An intelligent multimodal deepfake detection system that analyzes images, videos, and audio for signs of AI generation or manipulation.
Built for Vulnuris Security Solutions Pvt. Ltd. Hackathon Challenge.
- Image Analysis — Error Level Analysis (ELA), FFT Frequency Domain Analysis, Face Region Analysis, Noise Pattern Analysis, Metadata Inspection
- Video Analysis — Per-frame forensic analysis, Temporal Consistency checks, Face Stability tracking, Blink Pattern analysis
- Audio Analysis — Mel-Spectrogram analysis, MFCC Feature extraction, Pitch Consistency (F0), Audio Artifact detection
- Confidence score with detailed breakdown
- Visual forensic heatmaps and spectrograms
- Downloadable analysis reports
- Analysis history tracking
- Drag-and-drop file upload
- Support for 20+ file formats
- Python 3.8+
- pip
# Navigate to project directory
cd VERSANIX
# Install dependencies
pip install -r requirements.txt
# Run the application
python app.pyOpen your browser to http://localhost:5000
| Type | Formats |
|---|---|
| Image | PNG, JPG, JPEG, GIF, BMP, WebP, TIFF |
| Video | MP4, AVI, MOV, MKV, WMV, FLV, WebM |
| Audio | MP3, WAV, OGG, FLAC, AAC, M4A, WMA |
| Method | Endpoint | Description |
|---|---|---|
| GET | / |
Web interface |
| GET | /api/health |
Health check |
| POST | /api/analyze |
Analyze uploaded media file |
| GET | /api/history |
Get analysis history |
curl -X POST -F "file=@photo.jpg" http://localhost:5000/api/analyze- Error Level Analysis (ELA) — Re-saves image at known quality and compares pixel-level differences
- Frequency Domain Analysis — Applies FFT to detect GAN spectral fingerprints
- Face Region Analysis — Checks for blending artifacts and blur inconsistencies around faces
- Noise Pattern Analysis — Extracts and analyzes noise residuals for synthetic patterns
- Metadata Analysis — Examines EXIF data for AI tool signatures
- Per-Frame Analysis — Runs image forensics on sampled keyframes
- Temporal Consistency — Checks for score variations between frames
- Face Stability — Tracks face position jitter across frames
- Blink Analysis — Detects unnatural blink patterns
- Spectral Analysis — Analyzes mel-spectrogram for TTS smoothness
- MFCC Analysis — Checks for statistical anomalies in cepstral coefficients
- Pitch Consistency — Measures F0 steadiness (TTS tends to be unnaturally steady)
- Artifact Detection — Finds clicks, phase discontinuities, and splice artifacts
VERSANIX/
├── app.py # Flask application
├── requirements.txt # Python dependencies
├── README.md # This file
├── detectors/
│ ├── __init__.py
│ ├── image_detector.py # Image forensic analysis
│ ├── video_detector.py # Video deepfake detection
│ └── audio_detector.py # Audio synthesis detection
├── static/
│ ├── css/style.css # UI stylesheet
│ └── js/app.js # Frontend logic
├── templates/
│ └── index.html # Web interface
└── uploads/ # Temporary upload directory
- Backend: Python, Flask, Flask-CORS
- Image Processing: OpenCV, Pillow, NumPy, SciPy
- Audio Processing: Librosa, SoundFile
- Frontend: HTML5, CSS3, JavaScript (Vanilla)
- Design: Glassmorphism, Dark Theme, Inter Font
This tool uses heuristic-based forensic analysis techniques. Results are indicative and should be used alongside other verification methods for critical decisions. Detection accuracy may vary depending on the sophistication of the deepfake.
Team VERSANIX | Vulnuris Hackathon 2026