Deepfake Database

Deepfake Database


Deepfake Database


Deepfake Database

Use Case

Deepfake Database


Explore our extensive Deepfake Database, featuring diverse and real-world videos for deepfake detection, face recognition, and video forensics.

Deepfake Database


This dataset, originally created by the authors of the Mesonet Paper, has been enhanced to include a validation set. It uniquely gathers videos generated using the Deepfake technique, a method that creates realistic forged videos by training auto-encoders. Training these auto-encoders takes several days with conventional processors and can only be done for two specific faces at a time.

To ensure diversity, the authors collected a variety of publicly available videos from different platforms on the internet. These videos range in length from two seconds to three minutes and have a minimum resolution of 854 × 480 pixels. All videos are compressed using the H.264 codec with varying compression levels, mimicking real-world conditions.

Download Dataset

Data Collection and Processing

  • Face Extraction and Alignment: All faces were extracted using the Viola-Jones detector and aligned using a trained neural network for facial landmark detection. To ensure a balanced distribution of faces, the number of selected frames for extraction per video is proportional to the number of camera angles and illumination changes on the target face. On average, approximately 50 faces were extracted per scene.
  • Dataset Composition: The dataset includes both forged and real face images. Real faces were also extracted from various internet sources, maintaining the same resolution as the forged faces. The dataset was manually reviewed to remove misaligned and incorrectly detected faces. Both classes (real and forged faces) were balanced for good-resolution and poor-resolution images to avoid classification bias.

Key Features

  • Diversity: The dataset includes a wide variety of faces from different videos to ensure diversity.
  • Real-world Conditions: Videos are compressed using different levels of the H.264 codec, replicating real-world conditions.
  • Balanced Distribution: The number of faces extracted per video is balanced based on camera angles and illumination changes, ensuring a representative sample.
  • Manual Review: The dataset was manually reviewed to remove any misalignments and incorrect face detections, ensuring high-quality data.
  • Validation Set: Includes a validation set for model testing and evaluation.

Enhanced Content and Applications

This dataset can be used for:

  • Deepfake Detection: Training and evaluating models designed to detect forged videos.
  • Face Recognition: Enhancing the robustness of face recognition systems against deepfake attacks.
  • Video Forensics: Assisting in the development of forensic tools to identify and analyze video manipulations.
  • AI Ethics Research: Studying the ethical implications and potential misuse of deepfake technology.

Contact Us

Please enable JavaScript in your browser to complete this form.

Quality Data Creation


Guaranteed TAT


ISO 9001:2015, ISO/IEC 27001:2013 Certified


HIPAA Compliance


GDPR Compliance


Compliance and Security

Let's Discuss your Data collection Requirement With Us

To get a detailed estimation of requirements please reach us.

Scroll to Top