HomeDatasetsComplexDataLab/OpenFake
O

ComplexDataLab/OpenFake

Image Classification · ComplexDataLab· 34.8K
cc-by-nc-4.0 4.1 TB task_categories:image-classificationlanguage:enlicense:cc-by-nc-4.0size_categories:1M<n<10Mformat:parquet

OpenFake is a dataset and benchmark for detecting AI-generated images, with a focus on politically and socially salient content where misinformation risk is highest. It pairs real photographs with synthetic counterparts produced by a wide range of frontier proprietary generators, open-source diffusion models, and community fine-tunes. A separate in-the-wild test set is sourced from Reddit to evalu

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull ComplexDataLab/OpenFake

Dataset details

Task
Image Classification
Language
en
License
cc-by-nc-4.0
Size
4.1 TB
Rows / images
2.5M
Creator
ComplexDataLab
Downloads
34.8K
Source
huggingface_datasets
Updated
2026-05-07

About ComplexDataLab/OpenFake

OpenFake is a dataset and benchmark for detecting AI-generated images, with a focus on politically and socially salient content where misinformation risk is highest. It pairs real photographs with synthetic counterparts produced by a wide range of frontier proprietary generators, open-source diffusion models, and community fine-tunes. A separate in-the-wild test set is sourced from Reddit to evaluate detector performance on naturally circulated synthetic media.