HomeDatasetsjasperai/monet
M

jasperai/monet

Text To Image · jasperai· 180.7K
apache-2.0 466 GB task_categories:text-to-imagetask_categories:image-feature-extractiontask_categories:zero-shot-image-classificationlanguage:enlicense:apache-2.0

Dataset Card for MONET MONET (Massive, Open, Non-redundant and Enriched Text-to-image dataset) is a large-scale, curated image-text dataset designed for training text-to-image (T2I) systems. It contains 104.9 million high-quality image-text pairs distilled from 2.9 billion raw pairs across nine heterogeneous open sources (6 real and 3 synthetic) through successive stages of safety filtering, domain-based filtering, exact and near-duplicate removal, and re-captioning with… See the full description on the dataset page: https://huggingface.co/datasets/jasperai/monet.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull jasperai/monet

Dataset details

Task
Text To Image
Language
en
License
apache-2.0
Size
466 GB
Creator
jasperai
Downloads
180.7K
Source
huggingface_datasets
Updated
2026-06-24

About jasperai/monet

MONET (Massive, Open, Non-redundant and Enriched Text-to-image dataset) is a large-scale, curated image-text dataset designed for training text-to-image (T2I) systems. It contains 104.9 million high-quality image-text pairs distilled from 2.9 billion raw pairs across nine heterogeneous open sources (6 real and 3 synthetic) through successive stages of safety filtering, domain-based filtering, exact and near-duplicate removal, and re-captioning with multiple vision-language models, and is further augmented with synthetically generated samples. Each image is released with pre-computed embeddings, structured annotations and pre-encoded VAE latents to accelerate downstream use.