HomeDatasetstahoebio/Tahoe-100M
T

tahoebio/Tahoe-100M

General · tahoebio· 30.4K
cc0-1.0 435 GB license:cc0-1.0size_categories:1B<n<10Bformat:parquetmodality:tabularmodality:text

Tahoe-100M Tahoe-100M is a giga-scale single-cell perturbation atlas consisting of over 100 million transcriptomic profiles from 50 cancer cell lines exposed to 1,100 small-molecule perturbations. Generated using Vevo Therapeutics' Mosaic high-throughput platform, Tahoe-100M enables deep, context-aware exploration of gene function, cellular states, and drug responses at unprecedented scale and r

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull tahoebio/Tahoe-100M

Dataset details

Task
General
License
cc0-1.0
Size
435 GB
Rows / images
4.3B
Classes
10
Creator
tahoebio
Downloads
30.4K
Source
huggingface_datasets
Updated
2025-07-23

About tahoebio/Tahoe-100M

Tahoe-100M Tahoe-100M is a giga-scale single-cell perturbation atlas consisting of over 100 million transcriptomic profiles from 50 cancer cell lines exposed to 1,100 small-molecule perturbations. Generated using Vevo Therapeutics' Mosaic high-throughput platform, Tahoe-100M enables deep, context-aware exploration of gene function, cellular states, and drug responses at unprecedented scale and resolution. This dataset is designed to power the development of next-generation AI models of cell biology, offering broad applications across systems biology, drug discovery, and precision medicine.