HomeDatasetsgaia-benchmark/GAIA
G

gaia-benchmark/GAIA

General · gaia-benchmark· 24.6K
Unknown 98 MB language:ensize_categories:n<1Kformat:parquetmodality:audiomodality:document

GAIA dataset GAIA is a benchmark which aims at evaluating next-generation LLMs (LLMs with augmented capabilities due to added tooling, efficient prompting, access to search, etc). We added gating to prevent bots from scraping the dataset. Please do not reshare the validation or test set in a crawlable format. Data and leaderboard GAIA is made of more than 450 non-trivial question with an unambiguous answer, requiring different levels of tooling and autonomy to… See the full description on the dataset page: https://huggingface.co/datasets/gaia-benchmark/GAIA.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull gaia-benchmark/GAIA

Dataset details

Task
General
Language
en
License
Unknown
Size
98 MB
Creator
gaia-benchmark
Downloads
24.6K
Source
huggingface_datasets
Updated
2025-10-28