HomeDatasetslocuslab/TOFU
T

locuslab/TOFU

Question Answering · locuslab· 86.7K
mit 6.7 MB task_categories:question-answeringtask_ids:closed-domain-qaannotations_creators:machine-generatedlanguage_creators:machine-generatedmultilinguality:monolingual

The TOFU dataset serves as a benchmark for evaluating unlearning performance of large language models on realistic tasks. The dataset comprises question-answer pairs based on autobiographies of 200 different authors that do not exist and are completely fictitiously generated by the GPT-4 model. The goal of the task is to unlearn a fine-tuned model on various fractions of the forget set.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull locuslab/TOFU

Dataset details

Task
Question Answering
Language
en
License
mit
Size
6.7 MB
Rows / images
18.1K
Creator
locuslab
Downloads
86.7K
Source
huggingface_datasets
Updated
2025-03-27

About locuslab/TOFU

The TOFU dataset serves as a benchmark for evaluating unlearning performance of large language models on realistic tasks. The dataset comprises question-answer pairs based on autobiographies of 200 different authors that do not exist and are completely fictitiously generated by the GPT-4 model. The goal of the task is to unlearn a fine-tuned model on various fractions of the forget set.