locuslab/TOFU

Name: locuslab/TOFU
Creator: locuslab
License: mit
Keywords: huggingface, task_categories:question-answering, task_ids:closed-domain-qa, annotations_creators:machine-generated, language_creators:machine-generated, multilinguality:monolingual, source_datasets:original, language:en, license:mit, question-answering

Question Answering · locuslab· 86.7K

mit 6.7 MB task_categories:question-answeringtask_ids:closed-domain-qaannotations_creators:machine-generatedlanguage_creators:machine-generatedmultilinguality:monolingual

The TOFU dataset serves as a benchmark for evaluating unlearning performance of large language models on realistic tasks. The dataset comprises question-answer pairs based on autobiographies of 200 different authors that do not exist and are completely fictitiously generated by the GPT-4 model. The goal of the task is to unlearn a fine-tuned model on various fractions of the forget set.

Open in MLForge Sign up free Desktop app

# download instantly
mlforge datasets pull locuslab/TOFU

Dataset details

Task

Question Answering

Language

License

mit

Size

6.7 MB

Rows / images

18.1K

Creator

locuslab

Downloads

86.7K

Source

huggingface_datasets

Updated

2025-03-27

locuslab/TOFU

Dataset details

About locuslab/TOFU