Name: zai-org/LongBench
Creator: zai-org
License: Unknown
Keywords: huggingface, task_categories:question-answering, task_categories:text-generation, task_categories:summarization, task_categories:text-classification, language:en, language:zh, size_categories:1K<n<10K, arxiv:2308.14508, question-answering, text-generation, summarization

About zai-org/LongBench

LongBench is the first benchmark for bilingual, multitask, and comprehensive assessment of long context understanding capabilities of large language models. LongBench includes different languages (Chinese and English) to provide a more comprehensive evaluation of the large models' multilingual capabilities on long contexts. In addition, LongBench is composed of six major categories and twenty one different tasks, covering key long-text application scenarios such as single-document QA, multi-document QA, summarization, few-shot learning, synthetic tasks and code completion.

zai-org/LongBench

Dataset details

About zai-org/LongBench