From 67e4b2e68d8757904eaa90586dc697a956ba6a60 Mon Sep 17 00:00:00 2001 From: Xin Lv <lvxin14@163.com> Date: Mon, 31 Jul 2023 10:43:38 +0800 Subject: [PATCH] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index c358330..3b38cff 100644 --- a/README.md +++ b/README.md @@ -5,7 +5,7 @@ 阅读[中文版本](README_ZH.md). -# LongBench: A Multilingual, Multitask Benchmark Tailored for Long Context Understanding +# LongBench: A Multilingual, Multitask Benchmark for Long Context Understanding **LongBench** is the first benchmark for multilingual, multitask, and comprehensive assessment of **long context understanding** capabilities of large language models. LongBench includes different languages (Chinese and English) to provide a more comprehensive evaluation of the large models' multilingual capabilities on long contexts. In addition, LongBench consists of 20 different tasks, covering key long context application scenarios such as single-document QA, multi-document QA, summarization, few-shot learning, code completion, and synthetic tasks. @@ -191,4 +191,4 @@ The following tables show the Zero-shot evaluation results (%) on all datasets, ## Citation This is a joint work by **THU-KEG** and **Zhipu AI**. We are currently working on the paper, and the citation information will be updated when it's ready. Please stay tuned~ -When citing our work, please cite all of the original dataset papers. The relevant citation information is listed [here](refs/ref.bib). \ No newline at end of file +When citing our work, please cite all of the original dataset papers. The relevant citation information is listed [here](refs/ref.bib). -- GitLab