Skip to content
Snippets Groups Projects
Unverified Commit b69e6bf8 authored by Jack Tang's avatar Jack Tang Committed by GitHub
Browse files

Update README.md

parent db05e13f
No related branches found
No related tags found
No related merge requests found
......@@ -23,13 +23,14 @@ LongBench includes 13 English tasks, 5 Chinese tasks, and 2 code tasks, with the
| Code Completion | - | - | 2 |
## 🔍 Table of Contents
- [Leaderboard](#%F0%9F%96%A5%EF%B8%8FLeaderboard)
- [Leaderboard](#Leaderboard)
- [How to evaluate on LongBench](#how-to-evaluate-on-LongBench)
- [Evaluation Result on Each Dataset](#evaluation-result-on-each-dataset)
- [Acknowledgement](#acknowledgement)
- [Citation](#citation)
## 🖥️Leaderboard
## 🖥️ Leaderboard
<a name="Leaderboard"></a>
Here is the average scores (%) on the main task categories in both Chinese and English languages under the Zero-shot scenario. Please refer to this [link](task.md) for the evaluation metrics used for each task.
> Note: For text exceeding the processing length capability of the model, we truncate from the middle of the text, preserving information from the beginning and end, in accordance with the observations from [Lost in the Middle](https://arxiv.org/abs/2307.03172). Experiments show that this truncation method has the least impact on model performance.
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment