From fa02ded685d115d5bd79b813f1e35d2ffb3f133c Mon Sep 17 00:00:00 2001 From: Chester Hu <hcp199242@gmail.com> Date: Mon, 10 Jun 2024 10:32:42 -0700 Subject: [PATCH] Create README.md Add benchmarks top-level README --- tools/benchmarks/README.md | 4 ++++ 1 file changed, 4 insertions(+) create mode 100644 tools/benchmarks/README.md diff --git a/tools/benchmarks/README.md b/tools/benchmarks/README.md new file mode 100644 index 00000000..b94739eb --- /dev/null +++ b/tools/benchmarks/README.md @@ -0,0 +1,4 @@ +# Benchmarks + +* inference - a folder contains benchmark scripts that apply a throughput analysis for Llama models inference on various backends including on-prem, cloud and on-device. +* llm_eval_harness - a folder contains a tool to evaluate fine-tuned Llama models including quantized models focusing on quality. -- GitLab