Skip to content
Snippets Groups Projects
Commit fa02ded6 authored by Chester Hu's avatar Chester Hu Committed by Suraj Subramanian
Browse files

Create README.md

Add benchmarks top-level README
parent 49018410
Loading
# Benchmarks
* inference - a folder contains benchmark scripts that apply a throughput analysis for Llama models inference on various backends including on-prem, cloud and on-device.
* llm_eval_harness - a folder contains a tool to evaluate fine-tuned Llama models including quantized models focusing on quality.
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment