-
- Downloads
LMM-As-A-Judge: Multi-Modal Faithfulness and Relevancy Evaluators (#8945)
* add MM Relevancy * minor fix * add MM faithfulness evaluator; minor fix of nb utils * add nb * start executing evals * use Union * add comment to ignore of UP007 * finish evals * use new image_to_text flag * add nb to module_guides
Showing
- .pre-commit-config.yaml 1 addition, 1 deletion.pre-commit-config.yaml
- docs/examples/evaluation/multi_modal/multi_modal_rag_evaluation.ipynb 241 additions, 111 deletions...s/evaluation/multi_modal/multi_modal_rag_evaluation.ipynb
- docs/module_guides/evaluating/modules.md 1 addition, 0 deletionsdocs/module_guides/evaluating/modules.md
- llama_index/evaluation/multi_modal/__init__.py 8 additions, 0 deletionsllama_index/evaluation/multi_modal/__init__.py
- llama_index/evaluation/multi_modal/faithfulness.py 213 additions, 0 deletionsllama_index/evaluation/multi_modal/faithfulness.py
- llama_index/evaluation/multi_modal/relevancy.py 194 additions, 0 deletionsllama_index/evaluation/multi_modal/relevancy.py
- llama_index/response/notebook_utils.py 6 additions, 2 deletionsllama_index/response/notebook_utils.py
- pyproject.toml 1 addition, 0 deletionspyproject.toml
Loading
Please register or sign in to comment