llm-eval-analysis v1.4.6-beta.3

Latest

Gaganv882 released this 04 May 13:16

· 1 commit to main since this release

1.4.6-beta.3

e074263

This release, version 1.4.6-beta.3, introduces enhancements to the multi-metric evaluation framework for human-bot dialogues. It now supports additional datasets and improves the analysis capabilities for LLMs like Claude and GPT-4o. Users can expect more accurate insights and streamlined performance in their evaluations.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llm-eval-analysis v1.4.6-beta.3

Uh oh!