EvalAP "Evaluation API and Platform" is a high-level service designed to perform evaluations for Etalab. This project provides an API to evaluate [LLM] models and an interface to navigate datasets, models, metrics and experiments.
For guidance on how to use this project, please refer to the following resources:
- The documentation: https://evalap.etalab.gouv.fr/doc
- the demo notebooks: notebooks/
- The public instance interface: https://evalap.etalab.gouv.fr/
You can open issues for bugs you've found or features you think are missing. You can also submit pull requests to this repository. To get started, take a look at CONTRIBUTING.md.
MIT License