-
Notifications
You must be signed in to change notification settings - Fork 12.4k
Open
Labels
enhancementNew feature or requestNew feature or requestgeneration qualityQuality of model outputQuality of model outputhelp wantedNeeds help from the communityNeeds help from the communityhigh priorityVery important issueVery important issueresearch 🔬
Description
Update 10 Apr 2024: #231 (comment)
It would be great to start doing this kind of quantitative analysis of ggml
-based inference:
https://bellard.org/ts_server/
It looks like Fabrice evaluates the models using something called LM Evaluation Harness:
https://github.com/EleutherAI/lm-evaluation-harness
I have no idea what this is yet, but would be nice to study it and try to integrate it here and in other ggml
-based projects.
This will be very important step needed to estimate the quality of the generated output and see if we are on the right track.
bitRAKE, cdulhanty, Coderx7, shinomakoi, lin72h and 3 more
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgeneration qualityQuality of model outputQuality of model outputhelp wantedNeeds help from the communityNeeds help from the communityhigh priorityVery important issueVery important issueresearch 🔬