Test Results #10
Replies: 3 comments 2 replies
-
Test Results: Llama-3.1 70b instruct (fireworks.ai)=== Test Results Summary === Test Case: Arena Bench Hard Test Case: Big Code Bench Test Case: Maths Problem Test Case: GSM8K |
Beta Was this translation helpful? Give feedback.
-
Test Results: Gemma 2 2b it Q8_0 (local: llama_cpp_python)=== Test Results Summary === Test Case: Arena Bench Hard Test Case: Big Code Bench Test Case: Maths Problem Test Case: GSM8K |
Beta Was this translation helpful? Give feedback.
-
Test Results: Phi-3.5 Mini 128K Instruct (openrouter.ai)=== Test Results Summary === Test Case: Arena Bench Hard Test Case: Big Code Bench Test Case: Maths Problem Test Case: GSM8K |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Just a thread to post test results.
E.g. to use another backend like openrouter.ai:
I updated
test.py
to supportbase_url
Beta Was this translation helpful? Give feedback.
All reactions