LLM as a Judge

This exercise is to create LLM as a judge. It will evaluate and compare responses from two other AI models.

Model A: gpt-4o-mini
Model B: Gemini-2.0-flash
Model C (Judge) : GPT-4.1

Description:
We first user Model A to come up with a challenging question to test the intelligence of LLMs. Then fed this question to both Model A and Model B as an input and used Model C to evaluate and rank the responses from Model A and Model B.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
comapreAImodels		comapreAImodels
judge_AI		judge_AI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM as a Judge

Workflow is as below

About

Uh oh!

Releases

Packages

kushagra2103/compareAImodels

Folders and files

Latest commit

History

Repository files navigation

LLM as a Judge

Workflow is as below

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages