Skip to content

kushagra2103/compareAImodels

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

LLM as a Judge

This exercise is to create LLM as a judge. It will evaluate and compare responses from two other AI models.

Model A: gpt-4o-mini
Model B: Gemini-2.0-flash
Model C (Judge) : GPT-4.1

Description:
We first user Model A to come up with a challenging question to test the intelligence of LLMs. Then fed this question to both Model A and Model B as an input and used Model C to evaluate and rank the responses from Model A and Model B.

Workflow is as below

image

About

This exercise is to make AI model judge responses from other LLM models.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published