Abduhu1

Abdullah Andrabi Abduhu1

Pinned Loading

Evaluating-SLMs-on-JEEBench Evaluating-SLMs-on-JEEBench Public

A comprehensive analysis of seven state-of-the-art SLMs on JEEBench, a rigorous benchmark for STEM reasoning. This project explores the impact of zero-shot, few-shot, and Chain-of-Thought prompting…

Python 1