CS336 Spring 2025 Assignment 1 (Basics) Leaderboard

Note

If you're a non-Stanford student and interested in submitting to the leaderboard, please create a pull request adding your result to the second table. To remain in the top 5, your submission must be verified, for which you should invite marcelroed to a minimal repo containing a uv project with pyproject.toml, uv.lock and main.py. Your script should be able to be reproduced on a single H100 by running uv run main.py.

To submit to the leaderboard, submit a pull request that adds your results to the Markdown table below. The table should be sorted by increasing loss.

Note that your submission can run for at most 1.5 hours on an H100, and that you may only use the OpenWebText training dataset that we provide. The code must clearly be your own work, and you can't use external implementations for systems-critical aspects of your model.

The top 3 submissions will receive a prize at the end of the quarter, and the external top 3 submissions will receive a T-shirt. To make this fair, we will reorder the top 5 scoring students based on our reproduced training runs. Make sure you save a snapshot of your best code so it can be reproduced by us! We will reach out to the top few students after results have stabilized. Leading submissions that cannot be verified will be removed.

In your pull request description, you should include:

The final validation loss that was recorded
A link to an associated learning curve that clearly shows a wallclock-time x-axis that is less than 1.5 hours. You may either upload an image directly to the repo (use the ./images) folder or link to a publicly-viewable plot from a service like Weights and Biases.
A description of what you did

We are considering adding an automated validation loss check, considering it's easy to measure your metrics wrong in a way that will place you higher on the leaderboard than you should be. If your loss seems too good to be true, make sure to validate your training and valdation datasets are correct, by checking decoded samples, and making sure your vocab is correct with 32k tokens. It should not be easy to get a validation loss better than 3.3.

OpenWebText (subsample) validation loss leaderboard

Stanford class leaderboard (Spring 2025)

Name	Validation Loss	Link	Verification status (leave empty)
Herman Brunborg	3.0781	https://api.wandb.ai/links/brunborg-cs336/igorg097	Verified
Stephen Ge	3.1460	https://api.wandb.ai/links/stephenge/cc9sewxe	Verified
Brandon Snider	3.1658	https://api.wandb.ai/links/brandon-snider-stanford-university/v8n2t4py	Verified
Joe Li	3.19406	Validation loss curve
Tejas Narayanan	3.22	https://api.wandb.ai/links/tejas-narayanan/n8itavzy
Hermann Kumbong	3.22	https://api.wandb.ai/links/hermannk/iwrvwesk
Ayush Agrawal	3.261	https://api.wandb.ai/links/ayushag2410/mcbnccjr
Puheng Li	3.268	https://api.wandb.ai/links/puhengli-stanford-university/s1cokosj
Hongyue Li	3.27	Validation loss curve
Christine Ye	3.283	https://api.wandb.ai/links/christineye/dhqwbfqa
I-han Lai	3.29	https://wandb.ai/ihan-lai0924-stanford-university/cs336_hw1/reports/owt-validation-loss-25-04-18-01-16-13---VmlldzoxMjM1MjYwNA
Prateek	3.29	https://wandb.ai/stanfordcs/OWT%20Experiments/reports/--VmlldzoxMjM2NjQ2MQ
Pinlin [Calvin] Xu	3.29688	https://api.wandb.ai/links/pinlinxu-lab/rv9m2oqq
Varun Desai	3.298	https://wandb.ai/vdesai10/owt_leaderboard/reports/Final-Leaderboard-Submission-Varun--VmlldzoxMjM2NjQ3NA?accessToken=ylvoskxok1cnhegx1i4gziphtc5eih16ylza2buzy13y1uoll58h7jhndr4dviq5
Michael Bereket	3.30	https://api.wandb.ai/links/mbereket/srr1jc5b
Jack Hsieh	3.3001	https://wandb.ai/jackellishsieh-stanford-university/cs336-assignment1/runs/ohdb5e0v/panel/mc4jbfhrm?nw=nwuserjackellishsieh
Suze van Adrichem	3.3134	https://api.wandb.ai/links/suzevana/nfzefh73
Chenchen Gu	3.314	https://api.wandb.ai/links/cygu/2cwahtxu
Ashish Rao	3.330	https://api.wandb.ai/links/aprao/v79845cv
Arnuv Tandon	3.33213	https://wandb.ai/arnuv-tandon-stanford-university/cs336/reports/CS-336-Leaderboard--VmlldzoxMjM2NDY5OA?accessToken=eh8nugwo4d6zvq7sgajuni8892vfoomcp7k0klbqzkqrzj6h9ex789r38u76myrh
Mehmet Hamza Erol	3.353	https://api.wandb.ai/links/mhamzaerol-stanford-university/hcjj4l7r
Divija Hasteer	3.35628	Validation Loss Curve
Christopher Chou	3.41	https://api.wandb.ai/links/babychousr-stanford-university/ed9fu89s
Milan Rohatgi	3.41	[https://api.wandb.ai/links/milanrohatgi/zuet4nhc](https://api.wandb.ai/links/milanrohatgi/lq28xt0w)
Katherine Li	3.418	https://api.wandb.ai/links/kathli/rmglb4ts
Harshvardhan Agarwal	3.42	https://api.wandb.ai/links/tokenization/dvezrvbp
Ramgopal Venkateswaran	3.44	https://api.wandb.ai/links/ramvenkat98/cvrfl2pa
Ken Liu	3.47	https://api.wandb.ai/links/kenziyuliu/3z1f54qp
Kaitlyn Wang	3.50	https://api.wandb.ai/links/kaitwang-stanford-university/wgigpj6h
Sai Konkimalla	3.51	https://api.wandb.ai/links/sai-konk/yf8ashf0
Radostin Cholakov	3.55	https://api.wandb.ai/links/radi-cho/mrr13237
Sally Zhu	3.55	https://api.wandb.ai/links/sallyzhu-stanford-university/s6sd95zh
Ziqing Huang	3.55	https://api.wandb.ai/links/tyltto/505dcz72
Ayush Alag	3.56	https://api.wandb.ai/links/ayushalag1-stanford-university/z56avu3c
Kyler Wang	3.57	https://api.wandb.ai/links/kylerwang-stanford-university/5znjvf3e
Adam Zhao	3.58	https://api.wandb.ai/links/zhao1adam-stanford-university/5zgjjs1h
Aryaman Arora	3.61	https://wandb.ai/aryamanarora/cs336/runs/39skvnwk?nw=t6wb3iafj2q
Josiah Wong	3.61	Validation loss curve
orrzohar	3.61	https://api.wandb.ai/links/marvl/xpyqen6p
Prateek Varshney	3.62	https://api.wandb.ai/links/stanfordcs/jlkmfbgj
Varun Desai	3.63	https://api.wandb.ai/links/vdesai10/all5y62k
Karthik Dharmarajan	3.67	https://wandb.ai/kdharmarajan/cs336-asst1/reports/Validation-Loss-25-04-18-20-41-23---VmlldzoxMjM2NDk1OQ
Shiny Weng	3.67	https://api.wandb.ai/links/shinyweng-stanford-university/xt471xol
Harshit Joshi	3.69	https://wandb.ai/josharshit-stanford-university/cs336-basics/reports/CS-336--VmlldzoxMjM2NDcxMQ?accessToken=26yom3e3gznkpvg2yjispit1vhf4thw15i3xbj4hfckynojj0vc2g96bo7uedqec
Harry Shin	3.70	https://api.wandb.ai/links/dh2shin2-stanford-university/jueu6en8
Angikar Ghosal	3.71	Validation loss curve
Justin Wu	3.71	https://api.wandb.ai/links/justin-wu/9jrz2aep
Harry Shin	3.70	https://api.wandb.ai/links/dh2shin2-stanford-university/jueu6en8
Harshit Joshi	3.69	https://wandb.ai/josharshit-stanford-university/cs336-basics/reports/CS-336--VmlldzoxMjM2NDcxMQ?accessToken=26yom3e3gznkpvg2yjispit1vhf4thw15i3xbj4hfckynojj0vc2g96bo7uedqec
Angela Liu	3.75	https://api.wandb.ai/links/aliu917/fdx2pwqa
Herumb Shandilya	3.76	https://wandb.ai/krypticmouse/cs336-basics/runs/1zl172ay?nw=nwuserkrypticmouse
Hongyue Li	3.79	Validation loss curve
atj10	3.83	Validation loss curve
Ryan Zhao	3.84	https://api.wandb.ai/links/knightasterial-stanforduniversity/j7z9j001
William Huang	3.88	https://api.wandb.ai/links/abcisosm/bgl39okf
jshenoy	3.99	https://api.wandb.ai/links/jayshenoy-stanford-university/shpznb3o
Arya Bakhtiar	4.00	https://drive.google.com/file/d/1nKmlqy1UJ6ZlmWjhZe-jTTN6h4Vn2vZK/view?usp=drive_link
naive baseline	5.00		Verified

Global leaderboard (2025)

Name	Validation Loss	Link	Verification status (leave empty)
Herman Brunborg	3.0781	https://api.wandb.ai/links/brunborg-cs336/igorg097	Verified
Jorge Vanco	3.1371	https://api.wandb.ai/links/jorgev/qa9to62v
Stephen Ge	3.1460	https://api.wandb.ai/links/stephenge/cc9sewxe	Verified
Brandon Snider	3.1658	https://api.wandb.ai/links/brandon-snider-stanford-university/v8n2t4py	Verified
Joe Li	3.19406	Validation loss curve
Tejas Narayanan	3.22	https://api.wandb.ai/links/tejas-narayanan/n8itavzy
Hermann Kumbong	3.22	https://api.wandb.ai/links/hermannk/iwrvwesk
Ayush Agrawal	3.261	https://api.wandb.ai/links/ayushag2410/mcbnccjr
Puheng Li	3.268	https://api.wandb.ai/links/puhengli-stanford-university/s1cokosj
Hongyue Li	3.27	Validation loss curve
Christine Ye	3.283	https://api.wandb.ai/links/christineye/dhqwbfqa
I-han Lai	3.29	https://wandb.ai/ihan-lai0924-stanford-university/cs336_hw1/reports/owt-validation-loss-25-04-18-01-16-13---VmlldzoxMjM1MjYwNA
Prateek	3.29	https://wandb.ai/stanfordcs/OWT%20Experiments/reports/--VmlldzoxMjM2NjQ2MQ
Pinlin [Calvin] Xu	3.29688	https://api.wandb.ai/links/pinlinxu-lab/rv9m2oqq
Varun Desai	3.298	https://wandb.ai/vdesai10/owt_leaderboard/reports/Final-Leaderboard-Submission-Varun--VmlldzoxMjM2NjQ3NA?accessToken=ylvoskxok1cnhegx1i4gziphtc5eih16ylza2buzy13y1uoll58h7jhndr4dviq5
Michael Bereket	3.30	https://api.wandb.ai/links/mbereket/srr1jc5b
Jack Hsieh	3.3001	https://wandb.ai/jackellishsieh-stanford-university/cs336-assignment1/runs/ohdb5e0v/panel/mc4jbfhrm?nw=nwuserjackellishsieh
Suze van Adrichem	3.3134	https://api.wandb.ai/links/suzevana/nfzefh73
Chenchen Gu	3.314	https://api.wandb.ai/links/cygu/2cwahtxu
Ashish Rao	3.330	https://api.wandb.ai/links/aprao/v79845cv
Arnuv Tandon	3.33213	https://wandb.ai/arnuv-tandon-stanford-university/cs336/reports/CS-336-Leaderboard--VmlldzoxMjM2NDY5OA?accessToken=eh8nugwo4d6zvq7sgajuni8892vfoomcp7k0klbqzkqrzj6h9ex789r38u76myrh
Mehmet Hamza Erol	3.353	https://api.wandb.ai/links/mhamzaerol-stanford-university/hcjj4l7r
Divija Hasteer	3.35628	Validation Loss Curve
Christopher Chou	3.41	https://api.wandb.ai/links/babychousr-stanford-university/ed9fu89s
Milan Rohatgi	3.41	[https://api.wandb.ai/links/milanrohatgi/zuet4nhc](https://api.wandb.ai/links/milanrohatgi/lq28xt0w)
Katherine Li	3.418	https://api.wandb.ai/links/kathli/rmglb4ts
Harshvardhan Agarwal	3.42	https://api.wandb.ai/links/tokenization/dvezrvbp
Ramgopal Venkateswaran	3.44	https://api.wandb.ai/links/ramvenkat98/cvrfl2pa
Ken Liu	3.47	https://api.wandb.ai/links/kenziyuliu/3z1f54qp
Kaitlyn Wang	3.50	https://api.wandb.ai/links/kaitwang-stanford-university/wgigpj6h
Sai Konkimalla	3.51	https://api.wandb.ai/links/sai-konk/yf8ashf0
Radostin Cholakov	3.55	https://api.wandb.ai/links/radi-cho/mrr13237
Sally Zhu	3.55	https://api.wandb.ai/links/sallyzhu-stanford-university/s6sd95zh
Ziqing Huang	3.55	https://api.wandb.ai/links/tyltto/505dcz72
Ayush Alag	3.56	https://api.wandb.ai/links/ayushalag1-stanford-university/z56avu3c
Kyler Wang	3.57	https://api.wandb.ai/links/kylerwang-stanford-university/5znjvf3e
Adam Zhao	3.58	https://api.wandb.ai/links/zhao1adam-stanford-university/5zgjjs1h
Aryaman Arora	3.61	https://wandb.ai/aryamanarora/cs336/runs/39skvnwk?nw=t6wb3iafj2q
Josiah Wong	3.61	Validation loss curve
orrzohar	3.61	https://api.wandb.ai/links/marvl/xpyqen6p
Prateek Varshney	3.62	https://api.wandb.ai/links/stanfordcs/jlkmfbgj
Varun Desai	3.63	https://api.wandb.ai/links/vdesai10/all5y62k
Karthik Dharmarajan	3.67	https://wandb.ai/kdharmarajan/cs336-asst1/reports/Validation-Loss-25-04-18-20-41-23---VmlldzoxMjM2NDk1OQ
Shiny Weng	3.67	https://api.wandb.ai/links/shinyweng-stanford-university/xt471xol
Harshit Joshi	3.69	https://wandb.ai/josharshit-stanford-university/cs336-basics/reports/CS-336--VmlldzoxMjM2NDcxMQ?accessToken=26yom3e3gznkpvg2yjispit1vhf4thw15i3xbj4hfckynojj0vc2g96bo7uedqec
Harry Shin	3.70	https://api.wandb.ai/links/dh2shin2-stanford-university/jueu6en8
Angikar Ghosal	3.71	Validation loss curve
Justin Wu	3.71	https://api.wandb.ai/links/justin-wu/9jrz2aep
Harry Shin	3.70	https://api.wandb.ai/links/dh2shin2-stanford-university/jueu6en8
Harshit Joshi	3.69	https://wandb.ai/josharshit-stanford-university/cs336-basics/reports/CS-336--VmlldzoxMjM2NDcxMQ?accessToken=26yom3e3gznkpvg2yjispit1vhf4thw15i3xbj4hfckynojj0vc2g96bo7uedqec
Angela Liu	3.75	https://api.wandb.ai/links/aliu917/fdx2pwqa
Herumb Shandilya	3.76	https://wandb.ai/krypticmouse/cs336-basics/runs/1zl172ay?nw=nwuserkrypticmouse
Hongyue Li	3.79	Validation loss curve
atj10	3.83	Validation loss curve
Ryan Zhao	3.84	https://api.wandb.ai/links/knightasterial-stanforduniversity/j7z9j001
William Huang	3.88	https://api.wandb.ai/links/abcisosm/bgl39okf
jshenoy	3.99	https://api.wandb.ai/links/jayshenoy-stanford-university/shpznb3o
Arya Bakhtiar	4.00	https://drive.google.com/file/d/1nKmlqy1UJ6ZlmWjhZe-jTTN6h4Vn2vZK/view?usp=drive_link
naive baseline	5.00		Verified

Name		Name	Last commit message	Last commit date
Latest commit History 223 Commits
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CS336 Spring 2025 Assignment 1 (Basics) Leaderboard

OpenWebText (subsample) validation loss leaderboard

About

Uh oh!

Releases

Packages

Contributors 57

Uh oh!

stanford-cs336/assignment1-basics-leaderboard

Folders and files

Latest commit

History

Repository files navigation

CS336 Spring 2025 Assignment 1 (Basics) Leaderboard

OpenWebText (subsample) validation loss leaderboard

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 57

Uh oh!

Packages