To submit to the leaderboard, submit a pull request that adds your results to the Markdown table below. The table should be sorted by increasing time in milliseconds.
You should be running on a single H100 with batch size 1
, sequence length 16384
, d_model=1024
and num_heads=16
.
The top 3 submissions will receive a prize at the end of the quarter. To make this fair, we will reorder the top 5 scoring students based on our own tests. Make sure you save a snapshot of your best code so it can be reproduced by us! We will reach out to the top 5 students after results have stabilized.
In your pull request description, you should also include:
- The time you recorded
- A description of what you did
Name | Forward + Backward Time (ms) | Verification status (leave empty) |
---|---|---|
Herman Brunborg | 5.364ms | Verified |
Matthew Noto | 6.778ms | Verified |
Varun Desai | 11.07ms | Verified |
Prateek Varshney | 22.59ms | Verified |
Stephen Ge | 28.45ms | |
naive baseline | 80 ms | Verified |