UnderstandingLLMs

A hands-on collection of Jupyter notebooks exploring key concepts, architectures, and optimizations in Large Language Models (LLMs).

Notebooks

Enhances the previous notebook with modern techniques:
- RMSNorm
- Gated (SwiGLU) FFN
- Rotary Positional Embeddings (RoPE)
Includes gradient accumulation and mixed precision training.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
nbs		nbs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md