Skip to content

A hands-on collection of Jupyter notebooks covering key concepts, architectures, and optimizations in Large Language Models (LLMs).

License

Notifications You must be signed in to change notification settings

Ankur-singh/UnderstandingLLMs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

UnderstandingLLMs

A hands-on collection of Jupyter notebooks exploring key concepts, architectures, and optimizations in Large Language Models (LLMs).


Notebooks

1. LLM from Scratch - Part 1

Open In Colab

  • Build a basic LLM using PyTorch.
  • Covers data preprocessing, model architecture, training loop, and inference.

2. LLM from Scratch - Part 2

Open In Colab

  • Enhances the previous notebook with modern techniques:
    • RMSNorm
    • Gated (SwiGLU) FFN
    • Rotary Positional Embeddings (RoPE)
  • Includes gradient accumulation and mixed precision training.

3. Rotary Positional Embeddings (RoPE)

Open In Colab

  • Deep dive into RoPE in PyTorch.
  • Compares different RoPE implementations.

About

A hands-on collection of Jupyter notebooks covering key concepts, architectures, and optimizations in Large Language Models (LLMs).

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published