minor nits

skyzh · skyzh · commit 6940f6a0bec9 · 2025-05-18T11:41:57.000+08:00
Signed-off-by: Alex Chi &lt;iskyzh@gmail.com&gt;
diff --git a/book/src/SUMMARY.md b/book/src/SUMMARY.md
@@ -12,7 +12,6 @@
     - [RMSNorm and MLP](./week1-04-rmsnorm-and-mlp.md)
     - [The Qwen2 Model]()
     - [Generating the Response]()
-    - [Loading the Model]()
     - [Sampling and Preparing for Week 2]()
     <!--
     - [Attention and Multi-Head Attention](./week1-01-attention.md)
diff --git a/book/src/week1-overview.md b/book/src/week1-overview.md
@@ -10,9 +10,8 @@ In this week, we will start from the basic matrix operations and see how those t
 Qwen2 model parameters into a model that generates text. We will implement the neural network layers used in the Qwen2
 model using mlx's matrix APIs.
 
-We will use the Qwen2-7B-Instruct model for this week. As we need to dequantize the model parameters, the 4GB model needs
-20GB of memory in week 1. If you do not have enough memory, you can consider using the smaller 0.5B model (we do not have
-infra to test it so you need to figure out things on your own unfortunately).
+We will use the Qwen2-7B-Instruct model for this week. As we need to dequantize the model parameters, the model of 4GB
+download size needs 20GB of memory in week 1. If you do not have enough memory, you can consider using the smaller 0.5B model.
 
 The MLX version of the Qwen2-7B-Instruct model we downloaded in the setup is an int4 quantized version of the original bfloat16 model.