Skip to content

Commit 8396a61

Browse files
committed
clean
1 parent 9785b50 commit 8396a61

File tree

2 files changed

+1
-6
lines changed

2 files changed

+1
-6
lines changed

torchtitan/models/deepseek_v3/model/model.py

Lines changed: 0 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -368,15 +368,10 @@ def forward(self, tokens: torch.Tensor):
368368
Returns:
369369
torch.Tensor: Logits tensor of shape (batch_size, vocab_size).
370370
"""
371-
print("Input tokens:", tokens)
372371
h = self.tok_embeddings(tokens)
373-
print("After token embedding:", h)
374372

375373
for layer in self.layers.values():
376374
h = layer(h, self.freqs_cis)
377-
print(f"After layer {layer}: ", h)
378375
h = self.norm(h)
379-
print("After normalization:", h)
380376
output = self.output(h)
381-
print("Output logits:", output)
382377
return output

torchtitan/models/deepseek_v3/train_configs/debug_model.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@ lr_min = 0.0
4242
local_batch_size = 8
4343
seq_len = 2048
4444
max_norm = 1.0 # grad norm clipping
45-
steps = 1
45+
steps = 10
4646
compile = false
4747
dataset = "c4_test" # supported datasets: c4_test (2K), c4 (177M)
4848

0 commit comments

Comments
 (0)