File tree
7 files changed
+106
-78
lines changed- torchtitan
- experiments/kernels/moe
- models/deepseek_v3
- model
- train_configs
7 files changed
+106
-78
lines changedLines changed: 28 additions & 28 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
72 | 72 |
| |
73 | 73 |
| |
74 | 74 |
| |
75 |
| - | |
| 75 | + | |
76 | 76 |
| |
77 |
| - | |
| 77 | + | |
78 | 78 |
| |
79 |
| - | |
| 79 | + | |
80 | 80 |
| |
81 |
| - | |
| 81 | + | |
82 | 82 |
| |
83 | 83 |
| |
84 | 84 |
| |
| |||
99 | 99 |
| |
100 | 100 |
| |
101 | 101 |
| |
102 |
| - | |
103 |
| - | |
104 |
| - | |
| 102 | + | |
105 | 103 |
| |
106 | 104 |
| |
107 | 105 |
| |
108 | 106 |
| |
109 | 107 |
| |
110 | 108 |
| |
111 |
| - | |
| 109 | + | |
112 | 110 |
| |
113 |
| - | |
| 111 | + | |
| 112 | + | |
114 | 113 |
| |
115 |
| - | |
| 114 | + | |
116 | 115 |
| |
117 | 116 |
| |
118 |
| - | |
119 |
| - | |
| 117 | + | |
120 | 118 |
| |
| 119 | + | |
121 | 120 |
| |
122 | 121 |
| |
| 122 | + | |
123 | 123 |
| |
124 | 124 |
| |
125 | 125 |
| |
126 | 126 |
| |
| 127 | + | |
127 | 128 |
| |
128 |
| - | |
| 129 | + | |
129 | 130 |
| |
130 | 131 |
| |
131 | 132 |
| |
132 | 133 |
| |
| 134 | + | |
133 | 135 |
| |
134 | 136 |
| |
135 | 137 |
| |
| |||
139 | 141 |
| |
140 | 142 |
| |
141 | 143 |
| |
| 144 | + | |
142 | 145 |
| |
143 | 146 |
| |
144 | 147 |
| |
145 | 148 |
| |
146 | 149 |
| |
147 |
| - | |
148 | 150 |
| |
149 | 151 |
| |
150 | 152 |
| |
151 | 153 |
| |
152 | 154 |
| |
| 155 | + | |
153 | 156 |
| |
154 | 157 |
| |
155 | 158 |
| |
| 159 | + | |
156 | 160 |
| |
157 |
| - | |
| 161 | + | |
158 | 162 |
| |
159 | 163 |
| |
160 | 164 |
| |
| |||
165 | 169 |
| |
166 | 170 |
| |
167 | 171 |
| |
168 |
| - | |
| 172 | + | |
169 | 173 |
| |
170 | 174 |
| |
171 | 175 |
| |
| |||
182 | 186 |
| |
183 | 187 |
| |
184 | 188 |
| |
| 189 | + | |
185 | 190 |
| |
186 | 191 |
| |
187 | 192 |
| |
188 |
| - | |
189 |
| - | |
190 |
| - | |
191 | 193 |
| |
192 | 194 |
| |
193 | 195 |
| |
| |||
196 | 198 |
| |
197 | 199 |
| |
198 | 200 |
| |
199 |
| - | |
| 201 | + | |
200 | 202 |
| |
201 |
| - | |
| 203 | + | |
202 | 204 |
| |
203 | 205 |
| |
204 | 206 |
| |
205 | 207 |
| |
206 | 208 |
| |
207 | 209 |
| |
208 |
| - | |
| 210 | + | |
209 | 211 |
| |
210 | 212 |
| |
211 | 213 |
| |
| |||
225 | 227 |
| |
226 | 228 |
| |
227 | 229 |
| |
228 |
| - | |
229 |
| - | |
230 |
| - | |
231 |
| - | |
232 |
| - | |
| 230 | + | |
233 | 231 |
| |
234 | 232 |
| |
235 | 233 |
| |
236 | 234 |
| |
237 | 235 |
| |
238 | 236 |
| |
| 237 | + | |
239 | 238 |
| |
240 | 239 |
| |
241 | 240 |
| |
| |||
273 | 272 |
| |
274 | 273 |
| |
275 | 274 |
| |
| 275 | + | |
276 | 276 |
| |
277 |
| - | |
278 | 277 |
| |
279 | 278 |
| |
280 | 279 |
| |
281 | 280 |
| |
282 | 281 |
| |
283 | 282 |
| |
284 | 283 |
| |
| 284 | + | |
285 | 285 |
| |
286 | 286 |
| |
287 | 287 |
| |
|
Lines changed: 5 additions & 5 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
32 | 32 |
| |
33 | 33 |
| |
34 | 34 |
| |
35 |
| - | |
36 |
| - | |
| 35 | + | |
| 36 | + | |
37 | 37 |
| |
38 |
| - | |
39 |
| - | |
40 |
| - | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
41 | 41 |
| |
42 | 42 |
| |
43 | 43 |
| |
|
Lines changed: 2 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
75 | 75 |
| |
76 | 76 |
| |
77 | 77 |
| |
78 |
| - | |
79 |
| - | |
| 78 | + | |
| 79 | + | |
80 | 80 |
| |
81 | 81 |
| |
82 | 82 |
| |
|
Lines changed: 5 additions & 37 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
14 | 14 |
| |
15 | 15 |
| |
16 | 16 |
| |
17 |
| - | |
| 17 | + | |
18 | 18 |
| |
19 | 19 |
| |
20 | 20 |
| |
| |||
260 | 260 |
| |
261 | 261 |
| |
262 | 262 |
| |
263 |
| - | |
264 |
| - | |
265 |
| - | |
266 |
| - | |
267 |
| - | |
268 |
| - | |
269 |
| - | |
270 |
| - | |
271 |
| - | |
272 |
| - | |
273 |
| - | |
274 |
| - | |
275 |
| - | |
276 |
| - | |
277 |
| - | |
278 |
| - | |
279 |
| - | |
280 |
| - | |
281 |
| - | |
282 |
| - | |
283 |
| - | |
284 |
| - | |
285 |
| - | |
286 |
| - | |
287 |
| - | |
288 |
| - | |
289 |
| - | |
290 |
| - | |
291 |
| - | |
292 |
| - | |
293 |
| - | |
294 |
| - | |
295 |
| - | |
296 |
| - | |
297 |
| - | |
298 |
| - | |
299 | 263 |
| |
300 | 264 |
| |
301 | 265 |
| |
| |||
316 | 280 |
| |
317 | 281 |
| |
318 | 282 |
| |
| 283 | + | |
319 | 284 |
| |
320 | 285 |
| |
321 | 286 |
| |
| |||
330 | 295 |
| |
331 | 296 |
| |
332 | 297 |
| |
| 298 | + | |
333 | 299 |
| |
334 | 300 |
| |
| 301 | + | |
335 | 302 |
| |
336 | 303 |
| |
337 | 304 |
| |
| |||
360 | 327 |
| |
361 | 328 |
| |
362 | 329 |
| |
| 330 | + | |
363 | 331 |
| |
364 | 332 |
| |
365 | 333 |
| |
|
Lines changed: 63 additions & 3 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
11 | 11 |
| |
12 | 12 |
| |
13 | 13 |
| |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
14 | 50 |
| |
15 | 51 |
| |
16 | 52 |
| |
| |||
212 | 248 |
| |
213 | 249 |
| |
214 | 250 |
| |
215 |
| - | |
| 251 | + | |
216 | 252 |
| |
217 | 253 |
| |
218 | 254 |
| |
219 | 255 |
| |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
220 | 262 |
| |
221 | 263 |
| |
222 | 264 |
| |
| |||
266 | 308 |
| |
267 | 309 |
| |
268 | 310 |
| |
| 311 | + | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
269 | 320 |
| |
270 | 321 |
| |
271 | 322 |
| |
| |||
299 | 350 |
| |
300 | 351 |
| |
301 | 352 |
| |
| 353 | + | |
302 | 354 |
| |
303 | 355 |
| |
304 | 356 |
| |
| |||
311 | 363 |
| |
312 | 364 |
| |
313 | 365 |
| |
| 366 | + | |
314 | 367 |
| |
315 |
| - | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
316 | 372 |
| |
317 | 373 |
| |
318 | 374 |
| |
| |||
321 | 377 |
| |
322 | 378 |
| |
323 | 379 |
| |
324 |
| - | |
| 380 | + | |
325 | 381 |
| |
326 | 382 |
| |
327 | 383 |
| |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
328 | 388 |
| |
329 | 389 |
| |
330 | 390 |
| |
|
0 commit comments