File tree
7 files changed
+97
-56
lines changed- large_language_model_pretraining/nemo
- utils
- mixture_of_experts_pretraining
7 files changed
+97
-56
lines changedLines changed: 37 additions & 15 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
35 | 35 |
| |
36 | 36 |
| |
37 | 37 |
| |
38 |
| - | |
39 |
| - | |
40 |
| - | |
41 | 38 |
| |
42 | 39 |
| |
43 | 40 |
| |
| |||
79 | 76 |
| |
80 | 77 |
| |
81 | 78 |
| |
82 |
| - | |
| 79 | + | |
83 | 80 |
| |
84 | 81 |
| |
85 | 82 |
| |
86 | 83 |
| |
87 |
| - | |
| 84 | + | |
88 | 85 |
| |
89 | 86 |
| |
90 | 87 |
| |
| |||
103 | 100 |
| |
104 | 101 |
| |
105 | 102 |
| |
106 |
| - | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
107 | 106 |
| |
108 | 107 |
| |
109 | 108 |
| |
110 | 109 |
| |
111 | 110 |
| |
112 | 111 |
| |
113 | 112 |
| |
114 |
| - | |
| 113 | + | |
115 | 114 |
| |
116 | 115 |
| |
117 | 116 |
| |
118 | 117 |
| |
119 |
| - | |
120 |
| - | |
121 |
| - | |
| 118 | + | |
| 119 | + | |
122 | 120 |
| |
123 | 121 |
| |
124 | 122 |
| |
| |||
148 | 146 |
| |
149 | 147 |
| |
150 | 148 |
| |
151 |
| - | |
| 149 | + | |
152 | 150 |
| |
153 | 151 |
| |
154 | 152 |
| |
| |||
163 | 161 |
| |
164 | 162 |
| |
165 | 163 |
| |
166 |
| - | |
| 164 | + | |
167 | 165 |
| |
168 | 166 |
| |
169 | 167 |
| |
170 |
| - | |
| 168 | + | |
171 | 169 |
| |
172 | 170 |
| |
173 | 171 |
| |
| |||
176 | 174 |
| |
177 | 175 |
| |
178 | 176 |
| |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
179 | 199 |
| |
180 | 200 |
| |
181 | 201 |
| |
182 | 202 |
| |
183 | 203 |
| |
184 | 204 |
| |
185 |
| - | |
| 205 | + | |
186 | 206 |
| |
187 | 207 |
| |
188 | 208 |
| |
189 | 209 |
| |
| 210 | + | |
| 211 | + | |
190 | 212 |
| |
191 | 213 |
| |
192 | 214 |
| |
| |||
234 | 256 |
| |
235 | 257 |
| |
236 | 258 |
| |
237 |
| - | |
| 259 | + |
Lines changed: 12 additions & 4 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
97 | 97 |
| |
98 | 98 |
| |
99 | 99 |
| |
100 |
| - | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
101 | 103 |
| |
102 | 104 |
| |
103 | 105 |
| |
| |||
110 | 112 |
| |
111 | 113 |
| |
112 | 114 |
| |
| 115 | + | |
| 116 | + | |
| 117 | + | |
113 | 118 |
| |
114 | 119 |
| |
115 | 120 |
| |
116 | 121 |
| |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
117 | 126 |
| |
118 | 127 |
| |
119 | 128 |
| |
| |||
138 | 147 |
| |
139 | 148 |
| |
140 | 149 |
| |
141 |
| - | |
142 | 150 |
| |
143 |
| - | |
| 151 | + | |
144 | 152 |
| |
145 |
| - | |
| 153 | + | |
146 | 154 |
| |
147 | 155 |
| |
148 | 156 |
| |
|
Lines changed: 26 additions & 19 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
71 | 71 |
| |
72 | 72 |
| |
73 | 73 |
| |
| 74 | + | |
74 | 75 |
| |
75 | 76 |
| |
| 77 | + | |
76 | 78 |
| |
77 | 79 |
| |
78 | 80 |
| |
79 | 81 |
| |
80 |
| - | |
81 |
| - | |
82 |
| - | |
83 |
| - | |
84 | 82 |
| |
85 | 83 |
| |
86 | 84 |
| |
| |||
151 | 149 |
| |
152 | 150 |
| |
153 | 151 |
| |
| 152 | + | |
154 | 153 |
| |
| 154 | + | |
155 | 155 |
| |
156 | 156 |
| |
157 | 157 |
| |
| |||
217 | 217 |
| |
218 | 218 |
| |
219 | 219 |
| |
220 |
| - | |
| 220 | + | |
221 | 221 |
| |
222 | 222 |
| |
223 |
| - | |
| 223 | + | |
224 | 224 |
| |
225 | 225 |
| |
226 | 226 |
| |
| |||
300 | 300 |
| |
301 | 301 |
| |
302 | 302 |
| |
303 |
| - | |
304 |
| - | |
| 303 | + | |
| 304 | + | |
305 | 305 |
| |
306 | 306 |
| |
307 | 307 |
| |
| |||
312 | 312 |
| |
313 | 313 |
| |
314 | 314 |
| |
| 315 | + | |
315 | 316 |
| |
316 | 317 |
| |
317 | 318 |
| |
| |||
351 | 352 |
| |
352 | 353 |
| |
353 | 354 |
| |
354 |
| - | |
355 |
| - | |
| 355 | + | |
| 356 | + | |
356 | 357 |
| |
357 | 358 |
| |
358 | 359 |
| |
| |||
383 | 384 |
| |
384 | 385 |
| |
385 | 386 |
| |
386 |
| - | |
| 387 | + | |
387 | 388 |
| |
388 |
| - | |
389 |
| - | |
390 |
| - | |
391 |
| - | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
392 | 393 |
| |
393 | 394 |
| |
394 | 395 |
| |
| |||
467 | 468 |
| |
468 | 469 |
| |
469 | 470 |
| |
470 |
| - | |
471 |
| - | |
472 |
| - | |
| 471 | + | |
| 472 | + | |
| 473 | + | |
| 474 | + | |
| 475 | + | |
| 476 | + | |
| 477 | + | |
| 478 | + | |
473 | 479 |
| |
474 | 480 |
| |
475 | 481 |
| |
| |||
502 | 508 |
| |
503 | 509 |
| |
504 | 510 |
| |
505 |
| - | |
| 511 | + | |
| 512 | + | |
506 | 513 |
| |
507 | 514 |
| |
508 | 515 |
| |
|
Lines changed: 3 additions & 11 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
59 | 59 |
| |
60 | 60 |
| |
61 | 61 |
| |
62 |
| - | |
63 |
| - | |
64 | 62 |
| |
65 | 63 |
| |
66 | 64 |
| |
67 | 65 |
| |
68 | 66 |
| |
69 | 67 |
| |
70 | 68 |
| |
71 |
| - | |
| 69 | + | |
72 | 70 |
| |
73 | 71 |
| |
| 72 | + | |
74 | 73 |
| |
75 | 74 |
| |
76 | 75 |
| |
| |||
107 | 106 |
| |
108 | 107 |
| |
109 | 108 |
| |
110 |
| - | |
111 |
| - | |
112 |
| - | |
113 |
| - | |
114 |
| - | |
115 |
| - | |
116 |
| - | |
117 |
| - | |
118 | 109 |
| |
119 | 110 |
| |
120 | 111 |
| |
| |||
145 | 136 |
| |
146 | 137 |
| |
147 | 138 |
| |
| 139 | + | |
148 | 140 |
| |
149 | 141 |
| |
150 | 142 |
|
Lines changed: 9 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
2 | 2 |
| |
3 | 3 |
| |
4 | 4 |
| |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
5 | 10 |
| |
6 | 11 |
| |
7 | 12 |
| |
| |||
26 | 31 |
| |
27 | 32 |
| |
28 | 33 |
| |
29 |
| - | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + |
Lines changed: 2 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
27 | 27 |
| |
28 | 28 |
| |
29 | 29 |
| |
30 |
| - | |
31 |
| - | |
| 30 | + | |
| 31 | + | |
32 | 32 |
| |
33 | 33 |
| |
34 | 34 |
|
Lines changed: 8 additions & 4 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
492 | 492 |
| |
493 | 493 |
| |
494 | 494 |
| |
495 |
| - | |
496 |
| - | |
497 |
| - | |
| 495 | + | |
| 496 | + | |
| 497 | + | |
498 | 498 |
| |
499 | 499 |
| |
500 |
| - | |
| 500 | + | |
| 501 | + | |
| 502 | + | |
| 503 | + | |
| 504 | + | |
501 | 505 |
| |
502 | 506 |
| |
503 | 507 |
| |
|
0 commit comments