File tree
843 files changed
+40634
-19412
lines changed- .buildkite
- lm-eval-harness
- configs
- nightly-benchmarks/scripts
- scripts
- hardware_ci
- .github
- ISSUE_TEMPLATE
- workflows
- benchmarks
- cutlass_benchmarks
- disagg_benchmarks
- fused_kernels
- kernels
- deepgemm
- overheads
- profiling
- cmake
- csrc
- attention
- core
- cpu
- cutlass_extensions
- gradlib
- moe
- marlin_kernels
- marlin_moe_wna16
- quantization
- compressed_tensors
- cutlass_w8a8
- c3x
- fp4
- fp8
- fused_kernels
- gguf
- gptq_allspark
- gptq_marlin
- rocm
- docker
- docs/source
- assets/deployment
- community
- deployment
- frameworks
- design/v1
- features
- quantization
- getting_started
- installation
- models
- serving
- examples
- lmcache
- offline_inference
- basic
- disaggregated-prefill-v1
- openai_batch
- online_serving
- disaggregated_serving
- opentelemetry
- gradlib
- requirements
- tests
- async_engine
- basic_correctness
- compile
- piecewise
- distributed
- engine
- entrypoints
- llm
- openai
- tool_parsers
- kernels
- attention
- core
- mamba
- moe
- quantization
- lora
- model_executor
- models
- language
- generation
- pooling
- multimodal
- generation
- vlm_utils
- processing
- quantization
- multimodal
- neuron/1_core
- plugins/lora_resolvers
- quantization
- runai_model_streamer_test
- samplers
- spec_decode
- e2e
- tensorizer_loader
- tpu
- lora
- v1
- core
- engine
- entrypoints
- llm
- openai
- kv_connector
- nixl_integration
- unit
- metrics
- spec_decode
- structured_output
- tpu
- worker
- weight_loading
- tools/ep_kernels
- vllm
- adapter_commons
- attention
- backends
- mla
- ops
- benchmarks
- compilation
- device_allocator
- distributed
- device_communicators
- kv_transfer
- kv_connector
- v1
- kv_lookup_buffer
- kv_pipe
- engine
- multiprocessing
- entrypoints
- cli
- openai
- tool_parsers
- executor
- inputs
- logging_utils
- lora
- ops
- triton_ops
- xla_ops
- punica_wrapper
- model_executor
- guided_decoding
- layers
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- schemes
- kernels
- mixed_precision
- scaled_mm
- quark
- schemes
- utils
- model_loader
- models
- multimodal
- platforms
- plugins
- lora_resolvers
- profiler
- reasoning
- transformers_utils
- chat_templates
- configs
- processors
- tokenizers
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- metrics
- spec_decode
- stats
- structured_output
- worker
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
843 files changed
+40634
-19412
lines changedLines changed: 12 additions & 8 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
| |
11 |
| - | |
| 11 | + | |
12 | 12 |
| |
13 | 13 |
| |
14 | 14 |
| |
15 | 15 |
| |
16 |
| - | |
| 16 | + | |
17 | 17 |
| |
18 | 18 |
| |
19 | 19 |
| |
| |||
28 | 28 |
| |
29 | 29 |
| |
30 | 30 |
| |
31 |
| - | |
32 |
| - | |
33 |
| - | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
34 | 36 |
| |
35 | 37 |
| |
36 | 38 |
| |
37 |
| - | |
38 |
| - | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
39 | 43 |
| |
40 | 44 |
| |
41 | 45 |
| |
| |||
45 | 49 |
| |
46 | 50 |
| |
47 | 51 |
| |
48 |
| - | |
| 52 | + |
Lines changed: 2 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
22 | 22 |
| |
23 | 23 |
| |
24 | 24 |
| |
25 |
| - | |
26 |
| - | |
| 25 | + | |
| 26 | + |
Lines changed: 11 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + |
Lines changed: 11 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + |
Lines changed: 11 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + |
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
3 | 3 |
| |
4 | 4 |
| |
5 | 5 |
| |
| 6 | + |
Lines changed: 2 additions & 6 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 |
| - | |
2 |
| - | |
| 1 | + | |
3 | 2 |
| |
4 | 3 |
| |
5 | 4 |
| |
6 |
| - | |
| 5 | + | |
7 | 6 |
| |
8 |
| - | |
9 |
| - | |
10 |
| - |
Lines changed: 43 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + |
Lines changed: 0 additions & 59 deletions
This file was deleted.
Lines changed: 23 additions & 38 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
3 | 3 |
| |
4 | 4 |
| |
5 | 5 |
| |
6 |
| - | |
7 |
| - | |
8 |
| - | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
9 | 9 |
| |
10 | 10 |
| |
11 |
| - | |
12 |
| - | |
13 |
| - | |
14 | 11 |
| |
15 |
| - | |
16 |
| - | |
| 12 | + | |
17 | 13 |
| |
18 | 14 |
| |
19 | 15 |
| |
20 |
| - | |
21 |
| - | |
22 |
| - | |
23 |
| - | |
24 |
| - | |
25 |
| - | |
26 | 16 |
| |
27 |
| - | |
28 |
| - | |
29 |
| - | |
30 |
| - | |
31 |
| - | |
32 |
| - | |
33 |
| - | |
34 | 17 |
| |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
35 | 27 |
| |
36 | 28 |
| |
37 | 29 |
| |
38 | 30 |
| |
39 | 31 |
| |
40 | 32 |
| |
41 |
| - | |
42 |
| - | |
| 33 | + | |
| 34 | + | |
43 | 35 |
| |
44 | 36 |
| |
45 | 37 |
| |
46 |
| - | |
47 |
| - | |
48 |
| - | |
49 |
| - | |
50 |
| - | |
51 |
| - | |
52 |
| - | |
| 38 | + | |
| 39 | + | |
53 | 40 |
| |
54 |
| - | |
55 |
| - | |
| 41 | + | |
56 | 42 |
| |
57 |
| - | |
58 | 43 |
| |
59 | 44 |
| |
60 | 45 |
| |
61 | 46 |
| |
62 | 47 |
| |
63 |
| - | |
64 |
| - | |
65 |
| - | |
66 |
| - | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
67 | 53 |
| |
68 |
| - | |
69 | 54 |
|
0 commit comments