File tree
1,615 files changed
+29578
-8542
lines changed- .buildkite
- lm-eval-harness
- nightly-benchmarks/scripts
- scripts
- hardware_ci
- tpu
- .github
- ISSUE_TEMPLATE
- benchmarks
- cutlass_benchmarks
- disagg_benchmarks
- fused_kernels
- kernels
- deepgemm
- overheads
- cmake
- external_projects
- csrc
- attention
- mla
- cpu
- cutlass_extensions
- moe
- marlin_moe_wna16
- permute_unpermute_kernels
- prepare_inputs
- quantization
- compressed_tensors
- cutlass_w8a8
- c3x
- moe
- fp4
- fp8
- amd
- fused_kernels
- gguf
- gptq_marlin
- gptq
- machete
- rocm
- sparse/cutlass
- docker
- docs
- ci
- cli
- contributing
- model
- deployment
- design
- v1
- features
- quantization
- getting_started
- installation
- ai_accelerator
- cpu
- gpu
- mkdocs
- hooks
- javascript
- stylesheets
- models
- extensions
- serving
- usage
- examples
- offline_inference
- basic
- disaggregated-prefill-v1
- profiling_tpu
- qwen2_5_omni
- online_serving
- disaggregated_serving
- opentelemetry
- structured_outputs
- others
- lmcache
- disagg_prefill_lmcache_v1
- requirements
- tests
- async_engine
- basic_correctness
- benchmarks
- compile
- piecewise
- core
- block
- e2e
- detokenizer
- distributed
- encoder_decoder
- engine
- entrypoints
- llm
- offline_mode
- openai
- correctness
- tool_parsers
- fastsafetensors_loader
- kernels
- attention
- core
- mamba
- moe
- quantization
- kv_transfer
- lora
- metrics
- mistral_tool_use
- model_executor
- models
- language
- generation
- pooling
- multimodal
- generation
- vlm_utils
- pooling
- processing
- quantization
- mq_llm_engine
- multi_step
- multimodal
- neuron
- 1_core
- 2_core
- plugins_tests
- plugins
- lora_resolvers
- vllm_add_dummy_model
- vllm_add_dummy_model
- vllm_add_dummy_platform
- vllm_add_dummy_platform
- prefix_caching
- prompt_adapter
- quantization
- reasoning
- runai_model_streamer_test
- samplers
- spec_decode
- e2e
- standalone_tests
- tensorizer_loader
- tokenization
- tool_use
- tpu
- lora
- tracing
- v1
- core
- e2e
- engine
- entrypoints
- llm
- openai
- kv_connector
- nixl_integration
- unit
- metrics
- sample
- shutdown
- spec_decode
- structured_output
- tpu
- worker
- worker
- vllm_test_utils
- vllm_test_utils
- weight_loading
- worker
- tools
- ep_kernels
- profiler
- vllm
- adapter_commons
- assets
- attention
- backends
- mla
- ops
- blocksparse_attention
- utils
- benchmarks
- compilation
- core
- block
- device_allocator
- distributed
- device_communicators
- kv_transfer
- kv_connector
- v1
- kv_lookup_buffer
- kv_pipe
- engine
- multiprocessing
- output_processor
- entrypoints
- cli
- benchmark
- openai
- tool_parsers
- executor
- inputs
- logging_utils
- lora
- ops
- torch_ops
- triton_ops
- xla_ops
- punica_wrapper
- model_executor
- guided_decoding
- layers
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- schemes
- kernels
- mixed_precision
- scaled_mm
- quark
- schemes
- utils
- model_loader
- models
- multimodal
- platforms
- plugins
- lora_resolvers
- profiler
- prompt_adapter
- reasoning
- spec_decode
- third_party
- transformers_utils
- chat_templates
- configs
- processors
- tokenizers
- triton_utils
- usage
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- metrics
- sample
- ops
- tpu
- spec_decode
- structured_output
- worker
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
1,615 files changed
+29578
-8542
lines changedLines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
2 | 3 |
| |
3 | 4 |
| |
4 | 5 |
| |
|
Lines changed: 18 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
2 | 2 |
| |
| 3 | + | |
3 | 4 |
| |
4 | 5 |
| |
5 | 6 |
| |
| |||
11 | 12 |
| |
12 | 13 |
| |
13 | 14 |
| |
| 15 | + | |
14 | 16 |
| |
15 | 17 |
| |
16 | 18 |
| |
| |||
28 | 30 |
| |
29 | 31 |
| |
30 | 32 |
| |
| 33 | + | |
31 | 34 |
| |
32 | 35 |
| |
33 | 36 |
| |
| |||
44 | 47 |
| |
45 | 48 |
| |
46 | 49 |
| |
| 50 | + | |
47 | 51 |
| |
48 | 52 |
| |
49 | 53 |
| |
50 | 54 |
| |
51 | 55 |
| |
52 | 56 |
| |
53 | 57 |
| |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
54 | 70 |
| |
55 | 71 |
| |
56 | 72 |
| |
| |||
70 | 86 |
| |
71 | 87 |
| |
72 | 88 |
| |
| 89 | + | |
73 | 90 |
| |
74 | 91 |
| |
75 |
| - | |
| 92 | + | |
76 | 93 |
| |
77 | 94 |
| |
78 | 95 |
| |
|
0 commit comments