File tree
1,198 files changed
+93325
-30255
lines changed- .buildkite
- lm-eval-harness
- configs
- nightly-benchmarks
- scripts
- tests
- scripts
- hardware_ci
- .github
- ISSUE_TEMPLATE
- workflows
- benchmarks
- kernels
- cmake
- external_projects
- csrc
- attention
- mla
- core
- cpu
- cutlass_extensions
- epilogue
- mamba/causal_conv1d
- moe
- marlin_kernels
- marlin_moe_wna16
- permute_unpermute_kernels
- quantization
- cutlass_w8a8
- moe
- fp4
- fp8
- fused_kernels
- gguf
- gptq_allspark
- gptq_marlin
- marlin
- dense
- qqq
- sparse
- common
- rocm
- docker
- docs
- source
- _static
- api
- engine
- model
- multimodal
- offline_inference
- assets
- deployment
- design/v1/prefix_caching
- community
- contributing
- dockerfile
- model
- deployment
- frameworks
- integrations
- design
- v1
- features
- quantization
- getting_started
- installation
- ai_accelerator
- cpu
- gpu
- models
- extensions
- performance
- serving
- examples
- lmcache
- disagg_prefill_lmcache_v1
- configs
- offline_inference
- basic
- disaggregated-prefill-v1
- qwen2_5_omni
- online_serving
- chart-helm
- disagg_examples
- requirements
- tests
- basic_correctness
- benchmarks
- compile
- piecewise
- config
- core
- block/e2e
- distributed
- engine
- entrypoints
- llm
- openai
- correctness
- kernels
- attention
- core
- mamba
- moe
- quantization
- kv_transfer
- lora
- data
- metrics
- model_executor
- models
- decoder_only
- language
- vision_language
- embedding
- encoder_decoder
- audio_language
- language
- vision_language
- language
- generation
- pooling
- multimodal
- generation
- vlm_utils
- pooling
- processing
- quantization
- multimodal
- assets
- neuron/1_core
- quantization
- reasoning
- samplers
- spec_decode
- e2e
- tokenization
- tool_use
- tpu
- v1
- core
- e2e
- engine
- entrypoints
- llm
- sample
- shutdown
- spec_decode
- structured_output
- tpu
- worker
- worker
- worker
- tools
- vllm
- assets
- attention
- backends
- mla
- ops
- utils
- benchmarks
- compilation
- core
- device_allocator
- distributed
- device_communicators
- kv_transfer
- kv_connector
- v1
- kv_lookup_buffer
- kv_pipe
- engine
- multiprocessing
- output_processor
- entrypoints
- cli
- benchmark
- openai
- reasoning_parsers
- tool_parsers
- executor
- inputs
- lora
- ops/triton_ops
- punica_wrapper
- model_executor
- guided_decoding
- reasoner
- layers
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- schemes
- kernels
- mixed_precision
- scaled_mm
- quark
- schemes
- utils
- model_loader
- models
- multimodal
- platforms
- profiler
- reasoning
- spec_decode
- third_party
- transformers_utils
- configs
- processors
- tokenizer_group
- tokenizers
- triton_utils
- usage
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- metrics
- sample
- ops
- tpu
- spec_decode
- structured_output
- worker
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
1,198 files changed
+93325
-30255
lines changedLines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
Lines changed: 1 addition & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
|
0 commit comments