Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Refactor to standardize pytest usage
#1844 opened Sep 18, 2025 by fynnsu Loading…
Move OBCQ to SparseGPT
#1842 opened Sep 18, 2025 by Roy214 Loading…
MSE observer for NVFP4
#1840 opened Sep 17, 2025 by shubhra Loading…
ready label check ready When a PR is ready for review
#1832 opened Sep 17, 2025 by brian-dellabetta Loading…
1 task done
Consolidate example script tests into single parametrized test ready When a PR is ready for review
#1801 opened Sep 5, 2025 by fynnsu Loading…
add support for per-head attention quantization
#1791 opened Sep 2, 2025 by eldarkurtic Loading…
[QuantizationFormat] Remove code inferring format ready When a PR is ready for review
#1786 opened Aug 29, 2025 by dsikka Draft
[MXFP4] Add mxfp4 support
#1783 opened Aug 28, 2025 by dsikka Draft
[Transform] Spinquant R3 ready When a PR is ready for review
#1778 opened Aug 27, 2025 by kylesayrs Loading…
[Multi-modifier] Support scoped application of quantization config/status ready When a PR is ready for review
#1772 opened Aug 21, 2025 by brian-dellabetta Loading…
5 tasks done
[Tracing] Support Cohere Vision, Decouple vision tower from first layer ready When a PR is ready for review
#1710 opened Aug 6, 2025 by kylesayrs Loading…
[Example] [VLM] Gemma3n
#1696 opened Jul 31, 2025 by kylesayrs Draft
[Autowrapper] Support Gemma3n, autowrapper improvements ready When a PR is ready for review
#1693 opened Jul 30, 2025 by kylesayrs Loading…
1686 Logic matching refactor
#1687 opened Jul 28, 2025 by ved1beta Loading…
add quantization_w4a4_fp4 qwen3 example
#1681 opened Jul 24, 2025 by wangwenmingaa Loading…
ProTip! Updated in the last three days: updated:>2025-09-16.