Skip to content

Pull requests: intel/auto-round

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Reduce VRAM usage of quantizing VLM models
#1777 opened May 4, 2026 by lvliang-intel Contributor Loading…
1 of 4 tasks
support vllm model quantization
#1775 opened Apr 30, 2026 by mengniwang95 Contributor Draft
4 tasks
fix compile
#1774 opened Apr 30, 2026 by wenhuach21 Contributor Loading…
4 tasks
support gptqmodel 7.0.0 and fix bug in CI
#1772 opened Apr 30, 2026 by xin3he Contributor Loading…
3 of 4 tasks
Optimize CUDA CI and Code Scan workflows
#1770 opened Apr 30, 2026 by XuehaoSun Contributor Loading…
4 tasks
Update base.py
#1768 opened Apr 29, 2026 by wenhuach21 Contributor Loading…
4 tasks
Update alg_ext.py
#1767 opened Apr 29, 2026 by wenhuach21 Contributor Loading…
4 tasks
[refactor] decouple calibaration code
#1765 opened Apr 29, 2026 by n1ck-guo Contributor Loading…
4 tasks
Fix QDQ inference OOM issue.
#1763 opened Apr 29, 2026 by changwangss Loading…
Fix incompatible weight names
#1759 opened Apr 29, 2026 by mengniwang95 Contributor Loading…
4 tasks
Debug/usable rotation
#1756 opened Apr 29, 2026 by ZaneMark Contributor Loading…
Try to fix hadamard regression
#1754 opened Apr 28, 2026 by wenhuach21 Contributor Loading…
4 tasks
Fix FP8 CT export metadata for KV cache and attention
#1752 opened Apr 28, 2026 by yiliu30 Contributor Loading…
Awq algorithm WIP
#1749 opened Apr 28, 2026 by WeiweiZhang1 Contributor Loading…
4 tasks
fix qwen3.6 vllm infer bug
#1746 opened Apr 27, 2026 by n1ck-guo Contributor Loading…
4 tasks
Fix rotation
#1724 opened Apr 23, 2026 by wenhuach21 Contributor Loading…
2 of 9 tasks
[Toolkit] Integrate AutoRound Toolkit
#1723 opened Apr 23, 2026 by Zhenzhong1 Contributor Loading…
2 of 6 tasks
feat: support Nemotron-H / Nemotron-Cascade-2 (#1711)
#1712 opened Apr 20, 2026 by michael-rabe Loading…
4 of 9 tasks
Continuously optimize AutoScheme RAM consumption
#1703 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
chore: add shared agent config layout
#1700 opened Apr 17, 2026 by yiliu30 Contributor Loading…
support model_free WOQ quantization
#1699 opened Apr 17, 2026 by xin3he Contributor Loading…
4 of 9 tasks
Fix Qwen Omni quantization model issue for long form audio generation
#1698 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
Fix module.to("meta") for models with plain Tensors
#1688 opened Apr 15, 2026 by yiliu30 Contributor Loading…
1 of 9 tasks
ProTip! Exclude everything labeled bug with -label:bug.