Uh oh!

There was an error while loading. Please reload this page.

intel / auto-round Public

Notifications You must be signed in to change notification settings
Fork 148
Star 1.5k

Code
Issues 64
Pull requests 14
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: intel/auto-round

Labels 34 Milestones 5

New pull request New

14 Open 1,368 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix: replace bare except clauses with except Exception

#1981 opened Jul 1, 2026 by ramkrishs Contributor

Loading…

4 of 5 tasks

Support vLLM-based Model Quantization with llm_compressor Export

#1978 opened Jul 1, 2026 by changwangss Contributor

Loading…

4 tasks

Add hpu inference in CI test

#1976 opened Jul 1, 2026 by chensuyue Contributor

Loading…

4 tasks

[ARK] Support gemm using sycl-tla

#1968 opened Jun 30, 2026 by Zhenzhong1 Contributor • Draft

perf: back-pointer DP for AutoScheme bit allocation to cut path-copy RAM

#1959 opened Jun 26, 2026 by SuperMarioYL

Loading…

0.15.0

feat: add --dry-run VRAM/size estimation mode

#1958 opened Jun 26, 2026 by mvanhorn

Loading…

Fix UltraChat chat-template handling for Transformers v5

#1941 opened Jun 22, 2026 by Copilot AI • Draft

2 of 4 tasks

Add quantization support for DiffusionGemma

#1935 opened Jun 17, 2026 by lvliang-intel Contributor

Loading…

1 of 4 tasks

Added prefill strategy benchmarking script and results

#1923 opened Jun 15, 2026 by jijiaz

Loading…

[draft]refine device

#1900 opened Jun 9, 2026 by wenhuach21 Contributor • Draft

4 tasks

feat: add overlap function for multi-blocks compression

#1850 opened May 25, 2026 by ZaneMark Contributor

Loading…

3 tasks

Add moe prefill/ decode with int2/int4/int8 sym /asym and fp8 e4m3 e5m2

#1813 opened May 14, 2026 by Copilot AI

Loading…

4 tasks done

feat: support Nemotron-H / Nemotron-Cascade-2 (#1711)

#1712 opened Apr 20, 2026 by michael-rabe

Loading…

4 of 9 tasks

Continuously optimize AutoScheme RAM consumption

#1703 opened Apr 17, 2026 by lvliang-intel Contributor

Loading…

2 of 9 tasks

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!