Skip to content

[step 1]adjust gguf tuning algorithm#1649

Merged
wenhuach21 merged 29 commits into
mainfrom
wenhuach21-patch-1
May 18, 2026
Merged

[step 1]adjust gguf tuning algorithm#1649
wenhuach21 merged 29 commits into
mainfrom
wenhuach21-patch-1

Conversation

@wenhuach21
Copy link
Copy Markdown
Contributor

@wenhuach21 wenhuach21 commented Apr 2, 2026

Description

Please briefly describe your main changes, the motivation.

Type of Change

  • Bug fix
  • New feature
  • Documentation update
  • Performance improvement
  • Code refactoring
  • Other (please specify):

Related Issues

Fixes or relates to #

Checklist Before Submitting

  • My code has been tested locally.
  • Documentation has been updated as needed.
  • New or updated tests are included where applicable.

Copilot AI review requested due to automatic review settings April 2, 2026 09:44
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates the algorithm extension utilities in auto_round/alg_ext.py, primarily adjusting masking logic for loss computation and refining the iterative WLS quantization search rounding behavior.

Changes:

  • Tweaks the top-percent absolute value mask selection logic line (no functional change intended, but exposes a doc/behavior mismatch).
  • Adds the rounding perturbation term v into the quant_data_new computation inside iterative_wls_quant_search.

Comment thread auto_round/alg_ext.py
@wenhuach21 wenhuach21 changed the title Update alg_ext.py fix gguf issue in alg_ext.py Apr 3, 2026
@wenhuach21
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@wenhuach21
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@wenhuach21
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@wenhuach21
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@wenhuach21 wenhuach21 changed the title fix gguf issue in alg_ext.py refine alg_ext code to better support torch compile May 9, 2026
@wenhuach21 wenhuach21 added this to the 0.13.0 milestone May 9, 2026
@wenhuach21
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@wenhuach21
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

# Conflicts:
#	auto_round/compressors_new/base.py
#	auto_round/data_type/int.py
@wenhuach21 wenhuach21 requested a review from n1ck-guo May 14, 2026 07:21
@wenhuach21
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@n1ck-guo
Copy link
Copy Markdown
Contributor

The code in alg_ext.py should already be modified to autoroundv2, so changes to it will no longer take effect. You need to modify it accordingly to v2.

@wenhuach21 wenhuach21 changed the title refine alg_ext code to better support torch compile adjust gguf tuning algorithm May 15, 2026
@wenhuach21
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@wenhuach21 wenhuach21 changed the title adjust gguf tuning algorithm [step 1]adjust gguf tuning algorithm May 18, 2026
@wenhuach21 wenhuach21 merged commit 770b7a3 into main May 18, 2026
46 checks passed
@wenhuach21 wenhuach21 deleted the wenhuach21-patch-1 branch May 18, 2026 03:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants