Skip to content

dynamic map checkpoint naming based on model objective.#1840

Open
xin3he wants to merge 1 commit into
mainfrom
xinhe/5-18a
Open

dynamic map checkpoint naming based on model objective.#1840
xin3he wants to merge 1 commit into
mainfrom
xinhe/5-18a

Conversation

@xin3he
Copy link
Copy Markdown
Contributor

@xin3he xin3he commented May 21, 2026

Description

This pull request improves the handling of model path remapping during quantization configuration and enhances the evaluation helpers and tests for HuggingFace models, including multimodal support. The main changes ensure remapping is only applied when appropriate, preventing incorrect path transformations for composite models, and extend the evaluation utilities and tests for better flexibility and coverage.

Quantization remapping logic improvements:

  • Added logic in get_layer_config (convert_model.py) to only apply checkpoint conversion mapping when the model's module paths do not already match the checkpoint namespace, preventing incorrect remapping for composite models such as VLMs.
  • Updated the remapping of quant_block_list and extra_config to respect the new _should_remap flag, ensuring remapping only occurs when necessary.

Evaluation helper enhancements:

  • Modified evaluate_accuracy in helpers.py to accept a model_type parameter, allowing evaluation of different model types (e.g., "hf", "hf-multimodal"). Also, now passes the limit parameter to simple_evaluate for more controlled test runs.
    Test improvements:

  • Refactored HuggingFace integration tests to use the new model_type and limit parameters, and added a new test for HuggingFace multimodal models (test_hf_multimodal).

Type of Change

Bug fix

Related Issues

Fixes or relates to #1784 #1836

Checklist Before Submitting

  • My code has been tested locally.
  • Documentation has been updated as needed.
  • New or updated tests are included where applicable.
  • The CUDA CI has passed. You can trigger it by commenting /azp run Unit-Test-CUDA-AutoRound.

Signed-off-by: Xin He <xin3.he@intel.com>
@xin3he
Copy link
Copy Markdown
Contributor Author

xin3he commented May 21, 2026

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants