Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning

Prerequisites:

pip install -r requirements.txt

Sparse MeZO Training

# for llama-7B training with Sparse MeZO
bash run.sh

Noted that for vanilla MeZO training, we can directly change ratio to 1.0 in run.sh.

Acknowledgements

Our code is based on the official repo of MeZO.

Ciatation

@inproceedings{
liu2025sparse,
title={Sparse Me{ZO}: Less Parameters for Better Performance in Zeroth-Order {LLM} Fine-Tuning},
author={Yong Liu and Zirui Zhu and Chaoyu Gong and Minhao Cheng and Cho-Jui Hsieh and Yang You},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://openreview.net/forum?id=Tjw0ACu3NL}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning

Prerequisites:

Sparse MeZO Training

Acknowledgements

Ciatation

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning

Prerequisites:

Sparse MeZO Training

Acknowledgements

Ciatation