🧠 AttentionBreaker: A Framework for Parameter Set Optimization for Bit-Flip Attacks in Transformers

This repository provides a comprehensive framework for fault injection, sensitivity analysis, and optimization targeting transformer-based language models. It includes tools for Bit-Flip Attacks (BFA), layer-wise sensitivity scoring, and ablation studies to assess model robustness and performance degradation.

📁 Repository Structure

.
├── ablation_study.py            # Script to evaluate accuracy under various ablation strategies
├── attn_breaker_utils.py        # Utility functions for sensitivity metrics and tensor manipulation
├── genbfa_optimization.py       # Optimization and search strategy for generalized bit-flip attack
├── genbfa_utils.py              # Helper functions for BFA (e.g., bit-level manipulation)
├── sensitivity_analysis.py      # Main script to rank layers/parameters by sensitivity to faults
├── requirements.txt             # List of requirements for the framework operations

🚀 Key Features

Layer-wise Sensitivity Scoring using gradient and weight norms
Generalized Bit-Flip Attack (BFA) with search-based optimization
Modular Bit Manipulation Utilities
Ablation & Recovery Experiments to test model robustness
Integration-ready pipeline with HuggingFace models and quantized weights

🛠️ Setup

git clone https://github.com/TIES-Lab/attnbreaker.git
cd attentionbreaker
pip install -r requirements.txt

📌 Usage

1. Sensitivity Analysis

python sensitivity_analysis.py -m meta-llama/Llama-3.1-8B-Instruct --q int8 --d cuda:0

Ranks layers by their vulnerability to bit-flips based on combined gradient and weight magnitudes for a huggingface model at a given quantization level. Quantizations supported at INT8 and NF4 by BitsAndBytes. Default device is the CPU.

2. Ablation Study

python ablation_study.py -m meta-llama/Llama-3.1-8B-Instruct --q int8 --d cuda:0

Ablates top-k sensitive layers and evaluates BFAs for different sinsitivity scores and layer selections.

3. Generalized BFA Optimization

python genbfa_optimization.py -m meta-llama/Llama-3.1-8B-Instruct --q int8 --d cuda:0

Performs optimization after sensitivity analysis and ablation studies to reduce the critical parameter set for a BFA using evolutionary optimization.

📄 Citation

If you use this codebase in your research or publication, please consider citing:

@article{das2024attentionbreaker,
  title={GenBFA: An Evolutionary Optimization Approach to Bit-Flip Attacks on LLMs},
  author={Das, Sanjay and Bhattacharya, Swastik and Kundu, Souvik and Kundu, Shamik and Menon, Anand and Raha, Arnab and Basu, Kanad},
  journal={arXiv preprint arXiv:2411.13757},
  year={2024}
}

🧩 Acknowledgements

Built using HuggingFace Transformers and inspired by recent work in fault injection and robustness analysis of LLMs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 AttentionBreaker: A Framework for Parameter Set Optimization for Bit-Flip Attacks in Transformers

📁 Repository Structure

🚀 Key Features

🛠️ Setup

📌 Usage

1. Sensitivity Analysis

2. Ablation Study

3. Generalized BFA Optimization

📄 Citation

🧩 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
ablation_study.py		ablation_study.py
attn_breaker_utils.py		attn_breaker_utils.py
genbfa_optimization.py		genbfa_optimization.py
genbfa_utils.py		genbfa_utils.py
requirements.txt		requirements.txt
sensitivity_analysis.py		sensitivity_analysis.py

Folders and files

Latest commit

History

Repository files navigation

🧠 AttentionBreaker: A Framework for Parameter Set Optimization for Bit-Flip Attacks in Transformers

📁 Repository Structure

🚀 Key Features

🛠️ Setup

📌 Usage

1. Sensitivity Analysis

2. Ablation Study

3. Generalized BFA Optimization

📄 Citation

🧩 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages