Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models

Requirements

Use separate Conda environments for each stage to avoid dependency conflicts.

1) Stage-1 Training (SFT)

conda create -n llamafactory python=3.12
conda activate llamafactory
pip install torch==2.8.0 torchvision==0.23.0 torchaudio==2.8.0 --index-url https://download.pytorch.org/whl/cu128
pip install https://github.com/Dao-AILab/flash-attention/releases/download/v2.8.3/flash_attn-2.8.3+cu12torch2.8cxx11abiFALSE-cp312-cp312-linux_x86_64.whl
pip install -r requirements-llamafactory.txt

2) Stage-2 Training (RL) and Testing

conda create -n verl python=3.10
conda activate verl
pip install torch==2.8.0 torchvision==0.23.0 torchaudio==2.8.0 --index-url https://download.pytorch.org/whl/cu128
pip install https://github.com/Dao-AILab/flash-attention/releases/download/v2.8.3/flash_attn-2.8.3+cu12torch2.8cxx11abiFALSE-cp310-cp310-linux_x86_64.whl
pip install -r requirements-verl.txt

3) Visualization

conda create -n visualization python=3.10
conda activate visualization
pip install torch==2.8.0 torchvision==0.23.0 torchaudio==2.8.0 --index-url https://download.pytorch.org/whl/cu128
pip install https://github.com/Dao-AILab/flash-attention/releases/download/v2.8.3/flash_attn-2.8.3+cu12torch2.8cxx11abiFALSE-cp310-cp310-linux_x86_64.whl
pip install -r requirements-visualization.txt

Data Preprocessing

Prepare the training data before starting model training.

PACS

bash ./scripts/pacs/pacs_process.sh

VGGFace2

bash ./scripts/vgg/vgg_process.sh

Training

Run the following scripts for two-stage training.

PACS

bash ./scripts/pacs/pacs_train_stage1.sh
bash ./scripts/pacs/pacs_train_stage2.sh

VGGFace2

bash ./scripts/vgg/vgg_train_stage1.sh
bash ./scripts/vgg/vgg_train_stage2.sh

Important

If the reward stops improving during stage-2 RL training, run the script below to replace non-weight files and then resume training:

bash ./scripts/replace_non_weight_files.sh

Testing

Run the following scripts to evaluate model performance.

PACS

bash ./scripts/pacs/pacs_test.sh

VGGFace2

bash ./scripts/vgg/vgg_test.sh

Visualization

If you only need visualization for the face recognition scenario, you can directly use the provided checkpoint at ./model/qwen_to_clip_projector.pt:

bash ./scripts/visualization/test.sh

If you want visualization support for other scenarios, train the mapping network first:

bash ./scripts/visualization/train.sh

🙏 Acknowledgements

Our framework builds upon the excellent work of:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
model		model
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements-llamafactory.txt		requirements-llamafactory.txt
requirements-verl.txt		requirements-verl.txt
requirements-visualization.txt		requirements-visualization.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models

Table of Contents

Requirements

1) Stage-1 Training (SFT)

2) Stage-2 Training (RL) and Testing

3) Visualization

Data Preprocessing

PACS

VGGFace2

Training

PACS

VGGFace2

Testing

PACS

VGGFace2

Visualization

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models

Table of Contents

Requirements

1) Stage-1 Training (SFT)

2) Stage-2 Training (RL) and Testing

3) Visualization

Data Preprocessing

PACS

VGGFace2

Training

PACS

VGGFace2

Testing

PACS

VGGFace2

Visualization

🙏 Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages