Bilinear Prompt Tuning: A PyTorch Implementation

This is a PyTorch implementation of the paper Attention to the Burstiness in Visual Prompt Tuning!.

@InProceedings{wang2025attention,
  author  = {Yuzhu Wang and Manni Duan and Shu Kong},
  title   = {Attention to the Burstiness in Visual Prompt Tuning!},
  booktitle = {International Conference on Computer Vision (ICCV)},
  year    = {2025},
}

The implementation mainly consists of three parts: image data preprocessing, loading of pre-trained models, and training scripts.
This repository includes some demos to help reproduce. See demo_BPT_eval.ipynb, demo_BPT_det.ipynb and demo_BPT_dis.ipynb for details.
This repository is based on PyTorch==1.10.0 and timm==0.6.0.

1. Datasets and Pre-trained Models

Dataset
- See Table 6 in the appendix for dataset details.
- Fine-Grained Visual Classification tasks (FGVC). The datasets can be downloaded following the official links.
- The file folder ./Dataset implements image loading and preprocessing.
Pre-trained Models
- The following table provides the pre-trained checkpoints used in the paper.
  
  Pre-trained Objective Link
  
  MAE, ViT-Base/Large/Huge download
  
  MAE, ViT-2B download
  
  MoCo v3 download
- We load ImageNet-21K supervised pre-trained model weights by timm.

2. Implementation of BPT

Take MAE pre-training as an example.

BPT-fwhiten or BPT-twhiten
- These two methods are implemented in files ./Models/MAE_bpt_shallow.py and ./Models/MAE_bpt_deep.py;
- For BPT-fwhiten or BPT-twhiten, we need to set whitening=True and prepare the pre-trained weights of Wq, Wk and patch embeddings X. See Line 33-57 of ./Models/MAE_bpt_shallow.py for detials;
BPT-bilinear, we set whitening=False;

3. Training scripts

3.1 BPT-bilinear with MAE pre-trained ViT-B:

# for instance, prompt tuning on CUB-200 dataset via shallow variant.
torchrun --nproc_per_node=4 \
         train_MAE.py \
         --model_name MAE_bpt_vit_b \
         --finetune ${MAE_Pretain_ckpt} \
         --drop_path 0.0 \
         --dataset CUB200 \
         --tuning_type "prompt" \
         --num_prompts 100 \
         --channels 75 \
         --epochs 100 \
         --batch_size 32 \
         --weight_decay 5e-3 \
         --wd_head 0.5 \
         --lr 5e-2 \
         --min_lr 1e-8 \
         --warmup_epochs 10 \
         --model_ema \
         --save_dir ${SAVE_PATH}

num_prompts is 100 for shallow, and 50 for deep.
For deep variant, set model_name as MAE_bpt_deep_vit_b and turn on prompt_deep.
The training recipes for other tasks are similar, please refer to ./script/MAE.

3.2 BPT with MoCo and Supervised Pre-training

# for instance, deep variant on dogs-120.
torchrun --nproc_per_node=4 \
         train_MoCo.py \
         --model_name bpt_deep_vit_b \
         --drop_path 0.0 \
         --dataset DOG120 \
         --tuning_type "prompt" \
         --num_prompts 50 \
         --channels 50 \
         --prompt_deep \
         --epochs 100 \
         --batch_size 32 \
         --weight_decay 2e-3 \
         --wd_head 0.1 \
         --lr 5e-2 \
         --min_lr 1e-8 \
         --warmup_epochs 10 \
         --model_ema \
         --save_dir ${SAVE_PATH}

The script is similar to MAE pre-training.
Please refer to ./script/MoCo and ./script/sup for other task recipes.

4. Accuracy curves

We provide some test-accuracy curves for reference. (BPT-bilinear-shallow with MAE pre-trained ViT-B backbone.)
- CUB-200
- NABirds-555
- CAR-196
- COCO, Cascade Mask R-CNN

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bilinear Prompt Tuning: A PyTorch Implementation

1. Datasets and Pre-trained Models

2. Implementation of BPT

3. Training scripts

3.1 BPT-bilinear with MAE pre-trained ViT-B:

3.2 BPT with MoCo and Supervised Pre-training

4. Accuracy curves

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Dataset		Dataset
Models		Models
curves		curves
script		script
visual		visual
LICENSE		LICENSE
README.md		README.md
demo_BPT_det.ipynb		demo_BPT_det.ipynb
demo_BPT_dis.ipynb		demo_BPT_dis.ipynb
demo_BPT_eval.ipynb		demo_BPT_eval.ipynb
poster.pdf		poster.pdf
train_MAE.py		train_MAE.py
train_MoCo.py		train_MoCo.py
train_Sup.py		train_Sup.py
utils.py		utils.py

Pre-trained Objective	Link
MAE, ViT-Base/Large/Huge	download
MAE, ViT-2B	download
MoCo v3	download

License

WangYZ1608/BPT

Folders and files

Latest commit

History

Repository files navigation

Bilinear Prompt Tuning: A PyTorch Implementation

1. Datasets and Pre-trained Models

2. Implementation of BPT

3. Training scripts

3.1 BPT-bilinear with MAE pre-trained ViT-B:

3.2 BPT with MoCo and Supervised Pre-training

4. Accuracy curves

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages