Memory-Efficient Video Compressive Sensing Using Cross-Frame Attention Network

This is a project of course Computational Imaging of Westlake University. This project seek to train a end-to-end snapshot compressive imaging (SCI) system to recover RGB videos from raw camera measurements. More specifically, we train a UNet contains our customized cross-frame attention machenism. The network takes a coarse estimation as input and predict the reconstruction of video frames.

Quickstart

Setup environment

Install Pytorch

This project is experimented on Pytorch-2.0, please refer to Pytorch's official webpage for installation.

Install dependency packages

git clone https://github.com/lizhiqi49/cfattn-sci
cd cfattn-sci
pip install -r requirements.txt

Setup dataset

The raw data we used is the same as this repository. When the raw data is downloaded, please use script data/data_gen.py to generate your training data.

Start training

Configure hyper-parameters

File unet_config.json is used to configure the architecture of our model, which is a U-Net. And the training config files of two training stages are under directory configs. You can also configure your own training hyper-parameters under configs/{exp_name}.yaml.

Configure Accelerate

This project uses library Accelerate for mixed-precision and distributed training, before training start, you need configure your accelerate using accelerate config on your shell.

Train!

For training stage 1:

accelerate launch train.py --config configs/sci_stage1.yaml

For training stage 2:

accelerate launch finetune.py --config configs/sci_stage2.yaml

Evaluation

Put your test frames in a directory and name those image files using single number, for example, '5.png'. And test the model using:

python test.py \
--pretrained_unet_path {your_pretrained_model_dir} \
--test_video_dir {your_test_data_dir} \
--use_cross_frame_attn      # remove this flag if do not want to use cf-attn

This command will save the reconstructed frames in an image ./test_result.png. If you want to save video, please add --save_video flag.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Memory-Efficient Video Compressive Sensing Using Cross-Frame Attention Network

Quickstart

Setup environment

Setup dataset

Start training

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs		configs
data		data
models		models
README.md		README.md
finetune.py		finetune.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
unet_config.json		unet_config.json

lizhiqi49/cfattn-sci

Folders and files

Latest commit

History

Repository files navigation

Memory-Efficient Video Compressive Sensing Using Cross-Frame Attention Network

Quickstart

Setup environment

Setup dataset

Start training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages