mmq project

This repo contains results, notebooks, and code related to quantizing blip2 with various configs. To get an idea of the main logic, look at the below diagram:

Links

To Edit and Run Repo

To create env, run, and score:

# conda env create -f environment.yml`
python run.py ./configs/1.json
python score.py ./results/1.json

IMPORTANT: The scoring part of this pipeline relies on the pycocoevalcap python submodule. To also clone this into the repo run git clone --recurse-submodules https://github.com/gautomdas/blip2-coco or if you already downloaded the repo and the pycocoevalcap folder is still empty, run git submodule init && git submodule update.

To Recreate the Demo File

Download the coco data set to the data folder using the following script (assumes you have the environment loaded): python download_coco.py
From there you should be able to run all of demo.ipynb
demo.ipynb goes over the 3 main steps in the diagram above

The following files are as follows:

run.py: The singular file used for quantization + inferencing. This takes in a config as ./configs/<#>.json and runs it.
blip_quantizer.py: The quantization class that quantizes a the blip2 model.
inference_pipeline.py: The inference class that takes a model and tasks to produce results/<#>.json.
scoring_pipeline.py: The scoring class used to convert results to scores based on task. This is separate from the inferencer/quantizer because it only requires the CPU to run.
quant_functions.py: Functions that are Tensor->Tensor and perform quantization.
utils.py: Additional utils used for config loading and model printing.
multi_sbatch.py: Runs the main.py script over many GPUs and different configs.

Notebooks

demo.ipynb: The above figure demonstrated in a ipynb
blip2_analysis.ipynb: Counting linear layers and params for the BLIP2 model
blip2_dropoff_coco.ipynb: A look at drop off between different quantizations over the whole model
dataset_usage.ipynb: A simple file showing how the COCO dataset (and others) are loaded
config_creator.ipynb: Create all combinations of configs based on:

for each bit width:
  for each model part (ViT, LLM, QFormer):
    for each of the 8 combinations of front/middle/end:
      try with 2 other models quantized, not quantized, 1 of each, and 1 of each the other way

Running TODO

Add vqa2 dataset+test
Migrate datasets to HF
Look at error propagation through layers for quantizing
Add GPTQ and AWQ

Interesting Results

1082.json:

{
  "predictions": [
    {
      "image_id": 397133,
      "caption": "the new xiaomi mi box"
    },
    {
      "image_id": 37777,
      "caption": "a white and black image of a smartphone"
    },
    {
      "image_id": 252219,
      "caption": "a white and blue box with a black and white logo"
    },
    {
      "image_id": 87038,
      "caption": "a white and black table with a white and black table cloth"
    },
    {
      "image_id": 174482,
      "caption": "an image of a white table with a black and white image"
    },
    {
      "image_id": 403385,
      "caption": "an image of a white wall with a black and white image of a speaker"
    },
    {
      "image_id": 6818,
      "caption": "the new apple tv 4k"
    },
    {
      "image_id": 480985,
      "caption": "a white and black image of a computer screen"
    },
    {
      "image_id": 458054,
      "caption": "a white and black square with a white and black square"
    },
	...
}

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
configs		configs
dataset		dataset
pycocoevalcap @ a24f74c		pycocoevalcap @ a24f74c
results		results
scratch_notebooks		scratch_notebooks
scripts		scripts
slurm_files/trial		slurm_files/trial
.gitignore		.gitignore
.gitmodules		.gitmodules
3d_plot.html		3d_plot.html
3d_surface_plot_toggle.html		3d_surface_plot_toggle.html
3d_uniform_bits_plot.html		3d_uniform_bits_plot.html
README.md		README.md
abc.py		abc.py
abc.sh		abc.sh
analysis.ipynb		analysis.ipynb
blip_quantizer.py		blip_quantizer.py
correlation_analysis.ipynb		correlation_analysis.ipynb
dataset_usage.ipynb		dataset_usage.ipynb
deactivated_submit_jobs.py		deactivated_submit_jobs.py
demo_captioning.ipynb		demo_captioning.ipynb
demo_gptq_captioning.ipynb		demo_gptq_captioning.ipynb
demo_retrieval.ipynb		demo_retrieval.ipynb
download_coco.py		download_coco.py
download_flickr.py		download_flickr.py
download_vqa2.py		download_vqa2.py
environment.yml		environment.yml
gptq_blip2.ipynb		gptq_blip2.ipynb
gptq_blip2.py		gptq_blip2.py
gptq_opt.ipynb		gptq_opt.ipynb
gptq_sbatch.sh		gptq_sbatch.sh
gptq_scores.csv		gptq_scores.csv
inference_pipeline.py		inference_pipeline.py
it_retrieval.py		it_retrieval.py
multi_sbatch.py		multi_sbatch.py
multi_sbatch_gptq.py		multi_sbatch_gptq.py
quant_functions.py		quant_functions.py
run.py		run.py
run_multi.py		run_multi.py
scatter_meteor_plot.html		scatter_meteor_plot.html
scatter_plot.html		scatter_plot.html
score.py		score.py
scoring_pipeline.py		scoring_pipeline.py
test_slurm.py		test_slurm.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mmq project

Links

To Edit and Run Repo

To Recreate the Demo File

Notebooks

Running TODO

Interesting Results

About

Uh oh!

Releases

Packages

Contributors 2

Languages

gautomdas/mmq

Folders and files

Latest commit

History

Repository files navigation

mmq project

Links

To Edit and Run Repo

To Recreate the Demo File

Notebooks

Running TODO

Interesting Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages