How to use the mm-cot frame as a utility library through local LLM?

Hi! Much appreciated for the excellent work!

I am working on vision-QA task using BLIP2, which consists of three modules:
ViT that extracting vision feature
QFORMER that narrow the gap between vision and language modalities
T5xxl that receive the question and the output of QFORMER to generate answers.

I wonder if it's possible to employ the mm-cot as a utility library in BLIP2 model  to enhance vision-QA inference?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to use the mm-cot frame as a utility library through local LLM? #73

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to use the mm-cot frame as a utility library through local LLM? #73

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions