- 🎓 USTB(Ph.D. degree)
- 📖 S.X.Zhang's scholar
- 🔭 Computer vision(目标检测-OCR)、VLM、AIGC
- ❤ Program language: Python C++
Research interesting in CV, OCR, GCN, MLLM, AIGC
- China Beijin
- https://gxym.github.io/
Pinned Loading
-
Tencent-Hunyuan/HunyuanImage-3.0
Tencent-Hunyuan/HunyuanImage-3.0 PublicHunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
-
TextBPN-Plus-Plus
TextBPN-Plus-Plus PublicArbitrary Shape Text Detection via Boundary Transformer;The paper at: https://arxiv.org/abs/2205.05320, which has been accepted by IEEE Transactions on Multimedia (T-MM 2023).
-
VCapsBench
VCapsBench PublicVCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.