jiajiz

jiajiz

1 follower · 0 following

Stars

filipecalegario / awesome-vibe-coding

A curated list of vibe coding references, collaborating with AI to write code.

1,366 149 Updated Oct 9, 2025

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,957 220 Updated Oct 14, 2025

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,671 107 Updated Sep 16, 2025

xiaomabenten / ruankao_itpm

💯2025年信息系统项目管理师（软考高级）备考资源库。

Rich Text Format 985 243 Updated Oct 14, 2025

PRITHIVSAKTHIUR / FineTuning-SigLIP-2

Fine-Tuning SigLIP 2 for Single/Multi-Label Image Classification. Image classification vision-language encoder model fine-tuned for Image Classification Tasks

Jupyter Notebook 42 5 Updated Jul 22, 2025

tju-maoyan / AMNet

Recaptured Screen Image Demoiréing. (TCSVT 2020)

Python 26 3 Updated Apr 8, 2021

AmadeusITGroup / Moire-Pattern-Detection

Jupyter Notebook 130 36 Updated Oct 1, 2025

automl / trivialaugment

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Python 164 27 Updated Mar 7, 2023

anand-subu / blog_resources

A repository of all code and resources of my published blog articles.

Python 34 10 Updated Sep 20, 2025

Sassanmtr / VELM

Official Repository for VELM, featured in CVPRW 2025 paper: "Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal LLMs"

Python 48 4 Updated Jun 28, 2025

GitHubDaily / GitHubDaily

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

42,474 4,302 Updated Mar 20, 2025

cyfyifanchen / one-person-company

遇事不决，Vibe 力学! One-Person Company AI Tools Series – continuously updated to help boost productivity and empower your solo business!

2,534 215 Updated May 8, 2025

Acmesec / theAIMythbook

Ai迷思录（应用与安全指南）

1,108 115 Updated Mar 24, 2025

HCPLab-SYSU / Book-of-MLM

《多模态大模型：新一代人工智能技术范式》作者：刘阳，林倞

HTML 246 24 Updated Dec 5, 2024

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 27,440 3,252 Updated Oct 12, 2025

TinyLLaVA / TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Python 907 95 Updated Apr 26, 2025

nahidalam / maya

Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya

Python 116 11 Updated Aug 7, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,593 298 Updated Aug 6, 2025

Coobiw / MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 474 24 Updated Mar 10, 2025

chenin-wang / awesome_ai_paper

Python 20 3 Updated Oct 13, 2025

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,946 1,069 Updated Nov 18, 2024

PaddlePaddle / PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

Python 701 222 Updated Sep 3, 2025

yujunhuics / Reyes

从零到一实现了一个多模态大模型，并命名为Reyes（睿视），R：睿，eyes：眼。Reyes的参数量为8B，视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct，Reyes也通过一个两层MLP投影层连接视觉编码器与语言模型。

Python 26 2 Updated Feb 15, 2025

TobyYang7 / Llava_Qwen2

Visual Instruction Tuning for Qwen2 Base Model

Python 38 2 Updated Jun 29, 2024

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 4,854 506 Updated Oct 12, 2025

XiaoduoAILab / XmodelVLM

Python 70 6 Updated Jun 20, 2024

stay-leave / enhance_llm

大模型相关实践记录

Python 157 19 Updated Apr 6, 2025

AI-Study-Han / Zero-Qwen-VL

训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。

Python 74 12 Updated Sep 6, 2024

reilxlx / llava-Qwen2-7B-Instruct-Chinese-CLIP

模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力，接近gpt4o、claude-3.5-sonnet的识别水平！

Python 25 4 Updated Jul 23, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,668 437 Updated Aug 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly