+
Skip to content
View jiajiz's full-sized avatar

Block or report jiajiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of vibe coding references, collaborating with AI to write code.

1,366 149 Updated Oct 9, 2025

Collection of AWESOME vision-language models for vision tasks

2,957 220 Updated Oct 14, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,671 107 Updated Sep 16, 2025

💯2025年信息系统项目管理师(软考高级)备考资源库。

Rich Text Format 985 243 Updated Oct 14, 2025

Fine-Tuning SigLIP 2 for Single/Multi-Label Image Classification. Image classification vision-language encoder model fine-tuned for Image Classification Tasks

Jupyter Notebook 42 5 Updated Jul 22, 2025

Recaptured Screen Image Demoiréing. (TCSVT 2020)

Python 26 3 Updated Apr 8, 2021
Jupyter Notebook 130 36 Updated Oct 1, 2025

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Python 164 27 Updated Mar 7, 2023

A repository of all code and resources of my published blog articles.

Python 34 10 Updated Sep 20, 2025

Official Repository for VELM, featured in CVPRW 2025 paper: "Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal LLMs"

Python 48 4 Updated Jun 28, 2025

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

42,474 4,302 Updated Mar 20, 2025

遇事不决,Vibe 力学! One-Person Company AI Tools Series – continuously updated to help boost productivity and empower your solo business!

2,534 215 Updated May 8, 2025

Ai迷思录(应用与安全指南)

1,108 115 Updated Mar 24, 2025

《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞

HTML 246 24 Updated Dec 5, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 27,440 3,252 Updated Oct 12, 2025

A Framework of Small-scale Large Multimodal Models

Python 907 95 Updated Apr 26, 2025

Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya

Python 116 11 Updated Aug 7, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,593 298 Updated Aug 6, 2025

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 474 24 Updated Mar 10, 2025
Python 20 3 Updated Oct 13, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,946 1,069 Updated Nov 18, 2024

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

Python 701 222 Updated Sep 3, 2025

从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两层MLP投影层连接视觉编码器与语言模型。

Python 26 2 Updated Feb 15, 2025

Visual Instruction Tuning for Qwen2 Base Model

Python 38 2 Updated Jun 29, 2024

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 4,854 506 Updated Oct 12, 2025
Python 70 6 Updated Jun 20, 2024

大模型相关实践记录

Python 157 19 Updated Apr 6, 2025

训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。

Python 74 12 Updated Sep 6, 2024

模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!

Python 25 4 Updated Jul 23, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,668 437 Updated Aug 5, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载