Stars
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Official Repo for Open-Reasoner-Zero
A python module to repair invalid JSON from LLMs
Ola: Pushing the Frontiers of Omni-Modal Language Model
[ECCV 2024] DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
[ICLR 2025] MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
hiddenswitch / ComfyUI
Forked from comfyanonymous/ComfyUIA powerful and modular stable diffusion GUI with a graph/nodes interface.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
Hurl, run and test HTTP requests with plain text.
Python API client for AUTOMATIC1111/stable-diffusion-webui
Pytorch framework for doing deep learning on point clouds.
Instant voice cloning by MIT and MyShell. Audio foundation model.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Pointcept: a codebase for point cloud perception research. Latest works: Sonata (CVPR'25 Highlight), PTv3 (CVPR'24 Oral), PPT (CVPR'24), MSC (CVPR'23)
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Code for 3D-LLM: Injecting the 3D World into Large Language Models
Blender Python scripts for rendering images directly from command-line interface
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Painter & SegGPT Series: Vision Foundation Models from BAAI
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议
Official implementation of "MeshDiffusion: Score-based Generative 3D Mesh Modeling" (ICLR 2023 Spotlight)
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复