+
Skip to content
View g-jing's full-sized avatar

Highlights

  • Pro

Block or report g-jing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 6 Updated Jul 22, 2025

Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models'

Python 21 Updated Jul 18, 2025
Python 7,909 561 Updated Oct 10, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,242 190 Updated Jun 5, 2025

[CVPR 2025] WildAvatar: Learning In-the-wild 3D Avatars from the Web

Python 118 4 Updated Mar 11, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,379 2,036 Updated Jul 17, 2025

Scalable and memory-optimized training of diffusion models

Python 1,284 136 Updated Jun 4, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,871 10,630 Updated Oct 12, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 18,765 3,119 Updated Oct 12, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 21,257 2,496 Updated Oct 10, 2025

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 766 37 Updated Aug 8, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 50,246 8,795 Updated Sep 30, 2025

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,821 144 Updated Oct 4, 2025

The official implementation of RealisDance

Python 602 27 Updated Jun 20, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,402 266 Updated Mar 10, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,944 1,066 Updated Nov 18, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,489 1,623 Updated Sep 30, 2025

Official repository for LTX-Video

Python 8,267 742 Updated Jul 21, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,147 1,099 Updated Aug 27, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,058 521 Updated Jun 9, 2025

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 602 55 Updated Aug 13, 2025

FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024

Python 21 1 Updated Dec 9, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,076 128 Updated Aug 7, 2025

The best OSS video generation models, created by Genmo

Python 3,450 436 Updated Sep 5, 2025

Agent S: an open agentic framework that uses computers like a human

Python 7,207 801 Updated Oct 5, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,998 1,196 Updated Sep 7, 2025
Python 17 4 Updated Oct 22, 2024

[CVPR 2024] On the Content Bias in Fréchet Video Distance

Python 128 9 Updated Sep 28, 2024
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载