+

bongmo

Follow

Bong Mo Kim bongmo

Follow

6 followers · 12 following

SK Telecom
@South Korea

Achievements

Achievements

Stars

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 14,074 1,073 Updated Oct 11, 2025

bytedance / UMO

🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

Python 147 2 Updated Sep 15, 2025

Tencent-Hunyuan / SRPO

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,112 35 Updated Oct 11, 2025

Tencent-Hunyuan / HunyuanImage-2.1

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation

Python 638 45 Updated Sep 29, 2025

kgcrom / cluefin

Your financial investment assistant

Python 104 24 Updated Oct 11, 2025

xdit-project / DistVAE

A parallelism VAE avoids OOM for high resolution image generation

Python 81 10 Updated Aug 4, 2025

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,318 276 Updated Sep 30, 2025

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 2,401 178 Updated Oct 12, 2025

XueZeyue / DanceGRPO

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 978 47 Updated Oct 4, 2025

CodeGoat24 / Pref-GRPO

Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Python 167 7 Updated Sep 25, 2025

bytedance / USO

🔥🔥 Open-sourced unified customization model

Python 1,143 69 Updated Sep 12, 2025

KohakuBlueleaf / HDM

Home Made Diffusion Models

Python 158 4 Updated Sep 10, 2025

nxnai / Voost

[Official] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

317 22 Updated Aug 19, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,639 484 Updated Oct 3, 2025

stepfun-ai / NextStep-1

Python 553 15 Updated Sep 30, 2025

apple / embedding-atlas

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

TypeScript 3,922 191 Updated Oct 11, 2025

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,411 69 Updated Sep 18, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,771 516 Updated Oct 9, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,622 302 Updated Sep 30, 2025

Li-Jinsong / DAEDAL

Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"

Python 136 5 Updated Sep 12, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,954 1,021 Updated Oct 12, 2025

Kwai-Keye / Keye

Python 676 12 Updated Sep 24, 2025

bytedance / XVerse

[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".

Python 603 43 Updated Sep 26, 2025

FreedomIntelligence / ShareGPT-4o-Image

Python 265 11 Updated Jul 22, 2025

inclusionAI / Ming

Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.

Jupyter Notebook 477 41 Updated Sep 25, 2025

SKT-AI / A.X-4.0

SKT A.X LLM 4.0

139 6 Updated Jul 10, 2025

bytedance / Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,359 591 Updated Sep 30, 2025

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,583 541 Updated Jul 11, 2024

MiniMax-AI / MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,914 246 Updated Jul 7, 2025

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,388 519 Updated Oct 8, 2025

点击这是indexloc提供的php浏览器服务，不要输入任何密码和下载