+
Skip to content
View bollossom's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report bollossom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pytorch implemention of UniFlow

Jupyter Notebook 44 Updated Oct 17, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,105 23 Updated Oct 15, 2025

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 983 201 Updated Mar 15, 2025

ULMEvalKit: One-Stop Eval ToolKit for Image Generation

Python 43 1 Updated Oct 14, 2025

“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.

Python 155 4 Updated May 1, 2025

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 103 4 Updated Oct 14, 2025

[ICCV 2025] LIRA

Python 16 2 Updated Oct 9, 2025

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,218 91 Updated Oct 14, 2025

WorldVLA: Towards Autoregressive Action World Model

Python 450 19 Updated Oct 10, 2025

Official repo for NeurIPS 2025 poster: Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks

Python 3 Updated Sep 24, 2025

[NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs".

Python 61 3 Updated Jun 17, 2024

[NeurIPS 2025] The official implementation of paper "Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking"

Python 10 2 Updated Oct 10, 2025

[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project

Python 176 2 Updated Mar 20, 2025

Sequence Parallelism for Long Training

Python 2 Updated Oct 13, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,729 274 Updated Jul 18, 2025

Unified Long Training Codebase

Python 1 Updated Oct 13, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,355 561 Updated Oct 19, 2024

Fully Open Framework for Democratized Multimodal Training

Python 540 36 Updated Oct 17, 2025

Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.

Python 72 1 Updated Oct 16, 2025

[Fully open] [Encoder-free MLLM] Vision as LoRA

Python 340 28 Updated Jun 12, 2025

OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation

Python 222 5 Updated Sep 22, 2025
Python 1,105 140 Updated Sep 25, 2025

Code for our paper "Next Visual Granularity Generation".

Python 39 Updated Oct 7, 2025

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Python 174 5 Updated May 21, 2025

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 330 26 Updated Feb 23, 2025

A video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

630 48 Updated Aug 27, 2025
Jupyter Notebook 3,394 321 Updated May 14, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,021 1,544 Updated Sep 5, 2024
Python 555 15 Updated Sep 30, 2025

[NeurIPS 2025] Native-resolution diffusion Transformer

Python 286 17 Updated Oct 14, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载