Highlights
- Pro
Stars
CVPR2023-Occupancy-Prediction-Challenge
VIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Simulator-conditioned Driving Scene Generation
Curated list of awesome Cursor Rules .mdc files
Lets make video diffusion practical!
🔬 Visualize attention layers from Stable Diffusion
[CVPR-W] SAGA: Semantic Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Wan: Open and Advanced Large-Scale Video Generative Models
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
[ICCV 2025] CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"
[ICLR 2025 spotlight] 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation
4D Radar Object Detection for Autonomous Driving in Various Weather Conditions
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.
Official code for "Style Aligned Image Generation via Shared Attention"
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
The best OSS video generation models, created by Genmo
ControlLoRA Version 2: A Lightweight Neural Network To Control Stable Diffusion Spatial Information Version 2
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
High-resolution models for human tasks.
set prompt to divided region