Stars
Simulation of manipulation tasks using Galaxea robots
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs. Demos, technical insights and experimental results are presented on
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
[CoRL 2025] UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
Distributed, scalable benchmarking of generalist robot policies.
Eagle: Frontier Vision-Language Models with Data-Centric Strategies
A collection of tabletop tasks in Mujoco
Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
Generative models for conditional audio generation
Official implementation of the paper: Task Reconstruction and Extrapolation for $\pi_0$ using Text Latent (https://arxiv.org/pdf/2505.03500)
Wan: Open and Advanced Large-Scale Video Generative Models
Taccel: Scaling-up Vision-based Tactile Robotics with High-performance GPU Simulation
VR-based Robot Teleoperation and Data Collection System for Unitree G1
Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, full-time, etc
source code for project instinct website
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
A generative world for general-purpose robotics & embodied AI learning.
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
Official implementation of Diffusion Policy Policy Optimization, arxiv 2024
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction