-
Carnegie Mellon University
- Pittsburgh, PA
-
00:07
(UTC -04:00) - chenbao.tech
- https://orcid.org/0009-0007-0042-0821
- https://scholar.google.com/citations?user=HOngPZAAAAAJ
Highlights
- Pro
Stars
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
Code for "Differentiable Robot Rendering" (CoRL 2024)
A generative world for general-purpose robotics & embodied AI learning.
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
Official implementation of "Self-Improving Video Generation"
Estimating Body and Hand Motion in an Ego-sensed World
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
955 不加班的公司名单 - 工作 955,work–life balance (工作与生活的平衡)
Nipaplay 一款跨平台(Windows Linux macOS)本地弹幕视频播放器。弹弹play 的mac代餐。主要平台为macOS,也是基于macOS开发,其他平台仅做移植。
An Obsidian plugin to interact with your privacy focused AI-Assistant making your second brain even smarter!
High-Resolution Image Synthesis with Latent Diffusion Models
A list of Human-Object Interaction Learning.
Recent LLM-based CV and related works. Welcome to comment/contribute!
This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video""
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
无需预算,使用你的个人数据克隆自己——赛博飞升!Clone yourself by tuning a LLM using your own data.
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
PyTorch code and models for the DINOv2 self-supervised learning method.
This is a list of awesome articles about object detection from video.