- United States
Lists (1)
Sort Name ascending (A-Z)
Stars
A generative world for general-purpose robotics & embodied AI learning.
Deep reinforcement learning without experience replay, target networks, or batch updates.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Clean baseline implementation of PPO using an episodic TransformerXL memory
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Vectorization techniques for fast population-based training.
Multiplayer Rating System. No Friction.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
A simple, performant and scalable Jax LLM!
Ongoing research training transformer models at scale
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…
A library of reinforcement learning components and agents
Massively parallel rigidbody physics simulation on accelerator hardware.
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Awesome-LLM: a curated list of Large Language Model
A benchmark to evaluate language models on questions I've previously asked them to solve.
IbisML is a library for building scalable ML pipelines using Ibis.
Deep reinforcement learning for UAV in Gazebo simulation environment
Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022