+
Skip to content
View wilrop's full-sized avatar

Highlights

  • Pro

Block or report wilrop

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Automatic evals for LLMs

HTML 545 66 Updated Jun 27, 2025

Monte Carlo tree search in JAX

Python 2,541 209 Updated Sep 2, 2025
Python 84 12 Updated Sep 9, 2025

Unified Implementations of Offline Reinforcement Learning Algorithms

Python 112 5 Updated Oct 12, 2025

Train transformer language models with reinforcement learning.

Python 15,872 2,232 Updated Oct 14, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,391 2,510 Updated Oct 14, 2025

Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!

Python 237 19 Updated May 26, 2025

Optimal transport tools implemented with the JAX framework, to solve large scale matching problems of any flavor.

Python 658 116 Updated Oct 9, 2025

Wasserstein Auto-encoded Markov Decision Processes: a framework to distill Reinforcement Learning policies into verifiable controllers with bisimulation guarantees

Jupyter Notebook 4 1 Updated Aug 6, 2025

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python 843 114 Updated Oct 10, 2025

Multi-Agent Reinforcement Learning with JAX

Python 649 127 Updated Jul 7, 2025

A C++ framework for MDPs and POMDPs with Python bindings

C++ 660 102 Updated Mar 18, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 43,056 1,575 Updated Oct 14, 2025

A clean customizable documentation theme for Sphinx

Sass 3,255 365 Updated Oct 13, 2025

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 764 92 Updated Oct 13, 2025

JAX and PZ RL envs + algorithms for swarms of CrazyFlies

Python 81 12 Updated Aug 28, 2024

Benchmarks for Multi-Objective Multi-Agent Decision Making

Python 108 19 Updated Oct 7, 2025

A new markup-based typesetting system that is powerful and easy to learn.

Rust 46,913 1,280 Updated Oct 14, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,121 2,816 Updated Dec 18, 2024

Evolution Strategies in JAX 🦎

Python 668 55 Updated Sep 20, 2025

TLDRs for ML in Drug Discovery papers

71 5 Updated Mar 5, 2023

Distributed Reinforcement Learning over HTTP

Python 5 Updated Jun 29, 2022

Multi-Objective Reinforcement Learning algorithms implementations.

Python 436 83 Updated Sep 2, 2025

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,046 868 Updated Jul 8, 2025

The M2L school 2022 tutorials

Jupyter Notebook 36 9 Updated Sep 17, 2022

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,271 3,661 Updated Oct 14, 2025

Iterative Linear-Quadratic Games!

C++ 173 50 Updated Jun 18, 2025

The Information Dynamics of Multidimensional Sequences

Jupyter Notebook 2 1 Updated Jul 26, 2024

🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.

TeX 4,612 6,469 Updated Oct 12, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载