+
Skip to content
View twni2016's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Highlights

  • Pro

Block or report twni2016

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. llm-reasoning-uft llm-reasoning-uft Public

    Code for Offline Learning and Forgetting for Reasoning with Large Language Models

    Python 8

  2. self-predictive-rl self-predictive-rl Public

    Bridging State and History Representations: Understanding Self-Predictive RL -- ICLR 2024

    Jupyter Notebook 20 2

  3. Memory-RL Memory-RL Public

    When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)

    Python 63 5

  4. pomdp-baselines pomdp-baselines Public

    Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

    Python 323 46

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载