+
Skip to content
View zsychina's full-sized avatar
🐭
鼠鼠我啊,又要寄了
🐭
鼠鼠我啊,又要寄了
  • Sun Yat-sen University
  • Beijing, China
  • 10:29 (UTC +08:00)

Block or report zsychina

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zsychina/README.md

Hi there 👋

  • Education Experience:

    Undergraduate: DLUT School of Automation@20FALL

    Master: SYSU School of Computer Science@24FALL

  • Research Focus:

    I'm interested in reinforcement learning, agents and utilizing RL to reinforce LLM agents' ability in decision making.

  • News

    I will start an internship @ AutoGLM team, Zhipu AI as RL engineer this summer!

Looking forward to making friends and cooperating with you!

Pinned Loading

  1. verl-python verl-python Public

    Support `muliti-step` python call in verl. formation: <python>code</python><output>results</output>

    Python 1

  2. meta-curriculum-llm meta-curriculum-llm Public

    Using meta scheduler in LLM's RL curriculum training process.

    Python 1

  3. Curriculum-LLM Curriculum-LLM Public

    Using automated curriculum learning to enhance LLM's RL training process.

    Python 5

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载