+
Skip to content
View john-b-yang's full-sized avatar
🐶
wuphf.com
🐶
wuphf.com

Highlights

  • Pro

Organizations

@saasbook @SoftwareDefinedBuildings @61c-teach @SWE-bench

Block or report john-b-yang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
john-b-yang/README.md

Hey there 👋

I'm John! Currently a 2nd year CS PhD student at Stanford University.

Check out john-b-yang.github.io for more.

Pinned Loading

  1. SWE-agent/SWE-agent SWE-agent/SWE-agent Public

    SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

    Python 17.6k 1.9k

  2. SWE-bench/SWE-bench SWE-bench/SWE-bench Public

    SWE-bench: Can Language Models Resolve Real-world Github Issues?

    Python 3.7k 655

  3. SWE-bench/SWE-smith SWE-bench/SWE-smith Public

    [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

    Python 429 60

  4. SWE-bench/experiments SWE-bench/experiments Public

    Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.

    Shell 218 258

  5. princeton-nlp/WebShop princeton-nlp/WebShop Public

    [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

    Python 411 81

  6. princeton-nlp/intercode princeton-nlp/intercode Public

    [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

    Python 227 48

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载