-
bytedance
- beijing
- geeker-smallwhite.com
- https://leetcode.cn/geeker-smallwhite/
Starred repositories
Calculates cognitive complexities of functions (and methods) in Go source code. (Golang cognitive complexity)
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
Roo Code gives you a whole dev team of AI agents in your code editor.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
🌐 The open-source Agentic browser; privacy-first alternative to ChatGPT Atlas, Perplexity Comet, Dia.
《Agentic Design Patterns》中文翻译版
slime is an LLM post-training framework for RL Scaling.
A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures
CLI tool for configuring and monitoring Claude Code
Let your Claude able to think
Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
Lightweight coding agent that runs in your terminal
Open-source, secure environment with real-world tools for enterprise-grade agents.
Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
SSHM is a beautiful command-line tool that transforms how you manage and connect to your SSH hosts. Built with Go and featuring an intuitive TUI interface, it makes SSH connection management effort…
A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
Collection of leaked system prompts
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Code for the paper "Evaluating Large Language Models Trained on Code"
TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)
[ICML 2025🔥] ParallelComp: Parallel Long-Context Compressor for Length Extrapolation
SWE-bench: Can Language Models Resolve Real-world Github Issues?