- Shang hai, China
- https://blog.tommyyang.cn
Highlights
Lists (19)
Sort Name ascending (A-Z)
A2A
agent to agent protocolAI Agents
AgentAI_Infra
algo
big_data
data_analysisbooks
java_core
java核心项目LLM
LLM 相关 projectLLM-Agent-Frame
LLM-Agent-FrameLLM-bench
llm 评测LLM-DeepResearch
LLM-DeepResearchllm_learning
llm学习课程LLM-Paper
LLM PaperLLM-Tools
LLM相关工具mcp
mcp学习MedAI
medaiRAG
ragspring
spring家族Tools
ToolsStars
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
A simple yet powerful agent framework that delivers with open-source models
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
🔬 Online Heap Dump, GC Log, Thread Dump & JFR File Analyzer.
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
A Survey of Reinforcement Learning for Large Reasoning Models
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Youtu-GraphRAG boosts cost efficiency, inference accuracy, and cross-domain adaptability, pushing the boundaries of performance in complex QA.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
Retrieval and Retrieval-augmented LLMs
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, DEVIN, REPLIT, AND MORE! - AI SYSTEMS TRANSPARENCY FOR ALL! 👐
Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.
Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
[COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning
🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
🤗 smolagents: a barebones library for agents that think in code.
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调