Stars
Open Agent Coding CLI, Koding with GLM, Qwen, Kimi, DeepSeek etc.(welcome to use Kode to summit PR)
Train transformer language models with reinforcement learning.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
Crawl all your citations from Google Scholar
Ranking Google Scholar search results based on the number of citations
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
🏡 GitHub Pages template for personal academic homepage
ClinicRealm: Re-evaluating Large Language Models with Conventional Machine Learning for Non-Generative Clinical Prediction Tasks
This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners.
Chinese version of the Stanford's modern information retrieval slides
An official implementation of Pangu-Weather
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Awesome LLM for NLG Evaluation Papers
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
800,000 step-level correctness labels on LLM solutions to MATH problems
[ACL 2023] Reasoning with Language Model Prompting: A Survey
Resource, Evaluation and Detection Papers for ChatGPT
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
OpenChat: Advancing Open-source Language Models with Imperfect Data
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath