这是indexloc提供的服务，不要输入任何密码

bug-orz

Follow

bug-orz

Follow

Homepage: bug-orz.github.io

37 followers · 18 following

Fudan University
Shanghai, China
bug-orz.github.io

Achievements

Achievements

Lists (1)

Sort

🔮 Future ideas

Stars

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 4,231 436 Updated Apr 27, 2025

datawhalechina / tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

Jupyter Notebook 3,390 339 Updated Apr 30, 2025

modelscope / easydistill

a toolkit on knowledge distillation for large language models

Python 119 9 Updated Jul 21, 2025

MqLeet / DeMe

[CVPR2025] Official implementation of "Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning"

Python 6 Updated Jun 6, 2025

MqLeet / LED-Merging

[ACL2025 main] Official implementation of "LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint"

Python 11 Updated Jul 1, 2025

RL-MLDM / alphagen

Generating sets of formulaic alpha (predictive) stock factors via reinforcement learning.

Python 781 237 Updated Dec 18, 2024

theZJD / AspectMMKG

8 1 Updated Oct 21, 2023

yinzhangyue / SelfAware

Do Large Language Models Know What They Don’t Know?

Python 99 5 Updated Nov 8, 2024

zhushiyun88 / teaching-boyfriend-llm

419 23 Updated May 8, 2025

tianyi-lab / Reflection_Tuning

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 359 30 Updated Sep 6, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,491 731 Updated Jul 24, 2025

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,917 280 Updated Jul 22, 2025

TIGER-AI-Lab / MAmmoTH2

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 145 10 Updated Oct 27, 2024

RUCAIBox / LLMBox

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 829 104 Updated Jun 16, 2025

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,567 178 Updated Apr 20, 2024

DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,826 340 Updated May 21, 2024

openai / simple-evals

Python 3,879 389 Updated Jul 9, 2025

CLUEbenchmark / SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,229 108 Updated Apr 28, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 9,656 2,572 Updated Jul 24, 2025

Tongjilibo / build_MiniLLM_from_scratch

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

Python 458 56 Updated Mar 23, 2025

HqWu-HITCS / Awesome-LLM-Survey

An Awesome Collection for LLM Survey

372 36 Updated May 25, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 23,368 1,569 Updated Jul 25, 2025

KenyonY / openai-forward

🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy

Python 946 309 Updated Mar 15, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 54,935 6,755 Updated Jul 25, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,835 1,557 Updated Jul 23, 2025

wangcunxiang / QA-Eval

The repository for paper <Evaluating Open-QA Evaluation>

Python 25 Updated Apr 9, 2024

wangcunxiang / LLM-Factuality-Survey

The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>

341 29 Updated Apr 25, 2024

kaixindelele / ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,989 1,953 Updated Apr 4, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,576 2,644 Updated Jul 3, 2025

telmo-correa / all-of-statistics

Self-study on Larry Wasserman's "All of Statistics"

Jupyter Notebook 1,121 301 Updated Dec 11, 2022