这是indexloc提供的服务,不要输入任何密码
Skip to content
View bug-orz's full-sized avatar

Block or report bug-orz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 4,231 436 Updated Apr 27, 2025

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 3,390 339 Updated Apr 30, 2025

a toolkit on knowledge distillation for large language models

Python 119 9 Updated Jul 21, 2025

[CVPR2025] Official implementation of "Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning"

Python 6 Updated Jun 6, 2025

[ACL2025 main] Official implementation of "LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint"

Python 11 Updated Jul 1, 2025

Generating sets of formulaic alpha (predictive) stock factors via reinforcement learning.

Python 781 237 Updated Dec 18, 2024
8 1 Updated Oct 21, 2023

Do Large Language Models Know What They Don’t Know?

Python 99 5 Updated Nov 8, 2024

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 359 30 Updated Sep 6, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,491 731 Updated Jul 24, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,917 280 Updated Jul 22, 2025

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 145 10 Updated Oct 27, 2024

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 829 104 Updated Jun 16, 2025

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,567 178 Updated Apr 20, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,826 340 Updated May 21, 2024
Python 3,879 389 Updated Jul 9, 2025

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,229 108 Updated Apr 28, 2025

A framework for few-shot evaluation of language models.

Python 9,656 2,572 Updated Jul 24, 2025

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

Python 458 56 Updated Mar 23, 2025

An Awesome Collection for LLM Survey

372 36 Updated May 25, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 23,368 1,569 Updated Jul 25, 2025

🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy

Python 946 309 Updated Mar 15, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 54,935 6,755 Updated Jul 25, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,835 1,557 Updated Jul 23, 2025

The repository for paper <Evaluating Open-QA Evaluation>

Python 25 Updated Apr 9, 2024

The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>

341 29 Updated Apr 25, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,989 1,953 Updated Apr 4, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,576 2,644 Updated Jul 3, 2025

Self-study on Larry Wasserman's "All of Statistics"

Jupyter Notebook 1,121 301 Updated Dec 11, 2022
Next