这是indexloc提供的服务,不要输入任何密码
Skip to content
View bosung's full-sized avatar

Block or report bosung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch-native post-training at scale

Python 537 62 Updated Nov 18, 2025

A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning

Python 50 7 Updated Nov 17, 2025
Python 9 Updated Nov 18, 2025

Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.

Jupyter Notebook 21 5 Updated Oct 27, 2025

A PyTorch native platform for training generative AI models

Python 4,722 604 Updated Nov 18, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,892 147 Updated Aug 26, 2025

LLM training code for Databricks foundation models

Python 4,355 577 Updated Oct 27, 2025

Supercharge Your Model Training

Python 5,436 457 Updated Nov 12, 2025

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 477 30 Updated Mar 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 63,377 11,364 Updated Nov 18, 2025

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Jupyter Notebook 469 38 Updated Jan 19, 2024

Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Python 283 29 Updated Jun 10, 2022

Some preliminary explorations of Mamba's context scaling.

Python 216 10 Updated Feb 8, 2024

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Python 183 10 Updated Apr 9, 2024

Mamba SSM architecture

Python 16,450 1,496 Updated Nov 11, 2025

Structured state space sequence models

Jupyter Notebook 2,773 349 Updated Jul 17, 2024

LLM (Large Language Model) FineTuning

Jupyter Notebook 564 137 Updated Apr 1, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,126 673 Updated Oct 24, 2025

Official Code for the WWW'24 Paper: "Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models"

Python 21 3 Updated Apr 16, 2025

Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

Python 6,230 979 Updated Apr 4, 2025

🙈 Code for Zero-shot Triplet Extraction by Template Infilling (Kim et al; IJCNLP-AACL 2023)

Python 19 Updated Feb 17, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 2,035 344 Updated Jul 14, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,603 3,838 Updated Jul 23, 2024

An open source implementation of CLIP.

Python 12,971 1,203 Updated Nov 4, 2025

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python 6,252 665 Updated Aug 17, 2025
Python 3 Updated Mar 7, 2023

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,969 2,259 Updated Oct 14, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,674 31,165 Updated Nov 18, 2025

https://sites.google.com/view/wngt19/dgt-task

Python 1 Updated Aug 14, 2019
JavaScript 66 25 Updated Jun 29, 2020
Next