+
Skip to content
View TKONIY's full-sized avatar
🌋
Working on Data x AI
🌋
Working on Data x AI

Organizations

@DBGroup-SUSTech

Block or report TKONIY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Self-Adapting Language Models

Python 1,315 210 Updated Aug 1, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,040 796 Updated Oct 17, 2025

Triton implementation of Flash Attention2.0

Python 40 5 Updated Jul 31, 2023

Memory-efficient multi layer perceptron implementation in OpenAI Triton.

Python 12 Updated Jan 24, 2025

GPU programming related news and material links

1,741 98 Updated Sep 17, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,175 516 Updated Sep 23, 2025

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.

Python 2,945 270 Updated Aug 2, 2025

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 715 38 Updated Sep 19, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 4,744 650 Updated Aug 11, 2025
Python 253 19 Updated Oct 14, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 579 49 Updated Oct 11, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,058 203 Updated Oct 15, 2025

Ring attention implementation with flash attention

Python 900 87 Updated Sep 10, 2025

ring-attention experiments

Python 154 13 Updated Oct 17, 2024

Pie: Programmable LLM Serving

Python 35 8 Updated Oct 17, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,433 2,281 Updated Oct 17, 2025

PyTorch library for cost-effective, fast and easy serving of MoE models.

Python 252 18 Updated Oct 15, 2025

Fast CUDA matrix multiplication from scratch

Cuda 901 129 Updated Sep 2, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,597 265 Updated Oct 17, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,811 289 Updated Oct 16, 2025

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)

531 20 Updated Sep 12, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,726 292 Updated Jun 12, 2025

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,877 190 Updated Apr 8, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,907 706 Updated May 31, 2024

[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Python 250 10 Updated Dec 27, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,452 2,048 Updated Jul 17, 2025

A collection of awesome video generation studies.

TeX 650 30 Updated Oct 14, 2025

serverless agents

TypeScript 203 53 Updated Apr 13, 2025
C++ 696 118 Updated Sep 25, 2025

Brain-to-text with test time training

Python 23 Updated Oct 7, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载