Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A retargetable MLIR-based machine learning compiler and runtime toolkit.
collection of benchmarks to measure basic GPU capabilities
CUDA_C++_Best_Practices_Guide_CN
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Development repository for the Triton language and compiler
Shared Middle-Layer for Triton Compilation
A high-throughput and memory-efficient inference and serving engine for LLMs
Universal LLM Deployment Engine with ML Compilation
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2