Starred repositories
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, xDC replica…
简单易用的量化金融数据包(easy utility for getting financial market data of China)
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
A Cloud Native Batch System (Project under CNCF)
ebpf-go is a pure-Go library to read, modify and load eBPF programs and attach them to various hooks in the Linux kernel.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.
Machine Learning Journal for Intermediate to Advanced Topics.
A lightweight data processing framework built on DuckDB and 3FS.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Integrate the DeepSeek API into popular softwares
Real-time event streaming platform. Streaming CDC, stream processing, low-latency serving, and Iceberg management.
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
Unified MySQL, Postgres & FlightSQL Server, Powered by DuckDB.
KubeBlocks is a Kubernetes Operator designed to manage a variety of databases and streaming systems, including MySQL, PostgreSQL, MongoDB, Redis, RabbitMQ, RocketMQ, and more, within Kubernetes env…
Master programming by recreating your favorite technologies from scratch.
Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场…
本系统是集工单统计、任务钩子、权限管理、灵活配置流程与模版等等于一身的开源工单系统,当然也可以称之为工作流引擎。 致力于减少跨部门之间的沟通,自动任务的执行,提升工作效率与工作质量,减少不必要的工作量与人为出错率。
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Easy and fluent Go cron scheduling. This is a fork from https://github.com/jasonlvhit/gocron
MIT 6.828 Operating System Lab https://pdos.csail.mit.edu/6.828/2018/schedule.html