Starred repositories
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
An implementation of the Ad hoc On-demand Distance Vector (AODV) routing protocol
AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识
🚀 The fast, Pythonic way to build MCP servers and clients
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
Lance Namespace is an open specification on top of the storage-based Lance table and file format to standardize access to a collection of Lance tables
Spark integrations for working with Lance datasets
Distributed query engine providing simple and reliable data processing for any modality and scale
A lightweight data processing framework built on DuckDB and 3FS.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Apache Fluss is a streaming storage built for real-time analytics.
An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes
本文原文由知名 Hacker Eric S. Raymond 所撰寫,教你如何正確的提出技術問題並獲得你滿意的答案。
🔥 Blazing fast bulk data transfers between any cloud 🔥
An Open Standard for lineage metadata collection
Apache OpenDAL Go Binding Services Releases
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
A collection of RBIR projects and posts for anyone interested in joining this journey.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
🧰 The Rust SQL Toolkit. An async, pure Rust SQL crate featuring compile-time checked queries without a DSL. Supports PostgreSQL, MySQL, and SQLite.