Starred repositories
Testcontainers is a Python library that providing a friendly API to run Docker container. It is designed to create runtime environment to use during your automatic tests.
Testcontainers is a Java library that supports JUnit tests, providing lightweight, throwaway instances of common databases, Selenium web browsers, or anything else that can run in a Docker container.
Apache Iggy: Hyper-Efficient Message Streaming at Laser Speed
DuckLake is an integrated data lake and catalog format
An extremely fast Python type checker and language server, written in Rust.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Terraform Provider for Confluent
Python tool for converting files and office documents to Markdown.
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dat…
OpenAPI Generator allows generation of API client libraries (SDK generation), server stubs, documentation and configuration automatically given an OpenAPI Spec (v2, v3)
📊 Monitoring examples for Confluent Cloud and Confluent Platform
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
A lightweight data processing framework built on DuckDB and 3FS.
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Offline, privacy-first grammar checker. Fast, open-source, Rust-powered
Space and Time | Proof of SQL
This is a repo with links to everything you'd ever want to learn about data engineering
This repository contains a Kafka Connect source connector for copying data from IBM MQ into Apache Kafka.
AutoMQ is a diskless Kafka on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.
A Go library that exposes executors, interfaces, data structures, and utility functions which combined a universal stream processor, invariant to any specific messaging system.
Streamline Apache Kafka with Conduktor Platform. 🚀
An open source documentation tool to bring discoverability to your event-driven architectures
Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.