Stars
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Flink CDC is a streaming data integration tool
Upserts, Deletes And Incremental Processing on Big Data.
JanusGraph: an open-source, distributed graph database
旨在打造在线最佳的 Java 学习笔记,含博客讲解和源码实例,包括 Java SE 和 Java Web
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
Maxwell's daemon, a mysql-to-json kafka producer
SOFARPC is a high-performance, high-extensibility, production-level Java RPC framework.
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
A Java server-side recording and playback solution based on JVM-Sandbox
SOFABolt is a lightweight, easy to use and high performance remoting framework based on Netty.
Confluent Schema Registry for Kafka
An Open Standard for lineage metadata collection
ZooKeeper client wrapper and rich ZooKeeper framework
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond
Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads
The simple, stupid random Java beans/records generator
Apache Fluss is a streaming storage built for real-time analytics.
jsoniter (json-iterator) is fast and flexible JSON parser available in Java and Go
Janino is a super-small, super-fast Java™ compiler.
SQL-based streaming analytics platform at scale
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.