Lists (3)
Sort Name ascending (A-Z)
Stars
An Open Standard for lineage metadata collection
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Apache DataFusion Comet Spark Accelerator
Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…
New file format for storage of large columnar datasets.
A markdown version emoji cheat sheet
Various Docker Compose examples of selfhosted FOSS and proprietary projects.
The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, or make proposed changes & submit a pull request.
DuckDB is an analytical in-process SQL database management system
Move the cursor between multiple displays using a shortcut. (Version 1.2)
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
A very simple mac-menubar app that reminds you to give your eyes a break every twenty minutes. Based on the 202020-rule: Look at something that's 20 metres away for 20 seconds every 20 minutes to p…
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
The Markdown-based note-taking app that doesn't suck.
Generate HTML & PDF documentation from Github wiki or any other markdown-based wiki.
A composable and fully extensible C++ execution engine library for data management systems.
List of changes announced for AWS that may break existing code
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
The official AWS SDK for Java 1.x (In Maintenance Mode, End-of-Life on 12/31/2025). The AWS SDK for Java 2.x is available here: https://github.com/aws/aws-sdk-java-v2/
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…
The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.