这是indexloc提供的服务,不要输入任何密码
Skip to content
View MaxGekk's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@apache @databricks

Block or report MaxGekk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Spark Connect Client for Swift

Swift 27 7 Updated Oct 31, 2025

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,271 971 Updated Nov 8, 2025

A tool to get better debug info on spark's memory usage

Scala 42 15 Updated Aug 21, 2019

Tips for developing Apache Spark, especially in IntelliJ IDEA

3 1 Updated Jan 24, 2020

Command line history manager for bash

C++ 29 3 Updated Mar 11, 2023

All the things about TPC-DS in Apache Spark

Scala 108 43 Updated Jun 15, 2023
Jupyter Notebook 7 2 Updated Aug 23, 2021

Task Metrics Explorer

Scala 14 9 Updated Apr 2, 2019

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Python 22,953 4,998 Updated Nov 15, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,396 1,955 Updated Nov 14, 2025

Code that'll help you kickstart a personal website that showcases your work as a software developer.

HTML 7,575 6,647 Updated Dec 21, 2023

Spark Structured Streaming State Tools

Scala 34 8 Updated Jul 3, 2020

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

C++ 22,733 1,176 Updated Nov 11, 2025

Koalas: pandas API on Apache Spark

Python 3,369 367 Updated Mar 20, 2024

A lightweight library to inject LLVM bitcode into JVMs

C++ 86 7 Updated Dec 9, 2019

Log analyser / visualiser for Java HotSpot JIT compiler. Inspect inlining decisions, hot methods, bytecode, and assembly. View results in the JavaFX user interface.

Java 3,228 455 Updated Nov 11, 2025

Run spark calculations from Ammonite

Scala 117 17 Updated Nov 3, 2025

Spark SQL index for Parquet tables

Scala 134 35 Updated May 6, 2021

A scala library for interacting with the slack api and real time messaging interface

Scala 189 105 Updated Aug 28, 2024

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 184 34 Updated Oct 15, 2025

Qubole Sparklens tool for performance tuning Apache Spark

Scala 585 143 Updated Jun 26, 2024

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 795 160 Updated Nov 6, 2025

Schema Registry integration for Apache Spark

Scala 40 18 Updated Nov 16, 2022

Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol

Scala 34 19 Updated Sep 8, 2022

Example project showing how to use Hive UDFs in Apache Spark

Scala 55 23 Updated Apr 23, 2019

Scala library for .netrc files

Scala 2 1 Updated Feb 1, 2018

Simple jdbc client for Apache Spark

Scala 7 1 Updated Dec 16, 2017

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,299 28,937 Updated Nov 15, 2025

Mirror of Apache Kafka

Java 31,297 14,786 Updated Nov 15, 2025
Next