Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.
-
Updated
May 6, 2026 - Rust
Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.
Apache Spark Connect Client for Rust
Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!
velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docker Compose.
Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect
A reverse proxy server which allows secure connectivity to a Spark Connect server
Docker Compose environment for big data research and machine learning development
Collection of articles, using the Literate Programming style, about Data Engineering and Software Tooling in general
Interactive metadata visualizer for Apache Iceberg. Trace snapshots, manifests, and file lineages through a real-time, graph-based UI.
Real-time insights on bike availability using the JCDECAUX API. Data flows through Kafka, processed by Spark, and visualized in a Streamlit app. Deployed on Kubernetes.
User-land broadcast-variable workaround for PySpark on Spark Connect / Databricks Serverless
TypeScript client for Apache Spark Connect
Add a description, image, and links to the spark-connect topic page so that developers can more easily learn about it.
To associate your repository with the spark-connect topic, visit your repo's landing page and select "manage topics."