Skip to content
#

medallion-architecture

Here are 367 public repositories matching this topic...

End-to-end Data Lakehouse project built on Databricks, following the Medallion Architecture (Bronze, Silver, Gold). Covers real-world data engineering and analytics workflows using Spark, PySpark, SQL, Delta Lake, and Unity Catalog. Designed for learning, portfolio building, and job interviews.

  • Updated Jan 19, 2026
  • Jupyter Notebook

A cloud-native data pipeline and visualization project analyzing Formula 1 racing data using Azure, Databricks, Delta Lake, Tableau, and Python for insightful EDA and interactive dashboards.

  • Updated Jul 23, 2025
  • Jupyter Notebook

A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automation, Databricks Asset Bundles, and DQX data quality framework.

  • Updated Feb 20, 2026
  • Python

Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.

  • Updated Jan 23, 2025
  • Python

'Talk to Your Factory' demo leveraging Edge (Azure IoT Operations), Cloud (Microsoft Fabric), and a Factory Agent (Azure OpenAI), to streamline factory operations. It allows real-time, natural language communication with factory systems, helping operators quickly identify issues, boost efficiency, and minimize downtime.

  • Updated Apr 16, 2025
  • Python

Building a modern data warehouse with Microsoft SQL Server, including ETL processes with Bronze Layer, Silver Layer and the Gold Layer, data modeling and as well as analytics.

  • Updated Dec 3, 2025
  • TSQL

This repo provides a step-by-step approach to building a modern data warehouse using PostgreSQL. It covers the ETL (Extract, Transform, Load) process, data modeling, exploratory data analysis (EDA), and advanced data analysis techniques.

  • Updated Mar 7, 2025
  • PLpgSQL

End-to-end data pipeline transforming Olist e-commerce data through Azure cloud services. Implements medallion architecture (Bronze-Silver-Gold) with multi-source ingestion, Spark-based processing, and OLTP-to-OLAP optimization for analytics-ready datasets.

  • Updated Nov 4, 2025
  • Jupyter Notebook

A streaming data pipeline using only AWS that captures live Wikipedia edits from Wikimedia EventStreams, processes them through a medallion architecture, and surfaces insights via dashboards for content monitoring and risk detection with pipeline observability.

  • Updated Feb 3, 2026
  • Python

Improve this page

Add a description, image, and links to the medallion-architecture topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the medallion-architecture topic, visit your repo's landing page and select "manage topics."

Learn more