Skip to content

jessicaromero-ctrl/Big-Data-Engineering-Applications

Repository files navigation

Big Data Engineering Applications

Distributed Processing · Hadoop · Spark

Python Hadoop Institution

Overview

Big Data engineering implementations covering distributed batch processing, stream analytics, and large-scale data pipeline design.

Projects

  • Clickstream Sessionization — Hadoop MapReduce pipeline for web session reconstruction
  • Distributed Aggregations — Large-scale data summarization with Spark
  • ETL Pipelines — Batch ingestion and transformation workflows

Maestría en Inteligencia Artificial · Big Data · Universidad Politécnica Metropolitana de Hidalgo

About

Big Data engineering workflows: Hadoop MapReduce, Spark, distributed processing, clickstream sessionization

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages