Supplementary materials for A4M33BDT Big data course at @cvut FEL
- Hive practice. Simple ETL pipeline, query, analytics window functions. Practice uses USA average temperature data available for download here.
- Map-Reduce practice. Simple WordCount in Map-Reduce. Inverted index. Uses data IMDB reviews data available for download here.
- Spark practice. WordCount, Inverted index. Data
- Spark SQL. Analytics on Spark. Instructions how to download data can be found here