Unified roadmap β app features, curriculum expansion, DX improvements, and long-horizon ideas.
Stack: Vite 7 Β· React 19 Β· TypeScript 5.9 Β· motion Β· zustand Β· zod Β· react-hot-toast Β· canvas-confetti
Last updated: Mar 2, 2026
Audit update (2026-02-28): Re-validated items marked complete against current codebase and in-app surfacing. Tasks found partially implemented have been moved back to incomplete with notes.
Click to expand completed items
- Toast feedback on progress save, exercise submission, theme toggle
- Confetti on quiz ace, phase unlock, lesson complete, full-curriculum finale
- Zod schemas for lesson frontmatter + build-time validation
- Page transitions (AnimatePresence fade + slide)
- Staggered card entrance, spring-physics progress bar, hover lift
- Parallax hero, scroll-triggered fade-in lessons, animated stats counters
-
layoutIdshared transitions, masonry stagger for concept graph - Zustand progress store + persist middleware
- Quiz attempt tracking, per-question analytics, spaced repetition surfacing
- User preferences store (theme, sidebar, font-size, code lang, density)
- Time-on-page / study streak / weekly chart / total learning time
- Glassmorphism dark-mode cards, animated gradient mesh hero
- Breadcrumb trail, "continue where you left off" banner
- Keyboard shortcut overlay (
?) - XP + badge gamification layer on Progress page
- Vite 7 module preload optimisation, bundle analysis script
- Playwright visual regression + reduced-motion tests
- Zustand + Zod unit tests
- Strict TypeScript
satisfiespatterns, error boundaries with toast fallbacks
These fix the most critical content gaps identified in the curriculum audit.
Validation note (2026-02-27): Cross-checked Agent-P2, Agent-P8, and Agent-P9 overview updates against underlying
Day_*/README.mdlesson scope and sequence. Section coverage is now explicitly evidenced in the overview docs (skills matrix, ROI tables, scenario walkthroughs, exam prompts, cloud-native/capstone bridges), and all listed Phase 2/8/9 overview depth criteria are now satisfied.
- Phase 2 Overview β expanded to Phase 5 depth standard (300+ lines, ROI table, 3-tier skills matrix, 5+ pitfalls, 4+ exam Qs)
- Phase 8 Overview β scenario walkthroughs, 2+ milestone exam questions, expert track, and
extras/folder are now present - Phase 9 Overview β Cloud-native SQL section and curriculum capstone preview are present; >15 KB length is acceptable for this overview
-
Day_37B_Probability_and_Statistics_for_MLβ distributions, Bayes theorem, CLT (prerequisite for Phase 5 Day 54) -
Day_37C_Sklearn_Pipelinesβ Pipeline, ColumnTransformer, custom transformers, CV -
Day_36B_Docker_Fundamentalsβ containers, images, Compose for data apps (Phase 3 bonus) -
Day_60B_LLM_Fine_Tuning_and_PEFTβ LoRA, QLoRA, Hugging Face PEFT library -
Day_60C_RAG_and_Vector_Databasesβ embeddings, ChromaDB, LangChain RAG pipeline -
Day_84B_dbt_Fundamentalsβ models, refs, tests, docs, dbt Cloud -
Day_96B_NoSQL_Deep_Diveβ MongoDB, Redis, Cassandra β when to use each
- Add
extras/to Phase 2 (sample DataFrames, advanced Pandas notebooks) - Add
extras/to Phase 5 (PEFT configs, RAG starters) - Add
extras/to Phase 8 (DDL scripts, sample datasets) - Add
extras/to Phase 9 (capstone data + solution scaffold)
Phase already implemented. Audit & polish pass needed.
- Verify all 12 day files meet the content depth standard (500+ words, 3+ exercises, 5 Q&A)
- Add
quiz.jsonto each Phase 10 day - Phase 10 Overview polish β ensure ROI table and expert track are present
- Add
extras/with LLM starter notebooks and prompt library
-
Day_121_Cloud_Fundamentalsβ AWS/GCP/Azure architecture, IAM, cost management -
Day_122_Object_Storageβ S3, GCS, Delta Lake, Iceberg table formats -
Day_123_Cloud_Data_Warehousesβ BigQuery, Snowflake, Redshift architecture -
Day_124_dbt_at_Scaleβ incremental models, snapshots, advanced patterns -
Day_125_Orchestrationβ Apache Airflow, Prefect, Dagster -
Day_126_Streaming_Pipelinesβ Kafka, Pub/Sub, Kinesis, real-time ETL -
Day_127_Lakehouse_Architectureβ Databricks, Unity Catalog, Delta Live Tables -
Day_128_Data_Contracts_and_Qualityβ Great Expectations, Soda, data SLAs -
Day_129_Cloud_Security_and_Complianceβ VPC, encryption, PII handling -
Day_130_Cost_Engineeringβ query optimisation for $/TB, slot management -
Day_131_Platform_Engineeringβ Terraform for data infrastructure -
Day_132_Capstone_Cloud_Data_Pipelineβ end-to-end cloud pipeline project - Phase 11 Overview (300+ lines) β expanded to 513 lines with day-by-day journey, 3-tier skills matrix, 6 pitfalls, 5 exam questions, scenario walkthroughs, and expert track
-
Day_133_Analytics_Engineer_Roleβ vs Data Analyst, Data Scientist, DE -
Day_134_Semantic_and_Metrics_Layersβ dbt Metrics, Cube.js, LookML -
Day_135_Self_Serve_Analyticsβ empowering stakeholders without SQL -
Day_136_Data_Mesh_Principlesβ domain ownership, data products -
Day_137_Product_Analytics_Deep_Diveβ retention, funnels, cohort analysis -
Day_138_AB_Testing_at_Scaleβ statistical rigor, experimentation platforms -
Day_139_Data_Products_and_Monetizationβ API-first data, embedded analytics -
Day_140_Capstone_Data_Productβ design a data product for a business unit - Phase 12 Overview
-
Day_68_AI_Agents_and_Tool_Useβ LangChain/LlamaIndex agents, function calling, ReAct (Phase 6) -
Day_69_Responsible_AI_in_Practiceβ model cards, Fairlearn, audit reporting (Phase 6) -
Day_84C_Reverse_ETL_and_Semantic_Layerβ Hightouch concepts, operational analytics (Phase 7) -
Day_96C_Streaming_SQL_Fundamentalsβ Kafka concepts, ksqlDB basics, real-time aggregations (Phase 8) -
Day_108C_Cloud_Native_SQLβ BigQuery ML, Snowflake Cortex, Redshift ML -
Day_108B_Curriculum_Capstoneβ ingest β clean β model β visualise β deploy (all 9+ phases)
Each lesson day needs a quiz.json sidecar for the app's quiz engine:
{
"day": 1,
"questions": [
{
"id": "d01q01",
"type": "multiple_choice",
"question": "What is a variable in Python?",
"options": ["A loop", "A labeled container for data", "A function", "A module"],
"answer": 1,
"explanation": "Variables are named storage locations..."
}
]
}- Script:
scripts/generate-quiz-stubs.jsβ scaffoldquiz.jsonfor all days that don't have one - App:
QuizEnginecomponent readsquiz.json, replaces markdown mastery-check section - App: wrong-answer analytics surfaced in the spaced-repetition store
-
Day_01_Introductionβ quiz.json (5 questions: print, arithmetic, exponentiation, print(), type conversion) -
Day_02_Variables_Builtin_Functionsβ quiz.json (5 questions: data types, len(), naming rules, int(), reassignment) -
Day_03_Operatorsβ quiz.json (5 questions: PEMDAS, modulo, !=, +=, logical and) -
Day_04_Stringsβ quiz.json (5 questions: slicing, f-strings, strip/lower, immutability, title()) -
Day_05_Listsβ quiz.json (5 questions: slicing, aliasing, append(), sorted(), max()) -
Day_06_Tuplesβ quiz.json (5 questions: single-element tuple, immutability, unpacking, use cases, sum()) -
Day_07_Setsβ quiz.json (5 questions: uniqueness, intersection, empty set, symmetric difference, hashability) -
Day_08_Dictionariesβ quiz.json (5 questions: get(), items(), comprehension, KeyError, len()) -
Day_09_Conditionalsβ quiz.json (5 questions: and logic, elif, falsy values, ternary, business scenario) -
Day_10_Loopsβ quiz.json (5 questions: range(), break, enumerate(), while vs for, zip()) -
Day_11_Functionsβ quiz.json (5 questions: mutable default, **kwargs, pure functions, composition, lambda) -
Day_11B_Generators_Iteratorsβ quiz.json (5 questions: iterator protocol, memory advantage, yield, yield from, pipeline) -
Day_11C_Debugging_Workflowsβ quiz.json (5 questions: traceback reading, KeyError, debug workflow, breakpoint(), logging) -
Day_12_List_Comprehensionβ quiz.json (5 questions: basic, filter, generator expr, nested, readability)
-
01_python_data_pipeline/β Phase 1β2 skills showcase -
02_web_dashboard/β Phase 3 Flask/Streamlit project -
03_ml_churn_predictor/β Phase 4β5 end-to-end ML model -
04_bi_analytics_suite/β Phase 6β7 Tableau/Power BI + SQL -
05_sql_data_warehouse/β Phase 8β9 full DDL + ETL -
06_llm_data_assistant/β Phase 10 RAG / agent demo
Each case study includes a comprehensive README.md with hand-holding walkthrough (step-by-step guidance with checkpoints), a starter.py scaffold, and a data_generator.py for synthetic data.
- 01 β Retail Customer Churn (Logistic Regression, XGBoost) β Phase 4β5
- 02 β Finance Fraud Detection (Anomaly Detection, GNN) β Phase 5
- 03 β Healthcare Patient Risk (Ensemble, Probabilistic) β Phase 5
- 04 β E-Commerce Recommendations (Collaborative Filtering) β Phase 5
- 05 β Marketing Campaign Attribution (A/B Testing, Causal Inference) β Phase 6
- 06 β Operations Demand Forecasting (Time Series, ARIMA, Prophet) β Phase 5
- 07 β HR Attrition Prediction (Classification, SHAP) β Phase 4β5
- 08 β SaaS Growth Analytics (Cohorts, Product Analytics) β Phase 7
- 09 β Supply Chain Inventory (LP, Simulation) β Phase 4
- 10 β Banking Credit Scoring (Scorecard, Fairness) β Phase 6
- Shareable progress link (base64-encoded state)
- "Challenge a friend" β send quiz links
- Discussion prompts at the end of each lesson
- Service Worker for offline lesson reading
- Cache lesson markdown on first visit
- Offline progress tracking with sync-on-reconnect
- PWA manifest for "Add to Home Screen"
-
scripts/audit-lessons.jsβ verify all days meet depth standard (500+ words, 3 exercises, 5 Q&A) -
scripts/generate-quiz-stubs.jsβ scaffold missingquiz.jsonfiles -
scripts/check-phase-overviews.jsβ flag overviews below 300 lines / 10 KB - CI gate: fail build if any lesson fails the depth audit
- Design "career tracks" routing: Data Scientist / Analytics Engineer / ML Engineer tracks
- App page linking days by specialisation
- "What's Next" sidebar section on each phase overview page
- Skill radar chart on Progress page
- Heatmap calendar of study activity (GitHub-style)
- Animated dependency tree of tech concepts
- 3D concept graph (Three.js / React Three Fiber) β stretch goal
- Increase unit test coverage to 80%+
- Snapshot tests for all major page components
- Accessibility audit (axe-core) in CI
| Skill | Coverage | 2026 Demand | Action |
|---|---|---|---|
| Python fundamentals | β Phase 1β2 | High | β |
| Pandas / NumPy | β Phase 2 | High | β |
| ML fundamentals | β Phase 4β5 | High | β |
| Deep learning | β Phase 5 | High | β |
| MLOps | β Phase 5 (Day 50, 65) | Very High | Minor depth increase |
| LLMs / GPT APIs | β Phase 10 | Critical | Polish pass |
| RAG & Vector DBs | β Phase 10 (Day 112) | Critical | Add Day 60C cross-ref |
| AI Agents | β Phase 10 (Day 115) | Critical | Add Day 67B Phase 6 bridge |
| dbt | High | Implement P1 | |
| Cloud (AWS/GCP/Azure) | Very High | Add Phase 11 | |
| Kafka / Streaming | High | Implement P2 | |
| BI / Tableau | β Phase 6β7 | High | β |
| SQL mastery | β Phase 8β9 | High | β |
| Data governance | β Phase 7β8 | High | β |
| Responsible AI | High | Add Day 67C Phase 6 |
- Multi-user mode: instructor dashboard, cohort progress overview
- AI-graded exercise submissions (code execution sandbox)
- Video lesson stubs β embed Loom / YouTube per day
- Phase 13: Financial Modelling & Quant Finance (Python, Monte Carlo, Black-Scholes)
- Phase 14: Web3 & Decentralised Data (Solidity basics, on-chain analytics)
- Localisation: Spanish & Mandarin translations of lesson summaries
- Mobile app (React Native): offline-first lesson reader with push-notification streaks
This is a living document. Update priorities as phases are completed or the market shifts.