Skip to content
View pratikshakau's full-sized avatar

Block or report pratikshakau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pratikshakau/README.md

Hi 👋, I'm Pratiksha Kaushik

MS in Applied Data Intelligence @ SJSU | Data Engineer | Agentic AI & LLM Systems Builder

🚀 About Me

I design and build production-ready data and AI systems — from scalable ETL pipelines to autonomous LLM-powered applications.

My work focuses on Agentic AI architectures, distributed data engineering, and real-world ML deployment.

🎓 MS Data Analytics/ Applied Data Intelligence — San José State University
⚡ Specializing in AI systems + data platforms
🔬 Interested in intelligent autonomous workflows
💼 Seeking Data Engineering / AI / ML roles


🧠 Core Expertise

✔ Agentic AI architectures (Planner → Executor → Reviewer systems)
✔ LLM application development & orchestration
✔ FastAPI backend design for ML services
✔ End-to-end data pipelines (batch + streaming)
✔ Cloud data warehousing & analytics engineering
✔ Production ML deployment


🧰 Tech Stack

🤖 AI / LLM / Agentic Systems

  • LangChain, LangGraph, Ollama
  • Hugging Face Transformers
  • Retrieval Augmented Generation (RAG)
  • Vector embeddings
  • Prompt engineering
  • Autonomous agent workflows
  • Local & hosted LLM deployment

👩‍💻 Backend & APIs

  • FastAPI
  • REST API design
  • Model serving
  • Async processing
  • Microservices architecture

🗄 Databases & Storage

  • SQL (PostgreSQL, MySQL)
  • Snowflake
  • Vector databases
  • Data warehousing
  • ETL / ELT pipelines

⚙️ Data Engineering

  • Apache Spark
  • Apache Airflow
  • dbt
  • Kafka streaming
  • Distributed systems
  • Data pipeline orchestration

☁️ Cloud & Infrastructure

  • AWS (S3, Glue, Redshift)
  • Docker containerization
  • CI/CD workflows
  • Scalable deployment

📊 Analytics & Visualization

  • Power BI
  • Tableau
  • Feature engineering
  • Statistical modeling

🔥 Featured Projects

🤖 Agentic AI Workflow System

Autonomous multi-agent architecture using LLM reasoning loops:

  • Planner → Task Executor → Validator
  • LangChain + local LLM orchestration
  • Context memory management
  • Structured decision pipelines

🧠 LLM API Backend (FastAPI + Hugging Face)

Production-ready ML inference service:

  • REST API for model prediction
  • Hugging Face transformer integration
  • Request validation & async processing
  • Scalable container deployment

📊 Distributed Data Pipeline Platform

Large-scale data ingestion and transformation system:

  • Airflow orchestration
  • Spark distributed processing
  • Data warehouse integration
  • Automated analytics pipeline

📈 Crypto Analytics Intelligence System

Real-time analytics + forecasting:

  • API data ingestion
  • Feature engineering
  • ML prediction models
  • Automated dashboards

🗃 Full-Stack ML Prediction Service

End-to-end machine learning application:

  • Model training pipeline
  • FastAPI inference backend
  • SQL data storage
  • Frontend integration

🎯 Career Mission

To engineer intelligent, scalable, and autonomous AI systems that integrate data infrastructure, machine learning, and real-time decision making.


⭐ Always open to collaborating on AI, data engineering, and LLM projects.

Popular repositories Loading

  1. Lab-Stock_Analysis Lab-Stock_Analysis Public

    Python lab1 main:check_my_grade and unit test Gradeapp_unittest

    Python 1

  2. Data-Warehouse-LAB1 Data-Warehouse-LAB1 Public

    End-to-end stock price pipeline using Airflow + Snowflake: extract Alpha Vantage daily prices, transform with pandas, and full-refresh load into RAW.STOCK_DATA, scheduled @daily. Includes idempoten…

    Python 1

  3. Data-Warehouse-DBT Data-Warehouse-DBT Public

    # DBT Project – Snowflake Connector (MSDA This repository contains my dbt project developed as part of the Week 10 Data Warehouse & Pipelines lab. The goal of this project is to build and test an E…

    1

  4. Pinecone_Airflow Pinecone_Airflow Public

    This project runs a full Pinecone vector search pipeline using Apache Airflow on Docker. It includes installing sentence-transformers, configuring Pinecone, preprocessing data, creating an index, g…

    Python 1

  5. flask-dev-portfolio flask-dev-portfolio Public

    A Flask-based personal blog and portfolio website with CRUD posts, project showcase, and a contact page. Built with Flask, SQLAlchemy, and Jinja2 templates.

    HTML 1

  6. pratikshakau pratikshakau Public

    1