Skip to content
View pathak-ashutosh's full-sized avatar
💭
Learning
💭
Learning

Block or report pathak-ashutosh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pathak-ashutosh/README.md

Ashutosh Pathak

I build production AI systems, LLM-powered pipelines, multi-agent architectures, and the infrastructure required to make them reliable at scale. My focus is on the full engineering stack: from evaluation frameworks and retrieval systems to deployment, observability, and latency-sensitive inference.

I care about systems that actually work: well-defined failure modes, reproducible evaluation, and clean abstractions between components. Most of my recent work sits at the intersection of applied LLM engineering and data infrastructure; RAG systems, agentic workflows, and the tooling needed to iterate on them without breaking production.

I hold a Bachelor's and Master's in Computer Science (Concentration in Machine Learning). I write about ML systems, evaluation, and engineering decisions at https://thenumbercrunch.com/.

Side projects include HiveHaven — a housing platform for international students in the U.S. — and PolNet, a political network visualization tool for analyzing U.S. congressional caucus data.

Current reading: Build a Large Language Model (From Scratch) by Sebastian Raschka.


Focus areas

LLM Systems, Multi-Agent Architectures, Agentic Workflows
Retrieval-Augmented Generation, Vector Search, Embedding Pipelines
Evaluation Frameworks, Observability, Model Behavior Analysis
Inference Optimization, MLOps, High-Throughput Serving


Toolchain

Python, JavaScript, C/C++, SQL
LangChain, LangGraph, LlamaIndex, Google ADK
PyTorch, Scikit-learn, Hugging Face
Elasticsearch, Neo4j, Postgres, BigQuery
Apache Spark, Databricks, Hadoop (HDFS)
Vertex AI, Vertex AI Agent Builder, Google Cloud Run, GCS
AWS SageMaker, Amazon Bedrock, Azure ML
Docker, Kubernetes, Git, DVC


Contact

LinkedIn: https://www.linkedin.com/in/pathak-ash/
X: https://x.com/pathak_jeee
Email: ashutoshpathak@thenumbercrunch.com
Writing: https://thenumbercrunch.com/

Pinned Loading

  1. econberta econberta Public

    Robust Extraction of Named Entities in Economics

    Jupyter Notebook 2

  2. clinical-risk-prediction clinical-risk-prediction Public

    Clinical Risk Prediction using EHRs

    Jupyter Notebook 2

  3. spark-movie-recommendation spark-movie-recommendation Public

    A movie recommendation system on MovieLens 25M dataset using Python and Apache Spark

    Python 4 1

  4. liver-segmentation liver-segmentation Public

    Segment liver using unet architecture. This was a project I did for a senior anonymously as his final year project during my undergrad.

    Jupyter Notebook 2

  5. sentiment-analysis-yelp-reviews sentiment-analysis-yelp-reviews Public

    Perform sentiment analysis on Yelp dataset with Apache Spark

    Python 1