Skip to content
View anushreedas's full-sized avatar

Block or report anushreedas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
anushreedas/README.md

Hi, I'm Anushree 👋

ML Engineer building production systems at the intersection of research and engineering.

I've worked across gaming, VR, aerospace, medical imaging, and cloud infrastructure — and found that machine learning is where all of it comes together. My work focuses on making ML systems that don't just perform well in notebooks, but hold up in production.

Currently at Arkatechture building financial data infrastructure, and pursuing an MS in Data Analytics at McDaniel College. Previously at Amazon Web Services (Computer Vision Data Platform) and Quantil/CDNetworks.


What I'm working on

  • 📄   ML research under review at ACM PODS — automated SQL data type inference for enterprise data pipelines
  • 🔍   Anomaly detection and time-series forecasting for data ingestion workflows
  • ✍️   Writing about production ML and the fundamentals that actually matter → medium.com/@anushreedas.2710

Selected projects

Project What it does Stack
BERT Confusion Emotion Detection Fine-tuned BERT on time-series transcribed speech to detect multi-level human confusion with statistical significance testing across model variants PyTorch, HuggingFace, BERT
Handwritten Math Expression Recognizer Multi-stage pipeline for stroke segmentation, symbol classification, and Symbol Layout Tree generation outputting LaTeX and MathML SVM, Random Forest, Python
CNN Dishwasher-Safe Classifier End-to-end image classification with transfer learning, class imbalance handling, and per-class evaluation PyTorch, ResNet, VGG16

Technical focus

ML & AI          scikit-learn · PyTorch · TensorFlow · BERT/Transformers
                 Random Forest · ensemble methods · computer vision · NLP
                 feature engineering · causal inference

Data Engineering ETL/ELT pipelines · Snowflake · AWS (S3, Lambda, Glue)
                 PySpark · SQL · data quality monitoring · schema management


Background

MS Data Analytics        McDaniel College               2025 – 2026
MS Computer Science      Rochester Institute of Technology   2019 – 2021
BS Information Technology University of Mumbai              2017

AWS Certified Cloud Practitioner (Dec 2024 – Dec 2027)


Writing

I write about what I'm actually building — production ML systems, data pipelines, and why the fundamentals matter before chasing the hype.


Connect

LinkedIn Medium Email

Pinned Loading

  1. BERT_LLM_Detect_Confusion_Emotion BERT_LLM_Detect_Confusion_Emotion Public

    Fine-tuning a Bidirectional Encoder Representations from Transformers (BERT) model to detect confusion emotion in humans using time series transcribed speech data

    Jupyter Notebook 1

  2. credit-card-transactions-datapipeline credit-card-transactions-datapipeline Public

    ETL pipeline and dimensional data warehouse for credit card transaction analysis, built in R with a star schema design, advanced dimensional modeling, and data quality assessment.

    TeX

  3. Handwritten_Math_Expression_Recognition Handwritten_Math_Expression_Recognition Public

    Python

  4. madelonhulsebos/gittables madelonhulsebos/gittables Public

    Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).

    Python 49 3

  5. CNN_Dishwasher-safe_or_Not CNN_Dishwasher-safe_or_Not Public

    Jupyter Notebook

  6. Content_Based_Retrieval_System_Dog_Images Content_Based_Retrieval_System_Dog_Images Public

    Python