Skip to content
View sushildalavi's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report sushildalavi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sushildalavi/README.md
header

Typing SVG

profile views   followers


About Me

coding gif

I'm Sushil Dalavi, an AI Engineer at the USC Annenberg Norman Lear Center and an MS in Computer Science candidate at USC (2024 – 2026).

I architect production AI systems — AWS data platforms, hybrid retrieval pipelines, distributed LLM workflows, and multi-modal ML — with an emphasis on measurable outcomes, reliability, and reproducibility.


💼 Open to SDE / SWE / AI·ML Engineer / Applied AI roles
🏗️ AWS data platforms, distributed workflows, LLM inference gateways
🧠 Hybrid retrieval, reranking, MLOps, multi-modal alignment
📚 Building JobSense, ScribeAI, and ScholarRAG
🌍 Motivated by real-world product impact
Proud Real Madrid supporter
🍥 Huge anime geek


🎓 Education


USC

University of Southern California
MS in Computer Science
📍 Los Angeles, CA  |  📅 Aug 2024 – May 2026




University of Mumbai

University of Mumbai
BE in Computer Engineering
📍 Mumbai, India  |  📅 Jun 2019 – May 2023




💼 Work Experience


USC Annenberg Norman Lear Center

USC Annenberg Norman Lear Center
AI Engineer
📍 Los Angeles, CA  |  📅 Jun 2025 – Present




Reliance Jio

Reliance Jio Platforms
Software Engineer
📍 Navi Mumbai, India  |  📅 Dec 2023 – Jul 2024



📌 Highlights from USC Annenberg Norman Lear Center
  • Architected an AWS data platform (S3, Glue, SageMaker, Bedrock) ingesting, deduplicating, and normalizing 1M+ multi-region records for downstream ML training and retrieval workloads.
  • Shipped a multi-modal alignment system fusing audio, speaker diarization, and caption streams — reaching 99.3% F1 and 99.9% coverage on ground-truth evaluation.
  • Developed large-scale batch pipelines processing long-form video and audio through Whisper ASR, pyannote diarization, and model-based refinement stages.
  • Automated dataset QA, Unicode normalization, and deduplication in Python — lifting analysis-ready yield from 10,819 → 9,735 records with full reproducibility.
📌 Highlights from Reliance Jio Platforms
  • Trained and deployed ResNet-50 and DenseNet-121 deep vision networks for medical image anomaly detection — improving recall by 35% via transfer learning, augmentation, and loss tuning.
  • Optimized quantized transformer inference (BERT, GPT-2) on GPU with batched serving — cutting p95 latency by 30% while preserving accuracy gains.
  • Engineered demand-forecasting microservices (TFT, CatBoost, LSTM) over Hive SQL batch pipelines, reducing forecast MAPE by 25% for business-critical workloads.
  • Rolled out shadow-testing and canary-release workflows for 3 production ML upgrades, catching 2 latency regressions before fleet-wide deployment.


🛠️ Tech Stack

Languages

 

ML & Deep Learning

 

LLMs & Retrieval

Backend & Data Systems

 

Cloud & DevOps

 

AI-Assisted Development


🚀 Featured Projects

Durable distributed workflow platform — a fault-tolerant orchestration system on Temporal with 12 tool integrations, human-in-the-loop checkpoints, and a provider-agnostic inference gateway.

Highlights

  • Temporal-based orchestration with automated retries & end-to-end observability
  • Provider-agnostic inference gateway with multi-backend failover & Redis semantic caching
  • CI regression gates blocking merges on quality or cost drift
  • Hybrid retrieval (BM25 + dense + cross-encoder) fused with Reciprocal Rank Fusion

Stack

✍️ ScribeAI

Inference service with evaluation pipeline — async FastAPI service with SSE streaming, multi-backend routing (GPT-4o, Claude, fallback), and an MLflow-tracked evaluation harness.

Highlights

  • Graceful degradation under upstream failure across multiple LLM backends
  • MLflow-tracked evaluation: ROUGE, BLEU, BERTScore, faithfulness, leakage checks
  • Compliance-aware pipeline: 10+ PII types redacted, pgcrypto storage, append-only audit log
  • Automated regression alerts on metric drift across versioned releases

Stack

Retrieval and data engineering system — a hybrid retrieval pipeline for scholarly discovery with citation-aware grounding.

Highlights

  • Dense + BM25 + RRF + MiniLM rerank lifting MRR by 21.8% and nDCG@10 by 18.0% over a 120+ query eval harness
  • Duplicate indexing reduced by 50%, re-ingestion time by 60% via DOI/ID/title normalization + SHA-256 content hashing
  • Answer grounding lifted from 0.505 → 0.616 faithfulness; claim support 45.4% → 85.6%
  • Evidence-constrained generation with citation-aware prompting across heterogeneous scholarly sources

Stack

🏥 MedSOAP

Clinical documentation automation — generates structured SOAP notes from doctor-patient conversations.

Highlights

  • LLM-driven SOAP note generation with medical entity recognition
  • HIPAA-conscious architecture with audit trails
  • Fine-tuning and evaluation pipeline for clinical summarization
  • Explores healthcare-focused product design patterns

Stack


📊 GitHub Analytics

GitHub Stats    Top Languages

GitHub Streak

Activity Graph


🏆 Trophies

GitHub Trophies


💬 Quote I Live By




— Aristotle


🎯 Beyond the Code

🎬Love webseries and serious binge watching 🏊Swimming keeps me grounded
🏓Enjoy table tennis Lifelong football fan
🍥Huge anime geek 🎧Music always around

⚽ Hala Madrid

 

A proud Real Madrid supporter — I love the mentality, the standards, the legacy, and the winning culture.


🎧 Spotify


🔍 What I'm Looking For

I'm especially interested in opportunities where strong software engineering meets AI/ML, backend systems, and data-driven product building.


🤝 Let's Connect

built with love   powered by coffee

footer

Pinned Loading

  1. nanoserve nanoserve Public

    Benchmark-driven LLM serving engine built from scratch on Apple Silicon. OpenAI-compatible streaming API with continuous batching, paged KV-cache, prefix caching, and Prometheus/Grafana observabili…

    Python

  2. SOAPFlow-Clinical-Transcript-to-SOAP-Note-Platform SOAPFlow-Clinical-Transcript-to-SOAP-Note-Platform Public

    AI clinical scribe that turns doctor-patient conversation transcripts into structured SOAP notes. FastAPI + React 19 with six generation backends (OpenAI, Anthropic, Groq, Ollama, local MLX LoRA, r…

    Python

  3. citelens citelens Public

    Trust-calibrated scholarly QA system with sentence-level citation grounding and per-citation confidence scoring.

    Python

  4. QueryLens-PostgreSQL-Query-Performance-Monitor QueryLens-PostgreSQL-Query-Performance-Monitor Public

    Monitors PostgreSQL query performance via pg_stat_statements: fingerprints queries, snapshots EXPLAIN plans, detects regressions with deterministic rules, and surfaces slow queries in a React dashb…

    Python

  5. ReplayForge-Async-Workflow-Replay-Failure-Debugging-Platform ReplayForge-Async-Workflow-Replay-Failure-Debugging-Platform Public

    Production-grade async workflow replay & failure-debugging platform. Redis Streams consumer groups, exponential backoff retries, dead-letter queue, and a live React dashboard for inspecting timelin…

    Python

  6. SchemaPilot-API-Contract-Drift-Monitor SchemaPilot-API-Contract-Drift-Monitor Public

    Monitors live JSON APIs, infers schemas from observed responses, and detects breaking contract drift using deterministic, unit-tested rules. FastAPI + React + Postgres, with optional LLM-generated …

    TypeScript