🔭 I am a seasoned Staff AI/Data Engineer who architects resilient, scalable agentic workflows, data pipelines, and Data/ML infrastructure in a cloud-native stack. I specialize in bridging the gap between experimental AI and production-grade infrastructure for global startups. I design and implement ultra-reliable, low-latency systems that power real-time analytics engines and agentic workflows.
Primarily, I orchestrate autonomous AI agents, multi-agent systems, and advanced RAG pipelines. Leveraging the latest advancements in agentic workflows, I focus on complex Prompt Engineering and developing custom Claude code plugins. I am also an expert in data engineering workflows—both real-time and batch data pipelines—plus LLM orchestration, machine learning pipelines, and MLOps infrastructure. I specialize in production-grade AI systems, data pipelines, and cloud-native infrastructure that solve complex challenges for global startups.
Also, actively contributing to Apache Airflow, Apache Pinot and other open-source projects - and recently in the last 2 years into LLM and AI agentic workflows [
].
"You have to dream before your dreams come true."
⚡️ Fun fact: I'm a huge fan of FC Barcelona, and I love traveling, hiking, and gaming on . Please feel free to connect with me on
or
.
Details
> 50+ merged PRs across 16+ organisations and many others over the last 7+ yearsI orchestrate autonomous AI agents, multi-agent systems, and advanced RAG pipelines. Leveraging the latest advancements in agentic workflows, I focus on complex Prompt Engineering and developing custom Claude code skills and plugins.
- AI Architect: I design and implement ultra-reliable, low-latency systems that power real-time analytics engines, agentic workflows, and live data ingestion.
- Real-Time Data Pipelines & AWS Infrastructure: I architect and maintain high-scale, cloud-native streaming solutions leveraging Amazon Kinesis and MSK (Managed Kafka) to handle millions of events per second. By utilizing Terraform/CDK for Infrastructure as Code and CloudWatch/OpenSearch for deep observability, I ensure that real-time ingestion pipelines remain resilient, schema-consistent, and highly available across complex AWS environments.
- AI Architect & Strategic Technical Leadership: I spearhead architecture and technical design for next-generation products, specifically focusing on agentic workflows and specialized systems for high-frequency data storage and replay. I bridge the gap between non-deterministic AI outputs and the rigid reliability required for financial and analytical data replay systems.
- Performance Optimization & Scalability: I obsessively optimize existing services for maximum throughput and minimal latency. This includes refining data ingestion services, stream processing pipelines, and Big Data warehouses (Snowflake/ClickHouse), alongside tuning container-based microservices (ECS/EKS) to ensure seamless horizontal and vertical scaling under heavy production loads.
- Claude Agentic Orchestration & Skill Development: I design and implement advanced autonomous systems using the Claude Agent SDK, building custom Claude Skills and Claude Plugins to extend LLM capabilities into real-world actions. By architecting multi-step Claude Agentic Workflows, I enable seamless tool-calling and sophisticated reasoning cycles, allowing AI agents to navigate complex, non-deterministic tasks while maintaining strict operational guardrails and enterprise-grade reliability.
- 🧹 Vibe Code Cleanup: As AI drastically accelerates initial code generation, I specialize in transforming fragile, AI-generated "vibe code" into secure, decoupled, and scalable enterprise systems. I audit, refactor, and harden these prototypes so they are robust enough for production and real-world traffic.
I get excited about opportunities where I can leverage big data to discover insights and identify patterns that have real human impact.
I love connecting with new people. Give me a shout at chethanuk@outlook.com or on !




