Skip to content
View lebathien's full-sized avatar
πŸ’­
Always
πŸ’­
Always

Block or report lebathien

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
LeBaThien/README.md

Hi there, I'm Thien (Lee) πŸ‘‹

Data Engineer | AI & Data Systems | Product Engineer

🌐 Portfolio β€’ πŸ’Ό LinkedIn β€’ 🧩 LeetCode


About Me

  • πŸ“ Based in Da Nang, Vietnam
  • πŸ’Ό 5+ years of experience in Data Engineering & AI Engineering
  • πŸ”­ Building data platforms, AI-powered products, and scalable data pipelines
  • πŸ€– Interested in LLMs, Agentic AI, and AI-driven Development
  • πŸ“Š Turning raw data into actionable business insights

Work Experience & Projects

AI Engineer @ Hire-Central (03/2024 - Present)

  • Hire Central - AI Product: CV extraction with LLMs, candidate scoring, semantic search with vector embeddings, AI chatbot, recruitment dashboards

Data Engineer / Fullstack Engineer @ EST-Rouge (03/2021 - Present)

  • ISC Design Platform: Data lakehouse on AWS S3, Airflow pipelines, API services, AI-based summarization for operational insights

Data Engineer @ TMA Solution (Offshore)

  • Working as offshore team, collaborating with DE team at US
  • TDP - Data Platform: Big data platform with Spark, Kafka streaming, Delta Lake, Debezium CDC
  • Healthy Analytic Platform: ETL on Databricks (Scala), Terraform IaC, CI/CD pipelines, SonarQube integration

Tech Stack

Languages

Python Java Scala JavaScript TypeScript SQL HTML CSS

Frameworks & Database

Spring Boot Angular NestJS Express.js FastAPI Node.js Bootstrap Selenium PostgreSQL MySQL pgvector

AI / ML

OpenAI OpenAI Agents LangChain Vector Embeddings

Big Data & Streaming

Apache Spark Kafka Airflow Hadoop Hive Delta Lake Debezium Databricks

BI & Analytics

Power BI Tableau Looker Studio

DevOps & Tools

Docker Kubernetes Terraform Git SonarQube

AWS

AWS S3 SQS Lambda Redshift QuickSight

Azure

Azure Data Factory Data Lake Storage Azure DevOps


Certifications

  • πŸ… Google Advanced Data Analytics Certificate (2025)
  • πŸ… AWS Certified Cloud Practitioner (2024)
  • πŸ… Databricks Generative AI Fundamentals (2024)
  • πŸ… Databricks Fundamentals (2024)
  • πŸ… Microsoft Certified: Azure Data Fundamentals (DP-900) (2022)
  • πŸ… TOEIC 620 - IIG Vietnam (2020)

"Always learning, always growing."

Pinned Loading

  1. data-crawling_jsoup data-crawling_jsoup Public

    Python

  2. data-crawling-java data-crawling-java Public

    HTML

  3. webdriver-auto webdriver-auto Public

    HTML

  4. C0721G2-Org-Sprint-02/C0721G2-livestock-farm-BE C0721G2-Org-Sprint-02/C0721G2-livestock-farm-BE Public

    Java

  5. C0721G2-Org/C0721G2-Repo-BE C0721G2-Org/C0721G2-Repo-BE Public

    Java

  6. C0721G2-Org/C0721G2-Repo-FE C0721G2-Org/C0721G2-Repo-FE Public

    JavaScript