Skip to content
View landerox's full-sized avatar

Block or report landerox

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
landerox/README.md

Cloud, Data & AI

Website: landerox.com | LinkedIn | Hugging Face

I build data and AI systems that are reliable, scalable, and practical to run.

From legacy IBM iSeries/AS400 to modern warehouse/lakehouse stacks, I have spent 15+ years building and improving data platforms. These days my focus is data architecture, data engineering, and production AI across Google Cloud, equivalent stacks on other cloud platforms, and open source solutions.

What I Build

  • Data pipelines: ETL/ELT, event-driven systems, and streaming + batch processing.
  • Modern data platforms: Warehouse/lakehouse architectures that teams can actually operate.
  • Cloud foundations: IaC, CI/CD, and delivery workflows that keep projects moving.
  • Applied AI: RAG pipelines, LLM integrations, and evaluation workflows.
  • Platform cleanup: Technical debt reduction, reliability hardening, and cost optimization.

How I Work

  • Patterns: Event-driven, Medallion, and Lambda/Kappa where they fit.
  • Reliability first: Data contracts, schema evolution, idempotency, deduplication, quality gates, and replayability.
  • Table strategy: Apache Iceberg first, plus BigQuery native and Delta/Hudi interoperability when needed.
  • Engineering standards: Pre-commit, IaC-first, SemVer, and Conventional Commits.

Tech Stack

Cloud Platforms (GCP Focused)

Data & Analytics

Google Cloud BigQuery BigLake Dataflow Dataproc Cloud Storage

AI & ML

Vertex AI Gemini Model Garden Feature Store

Compute & Messaging

Cloud Run GKE Pub/Sub Cloud Functions

Orchestration

Cloud Composer Cloud Workflows

DevOps & Infrastructure (IaC)

Terraform Terragrunt Docker Kubernetes Artifact Registry GitHub Actions GitLab CI Just uv Pre-commit SemVer Conventional Commits

Data Engineering & Orchestration

dbt Apache Airflow Dagster Airbyte Apache Hadoop Apache Spark Apache Kafka RabbitMQ Databricks Pandas Polars Apache Iceberg BigQuery Native Delta Lake Apache Hudi

Production AI & Applied MLOps

LangChain LangGraph Pydantic FastAPI MCP

Languages, Libraries & Formats

Python SQL Rust Scala Go Bash Pytest uv Parquet Avro JSON

Popular repositories Loading

  1. cloud-landerox-infra cloud-landerox-infra Public

    Public GCP Infrastructure as Code Baseline: Modular Terraform for IAM, Storage & Cloud Services

    HCL 1

  2. cloud-landerox-data cloud-landerox-data Public

    Public GCP Data Architecture Baseline: Hybrid Warehouse/Lakehouse with Batch + Streaming

    Python 1

  3. landerox.github.io landerox.github.io Public

    Personal site focused on data platforms, cloud architecture, automation, and production ai solutions.

    CSS 1

  4. landerox landerox Public

    Personal GitHub profile for cloud platforms, data architecture, and production ai solutions.

    1