Skip to content
View immanuel-peter's full-sized avatar

Highlights

  • Pro

Block or report immanuel-peter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
immanuel-peter/README.md

Immanuel Peter

Profile Views

About Me

Computer Science + Physics at the University of Chicago. Incoming Member of Technical Staff Intern at Tensormesh.

Links

Current Work

  • Joining Tensormesh to work alongside the team behind LMCache on inference infrastructure.
  • Interested in software engineering, developer infrastructure, and self-driving neural networks.

Selected Projects

Hostess

Hostess is a deployment platform for multi-service applications built around a declarative hostess.yml. It combines a Go CLI, control plane, and Studio dashboard to deploy full stacks like Next.js, FastAPI, Postgres, and Redis with generated Kubernetes manifests, service discovery, secrets wiring, per-service deploys, and framework-specific operational views.

  • Live: hostess.sh
  • Stack: Go, Kubernetes, Next.js, PostgreSQL, GCP, Docker

Redis Operator

A Kubernetes Redis operator inspired by CloudNativePG's control-plane design. It manages pods and PVCs directly instead of relying on StatefulSets, which lets it enforce fencing-first failover, replica-first rolling updates, pod-level instance management, backup workflows, and deterministic behavior across standalone, sentinel, and cluster modes.

AutoMoE

A modular self-driving research stack built around a Mixture-of-Experts architecture instead of a single end-to-end model. The repo includes data pipelines for BDD100K, nuScenes, and CARLA, specialized perception experts, a context-aware gating network, a trajectory policy head, and released CARLA datasets on Hugging Face. The final integrated simulation stage is paused, but the project captures the full training, evaluation, and research workflow.

Matchbox

An AI-powered research matching platform that connects students with labs using semantic search and LLM-based fit scoring. It replaces fragmented outreach with a centralized pipeline for discovery, parsing, ranking, and review.

Grok Review

A small product that turns a public GitHub pull request URL into a streaming AI code review with line-by-line feedback.

Experience

Tensormesh

Incoming Member of Technical Staff Intern. Tensormesh helps enterprises reduce GPU cost by offloading reusable KV caches during inference.

Quantum Rings

Software Engineer Intern. Delivered 19 PRs and 43 contributions across schema refactors, queue-driven execution processing, telemetry aggregation with AWS SQS + TypeORM, and full-stack analytics dashboards with NestJS, Next.js, and Recharts.

Writing

Pinned Loading

  1. self-driving-model self-driving-model Public

    AutoMoE: a PyTorch Mixture‑of‑Experts self‑driving stack for CARLA with trained perception experts, a gating network, and a trajectory policy, plus datasets and training/inference scripts.

    Jupyter Notebook

  2. localrag localrag Public

    Terminal LLM Interface with Infinite Memory

    Python 2