Loomis Lens: Real-Time Head Pose Estimation and Anatomical Overlay Prediction

Loomis Lens is an interactive anatomical tool designed to assist artists in simplifying the breakdown of the human head. My vision was to create a system that integrates real-time facial landmark recognition with head pose estimation to generate a proportionally accurate overlay. By following the Loomis Method, the application creates a 3D guide on top of the subject's face, helping artists visualize anatomical planes and perspective through either a live webcam feed or an uploaded image.

How it Works

The application functions as a digital assistant for anatomical study through a three-stage pipeline:

Extraction: Extracts 468 3D facial landmarks from a camera feed or uploaded image using MediaPipe.
Estimation: A deep neural network predicts rotational vectors (Sine and Cosine values for Yaw, Pitch, and Roll) to ensure mathematical continuity.
Projection: A custom-modeled Loomis mesh is projected onto the user's face using a pinhole camera model and perspective mapping.

Model Performance

The model was evaluated on the AFLW2000-3D benchmark set and significantly outperforms general-purpose landmark estimators:

Metric	Performance	Context
Mean Absolute Error (MAE)	4.04°	Outperforms standard landmark-based estimators.
Robust Accuracy (<10°)	93.36%	Ensures a stable, "jitter-free" overlay for drawing.
Strict Accuracy (<5°)	58.01%	Demonstrates high-precision tracking for detailed anatomical study.

Dataset and Preprocessing

Source Data: Trained on the 300W-LP and AFLW-3D datasets.
Augmentation: Implements real-time mirror augmentation, inverting Yaw and Roll angles to improve model generalization.
Normalization: Landmarks are centered at the nose tip and scaled by interocular distance to achieve scale invariance.

Model Architecture

The final model is a custom Deep Neural Network implemented in TensorFlow with a focus on geometric precision:

Input Processing: Uses Gaussian Noise (0.005) to improve robustness against shaky webcam feeds.
Core Blocks: Features three Residual Blocks that utilize skip connections to maintain gradient stability during deep training.
Attention Mechanism: Integrated Squeeze-and-Excitation (SE) blocks allow the model to dynamically re-weight facial landmark features.
Activation: Uses the Swish activation function for smoother mathematical curves compared to standard ReLU.

Training Details

This project implements a robust machine learning pipeline designed for high-accuracy regression:

Target Representation: The model predicts Sine and Cosine values for each axis to avoid the "Gimbal Lock" problem and ensure mathematical continuity.
Loss Function: A hybrid Huber Loss ($\delta=2.0$) combined with a Margin Penalty ($\text{threshold}=5.0^\circ$) to penalize outliers while remaining robust to noise.
Sample Weighting: Uses a custom weighting scheme to prioritize accurate Pitch and Yaw estimation, which are critical for anatomical alignment.
Optimizer: AdamW (Learning Rate: $3 \times 10^{-4}$, Weight Decay: 0.04) for improved weight regularization.
Training Strategy: Utilizes Learning Rate Reduction on Plateau and Early Stopping (patience: 35) to find the global minimum without overfitting.

3D Asset Generation

The Loomis head overlay was custom-modeled in Blender to ensure strict anatomical accuracy:

Blender Workflow: A 3D mesh was generated following the proportions of the Loomis Method.
Rendering: Optimized as a low-poly .obj file.
Stability: The frontend uses Render Damping (0.9) to smooth the 3D guide's movement during real-time tracking.

Deployment

Frontend: Hosted on Vercel at loomis-lens.vercel.app.
Backend: Python API containerized via Docker and deployed on AWS Lambda/ECR for scalable, serverless inference.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Loomis Lens: Real-Time Head Pose Estimation and Anatomical Overlay Prediction

Table of Contents

How it Works

Model Performance

Dataset and Preprocessing

Model Architecture

Training Details

3D Asset Generation

Deployment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Loomis Lens: Real-Time Head Pose Estimation and Anatomical Overlay Prediction

Table of Contents

How it Works

Model Performance

Dataset and Preprocessing

Model Architecture

Training Details

3D Asset Generation

Deployment

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages