🦷 3D Tooth Segmentation Benchmark

A unified framework for training, evaluating, and benchmarking
point cloud segmentation methods on intraoral 3D dental scans.

Models • Install • Quick Start • Training • Evaluation • Extend • Contributing

📋 Overview

This repository provides a modular, extensible benchmark for 3D tooth segmentation from intraoral scans. It includes:

18+ model implementations spanning dental-specific architectures, general point cloud backbones, and self-supervised pre-training methods
A model registry with @register_model decorators — adding a new architecture requires zero changes to the training loop
Unified train.py and test.py scripts that handle all models through a single CLI
YAML-based configuration with sensible defaults and per-model overrides
Copy-and-customize templates for new models and datasets

Attribution: This codebase is built upon ToothGroupNetwork by Team CGIP. We thank the original authors for their excellent baseline and data processing pipeline.

🏗 Architecture

boilerplate_segmentation/
├── train.py                    # Unified training entry point (all models)
├── test.py                     # Unified evaluation entry point (all models)
├── smoke_test.py               # Pipeline verification (no GPU/data needed)
├── config_loader.py            # YAML + legacy Python config loader
│
├── configs/                    # YAML configuration files
│   ├── default.yaml            #   Base config (inherited by all)
│   ├── dgcnn.yaml              #   Per-model overrides
│   ├── pointnet.yaml
│   └── ...
│
├── models/                     # Model wrappers + modules
│   ├── registry.py             #   @register_model decorator & factory
│   ├── base_model.py           #   Abstract base class
│   ├── new_model_template.py   #   Template for adding new models
│   ├── dgcnn_model.py          #   DGCNN wrapper
│   ├── pointnet_model.py       #   PointNet wrapper
│   ├── ...                     #   (18 model wrappers total)
│   └── modules/                #   Neural network implementations
│       ├── dgcnn_module.py
│       └── ...
│
├── datasets/                   # Dataset classes
│   ├── base_dataset.py         #   Abstract base dataset
│   ├── dental_dataset.py       #   Dental scan dataset
│   └── new_dataset_template.py #   Template for adding new datasets
│
├── train_configs/              # Legacy Python configs (backward compat)
├── inference_pipelines/        # Model-specific inference pipelines
├── external_libs/              # PointNet2, PointOps CUDA extensions
│
├── generator.py                # Legacy dataset (backward compat)
├── trainer.py                  # Training loop implementation
├── runner.py                   # Training orchestrator
├── loss_meter.py               # Loss aggregation utilities
├── augmentator.py              # Point cloud augmentations
├── gen_utils.py                # General utilities
├── ops_utils.py                # Point cloud operations
├── preprocess_data.py          # Raw mesh → preprocessed .npy
├── eval_visualize_results.py   # Metric computation & visualization
└── predict_utils.py            # Inference utilities

🧠 Supported Models

Category	Model	Registry Name	Reference
Our Method	DentalMAE (Pretrain)	`dental_mae_pretrain`	—
	DentalMAE-Seg (Fine-tune)	`dental_mae_seg`	—
Dental-Specific	TGNet-FPS (Challenge Winner)	`tgnet_fps`	ToothGroupNetwork
	TGNet-BDL	`tgnet_bdl`	ToothGroupNetwork
	TSegNet	`tsegnet`	Paper
	TSegFormer	`tsegformer`	Paper
	MeshSegNet	`meshsegnet`	Paper
	TeethGNN	`teethgnn`	—
	HiCA	`hica`	—
	SGTNet	`sgtnet`	—
	SGTCNet	`sgtcnet`	—
	TSGCNet	`tsgcnet`	—
	Fast TGCN	`fast_tgcn`	—
	UpToothSeg	`uptoothseg`	—
	Dilated Tooth Seg	`dilated_tooth_seg`	—
General Backbones	PointNet	`pointnet`	Paper
	PointNet++	`pointnetpp`	Paper
	DGCNN	`dgcnn`	Paper
	Point Transformer	`pointtransformer`	Paper

List all models: python train.py --list_models

⚙ Installation

Requirements

Python 3.8+
PyTorch 1.7.1+ with CUDA 11.0+
Ubuntu 18.04+ (tested)

Setup

# 1. Clone the repository
git clone https://github.com/your-username/tooth-segmentation-benchmark.git
cd tooth-segmentation-benchmark

# 2. Create a virtual environment (recommended)
conda create -n tooth_seg python=3.10 -y
conda activate tooth_seg

# 3. Install PyTorch (adjust for your CUDA version)
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu118

# 4. Install dependencies
pip install wandb open3d multimethod termcolor trimesh easydict pyyaml scikit-learn

# 5. Install PointOps (required for Point Transformer & DentalMAE)
cd external_libs/pointops && python setup.py install && cd ../..

# 6. Verify installation
python smoke_test.py

🚀 Quick Start

1. Preprocess Raw Data

Convert raw .obj meshes to preprocessed .npy point clouds:

python preprocess_data.py \
  --source_obj_data_path data_obj_parent_directory \
  --source_json_data_path data_json_parent_directory \
  --save_data_path path/to/preprocessed_data

2. Train a Model

# Train DGCNN with YAML config
python train.py \
  --model_name dgcnn \
  --config configs/dgcnn.yaml \
  --experiment_name dgcnn_baseline \
  --data_dir path/to/preprocessed_data \
  --epochs 200

# Train with WandB logging
python train.py \
  --model_name pointnet \
  --config configs/pointnet.yaml \
  --experiment_name pointnet_v1 \
  --data_dir path/to/preprocessed_data

3. Evaluate

python test.py \
  --model_name dgcnn \
  --config configs/dgcnn.yaml \
  --checkpoint ckpts/dgcnn_baseline_val.h5 \
  --data_dir path/to/preprocessed_data \
  --test_split base_name_test_fold.txt \
  --save_predictions results/dgcnn/

🏋 Training

Unified Training Script

The train.py script handles all models via the model registry:

python train.py --model_name <MODEL> --config <CONFIG> [OPTIONS]

Argument	Description	Default
`--model_name`	Model name from registry	`dgcnn`
`--config`	Path to `.yaml` or `.py` config	`configs/dgcnn.yaml`
`--experiment_name`	Name for checkpoints & logging	`experiment`
`--data_dir`	Preprocessed data directory	—
`--train_split`	Train split txt file	`base_name_train_fold.txt`
`--val_split`	Validation split txt file	`base_name_val_fold.txt`
`--epochs`	Number of epochs (overrides config)	`200`
`--batch_size`	Batch size (overrides config)	`1`
`--lr`	Learning rate (overrides config)	—
`--resume`	Checkpoint path to resume from	—
`--wandb_off`	Disable WandB logging	`False`
`--device`	Force device (`cuda`/`cpu`)	auto
`--val_every`	Validate every N epochs	`1`

DentalMAE Two-Phase Training

# Phase 1: Self-supervised pre-training
python train.py \
  --model_name dental_mae_pretrain \
  --config configs/default.yaml \
  --experiment_name mae_pretrain \
  --data_dir path/to/unlabeled_data

# Phase 2: Supervised fine-tuning
python train.py \
  --model_name dental_mae_seg \
  --config configs/default.yaml \
  --experiment_name mae_finetune \
  --data_dir path/to/labeled_data

Legacy Training Script

The original start_train.py is still available for backward compatibility:

python start_train.py \
  --model_name dgcnn \
  --config_path train_configs/dgcnn.py \
  --experiment_name dgcnn_exp \
  ...

📊 Evaluation

Unified Test Script

python test.py --model_name <MODEL> --config <CONFIG> --checkpoint <CKPT> [OPTIONS]

Argument	Description
`--checkpoint`	Path to model checkpoint (`.h5`)
`--test_split`	Test split txt file
`--save_predictions`	Directory to save prediction JSONs
`--num_classes`	Number of classes (default: 17)

Output metrics:

Mean IoU (Intersection over Union)
Mean F1 (Dice Score)
Accuracy (overall and per-class)
Per-class IoU and F1 breakdown

Visualization

python eval_visualize_results.py \
  --mesh_path path/to/obj_file \
  --gt_json_path path/to/gt_json_file \
  --pred_json_path path/to/predicted_json_file

🔧 Adding New Models

Step 1: Create the Model Wrapper

cp models/new_model_template.py models/my_model_model.py

Edit my_model_model.py — fill in the TODO markers:

from models.registry import register_model
from models.base_model import BaseModel

@register_model("my_model")          # ← Name used in CLI
class MyModel(BaseModel):
    def __init__(self, config, module=None):
        if module is None:
            module = MyModule            # ← Your nn.Module class
        super().__init__(config, module)

    def get_loss(self, outputs, gt):
        # ← Define your loss computation
        ...

    def step(self, batch_idx, batch_item, phase="train"):
        # ← Forward pass + optimization
        ...

Step 2: Create a Config

cp configs/dgcnn.yaml configs/my_model.yaml
# Edit optimizer, scheduler, model_parameter as needed

Step 3: Train

python train.py --model_name my_model --config configs/my_model.yaml ...

That's it. No changes to train.py, test.py, or any other file. The @register_model decorator handles everything.

📦 Adding New Datasets

cp datasets/new_dataset_template.py datasets/my_dataset.py

Your dataset must inherit from BaseSegDataset and return a dict with:

Key	Shape	Description
`feat`	`[C, N]`	Point features (channels-first)
`gt_seg_label`	`[1, N]`	Segmentation labels
`category`	`[2]`	One-hot jaw category
`mesh_path`	`str`	Source file path

📁 Dataset

Labeled Dataset (Training & Evaluation)

We use the 3DTeethSeg'22 Challenge dataset.

data_obj_parent_directory/
├── 00OMSZGW/
│   ├── 00OMSZGW_lower.obj
│   └── 00OMSZGW_upper.obj
└── ...

data_json_parent_directory/
├── 00OMSZGW/
│   ├── 00OMSZGW_lower.json
│   └── 00OMSZGW_upper.json
└── ...

Unlabeled Dataset (DentalMAE Pre-training)

For self-supervised pre-training, download the unlabeled scans from OneDrive.

🧪 Testing the Pipeline

Run the smoke test to verify the entire pipeline without real data or GPU:

python smoke_test.py

This tests: model registry (18 models), YAML config loading & merging, dataset instantiation with dummy data, DataLoader batching, loss infrastructure, and CLI script importability.

🤝 Contributing

We welcome contributions! Here's how to get started:

Workflow

Fork the repository
Create a branch for your feature: git checkout -b feature/my-new-model
Implement your changes (see Adding New Models)
Test with python smoke_test.py
Submit a Pull Request with a clear description

Contribution Ideas

Area	Difficulty	Description
🧠 New Model	⭐⭐	Add a new segmentation architecture
📊 New Dataset	⭐⭐	Add support for another dental dataset
📈 Metrics	⭐	Add boundary F1, Hausdorff distance
🔍 Visualization	⭐⭐	Add 3D prediction visualization tools
📝 Documentation	⭐	Improve docstrings and examples
⚡ Performance	⭐⭐⭐	Multi-GPU training, mixed precision

Code Style

Follow PEP 8 conventions
Add docstrings to all public functions and classes
Use the model registry — never add if/elif chains for new models
Write a corresponding YAML config for every new model
Ensure python smoke_test.py passes before submitting

📖 Configuration

YAML Config Structure

# configs/my_model.yaml — Inherits from configs/default.yaml

tr_set:
  optimizer:
    NAME: "adam"         # adam | sgd | adamw
    lr: 0.001
    weight_decay: 0.0001
  scheduler:
    sched: "cosine"      # cosine | exp | step
    full_steps: 40
    min_lr: 0.00001

model_parameter:
  input_feat: 6          # XYZ + normals
  num_classes: 17         # 16 teeth + gingiva

training:
  epochs: 200
  val_every: 1

wandb:
  wandb_on: false
  project: "tooth_seg"

CLI arguments override YAML values (e.g. --lr 0.0005 overrides optimizer.lr).

📄 License

This project is released for academic and research purposes. Please refer to the original ToothGroupNetwork repository for license details.

📚 References

ToothGroupNetwork: Lim, H., et al. "3D Dental Segmentation via Tooth Group Network." MICCAI 2022. GitHub
3DTeethSeg Challenge: grand-challenge.org
PointNet: Qi, C.R., et al. "PointNet: Deep Learning on Point Sets." CVPR 2017.
PointNet++: Qi, C.R., et al. "PointNet++: Deep Hierarchical Feature Learning." NeurIPS 2017.
DGCNN: Wang, Y., et al. "Dynamic Graph CNN for Learning on Point Clouds." TOG 2019.
Point Transformer: Zhao, H., et al. "Point Transformer." ICCV 2021.

🙏 Acknowledgements

This codebase is built upon the excellent work of Team CGIP's ToothGroupNetwork. We are grateful for their open-source contribution which made this benchmark possible.

If you find this work useful, please consider giving it a ⭐

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
datasets		datasets
external_libs		external_libs
inference_pipelines		inference_pipelines
models		models
train_configs		train_configs
.gitignore		.gitignore
README.md		README.md
augmentator.py		augmentator.py
config_loader.py		config_loader.py
dental_mae_pretrain.py		dental_mae_pretrain.py
eval_visualize_results.py		eval_visualize_results.py
gen_utils.py		gen_utils.py
generator.py		generator.py
loss_meter.py		loss_meter.py
ops_utils.py		ops_utils.py
predict_utils.py		predict_utils.py
preprocess_data.py		preprocess_data.py
requirements.txt		requirements.txt
runner.py		runner.py
smoke_test.py		smoke_test.py
split_txt_maker.py		split_txt_maker.py
start_inference.py		start_inference.py
start_train.py		start_train.py
test.py		test.py
train.py		train.py
train_code.py		train_code.py
trainer.py		trainer.py

Folders and files

Latest commit

History

Repository files navigation

🦷 3D Tooth Segmentation Benchmark

📋 Overview

🏗 Architecture

🧠 Supported Models

⚙ Installation

Requirements

Setup

🚀 Quick Start

1. Preprocess Raw Data

2. Train a Model

3. Evaluate

🏋 Training

Unified Training Script

DentalMAE Two-Phase Training

Legacy Training Script

📊 Evaluation

Unified Test Script

Visualization

🔧 Adding New Models

Step 1: Create the Model Wrapper

Step 2: Create a Config

Step 3: Train

📦 Adding New Datasets

📁 Dataset

Labeled Dataset (Training & Evaluation)

Unlabeled Dataset (DentalMAE Pre-training)

🧪 Testing the Pipeline

🤝 Contributing

Workflow

Contribution Ideas

Code Style

📖 Configuration

YAML Config Structure

📄 License

📚 References

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages