Feeder Model Training

Training scripts for feeder bee detection models. Supports two model architectures:

POLO -- Point-detection model based on YOLO (ultralytics fork). Predicts bee locations as single points with class-specific radii on full-resolution images.
Localizer -- Lightweight fully-convolutional heatmap model (~248K params). Classifies 128x128 grayscale patches as containing a bee or background.

Both models detect four classes: UnmarkedBee, MarkedBee, BeeInCell, UpsideDownBee.

Setup

Requires Python 3.10+ and PyTorch with CUDA support (see pytorch.org).

# Install mosaic-behavior with POLO and localizer extras
pip install "mosaic-behavior[polo,localizer] @ git+https://github.com/ecodylicscience/mosaic.git"

# Clone this repo
git clone <repo-url>
cd feeder-model-training

Dataset

The training data is distributed as a tarball (feeder_bee_datasets_v1.tar.gz), separate from this repo. Extract it on the training machine:

tar xzf feeder_bee_datasets_v1.tar.gz

This produces the following structure:

Dataset	Path	Contents	Purpose
A	`polo/cvat_only`	CVAT annotations only	POLO baseline
B	`polo/merged`	CVAT + HDF5 + pseudo-labels	POLO merged training
C	`localizer/cvat`	128x128 patches from CVAT	Localizer baseline
All datasets share the same test set (CVAT images only) for fair model comparison. The train/valid/test split is stratified by camera type (feeder vs exit cam) to ensure proportional representation.

Localizer resolution scaling

The 2019 pretrained localizer was trained on BeesBook colony cam images at 38 px/tag. Feeder cam images have ~58 px/tag. To maintain compatibility with pretrained encoder weights, localizer patches are extracted from images pre-scaled by 0.655x (38/58). At inference time, input images must be downscaled by the same factor and detected coordinates mapped back to original image space. This is handled automatically by evaluate.py and documented in config.py.

Quick Start

# Train POLO on merged dataset (recommended)
python train_polo.py --dataset /path/to/feeder_bee_datasets_v1

# Train POLO on CVAT-only baseline
python train_polo.py --dataset /path/to/feeder_bee_datasets_v1 --variant cvat_only

# Train localizer on CVAT patches
python train_localizer.py --dataset /path/to/feeder_bee_datasets_v1

# Train localizer with pretrained weights
python train_localizer.py --dataset /path/to/feeder_bee_datasets_v1 \
    --weights /path/to/localizer_2019_weights.pt

# Evaluate a trained POLO model
python evaluate.py --type polo --dataset /path/to/feeder_bee_datasets_v1 \
    --model runs/polo/merged_20260313/weights/best.pt

# Evaluate a trained localizer
python evaluate.py --type localizer --dataset /path/to/feeder_bee_datasets_v1 \
    --model runs/localizer/cvat_20260313/weights/best.pt

Configuration

All scripts accept --help for full argument documentation. Key parameters:

train_polo.py

Argument	Default	Description
`--dataset`	(required)	Path to `feeder_bee_datasets_v1/`
`--variant`	`merged`	`merged` or `cvat_only`
`--model`	`polo26n.yaml`	Architecture (nano/small/medium/large)
`--epochs`	200	Max training epochs
`--batch`	16 (8 for merged)	Batch size
`--patience`	50	Early stopping patience
`--loc`	5.0	Localization loss weight
`--dor`	0.8	Distance of Reference threshold
`--augmentation`	`heavy`	Augmentation preset
`--device`	auto	`0` (cuda), `mps`, `cpu`

train_localizer.py

Argument	Default	Description
`--dataset`	(required)	Path to `feeder_bee_datasets_v1/`
`--variant`	`cvat`	`cvat` or `merged`
`--epochs`	300	Max training epochs
`--batch-size`	128	Batch size
`--lr`	0.001	Learning rate
`--patience`	40	Early stopping patience
`--weights`	None	Pretrained weights (.pt or .h5)
`--freeze-encoder`	False	Train head only
`--device`	auto	`0` (cuda), `mps`, `cpu`

Output Structure

Training runs are saved to runs/<model_type>/<run_name>/:

runs/polo/merged_20260313_143022/
    weights/
        best.pt        # best checkpoint (by validation metric)
        last.pt        # final epoch checkpoint
    results.csv        # per-epoch metrics
    args.yaml          # training configuration

Classes

ID	Name	Notes
0	UnmarkedBee	Most common class
1	MarkedBee	Rare in feeder cam images, more common in exit cam
2	BeeInCell	Absent from CVAT annotations (no comb cells at feeder). Present in HDF5/pseudo-label sources only.
3	UpsideDownBee	Bees walking upside down on feeder

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
config.py		config.py
evaluate.py		evaluate.py
pyproject.toml		pyproject.toml
train_localizer.py		train_localizer.py
train_polo.py		train_polo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Feeder Model Training

Setup

Dataset

Localizer resolution scaling

Quick Start

Configuration

train_polo.py

train_localizer.py

Output Structure

Classes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Feeder Model Training

Setup

Dataset

Localizer resolution scaling

Quick Start

Configuration

train_polo.py

train_localizer.py

Output Structure

Classes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages