MBGv2: Frame slicing and small object detection

This repository contains the complete codebase for experiments with the MBGv2 dataset reported in my dissertation. The project focuses on object detection in high-resolution (4K) images using frame slicing techniques, then further fine-tuning YOLO models.

The codebase enables execution of two main experiments:

1. Frame Slicing with Annotation Generation

Slices 4K resolution frames from the MBGv2 dataset into 640x640 excerpts with overlapping regions, automatically generating annotations in both COCO and YOLO formats for sliced objects. This preprocessing step is essential for detecting small objects (such as mosquito breeding sites like tires) in high-resolution images.

2. YOLO Model Fine-tuning

Further trains a YOLO model (default: YOLOv8s) on the sliced dataset. The default settings follow the experimental configuration used in the dissertation but can be customized for additional experimentation.

Installation

Prerequisites

First, install uv, a fast Python package manager:

# On macOS and Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

Then clone the repository and install the project dependencies:

cd MBGv2-d

# Install all dependencies
uv sync

How to Run

The sh folder contains shell scripts for the frame slicing and training pipelines.

The commands below show the basic command lines to reproduce the experiments from the dissertation. For detailed information on customizable parameters, read the 'sh' folder README.md.

Frame Slicing

To reproduce the frame slicing experiments — which slice annotated 4K frames from the MBGv2 dataset into 640x640 excerpts with overlapping regions:

chmod +x sh/frame_slicing/process_all_folds.sh

# Run the script (uses all available CPU cores by default)
./sh/frame_slicing/process_all_folds.sh \
  --image-dir /path/to/mbgv2/frames \
  --annotations-dir /path/to/annotations \
  --overlap-ratio 0.067 \
  --object-name tire

# Or specify number of parallel workers
./sh/frame_slicing/process_all_folds.sh \
  --image-dir /path/to/mbgv2/frames \
  --annotations-dir /path/to/annotations \
  --overlap-ratio 0.067 \
  --object-name tire \
  --n-workers 8

The output will contain train/val folders containing the sliced images, COCO and YOLO annotations.

Custom parameters example:

chmod +x sh/frame_slicing/process_all_folds.sh
bash sh/frame_slicing/process_all_folds.sh \
  --image-dir [path to folder containing frames (images)] \
  --annotations-dir [path to dir w/ COCO annotations] \
  --overlap-ratio 0.067 \
  --object-name tire \
  --n-workers 4

Model Training (Fine-tuning)

Subsequently train and validate a YOLOv8s model with the sliced dataset:

chmod +x sh/yolo/train_all_folds.sh
bash sh/yolo/train_all_folds.sh --data_dir [Path to dir w/ training data .YAML] --hyp-config [path to training hyperparameter config yaml]

This script will reproduce the training and evaluation across the 5 folds for the MBGv2. It will also display the average F1 score for each fold, as well as the optimal threshold for detection.

Custom Frame Slicing (beyond dissertation)

For more flexible frame slicing with custom fold selection, dataset splits, and min_area_ratio values, you can use the script custom_fold_processing.sh. For further instructions of customisation, read the README.md inside sh.

chmod +x sh/frame_slicing/custom_fold_processing.sh

# Basic usage with mandatory parameters (uses all available CPU cores by default)
./sh/frame_slicing/custom_fold_processing.sh \
  --image-dir "/path/to/frames" \
  --annotations-dir "/path/to/annotations" \
  --object-name "watertank" \
  --overlap-ratio 0.1 \
  --folds "0-4" \
  --splits "train val" \
  --min-area-ratios "0.0"

# With custom number of parallel workers
./sh/frame_slicing/custom_fold_processing.sh \
  --image-dir "/path/to/frames" \
  --annotations-dir "/path/to/annotations" \
  --object-name "watertank" \
  --overlap-ratio 0.1 \
  --folds "0-4" \
  --splits "train val" \
  --min-area-ratios "0.0" \
  --n-workers 6

Multiple definitions are accepted for the frame slicing process:

Custom fold selection: Process specific folds (e.g., "0,2-4", "1,3", "0-20", "1")
Flexible dataset splits: Choose which splits to process ("train", "val", "test", "train val train", etc.)
Custom min_area_ratio ranges: Define specific values ("0.0,0.5,1.0") or ranges ("0.5-0.8")
Parallel processing: Automatically parallelizes all fold+min_area_ratio+split combinations for optimal performance

Examples:

# Process only folds 0-2 with train and val splits (with 4 parallel workers)
./sh/frame_slicing/custom_fold_processing.sh \
  --image-dir "./frames" --annotations-dir "./annotations" \
  --object-name "tire" --overlap-ratio 0.067 \
  --folds "0-2" --splits "train val" \
  --n-workers 4

# Process specific min_area_ratios for single fold (uses all available CPUs)
./sh/frame_slicing/custom_fold_processing.sh \
  --image-dir "./frames" --annotations-dir "./annotations" \
  --object-name "watertank" --overlap-ratio 0.1 \
  --folds "0" --splits "train val test" \
  --min-area-ratios "0.0,0.5,1.0"

Expected Data Structure

To run the experiments, you must have the MBGv2 dataset. This codebase is structured to process the dataset published in Isabelle Vaz de Mello (2024)'s experiments with Faster-RCNN, which includes frames and COCO annotations:

Input (MBGv2 Dataset)

MBGv2_dataset/
├── frames/                      # Original 4K resolution images
│   ├── image1.jpg
│   ├── image2.jpg
│   └── ...
└── coco_json_folds/            # COCO annotations organized by folds
    └── 5folds/
        └── tire/               # Object class (tire|watertank)
            └── 40m/            # Drone height parameter
                ├── coco_format_.json
                ├── fold1_val.json
                └── ...

Output (After Processing)

MBGv2_sliced/                   # Sliced dataset
├── fold1/
│   └── 00/                     # Min area ratio folder
│       ├── data.yaml           # YOLO dataset configuration
│       ├── images/             # Sliced 640x640 images
│       ├── labels/             # YOLO format annotations
│       └── coco_annotations/   # COCO format annotations
└── ...

results/                        # Training results
├── fold1/
│   ├── run1/                   # Individual training runs
│   └── ...
├── fold1.log                   # Training logs
└── ...

Development

# Run all tests
uv run make test

# Format code
uv run make format

# Lint code
uv run make check

Configuration

Default Training Hyperparameter Files

config/hyp.mosquito.tire.yaml: Sample settings for tire detection
config/hyp.mosquito.watertank.yaml: Sample settings for watertank detection
config/hyp_mosquito_tire_dissertation.yaml: Dissertation experiment settings for tire detection

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
MBGv2_dissertation		MBGv2_dissertation
config		config
sh		sh
tests		tests
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MBGv2: Frame slicing and small object detection

1. Frame Slicing with Annotation Generation

2. YOLO Model Fine-tuning

Installation

Prerequisites

How to Run

Frame Slicing

Model Training (Fine-tuning)

Custom Frame Slicing (beyond dissertation)

Expected Data Structure

Input (MBGv2 Dataset)

Output (After Processing)

Development

Configuration

Default Training Hyperparameter Files

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MBGv2: Frame slicing and small object detection

1. Frame Slicing with Annotation Generation

2. YOLO Model Fine-tuning

Installation

Prerequisites

How to Run

Frame Slicing

Model Training (Fine-tuning)

Custom Frame Slicing (beyond dissertation)

Expected Data Structure

Input (MBGv2 Dataset)

Output (After Processing)

Development

Configuration

Default Training Hyperparameter Files

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages