YOLO Annotation Editor

A comprehensive WPF application serving two equally important workflows: YOLO object detection dataset management and PaddleOCR text recognition dataset management. Covers the full pipeline from raw images to training-ready data — annotation, dataset manipulation, quality control, and live inference — for both YOLO v11 and PaddleOCR models.

Features

Dataset Editor

Interactive Annotation Editor: Create, edit, and delete bounding box annotations with click-and-drag; select existing annotations to reclassify or delete them
Class Search & Quick Select: Filter the class dropdown by name or ID while annotating — single matches are auto-selected
Dataset Management: Load YOLO format datasets via YAML file; auto-detects train/val splits and generates empty label files for images missing them
Decimal Separator Fix: On load, detects label files using comma as decimal separator and offers to convert the entire dataset to dot notation
Edit State Tracking: Green indicators per image show which have been reviewed; toggle individual states or batch-mark all images up to the current one
Advanced Search: Filter the image list by filename or class name in real time
Statistics & Visualization: Bar chart of annotation counts per class, plus totals for images, annotations, unique classes, and edit progress
YOLO Auto-Annotation: Load an ONNX model (CUDA with CPU fallback) to detect annotations on the current image, redetect clearing existing ones, or run batch redetection across the entire dataset; auto-triggers on unannotated images when a model is loaded and edit mode is active
Batch Label Editor: Multi-step wizard to select images, draw a spatial region on a reference image, pick a target class, preview the impact, and reclassify all labels whose center falls within the region across the selected images
Keyboard Navigation: Arrow keys to move between images; Ctrl+E toggle edit mode; Ctrl+S save; Delete remove selected annotation; Ctrl+D detect / Ctrl+Shift+D redetect with YOLO
Zoom & Pan: Zoom control locks during edit mode for precise annotation placement

YAML Editor

YAML Configuration Management: Intuitive interface for managing dataset YAML files and class definitions
Path Settings: Configure base paths and train/validation/test directories
Class Management: Add, remove, and edit class definitions with search and sorting capabilities
File Operations: Create, open, save, and save-as functionality for YAML configurations

YOLO Preview

Live Object Detection: Real-time YOLO object detection preview using ONNX models
Multiple Input Sources: Support for webcam and screen capture
Model Flexibility: Load custom ONNX models with automatic fallback from CUDA to CPU
Detection Control: Toggle detection on/off and control capture settings

PaddleOCR Runner

Multiple Built-in Models: English V3, English V4, and Chinese V5 via one-click load
Custom Local Models: Load your own trained PaddleOCR model by pointing to separate detection and recognition model directories (V5 format)
Per-region Results: Displays each detected text region with its confidence score
Performance Metrics: Shows total inference time in milliseconds
Image Display: Renders the loaded image alongside results for quick visual verification

OCR Annotation

Text Annotation Interface: Dedicated tool for labeling image crops with their text content, producing a labels.txt file in PaddleOCR format (filename text)
Automatic OCR Detection: One-click OCR on the current image using the loaded PaddleOCR model to pre-fill the text label
Dataset Preparation: On folder load, offers to convert PNG files to JPG and rename all files to sequential numeric names (0.jpg, 1.jpg, …) — the format expected by PaddleOCR training pipelines
Auto-save: Labels are saved automatically on every navigation change and on application close
Progress Tracking: Per-image labeled/unlabeled indicator (green check / red warning) and a running Labeled: N/Total counter
Navigation: Previous/Next buttons with current position display

OCR Dataset Tools

TRDG to PaddleOCR Conversion: Convert Text Recognition Data Generator output (labels.txt + images) to PaddleOCR training format (rec_gt_train.txt / rec_gt_val.txt + split image folders) with a configurable train/val ratio slider
Dataset Merging: Merge any number of PaddleOCR datasets into one; choose image scaling mode — fixed height (aspect-ratio preserved), variable height (random within a min/max range), or no scaling — with configurable JPEG quality and optional random seed; outputs sequentially numbered images and a merged labels.txt
Auto-Discovery: Scan a root directory (up to 3 levels deep) to automatically find all folders that contain a labels.txt and at least one image, and queue them for merging
Character Set Generation: Scan a dataset's labels to extract every unique character and write a one-character-per-line dictionary file; shows a breakdown of letters, digits, punctuation, and spaces
Dataset Analysis: Total samples, unique characters, avg/min/max label length, per-character frequency table with percentage and dictionary coverage flag, dictionary comparison (characters in dataset missing from dict and vice versa), and training recommendations based on dataset size, character set breadth, and dictionary alignment

YOLO Dataset Tools

Merge Datasets: Combine multiple YOLO datasets into one, merging class maps and resolving filename conflicts automatically; outputs a ready-to-use dataset.yaml
Split Datasets: Split a dataset into N equal parts, into parts of a fixed image count, or extract a random subset — with optional seed for reproducibility; choose which splits (train/val/test) to include
Consolidate: Move all images and labels from any split into train, collapsing the directory structure
Train/Val/Test Split: Redistribute a flat dataset into train/val/test splits with configurable ratios
Filter Datasets: Subset a dataset by class presence (contains / does not contain, AND / OR logic), by first N images, or by random N images; filtered output includes only the relevant classes in the YAML
Analyze Dataset: Per-split and per-class annotation counts, images with no annotations, bounding box size distribution, aspect ratio statistics, and duplicate detection; export results to CSV or a full HTML report
Balance Datasets: Merge a primary and secondary dataset while balancing class representation
Validate Dataset: Check image-label consistency, detect corrupt images, missing label files, and out-of-bounds annotations; export a clean subset of only valid images

Getting Started

Installation

Download the latest release from our GitHub Releases page. The application uses Velopack for easy installation and automatic updates.

Requirements

Windows 10/11
.NET 8.0 Runtime

Usage

Loading a Dataset

Launch the application
Click "Browse..." to select your YOLO dataset YAML file
The application will load all images and annotations from the dataset

Editing Annotations

Select an image from the thumbnail list
Toggle "Edit Mode" to enable annotation editing
Click and drag on the image to create a new bounding box
Select a class from the right panel dropdown (or type to search)
Click an existing annotation to select and reclassify or delete it
Use Delete or "Delete Selected" to remove the selected annotation
Click "Save Changes" or press Ctrl+S to save

Tracking Edit Progress

Green indicators show which images have been edited
Use "Toggle Edit State" to manually mark an image as edited/unedited
Use "Mark All Edited Until Here" to batch mark multiple images
View overall edit progress in the Statistics tab

YOLO Auto-Annotation

Browse and load an ONNX model in the toolbar (CUDA with CPU fallback)
Enable Edit Mode and navigate to an image
Click "Detect with YOLO" to add detections on top of existing annotations, or "Redetect with YOLO" to clear and redetect
Use "Batch Redetect" to run detection across all images in the dataset
With a model loaded and Edit Mode active, the editor auto-detects on unannotated, unreviewed images

Batch Label Editor

Click "Batch Label Editor" (requires a loaded dataset)
Step 1: Select the images to modify (filter, select all, invert)
Step 2: Draw a region on a reference image to define where labels will be changed
Step 3: Choose the target class for all labels whose center falls inside the region
Step 4: Review the summary and apply — results are written directly to label files

YAML Editor

Navigate to the YAML Editor tab
Use the toolbar to create, open, save, or save as YAML files
Configure dataset paths in the Path Settings section
Manage class definitions using the dedicated buttons and data grid
Search/filter classes and sort by ID or name

YOLO Preview

Navigate to the YOLO Preview tab
Select an ONNX model file using the Browse button
Choose source type: Webcam or Screen capture
Select input device and start/stop capture
Toggle YOLO detection on/off and view real-time results

OCR Annotation

Navigate to the OCR Annotation tab
Open a folder containing images
Use "Detect with OCR" for automatic text recognition
Navigate through images and edit text annotations
Progress is automatically saved

OCR Dataset Tools

Navigate to the OCR Dataset Tools tab
Convert TRDG to PaddleOCR: Select the TRDG directory and an output directory, set the train/val split ratio, and convert
Merge Datasets: Add dataset folders manually or use Auto-Discover; pick a scaling mode and quality; merge into a single numbered dataset
Generate Character Set: Select a dataset folder and an output .txt path; the tool writes one character per line and previews the full set
Analyze Dataset: Select a dataset folder, optionally load a dictionary file for comparison, and run the analysis to review statistics and recommendations

YOLO Dataset Tools

Navigate to the YOLO Dataset Tools tab
Merge: Add dataset folders, select an output directory, and merge
Split: Choose a dataset, pick a split mode (N parts / by count / random subset), and split
Filter: Load classes from the dataset, select which to include/exclude with AND/OR logic, and filter
Analyze: Run a full analysis and export results to CSV or HTML
Balance: Select primary and secondary datasets to produce a class-balanced merge
Validate: Scan for corrupt images, missing labels, and out-of-bounds annotations; export only valid images

Development

Building from Source

Clone the repository

git clone https://github.com/flowhl/YoloAnnotationEditor.git

Open the solution in Visual Studio 2022 or later
Restore NuGet packages
Build the solution

Dependencies

.NET 8.0
WPF (Windows Presentation Foundation)
LiveChartsCore - for statistics visualization
YamlDotNet - for parsing YAML config files
OpenCVSharp4 - for image manipulation
Denxorz.ZoomControl - for zoom functionality
SkiaSharp - for high-performance 2D graphics
YoloDotNet - for YOLO object detection with ONNX models
Sdcb.PaddleOCR - for OCR functionality
Velopack - for automatic updates

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Third-Party Licenses

Denxorz.ZoomControl - Microsoft Public License (MS-PL)
Other dependencies may have their own licenses - see respective projects for details

Acknowledgments

YOLO - for the annotation format
OpenCV - for image processing capabilities
PaddleOCR - for OCR functionality

About

YOLO Annotation Editor was created to simplify the process of creating and maintaining high-quality datasets for computer vision applications, now expanded with comprehensive OCR dataset management capabilities. For questions or support, please open an issue on our GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.github/workflows		.github/workflows
YoloAnnotationEditor		YoloAnnotationEditor
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

YOLO Annotation Editor

Features

Dataset Editor

YAML Editor

YOLO Preview

PaddleOCR Runner

OCR Annotation

OCR Dataset Tools

YOLO Dataset Tools

Getting Started

Installation

Requirements

Usage

Loading a Dataset

Editing Annotations

Tracking Edit Progress

YOLO Auto-Annotation

Batch Label Editor

YAML Editor

YOLO Preview

OCR Annotation

OCR Dataset Tools

YOLO Dataset Tools

Development

Building from Source

Dependencies

Contributing

License

Third-Party Licenses

Acknowledgments

About

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 23

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages