Train-time data-augmentation for parameterised learning

## Overview

Parameterised learning is useful in HEP, for example in cases where a classifier should learn multiple signal hypotheses (e.g. a heavy Higgs of several possible masses) see [Baldi et al., 2016](https://arxiv.org/abs/1601.07913).

In this example the signal would have a parameterised input equal to the true resonant mass, and the background would be randomly assigned resonant masses. Once trained, the entire dataset can be set to a particular resonant mass in order to perform inference for a given hypothesis. This last part is already possible with the `ParametrisedPrediction` class.

## Data augmentation for parameterised learning

Currently the random assignment  of parameterised-feature values for background (in the example above) is performed once when preparing the data for training. It could well be possible that it is useful to perform this random assignment during training, which may provide some of the benefits of train-time data augmentation.

## Implementation

To avoid conflicts with `HEPAugFoldYielder`, and due to the fact that this only wants to be performed during training, this secondary form of augmentation should probably implemented as a callback. It also needs to account for the possibility that multiple parameterisation features may be used, and that only a subset of the data may need to be changed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train-time data-augmentation for parameterised learning #68

Overview

Data augmentation for parameterised learning

Implementation

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Train-time data-augmentation for parameterised learning #68

Description

Overview

Data augmentation for parameterised learning

Implementation

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions