Currently curating a cactus wren dataset to train a whittle model on Edge Impulse. I have a combination of xeno canto data, where i ran birdnet over it to segment out 3s segments of cactus wren, and 5 other regional species. I also have data from sagemics that im using to supplement this data, of cactus wren collected from the field and the 5 other species of interest. In order to create logical test/validation splits, I need this data to be in a format where I can run it through my stratified group folding script.
Currently curating a cactus wren dataset to train a whittle model on Edge Impulse. I have a combination of xeno canto data, where i ran birdnet over it to segment out 3s segments of cactus wren, and 5 other regional species. I also have data from sagemics that im using to supplement this data, of cactus wren collected from the field and the 5 other species of interest. In order to create logical test/validation splits, I need this data to be in a format where I can run it through my stratified group folding script.