impact of reshape(-1) vs mean(0)

https://github.com/microsoft/Pengi/blob/31d5e37d0cf5a7bdcdcb4fbd8f2072674f993cea/wrapper.py#L167

Hi, I noticed a potential issue in with this function. Here, assuming a stereo input file of 10s duration and 44.1khz, torchaudio.load() creates a tensor of (2, 441000) and the reshaped output would have a length of (882000). This would result in uneven repetition or trimming (selection of random subsets of timeseries) between channels in the subsequent operations if operating on the .reshape(-1) tensor. Would a better choice be .mean(0) to calculate the mean signal between channels? This would prevent the uneven nature of subsequent operations.

I hope you can highlight if I am missing something here. 

Regards,
Aashish

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

impact of reshape(-1) vs mean(0) #21

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

impact of reshape(-1) vs mean(0) #21

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions