Does anyone have experience with how this deals with background noise? It appears to be analyzing audio spectra, but can it detect a birdsong against the background of a busy road or a railway nearby? I've seen and heard so many birds near where I live (red-tailed hawks, northern flickers, european starlings, magpies, finches, ...), but audio recordings of them have always been hampered by the sounds of human activity.
The original paper [1] me tions data augmentation in the training dataset such as background noise addition, so it seems to be part of the initial design.
As always with non-stationary noisy signal, any estimator will reach its limit to a certain point.