Skip to main content

Research Repository

Advanced Search

Identifying bird species by their calls in Soundscapes

Maclean, Kyle; Triguero, Isaac

Identifying bird species by their calls in Soundscapes Thumbnail


Authors

Kyle Maclean



Abstract

In many real data science problems, it is common to encounter a domain mismatch between the training and testing datasets, which means that solutions designed for one may not transfer well to the other due to their differences. An example of such was in the BirdCLEF2021 Kaggle competition, where participants had to identify all bird species that could be heard in audio recordings. Thus, multi-label classifiers, capable of coping with domain mismatch, were required. In addition, classifiers needed to be resilient to a long-tailed (imbalanced) class distribution and weak labels. Throughout the competition, a diverse range of solutions based on convolutional neural networks were proposed. However, it is unclear how different solution components contribute to overall performance. In this work, we contextualise the problem with respect to the previously existing literature, analysing and discussing the choices made by the different participants. We also propose a modular solution architecture to empirically quantify the effects of different architectures. The results of this study provide insights into which components worked well for this challenge.

Citation

Maclean, K., & Triguero, I. (2023). Identifying bird species by their calls in Soundscapes. Applied Intelligence, 53, 21485-21499. https://doi.org/10.1007/s10489-023-04486-8

Journal Article Type Article
Acceptance Date Jan 23, 2023
Online Publication Date Mar 20, 2023
Publication Date 2023-10
Deposit Date Apr 3, 2023
Publicly Available Date Apr 6, 2023
Journal Applied Intelligence
Print ISSN 0924-669X
Electronic ISSN 1573-7497
Publisher Springer Verlag
Peer Reviewed Peer Reviewed
Volume 53
Pages 21485-21499
DOI https://doi.org/10.1007/s10489-023-04486-8
Keywords Multi-label classification · Signal processing · Domain mismatch · Convolutional neural networks
Public URL https://nottingham-repository.worktribe.com/output/19009871
Publisher URL https://link.springer.com/article/10.1007/s10489-023-04486-8

Files





You might also like



Downloadable Citations