Kyle Maclean
Identifying bird species by their calls in Soundscapes
Maclean, Kyle; Triguero, Isaac
Abstract
In many real data science problems, it is common to encounter a domain mismatch between the training and testing datasets, which means that solutions designed for one may not transfer well to the other due to their differences. An example of such was in the BirdCLEF2021 Kaggle competition, where participants had to identify all bird species that could be heard in audio recordings. Thus, multi-label classifiers, capable of coping with domain mismatch, were required. In addition, classifiers needed to be resilient to a long-tailed (imbalanced) class distribution and weak labels. Throughout the competition, a diverse range of solutions based on convolutional neural networks were proposed. However, it is unclear how different solution components contribute to overall performance. In this work, we contextualise the problem with respect to the previously existing literature, analysing and discussing the choices made by the different participants. We also propose a modular solution architecture to empirically quantify the effects of different architectures. The results of this study provide insights into which components worked well for this challenge.
Citation
Maclean, K., & Triguero, I. (2023). Identifying bird species by their calls in Soundscapes. Applied Intelligence, 53, 21485-21499. https://doi.org/10.1007/s10489-023-04486-8
Journal Article Type | Article |
---|---|
Acceptance Date | Jan 23, 2023 |
Online Publication Date | Mar 20, 2023 |
Publication Date | 2023-10 |
Deposit Date | Apr 3, 2023 |
Publicly Available Date | Apr 6, 2023 |
Journal | Applied Intelligence |
Print ISSN | 0924-669X |
Electronic ISSN | 1573-7497 |
Publisher | Springer Verlag |
Peer Reviewed | Peer Reviewed |
Volume | 53 |
Pages | 21485-21499 |
DOI | https://doi.org/10.1007/s10489-023-04486-8 |
Keywords | Multi-label classification · Signal processing · Domain mismatch · Convolutional neural networks |
Public URL | https://nottingham-repository.worktribe.com/output/19009871 |
Publisher URL | https://link.springer.com/article/10.1007/s10489-023-04486-8 |
Files
s10489-023-04486-8
(1.7 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
You might also like
Machine Learning Pipeline for Energy and Environmental Prediction in Cold Storage Facilities
(2024)
Journal Article
Local-global methods for generalised solar irradiance forecasting
(2024)
Journal Article
Hyper-Stacked: Scalable and Distributed Approach to AutoML for Big Data
(2023)
Presentation / Conference Contribution
Explaining time series classifiers through meaningful perturbation and optimisation
(2023)
Journal Article
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search