Kavan Fatehi
LABERT: A Combination of Local Aggregation and Self-Supervised Speech Representation Learning for Detecting Informative Hidden Units in Low-Resource ASR Systems
Fatehi, Kavan; Kucukyilmaz, Ayse
Abstract
With advances in deep learning methodologies, Automatic Speech Recognition (ASR) systems have seen impressive results. However, ASR in Low-Resource Environments (LREs) are challenged by a lack of training data for the specific target domain. We propose that data sampling criteria for choosing more informative speech samples can be critical to addressing the problem of training data bottleneck. Our proposed Local Aggregation BERT (LABERT) method for self-supervised speech representation learning fuses an active learning model with an adapted local aggregation metric. Active learning is used to pick informative speech units, whereas the aggregation metric forces the model to move similar data together in the latent space while separating dissimilar instances to detect hidden units in LRE tasks. We evaluate LABERT with two LRE datasets: I-CUBE and UASpeech to explore the performance of our model in the LRE ASR problems.
Citation
Fatehi, K., & Kucukyilmaz, A. (2023, August). LABERT: A Combination of Local Aggregation and Self-Supervised Speech Representation Learning for Detecting Informative Hidden Units in Low-Resource ASR Systems. Presented at Interspeech 2023, Dublin, Ireland
Presentation Conference Type | Edited Proceedings |
---|---|
Conference Name | Interspeech 2023 |
Start Date | Aug 20, 2023 |
End Date | Aug 24, 2023 |
Acceptance Date | May 17, 2023 |
Online Publication Date | Aug 21, 2023 |
Publication Date | Aug 21, 2023 |
Deposit Date | Jun 22, 2023 |
Publicly Available Date | Aug 21, 2023 |
Series Title | Interspeech Conference |
Series ISSN | 1990-9772 |
Book Title | Interspeech 2023 |
Keywords | Self-Supervised Learning; BERT; Local Aggre- gation Function; Low-Resource Environment ASR |
Public URL | https://nottingham-repository.worktribe.com/output/22183323 |
Related Public URLs | https://www.isca-speech.org/archive/pdfs/interspeech_2023/fatehi23_interspeech.pdf |
Files
LABERT For INTERSPEECH 2023
(651 Kb)
PDF
You might also like
A Taxonomy of Domestic Robot Failure Outcomes: Understanding the impact of failure on trustworthiness of domestic robots
(2024)
Presentation / Conference Contribution
Charting Ethical Tensions in Multispecies Technology Research through Beneficiary-Epistemology Space
(2024)
Presentation / Conference Contribution
TAS for Cats: An Artist-led Exploration of Trustworthy Autonomous Systems for Companion Animals
(2023)
Presentation / Conference Contribution
Somabotics Toolkit for Rapid Prototyping Human-Robot Interaction Experiences using Wearable Haptics
(2023)
Presentation / Conference Contribution
Somabotics Toolkit for Rapid Prototyping Human-Robot Interaction Experiences using Wearable Haptics
(2023)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search