Siyang Song
Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification
Song, Siyang; Zhang, Shuimei; Schuller, Björn W.; Shen, Linlin; Valstar, Michel
Authors
Shuimei Zhang
Björn W. Schuller
Linlin Shen
Michel Valstar
Abstract
The performance of speaker-related systems usually degrades heavily in practical applications largely due to the presence of background noise. To improve the robustness of such systems in unknown noisy environments, this paper proposes a simple pre-processing method called Noise Invariant Frame Selection (NIFS). Based on several noisy constraints, it selects noise invariant frames from utterances to represent speakers. Experiments conducted on the TIMIT database showed that the NIFS can significantly improve the performance of Vector Quantization (VQ), Gaussian Mixture Model-Universal Background Model (GMM-UBM) and i-vector-based speaker verification systems in different unknown noisy environments with different SNRs, in comparison to their baselines. Meanwhile, the proposed NIFS-based speaker verification systems achieves similar performance when we change the constraints (hyper-parameters) or features, which indicates that it is robust and easy to reproduce. Since NIFS is designed as a general algorithm, it could be further applied to other similar tasks.
Citation
Song, S., Zhang, S., Schuller, B. W., Shen, L., & Valstar, M. (2018). Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification. In Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN). https://doi.org/10.1109/IJCNN.2018.8489497
Conference Name | International Joint Conference on Neural Networks 2018 |
---|---|
Conference Location | Rio de Janeiro, Brazil |
Start Date | Jul 8, 2018 |
End Date | Jul 13, 2018 |
Acceptance Date | Mar 15, 2018 |
Online Publication Date | Oct 14, 2018 |
Publication Date | 2018 |
Deposit Date | Apr 30, 2018 |
Publicly Available Date | Mar 29, 2024 |
Peer Reviewed | Peer Reviewed |
Series ISSN | 2161-4407 |
Book Title | Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN) |
ISBN | 978-1-5090-6015-3 |
DOI | https://doi.org/10.1109/IJCNN.2018.8489497 |
Public URL | https://nottingham-repository.worktribe.com/output/945806 |
Publisher URL | https://ieeexplore.ieee.org/document/8489497 |
Related Public URLs | http://www.ecomp.poli.br/~wcci2018/ |
Additional Information | © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Files
noise-invariant-frame.pdf
(413 Kb)
PDF
You might also like
Identifying Variation in the Newborn Life Support Procedure: An Automated Method
(2023)
Conference Proceeding
A modelling approach to studying variations in newborn life support procedure
(2023)
Journal Article
Benchmarking framework for machine learning classification from fNIRS data
(2023)
Journal Article
Design and Evaluation of Virtual Human Mediated Tasks for Assessment of Depression and Anxiety
(2021)
Conference Proceeding
Designing an Adaptive Embodied Conversational Agent for Health Literacy: a User Study
(2021)
Conference Proceeding
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: digital-library-support@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search