ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Whisper to Neutral Mapping Using I-Vector Space Likelihood and a Cosine Similarity Based Iterative Optimization for Whispered Speaker Verification

Naini, AR and Achuth Rao, MV and Ghosh, PK (2022) Whisper to Neutral Mapping Using I-Vector Space Likelihood and a Cosine Similarity Based Iterative Optimization for Whispered Speaker Verification. In: 27th National Conference on Communications, NCC 2022, 24 - 27 May 2022, Virtual, Online at Mumbai, India, pp. 130-135.

[img] PDF
NCC 2022_130-135_2022.pdf - Published Version
Restricted to Registered users only

Download (367kB) | Request a copy
Official URL: https://doi.org/10.1109/NCC55593.2022.9806732

Abstract

In this work, we propose an iterative optimization algorithm to learn a feature mapping (FM) from the whispered to neutral speech features. Such an FM can be used to improve the performance of speaker verification (SV) systems when presented with a whispered speech. In one of previous works, the equal error rate (EER) in an SV task has been shown to improve by 24. based on an FM network trained using a cosine similarity based loss function over that using a mean squared error based objective function. As the mapped whispered features obtained in this manner may not lie in the trained i-vector space, we, in this work, iteratively optimize the i-vector space likelihood (by updating T-matrix) and a cosine similarity based loss function for learning the parameters of the FM network. The proposed iterative optimization improves the EER by 26 compared to when the FM network parameters are learned based on only cosine similarity based loss function without any T-matrix update, which is a special case of the proposed iterative optimization.

Item Type: Conference Paper
Publication: 2022 National Conference on Communications, NCC 2022
Publisher: Institute of Electrical and Electronics Engineers Inc.
Additional Information: The copyright for this article belongs to the Institute of Electrical and Electronics Engineers Inc.
Keywords: Frequency modulation; Iterative methods; Mapping; Mean square error; Speech recognition, Cosine similarity; Equal error rate; Feature mapping; I vectors; Iterative Optimization; Loss functions; Neutral mappings; Speaker verification; T-matrix; Whispered speech, Vector spaces
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 10 Aug 2022 05:54
Last Modified: 10 Aug 2022 05:54
URI: https://eprints.iisc.ac.in/id/eprint/75787

Actions (login required)

View Item View Item