ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Speaker conditioned acoustic-to-articulatory inversion using x-vectors

Illa, A and Ghosh, PK (2020) Speaker conditioned acoustic-to-articulatory inversion using x-vectors. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 25-29 October 2020, Shanghai; China, pp. 1376-1380.

[img]
Preview
PDF
INTERSPEECH-Vol-2020-October-1376-1380.pdf - Published Version

Download (646kB) | Preview
Official URL: https://dx.doi.org/10.21437/Interspeech.2020-1222

Abstract

Speech production involves the movement of various articulators, including tongue, jaw, and lips. Estimating the movement of the articulators from the acoustics of speech is known as acoustic-to-articulatory inversion (AAI). Recently, it has been shown that instead of training AAI in a speaker specific manner, pooling the acoustic-articulatory data from multiple speakers is beneficial. Further, additional conditioning with speaker specific information by one-hot encoding at the input of AAI along with acoustic features benefits the AAI performance in a closed-set speaker train and test condition. In this work, we carry out an experimental study on the benefit of using x-vectors for providing speaker specific information to condition AAI. Experiments with 30 speakers have shown that the AAI performance benefits from the use of x-vectors in a closed set seen speaker condition. Further, x-vectors also generalizes well for unseen speaker evaluation. Copyright © 2020 ISCA

Item Type: Conference Paper
Publication: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publisher: International Speech Communication Association
Additional Information: cited By 0; Conference of 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020 ; Conference Date: 25 October 2020 Through 29 October 2020; Conference Code:165507
Keywords: Petroleum reservoir evaluation; Speech communication; State assignment, Acoustic features; Articulatory data; Articulatory inversion; Closed set; Performance benefits; Speaker specific informations; Speech production; Test condition, Vectors
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 13 Jan 2021 08:50
Last Modified: 13 Jan 2021 08:50
URI: http://eprints.iisc.ac.in/id/eprint/67633

Actions (login required)

View Item View Item