Afshan, Amber and Ghosh, Prasanta Kumar (2016) Better acoustic normalization in subject independent acoustic-to-articulatory inversion: benefit to recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, MAR 20-25, 2016, Shanghai, PEOPLES R CHINA, pp. 5395-5399.
PDF
IEEE_Int_Con_Aco_Spe_Sig_Pro_Pro_5395_2016.pdf - Published Version Restricted to Registered users only Download (273kB) | Request a copy |
Abstract
In subject independent acoustic-to-articulatory inversion (SII), the training and test subjects are in general different, whereas subject dependent inversion (SDI) uses the same training and test subjects. Thus, acoustic normalization is used to compensate for the mismatch between the training and the test subjects in SII. We show that a better acoustic normalization not only results in better articulatory estimates using SII, but also improves the broad class phonetic recognition accuracy, when the articulatory features estimated from SII are used for recognition. Recognition experiments using male and female subjects from the MOCHA-TIMIT corpus also show that there is no significant difference between the recognition accuracy using the articulatory features obtained by the best acoustic normalization in SII and that obtained using SDI as well as directly measured articulatory features.
Item Type: | Conference Proceedings |
---|---|
Series.: | International Conference on Acoustics Speech and Signal Processing ICASSP |
Additional Information: | Copy right for this article belongs to the IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 20 Jan 2017 04:28 |
Last Modified: | 20 Jan 2017 04:28 |
URI: | http://eprints.iisc.ac.in/id/eprint/55936 |
Actions (login required)
View Item |