Illa, A and Ghosh, PK (2019) An investigation on speaker specific articulatory synthesis with speaker independent articulatory inversion. In: 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019, 15 - 19 September 2019, Graz, pp. 121-125.
PDF
INTERSPEECH_2019.pdf - Published Version Restricted to Registered users only Download (249kB) | Request a copy |
Abstract
Estimating speech representations from articulatory movements is known as articulatory-to-acoustic forward (AAF) mapping. Typically this mapping is learned using directly measured articulatory movement in a subject-specific manner. Such AAF mapping has been shown to benefit the speech synthesis applications. In this work, we investigate the speaker similarity and naturalness of utterances generated by AAF which is driven by the articulatory movements from a subject (referred to as cross speaker) different from the speaker (target speaker) used for training AAF mapping. Experiments are performed with directly measured articulatory data from 9 speakers (8 target speakers and 1 cross speaker), which are recorded using Electromagnetic articulograph AG501. Experiments are also performed with articulatory features estimated using speaker independent acoustic-to-articulatory inversion (SI-AAI) model trained on 26 reference speakers. Objective evaluation on target speakers reveal that the articulatory features estimated from SI-AAI result in a lower Mel-cepstrum distortion compared to that using directly measured articulatory features. Further, listening tests reveal that the directly measured articulatory movements preserve the speaker similarity better than estimated ones. Although, for naturalness, articulatory movements predicted by SI-AAI perform better than the direct measurements.
Item Type: | Conference Paper |
---|---|
Publication: | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
Publisher: | International Speech Communication Association |
Additional Information: | The copyright for this article belongs to International Speech Communication Association. |
Keywords: | Acoustic-to-articulatory inversion; Articulatory-to-acoustic mapping; Voice conversion |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 05 Dec 2022 09:36 |
Last Modified: | 05 Dec 2022 09:36 |
URI: | https://eprints.iisc.ac.in/id/eprint/78250 |
Actions (login required)
View Item |