ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

ESTIMATION OF THE INVARIANT AND VARIANT CHARACTERISTICS IN SPEECH ARTICULATION AND ITS APPLICATION TO SPEAKER IDENTIFICATION

Prasad, Abhay and Periyasamy, Vijitha and Ghosh, Prasanta Kumar (2015) ESTIMATION OF THE INVARIANT AND VARIANT CHARACTERISTICS IN SPEECH ARTICULATION AND ITS APPLICATION TO SPEAKER IDENTIFICATION. In: 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), APR 19-24, 2014, Brisbane, AUSTRALIA, pp. 4265-4269.

[img] PDF
IEEE_Int_Con_Aco_Spe_Sig_Pro_4265_2015.pdf - Published Version
Restricted to Registered users only

Download (2MB) | Request a copy
Official URL: http://dx.doi.org/10.1109/ICASSP.2015.7178775

Abstract

Speech articulation varies across speakers for producing a speech sound due to the differences in their vocal tract morphologies, though the speech motor actions are executed in terms of relatively invariant gestures 1]. While the invariant articulatory gestures are driven by the linguistic content of the spoken utterance, the component of speech articulation that varies across speakers reflects speaker-specific and other paralinguistic information. In this work, we present a formulation to decompose the speech articulation from multiple speakers into the variant and invariant aspects when they speak the same sentence. The variant component is found to be a better representation for discriminating speakers compared to the speech articulation which includes the invariant part. Experiments with real-time magnetic resonance imaging (rtMRI) videos of speech production from multiple speakers reveal that the variant component of speech articulation yields a better frame-level speaker identification accuracy compared to the speech articulation as well as acoustic features by 29.9% and 9.4% (absolute) respectively.

Item Type: Conference Proceedings
Series.: International Conference on Acoustics Speech and Signal Processing ICASSP
Publisher: IEEE
Additional Information: Copy right for this article belongs to the IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA
Keywords: speech articulation; invariant gestures; speaker identification
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 19 Aug 2016 09:36
Last Modified: 19 Aug 2016 09:36
URI: http://eprints.iisc.ac.in/id/eprint/54295

Actions (login required)

View Item View Item