ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Multicomponent 2-D AM-FM Modeling of Speech Spectrograms

Dhiman, Jitendra Kumar and Sharma, Neeraj and Seelamantula, Chandra Sekhar (2018) Multicomponent 2-D AM-FM Modeling of Speech Spectrograms. In: 19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018), 02-06 Sept., 2018, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 736-740.

[img] PDF
Interspeech 2018(1).pdf - Published Version
Restricted to Registered users only

Download (1MB) | Request a copy
Official URL: https://doi.org/10.21437/Interspeech.2018-1937

Abstract

In contrast to 1-D short-time analysis of speech, 2-D modeling of spectrograms provides a characterization of speech attributes directly in the joint time-frequency plane. Building on existing 2-D models to analyze a spectrogram patch, we propose a multicomponent 2-D AM-FM representation for spectrogram decomposition. The components of the proposed representation comprise a DC, a fundamental frequency carrier and its harmonics, and a spectrotemporal envelope, all in 2-D. The number of harmonics required is patch-dependent. The estimation of the AM and FM is done using the Riesz transform, and the component weights are estimated using a least-squares approach. The proposed representation provides an improvement over existing state-of-the-art approaches, for both male and female speakers. This is quantified using reconstruction SNR and perceptual evaluation of speech quality (PESQ) metric. Further, we perform an overlap-add on the DC component, pooling all the patches and obtain a time-frequency (t-f) a periodicity map for the speech signal. We verify its effectiveness in improving speech synthesis quality by using it in an existing state-of-the art vocoder.

Item Type: Conference Proceedings
Series.: Interspeech
Publisher: ISCA-INT SPEECH COMMUNICATION ASSOC
Additional Information: 19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018), Hyderabad, INDIA, AUG 02-SEP 06, 2018
Keywords: multicomponent 2-D AM-FM modeling; aperiodicity parameter; Riesz transform
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 17 Jul 2019 06:22
Last Modified: 25 Aug 2022 11:27
URI: https://eprints.iisc.ac.in/id/eprint/62915

Actions (login required)

View Item View Item