ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

A spectro-temporal demodulation technique for pitch estimation

Dhiman, JK and Adiga, N and Seelamantula, CS (2017) A spectro-temporal demodulation technique for pitch estimation. In: 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, 20 - 24 August 2017, Stockholm, pp. 2306-2310.

Full text not available from this repository.
Official URL: https://10.21437/Interspeech.2017-1138


We consider a two-dimensional demodulation framework for spectro-temporal analysis of the speech signal. We construct narrowband (NB) speech spectrograms, and demodulate them using the Riesz transform, which is a two-dimensional extension of the Hilbert transform. The demodulation results in timefrequency envelope (amplitude modulation or AM) and timefrequency carrier (frequency modulation or FM). The AM corresponds to the vocal tract and is referred to as the vocal tract spectrogram. The FM corresponds to the underlying excitation and is referred to as the carrier spectrogram. The carrier spectrogram exhibits a high degree of time-frequency consistency for voiced sounds. For unvoiced sounds, such a structure is lacking. In addition, the carrier spectrogram reflects the fundamental frequency (F0) variation of the speech signal. We develop a technique to determine the F0 from the carrier spectrogram. The time-frequency consistency is used to determine which time-frequency regions correspond to voiced segments. Comparisons with the state-of-the-art F0 estimation algorithms show that the proposed F0 estimator has high accuracy for telephone channel speech and is robust to noise.

Item Type: Conference Paper
Publication: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publisher: International Speech Communication Association
Additional Information: The copyright for this article belongs to International Speech Communication Association
Keywords: Amplitude modulation; Demodulation; Frequency estimation; Frequency modulation; Mathematical transformations; Modulation; Natural frequencies; Optical variables measurement; Spectrographs; Speech, Demodulation techniques; Fundamental frequencies; Hilbert transform; Pitch estimation; Riesz transform; State of the art; Temporal analysis; Time frequency, Speech communication
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 25 Jul 2022 05:22
Last Modified: 25 Jul 2022 05:22
URI: https://eprints.iisc.ac.in/id/eprint/74715

Actions (login required)

View Item View Item