Kulkarni, P and Sadasivan, J and Adiga, A and Seelamantula, CS (2020) Epoch Estimation from a Speech Signal Using Gammatone Wavelets in a Scattering Network. In: 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020, 4-8, May 2020, Barcelona, Spain, pp. 7364-7368.
PDF
ica_iee_int_con_aco_spe_sig_pro_pro_7364-7368_2020.pdf - Published Version Restricted to Registered users only Download (1MB) | Request a copy |
Abstract
In speech production, epochs are glottal closure instants where significant energy is released from the lungs. Extracting an epoch accurately is important in speech synthesis, analysis, and pitch oriented studies. The time-varying characteristics of the source and the system, and channel attenuation of low-frequency components by telephone channels make estimation of epoch from a speech signal a challenging task. In this paper, we propose a new technique that employs a Gammatone wavelet filterbank and compute a scattering sequence whose local maxima define the candidate epochs in the speech signal. Results are presented for both normal and telephone channel speech by considering the differential electroglottograph from CMU-Arctic database as the ground-truth. The proposed method gives significant improvements with respect to multiple performance metrics when compared with state-of-the-art techniques for epoch estimation.
Item Type: | Conference Paper |
---|---|
Publication: | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
Publisher: | Institute of Electrical and Electronics Engineers Inc. |
Additional Information: | The copyright of this article belongs to Institute of Electrical and Electronics Engineers Inc. |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 27 Aug 2020 04:19 |
Last Modified: | 27 Aug 2020 04:19 |
URI: | http://eprints.iisc.ac.in/id/eprint/66379 |
Actions (login required)
View Item |