ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Multivariate Autoregressive Spectrogram Modeling for Noisy Speech Recognition

Ganapathy, Sriram (2017) Multivariate Autoregressive Spectrogram Modeling for Noisy Speech Recognition. In: IEEE SIGNAL PROCESSING LETTERS, 24 (9). pp. 1373-1377.

[img] PDF
IEEE_Sig_Pro_Let_24-9_1373_2017.pdf - Published Version
Restricted to Registered users only

Download (371kB) | Request a copy
Official URL: http://doi.org/10.1109/LSP.2017.2724561


The performance of an automatic speech recognition (ASR) system is highly degraded in the presence of noise and reverberation. The autoregressive (AR) modeling approach, which preserves the high energy regions of the signal that are less susceptible to noise, first, presents a potential method for robust feature extraction. Second, there are strong correlations in the spectrotemporal domain of the speech signal, which are generally absent in noise. In this letter, we propose a novel method for speech feature extraction, which combines the advantages of AR approach and joint time-frequency processing using the multivariate AR modeling (MAR). Specifically, the subband discrete cosine transform coefficients obtained from multiple speech bands are used in the MAR framework to derive the Riesz temporal envelopes that provide features for ASR. We perform several speech recognition experiments in the Aurora-4 database with clean and multicondition training. In these experiments, the proposed features provide significant improvements over other noise robust feature extraction methods (relative improvements of 24% in clean training and 14% in multicondition training over mel features). Furthermore, the speech recognition experiments in REVERB challenge database illustrates the extension of the MAR modeling method for suppressing reverberant artifacts.

Item Type: Journal Article
Additional Information: Copy right for this article belongs to the IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 01 Sep 2017 06:01
Last Modified: 01 Sep 2017 06:01
URI: http://eprints.iisc.ac.in/id/eprint/57686

Actions (login required)

View Item View Item