ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

JOINT DICTIONARY TRAINING FOR BANDWIDTH EXTENSION OF SPEECH SIGNALS

Sadasivan, Jishnu and Mukherjee, Subhadip and Seelamantula, Chandra Sekhar (2016) JOINT DICTIONARY TRAINING FOR BANDWIDTH EXTENSION OF SPEECH SIGNALS. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, MAR 20-25, 2016, Shanghai, PEOPLES R CHINA, pp. 5925-5929.

[img] PDF
IEEE_Int_Con_Aco_Spe_Sig_Pro_Pro_5925_2016.pdf - Published Version
Restricted to Registered users only

Download (660kB) | Request a copy
Official URL: http://dx.doi.org/10.1109/ICASSP.2016.7472814

Abstract

We address the problem of extending the bandwidth of speech signals, which is of importance to enhance the quality and intelligibility of the telephone speech. The low-pass filtering effect of the telephone communication channels eliminate the high-frequency components of the speech signal, and it is necessary to retrieve those to maintain the speech quality. We adopt a joint-dictionary training approach to recover the missing spectral information. By exploiting the sparsity of the spectrogram frames, the dictionaries for the wide-band (WB) and the corresponding narrow-band (NB) spectrogram frames are trained in a coupled manner in order to learn the mapping from NB to WB frames. We refer to this approach as the joint dictionary training for bandwidth extension (JDTBE). To ensure that the reconstructed bandwidth-extended speech is consistent with the measurement, we propose to apply a suitable affine transformation that depends on the properties of the telephone channel. We study the effect of the choice of sparsity on the quality of the reconstructed speech, for both male and female speakers. A comparison of the proposed JDTBE algorithm with a bandwidth extension technique based on stochastic modeling reveals the superiority of the JDTBE approach in terms of subjective listening test scores.

Item Type: Conference Proceedings
Series.: International Conference on Acoustics Speech and Signal Processing ICASSP
Additional Information: Copy right for this article belongs to the IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA
Department/Centre: Division of Electrical Sciences > Electrical Communication Engineering
Division of Electrical Sciences > Electrical Engineering
Date Deposited: 20 Jan 2017 04:29
Last Modified: 20 Jan 2017 04:29
URI: http://eprints.iisc.ac.in/id/eprint/55938

Actions (login required)

View Item View Item