ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Speech Enhancement Using the Minimum-Probability-of-Error Criterion

Sadasivan, Jishnu and Mukherjee, Subhadip and Seelamantula, Chandra Sekhar (2018) Speech Enhancement Using the Minimum-Probability-of-Error Criterion. In: 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018;, 2-6th September, 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018;, pp. 1141-1145.

[img] PDF
Interspeech(11)(2018.pdf - Published Version
Restricted to Registered users only

Download (836kB) | Request a copy
Official URL: https://dx.doi.org/10.21437/Interspeech.2018-1294

Abstract

We propose a novel speech denoising framework by minimizing the probability of error (PE), which measures the deviation probability of the estimate from its true value. To develop the minimum PE (MPE) criterion, one requires the knowledge of the noise probability density function (p.d.f.), which may not be available in a parametric form in speech denoising applications. Therefore, we adopt two approaches for modeling the noise p.d.f.: (i) Gaussian modeling based on adaptive variance estimation; and (ii) a Gaussian mixture model (GMM) in view of its approximation capabilities. We consider discrete cosine transform (DCT) domain shrinkage, where the optimum shrinkage parameter is obtained by minimizing an estimate of the PE. A performance assessment for real-world noise types shows that for input signal-to-noise ratios (SNR) greater than 5 dB, the proposed MPE-based point-wise shrinkage estimators outperform three benchmark techniques in terms of segmental SNR and short-time objective intelligibility (STOI) scores.

Item Type: Conference Proceedings
Series.: Interspeech
Publisher: ISCA-INT SPEECH COMMUNICATION ASSOC
Additional Information: 19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018), Hyderabad, INDIA, AUG 02-SEP 06, 2018
Keywords: Minimum probability of error; Speech denoising; Gaussian mixture model; point-wise shrinkage estimator
Department/Centre: Division of Electrical Sciences > Electrical Communication Engineering
Division of Electrical Sciences > Electrical Engineering
Date Deposited: 10 Jun 2020 07:00
Last Modified: 10 Jun 2020 07:00
URI: http://eprints.iisc.ac.in/id/eprint/62920

Actions (login required)

View Item View Item