Raj, RGP and Kumar, R and Jayesh, MK and Purushothaman, A and Ganapathy, S and Shaik, MAB (2021) Srib-leap submission to far-field multi-channel speech enhancement challenge for video conferencing. In: 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021, 30 Aug - 03 Sep 2021, Brno, pp. 326-330.
PDF
INTERSPEECH_2021.pdf - Published Version Restricted to Registered users only Download (311kB) | Request a copy |
Abstract
This paper presents the details of the SRIB-LEAP submission to the ConferencingSpeech challenge 2021. The challenge involved the task of multi-channel speech enhancement to improve the quality of far field speech from microphone arrays in a video conferencing room. We propose a two stage method involving a beamformer followed by single channel enhancement. For the beamformer, we incorporated self-attention mechanism as inter-channel processing layer in the filter-and-sum network (FaSNet), an end-to-end time-domain beamforming system. The single channel speech enhancement is done in log spectral domain using convolution neural network (CNN)-long short term memory (LSTM) based architecture. We achieved improvements in objective quality metrics - perceptual evaluation of speech quality (PESQ) of 0:5 on the noisy data. On subjective quality evaluation, the proposed approach improved the mean opinion score (MOS) by an absolute measure of 0:9 over the noisy audio. Copyright © 2021 ISCA.
Item Type: | Conference Paper |
---|---|
Publication: | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
Publisher: | International Speech Communication Association |
Additional Information: | The copyright for this article belongs to International Speech Communication Association |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 03 Dec 2021 08:50 |
Last Modified: | 03 Dec 2021 08:50 |
URI: | http://eprints.iisc.ac.in/id/eprint/70636 |
Actions (login required)
View Item |