ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Subband Weighting for Binaural Speech Source Localization

Karthik, Girija Ramesan and Suresh, Parth and Ghosh, Prasanta Kumar (2018) Subband Weighting for Binaural Speech Source Localization. In: 19th Annual Conference of the International Speech Communication, 2 September 2018 through 6 September 2018, International Convention Centre (HICC)Hyderabad, pp. 861-865. (In Press)

[img] PDF
Interspeech 2018(2).pdf - Published Version
Restricted to Registered users only

Download (1MB) | Request a copy
Official URL: https://doi.org/10.21437/Interspeech.2018-2173

Abstract

We consider the task of speech source localization from a binaural recording using interaural time difference (ITD). A typical approach is to process binaural speech using gammatone filters and calculate frame-level ITD in each subband. The ITDs in each gammatone subband are statistically modelled using Gaussian mixture models (GMMs) for every direction during training. Given a binaural test-speech, the source is localized using maximum likelihood (ML) criterion. In this work, we propose a subband weighting scheme where subband likelihoods are weighted based on their reliability. We measure the reliability of a subband using the average frame level localization error obtained for the respective subbands. These reliability values are used as the weights for each subband likelihood prior to combining the likelihoods for ML estimation. We also introduce non-linear warping of these weights to accommodate and analyse a larger space of possible subband weights. Experiments on Subject_003 from the CIPIC database reveal that weighting the subbands is better than the unweighted scheme of combining likelihoods.

Item Type: Conference Proceedings
Series.: Interspeech
Publisher: ISCA-INT SPEECH COMMUNICATION ASSOC
Additional Information: 19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018), Hyderabad, INDIA, AUG 02-SEP 06, 2018
Keywords: gammatone filters; interaural time difference; warping function
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 13 Feb 2020 11:31
Last Modified: 13 Feb 2020 11:32
URI: http://eprints.iisc.ac.in/id/eprint/62917

Actions (login required)

View Item View Item