ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Binaural speech source localization using template matching of interaural time difference patterns

Karthik, GR and Ghosh, PK (2018) Binaural speech source localization using template matching of interaural time difference patterns. In: 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018, 15 - 20 April 2018, Calgary, pp. 5164-5168.

[img] PDF
ICASSP 2018_2018_5164-5168_2018.pdf - Published Version
Restricted to Registered users only

Download (1MB) | Request a copy
Official URL: https://doi.org/10.1109/ICASSP.2018.8462586

Abstract

In this paper we present a template based algorithm for localizing speech sources from a binaural recording. Binaural recordings are associated with head related transfer functions (HRTFs) for each direction which are specific to the object, say head, in between the two microphones. So, using these HRTFs and time-frequency representations of the binaural signals, we learn direction specific two dimensional reference templates using histograms of interaural time difference (ITD) in each frequency subband. These are called ITD pattern templates (IPTs). Test templates are then compared with each of the reference IPTs. The reference IPT, that matches best with the test template, provides the estimated direction of arrival for the test speech source. Experimental results obtained using subject-003 from the CIPIC database show that IPT based localization performs better than existing methods where the ITD distribution is modeled using Gaussian mixture model. Given n time-frequency points, we also present a method with complexity O(n) to compute the IPT, thus making it computationally efficient.

Item Type: Conference Paper
Publication: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Publisher: Institute of Electrical and Electronics Engineers Inc.
Additional Information: The copyright for this article belongs to the Institute of Electrical and Electronics Engineers Inc.
Keywords: Binaural localization; Gammatone filters; Interaural time difference
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 05 Aug 2022 10:48
Last Modified: 05 Aug 2022 10:48
URI: https://eprints.iisc.ac.in/id/eprint/75381

Actions (login required)

View Item View Item