ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing

Kumar, Anurendra and Guha, Tanaya and Ghosh, Prasanta Kumar (2019) Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing. In: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 27 (5). pp. 919-931.

[img] PDF
Iee_ACM_Tra_Aud_Spe_lan_Pro_27-5_2019.pdf - Published Version
Restricted to Registered users only

Download (3MB) | Request a copy
Official URL: https://doi.org/ 10.1109/TASLP.2019.2903288

Abstract

We propose a dynamic latent variable model for learning latent bases from time varying, non-negative data. We take a probabilistic approach to modeling the temporal dependence in data by introducing a dynamic Dirichlet prior-a Dirichlet distribution with dynamic parameters. This new distribution allows us to assure non-negativity and avoid intractability when sequential updates are performed (otherwise encountered in using Dirichlet prior). We refer to the proposed model as the Dirichlet latent variable model (DLVM). We develop an expectation maximization algorithm for the proposed model, and also derive a maximum a posteriori estimate of the parameters. Furthermore, we connect the proposed DLVM to two popular latent basis learning methods- probabilistic latent component analysis (PLCA) and non-negative matrix factorization (NMF). We show that 1) PLCA is a special case of our DLVM, and 2) DLVM can be interpreted as a dynamic version of NMF. The usefulness of DLVM is demonstrated for three audio processing applications-speaker source separation, denoising, and bandwidth expansion. To this end, a new algorithm for source separation is also proposed. Through extensive experiments on benchmark databases, we show that the proposed model outperforms several relevant existing methods in all three applications.

Item Type: Journal Article
Publication: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Additional Information: Copyright of this article belongs to IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Keywords: Latent variable model; Dirichlet distribution; time varying; non negative; NMF; exponential family distributions
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 24 May 2019 13:07
Last Modified: 24 May 2019 13:07
URI: http://eprints.iisc.ac.in/id/eprint/62534

Actions (login required)

View Item View Item