Kumar, A and Guha, T and Ghosh, P (2018) A Dynamic Latent Variable Model for Source Separation. In: 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018, 15- 20 April 2018, Calgary, Canada, pp. 2871-2875.
|
PDF
iee_2018_ica_2018_2871-287_2018.pdf - Published Version Download (639kB) | Preview |
Abstract
We propose a novel latent variable model for learning latent bases for time-varying non-negative data. Our model uses a mixture multinomial as the likelihood function and proposes a Dirichlet distribution with dynamic parameters as a prior, which we call the dynamic Dirichlet prior. An expectation maximization (EM) algorithm is developed for estimating the parameters of the proposed model. Furthermore, we connect our proposed dynamic Dirichlet latent variable model (dynamic DLVM) to the two popular latent basis learning methods - probabilistic latent component analysis (PLCA) and non-negative matrix factorization (NMF). We show that (i) PLCA is a special case of the dynamic DLVM, and (ii) dynamic DLVM can be interpreted as a dynamic version of NMF. The effectiveness of the proposed model is demonstrated through extensive experiments on speaker source separation, and speech-noise separation. In both cases, our method performs better than relevant and competitive baselines. For speaker separation, dynamic DLVM shows 1.38 dB improvement in terms of source to interference ratio, and 1 dB improvement in source to artifact ratio. © 2018 IEEE.
Item Type: | Conference Paper |
---|---|
Publication: | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
Publisher: | Institute of Electrical and Electronics Engineers Inc. |
Additional Information: | The copyright for this article belongs to the Author. |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 27 Aug 2022 09:07 |
Last Modified: | 27 Aug 2022 09:07 |
URI: | https://eprints.iisc.ac.in/id/eprint/76244 |
Actions (login required)
View Item |