Mondal, Bishwarup and Sreenivas, TV (2001) Mixture Gaussian Envelope Chirp Model for Speech and Audio. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '01, 7-11 May, Utah,USA, Vol.2, 857-860.
|
PDF
mixture.pdf Download (407kB) |
Abstract
We develop a parametric sinusoidal analysis/synthesis model which can be applied to both speech and audio signals. These signals are characterised by large amplitude variations and small frequency variation within a short analysis frame. The model comprises of a Gaussian mixture representation for the envelope and a sum of linear chirps for the frequency components. A closed form solution is derived for the frequency domain parameters of a chirp with Gaussian-mixture envelope, based on the spectral moments. An iterative algorithm is developed to select and estimate prominent chirps based on the psycho-acoustic masking threshold. The model can adaptively select the number of time-domain and frequency-domain parameters to suit a particular type of signal. Experimental evaluation of the technique has shown that about 2 to 4 parameters/ms is sufficient for near transparent quality reconstruction of a variety of wide-band music and speech signals.
Item Type: | Conference Paper |
---|---|
Publisher: | IEEE |
Additional Information: | Copyright 1990 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. |
Department/Centre: | Division of Electrical Sciences > Electrical Communication Engineering |
Date Deposited: | 14 Feb 2006 |
Last Modified: | 19 Sep 2010 04:23 |
URI: | http://eprints.iisc.ac.in/id/eprint/5334 |
Actions (login required)
View Item |