Meghanani, A and Ramakrishnan, AG (2020) Pitch-synchronous discrete cosine transform features for speaker identification and verification. In: ICPRAM 2020 - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods 2020, 22 - 24 February 2020, Valletta, Malta, pp. 395-401.
|
PDF
ICPRAM2020_1_395-401_2020.pdf - Published Version Download (375kB) | Preview |
Abstract
We propose a feature called pitch-synchronous discrete cosine transform (PS-DCT), derived from the voiced part of the speech for speaker identification (SID) and verification (SV) tasks. PS-DCT features are derived from the ‘time-domain, quasi-stationary waveform shape’ of the voiced sounds. We test our PS-DCT feature on TIMIT, Mandarin and YOHO datasets. On TIMIT with 168 and Mandarin with 855 speakers, we obtain the SID accuracies of 99.4% and 96.1%, respectively, using a Gaussian mixture model-based classifier. In the i-vector-based SV framework, fusing the ‘PS-DCT based system’ with the ‘MFCC-based system’ at the score level reduces the equal error rate (EER) for both YOHO and Mandarin datasets. In the case of limited test data and session variabilities, we obtain a significant reduction in EER, up to 5.8% (for test data of duration < 3 sec).
Item Type: | Conference Paper |
---|---|
Publication: | ICPRAM 2020 - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods |
Publisher: | SciTePress |
Additional Information: | The copyright of the article belongs to the Authors. |
Keywords: | Continuous speech recognition; Gaussian distribution; Loudspeakers; Time domain analysis, Equal error rate; Gaussian Mixture Model; MFCC; Pitch synchronous; Quasi-stationary; Speaker identification; Speaker verification; Waveform shape, Discrete cosine transforms |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 29 Sep 2020 11:11 |
Last Modified: | 05 Dec 2023 09:45 |
URI: | https://eprints.iisc.ac.in/id/eprint/65176 |
Actions (login required)
View Item |