ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

FACTOR ANALYSIS METHODS FOR JOINT SPEAKER VERIFICATION AND SPOOF DETECTION

Dhanush, BK and Suparna, S and Aarthy, R and Likhita, C and Shashank, D and Harish, H and Ganapathy, Sriram (2017) FACTOR ANALYSIS METHODS FOR JOINT SPEAKER VERIFICATION AND SPOOF DETECTION. In: IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), MAR 05-09, 2017, New Orleans, LA, pp. 5385-5389.

[img] PDF
Iee_Int_Con_5385_2017.pdf - Published Version
Restricted to Registered users only

Download (231kB) | Request a copy
Official URL: http://dx.doi.org/10.1109/ICASSP.2017.7953185

Abstract

The performance of a speaker verification system is severely degraded by spoofing attacks generated from artificial speech synthesizers. Recently, several approaches have been proposed for classifying natural and synthetic speech (spoof detection) which can be used in conjunction with a speaker verification system. In this paper, we attempt to develop a joint modelling approach which can detect the presence of spoofing attacks while also performing the speaker verification task. We propose a factor modelling approach where the spoof variability subspace and the speaker variability subspace are jointly trained. The lower dimensional projections in these sub-spaces are used for speaker verification as well as spoof detection tasks. We also investigate the benefits of linear discriminant analysis (LDA), widely used in speaker recognition, for the spoof detection task. Several experiments are performed using the speaker and spoofing (SAS) database. For speaker verification, we compare the performance of the proposed method with a baseline method of fusing a conventional speaker verification system and a spoof detection system. In these experiments, the proposed approach provides substantial improvements for spoof detection (relative improvements of 20% in EER over the baseline) as well as speaker verification under spoofing conditions (relative improvements of 40% in EER over the baseline).

Item Type: Conference Paper
Series.: International Conference on Acoustics Speech and Signal Processing ICASSP
Publisher: 10.1109/ICASSP.2017.7953185
Additional Information: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New Orleans, LA, MAR 05-09, 2017 Copy right for this article belongs to the IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 20 Jan 2018 05:47
Last Modified: 20 Jan 2018 05:47
URI: http://eprints.iisc.ac.in/id/eprint/58842

Actions (login required)

View Item View Item