ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling

Kalluri, SB and Vijayasenan, D and Ganapathy, S and Ragesh Rajan, M and Krishnan, P (2021) NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling. In: International Conference on Acoustics, Speech and Signal Processing - Proceedings, 06 - 11 June 2021, Toronto, pp. 6953-6957.

[img]
Preview
PDF
ICASSP_2021.pdf - Published Version

Download (772kB) | Preview
Official URL: https://doi.org/10.1109/ICASSP39728.2021.9414349

Abstract

Many commercial and forensic applications of speech demand the extraction of information about the speaker characteristics, which falls into the broad category of speaker profiling. The speaker characteristics needed for profiling include physical traits of the speaker like height, age, and gender of the speaker along with the native language of the speaker. Many of the datasets available have only partial information for speaker profiling. In this paper, we attempt to overcome this limitation by developing a new dataset which has speech data from five different Indian languages along with English. The metadata information for speaker profiling applications like linguistic information, regional information, and physical characteristics of a speaker are also collected. We call this dataset as NITK-IISc Multilingual Multi-accent Speaker Profiling (NISP) dataset. The description of the dataset, potential applications, and baseline results for speaker profiling on this dataset are provided in this paper.

Item Type: Conference Paper
Publication: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Publisher: Institute of Electrical and Electronics Engineers Inc.
Additional Information: The copyright for this article belongs to Institute of Electrical and Electronics Engineers.
Keywords: NISP datas et; Physical parameters; Speaker profiling; Voice forensics Extraction of information; Forensic applications; Linguistic information; Metadata information; Physical characteristics; Profiling application; Regional information; Speaker characteristics Linguistics
Department/Centre: Division of Electrical Sciences > Electronic Systems Engineering (Formerly Centre for Electronic Design & Technology)
Division of Electrical Sciences > Electrical Communication Engineering
Division of Electrical Sciences > Electrical Communication Engineering > Electrical Communication Engineering - Technical Reports
Division of Electrical Sciences > Electrical Engineering
Date Deposited: 06 Jun 2023 10:13
Last Modified: 06 Jun 2023 10:13
URI: https://eprints.iisc.ac.in/id/eprint/81819

Actions (login required)

View Item View Item