ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Voistutor 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners

Pal, P and Yarra, C and Ghosh, PK (2022) Voistutor 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners. In: 2022 25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Proceedings, 24 - 26 November 2022, Hanoi.

[img] PDF
COCOSDA_2022.pdf - Published Version
Restricted to Registered users only

Download (385kB) | Request a copy
Official URL: https://doi.org/10.1109/O-COCOSDA202257103.2022.99...

Abstract

In computer assisted pronunciation training (CAPT), robust automatic models are critical for pronunciation assessment and mispronunciation detection and diagnosis (MDD). In the modelling, besides the audio data of second language (L2) learners, CAPT requires manually annotated ratings of overall pronunciation quality, and the MDD uses manually annotated phonetic transcriptions. Though the pronunciation quality and the mispronunciation are interdependent, to the best of our knowledge, none of the existing corpora contains both ratings and phonetic transcriptions. This could be due to the cost involved in obtaining phonetic transcriptions. However, a corpus with both kinds of information could benefit the researchers to obtain robust models by exploring the interdependencies. For addressing this, we develop voisTUTOR 2.0 corpus considering the existing voisTUTOR corpus referred to as voisTUTOR 1.0. We obtain phonetic transcriptions manually from a linguist for the entire Indian L2 learners' English audio data (26529 utterances approximately 14 hours) in voisTUTOR 1.0 for which overall quality ratings and binary scores of factors influencing the pronunciation quality are available. A preliminary analysis of voisTUTOR 2.0 suggests that the phonetic errors correlated with the ratings and the binary scores indicating mispronunciations and phoneme quality. © 2022 IEEE.

Item Type: Conference Paper
Publication: 2022 25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Proceedings
Publisher: Institute of Electrical and Electronics Engineers Inc.
Additional Information: The copyright for this article belongs to Institute of Electrical and Electronics Engineers Inc.
Keywords: Computer aided analysis; Computer aided diagnosis; Computer aided instruction; Linguistics; Modeling languages, Audio data; Computer assisted; Detection and diagnosis; L2-english corpus; Mispronunciation detection and diagnose; Mispronunciation detections; Phonetic transcriptions; Pronunciation quality; Pronunciation trainings; Speech corpora, Quality control
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 09 Feb 2023 11:20
Last Modified: 09 Feb 2023 11:20
URI: https://eprints.iisc.ac.in/id/eprint/80124

Actions (login required)

View Item View Item