ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Automatic Native Language Identification Using Novel Acoustic and Prosodic Feature Selection Strategies

Yarra, C and Achuth Rao, MV and Kumar Ghosh, P (2018) Automatic Native Language Identification Using Novel Acoustic and Prosodic Feature Selection Strategies. In: 2018 15th IEEE India Council International Conference (INDICON), 16-18 Dec. 2018, Coimbatore, India, India.

[img] PDF
IEEE_COU_INT_CON_2018.pdf - Published Version
Restricted to Registered users only

Download (295kB) | Request a copy
Official URL: https://dx.doi.org/10.1109/INDICON45594.2018.89870...


We consider the problem of automatic identification of native language (L1) of non-native English (L2) speakers from eleven L1 backgrounds. Analyzing the influence of each L1 pronunciation variabilities on L2 pronunciation, different sets of linguistic units are chosen to compute supra-segmental features by considering the acoustic and prosodic variations within and across these sets. Using these features, we implement a multi-class classifier comprising 55 binary (one versus another) support vector machine (SVM) classifiers. We select optimal set of features for each binary classifier using two feature selection strategies (FSSs) based on Fisher discriminant ratio (FDR). The first strategy considers the features that maximizes the each binary classifier performance. However, the second strategy selects the features by maximizing a multi-class classifier performance for which an algorithm is proposed. Experiments are performed on the ETS corpus of non-native spoken English, comprising 4099 files. When the proposed features along with FSSs are used, the unweighted average recall (UAR) on the test set for each selection strategy is found to be 1.3 and 2.1 (absolute) higher compared to using all features; as well as 3.0 and 3.8 higher than the baseline technique respectively. © 2018 IEEE.

Item Type: Conference Paper
Publication: INDICON 2018 - 15th IEEE India Council International Conference
Publisher: Institute of Electrical and Electronics Engineers Inc.
Additional Information: cited By 0; Conference of 15th IEEE India Council International Conference, INDICON 2018 ; Conference Date: 16 December 2018 Through 18 December 2018; Conference Code:158392
Keywords: Automatic identification; Feature extraction; Linguistics; Natural language processing systems; Support vector machines, Binary classifiers; Fisher discriminant; Linguistic units; Multi-class classifier; Native language; Non-native; Optimal sets; Prosodic features, Classification (of information)
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 21 Sep 2020 07:23
Last Modified: 21 Sep 2020 07:23
URI: http://eprints.iisc.ac.in/id/eprint/65145

Actions (login required)

View Item View Item