ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Relationship between spoken Indian languages by clustering of long distance bigram features of speech

Girish, K V Vijay and Vijai, Veena and Ramakrislman, A G (2017) Relationship between spoken Indian languages by clustering of long distance bigram features of speech. In: IEEE Annual India Conference (INDICON), DEC 16-18, 2016, Bangalore, INDIA.

[img] PDF
2016_Iee_Ann_ind_Con.pdf - Published Version
Restricted to Registered users only

Download (207kB) | Request a copy
Official URL: http://dx.doi.org/10.1109/INDICON.2016.7839074

Abstract

In this paper, a novel method of identifying relationships between languages has been proposed. Our analysis deals with four major Indian languages, as well as Sanskrit and English. We have made use of long distance bigram Mel Frequency Cepstrum Coefficient features and different linkage measures to test the similarities between the clusters formed. Phylogenetic trees have been constructed to provide a visual understanding of the same. The results obtained match with already existing knowledge about language families. For all types of linkage measures, the closest language to Hindi is Marathi and for Tamil, it is Telugu. Since K-medoids give expected language relationships, they are used to learn dictionaries in order to see if they are useful in language identification as well. We have reported the results of one-vs-one classification and found that accuracy improves in the case of English when the weights recovered are multiplied with joint probability of the cluster associated with that medoid.

Item Type: Conference Proceedings
Additional Information: IEEE Annual India Conference (INDICON), Bangalore, INDIA, DEC 16-18, 2016
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Depositing User: Id for Latest eprints
Date Deposited: 10 Jun 2017 04:42
Last Modified: 10 Jun 2017 04:42
URI: http://eprints.iisc.ac.in/id/eprint/57209

Actions (login required)

View Item View Item