Automatic Speech Recognition for Sanskrit

Anoop, CS and Ramakrishnan, AG (2019) Automatic Speech Recognition for Sanskrit. In: 2019 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICICICT 2019, 5-6th July 2019, Kannur, Kerala, pp. 1146-1151.

PDF
ICICICT 2019.pdf - Published Version
Restricted to Registered users only
Download (3MB) | Request a copy

Official URL: https://dx.doi.org/10.1109/ICICICT46008.2019.89932...

Abstract

This paper presents our work on building a speaker independent, large vocabulary continuous speech recognition system for Sanskrit using HMM Toolkit (HTK). To our knowledge, this is the maiden attempt on a Sanskrit automatic speech recognizer. A Sanskrit speech corpus with a vocabulary size of 8370 words is built. The corpus contains orthographic, phoneme and word level transcriptions of 1360 sentences. The speech data were collected from 3 sources: All India Radio website, Indian Heritage Group under C-DAC and Vyoma Linguistic Labs Foundation. Mel Frequency Cepstral Coefficients together with 0th order coefficient and delta and acceleration parameters are used as features. Triphone HMMs, trained using HTK, are used as acoustic model. Bigram probabilities with backoff smoothing are used as language model. Both phoneme and word level recognizers were developed on the Sanskrit corpus. The system provides a word level accuracy of 89.64 and a sentence level correctness of 58.76 on the test set of 274 sentences. A graphical user interface for the speech recognizer is built using Java Swings. Â© 2019 IEEE.

Item Type:	Conference Poster
Publication:	2019 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICICICT 2019
Publisher:	Institute of Electrical and Electronics Engineers Inc.
Additional Information:	cited By 0; Conference of 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICICICT 2019 ; Conference Date: 5 July 2019 Through 6 July 2019; Conference Code:157717
Keywords:	Continuous speech recognition; Graphical user interfaces; Hidden Markov models; Intelligent computing; Speech recognition, MFCC; Sanskrit; Sanskrit ASR; Speech corpora; TIMIT, Speech
Department/Centre:	Division of Electrical Sciences > Electrical Engineering
Date Deposited:	09 Jun 2020 05:57
Last Modified:	09 Jun 2020 05:57
URI:	http://eprints.iisc.ac.in/id/eprint/64834

Actions (login required)

View Item


	Powered by EPrints		A service from The J.R.D. Tata Memorial Library Indian Institute of Science, Bengaluru-560012, India