Anoop, CS and Ramakrishnan, AG (2019) Automatic Speech Recognition for Sanskrit. In: 2019 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICICICT 2019, 5-6th July 2019, Kannur, Kerala, pp. 1146-1151.
PDF
ICICICT 2019.pdf - Published Version Restricted to Registered users only Download (3MB) | Request a copy |
Abstract
This paper presents our work on building a speaker independent, large vocabulary continuous speech recognition system for Sanskrit using HMM Toolkit (HTK). To our knowledge, this is the maiden attempt on a Sanskrit automatic speech recognizer. A Sanskrit speech corpus with a vocabulary size of 8370 words is built. The corpus contains orthographic, phoneme and word level transcriptions of 1360 sentences. The speech data were collected from 3 sources: All India Radio website, Indian Heritage Group under C-DAC and Vyoma Linguistic Labs Foundation. Mel Frequency Cepstral Coefficients together with 0th order coefficient and delta and acceleration parameters are used as features. Triphone HMMs, trained using HTK, are used as acoustic model. Bigram probabilities with backoff smoothing are used as language model. Both phoneme and word level recognizers were developed on the Sanskrit corpus. The system provides a word level accuracy of 89.64 and a sentence level correctness of 58.76 on the test set of 274 sentences. A graphical user interface for the speech recognizer is built using Java Swings. © 2019 IEEE.
Item Type: | Conference Poster |
---|---|
Publication: | 2019 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICICICT 2019 |
Publisher: | Institute of Electrical and Electronics Engineers Inc. |
Additional Information: | cited By 0; Conference of 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICICICT 2019 ; Conference Date: 5 July 2019 Through 6 July 2019; Conference Code:157717 |
Keywords: | Continuous speech recognition; Graphical user interfaces; Hidden Markov models; Intelligent computing; Speech recognition, MFCC; Sanskrit; Sanskrit ASR; Speech corpora; TIMIT, Speech |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 09 Jun 2020 05:57 |
Last Modified: | 09 Jun 2020 05:57 |
URI: | http://eprints.iisc.ac.in/id/eprint/64834 |
Actions (login required)
View Item |