Online speech translation system for Tamil

Madhavaraj, A and Kumar, Shiva H R and Ramakrishnan, AG (2018) Online speech translation system for Tamil. In: 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018, 2 September 2018 through 6 September 2018, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 1966-1967.

PDF
Interspeech 2018.pdf - Published Version
Restricted to Registered users only
Download (262kB) | Request a copy

Official URL: https://dx.doi.org/10.21437/Interspeech.2018-3035

Abstract

In this paper, we present an application, which recognizes spoken Tamil utterances and speaks out the recognized text in Tamil through our Tamil text-to-speech (TTS) system. Further, we translate the recognized Tamil text to English using google translate and play it through our English TTS. Our Tamil speech recognition system, which can recognize about 75,000 words, has been trained on a 150-hour transcribed speech corpus. We have trained a deep neural network for the acoustic model and employed tri-gram language models to build our recognition system. Our Thirukkural TI'S system performs unit-selection based, concatenative speech synthesis, using 2.5 hours of Tamil spoken utterances transcribed at the phone-level. Our English TTS uses 2.7 hours of phone-transcribed utterances. This is a technology demonstration of a complete web application, which, when perfected, could be used to assist Tamil users in learning English, by speaking in Tamil into the system. The playback of the recognized text from Tamil TTS serves to demonstrate the effectiveness of the Tamil ASR to the majority of the conference registrants (who cannot read the recognized Tamil text.

Item Type:	Conference Proceedings
Series.:	Interspeech
Publisher:	ISCA-INT SPEECH COMMUNICATION ASSOC
Additional Information:	19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018), Hyderabad, INDIA, AUG 02-SEP 06, 2018
Keywords:	Speech recognition; text-to-speech; Tamil. English; translation; deep neural networks; acoustic model; language model; web application
Department/Centre:	Division of Electrical Sciences > Electrical Engineering
Date Deposited:	11 Mar 2020 10:53
Last Modified:	11 Mar 2020 10:53
URI:	http://eprints.iisc.ac.in/id/eprint/62922

Actions (login required)

View Item


	Powered by EPrints		A service from The J.R.D. Tata Memorial Library Indian Institute of Science, Bengaluru-560012, India