ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Browse by Author

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | No Grouping
Number of items: 60.

Conference Proceedings

Yarra, Chandana SChiranjeevi and Aggarwal, Ritu and Mittal, Sanjeev Kumar and Kausthubha, NK and Raseena, KT and Singh, Astha and Ghosh, Prasanta Kumar (2018) Automatic visual augmentation for concatenation based synthesized articulatory videos from real-time MRI data for spoken language training. In: 19th Annual Conference of the International Speech Communication, 2-6, September 2018, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 3127-3131.

Illa, Aravind and Ghosh, Prasanta Kumar (2018) Low resource acoustic-to-articulatory inversion using bi-directional long short term memory. In: 19th Annual Conference of the International Speech Communication, 2 September 2018 to 6 September, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 3122-3126.

Karthik, Girija Ramesan and Suresh, Parth and Ghosh, Prasanta Kumar (2018) Subband Weighting for Binaural Speech Source Localization. In: 19th Annual Conference of the International Speech Communication, 2 September 2018 through 6 September 2018, International Convention Centre (HICC)Hyderabad, pp. 861-865. (In Press)

Meenakshi, G Nisha and Ghosh, Prasanta Kumar (2018) Whispered speech to neutral speech conversion using bidirectional LSTMs. In: 19th Annual Conference of the International Speech Communication, 2-6 September, 2018, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 491-495.

Illa, Aravind and Patel, Deep and Yamini, BK and Meera, SS and Shivashankar, N and Veeramani, Preethish-Kumar and Vengalil, Seena and Polavarapu, Kiran and Nashi, Saraswati and Nalini, Atchayaram and Ghosh, Prasanta Kumar (2018) COMPARISON OF SPEECH TASKS FOR AUTOMATIC CLASSIFICATION OF PATIENTS WITH AMYOTROPHIC LATERAL SCLEROSIS AND HEALTHY SUBJECTS. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), APR 15-20, 2018, Calgary, CANADA, pp. 6014-6018.

Desai, Urvish and Yarra, Chiranjeevi and Ghosh, Prasanta Kumar (2018) CONCATENATIVE ARTICULATORY VIDEO SYNTHESIS USING REAL-TIME MRI DATA FOR SPOKEN LANGUAGE TRAINING. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), APR 15-20, 2018, Calgary, CANADA, pp. 4999-5003.

Karjol, Pavan and Kumar, Ajay M and Ghosh, Prasanta Kumar (2018) SPEECH ENHANCEMENT USING MULTIPLE DEEP NEURAL NETWORKS. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), APR 15-20, 2018, Calgary, CANADA, pp. 5049-5053.

Koparkar, Advait and Ghosh, Prasanta Kumar (2018) A SUPERVISED AIR-TISSUE BOUNDARY SEGMENTATION TECHNIQUE IN REAL-TIME MAGNETIC RESONANCE IMAGING VIDEO USING A NOVEL MEASURE OF CONTRAST AND DYNAMIC PROGRAMMING. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), APR 15-20, 2018, Calgary, CANADA, pp. 5004-5008.

Valliappan, CA and Mannem, Renuka and Ghosh, Prasanta Kumar (2018) Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video using Semantic Segmentation with Fully Convolutional Networks. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, September, 2018, Hyderabad, pp. 3132-3136.

Rao, Achuth M and Krishnamurthy, Rahul and Gopikishore, Pebbili and Priyadharshini, Veeramani and Ghosh, Prasanta Kumar (2018) Automatic glottis localization and segmentation in stroboscopic videos using deep neural network. In: 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018, 2 September 2018 through 6 September 2018, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 3007-3011.

Anand, PA and Yarra, Chiranjeevi and Kausthubha, NK and Ghosh, Prasanta Kumar (2018) Intonation tutor by SPIRE (In-SPIRE): An online tool for an automatic feedback to the second language learners in learning intonation. In: 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018, 2-6 September, 2018, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 546-547. (In Press)

Reddy, Abinay N and Rao, Achuth M and Meenakshi, G Nisha and Ghosh, Prasanta Kumar (2018) Reconstructing neutral speech from tracheoesophageal speech. In: 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018, 2 September 2018 through 6 September 2018, International Convention Centre (HICC)Hyderabad; India, pp. 1541-1545.

Rao, Achuth M. and Kausthubha, N K and Yadav, Shivani and Gope, Dipanjan and Krishnaswamy, Uma Maheswari and Ghosh, Prasanta Kumar (2017) Automatic Prediction of Spirometry Readings from Cough and Wheeze for Monitoring of Asthma Severity. In: 25th European Signal Processing Conference (EUSIPCO), AUG 28-SEP 02, 2017, GREECE, pp. 41-45.

Sadhu, Samik and Ghosh, Prasanta Kumar (2017) Low Resource Point Process Models for Keyword Spotting Using Unsupervised Online Learning. In: 25th European Signal Processing Conference (EUSIPCO), AUG 28-SEP 02, 2017, GREECE, pp. 538-542.

Rao, Achuth M and Ghosh, Prasanta Kumar (2017) Pitch Prediction from Mel-generalized Cepstrum - a Computationally Efficient Pitch Modeling Approach for Speech Synthesis. In: 25th European Signal Processing Conference (EUSIPCO), AUG 28-SEP 02, 2017, GREECE, pp. 1629-1633.

Prasad, Abhay and Ghosh, Prasanta Kumar (2016) Automatic classification of eating conditions from speech using acoustic feature selection and a set of hierarchical support vector machine classifiers. In: 16th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2015), SEP 06-10, 2015, Dresden, GERMANY, pp. 884-888.

Afshan, Amber and Ghosh, Prasanta Kumar (2016) Better acoustic normalization in subject independent acoustic-to-articulatory inversion: benefit to recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, MAR 20-25, 2016, Shanghai, PEOPLES R CHINA, pp. 5395-5399.

Narwekar, Abhishek and Ghosh, Prasanta Kumar (2016) A Comparative Study of Articulatory Features From Facial Video and Acoustic-To-Articulatory Inversion for Phonetic Discrimination. In: 11th International Conference on Signal Processing and Communications (SPCOM), JUN 12-15, 2016, Indian Inst Sci, Banglore, INDIA.

Nagesh, Supriya and Yarra, Chiranjeevi and Deshmukh, Om D and Ghosh, Prasanta Kumar (2016) A ROBUST SPEECH RATE ESTIMATION BASED ON THE ACTIVATION PROFILE FROM THE SELECTED ACOUSTIC UNIT DICTIONARY. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, MAR 20-25, 2016, Shanghai, PEOPLES R CHINA, pp. 5400-5404.

Gaonkar, Aditya P and Bhuthesh, R and Gope, Dipanjan and Ghosh, Prasanta Kumar (2016) Robust real-time pulse rate estimation from facial video using sparse spectral peak tracking. In: 11th International Conference on Signal Processing and Communications (SPCOM), JUN 12-15, 2016, Indian Inst Sci, Banglore, INDIA.

Meenakshi, Nisha G and Ghosh, Prasanta Kumar (2016) A discriminative analysis within and across voiced and unvoiced consonants in neutral and whispered speech in multiple Indian languages. In: 16th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2015), SEP 06-10, 2015, Dresden, GERMANY, pp. 781-785.

Meenakshi, Nisha G and Ghosh, Prasanta Kumar (2015) Automatic Gender Classification Using the Mel Frequency Cepstrum of Neutral and Whispered Speech: a Comparative Study. In: 21st National Conference on Communications (NCC), FEB 27-MAR 01, 2015, Indian Inst Technol, Bombay, INDIA.

Prasad, Abhay and Periyasamy, Vijitha and Ghosh, Prasanta Kumar (2015) ESTIMATION OF THE INVARIANT AND VARIANT CHARACTERISTICS IN SPEECH ARTICULATION AND ITS APPLICATION TO SPEAKER IDENTIFICATION. In: 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), APR 19-24, 2014, Brisbane, AUSTRALIA, pp. 4265-4269.

Parida, Satyabrata and Kumar, Pattern Ashok and Ghosh, Prasanta Kumar (2015) Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data. In: 16th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2015), SEP 06-10, 2015, Dresden, GERMANY, pp. 2147-2151.

Sujith, P and Ghosh, Prasanta Kumar (2014) MAXIMUM A-POSTERIORI ESTIMATION OF MISSING SAMPLES WITH CONTINUITY CONSTRAINT IN ELECTROMAGNETIC ARTICULOGRAPHY DATA. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), MAY 04-09, 2014, Florence, ITALY.

Conference Paper

Singh, Astha and Meenakshi, G Nisha and Ghosh, Prasanta Kumar (2018) Relating articulatory motions in different speaking rates. In: 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018;, 2-6 Sept. 2018, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 2992-2996.

Karjol, Pavan and Ghosh, Prasanta Kumar (2018) Speech enhancement using deep mixture of experts based on hard expectation maximization. In: 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018, 2-6, September 2018, Hyderabad International Convention Centre (HICC)Hyderabad; India, pp. 3254-3258.

Mekhala, HS and Yamini, BK and Ketan, J and Pal, P and Shivashankar, N and Ghosh, Prasanta Kumar (2017) Classification of healthy subjects and patients with essential vocal tremor using empirical mode decomposition of high resolution pitch contour. In: 23rd National Conference on Communications, NCC 2017, 02-04 March 2017, Chennai, India, pp. 1-6.

Rao, M V Achuth and Ghosh, Prasanta Kumar (2017) Pitch prediction from Mel-frequency cepstral coefficients using sparse spectrum recovery. In: 23rd National Conference on Communications, NCC 2017, 02-04 March 2017, Chennai, India, pp. 1-6.

Raghavan, Srinivasa and Meenakshi, Nisha and Mittal, Sanjeev Kumar and Yarra, Chiranjeevi and Mandal, Anupam and Prasanna Kumar, KR and Ghosh, Prasanta Kumar (2017) A comparative study on the effect of different codecs on speech recognition accuracy using various acoustic modeling techniques. In: 23rd National Conference on Communications, NCC 2017, 02-04 March 2017, Chennai, India, pp. 1-6.

Yarra, Chiranjeevi and Deshmukh, Om D and Ghosh, Prasanta Kumar (2017) AUTOMATIC DETECTION OF SYLLABLE STRESS USING SONORITY BASED PROMINENCE FEATURES FOR PRONUNCIATION EVALUATION. In: IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), MAR 05-09, 2017, New Orleans, LA, pp. 5845-5849.

Bansal, Sahil and Ghosh, Anindita and Seelamantula, Chandra Sekhar and Gurrala, Gurunath and Ghosh, Prasanta Kumar (2017) Adaptive Frequency Estimation Using Iterative DESA with RDFT-Based Filter. In: IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC), NOV 08-10, 2017, Bangalore, INDIA.

Fotedar, Gaurav and Gaonkar, Aditya P and Chatterjee, Saikat and Ghosh, Prasanta Kumar (2017) Automatic recognition of social roles using long term role transitions in small group interactions. In: 17th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2016), SEP 08-12, 2016, San Francisco, CA, pp. 2065-2069.

Illa, Aravind and Meenakshi, Nisha G and Ghosh, Prasanta Kumar (2017) A COMPARATIVE STUDY OF ACOUSTIC-TO-ARTICULATORY INVERSION FOR NEUTRAL AND WHISPERED SPEECH. In: International Conference on Acoustics Speech and Signal Processing ICASSP, MAR 05-09, 2017, New Orleans, LA, pp. 5075-5079.

Nazreen, P M and Ramakrishnan, A G and Ghosh, Prasanta Kumar (2017) A class-specific speech enhancement for phoneme recognition: a dictionary learning approach. In: 17th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2016), SEP 08-12, 2016, San Francisco, CA, pp. 3728-3732.

Ghosh, Prasanta Kumar (2007) Speech segmentation using extrema based signal track length measure. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. , 15-20 April 2007, Honolulu, HI .

Ghosh, Prasanta Kumar and Sreenivas, TV (2006) Dynamic programming based optimum non-uniform samples for speech reconstruction and coding. In: 31st IEEE International Conference on Acoustics, Speech and Signal Processing,, May 14-19, 2006, Toulouse, France, pp. 1221-1224.

Ghosh, Prasanta Kumar and Sreenivas, TV (2006) Extrema based unwarping for time-varying pitch estimation. In: Proceedings. National Conf. Communications (NCC), January 2006, New Delhi.

Ghosh, Prasanta Kumar and Sreenivas, TV (2005) Waveform Reconstruction from Non-Uniform Samples with Application to Speech Coding. In: IEEE-Eurasip Nonlinear Signal and Image Processing, 2005. NSIP 2005. Abstracts, 18-20 May, Sapporo,Japan, p. 35.

Journal Article

Illa, Aravind and Ghosh, Prasanta Kumar (2020) The impact of speaking rate on acoustic-to-articulatory inversion. In: COMPUTER SPEECH AND LANGUAGE, 59 . pp. 75-90.

Yarra, Chiranjeevi and Nagesh, Supriya and Deshmukh, Om D and Ghosh, Prasanta Kumar (2019) Noise robust speech rate estimation using signal-to-noise ratio dependent sub-band selection and peak detection strategy. In: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 146 (3). pp. 1615-1628.

Kumar, Anurendra and Guha, Tanaya and Ghosh, Prasanta Kumar (2019) Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing. In: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 27 (5). pp. 919-931.

Rao, Achuth M and Ghosh, Prasanta Kumar (2019) Glottal Inverse Filtering Using Probabilistic Weighted Linear Prediction. In: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 27 (1). pp. 114-124.

Yarra, Chiranjeevi and Ghosh, Prasanta Kumar (2018) Automatic intonation classification using temporal patterns in utterance-level pitch contour and perceptually motivated pitch transformation. In: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 144 (5). EL471-EL476.

Rao, Achuth M and Ghosh, Prasanta Kumar (2018) PSFM-A Probabilistic Source Filter Model for Noise Robust Glottal Closure Instant Detection. In: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 26 (9). pp. 1645-1657.

Rao, Achuth M and Victory, Shiny J and Ghosh, Prasanta Kumar (2018) Effect of source filter interaction on isolated vowel-consonant-vowel perception. In: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 144 (2). EL95-EL99.

Meenakshi, Nisha G and Ghosh, Prasanta Kumar (2018) Reconstruction of articulatory movements during neutral speech from those during whispered speech. In: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 143 (6). pp. 3352-3364.

Yarra, Chiranjeevi and Deshmukh, Om D and Ghosh, Prasanta Kumar (2018) A frame selective dynamic programming approach for noise robust pitch estimation. In: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 143 (4). pp. 2289-2300.

Koluguri, Nithin Rao and Meenakshi, G Nisha and Ghosh, Prasanta Kumar (2017) Spectrogram Enhancement Using Multiple Window Savitzky-Golay (MWSG) Filter for Robust Bird Sound Detection. In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25 (6). pp. 1183-1192. ISSN 2329-9290

Pattem, Ashok Kumar and Illa, Aravind and Afshan, Amber and Ghosh, Prasanta Kumar (2017) Optimal sensor placement in electromagnetic articulography recording for speech production study. In: COMPUTER SPEECH AND LANGUAGE, 47 . pp. 157-174.

Periyasamy, Vijitha and Pramanik, Manojit and Ghosh, Prasanta Kumar (2017) Review on Heart-Rate Estimation from Photoplethysmography and Accelerometer Signals During Physical Exercise. In: JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE, 97 (3). pp. 313-324.

Prathosh, AP and Sujith, P and Ramakrishnan, AG and Ghosh, Prasanta Kumar (2016) Cumulative Impulse Strength for Epoch Extraction. In: IEEE SIGNAL PROCESSING LETTERS, 23 (4). pp. 424-428.

Prasad, Abhay and Ghosh, Prasanta Kumar (2016) Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition. In: COMPUTER SPEECH AND LANGUAGE, 39 . pp. 108-128.

Yarra, Chiranjeevi and Deshmukh, Om D and Ghosh, Prasanta Kumar (2016) A mode-shape classification technique for robust speech rate estimation and syllable nuclei detection. In: SPEECH COMMUNICATION, 78 . pp. 62-71.

Afshan, Amber and Ghosh, Prasanta Kumar (2015) Improved subject-independent acoustic-to-articulatory inversion. In: SPEECH COMMUNICATION, 66 . pp. 1-16.

Murthy, Navaneet KLakshminarasimha and Madhusudana, Pavan C and Suresha, Pradyumna and Periyasamy, Vijitha and Ghosh, Prasanta Kumar (2015) Multiple Spectral Peak Tracking for Heart Rate Monitoring from Photoplethysmography Signal During Intensive Physical Exercise. In: IEEE SIGNAL PROCESSING LETTERS, 22 (12). pp. 2391-2395.

Meenakshi, Nisha G and Ghosh, Prasanta Kumar (2015) Robust Whisper Activity Detection Using Long-Term Log Energy Variation of Sub-Band Signal. In: IEEE SIGNAL PROCESSING LETTERS, 22 (11). pp. 1859-1863.

Li, Ming and Kim, Jangwon and Lamrnert, Adam and Ghosh, Prasanta Kumar and Ramanarayanan, Vikram and Narayanan, Shrikanth (2015) Speaker verification based on the fusion of speech acoustics and inverted articulatory signals. In: COMPUTER SPEECH AND LANGUAGE, 36 . pp. 196-211.

Kim, Jangwon and Lammert, Adam C and Ghosh, Prasanta Kumar and Narayanan, Shrikanth S (2014) Co-registration of speech production datasets from electromagnetic articulography and real-time magnetic resonance imaging. In: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 135 (2). EL115-EL121.

Editorials/Short Communications

Periyasamy, Vijitha and Pramanik, Manojit and Ghosh, Prasanta Kumar (2017) Review on Heart-Rate Estimation from Photoplethysmography and Accelerometer Signals During Physical Exercise. In: JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE, 97 (3). pp. 313-324.

This list was generated on Sat Apr 20 19:28:50 2024 IST.