ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Browse by IISc Authors

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | No Grouping
Number of items: 62.

Conference Proceedings

Muguli, A and Pinto, L and Nirmala, R and Sharma, N and Krishnan, P and Ghoshy, PK and Kumar, R and Bhat, S and Chetupalli, SR and Ganapathy, S and Ramoji, S and Nanda, V (2021) DiCOVA challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics. In: 22nd Annual Conference of the International Speech Communication Association, 30 - 3 September 2021, Brno, pp. 4241-4245.

Agrawal, P and Ganapathy, S (2021) Representation Learning for Speech Recognition Using Feedback Based Relevance Weighting. In: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6 - 11 June 2021, Virtual, Toronto, pp. 6883-6887.

Conference Paper

Prabhu, D and Jyothi, P and Ganapathy, S and Unni, V (2023) Accented Speech Recognition With Accent-specific Codebooks. In: UNSPECIFIED, pp. 7175-7188.

Krishna, V and Ganapathy, S (2023) Pseudo-Label Based Supervised Contrastive Loss for Robust Speech Representations. In: UNSPECIFIED.

Singh, P and Kaul, A and Ganapathy, S (2023) Supervised Hierarchical Clustering Using Graph Neural Networks for Speaker Diarization. In: UNSPECIFIED.

Bhattacharya, D and Dutta, D and Sharma, NK and Chetupalli, SR and Mote, P and Ganapathy, S and Chandrakiran, C and Nori, S and Suhail, KK and Gonuguntla, S and Alagesan, M (2022) Analyzing the impact of SARS-CoV-2 variants on respiratory sound signals. In: 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, 18 - 22 September 2022, Incheon, pp. 2473-2477.

Bhattacharya, D and Dutta, D and Sharma, NK and Chetupalli, SR and Mote, P and Ganapathy, S and Chandrakiran, C and Nori, S and Suhail, KK and Gonuguntla, S and Alagesan, M (2022) Coswara: A website application enabling COVID-19 screening by analysing respiratory sound samples and health symptoms. In: 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, 18 - 22 September 2022, Incheon, pp. 1957-1958.

Dutta, D and Bhattacharya, D and Ganapathy, S and Poorjam, AH and Mittal, D and Singh, M (2022) Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection. In: 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, 18 - 22 September 2022, Incheon, pp. 2863-2867.

Agarwal, S and Ganapathy, S and Takahashi, N (2022) Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer. In: 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, 18 - 22 September 2022, Incheon, pp. 3013-3017.

Rath, SP and Bandarupalli, TS and Shah, N and Onoe, N and Ganapathy, S (2022) Semi-supervised Acoustic and Language Modeling for Hindi ASR. In: 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, 18 - 22 September 2022, Incheon, pp. 3528-3532.

Chetupalli, SR and Ganapathy, S (2022) Speaker conditioned acoustic modeling for multi-speaker conversational ASR. In: 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, 18 - 22 September 2022, Incheon, pp. 3834-3838.

Jayesh, MK and Sharma, M and Vonteddu, P and Shaik, MAB and Ganapathy, S (2022) Transformer Networks for Non-Intrusive Speech Quality Prediction. In: 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, 18 - 22 September 2022, Incheon, pp. 4078-4082.

Kumar, R and Purushothaman, A and Sreeram, A and Ganapathy, S (2022) END-TO-END SPEECH RECOGNITION WITH JOINT DEREVERBERATION OF SUB-BAND AUTOREGRESSIVE ENVELOPES. In: 47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, 23 - 27 May 2022, Virtual, Online at Singapore, pp. 1805-1809.

Dutta, S and Ganapathy, S (2022) MULTIMODAL TRANSFORMER WITH LEARNABLE FRONTEND AND SELF ATTENTION FOR EMOTION RECOGNITION. In: 47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, 23 - 27 May 2022, Virtual, Online at Singapore, pp. 5932-5936.

Sharma, NK and Chetupalli, SR and Bhattacharya, D and Dutta, D and Mote, P and Ganapathy, S (2022) THE SECOND DICOVA CHALLENGE: DATASET AND PERFORMANCE ANALYSIS FOR DIAGNOSIS OF COVID-19 USING ACOUSTICS. In: 47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, 23 - 27 May 2022, Virtual, Online at Singapore, pp. 556-560.

Krishna, V and Ganapathy, S (2022) SELF SUPERVISED REPRESENTATION LEARNING WITH DEEP CLUSTERING FOR ACOUSTIC UNIT DISCOVERY FROM RAW SPEECH. In: 47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, 23 - 27 May 2022, Virtual, Online at Singapore, pp. 3268-3272.

Singh, P and Ganapathy, S (2021) Self-Supervised Metric Learning with Graph Clustering for Speaker Diarization. In: 2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021, 3 - 17 December 2021, Cartagena, pp. 90-97.

Kalluri, SB and Vijayasenan, D and Ganapathy, S and Ragesh Rajan, M and Krishnan, P (2021) NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling. In: International Conference on Acoustics, Speech and Signal Processing - Proceedings, 06 - 11 June 2021, Toronto, pp. 6953-6957.

Katthi, JR and Ganapathy, S (2021) Deep multiway canonical correlation analysis for multi-subject eeg normalization. In: 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021, 6 June - 11 June 2021, Virtual, Toronto, pp. 1245-1249.

Avila, F and Poorjam, AH and Mittal, D and Dognin, C and Muguli, A and Kumar, R and Chetupalli, SR and Ganapathy, S and Singh, M (2021) Investigating feature selection and explainability for COVID-19 diagnostics from cough sounds. In: 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021, 30 Aug - 03 Sep 2021, Brno, pp. 4246-4250.

Singh, P and Varma, R and Krishnamohan, V and Chetupalli, SR and Ganapathy, S (2021) LEAP submission for the third DIHARD diarization challenge. In: 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021, 30 Aug - 03 Sep 2021, Brno, pp. 2538-2542.

Raj, RGP and Kumar, R and Jayesh, MK and Purushothaman, A and Ganapathy, S and Shaik, MAB (2021) Srib-leap submission to far-field multi-channel speech enhancement challenge for video conferencing. In: 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021, 30 Aug - 03 Sep 2021, Brno, pp. 326-330.

Krishnamohan, V and Soman, A and Gupta, A and Ganapathy, S (2020) Audiovisual correspondence learning in humans and machines. In: 21st Annual Conference of the International Speech Communication Association, INTERSPEECH, 25 October 2020, Shanghai; China, pp. 4462-4466.

Chetupalli, SR and Ganapathy, S (2020) Context dependent RNNLM for automatic transcription of conversations. In: 21st Annual Conference of the International Speech Communication Association, INTERSPEECH, 25-29, October 2020, Shanghai; China;, pp. 886-890.

Ramoji, S and Krishnan, P and Ganapathy, S (2020) Neural PLDA modeling for end-to-end speaker verification. In: 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020, 25 October 2020, Shanghai; China, pp. 4333-4337.

Katthi, JR and Ganapathy, S and Kothinti, S and Slaney, M (2020) Deep Canonical Correlation Analysis for Decoding the Auditory Brain. In: 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC-2020, 20-24 July 2020, Montreal; Canada, pp. 3505-3508.

Purushothaman, A and Sreeram, A and Ganapathy, S (2020) 3-D acoustic modeling for far-field multi-channel speech recognition. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 4-8 May 2020, Barcelona; Spain, pp. 6964-6968.

Kumar, R and Sreeram, A and Purushothaman, A and Ganapathy, S (2020) Unsupervised neural mask estimator for generalized eigen-value beamforming based ASR. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 4-8 May 2020, Barcelona; Spain, pp. 7494-7498.

Singh, K and Kumar, N and Sinha, R and Ramoji, S and Ganapathy, S (2020) IITG- Indigo Submissions for NIST 2018 Speaker Recognition Evaluation and Post-Challenge Improvements. In: 26th National Conference on Communications NCC 2020, 21-23 Feb. 2020, Kharagpur, India, India.

Sharma, N and Krishnan, P and Kumar, R and Ramoji, S and Chetupalli, SR and Nirmala, R and Kumar Ghosh, P and Ganapathy, S (2020) Coswara - A database of breathing, cough, and voice sounds for COVID-19 diagnosis. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 25 October 2020 through 29 October 2020, Shanghai; China, pp. 4811-4815.

Purushothaman, A and Sreeram, A and Kumar, R and Ganapathy, S (2020) Deep learning based dereverberation of temporal envelopes for robust speech recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 25-29 October 2020, Shanghai; China, pp. 1688-1692.

Sharma, N and Krishnamohan, V and Ganapathy, S and Gangopadhayay, A and Fink, L (2020) On the Impact of Language Familiarity in Talker Change Detection. In: 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020, 4-8, May 2020, Barcelona, Spain, pp. 6249-6253.

Agrawal, P and Ganapathy, S (2020) Robust raw waveform speech recognition using relevance weighted representations. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 25-29 October 2020, Shanghai; China, pp. 1649-1653.

Praveen, K and Gupta, A and Soman, A and Ganapathy, S (2019) Second Language Transfer Learning in Humans and Machines Using Image Supervision. In: 2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019, 15-18, December 2019, Singapore, pp. 1040-1047.

Malhotra, K and Bansal, S and Ganapathy, S (2019) Active learning methods for low resource end-to-end speech recognition. In: 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019, 15 - 19 September 2019, Graz, pp. 2215-2219.

Padi, B and Mohan, A and Ganapathy, S (2019) Attention based hybrid I-vector BLSTM model for language recognition. In: 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019, 15 - 19 September 2019, Graz, pp. 1263-1267.

Agrawal, P and Ganapathy, S (2019) Unsupervised raw waveform representation learning for ASR. In: 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019, 15 - 19 September 2019, Graz, pp. 3451-3455.

Ryant, N and Church, K and Cieri, C and Cristia, A and Du, J and Ganapathy, S and Liberman, M (2019) The second dihard diarization challenge: Dataset, task, and baselines. In: 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019, 15 - 19 September 2019, Graz, pp. 978-982.

Sharma, N and Ganesh, S and Ganapathy, S and Holt, LL (2019) Analyzing Human Reaction Time for Talker Change Detection. In: 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019, 12 - 17 May 2019, Brighton, pp. 7135-7139.

Kalluri, SB and Vijayasenan, D and Ganapathy, S (2019) A Deep Neural Network Based End to End Model for Joint Height and Age Estimation from Short Duration Speech. In: 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019, 12 May 2019 -17 May 2019, Brighton, pp. 6580-6584.

Agrawal, P and Ganapathy, S (2019) Deep Variational Filter Learning Models for Speech Recognition. In: 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019, 12 May 2019-17 May 2019, Brighton, pp. 5731-5735.

Padi, B and Mohan, A and Ganapathy, S (2019) End-to-end Language Recognition Using Attention Based Hierarchical Gated Recurrent Unit Models. In: 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019, 12 - 17 May 2019, Brighton, pp. 5966-5970.

Ramoji, S and Mohan, A and Mysore, B and Bhatia, A and Singh, P and Vardhan, H and Ganapathy, S (2019) The Leap Speaker Recognition System for NIST SRE 2018 Challenge. In: 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019, 12 May 2019-17 May 2019, Brighton, pp. 5771-5775.

Bansal, S and Malhotra, K and Ganapathy, S (2019) Speaker and Language Aware Training for End-To-End ASR. In: 2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019, 15-18, December 2019, Singapore, pp. 494-501.

Ansari, TK and Kumar, R and Singh, S and Ganapathy, S (2018) Deep learning methods for unsupervised acoustic modeling-Leap submission to ZeroSpeech challenge 2017. In: IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017, 16 - 20 December 2017, Okinawa, pp. 754-761.

Siddhant, A and Jyothi, P and Ganapathy, S (2018) Leveraging native language speech for accent identification using deep Siamese networks. In: 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017, 16 - 20 December 2017, Okinawa, pp. 621-628.

Kumar, N and Das, RK and Jelil, S and Dhanush, BK and Kashyap, H and Murty, KSR and Ganapathy, S and Sinha, R and Prasanna, SRM (2017) IITG-indigo system for NIST 2016 SRE challenge. In: 18th Annual Conference of the International Speech Communication Association, 20 August 2017, Stockholm, pp. 2859-2863.

Agrawal, P and Ganapathy, S (2017) Speech representation learning using unsupervised data-driven modulation filtering for robust ASR. In: 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, 20 - 24 August 2017, Stockholm, pp. 2446-2450.

Journal Article

Baghel, S and Ramoji, S and Jain, S and Chowdhuri, PR and Singh, P and Vijayasenan, D and Ganapathy, S (2024) Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments. In: Speech Communication, 161 .

Chetupalli, SR and Krishnan, P and Sharma, N and Muguli, A and Kumar, A and Nanda, V and Pinto, LM and Ghosh, PK and Ganapathy, S (2023) Multi-Modal Point-of-Care Diagnostics for COVID-19 Based on Acoustics and Symptoms. In: IEEE Journal of Translational Engineering in Health and Medicine, 11 . pp. 199-210.

Sharma, NK and Muguli, A and Krishnan, P and Kumar, R and Chetupalli, SR and Ganapathy, S (2022) Towards sound based testing of COVID-19�Summary of the first Diagnostics of COVID-19 using Acoustics (DiCOVA) Challenge. In: Computer Speech and Language, 73 .

Purushothaman, A and Sreeram, A and Kumar, R and Ganapathy, S (2022) Dereverberation of autoregressive envelopes for far-field speech recognition. In: Computer Speech and Language, 72 .

Soman, A and Ramachandran, P and Ganapathy, S (2022) ERP Evidences of Rapid Semantic Learning in Foreign Language Word Comprehension. In: Frontiers in Neuroscience, 16 .

Reddy Katthi, J and Ganapathy, S (2021) Deep Correlation Analysis for Audio-EEG Decoding. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering, 29 . pp. 2742-2753.

Dutta, D and Agrawal, P and Ganapathy, S (2021) A Multi-Head Relevance Weighting Framework for Learning Raw Waveform Audio Representations. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021-O . pp. 191-195.

Sharma, NK and Krishnamohan, V and Ganapathy, S and Gangopadhayay, A and Fink, L (2020) Acoustic and linguistic features influence talker change detection. In: Journal of the Acoustical Society of America, 148 (5). EL414-EL419.

Kalluri, SB and Vijayasenan, D and Ganapathy, S (2020) Automatic speaker profiling from short duration speech data. In: Speech Communication, 121 . pp. 16-28.

Agrawal, P and Ganapathy, S (2020) Interpretable Representation Learning for Speech and Audio Signals Based on Relevance Weighting. In: IEEE/ACM Transactions on Audio Speech and Language Processing, 28 . pp. 2823-2836.

Padi, B and Mohan, A and Ganapathy, S (2020) Towards Relevance and Sequence Modeling in Language Recognition. In: IEEE/ACM Transactions on Audio Speech and Language Processing, 28 . pp. 1223-1232.

Kadimesetty, VS and Gutta, S and Ganapathy, S and Yalavarthy, PK (2019) Convolutional neural network-based robust denoising of low-dose computed tomography perfusion maps. In: IEEE Transactions on Radiation and Plasma Medical Sciences, 3 (2). pp. 137-152.

Soman, A and Madhavan, CR and Sarkar, K and Ganapathy, S (2019) An EEG study on the brain representations in language learning. In: Biomedical Physics and Engineering Express, 5 (2).

Other

Bhattacharya, D and Sharma, NK and Dutta, D and Chetupalli, SR and Mote, P and Ganapathy, S and Chandrakiran, C and Nori, S and Suhail, KK and Gonuguntla, S and Alagesan, M (2023) Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection. Nature Research.

This list was generated on Sun Dec 22 00:31:14 2024 IST.