Rao, Achuth M and Krishnamurthy, Rahul and Gopikishore, Pebbili and Priyadharshini, Veeramani and Ghosh, Prasanta Kumar (2018) Automatic glottis localization and segmentation in stroboscopic videos using deep neural network. In: 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018, 2 September 2018 through 6 September 2018, Hyderabad International Convention Centre (HICC)Hyderabad, pp. 3007-3011.
PDF
Interspeech(6)2018.pdf - Published Version Restricted to Registered users only Download (2MB) |
Abstract
Exact analysis of the glottal vibration patten is vital for assessing voice pathologies. One of the primary steps in this analysis is automatic glottis segmentation, which, in turn, has two main parts, namely, glottis localization and the glottis segmentation. In this paper, we propose a deep neural network (DNN) based automatic glottis localization and segmentation scheme. We pose the problem as a classification problem where colors of each pixel and its neighborhood is classified as belonging to inside or outside the glottis region. We further process the classification result to get the biggest cluster, which is declared as the segmented glottis. The proposed algorithm is evaluated on a dataset comprising of stroboscopic videos from 18 subjects where the glottis region is marked by the three Speech Language Pathologists (SLPs). On average, the proposed DNN based segmentation scheme achieves a localization performance of 65.33% and segmentation DICE score of 0.74 (absolute), which is better than the baseline scheme by 22.66% and 0.09 respectively. We also find that the DICE score obtained by the DNN based segmentation scheme correlates well with the average DICE score computed between annotation provided by any two SLPs suggesting the robustness of the proposed glottis segmentation scheme.
Item Type: | Conference Proceedings |
---|---|
Series.: | Interspeech |
Publisher: | ISCA-INT SPEECH COMMUNICATION ASSOC |
Additional Information: | 19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018), Hyderabad, INDIA, AUG 02-SEP 06, 2018 |
Keywords: | Glottal segmentation; DNN; stroboscope |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 13 Mar 2020 08:05 |
Last Modified: | 13 Mar 2020 08:05 |
URI: | http://eprints.iisc.ac.in/id/eprint/62929 |
Actions (login required)
View Item |