ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Air tissue boundary segmentation using regional loss in real-time Magnetic Resonance Imaging video for speech production

Roy, A and Belagali, V and Ghosh, PK (2022) Air tissue boundary segmentation using regional loss in real-time Magnetic Resonance Imaging video for speech production. In: 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, 18 - 22 September 2022, Incheon, pp. 3113-3117.

[img] PDF
INTERSPEECH_2022.pdf - Published Version
Restricted to Registered users only

Download (667kB) | Request a copy
Official URL: https://doi.org/10.21437/Interspeech.2022-10579

Abstract

The SegNet model has been shown to provide the best performance in air-tissue boundary (ATB) segmentation in real-time Magnetic Resonance Imaging (rtMRI) videos in seen subject conditions. The SegNet model uses overall binary cross entropy as the loss function. However, such a global loss function does not give enough emphasis on regions which are more prone to errors. In this work, together with global loss, we explore the use of regional loss functions which focus on areas of the contours which have been analysed as error prone in the past. Evaluation is done using global Dynamic Time Warping (DTW) distance as well as regional metrics. The regional metrics used are EVEL and VELrDTW for contour1, and ETB and TBrDTW for contour2. We show that using such combinations of regional and global losses improves the regional, as well as global, evaluation metrics. For the best combination of losses, the two regional metrics show an improvement of 37.2 and 25.3 for contour1 and 23.9 and 28.4 for contour2, over a baseline model which uses only global loss. Global DTW distance, on the other hand, improves by 11.2 for contour1 and 5.6 for contour2.

Item Type: Conference Paper
Publication: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publisher: International Speech Communication Association
Additional Information: The copyright for this article belongs to International Speech Communication Association.
Keywords: Entropy; Function evaluation; Magnetism; Resonance; Speech communication; Tissue, Air-tissue boundary segmentation; Binary cross entropy loss; Boundary segmentation; Cross entropy; Dice loss; Entropy loss; Real- time; Real-time magnetic resonance imaging video; Segnet; Tissue boundary; Tongue base; Vela, Magnetic resonance imaging
Department/Centre: Division of Electrical Sciences > Electrical Engineering
Date Deposited: 07 Nov 2022 10:21
Last Modified: 07 Nov 2022 10:21
URI: https://eprints.iisc.ac.in/id/eprint/77849

Actions (login required)

View Item View Item