Kasar, T and Kumar, D and Anil Prasad, MN and Girish, D and Ramakrishnan, AG (2011) MAST: multi-script annotation toolkit for scenic text. In: Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data, 2011, New York, NY, USA.
PDF
Mul_OCR_Ana_Noi_Unst_Tex_Dat_1_2011.pdf - Published Version Restricted to Registered users only Download (2MB) | Request a copy |
Abstract
This paper describes a semi-automatic tool for annotation of multi-script text from natural scene images. To our knowledge, this is the maiden tool that deals with multi-script text or arbitrary orientation. The procedure involves manual seed selection followed by a region growing process to segment each word present in the image. The threshold for region growing can be varied by the user so as to ensure pixel-accurate character segmentation. The text present in the image is tagged word-by-word. A virtual keyboard interface has also been designed for entering the ground truth in ten Indic scripts, besides English. The keyboard interface can easily be generated for any script, thereby expanding the scope of the toolkit. Optionally, each segmented word can further be labeled into its constituent characters/symbols. Polygonal masks are used to split or merge the segmented words into valid characters/symbols. The ground truth is represented by a pixel-level segmented image and a '.txt' file that contains information about the number of words in the image, word bounding boxes, script and ground truth Unicode. The toolkit, developed using MATLAB, can be used to generate ground truth and annotation for any generic document image. Thus, it is useful for researchers in the document image processing community for evaluating the performance of document analysis and recognition techniques. The multi-script annotation toolokit (MAST) is available for free download.
Item Type: | Conference Proceedings |
---|---|
Publisher: | Association for Computing Machinery |
Additional Information: | Copyright of this article belongs to Association for Computing Machinery. |
Keywords: | Annotation Tool; Ground Truth; Scene Text; Multi-Script Documents; Camera-Based Document Analysis; MAST |
Department/Centre: | Division of Electrical Sciences > Electrical Engineering |
Date Deposited: | 19 Apr 2013 09:15 |
Last Modified: | 19 Apr 2013 09:15 |
URI: | http://eprints.iisc.ac.in/id/eprint/46228 |
Actions (login required)
View Item |