IIIT Hyderabad Publications |
|||||||||
|
Can RNNs Reliably Separate Script and Language at Word and Line Level?Authors: Ajeet Kumar Singh, C V Jawahar Conference: International Conference on Document Analysis and Recognition, (ICDAR 2015 2015) Date: 2015-08-23 Report no: IIIT/TR/2015/33 AbstractIn this work, we investigate the utility of Recurrent Neural Networks (RNNs) for script and langauge identification. Both these problems have been attempted in the past with representations computed from the distribution of connected components or characters (e.g. texture, n-gram). Often these features are computed from a larger segment (a paragraph or a page). We argue that one can predict the script or language with minimal evidence (e.g. given only a word or a line) very accurately with the help of a pre-trained RNN. We propose a simple and generic solution for the task of script and language identification which do not require any special tuning. Our method represents the word images as a sequence of feature vectors, and employ the RNNs for the identification. We verify the method on a large corpus of more than 15.03M words from 55K document images comprising 15 scripts and languages. We report an accurate script and language identification at word and line level. Full paper: pdf Centre for Visual Information Technology |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |