IIIT Hyderabad Publications |
|||||||||
|
HMM Based Chunker for HindiAuthors: Akshay Singh,S M Bendre,Rajeev Sangal Conference: Published in the Proceedings of IJCNLP-05: The Second International Joint Conference on Natural Language Processing, 11-13 October, 2005, Jeju Island, Republic of Korea Date: 2005-12-01 Report no: IIIT/TR/2005/9 AbstractThis paper presents an HMM-based chunk tagger for Hindi. Various tagging schemes for marking chunk boundaries are discussed along with their results. Contextual information is incorporated into the chunk tags in the form of partof- speech (POS) information. This information is also added to the tokens themselves to achieve better precision. Error analysis is carried out to reduce the number of common errors. It is found that for certain classes of words, using the POS information is more effective than using a combination of word and POS tag as the token. Finally, chunk labels are also marked on the chunks. Full paper: pdf Centre for Language Technologies Research Centre |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |