IIIT Hyderabad Publications |
|||||||||
|
Unsupervised Improvement of Morphological Analyzer for Inflectionally Rich LanguagesAuthors: Akshar Bharati,Rajeev Sangal,S M Bendre,M.N.S.S.K. Pavan Kumar, Aishwarya Conference: Published in the Proceedings of NLPRS-2001, Tokyo, 27-30 November 2001 Date: 2001-11-30 Report no: IIIT/TR/2001/4 AbstractThis paper presents an algorithm for unsupervised learning of morphological analysis and generation of inflectionally rich languages like Hindi, given a low coverage morph and a corpus of raw text. It assumes no particular theoretical model of morph, but can work with any morph that defines classes of stem that behave similarly. The morph learning algorithm uses the concept of 'observable paradigm'. The results of the algorithm are encouraging with the coverage of a primitive morph going up from 32% to about 63% and that of an advanced morph going up from 96% to about 97%. Full paper: pdf Centre for Language Technologies Research Centre |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |