IIIT Hyderabad Publications |
|||||||||
|
Approximate Grammar for Information ExtractionAuthors: V Sriram,Ravi Sekhar Reddy,Rajeev Sangal Conference: Published in the proceedings of International Conference on Universal Knowledge and Language(ICUKL),Goa, 25 Novmeber to 29 November, 2002. Date: 2002-12-01 Report no: IIIT/TR/2002/7 AbstractIn this paper, we present the concept of Approximate grammar and how it can be used to extract information from a document. As the structure of informational strings cannot be defined well in a document, we cannot use the conventional grammar rules to represent the information. Hence, the need arises to design an approximate grammar than can be used effectively to accomplish the task of Information extraction. Approximate grammars are a novel step in this direction. The rules of an approximate grammar can be given by a user or the machine can learn the rules from an annotated document. We have performed our experiments in both the above areas and the results have been impressive. Full paper: pdf Centre for Language Technologies Research Centre |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |