IIIT Hyderabad Publications |
|||||||||
|
Domain Specific Facts Extraction Using Weakly Supervised Active Learning ApproachAuthors: Vinay Pande,Tanmoy Mukherjee,Vasudeva Varma Conference: The 2013 IEEE/WIC/ACM International Conference on Web Intelligence (WI13) Location Atlanta, USA. Date: 2013-11-17 Report no: IIIT/TR/2013/91 AbstractAn ontology is defined using concepts and relationships between the concepts. In this paper, we focus on second problem: relation extraction from plain text. Generic Knowledge Bases like YAGO, Freebase, and DBPedia have made accessible huge collections of facts and their properties from various domains. But acquiring and maintaining various facts and their relations from domain specific corpus becomes very important and challenging task due to low availability of annotated data. Here, we proposed a label propagation based semisupervised approach for relation extraction by choosing most informative instances for annotation. We also proposed weakly supervised approach for data annotation using generic ontologies like Freebase, which further reduces the cost of annotating data manually. We checked efficiency of our approach by performing experiments on various domain specific corpora. Full paper: pdf Centre for Search and Information Extraction Lab |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |