IIIT Hyderabad Publications |
|||||||||
|
Improved Topic Models for Social Media via Community Detection using User Interaction and Content SimilarityAuthors: Prateek Mehta,Vasudeva Varma Conference: International FRUCT Conference on Intelligence, Social Media and Web (ISMW FRUCT-2016 2016) Date: 2016-08-28 Report no: IIIT/TR/2016/66 AbstractTopic models such as Latent Dirichlet Allocation (LDA) have historically served as a successful tool for various data mining applications on conventional documents such as news articles or academic abstracts. However, standard use of topic models on social media posts pose several poblems because social media posts are short, messy and generated non-uniformly by the users of the social media platforms. In this paper we propose a new approach of community based document pooling to train better topic models over social media posts and address these problems without modifying the basic machinery of LDA. We compare our approach to the popular user based pooling scheme and show significant improvement in the quality of topic models. Full paper: pdf Centre for Language Technologies Research Centre |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |