IIIT Hyderabad Publications |
|||||||||
|
Specialty MiningAuthors: Hanuma Kumar,Rohit Paravastu,Vikram Pudi Conference: Intl Conference on Data Warehousing and Knowledge Discovery (DaWaK 2010) Location Bilbao, Spain Date: 2010-08-30 Report no: IIIT/TR/2010/34 AbstractIn this paper, we consider the problem of mining the special properties of a given record in a relational dataset. In our formulation, a property is a combination of multiple attribute-value pairs. The support of a property is the number of records that satisfy it. We consider a property as special if its support occurs to us as a shock and the measure of this shock factor is more than a user defined threshold. We provide a way to define this notion of shock based on entropy. We also output the shock factor for records in the dataset in a convenient, easily-interpretable manner. An illustrated example is provided on how users can interpret the results. Experiments on real and synthetic data sets reveal interesting properties of data records that cannot be mined using traditional approaches. Full paper: pdf Centre for Data Engineering |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |