Differences

This shows you the differences between two versions of the page.

Link to this comparison view

blogs:pub2010:efficient_pattern_mining_of_uncertain_data_with_sampling [2011/05/03 12:14] (current)
jvogelaar
Line 1: Line 1:
 +====== Efficient Pattern Mining of Uncertain Data with Sampling ======
  
 +
 +
 +T.G.K. Calders, C. Garboni and B. Goethals \\
 +//In M.J. Zaki, J.X. Yu, B. Ravindran & V. Pudi (Eds.), Advances in Knowledge Discovery and Data Mining (14th Pacific-Asia Conference, PAKDD 2010, Hyderabad, India, June 21-24, 2010. Proceedings,​ Part I). (Lecture Notes in Computer Science, Vol. 6118, pp. 480-487). Berlin: Springer (DOI 10.1007/​978-3-642-13657-3_51)
 +//
 +
 +===== Abstract =====
 +Mining frequent itemsets from transactional datasets is a well known problem with good algorithmic solutions. In the case of uncertain data, however, several new techniques have been proposed. Unfortunately,​ these proposals often suffer when a lot of items occur with many different probabilities. Here we propose an approach based on sampling by instantiating “possible worlds” of the uncertain data, on which we subsequently run optimized frequent itemset mining algorithms. As such we gain efficiency at a surprisingly low loss in accuracy. These is confirmed by a statistical and an empirical evaluation on real and synthetic data.
 +
 +===== Links =====
 +[[http://​alexandria.tue.nl/​campusonly/​Metis235727.pdf| Download PDF]] (183 KB)