M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 4 Issue 11, November 2015
Mining Sequential Patterns from Probabilistic with Source Level Uncertainty
Venkata Sasidhar Puli
Sequential Pattern Mining (SPM) is an important data mining problem. Although it is assumed in classical SPM that the data to be mined is deterministic, it is recognized that data obtained from a wide variety of data sources is inherently noisy or uncertain, such as data from sensors or data being collected from the web from different (potentially conflicting) data sources. Probabilistic database is a popular framework for modeling uncertainty. Recently, several data mining and ranking problems have been studied in probabilistic databases. In this work we proposed one of the uncertainty models for spm, namely source level uncertainty which is covered under the framework of probabilistic databases framework. We give a dynamic programming algorithm to compute the source support probability and hence the expected support of a sequence in a source-level uncertain database. We then propose optimizations to speed up the support computation task. Next, we propose probabilistic SPM algorithms based on the candidate generation and pattern growth frameworks for the source-level uncertainty model and the expected support measure. We implement these algorithms and give an empirical evaluation of the probabilistic SPM algorithms and show the scalability of these algorithms under different parameter settings using both real and synthetic datasets. Finally, we demonstrate the effectiveness of the probabilistic SPM framework at extracting meaningful patterns in the presence of noise.
Keywords: Uncertainty, SPM, Probabilistic database, optimization
Edition: Volume 4 Issue 11, November 2015
Pages: 241 - 244
How to Cite this Article?
Venkata Sasidhar Puli, "Mining Sequential Patterns from Probabilistic with Source Level Uncertainty", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=NOV151090, Volume 4 Issue 11, November 2015, 241 - 244
90 PDF Views | 82 PDF Downloads
Similar Articles with Keyword 'Uncertainty'
Managing Uncertainty in Supply Chain Operating Cost Using Genetic Algorithm
Dr. Niju P. Joseph, Dr. Priyanka Surendran
Survey Paper on User's Location Hiding In Geosocial Recommendation Applications
Mayura Phadnis, Kanchan Varpe
Maintaining Privacy in Location Sharing Using LOCX
Syeda Maimuna Afreen, Shameem Akther
A Framework On: Decision Tree for Dynamic Uncertain Data
Megha Pimpalkar, Garima Singh
Ameliorating Brain Image Segmentation Using Fuzzy Clustering Techniques
Sana Tak, Toran Verma
Similar Articles with Keyword 'optimization'
Balancing the Trade-Offs between Data Availability and Query Delay in MANET's
Umar I. Masumdar, N. S. Killarikar
Paid and Non-Paid Marketing Strategies for Search Engine Optimization
Elton D'souza; Gursheen Grewal; Divya Unnikrishnan; Neelam Phadnis
Effective Approach for Localizing Jammers in Wireless Sensor Network
Ashwini S. Chimankar, V. S. Nandedkar
Classification of Data Using LAD
Aishwarya Jadhav, Vaishali Nandedkar
Clustering Medical Data Using Subspace and Parallel Approximation Algorithm
B. Thenmozhi, P. Shanthi