M.Tech / M.E / PhD Thesis | Information Technology | India | Volume 3 Issue 7, July 2014
Hybrid Approach for Outlier Detection in High Dimensional Dataset
Rohini Balkrishna Gurav, Sonali Rangdale
Abstract: An object that does not obey the behavior of normal data objects is called as Outlier. In many data analysis process, a large number of data are being recorded or sampled as data set. It is very important in data mining to find rare events, anomalies, exceptions etc. Outlier detection has important applications in many fields in which the data can contain high dimensions. Resulting the intended knowledge of outliers will become inefficient and even infeasible in high dimensional space. I devised an outlier detection structure which is based on clustering. Clustering is an unsupervised type of data mining and it does not require trained or labeled data. Combination of density based and partition clustering method for taking improvement of both densities based and distance based outlier detection. Weights are allocated to attributes depending upon their individual significance in mining task and weights are adaptive in nature. Weighted attributes are useful to reduce or remove the effect of noisy attributes. In view of the challenges of streaming data, the schemes are incremental and adaptive to concept development. In high dimensional data the number of attributes associated with the dataset is very large and it makes the dataset unmanageable. Thus a Feature Extraction technique is used to reduce the number of attributes to a manageable value.
Keywords: Attribute weighting, Dataset, DBSCAN, k-mean, unsupervised method
Edition: Volume 3 Issue 7, July 2014,
Pages: 1743 - 1746
How to Cite this Article?
Rohini Balkrishna Gurav, Sonali Rangdale, "Hybrid Approach for Outlier Detection in High Dimensional Dataset", International Journal of Science and Research (IJSR), https://www.ijsr.net/get_abstract.php?paper_id=20071403, Volume 3 Issue 7, July 2014, 1743 - 1746
How to Share this Article?
Similar Articles with Keyword 'Dataset'
Analysis of NSL-KDD Dataset for Fuzzy Based Intrusion Detection System
Macdonald Mukosera, Thabiso Peter Mpofu, Budwell Masaiti
Detection of Outliers Using Hybrid Algorithm on Categorical Datasets
Rachana P. Jakkulwar, Prof. R. A. Fadnavis
Similar Articles with Keyword 'Attribute'
Re-encryption based Attribute Revocation in Data Access Control for Multiauthority Cloud Storage
Kalyani G. Ktangale, Milind Penurkar
Privacy-Preservation of Centralized and Distributed Social Network by Using L-Diversity Algorithm
Shankaranand, P. Rajasekar