Towards Identifying Feature Subset Selection for Mining High Dimensional Data

Punnana Sarath Kumar, Ganiya Rajendra Kumar

doi:10.21275/17121405

Towards Identifying Feature Subset Selection for Mining High Dimensional Data

Punnana Sarath Kumar, Ganiya Rajendra Kumar

Abstract: High dimensional data is the data which has many features. Some features might have representative characteristics that can help in reducing search space in data mining activities. Therefore it is important to identify such features. The feature subset selection can improve the performance of data mining on high dimensional data. This will help in extracting business intelligence that can help in making expert decisions. However, it is challenging task to identify feature subset that is representative of all possible characteristics. Song et al. , of late, proposed a framework that can be used to select feature subset from high dimensional data. Clustering is involved in their approach. Similarly in this paper we built a prototype system that demonstrates the feature subset selection. The application uses clustering and the results reveal that they are encouraging. The results are also compared with other algorithms like C4.5, Nave Bayes, IB1 and RIPPER.

Keywords: Data mining, feature subset selection, clustering

How to Cite?: Punnana Sarath Kumar, Ganiya Rajendra Kumar, "Towards Identifying Feature Subset Selection for Mining High Dimensional Data", Volume 3 Issue 12, December 2014, International Journal of Science and Research (IJSR), Pages: 1934-1937, https://www.ijsr.net/getabstract.php?paperid=17121405, DOI: https://dx.doi.org/10.21275/17121405

Download Citation: APA | MLA | BibTeX | EndNote | RefMan