An Efficient Approach for High Dimensional Data Clustering of Gene Expression using Dynamic Error Threshold Estimation Model
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Open Access | Fully Refereed | Peer Reviewed International Journal

ISSN: 2319-7064


Amazon Sale


Research Paper | Computer Science & Engineering | India | Volume 2 Issue 3, March 2013

An Efficient Approach for High Dimensional Data Clustering of Gene Expression using Dynamic Error Threshold Estimation Model

K. Arun Prabha, A. Amutha

Clustering is the classification of objects into different groups, or more precisely, the partitioning of a data set into subsets. Data clustering is a common technique for statistical data analysis, which is used in many fields, including machine learning, data mining, pattern recognition and bioinformatics. Gene expressions are one of the high dimensional data values and its motivating the development of clustering algorithm was used. The Existing system consists of popular algorithms like k-means and CAST. Implementing these algorithms for a large genome-scale gene expression data set is practically critical. A novel method for clustering large gene data set is introduced. In Existing work the TCLUST algorithm used, which introduce, Correlation Coefficient Graph (CCG) is constructed to maintain gene expression data values and Tanimoto Coefficient Graph (TCG) is used to measure the similarity value for the gene expression data. In proposed the enhanced TCLUST algorithm is used, it is called as E-TCLUST. Enhanced Tanimoto clustering method is implemented which feats the co-connectedness for efficiently clustering large, sparse expression data. Dynamic error threshold estimation model implements threshold values which filters data below the given threshold value. In the proposed work tree structure is constructed represent the input samples. Using graphs the variations are identified .Graph Re-arrangement mechanism is performed which effectively reduces the number of iterations. The process time is also reduced. Extensive evaluation of this method reveals an optimized performance which is depicted as a graph. This algorithm is applied to a genome-scale gene expression data set and used gene set enrichment analysis to obtain highly significant biological clusters. It have been implemented both TCLUST and E-TCLUST algorithms and tested their performance using three different data sets. The datasets are real gene expression data from yeast samples generated using micro-arrays technology.

Keywords: Clustering, Gene Expression, Micro-array, Bio-informatics, Data mining

Edition: Volume 2 Issue 3, March 2013

Pages: 194 - 196

Share this Article

How to Cite this Article?

K. Arun Prabha, A. Amutha, "An Efficient Approach for High Dimensional Data Clustering of Gene Expression using Dynamic Error Threshold Estimation Model", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=IJSROFF2013083, Volume 2 Issue 3, March 2013, 194 - 196

61 PDF Views | 57 PDF Downloads

Download Article PDF


Amazon Sale


Similar Articles with Keyword 'Clustering'

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 1936 - 1938

A Mining Method to Predict Patient's DOSH

Ruchi Rathor, Pankaj Agarkar

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 1569 - 1573

Survey on Parallel Comparison of Text Document with Input Data Mining and VizSFP

Priyanka P. Palsaniya, D. C. Dhanwani

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 2, February 2015

Pages: 2461 - 2466

A Review of Text Mining Techniques Associated with Various Application Areas

Dr. Shilpa Dang, Peerzada Hamid Ahmad

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1063 - 1066

A Survey on Automatic Fault Detection Framework for Cloud based Application

Kshitija Nandgaonkar, Swarupa Kamble

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 4, April 2014

Pages: 253 - 257

Outlier Recognition in Clustering

Balaram Krishna Chavali, Sudheer Kumar Kotha

Share this Article

Similar Articles with Keyword 'Gene Expression'

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 2, February 2015

Pages: 1164 - 1168

Predicting Cancer by Analyzing Gene Using Data Mining Techniques

Shahida M

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 5, May 2014

Pages: 1155 - 1159

A Novel Method for Cancer Gene Prediction Using Back Propagation Algorithm

Annakkodi P. S, Manjula Devi B

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 2, February 2014

Pages: 258 - 265

Algorithm for Clustering Gene Expression Data with Outliers Using Minimum Spanning Tree

S. John Peter

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 10, October 2014

Pages: 764 - 767

Fuzzy and Rough Set Theory Based Gene Selection Method

C. Kalaiselvi, Dr. G. M. Nasira

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 5, May 2015

Pages: 188 - 193

Performance Enhancement of Dimension Reduction for Microarray Data

Shubhangi N. Katole, Swapnili P. Karmore

Share this Article

Similar Articles with Keyword 'Data mining'

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 1569 - 1573

Survey on Parallel Comparison of Text Document with Input Data Mining and VizSFP

Priyanka P. Palsaniya, D. C. Dhanwani

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 5 Issue 11, November 2016

Pages: 1304 - 1307

A Survey Paper on Mining Keywords Using Text Summarization Extraction System for Summary Generation over Multiple Documents

Parmar Paresh B., Ketan Patel

Share this Article

Research Paper, Computer Science & Engineering, Kenya, Volume 7 Issue 5, May 2018

Pages: 1409 - 1411

Student's Performance Prediction Using FP-Tree Data Mining Techniques

Eliakim Ombati Akama

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 4, April 2014

Pages: 253 - 257

Outlier Recognition in Clustering

Balaram Krishna Chavali, Sudheer Kumar Kotha

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 7, July 2014

Pages: 2134 - 2138

Mining Web using Hyper Induced Topic Search Algorithm

Manali Gupta, Shweta Rathour

Share this Article
Top