Text Studies Classification of Database of Genotypes and Phenotypes using K-Nearest Neighbor Algorithm
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Open Access | Fully Refereed | Peer Reviewed International Journal

ISSN: 2319-7064

Research Paper | Computer Science & Engineering | India | Volume 3 Issue 6, June 2014

Text Studies Classification of Database of Genotypes and Phenotypes using K-Nearest Neighbor Algorithm

Kolekar Suresh S, Kumbhar Satish S

The database of genotypes and phenotypes (dbGaP) is the new database to store and distribute data from studies of genome wide association. dbGaP launch by National Library of Medicine (NLM) which is part of National Institutes of Health (NIH). Searching relevant studies of particular interest accurately and completely is challenging task due to keyword based search method of dbGaP Entrez system. For given queries; the dbGaP retrieval system returns several studies that are unrelated; and it is very difficult to find how particular studies are retrieved and why they come out in a particular sequence. Thus; users have to evaluate every study description carefully to find relevant studies; which is time consuming task. Text mining is emerging research field which enable users to extract useful information from text documents and deals with retrieval; classification; clustering and machine learning techniques to classify different text document. In this research; an empirical approach is proposed and implemented with K-nearest neighbor (KNN) machine learning algorithms to classify dbGaP study text in heart; lung and blood studies. It is evident from results that this text based classification outperforms conventional keyword based search of document retrieval system provided by dbGaP.

Keywords: Bioinformatics, Data Mining, Text Mining, database of Genotypes and Phenotypes

Edition: Volume 3 Issue 6, June 2014

Pages: 1146 - 1149

Share this Article

How to Cite this Article?

Kolekar Suresh S, Kumbhar Satish S, "Text Studies Classification of Database of Genotypes and Phenotypes using K-Nearest Neighbor Algorithm", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=2014436, Volume 3 Issue 6, June 2014, 1146 - 1149

84 PDF Views | 67 PDF Downloads

Download Article PDF



Similar Articles with Keyword 'Bioinformatics'

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 1884 - 1886

Performance Analysis of Clustal W Algorithm on Linux Cluster

Swati Jasrotia, Salam Din

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 1146 - 1149

Text Studies Classification of Database of Genotypes and Phenotypes using K-Nearest Neighbor Algorithm

Kolekar Suresh S, Kumbhar Satish S

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 5 Issue 3, March 2016

Pages: 1267 - 1271

Survey on Matrix Factorization Using Information Fusion

Rutuja Mane, A. N. Bandal

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 2 Issue 1, January 2013

Pages: 239 - 249

Conceptual study on Ontology based Application and its Specification in Information Science

Chauhan Vipul, Chauhan Falguni

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 1087 - 1091

Implementation of Hadoop Based Framework for Parallel Processing of Biological Data

Praveen Kumar B, Nirmala Bariker

Share this Article

Similar Articles with Keyword 'Data Mining'

Research Paper, Computer Science & Engineering, Kenya, Volume 7 Issue 5, May 2018

Pages: 1409 - 1411

Student's Performance Prediction Using FP-Tree Data Mining Techniques

Eliakim Ombati Akama

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 10, October 2015

Pages: 2041 - 2047

Adaptive Analysis of Knowledge Engineering and Pattern Recognition on Medical Data

Jani Basha, Dasari. Rajesh

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1165 - 1168

Privacy Preserving Closed Frequent Pattern Mining

Anju Vijayan

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 6 Issue 1, January 2017

Pages: 1979 - 1983

A Survey on Mining User-Aware Rare Sequential Pattern

Salmath Amina KP, Farzin Ahammed T

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2214 - 2216

Privacy-Preserving Mining of Association Rules in Cloud

Vishal Ravindra Redekar, Dr. K.N.Honwadkar

Share this Article

Similar Articles with Keyword 'Text Mining'

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2205 - 2207

A Survey of Generating Multi-Document Summarizations

Patil Ajita S., P. M. Mane

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 10, October 2015

Pages: 1750 - 1751

Automatic Emotion Generation and Summarization form Perceptual Text ? A Survey

Sayalee Sandeep Raut, Kavita P. Shirsat

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2366 - 2368

A Survey on Effective Quality Enhancement of Text Clustering & Classification Using METADATA

Padmaja Shivane, Rakesh Rajani

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 2, February 2015

Pages: 2461 - 2466

A Review of Text Mining Techniques Associated with Various Application Areas

Dr. Shilpa Dang, Peerzada Hamid Ahmad

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 747 - 749

A Survey of Friendbook Recommendation Services

Pankaj L. Pingate, S. M. Rokade

Share this Article
Top