An Improved Mining of Biomedical Data from Web Documents Using Clustering

Nikita Gupta, Gunjan Pahuja

doi:10.21275/NOV161318

An Improved Mining of Biomedical Data from Web Documents Using Clustering

Nikita Gupta, Gunjan Pahuja

Abstract: Now a days web is the main source of information in every field. Web is also expanding exponentially day by day. To get the relevant information is very time consuming and is not a very easy task. Mainly users go for the various search engines to search any information. But sometimes search engines are not able to give the useful results as most of the web documents are present in unstructured manner. Data mining is extraction of information from large database. Text mining uses the many techniques of data mining. In web, biomedical documents are also increasing at a very fast pace but most of them are unstructured text. These documents can be very helpful in diagnostics, treatment and prevention of any disease. There are we millions of documents on internet about a specific term so to obtain a relevant document is very difficult. The goal of this is to apply text mining techniques to retrieve useful biomedical web documents. Here a more efficient mechanism is proposed which uses the optimised K-means clustering algorithm where it can group the similar documents in one place. This approach will help the user to get all the relevant biomedical documents at one place. On comparing our approach with the original k-means algorithm and found that our algorithm on an average giving 99.06 % F-measure.

Keywords: Data Mining, Biomedical Data, Clustering, K-means

How to Cite?: Nikita Gupta, Gunjan Pahuja, "An Improved Mining of Biomedical Data from Web Documents Using Clustering", Volume 5 Issue 2, February 2016, International Journal of Science and Research (IJSR), Pages: 1396-1400, https://www.ijsr.net/getabstract.php?paperid=NOV161318, DOI: https://dx.doi.org/10.21275/NOV161318

Download Citation: APA | MLA | BibTeX | EndNote | RefMan