International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 122

Survey Paper | Computer Science & Engineering | India | Volume 3 Issue 12, December 2014


A Survey Report on: Methodology for Extraction of Information from Web Pages by Using Clustering Algorithm

Mahesh Dabade | Shriniwas Gadage [2]


Abstract: This paper is about data extraction from top-k web pages, which explain top k occurrences of a subject that will be of ordinary interest. For example Best Catches ever, 50 best Android diversions 2014: our top picks, and so on. Contrasted with other sorted out data on the web including advertizing data, data in top-k gives is bigger and effective, of high caliber, and by and large additional fascinating. In this way best k gives are very important. For sample, it will likewise help improve open-domain information bottoms (to help projects, for example, inquiry or reality replying). In this report, we introduce an efficient system that extracts top-k providers from pages with superior performance. Specifically, we procure more than 1.69 million top-k gives from a site corpus of 1.59 billion pages with 91.9 % exactness and 72.29 % review.


Keywords: data extraction, top-k provides, record extraction, open-domain information, clustering


Edition: Volume 3 Issue 12, December 2014,


Pages: 345 - 347


How to Download this Article?

You Need to Register Your Email Address Before You Can Download the Article PDF


How to Cite this Article?

Mahesh Dabade, Shriniwas Gadage, "A Survey Report on: Methodology for Extraction of Information from Web Pages by Using Clustering Algorithm", International Journal of Science and Research (IJSR), Volume 3 Issue 12, December 2014, pp. 345-347, https://www.ijsr.net/get_abstract.php?paper_id=SUB14411

Similar Articles with Keyword 'data extraction'

Downloads: 97

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 1191 - 1194

Web Data Extraction by Using Trinity

Sayali Khodade | Nilav Mukharjee

Share this Article

Downloads: 101

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1579 - 1582

Data Hiding in H.264/AVC Video Encryption with XOR-ed User Information and Data in File Format

Neenu Shereef

Share this Article
Top