Survey Paper | Computer Science & Engineering | India | Volume 3 Issue 12, December 2014
A Survey Report on: Methodology for Extraction of Information from Web Pages by Using Clustering Algorithm
Mahesh Dabade, Shriniwas Gadage
Abstract: This paper is about data extraction from top-k web pages, which explain top k occurrences of a subject that will be of ordinary interest. For example Best Catches ever, 50 best Android diversions 2014: our top picks, and so on. Contrasted with other sorted out data on the web including advertizing data, data in top-k gives is bigger and effective, of high caliber, and by and large additional fascinating. In this way best k gives are very important. For sample, it will likewise help improve open-domain information bottoms (to help projects, for example, inquiry or reality replying). In this report, we introduce an efficient system that extracts top-k providers from pages with superior performance. Specifically, we procure more than 1.69 million top-k gives from a site corpus of 1.59 billion pages with 91.9 % exactness and 72.29 % review.
Keywords: data extraction, top-k provides, record extraction, open-domain information, clustering
Edition: Volume 3 Issue 12, December 2014,
Pages: 345 - 347
How to Cite this Article?
Mahesh Dabade, Shriniwas Gadage, "A Survey Report on: Methodology for Extraction of Information from Web Pages by Using Clustering Algorithm", International Journal of Science and Research (IJSR), https://www.ijsr.net/get_abstract.php?paper_id=SUB14411, Volume 3 Issue 12, December 2014, 345 - 347
How to Share this Article?
Similar Articles with Keyword 'data extraction'
Web Data Extraction by Using Trinity
Sayali Khodade, Nilav Mukharjee
Data Hiding in H.264/AVC Video Encryption with XOR-ed User Information and Data in File Format
Similar Articles with Keyword 'clustering'
Survey on Various Image Segmentation Techniques
Comparative Analysis of AI Techniques in the Prediction of Heart Disease
Similar Articles with Keyword 'data'
Heart Disease Prediction with Machine Learning Approaches
Image Noise Reduction with Autoencoder using Tensor Flow
Jai Sehgal, Dr Yojna Arora
Similar Articles with Keyword 'extraction'
A Survey of Generating Multi-Document Summarizations
Patil Ajita S., P. M. Mane
Detection of Malicious URLs using Classification Algorithm
Muskan V. Jaiswal, Dr. Anjali B. Raut
Similar Articles with Keyword 'provides'
Artificial Intelligence for Hiring
Ishan Borker, Ashok Veda
Survey on Algorithms Predicting Performance of Keyword Queries
Similar Articles with Keyword 'record'
Supply Chain Using Blockchain (Healthcare Industry)
Prof. Mangesh G. Ingale, Shyam Jamman Parmeshwar, Yash G. Purohit, Divyani Sable, Vishaksha Kachane
A Survey based IoT Model to Maintain Social Distancing
Similar Articles with Keyword 'information'
A Survey of Thinning Techniques on Two Dimensional Binary Images
Moumita Sarkar, Santanu Chatterjee
Study of Power Management in Adhoc Networks
Anandhi Giri, S. K. Srivatsa