International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 110

India | Computer Science Engineering | Volume 4 Issue 12, December 2015 | Pages: 2102 - 2105


Novel Web Data Extraction Using Template Extraction and Filtering Non Information

Jaishree G Waghmare, Vikas B Maral

Abstract: Web is huge repository of information which contains different types of data in various forms. As we need to extract only the relevant data from web. Web data extractors are used to automatically extract the data from web documents. To study the problems related to web data Extraction different scientific tools are used which has broad range of applications. As we want only relevant data is to be extracted from the web. In our proposed system data is extracted using template extraction. Template matching will be based upon depth and data similarity and also removing the non-information part from the web pages by using filtering. The proposed system works on input document of variable depth.

Keywords: Information Filtering, Non Information, Template Extraction Unsupervised learning, Web data extraction



Rate This Article!



Received Comments

No approved comments available.


Top