International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 110 | Views: 192

Review Papers | Computer Science & Engineering | India | Volume 4 Issue 12, December 2015


Novel Web Data Extraction Using Template Extraction and Filtering Non Information

Jaishree G Waghmare | Vikas B Maral


Abstract: Web is huge repository of information which contains different types of data in various forms. As we need to extract only the relevant data from web. Web data extractors are used to automatically extract the data from web documents. To study the problems related to web data Extraction different scientific tools are used which has broad range of applications. As we want only relevant data is to be extracted from the web. In our proposed system data is extracted using template extraction. Template matching will be based upon depth and data similarity and also removing the non-information part from the web pages by using filtering. The proposed system works on input document of variable depth.


Keywords: Information Filtering, Non Information, Template Extraction Unsupervised learning, Web data extraction


Edition: Volume 4 Issue 12, December 2015,


Pages: 2102 - 2105


How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link


Verification Code will appear in 2 Seconds ... Wait

Top