Downloads: 110 | Views: 377
Review Papers | Computer Science & Engineering | India | Volume 4 Issue 12, December 2015 | Popularity: 6.3 / 10
Novel Web Data Extraction Using Template Extraction and Filtering Non Information
Jaishree G Waghmare, Vikas B Maral
Abstract: Web is huge repository of information which contains different types of data in various forms. As we need to extract only the relevant data from web. Web data extractors are used to automatically extract the data from web documents. To study the problems related to web data Extraction different scientific tools are used which has broad range of applications. As we want only relevant data is to be extracted from the web. In our proposed system data is extracted using template extraction. Template matching will be based upon depth and data similarity and also removing the non-information part from the web pages by using filtering. The proposed system works on input document of variable depth.
Keywords: Information Filtering, Non Information, Template Extraction Unsupervised learning, Web data extraction
Edition: Volume 4 Issue 12, December 2015
Pages: 2102 - 2105
DOI: https://www.doi.org/10.21275/NOV152454
Please Disable the Pop-Up Blocker of Web Browser
Verification Code will appear in 2 Seconds ... Wait