Web Data Extraction by Using Trinity
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 97 | Views: 420

Review Papers | Computer Science & Engineering | India | Volume 3 Issue 11, November 2014 | Popularity: 7 / 10


     

Web Data Extraction by Using Trinity

Sayali Khodade, Nilav Mukharjee


Abstract: Internet presents a huge collection of useful information so extracting information from web document has become research area for which web data extractors are used. In this article we proposed technique which works on two or more web documents generated by same sever side template and learns a regular expression that models it and then used it for extracting data from similar documents. The technique introduces some shared pattern that do provide any relevant data. We have to compared our technique to others in the literature on large collection of web documents; our results determine that our proposal better than others and no negative impact on its effectiveness.


Keywords: Web Data Extraction, Automatic wrapper generation, Web Crawler, Unsupervised learning


Edition: Volume 3 Issue 11, November 2014


Pages: 1191 - 1194



Please Disable the Pop-Up Blocker of Web Browser

Verification Code will appear in 2 Seconds ... Wait



Text copied to Clipboard!
Sayali Khodade, Nilav Mukharjee, "Web Data Extraction by Using Trinity", International Journal of Science and Research (IJSR), Volume 3 Issue 11, November 2014, pp. 1191-1194, https://www.ijsr.net/getabstract.php?paperid=OCT141128, DOI: https://www.doi.org/10.21275/OCT141128

Top