International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 114

Research Paper | Computer Science | India | Volume 2 Issue 9, September 2013


Focused Crawling System based on Improved LSI

Radhika Gupta [3] | AP Nidhi [4]


Abstract: In this research work we have developed a semi-deterministic algorithm and a scoring system that takes advantage of the Latent Semantic indexing scoring system for crawling web pages that belong to particular domain or is specific to the topic. The proposed algorithm calculates a preference factor in addition to the LSI score to determine which web page needs to preferred for crawling by the multi threaded crawler application, by doing this we were able to produce a retrieval system that has high recall and precision values as it builds a queue which is specific to a particular domain/topic which would not have been possible in Breath first and only LSI based information retrieval systems.


Keywords: LSI, Breath first crawler, focused crawler


Edition: Volume 2 Issue 9, September 2013,


Pages: 61 - 64


How to Download this Article?

You Need to Register Your Email Address Before You Can Download the Article PDF


How to Cite this Article?

Radhika Gupta, AP Nidhi, "Focused Crawling System based on Improved LSI", International Journal of Science and Research (IJSR), Volume 2 Issue 9, September 2013, pp. 61-64, https://www.ijsr.net/get_abstract.php?paper_id=12013157

Top