International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Most Trusted Research Journal Since Year 2012

ISSN: 2319-7064



Review Papers | Computer Science & Engineering | India | Volume 4 Issue 4, April 2015

A Review on Identifying the Main Content From Web Pages

Madhura R. Kaddu, Dr. R. B. Kulkarni

A web page is a web document in which huge amount of information is available and because of rapid growth of World Wide Web there is a great advantage to anyone, the user can easily access the web pages from any place through the internet. In the web page contains noisy information like menus, footers, unnecessary links, logos, etc and the main content. Most of the users are interested in only main content.But the main problem with the extraction process is to greater performance impact on web summarization, question answering system, information retrieval application because of the web page is collection of noisy and main content. So we propose an extraction process for identifying main content from web pages. In the extraction process consist of an automatic extraction techniques and hand crafted rules. In the automatic extraction techniques process the first step is to the web page is segmented into web page block and the second step is to differentiate main content from irrelevant or noisy content. In the hand crafted rule process extracts the main content from web pages by using rules which are already generated.

Keywords: DOM Tree, Content extraction, Web mining, Machine learning method, Web page Segmentation

Edition: Volume 4 Issue 4, April 2015

Pages: 2630 - 2634


How to Cite this Article?

Madhura R. Kaddu, Dr. R. B. Kulkarni, "A Review on Identifying the Main Content From Web Pages", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=SUB153719, Volume 4 Issue 4, April 2015, 2630 - 2634

33 PDF Views | 26 PDF Downloads

Download Article PDF



Similar Articles with Keyword 'DOM Tree'

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 2630 - 2634

A Review on Identifying the Main Content From Web Pages

Madhura R. Kaddu, Dr. R. B. Kulkarni

Share this article

Dissertation Chapters, Computer Science & Engineering, India, Volume 3 Issue 4, April 2014

Pages: 178 - 184

Mining Contents in Web Pages and Ranking of Web Pages Using Cosine Similarity

Divya C.

Share this article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 1, January 2015

Pages: 398 - 401

eDEW: Effective Data Extraction from Web

Shalaka Patil

Share this article

Research Paper, Computer Science & Engineering, Sudan, Volume 6 Issue 9, September 2017

Pages: 337 - 342

Intrusion Detection System Using Weka Data Mining Tool

Asma Abbas Hassan, Alaa F. Sheta, Talaat M. Wahbi

Share this article

Research Paper, Computer Science & Engineering, Egypt, Volume 6 Issue 10, October 2017

Pages: 126 - 131

Early Prediction of Student Success Using a Data Mining Classification Technique

Mohamed Hegazy Mohamed, Hoda Mohamed Waguih

Share this article



Similar Articles with Keyword 'Web mining'

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 12, December 2015

Pages: 1255 - 1257

Data Mining in E-Commerce for Electronics Products

Manpreet Kaur, Jyoti Arora

Share this article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 2094 - 2098

An Improved Web Mining Technique to Fetch Web Data Using Apriori and Decision Tree

Rupinder Kaur, Kamaljit Kaur

Share this article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 2681 - 2688

A Proposed Framework Using Neural Network in Web Mining for Improving the Performance of E-Learning System

Dar Masroof Amin, Atul Garg

Share this article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 6, June 2015

Pages: 1598 - 1602

Search Result Optimization using Annotators

Vishal A. Kamble, Amit B. Chougule

Share this article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 8, August 2015

Pages: 1640 - 1647

Privacy Preservation Protection for Personalized Web User by k-Anonymity with Profile Construction for Web Search Engines

Uma Maheswari.T, Dr.V. Kavitha

Share this article



Similar Articles with Keyword 'Machine learning method'

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 1141 - 1146

Traffic Allocation Technique in Computer Networks

Malgireddy Saidi Reddy

Share this article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 1060 - 1064

Review on Cost Estimation Prediction Using ANN

Anshul, Nitin Jain

Share this article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 2630 - 2634

A Review on Identifying the Main Content From Web Pages

Madhura R. Kaddu, Dr. R. B. Kulkarni

Share this article

Research Paper, Computer Science & Engineering, India, Volume 6 Issue 6, June 2017

Pages: 2467 - 2471

Machine Learning using MapReduce

Satwik Kumar Shiri, Satyam Thusu

Share this article

Survey Paper, Computer Science & Engineering, Tanzania, Volume 4 Issue 7, July 2015

Pages: 697 - 702

A Survey of Artificial Neural Networks machine learning methods and Applications in Bio-Neuron System

Thangaraj E, Subinson G, S. Rimlon Shibi

Share this article
Top