International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 119

Burma | Information Technology | Volume 8 Issue 1, January 2019 | Pages: 1511 - 1516


Comparison of Keyword-based and Semantic-based Web Page Clustering Systems

Ei Ei Moe, Hnin Hnin Htun

Abstract: Today, web page clustering is useful for many applications such as categorization, cleaning, schema detection and automatic extractions. Web page clustering is classified into different categories that are hierarchical and flat clustering, online and offline clustering, soft and hard clustering, and document-based and keywords-based clustering. Among them, keyword-based web page clustering uses the single words or compounds words occurring in the web page set as the features for clustering. In this situation, these words cant precisely represent the content of the web page because the synonyms and polysemous of the word can lead the ambiguity problems. Semantic analysis is useful to solve this ambiguity problem. So, this system proposes both keyword-based and semantic-based web page clustering system, and then compares the performance between them. In the semantic analysis, words in each web page are first mapped to word senses by using supervised based word sense disambiguation method. Then, semantic-based web page clustering system uses both keywords and semantic features for clustering. After performing each cluster process, this system points out the semantic-based web page clustering system is more precise and effective than the keyword-based clustering system.

Keywords: Semantic, Word Sense Disambiguation, Clustering

How to Cite?: Ei Ei Moe, Hnin Hnin Htun, "Comparison of Keyword-based and Semantic-based Web Page Clustering Systems", Volume 8 Issue 1, January 2019, International Journal of Science and Research (IJSR), Pages: 1511-1516, https://www.ijsr.net/getabstract.php?paperid=ART20194450, DOI: https://dx.doi.org/10.21275/ART20194450


Download Article PDF


Rate This Article!


Top