International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 119 | Views: 190

M.Tech / M.E / PhD Thesis | Information Technology | Burma | Volume 8 Issue 1, January 2019


Comparison of Keyword-based and Semantic-based Web Page Clustering Systems

Ei Ei Moe | Hnin Hnin Htun [2]


Abstract: Today, web page clustering is useful for many applications such as categorization, cleaning, schema detection and automatic extractions. Web page clustering is classified into different categories that are hierarchical and flat clustering, online and offline clustering, soft and hard clustering, and document-based and keywords-based clustering. Among them, keyword-based web page clustering uses the single words or compounds words occurring in the web page set as the features for clustering. In this situation, these words cant precisely represent the content of the web page because the synonyms and polysemous of the word can lead the ambiguity problems. Semantic analysis is useful to solve this ambiguity problem. So, this system proposes both keyword-based and semantic-based web page clustering system, and then compares the performance between them. In the semantic analysis, words in each web page are first mapped to word senses by using supervised based word sense disambiguation method. Then, semantic-based web page clustering system uses both keywords and semantic features for clustering. After performing each cluster process, this system points out the semantic-based web page clustering system is more precise and effective than the keyword-based clustering system.


Keywords: Semantic, Word Sense Disambiguation, Clustering


Edition: Volume 8 Issue 1, January 2019,


Pages: 1511 - 1516


How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link


Verification Code will appear in 2 Seconds ... Wait

Top