Semantic Similarity Using Web Search Engine

Meghana Raut | Nityaspandana Nalamari | Darshana Rane

Abstract: Measuring the semantic similarity between words is an important component in various tasks on the web such as relation extraction, document clustering, and automatic metadata extraction. Despite the usefulness of semantic similarity measures in these applications, accurately measuring semantic similarity between two words (or entities) is still difficult. We propose a method to estimate semantic similarity using page counts and text snippets retrieved from a web search engine for two words. Specifically, we define various word co-occurrence measures using page counts and integrate those with lexical patterns extracted from text snippets. To identify the numerous semantic relations that exist between two given words, we propose a pattern extraction algorithm and a pattern clustering algorithm. The optimal combination of page counts-based co-occurrence measures and lexical pattern clusters is obtained using support vector machines.

Keywords: web mining, information retrieval, page counts, snippets

Edition: Volume 2 Issue 12, December 2013,

Pages: 92 - 94

An Ensemble Framework for Web Content Extraction to User Query Obfuscations

Umarani. P. M | Sumathi. P

Share this Article

Downloads: 125 | Weekly Hits: ⮙1 | Monthly Hits: ⮙2

M.Tech / M.E / PhD Thesis, Software Engineering, India, Volume 3 Issue 4, April 2014

Pages: 1 - 4

Conceptual Cohesion of Classes(C3) Metrics

Girish K.K.

Share this Article

Semantic Similarity Using Web Search Engine

Similar Articles with Keyword 'information retrieval'

An Ensemble Framework for Web Content Extraction to User Query Obfuscations

Conceptual Cohesion of Classes(C3) Metrics