International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064

Downloads: 108 | Views: 192

Review Papers | Computer Science & Engineering | India | Volume 4 Issue 11, November 2015 | Rating: 6.6 / 10

Specific Personal Alias Withdrawal from Web and Clustering of Similar Web Documents

Snehal S. Shinde | Prakash R. Devale

Abstract: There are many names available for a person, place or an entity on the web. If accurate alias of a particular individual is identified it becomes very useful in numerous web related tasks like information extraction, relation extraction, biomedical fields, sentiment analysis, personal name disambiguation, etc. Here, one method is projected based on referential ambiguity to find the correct alias for a given name. After accepting real name as input lexical patterns are achieved from the web. Candidate aliases are extracted with the help of these patterns. The candidate aliases are ranked using various ranking scores like co occurrence frequency, web dice, hub discounting, and degree distribution. This method improves the recall and attains a statistically considerable mean reciprocal rank. Using candidate aliases and data files, related web documents are bunched or grouped. Grouping achieves high accuracy and reduces the complexity.

Keywords: Web mining, ranking, clustering, web text analysis, co-occurrence frequency

Edition: Volume 4 Issue 11, November 2015,

Pages: 2503 - 2506

How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link

Verification Code will appear in 2 Seconds ... Wait