Review Papers | Computer Science & Engineering | India | Volume 4 Issue 11, November 2015
Specific Personal Alias Withdrawal from Web and Clustering of Similar Web Documents
Snehal S. Shinde, Prakash R. Devale
There are many names available for a person, place or an entity on the web. If accurate alias of a particular individual is identified it becomes very useful in numerous web related tasks like information extraction, relation extraction, biomedical fields, sentiment analysis, personal name disambiguation, etc. Here, one method is projected based on referential ambiguity to find the correct alias for a given name. After accepting real name as input lexical patterns are achieved from the web. Candidate aliases are extracted with the help of these patterns. The candidate aliases are ranked using various ranking scores like co occurrence frequency, web dice, hub discounting, and degree distribution. This method improves the recall and attains a statistically considerable mean reciprocal rank. Using candidate aliases and data files, related web documents are bunched or grouped. Grouping achieves high accuracy and reduces the complexity.
Keywords: Web mining, ranking, clustering, web text analysis, co-occurrence frequency
Edition: Volume 4 Issue 11, November 2015
Pages: 2503 - 2506
How to Cite this Article?
Snehal S. Shinde, Prakash R. Devale, "Specific Personal Alias Withdrawal from Web and Clustering of Similar Web Documents", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=NOV151782, Volume 4 Issue 11, November 2015, 2503 - 2506
Similar Articles with Keyword 'Web mining'
A Proposed Framework Using Neural Network in Web Mining for Improving the Performance of E-Learning System
Dar Masroof Amin, Atul Garg
Privacy Preservation Protection for Personalized Web User by k-Anonymity with Profile Construction for Web Search Engines
Uma Maheswari.T, Dr.V. Kavitha
Similar Articles with Keyword 'ranking'
Survey on Algorithms Predicting Performance of Keyword Queries
Key Phrase Extraction Using Recurrent Neural Network
Aakib Jabbar, Preeti Sondhi
Similar Articles with Keyword 'clustering'
Survey on Various Image Segmentation Techniques
Comparative Analysis of AI Techniques in the Prediction of Heart Disease
Similar Articles with Keyword 'web text analysis'
Learning to Cluster Feedback Session for Identification of User Search Objective
Manjiri M. Kokate, Poonam D. Lambhate
Enhanced Approach for Construing & Reorganizing User Search Result Using Feedback Session
Sultana N. Sayyad, Deepak S. Tamhane
Similar Articles with Keyword 'Web'
Secure Methods for Supplychain Management to Protect from Attacks in Blockchain
B. Ratnakanth, K. Venkata Ramana
Krashi Prabhandak (Agricultural Manager)
Prafful Mundra, A V Pavan Krishna, Swarnalatha P, Venkata Sumanth Kakollu
Similar Articles with Keyword 'mining'
A Survey of Generating Multi-Document Summarizations
Patil Ajita S., P. M. Mane
Random Forest Based Heart Disease Prediction
Adeen, Preeti Sondhi