An Algorithm of Word Indexing Model for Document Summarization based on Perspective of Document
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Open Access | Fully Refereed | Peer Reviewed International Journal

ISSN: 2319-7064

Research Paper | Computer Science & Engineering | India | Volume 5 Issue 3, March 2016

An Algorithm of Word Indexing Model for Document Summarization based on Perspective of Document

Meha Shah, Chetna Chand

Natural language processing (NLP) is an area of computer science, artificial intelligence, and computational linguistics connected with the communications between computers and natural languages. There are many challenges in NLP involve natural language understanding, that is, enabling computers to derive meaning from human or natural language input, and others involve natural language generation. Document summarization is a part of it. Many different classes of such process based on machine learning are developed. In researches earlier document summarization mostly use the similarity between sentences in the document to extract the most significant sentences. The documents as well as the sentences are indexed using traditional term indexing measures, which do not take the context into consideration. Therefore, the sentence similarity values remain independent of the context. In this paper, we propose a context sensitive document indexing model based on the Bernoulli model of randomness. The Bernoulli model of randomness has been used to find the probability of the co-occurrences of two terms in a large corpus. A new approach using the lexical association between terms to give a context sensitive weight to the document terms has been proposed. The resulting indexing weights are used to compute the sentence similarity matrix. The proposed sentence similarity measure has been used with the baseline graph-based ranking models for sentence extraction. Experiments have been conducted over the benchmark DUC data sets and it has been shown that the proposed Bernoulli-based sentence similarity model provides consistent improvements over the baseline Intra Link and Uniform Link methods.

Keywords: Data mining, Document Summarization, Text mining, Stemming, Sentence Similarity, Context Similarity

Edition: Volume 5 Issue 3, March 2016

Pages: 1687 - 1690

Share this Article

How to Cite this Article?

Meha Shah, Chetna Chand, "An Algorithm of Word Indexing Model for Document Summarization based on Perspective of Document", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=NOV162223, Volume 5 Issue 3, March 2016, 1687 - 1690

119 PDF Views | 91 PDF Downloads

Download Article PDF



Similar Articles with Keyword 'Data mining'

Research Paper, Computer Science & Engineering, Kenya, Volume 7 Issue 5, May 2018

Pages: 1409 - 1411

Students Performance Prediction Using FP-Tree Data Mining Techniques

Eliakim Ombati Akama

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1165 - 1168

Privacy Preserving Closed Frequent Pattern Mining

Anju Vijayan

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 708 - 710

A Review on Efficient Algorithms for Mining High Utility Item Sets

Nutan Sarode, Devendra Gadekar

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1227 - 1231

An Efficient Clustering Based High Utility Infrequent Weighted Item Set Mining Approach

Dr. N. Umadevi, A. Gokila Devi

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

Share this Article

Similar Articles with Keyword 'Document Summarization'

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2205 - 2207

A Survey of Generating Multi-Document Summarizations

Patil Ajita S., P. M. Mane

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 5, May 2015

Pages: 3156 - 3159

Review on Multi Document Summarization Using Ontology

Rajshree S Hingane, Devendra P Gadekar

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 5 Issue 3, March 2016

Pages: 1687 - 1690

An Algorithm of Word Indexing Model for Document Summarization based on Perspective of Document

Meha Shah, Chetna Chand

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 3, March 2015

Pages: 1944 - 1948

Result Evaluation of Graph Based Multi Document Summarization

Vijay Sonawane, Rakesh Salam

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 8, August 2015

Pages: 1382 - 1384

Text Summarization using weighted Archetypal Analysis

Vaishali Shakhapure, A. R. Kulkarani

Share this Article

Similar Articles with Keyword 'Text mining'

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2205 - 2207

A Survey of Generating Multi-Document Summarizations

Patil Ajita S., P. M. Mane

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 2, February 2015

Pages: 2461 - 2466

A Review of Text Mining Techniques Associated with Various Application Areas

Dr. Shilpa Dang, Peerzada Hamid Ahmad

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 747 - 749

A Survey of Friendbook Recommendation Services

Pankaj L. Pingate, S. M. Rokade

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 5 Issue 2, February 2016

Pages: 1396 - 1400

An Improved Mining of Biomedical Data from Web Documents Using Clustering

Nikita Gupta, Gunjan Pahuja

Share this Article

Similar Articles with Keyword 'Stemming'

Research Paper, Computer Science & Engineering, India, Volume 5 Issue 3, March 2016

Pages: 1687 - 1690

An Algorithm of Word Indexing Model for Document Summarization based on Perspective of Document

Meha Shah, Chetna Chand

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 6, June 2015

Pages: 1862 - 1864

User Profile Based Client Side Instant Search Mechanism With Use of TLB Mechanism and Fuzzy Search

Rupali A. Ingale, J. L. Chaudhari

Share this Article

Case Studies, Computer Science & Engineering, India, Volume 5 Issue 10, October 2016

Pages: 1987 - 1990

Improving Auto Bug Triage by Effective Data Reduction

Roshna V. Sangle, Rajendra D. Gawali

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 10, October 2015

Pages: 1707 - 1708

Survey Paper on Twitter Sentiment Analysis Using Portar Stemming Algorithm

Nishad Patil, Tingre Sayali, Thorat Kalyani, Shivshetty Swapnil, Patil Shwetal

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 5 Issue 1, January 2016

Pages: 710 - 712

World Wide Web Metasearch Using TF-IDF Method

S. P. Phadtare, S. B. Magdum

Share this Article

Similar Articles with Keyword 'Sentence Similarity'

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 5, May 2015

Pages: 3018 - 3020

A Query-Based Summarizer based on the Context

Divya Vidyadharan, Anju CR

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 5 Issue 3, March 2016

Pages: 1687 - 1690

An Algorithm of Word Indexing Model for Document Summarization based on Perspective of Document

Meha Shah, Chetna Chand

Share this Article
Top