International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 103

Research Paper | Computer Science & Engineering | India | Volume 5 Issue 3, March 2016


An Algorithm of Word Indexing Model for Document Summarization based on Perspective of Document

Meha Shah | Chetna Chand [2]


Abstract: Natural language processing (NLP) is an area of computer science, artificial intelligence, and computational linguistics connected with the communications between computers and natural languages. There are many challenges in NLP involve natural language understanding, that is, enabling computers to derive meaning from human or natural language input, and others involve natural language generation. Document summarization is a part of it. Many different classes of such process based on machine learning are developed. In researches earlier document summarization mostly use the similarity between sentences in the document to extract the most significant sentences. The documents as well as the sentences are indexed using traditional term indexing measures, which do not take the context into consideration. Therefore, the sentence similarity values remain independent of the context. In this paper, we propose a context sensitive document indexing model based on the Bernoulli model of randomness. The Bernoulli model of randomness has been used to find the probability of the co-occurrences of two terms in a large corpus. A new approach using the lexical association between terms to give a context sensitive weight to the document terms has been proposed. The resulting indexing weights are used to compute the sentence similarity matrix. The proposed sentence similarity measure has been used with the baseline graph-based ranking models for sentence extraction. Experiments have been conducted over the benchmark DUC data sets and it has been shown that the proposed Bernoulli-based sentence similarity model provides consistent improvements over the baseline Intra Link and Uniform Link methods.


Keywords: Data mining, Document Summarization, Text mining, Stemming, Sentence Similarity, Context Similarity


Edition: Volume 5 Issue 3, March 2016,


Pages: 1687 - 1690


How to Download this Article?

You Need to Register Your Email Address Before You Can Download the Article PDF


How to Cite this Article?

Meha Shah, Chetna Chand, "An Algorithm of Word Indexing Model for Document Summarization based on Perspective of Document", International Journal of Science and Research (IJSR), Volume 5 Issue 3, March 2016, pp. 1687-1690, https://www.ijsr.net/get_abstract.php?paper_id=NOV162223

Similar Articles with Keyword 'Data mining'

Downloads: 0

Survey Paper, Computer Science & Engineering, India, Volume 11 Issue 8, August 2022

Pages: 947 - 949

COVID-19 Prediction using Machine Learning Algorithms

Saily Suresh Patil

Share this Article

Downloads: 0

Review Papers, Computer Science & Engineering, India, Volume 13 Issue 3, March 2024

Pages: 1036 - 1039

An Investigation of the Applications of Artificial Intelligence and Other New Technologies in Smart Energy Infrastructure

Karan Chawla [5]

Share this Article
Top