Removing Dedepulication Using Pattern Serach Suffix Arrays
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Open Access | Fully Refereed | Peer Reviewed International Journal

ISSN: 2319-7064

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 11, November 2015

Removing Dedepulication Using Pattern Serach Suffix Arrays

Pratiksha Dhande, Supriya Kumari, Sushmita Tupe, Laukik Shah

With the increase of de-duplication in data sets of voter card or pan card, removing the de-duplication is the major challenge. Record linkage is the process of matching records from several databases that refer to the same entities. When appliedon a single database, this process is known as de-duplication. In this paper the investigation is done to how to remove the de-duplication with the help of suffix arrays.Suffix array is well organized data structure for pattern searching. This paper covers similarity metrics that are commonly used to spot similar field entries, and present a widespread set of duplicate detection algorithms that can identify almost duplicate records in a database. It also covers multiple techniques for improving the effectiveness and scalability of estimated duplicate detection algorithms.Finally, based on the algorithms, the paper presents how to remove the de-duplication from dataset.

Keywords: String search, pattern matching, suffix array, suffix tree

Edition: Volume 4 Issue 11, November 2015

Pages: 1217 - 1219

Share this Article

How to Cite this Article?

Pratiksha Dhande, Supriya Kumari, Sushmita Tupe, Laukik Shah, "Removing Dedepulication Using Pattern Serach Suffix Arrays", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=NOV151363, Volume 4 Issue 11, November 2015, 1217 - 1219

131 PDF Views | 93 PDF Downloads

Download Article PDF



Similar Articles with Keyword 'String search'

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1217 - 1219

Removing Dedepulication Using Pattern Serach Suffix Arrays

Pratiksha Dhande, Supriya Kumari, Sushmita Tupe, Laukik Shah

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 1730 - 1734

Quality Preference Spatial Approximate String Search

Joslin T.J

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 9 Issue 2, February 2020

Pages: 965 - 967

A Survey on Encrypted String Search Using Hash Chain Technique

Syeda Deema Quadri, Dr. Asma Parveen

Share this Article

Similar Articles with Keyword 'pattern matching'

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1217 - 1219

Removing Dedepulication Using Pattern Serach Suffix Arrays

Pratiksha Dhande, Supriya Kumari, Sushmita Tupe, Laukik Shah

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 2395 - 2398

Distributed Pattern Matching: Cycle-Based Query Optimization

Rajshri G. Deshmukh, Praful B. Sambhare

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 3, March 2015

Pages: 471 - 474

A Neural Network Approach to Character Recognition

Pritesh A. Pali, Anjusha Pimpalshende

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2247 - 2250

Efficient Query Handling On Big Data in Network Using Pattern Matching Algorithm: A Review

Prasadkumar Kale, Arti Mohanpurkar

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 9 Issue 5, May 2020

Pages: 815 - 819

Extraction of Bank Transaction Data and Classification using Naive Bayes

Urmika Kasi

Share this Article

Similar Articles with Keyword 'suffix tree'

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 826 - 830

An Improved Framework for Outlier Periodic Pattern Detection in Time Series

Sulochana Gagare-Kadam

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1217 - 1219

Removing Dedepulication Using Pattern Serach Suffix Arrays

Pratiksha Dhande, Supriya Kumari, Sushmita Tupe, Laukik Shah

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 5, May 2015

Pages: 2152 - 2156

Inferring user Search Goals with Feedback Sessions using STC

Asha P, Ambily Balaram

Share this Article
Top