International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 111 | Views: 218

Review Papers | Aerospace Engineering | India | Volume 6 Issue 1, January 2017 | Popularity: 6.8 / 10


     

Progressive Detection of Duplicate Data

Deepa Bhattacharya, Sapna Patle


Abstract: Data duplicate detection is the process of identifying multiple representations of same or real world entities. Nowadays, data duplicate detection methods are needed to process larger datasets in shorter time maintaining the quality of the datasets and also the entities duplicated becomes increasingly difficult. This application focus on the duplicates in hierarchical datas like XML file. The data can be detected using the detection methods. Here the datasets are loaded in the applications and the processing, extraction, cleaning, separation and detection are carried out to remove the duplicated data. Comprehensive experiments show that our progressive algorithms can double the efficiency over time of traditional duplicate detection and significantly improve upon related work


Keywords: Duplicate detection, entity resolution, progressiveness, and data cleaning


Edition: Volume 6 Issue 1, January 2017


Pages: 1647 - 1649



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Deepa Bhattacharya, Sapna Patle, "Progressive Detection of Duplicate Data", International Journal of Science and Research (IJSR), Volume 6 Issue 1, January 2017, pp. 1647-1649, https://www.ijsr.net/getabstract.php?paperid=15011702