Downloads: 126
India | Computer Science Engineering | Volume 3 Issue 6, June 2014 | Pages: 362 - 365
Efficient Text Clustering for Distributed Network
Abstract: Text clustering is an important technique for improving the quality of information retrieval in both centralized and distributed environment. Most of the existing text clustering algorithms are designed for central execution; which are not work well on highly distributed environment. In this paper; an algorithm called probabilistic text clustering for distributed network such as peer to peer network is proposed. This algorithm achieves high scalability for assigning documents to clusters. It enables a peer to compare each of its documents only with very few selected clusters; maintain cluster quality.
Keywords: text clustering, k- means, p2p network, DHT, centroid
How to Cite?: Chithra Purushothaman, Lakshmi S, "Efficient Text Clustering for Distributed Network", Volume 3 Issue 6, June 2014, International Journal of Science and Research (IJSR), Pages: 362-365, https://www.ijsr.net/getabstract.php?paperid=20131789, DOI: https://dx.doi.org/10.21275/20131789