Downloads: 126
India | Computer Science Engineering | Volume 3 Issue 6, June 2014 | Pages: 362 - 365
Efficient Text Clustering for Distributed Network
Abstract: Text clustering is an important technique for improving the quality of information retrieval in both centralized and distributed environment. Most of the existing text clustering algorithms are designed for central execution; which are not work well on highly distributed environment. In this paper; an algorithm called probabilistic text clustering for distributed network such as peer to peer network is proposed. This algorithm achieves high scalability for assigning documents to clusters. It enables a peer to compare each of its documents only with very few selected clusters; maintain cluster quality.
Keywords: text clustering, k- means, p2p network, DHT, centroid
Rating submitted successfully!
Received Comments
No approved comments available.