International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064

Downloads: 128 | Views: 204

Research Paper | Information Technology | India | Volume 3 Issue 5, May 2014 | Rating: 6.9 / 10

A Fuzzy C ? Means Clustering Based on Density Sensitive Distance Metric with a Novel Penalty Term

S. Omprakash [2] | T. Senthamarai [2] | M. Hemalatha

Abstract: A cluster is a group of objects which are similar to each other within a cluster and are dissimilar to the objects of other clusters. The major objective of clustering is to discover collection of comparable objects based on similarity metric. A similarity metric is generally specified by the user according to the requirements for obtaining better results. The distance between the measures of two objects in a particular cluster should be well defined using effective distance measures. There are several approaches available for clustering objects. The clustering approaches are; Penalty Fuzzy C-Means. But these techniques are not suitable for all applications and huge data collections. In the proposed approach an effective fuzzy clustering technique is used. Fuzzy Possibilistic C-Means (FPCM) is the effective clustering algorithm available to cluster unlabeled data that produces both membership and typicality values during clustering process. Penalized and Compensated terms are embedded with the Modified fuzzy positivistic clustering methods objective function to construct the Penalized based FPCM (PFPCM). In order to improve the clustering accuracy; third proposed approach uses the Improved Penalized Fuzzy C-Means (IPFCM). The penalty term takes the spatial dependence of the objects into consideration; which is inspired by the Neighborhood Expectation Maximization (NEM) algorithm and is modified according to the criterion of FCM. The proposed Improved Penalized for Fuzzy C-Means (IPFCM) clustering algorithm; uses improved penalized constraints which will help in better calculation of distance between the clusters and increasing the accuracy of clustering. The performance of the proposed approaches is evaluated on the University of California; Irvine (UCI) machine repository datasets such as Iris; Wine; Lung Cancer and Lymphograma. The parameters used for the evaluation is Clustering accuracy; Mean Squared Error (MSE) ; Execution Time and Convergence behavior.

Keywords: Clustering, FCM, PFCM, MPFCM, Dataset

Edition: Volume 3 Issue 5, May 2014,

Pages: 347 - 353

How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link

Verification Code will appear in 2 Seconds ... Wait