Professional Documents
Culture Documents
Volume: 4 Issue: 6 27 - 31
_______________________________________________________________________________________
An Enhanced K-Medoid Clustering Algorithm
Abstract Data mining is a technique of mining information from the raw data. It is a non trivial process of identifying valid and useful patterns
in data. Some of the major Data Mining techniques used for analysis are Association, Classification and Clustering etc. Clustering is used to
group homogenous kind of data, but it is different approach from classification process. In the classification process data is grouped on the
predefined domains or subjects. A basic clustering technique represents a list of topics for each data and calculates the distance for how
accurately a data fit into a group. The Cluster is helpful to get fascinating patterns and structures from an outsized set of knowledge. There are a
lots of clustering algorithms that have been proposed and they can be divided as: partitional, grid, density, model and hierarchical based. This
paper propose the new enhanced algorithm for k-medoid clustering algorithm which eliminates the deficiency of existing k-medoid algorithm. It
first calculates the initial medoids k as per needs of users and then gives relatively better cluster. It follows an organized way to generate initial
medoid and applies an effective approach for allocation of data points into the clusters. It reduces the mean square error without sacrificing the
execution time and memory use as compared to the existing k-medoid algorithm.
Keywords- Data Mining, Clustering, Partitional Clustering, K-Medoid, Enhanced K-Medoid Algorithm.
__________________________________________________*****_________________________________________________
25 23.96 18.15 Figure 2: Graphs Represent Number of Clustering and Execution Time
Comparison for Iris Dataset
29
IJRITCC | June 2016, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 4 Issue: 6 27 - 31
_______________________________________________________________________________________
C) Memory Used TABLE 4: PERFORMANCE PARAMETERS
REFERENCES
5 204596 204652 [1] L. Kaufman and P. J. Rousseau, Finding Groups in Data:
an Introduction to Cluster Analysis, John Wiley & Sons,
10 204544 201960
1990.
[2] A. K. Jain, M. N. Murty, and P. J. Flynn, Data
Clustering: A review. ACM Computing Surveys, Vol.
15 204576 212792 31 No. 3, pp.264 323, 1999.
[3] J. Han and M. Kamber. Data Mining: Concepts and
Techniques, Morgan Kaufmann Publishers, August
20 212768 210132
2000.
[4] Rui Xu and Donlad Wunsch, Survey of Clustering
25 212788 220984 Algorithm, IEEE Transactions on Neural Networks,Vol.
16, No. 3, May 2005.
[5] Sanjay Garg, Ramesh and Chandra Jain, Variation of K-
Mean Algorithm: A study for High Dimensional Large
Figure 3: Graphs Represent Number of Clusters and memory required Data Sets, Information Technology Journal,Vol. 5, No.
Comparison for Iris Dataset 6, pp.1132 1135, 2006.
[6] K. A. Abdul Nazeer and M. P. Sebastian Improving the
The comparison between the algorithms that is k-medoid Accuracy and Efficiency of the K-Means Clustering
and proposed k- medoid algorithm is done on the Iris data Algorithm Proceedings of the World Congress on
set which contains 150 data points with five attributes. Engineering , Vol.1, pp.1-3, July 2009.
[7] T. Velmurugan and T. Santhanam, A Survey of
Table 4 describe the performance summary of both Partition Based Clustering Algorithms in Data Mining:
An Experimental Approach, Journal, Vol. 10, No. 3, pp.
algorithms. According to obtained result the proposed
478- 484, 2011
algorithm is able to clustering the data points. Therefore the [8] Shalini S Singh and NC Chauhan, K- means v/s K-
proposed algorithm based on partitional clustering algorithm medoids: A Comparative Study, National Conference
is adoptable and efficient on Recent Trends in Engineering & Technology, 2011.
30
IJRITCC | June 2016, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 4 Issue: 6 27 - 31
_______________________________________________________________________________________
[9] Rui Xu and Donlad Wunsch, Survey of Clustering [12] Abhishek Patel and Purnima Singh, "New Approach For
Algorithm, IEEE Transactions on Neural Networks, Vol. K-mean and K-Medoids Algorithm", International
16, No. 3, May 2005. Journal of Computer Applications Technology and
[10] M. S. Chen, J. Han and P. S. Yu., Data Mining: An Research, 2013
Overview from a Database Perspective, IEEE [13] H.S. Park, and C.H. Jun, "A Simple and Fast Algorithm
Transactions on Knowledge and Data Engineering, Vol. for K-Medoids Clustering", Department of Industerial
8, pp. 866-883, 1998 and Management Engineering POSTECH, 2009.
[11] Bharat Pardeshi and Durga Toshniwal, "Improved K- [14] R. Fisher, UCI Machine Learning Repository, 1936.
Medoid Clustering Based On Cluster Validity Index and https://archive.ics.uci.edu/ml/datasets/Iris
Object Density", IEEE, 2010.
31
IJRITCC | June 2016, Available @ http://www.ijritcc.org
_______________________________________________________________________________________