You are on page 1of 7

A Fuzzy Kohonen Local Information C-Means Clustering

for Remote Sensing Imagery


Krishna Kant Singh1, M. J. Nigam2, Kirat Pal1 and Akansha Mehrotra1
1
Earthquake Engineering Department, Indian Institute of Technology, Roorkee, India, 2Electronics and Computer Engineering Department,
Indian Institute of Technology, Roorkee, India

ABSTRACT
This paper presents a neuro fuzzy clustering algorithm, Fuzzy Kohonen Local Information C-Means (FKLICM),
for classification of remote sensing images. The proposed algorithm is a hybridization of the conventional
Kohonen clustering network and Fuzzy Local Information C-Means (FLICM) to produce a much more efficient
and accurate clustering algorithm. The proposed algorithm first forms a fused image with three Multispectral
bands and pan band of Landsat 7 Enhanced Thematic Mapper Plus (ETMþ) using the Brovey transform. The
fused image is a three band image with higher resolution and better visual perception. The fused image is
reduced to a one-dimensional image using principal component analysis (PCA). The FKLICM algorithm is
applied on the PC-1 image to classify the remote sensing image into different land cover types. Integrating
the neural network with a fuzzy system combines the advantages and overcomes the limitations of both tech-
nologies. The experimental results of the proposed algorithm are compared with two other algorithms, FCM
and GIFP-FCM. The classification results and accuracy assessment show that FKLICM yields better results
than the other methods.
Keywords:
Brovey transform, Clustering, FCM, FLICM, Kohonen clustering network, PCA.

1. INTRODUCTION fuzzy Kohonen clustering network (FKCN) [8, 9]. The


KCN, developed by Kohonen, groups the input data
Remote sensing images are widely used in applica- into clusters based on competitive learning using the
tions such as urban area monitoring and planning, Euclidean distance metric. The cluster unit whose
disaster management, climate studies, natural hazard weight vector matches the input vector closely is
monitoring and land cover monitoring of forest resour- selected as the winner. The weights of the winning
ces. Remote sensing applications employ image classi- node and its neighbours in the Kohonen layer are
fication to identify the different land cover types and updated to more closely resemble the input vector.
assign them a unique grey level to create a thematic KCNs have several limitations. FCM and other meth-
image. An important unsupervised classification ods based on FCM perform clustering by optimizing
method is clustering, which groups a set of patterns or the value of an objective function. The limitation of
vectors into different clusters based on the internal FCM is that it gets stuck at local optima and an appro-
homogeneity and the external separation [13]. In priate value of fuzziness index m is required for better
recent years, artificial neural networks and fuzzy performance of FCM. H€ oppner and Klawonn pro-
logic-based methods have been widely used for classi- posed IFP-FCM, which assigns crisp membership
fication of remote sensing images owing to their inher- degrees but is less sensitive to noise [10]. Fuzzy local
ent advantages over the traditional approaches [4]. information c-means clustering (FLICM) is an
Image classification techniques fall under two catego- improvement of the FCM algorithm. It introduces a
ries, unsupervised and supervised classification. fuzzy factor which improves the clustering results as
Supervised classification techniques require analyst- well as making the algorithm insensitive to noise [11].
specified training data to perform classification. Unsu- Generalized fuzzy c-means clustering algorithm with
pervised classification is a technique that analyses a improved fuzzy partitions (GIFP-FCM) [12] is a gener-
large number of unknown pixels and groups them into alized form of FCM and IFP-FCM; it provides better
homogeneous regions or classes based on natural clustering and overcomes the limitations of FCM and
groupings present in the image values. The widely IFP-FCM. One of the drawbacks of GIFP-FCM is that it
used unsupervised classifiers are Kohonen clustering is sensitive to noise as it does not take into consider-
network (KCN) [5], fuzzy c-means (FCM) [6, 7] and ation the spatial information contained in the pixels.

IETE TECHNICAL REVIEW | VOL 31 | NO 1 | JANFEB 2014 75


Singh KK, et al.: A Fuzzy Kohonen Local Information C-Means Clustering for Remote Sensing Imagery

To overcome this problem another clustering algo- winning output neuron is decided by
rithm, fuzzy clustering algorithm with nonlocal adap-
tive spatial constraint (FCA-NLASC) was proposed minfd2ik g for k ¼ 1; 2; . . . ; N and i ¼ 1; 2; . . . c
[13]. FCA-NLASC has a nonlocal adaptive spatial con-
straint term, which is useful in case of noisy image seg-
mentation. A number of image segmentation Step 3. Weight update: the weight of the output neu-
techniques are discussed in [14, 15]. In this paper, a ron is updated by
fuzzy Kohonen local information c-means clustering
(FKLICM) algorithm for classification of remote sens- z1;t ¼ z1;t1 þ Y 1k;t ðxk  z1;t1 Þ ð2Þ
ing image is proposed. The algorithm first fuses the
multispectral (MS) bands and pan band using the where Y is learning rate.
Brovey transform to obtain a higher-resolution image.
The fused image contains three bands. Thus, to obtain Step 4. Update the learning rate Y
a single band image, PCA transformation is applied on
the fused image and its first component that is the PC- Step 5. If kz1;t  z1;t1 k > e then go to step 2, otherwise
1 image is extracted. The FKLICM clustering algorithm go to step 6.
is applied on the PC-1 image to classify it into different
classes. FKLICM is a neuro-fuzzy model that combines Step 6. Output the final clustering result.
the Kohonen clustering network with FLICM cluster-
ing algorithm. The experimental results show that the 2.2 FCM
proposed hybridized neuro-fuzzy model, FKLICM, is
much more efficient and effective for classification of The FCM algorithm is widely used for image cluster-
remote sensing images. ing [7]. FCM groups the data points into c clusters.
Each cluster has a cluster centre, zi . The cluster
centres are computed by means of optimizing the
2. BACKGROUND value of an objective function. Each point is assigned
In this section, the Kohonen clustering network, fuzzy a fuzzy membership in the range [0, 1]. However, the
c-means and FLICM clustering algorithms are exam- sum of the memberships of a point in all clusters is
ined to identify their advantages and limitations. equal to one. The fuzzy membership values are
placed in the membership matrix. The FCM algorithm
2.1 KCN minimizes the objective function, Jm given by
Equation (3):
The KCN is the simplest neural network, without any
activation function and hidden layer. The network has N X
X c
only two layers, the input layer and output layer. The Jm ¼ 2
ik kzi  xk k
um ð3Þ
neuron closest to the input vector in terms of Euclidean k¼1 i¼1
distance is the winner neuron [5]. The weight of the
winner and its predefined neighbours are updated
Clustering is done by iteratively optimizing the objec-
using a learning rule. The operation of KCN is summa-
tive function (Jm ). The steps involved in FCM are as
rized below.
follows.
Step 1. Initialization: initialize the cluster centres
Step 1. Initialize the cluster centres zi ð1  i  cÞ, fuzzi-
zi ð1  i  cÞ, learning rate Y ð0  Y  1Þ, threshold
ness index mð1 < m  1Þ, fuzzy partition matrix U
e ðe > 0Þ and topological neighbourhood parameters.
and threshold e ðe > 0Þ, number of iterations.
Step 2. Selection of winner: calculate the squared
Step 2. Calculate the fuzzy membership matrix U ¼
Euclidean distance for i ¼ 1; 2; . . . ; c.
[uik ] by Equation (4):

!1
c 
X 
d2ik ¼ kxk  zi k2 for k ¼ 1; 2; . . . ; N and i ¼ 1; 2; . . . ; c kzi  xk k 2=ðm1Þ
uik ¼ ð4Þ
ð1Þ l¼1
kzl  xk k

for 1  i  c and 1  k  N; where c is the number of


where xk denotes the kth pixel of the input image X clusters, xk denotes the kth pixel of the input image X
and N is the total number of pixels in the image X. The and N is the total number of pixels in the image X.

76 IETE TECHNICAL REVIEW | VOL 31 | NO 1 | JANFEB 2014


Singh KK et al.: A Fuzzy Kohonen Local Information C-Means Clustering for Remote Sensing Imagery

Step 3. The cluster centres are updated by The cluster centres are updated by Equation (9):
Equation (5):

PN PN m
um uik xk
ik xk zi ¼ Pk¼1 ð9Þ
zi ¼ Pk¼1 ð5Þ N m
N m k¼1 uik
k¼1 uik

Step 4. If kU tþ1  U t k > e then go to step 2, otherwise On convergence of the algorithm, the pixels of the
go to step 5. image are assigned to the class with highest member-
ship value to obtain the classified image.
Step 5. Output the final clustered image by assigning
the pixel xk to the class c with highest membership 3. PROPOSED ALGORITHM
value.
The proposed algorithm consists of two main steps:
The limitation of FCM is that its performance depends
upon the choice of fuzziness index m. An inappropri- 1. image fusion and dimensionality reduction;
ate value of m leads to unsatisfactory results. The 2. FKLICM.
inability of FCM to consider local spatial information
makes it highly sensitive to noise. Thus, FCM performs These are discussed in the following sections. The
poorly in the case of noisy images. flowchart of the proposed algorithm is shown in
Figure 1.
2.3 FLICM
To further improve the performance of FCM, a new
clustering technique named FLICM was introduced by
Kirindis and Chatzis [11]. It uses the local spatial and
grey-level information in its objective function. The
objective function of FLICM is given by Equation (6):

N X
X c 
2 
Jm ¼ ik kzi  xk k þ aik
um ð6Þ
k¼1 i¼1

The fuzzy factor aik is mathematically expressed as


follows:

X 1
aik ¼ ð1  uij Þm kzi  xj k2 ð7Þ
j2Nr
dðk; jÞ þ 1

where the kth pixel of the input image X is the centre of


the local window Nr , xj represents the neighbourhood
pixels and dðk; jÞ is the spatial Euclidean distance
between pixels k and j. The fuzzy membership matrix
U ¼ [uik ] is computed using Equation (8):

0 !1=ðm1Þ 11
X
c
kzi  xk k2 þ aik
uik ¼ @ A
l¼1 kzl  xk k2 þ alk ð8Þ

for 1  i  c and 1  k  N:

where xk denotes the kth pixel of the input image X


and N is the total number of pixels in the image X. Figure 1: Flowchart of the proposed algorithm.

IETE TECHNICAL REVIEW | VOL 31 | NO 1 | JANFEB 2014 77


Singh KK, et al.: A Fuzzy Kohonen Local Information C-Means Clustering for Remote Sensing Imagery

3.1 Image Fusion and Dimensionality Reduction where N ¼ H  W. To create the feature vector using
PCA, we proceed as follows. The average vector, x, of
Satellite images consist of multiple number of bands. a vector population can be approximated by
These bands contain complementary information.
Image fusion is done to combine the information in dif-
ferent bands into one image. The new image obtained 1X N
x¼ Ik ð12Þ
is more appropriate for further processing as it has N k¼1 fuse
higher resolution and better visual quality. In this
paper, Brovey transformation [16] is used to fuse the
RGB bands of the MS image with the pan band. Brovey The difference between the each vector and average
transformation is a numerical technique that uses a vector is Dk ¼ Ifuse
k
x. The covariance matrix CI has
ratio algorithm to fuse images. The mathematical eigenvectors ei and corresponding eigenvalues li . The
definition of Brovey transformation is given in covariance matrix CI can be approximated by
Equation (10):
1 X N

In ði; jÞIpan ði; jÞ CI ¼ Dk DTk ð13Þ


Ifuse;n ði; jÞ ¼ P3 ð10Þ N  1 k¼1
n¼1 In ði; jÞ

where we use N  1 instead of N to obtain an unbiased


where Ifuse;n is the nth band of the fused image, In is the estimate of CI . It is assumed that the generated eigen-
nth original MS band image and Ipan is the original vectors of CI are sorted in decreasing order based on
band image. In this paper, a Landsat 7 ETMþ image the eigenvalues, that is, li  liþ1 .
is used, which consists of eight bands as shown in
Table 1 [17]. Thus, the value of n ranges from 1 to 3 in The feature vector space is obtained by projecting
Equation (10). Ifuse ðx; yÞ onto eigenvector space for each pixel at spa-
tial location (i, j) using PCA, that is,
The fused image is a three-dimensional image consist-
ing of three bands and can be expressed as 3D column 2 3
vector, Ifuse : X1 ði; jÞ  
Xði; jÞ ¼ 4 X2 ði; jÞ 5 ¼ A Id ðx; yÞ  x ð14Þ
2 3 X3 ði; jÞ
Ifuse;1
Ifuse ðx; yÞ ¼ 4 Ifuse;2 5 ð11Þ
Ifuse;3 The eigenvectors of CI are arranged as rows of the
matrix A. The first row of the matrix is the eigen vector
with the largest eigenvalue and so on. X1 is the PC-1
Further, the first component (PC-1) of principal com- image obtained from Equation (14).
ponent analysis (PCA) [1820] is used to reduce the
dimensionality of the image to one dimension. If the
images are of size H  W there will be total of HW 3.2 FKLICM
such vectors comprising all of the pixels in the images.
k A FKLICM is proposed in this paper. FKLICM uses a
For simplicity, Ifuse is used to represent the vector
neuro-fuzzy hybrid approach. It is a hybridization of
Ifuse ðx; yÞ, while k represents an index with 1  k  N
KCN with FLICM. KCN is the simplest neural net-
work, without any activation function and hidden
layer. Thus, hybridizing KCN with FLICM is less
complex than other neuro-fuzzy systems. The use
Table 1: Landsat 7 ETMþ bands
of a neuro-fuzzy method overcomes the limitations
Band number Spectral range (mm) Ground resolution (m) of conventional methods and has the advantages of
1 0.450.515 30 both neural networks and fuzzy systems. To inte-
2 0.52 0.605 30 grate FLICM with KCN, the following algorithm is
3 0.630.69 30 used.
4 0.750.90 30
5 1.551.75 30 Step 1. Initialize the cluster centres zi ð1  i  cÞ, the
6 10.4012.5 60
threshold e ðe > 0Þ and topological neighbourhood
7 2.092.35 30
parameters. Set t ¼ 1, maximum iteration limit tmax
Pan 0.520.90 15
and m > 1.

78 IETE TECHNICAL REVIEW | VOL 31 | NO 1 | JANFEB 2014


Singh KK et al.: A Fuzzy Kohonen Local Information C-Means Clustering for Remote Sensing Imagery

Step 2. Calculate mt , fuzzy membership matrix U ¼


½uik  and learning rate Y ik;t ; ; using Equations (15)(17)
respectively.

tðm  1Þ
mt ¼ m þ ð15Þ
tmax
0 !1=ðmt 1Þ 11
X
c
kzi  xk k2 þ aik
uik ¼ @ A ð16Þ
l¼1 kzl  xk k2 þ alk

for 1  i  c and 1  k  N: xk denotes the kth pixel


of the PC-1 image X1 and aik is calculated using
Equation (7).

Y ik;t ¼ ðuik Þmt ð17Þ

Step 3. The weight of the output neuron is updated by Figure 2: Network architecture of FKLICM.

PN
Y ik;t ðxk  zi;t1 Þ
zi;t ¼ zi;t1 þ k¼1
PN ð18Þ
s¼1 Y is;t performance of the proposed FKLICM, it is compared
with FCM and GIFP-FCM. The classification results
Step 4. Update the learning rate Y ik;t . are shown in Figure 3. The various parameters used in
the experiment are c ¼ 3, m ¼ 2, e ¼ 1  106. The
Step 5. Set t ¼ t þ 1: qualitative analysis of the results shows that FKLICM
gives better results than the other two methods. Fur-
Step 6. If kz1;t  z1;t1 k > e and t < tmax then go to ther, the accuracy assessment of FKLICM, FCM and
step 2, otherwise go to step 7. GIFP-FCM was performed using ERDASTM software.
An error matrix was used for a series of descriptive
Step 7. Output the final clustered image by assigning and analytical statistical analyses. Overall accuracy
the pixel xk to the class c with highest membership and kappa coefficient were used for the assessment of
value. the proposed algorithm. A total of 256 reference points
were chosen using stratified random sampling.
The use of spatial contextual information makes the
algorithm less sensitive to noise. Also, the inter cluster The error matrices of all the three methods are given in
distance is large enough, avoiding overlapping of Table 2 where classes 13 represent vegetation, land
clusters. The network architecture consists of two and urban area, and water, respectively. The error
layers: input layer and output layer. The input layer matrix shows that the number of misclassifications in
consists of N neurons; the kth neuron represents the FKLICM is minimal while GIFP-FCM shows improve-
kth pixel of the image to be clustered. The output of ment over FCM. The overall accuracy is 96.88 and the
the network is c neurons corresponding to the c cluster kappa coefficient is 0.9485 for FKLICM, which is
centres. The architecture of the network is shown in higher than the other two methods. Thus, the quanta-
Figure 2. tive and qualitative analysis shows that the proposed
FKLICM provides better accuracy and classification
4. EXPERIMENTAL RESULTS results.

In this section, the performance of the FKLICM is eval- The kappa coefficient value lies between 0 and 1. The
uated. The algorithm was implemented in Matlab higher the value of kappa, the higher the classifica-
R2012b. The experiment was conducted on Landsat 7 tion accuracy is. Accuracy and kappa coefficient
ETMþ image with eight bands over Ishinomaki City, values of the three methods are listed in Table 3.
Japan, acquired on 18 February 2003 (image courtesy It can be seen that FKLICM has highest overall
US Geological Survey) [21]. The study image is shown accuracy and highest value of kappa coefficient, con-
in Figure 3(a). The geographical coordinates of the forming that the proposed method outperforms the
area are 38 250 000 North, 141 180 000 East. To test the other two.

IETE TECHNICAL REVIEW | VOL 31 | NO 1 | JANFEB 2014 79


Singh KK, et al.: A Fuzzy Kohonen Local Information C-Means Clustering for Remote Sensing Imagery

Figure 3: (a) Original Landsat 7 ETMþ image; (b) classified image with FCM; (c) classified image with GIFP-FCM, a ¼ 0.7;
(d) classified image with FKLICM.

Table 2: Error matrix 5. CONCLUSION


Method Class 1 Class 2 Class 3
In this paper, a novel clustering algorithm, FKLICM,
FCM Class 1 96 7 1
for classifying remote sensing images is proposed.
Class 2 4 109 4
FKLICM is a neuro-fuzzy hybridization of KCN and
Class 3 2 3 30
FLICM. The input image is first fused using Brovey
GIFP-FCM Class 1 99 4 1
Class 2 2 112 3
transformation to convert it into a three-band image
Class 3 1 3 31
having higher resolution, and is visually more
FKLICM Class 1 102 1 1 enhanced. This three-band image is converted into a
Class 2 2 114 1 single-band image using PCA. The PC-1 image is then
Class 3 1 2 32 classified into different classes using the FKLICM
algorithm. The use of a neuro-fuzzy model embeds the
advantages of neural networks as well as those of
Table 3: Accuracy and kappa fuzzy systems. The local information makes it less sen-
sitive to noise and also improves clustering perfor-
Method Overall accuracy Kappa
mance. Experimental results and accuracy assessment
FCM 91.80 0.8649 show that FKLICM gives better overall accuracy and
GIFP-FCM 94.53 0.9099
kappa as compared with other state-of-the-art
FKLICM 96.88 0.9485
methods.

80 IETE TECHNICAL REVIEW | VOL 31 | NO 1 | JANFEB 2014


Singh KK et al.: A Fuzzy Kohonen Local Information C-Means Clustering for Remote Sensing Imagery

REFERENCES Trans. System, Man Cybernet. Pt B: Cybernet., Vol. 39, no. 3,


pp. 57891, 2009.
1. P. Hansen, and B. Jaumard, “Cluster analysis and mathematical 13. F. Zhao, L. Jiao, H. Liu, and X. Gao, “A novel fuzzy clustering
programming,” Math. Program., Vol. 79, 1997, pp. 191215. algorithm with non local adaptive spatial constraint for image
2. A. Jain, and R. C. Dubes, Algorithms for Clustering Data. Engle- segmentation,” Signal Process., Vol. 91, no. 4, pp. 98899,
wood Cliffs, NJ: Prentice-Hall, 1988. 2011.
3. R. Xu, and D. Wunsch, “Survey of clustering algorithms,” IEEE 14. K. K. Singh, and A. Singh, “A study of image segmentation
Trans. Neural Networks, Vol. 16, no. 3, pp. 64578, 2005. algorithms for different types of images,” Int. J. Comput. Sci.
Issues, Vol. 7, no. 5, September 2010.
4. J. F. Mas, and J. J. Flores, “The application of artificial neural
networks to the analysis of remotely sensed data,” Int. J. 15. D. Jayadevappa, S. S. Kumar, and D. S. Murty, “Medical image
Remote Sensing, Vol. 29, no. 3, pp. 61763, 2008. segmentation algorithms using deformable models: A review,”
IETE Tech. Rev., Vol. 28, no. 3, pp. 24855, 2011.
5. T. Kohonen, Self-Organization and Associative Memory, 3rd
edn. Springer, Berlin, 1989. 16. S. Dahiya, P. K. Garg, and M. K. Jat, “A comparative study
of various pixel-based image fusion techniques as applied to
6. M. Filippone, F. Camastra, F. Masulli, and S. Rovetta, “A survey an urban environment,” Int. J. Image Data Fusion, pp. 117,
of kernel and spectral methods for clustering,” Pattern Recog- 2013.
nit., Vol. 41, no. 1, pp. 17690, 2008.
17. National Aeronautics and Space Association. Available: http://
7. J. C. Bezdek, Pattern Recognition with Fuzzy Objective Func- landsat.gsfc.nasa.gov/about/etmþ.html
tion Algorithms. New York: Plenum Press, 1981.
18. R. C. Gonzalez, and R. E. Woods, Digital Image Processing.
8. E. C. Tsao, J. C. Bezdek, and N. R. Pal, “Fuzzy Kohonen cluster- Englewood Cliffs, NJ: Prentice-Hall, 2008.
ing network,” Pattern Recognit., Vol. 27, no. 5, pp. 75764, 1994.
19. T. Celik, “Unsupervised change detection in satellite images
9. C. Zhang, and F. Qiu, “Hyperspectral image classification using principal com-ponent analysis and k-means clustering,”
using an unsupervised neuro-fuzzy system,” J. Appl. Remote
IEEE Geosci. Remote Sensing Lett., Vol. 6, no. 4, pp. 7726,
Sensing, Vol. 6, no. 1, p. 0635151, 2012.
2009.
10. F. Ho€ppner, and F. Klawonn, “Improved fuzzy partitions for
20. K. K.Singh, A. Mehrotra, M. J. Nigam, and K. Pal, “Unsuper-
fuzzy regression models,” Int. J. Approx. Reason., Vol. 32, no.
vised change detection from remote sensing images using
2, pp. 85102, 2003. hybrid genetic FCM” in Proceedings of 2nd Students’ Confer-
11. S. Kirindis, and V. Chatzis, “A robust fuzzy local information c ence on Engineering and Systems (SCES2013), April 2013, pp.
means clustering algorithm,” IEEE Trans. Image Process., Vol. 4237.
19, no. 5, pp. 132837, 2010. 21. USGS Global Visualization Viewer. Available: http://glovis.
12. L. Zhu, F. L. Chung, and S. Wang, “Generalized fuzzy c-means usgs.gov
clustering algorithm with improved fuzzy partitions,” IEEE

Authors
Krishna Kant Singh is currently pursuing a Kirat Pal is a professor in the Earthquake
PhD from Indian Institute of Technology, Roor- Engineering Department of the Indian Institute
kee, India. He holds BTech (EI) and MTech of Technology Roorkee. He has authored
(VLSI) degrees. His areas of specialization more than 61 research papers in national and
include digital image processing, VLSI tech- international journals and conferences.
nology and neural networks. He has authored
more than 12 research papers in the above E-mail: kiratfeq@iitr.ernet.in
areas in national and international journals
and conferences.

E-mail: krishnaiitr2011@gmail.com
Madhav J. Nigam is a professor in the Elec- Akansha Mehrotra is currently pursuing a
tronics and Computer Engineering Depart- PhD from Indian Institute of Technology, Roor-
ment, Indian Institute of Technology, Roorkee, kee, India. She has to her credit a number of
India. His areas of research include digital research papers in international conferences
image processing, control systems and neural and journals. Her areas of interest include digi-
networks. He has authored a number of tal image processing, remote sensing and
research papers in the above-mentioned neural networks.
areas.
E-mail: akanshasing@gmail.com
E-mail: mkndnfec@gmail.com

DOI: 10.1080/02564602.2014.891375; Copyright © 2014 by the IETE

IETE TECHNICAL REVIEW | VOL 31 | NO 1 | JANFEB 2014 81

You might also like