Professional Documents
Culture Documents
Abstract. This paper proposes a novel human face recognition system using
curvelet transform and Kernel based principal component analysis. Traditionally multiresolution analysis tools namely wavelets and curvelets have been
used in the past for extracting and analyzing still images for recognition and
classification tasks. Curvelet transform has gained significant popularity over
wavelet based techniques due to its improved directional and edge representation capability. In the past features extracted from curvelet subbands were
dimensionally reduced using principal component analysis for obtaining an enhanced representative feature set. In this work we propose to use an improved
scheme using kernel based principal component analysis (KPCA) for a comprehensive feature set generation. KPCA performs a nonlinear principal component analysis (PCA) using an integral kernel operator function and obtains
features that are more meaningful than the ones extracted using a linear PCA.
Extensive experiments were performed on a comprehensive database of face
images and superior performance of KPCA based human face recognition in
comparison with state-of-the-art recognition is established.
1 Introduction
Human face recognition has attracted considerable attention during the last few decades. Human faces represent one of the most common visual patterns in our environment, and humans have a remarkable ability to recognize faces. Face recognition has
received significant consideration and is evident by the emergence of international
face recognition conferences, protocols and commercially available products. Some of
the reasons for this trend are wide range of commercial and law enforcement applications and availability of feasible techniques after decades of research.
Developing a face recognition model is quite difficult since faces are complex,
multidimensional structures and provide a good example of a class of natural objects
that do not lend themselves to simple geometric interpretations, and yet the human
visual cortex does an excellent job in efficiently discriminating and recognizing these
images. Automatic face recognition systems can be classified into two categories
namely, constituent and face based recognition [2-3]. In the constituent based
approach, recognition is achieved based on the relationship between human facial
features such as eyes, nose, mouth and facial boundary [4-5]. The success of this
M. Kamel and A. Campilho (Eds.): ICIAR 2009, LNCS 5627, pp. 512521, 2009.
Springer-Verlag Berlin Heidelberg 2009
513
approach relies significantly on the accuracy of the facial feature detection. Extracting
facial features accurately is extremely difficult since human faces have similar facial
features with subtle changes that make them different.
Face based approaches [1,6-7] capture and define the image as a whole. The human
face is treated as a two-dimensional intensity variation pattern. In this approach recognition is performed through identification and matching of statistical properties.
Principal component analysis (PCA) [7-8] has been proven to be an effective face
based approach. Kirby et al [7] proposed using Karhunen-Loeve (KL) transform to
represent human faces using a linear combination of weighted eigenvectors. Standard
PCA based techniques suffer from poor discriminatory power and high computational
load. In order to eliminate the inherent limitations of standard PCA based systems,
face recognition approaches based on multiresolution tools have emerged and have
significantly improved accuracy with a considerable reduction in computation.
Wavelet based approach using PCA for human face recognition [9] proposed by
Feng et al has utilized a midrange frequency subband for PCA representation and has
achieved improved accuracy and class separability. In a recent work, Mandal et al
[10] has shown that a new multiresolution tool, curvelet along with PCA can be used
for human face recognition with superior performance than the standard wavelet
subband decomposition. In this paper we propose to use coarse level curvelet coefficients together with a kernel based principal component analysis (KPCA) for face
recognition. Experimental results on five well known face databases demonstrate that
dimensionally reduced curvelet coefficients using KPCA offer better recognition in
comparison with PCA based curvelet coefficients.
The remainder of the paper is divided into 4 sections. Section 2 discusses the
curvelet transform, its variants along with their implementation details followed by a
discussion of kernel based PCA in section 3. The proposed methodology is described
in section 4. Experimental results are discussed in section 5 followed by conclusion,
acknowledgment and references.
514
Following wavelets, research community has witnessed intense efforts for development of better directional and decomposition tools; contourlets and ridgelets.
Curvelet transform [11] is a recent addition to the family of multiresolution analysis tool that is designed and targeted to represent smooth objects with discontinuity
along a general curve. Curvelet transform overcomes limitations of existing multiresolution analysis schemes and offers improved directional capacity to represent
edges and other singularities along curves. Curvelet transform is a multiscale nonstandard pyramid with numerous directions and positions at each length and scale.
Curvelets outperform wavelets in situations that require optimal sparse representation
of objects with edges, representation of wave propagators, image reconstruction with
missing data etc.
2.1 Continuous -Time Curvelet Transform
Since the introduction of curvelet transform researchers have developed numerous
algorithmic strategies [12] for its implementation based on its original architecture.
Let us consider a 2D space, i.e. 2 , with a spatial variable x and a frequency-domain
variable , and let r and represent polar coordinates in frequency-domain. W(r)
and V(t) are radial and angular window respectively. Both windows are smooth, nonnegative, real valued and supported by arguments r [1 / 2,2]and t [ 1,1]. For j j0 ,
frequency window Uj in Fourier domain is defined by [11]
2 j / 2 .
,
U j (r , ) = 2 3 j / 4 W 2 j r V
2
(1)
where j / 2 is the integral part of j / 2 . Thus the support of U j is a polar wedge defined by the support of W and V applied with scale-dependent window widths in each
direction. Windows W and V will always obey the admissibility conditions:
W (2 r ) = 1,
r >0 .
j =
V (t l ) = 1,
(2)
t .
l =
We define curvelets (as function of x=(x1,x2)) at a scale 2 j , orientation l , and position xk( j,l ) = Rl1 (k1.2 j , k 2 .2 j / 2 ) by j ,k ,l (x ) = j (R (x xk( j ,l ) )), where R is an orthogonal rotal
tion matrix. A curvelet coefficient is simply computed by computing the inner product
of an element f L2(R2) and a curvelet j,k,l ,
c( j , k , l ) = f , j ,k ,l =
f ( x)
j ,k ,l dx .
(3)
R2
Curvelet transform also contains coarse scale elements similar to wavelet theory.
For k1 , k2 Z , we define a coarse level curvelet as:
j ,k ( x ) = j (x 2 j k ) , j ( ) = 2 j W0 (2 j ) .
^
(4)
515
components of the constructions. The figure on the left represents the induced
tiling of the frequency plane. In Fourier space, curvelets are supported near a
parabolic wedge. Shaded area in left portion of fig.1 represents a generic wedge.
The figure on the right shows the spatial Cartesian grid associated with a given
scale and orientation. Plancherels theorem is applied to express c( j, k , l ) as an
integral over the frequency plane as:
c( j, k , l ) =
(2 )2
f ( ) j ,k ,l ( ) d =
1
(2 ) 2
f ( )U (R )e
i xk( j , l ) ,
d .
(5)
In this work curvelet based features of human faces are extracted using FDCT via
the wrapping technique. Coarse level coefficients are selected for face representation
and their dimensionality is reduced using kernel based principal component analysis.
Approximate coefficients are selected since they contain an overall structure of the
image instead of high frequency detailed information which is insignificant and does
not greatly impact the recognition accuracy. Fig. 2 shows an image from FERET
database along with its approximate coefficients and detailed higher frequency coefficients at eight angles in the next coarsest level.
516
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
(j)
Fig. 2. (a) Original face image, (b) approximate curvelet coefficient, (c-j) 2nd coarsest level
curvelet coefficients at 8 varying angles
C= 1 x
m
j =1
x Tj .
(6)
C = 1 ( x ) (x )
m
j =1
(7)
517
Kernel Type
Gaussian kernel
Polynomial kernel
Sigmoid kernel
Mathematical Identity
2
k ( xi, xj ) = exp
( x i x j )
2 2
k ( xi, xj ) = ( xi . x j + ) d , d = 1,2,3....
tanh ( k ( xi, xj) + )
4 Proposed Method
Our proposed method deals with classification of face images using k-NN based classification utilizing reduced dimension feature vectors obtained from curvelet space.
Images from each dataset are converted into gray level image with 256 gray levels.
Conversion from RGB to gray level format along with a two fold reduction in the
image size was the only pre-processing performed on the images. In addition to the
mentioned adaptations there were no further changes made in an image that may lead
to image degradation. We randomly divide image database into two sets namely training set and testing set. Recently, research community has observed dimensionality
reduction techniques being applied on data to be classified for real-time, accurate and
efficient processing. All images within each datasets are of the same dimension, i.e.
RxC. Similar image sizes support the assembly of equal sized curvelet coefficients
and feature vector extraction with identical level of global content. Curvelet transform
of every image is computed and only coarse level coefficients are extracted. Curvelet
transform is a relatively new technique for multiresolution analysis that better deals
with singularities in higher dimension, and better localization of higher frequency
components with minimized aliasing effects. Vectorization is the next step to convert
518
our curvelet coefficients into UxV dimension vector, called as curvelet vector,
whereas UxV << RxC.
Applying k-NN classifier on curvelet vectors could be computationally expensive
due to higher dimensionality of data originating from large image databases. Outliers
and irrelevant image points being included into classification task can also effect the
performance of our algorithm hence KPCA is implemented to reduce the dimension
of curvelet vectors. KPCA was proposed in the pioneering work of [14] which computes principal components in a higher dimension feature space which is non-linearly
related to the input space. Hence KPCA can reliably extract non-linear principal components while maintaining global content of the input space.
A Polynomial function based KPCA is used in our proposed method for dimensionality reduction to construct KPCA feature vectors. KPCA feature vectors retain
the global structure of input space that facilitates accurate classification with lower
computational complexity, diminished outliers and irrelevant information. Next,
k-NN algorithm is trained using labeled KPCA feature vectors computed in step 5
(Table 2). We selected the k-NN based classification scheme due to its attractive
properties and better performance in image-to-class scenario compared with other
parametric classification schemes as argued by Boiman et. al. [15]. Finally, test image set feature vectors are classified using k-NN scheme utilizing Euclidean distance
metric to compute the dissimilarity level between input images. Table 2 consists of
detailed steps that demonstrate our proposed techniques.
Table 2. Main steps of our proposed classification scheme
INPUT: Randomly divide image dataset into two subsets TRi and TEj where i={1,2,,n} and
j={1,2,,m} representing training and test image, each of size RxC, sets respectively.
OUTPUT: Classifier - f(x)
hsZ
shs
<dZ<d
^
u ATR = K TR ATR
u ATE = K TE ATE
, A x
KW
dEE<W
EE
519
5 Experimental Results
We tested our proposed method comprehensively using 5 distinctive faces databases
namely, FERET, AT&T, Georgia Tech, Faces94 and JAFFE data sets. Before divulging into the experimental details and the results achieved using the proposed method,
we will briefly describe the datasets used for face recognition.
5.1 Datasets
The FERET was sponsored by the Department of Defense in order to develop automatic face recognition capabilities that could be employed to assist security, intelligence and law enforcement personnel in the performance of their duties. The final
corpus consists of 14051 eight-bit grayscale images of human heads with views ranging from frontal to left and right profiles.
AT&T face database contains 10 different images for each of 40 distinctive subjects. Images of some subjects were taken at different times, with varying lighting
conditions, facial expressions and facial details. All images were taken against a dark
homogeneous background with the subjects in an upright, frontal position with a
small tolerance for side movement.
Georgia Tech database contains images of 50 people and contains 15 color images
for every subject. Most of the images were taken in two different sessions to take into
account the variations in illumination conditions, facial expression, and appearance.
In addition to this, the faces were captured at different scales and orientations.
Faces94 database was generated at University of Essex and contain a series of 20
images per individual. Faces94 database is wide-ranging and contains 20 images of
152 distinctive individuals. The database contains images of people of various racial
origins, mainly first year undergraduate students, so the majority of individuals are
between 18-20 years old but some older staff member and students are also present.
Some individuals are wearing glasses and/or beards.
Finally a Japanese female facial expression (JAFFE) database is also used to rigorously test the performance of the proposed method. The database contained 220 images of varying facial expressions posed by 10 Japanese female models.
5.2 Results and Discussion
As described earlier image datasets are converted from RGB to gray level and image
size is reduced by a factor of 2 in our experiments. In the FERET database different
number of images exists for different subjects so 45% of images from each subject
were used as prototypes and the remaining 55% for testing. Five images of each subject from AT&T database are randomly selected as prototype and the remaining 5 are
used for testing the recognition accuracy. Similarly 9 images of each subject of the
Georgia Tech dataset, 8 images of each subject from the Faces94 dataset and 9 images
of each subject from the JAFFE dataset are randomly selected for training. Both the
testing and training sets of images are decomposed using curvelet transform at 5
scales and 8 different angles. Amongst the curvelet coefficients only approximate
coefficients are selected as feature vectors since they closely represent and approximate the input image. The selected feature vectors are dimensionally reduced with
520
5
10
15
20
25
5
6
7
8
9
6 Conclusion
We proposed a novel face recognition technique using nonlinear curvelet feature
subspace. Curvelet transform is used as multiresolution analysis tool to compute
sparse features. Localized high frequency response with minimized aliasing, better
521
References
1. Turk, M.A., Pentland, A.P.: Face Recognition using Eigenfaces. In: Proc. Computer Vision and Pattern Recognition, pp. 586591 (1991)
2. Chow, G., Li, X.: Towards a system for automatic facial feature detection. Pattern Recognition 26(12), 17391755 (1993)
3. Zhao, W., Chellappa, R., Rosenfeld, A., Phillips, P.J.: Face Recognition: A Literature Survey. In: ACM Computing Surveys, pp. 399458 (2003)
4. Goudail, F., Lange, E., Iwamoto, T., Kyuma, K., Otsu, N.: Face recognition system using
local autocorrelations and multiscale integration. IEEE Trans. Pattern Anal. Mach. Intell. 18(10), 10241028 (1996)
5. Valentin, D., Abdi, H., OToole, A.J., Cottrell, G.W.: Connectionist models of face processing: A Survey. Pattern Recognition 27, 12091230 (1994)
6. Swets, D.L., Weng, D.L.: Using discriminant eigenfeatures for image retrieval. IEEE
Trans. Pattern Anal. Mach. Intell. 18(8), 831836 (1996)
7. Kirby, M., Sirovich, L.: Application of the Karhunen-Loeve procedure for the characterizati-on of human faces. IEEE Trans. Pattern Anal. Mach. Intell. 12, 103108 (1990)
8. Pentland, A., Moghaddam, B., Starner, T.: View-based and modular eigenspaces for face
recognition. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, Seattle, pp.
8491 (1994)
9. Feng, G.C., Yuen, P.C., Dai, D.Q.: Human Face Recognition using PCA on Wavelet Subband. Journal of Electronic Imaging 9(2), 226233 (2000)
10. Mandal, T., Wu, Q.M.J.: Face Recognition using Curvelet Based PCA. In: ICPR (2008)
11. Cands, E.J., Demanet, L., Donoho, D.L., Ying, L.: Fast discrete curvelet transforms. In:
Multiscale Model. Simul., pp. 861899 (2005)
12. Starck, J.L., Elad, M., Donoho, D.L.: Redundant multiscale transforms and their application for morphological component analysis. Advances in Imaging and Electron Physics 132, 287342 (2004)
13. Cover, T.M.: Geometrical and Statistical Properties of Systems of Linear Inequalities with
Applications in Pattern Recognition. IEEE Transaction on Electronic Computers 14(3),
326334 (1965)
14. Scholkopf, B., Smola, A., Muller, K.R.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 12(5), 12991319 (1998)
15. Boiman, O., Shechtman, E., Irani, M.: In Defense of Nearest-Neighbour Based Image
Classification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
(June 2008)