Professional Documents
Culture Documents
ABSTRACT
A comparative study of deep learning and machine learning techniques which is used in pattern recognition domain is
introduced. This study includes an analysis and specification of the most researches that had been published in this scope with
an accurate analysis of their advantages and disadvantages. The research proposes a new classification of the pattern recognition
systems according to many concepts which are: dataset type and size, learning type, fusion techniques and the type of
application which is useful to use in recognition approaches. As a result, it can be noted that some research did not give
important to the effect of the size of the dataset and its impact on the accuracy and some researches did not give important to the
type of database and application which we wanted to obtained in the selection of appropriate technology.
Keywords :— Machine Learning, Deep Learning, Pattern Recognition, Supervised Learning, Unsupervised Learning,
Classification, Regression, Dimensionality Reduction, Clustering.
Feature
.
.
.
.
.
.
.
.
.
Extraction output
Feature
Classification
Input Vector
data Machine learning
output
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Input
data Deep learning
III. METHODOLOGY SEN measures the proportion of actual positives that are
correctly identified as such (e.g., the percentage of sick people
A. Performance Measure who are correctly identified as having the condition).
Supervised learning
[24], 2019 (322 images) Konstanin [17], classification Diseases Diagnosis
2015
Accuracy is not entirely depending on the type of data Brandon [9], regression Face recognition
used but the data type and size play an important role in 2016
determining its value. The type of algorithm used is one of the Sharada [31], classification Plants diseases
2016 detection
most important and most important factors affecting on
accuracy and This is what we will discuss later. Aharon [1], 2017 Regression Emotion
recognition
F. Classification According to Learning Type Neal [20], 2016 clustering Predict poverty
Unsupervised
learning
The learning type in machine learning and deep learning Alessandra [2], dimensionality decrease the
2018 reduction dimensionality of
algorithms are divided into two basic types: supervised
sensorial features
learning and unsupervised learning. The suitable type is
Juan [15], 2018 Reduction climate
selected according to the type of application to be obtained. data features
The supervised learning usually used in the recognition,
prediction, detection and diagnosis of diseases. Nithin VR [21]
used supervised learning(regression) in prediction movie G. Classification According to Techniques
success, Konstanin [17] used supervised learning The methods used in the recognition systems vary
(classification) to detect cancer and later Sharada [31] used according to the application to be obtained. In the
supervised learning in plant diseases detection. supervised classification applications, algorithms such as (PCA) and deep
learning also used in recognition application, Brandon [9] use neural networks were used in the extraction features and then
supervised learning(regression)in face recognition and Aharon the use of these features as input for one type of classifier such
[1] used it in emotion recognition. as SVM, Random forest, k-nearest neighbors and ANN. some
The other type of learning is unsupervised learning which of research tend to integrate more than one type of deep
is used to draw inferences from datasets consisting of input learning network together or used gene selection algorithms to
data without labeled responses. The most common have more powerful classifier. Table (3) present the
unsupervised learning method is cluster analysis, which is classification of deep learning and machine learning
used for exploratory data analysis to find hidden patterns or algorithms published researches between 2010 and 2019
grouping in data. researchers used clustering in predictive according to Technique.
aspects. Nell [20] used clustering algorithms to predict
poverty Using survey and satellite data from five African TABLE (3)
countries (Nigeria, Tanzania, Uganda and Malawi). CLASSIFICATION MACHINE LEARNING AND DEEP LEARNING
ALGORITHMS ACCORDING TO TECHNIQUES
TABLE (5)
AN OVERALL COMPARISON OF THE MOST IMPORTANT RESEARCH IN THE PATTERN RECOGNITION SYSTEM USING DEEP LEARNING AND MACHINE LEARNING
TECHNIQUES.
Researcher, Performance
application Dataset Techniques weakness
date Measurements
SVM, Linear The need
Logistic Regression,
Nithin [21], Predicting Movie IMDB Dataset regression MSE=20% to consider additional
SVM Regression and
2014 Success (1050 records) logistic regression features to improve
Linear Regression
MSE= 12.5%. performance
KONSTANIN[1 Cancer prognosis Seer(162500 SVM and ANN (machine difficult to define significant
7] , 2015 and classification records) learning) ACC=71% features
Gene Expression two cancer
Devi [13], Consume lots of computation
Data microarray SVM and gene selection ACC=67%
2015 time and less accuracy
Classification datasets
Zhihua [39], Lung cancer ACC=86.54 % Small dataset and separate it
(NSCLC) dataset Random forest
2015. classification SEN=84.37% to test and train set
Traffic Collected from 17% flow data can’t be
Zhanyi [38],
identification internal ANN MSE=10% identified even the precise
2015
network rules are made
don't know ahead of time
Saahil [27], Predicting Stock CRSP stock which stocks the model will
SVM ACC=60%
2015 Price Direction database be able to predict accurately
and which it will not.
Brandon [9], Lot of time to train new
Face recognition LFW dataset CNN and SVM ACC=98%
2016 classification model
Weather
Yunjie [37] Detecting Small data set lead to less
broadcast CNN ACC=89%
2016, extreme weather accuracy
dataset
large-scale
Deep CNN for increase the network depth
Alexis [4], data sets
Natural Language Deep CNN MSE=37% lead to less error test but
2016 introduced by
Processing more complexity
Zhang et al
A lack of temporal labels(i.e.,
Neal [20], Predict poverty satellite CNN and mean
=0.81 the exact date of each image
2016 (regression) imagery dataset clustering
is unknown)
Plant diseases 54306 images of
Sharada [31], There is no standard data set
detection and diseased and CNN ACC=99%
2016 to train and test the model
classification healthy plant
The large amount of
institution's CART predictor variables so it is
Benjamin [8], cardiovascular
electronic (classification and MSE = 0.12 often challenging to know
2017 risk prediction
health record regression tree) which and how many should
be used in model.
Optimal
Pima dataset
Suhant [33] Predictive Deep neural network
(6912 records) MSE= 0.211 ACC=81% Limited and small dataset
2017, analytics of and RF classifier
Diabetic
Emotion
Aharon [1] IEMOCAP LSTM ACC=68% The Complexity of the
Recognition from LSTM,CNN
2017, dataset CNN ACC= 66% building model
Speech
TABLE (5)
CONTINUED
Researcher, Performance
application Dataset Techniques weakness
date Measurements
Ravikumar Brain tumor MRI filtering Linear SVM separate the data
SVM ACC=96%
[25], 2017 classification dataset only for two classes
Sahar [28], financial Deep neural network only 10% of messages in Stock
StockTwits data ACC=75%
2018 sentiment (LSTM, CNN ) Twits are labelled so that’s