Bayesian Network Approach To Root Cause Diagnosis

International Journal of Machine Tools & Manufacture 45 (2005) 7591
www.elsevier.com/locate/ijmactool
A Bayesian network approach to root cause diagnosis of process

variations
S. Dey, J.A. Stori
Department of Mechanical and Industrial Engineering, 140 Mechanical Engineering Building, 1206 West Green Street, MC-244,
University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
Received 28 October 2003; accepted 29 June 2004
Abstract
This paper addresses the challenges inherent in root cause diagnosis of process variations in a production machining environ-
ment. We develop and present a process monitoring and diagnosis approach based on a Bayesian belief network for incorporat-
ing multiple process metrics from multiple sensor sources in sequential machining operations to identify the root cause of process
variations and provide a probabilistic condence level of the diagnosis. The vast majority of previous work in machining process
monitoring has focused on single-operation tool wear monitoring. The focus of the present work is to develop a methodology for
diagnosing the root cause of process variations that are often confounded in process monitoring systems, namely workpiece hard-
ness, stock size, and tool wear variations. To achieve this goal, multiple sensor metrics have been identied with statistical corre-
lations to the process faults of interest. Data from multiple sensors on sequential machining operations are then combined
through a causal belief network framework to provide a probabilistic diagnosis of the root cause of the process variation.
# 2004 Elsevier Ltd. All rights reserved.
Keywords: Process monitoring; Process diagnosis; Fault diagnosis; Bayesian belief network
1. Introduction for example. Due to the stochastic nature of manufac-

turing process variations, a probabilistic analysis tech-
Machining process monitoring has received a great nique based on Bayesian inferencing has been
deal of attention in the academic research literature. A developed and will be described in this paper. Before
wide variety of sensors, modeling, and data analysis providing an overview of the developed approach, we
techniques have been developed, often focusing on tool briey review some of the relevant prior work in pro-
condition monitoring. However, the majority of these cess monitoring and diagnosis.
techniques have yet to experience widespread industrial
deployment. Industrial process monitoring systems in
1.1. Sensors in process monitoring
use today tend to rely on relatively simple signal
thresholds for triggering alarms, and are not generally 1.1.1. Tool condition monitoring
capable of diagnosing the cause of the alarm. One Tool condition monitoring has received a tremendous
important challenge in industrial process monitoring is amount of attention in the academic literature. Li and
isolating the root cause of a process variation. In auto- Mathew [1] and Dimla [2] provide excellent reviews of the
motive transfer line applications, for example, false various direct and indirect methods used to predict tool
alarms may be generated in monitoring systems as a wear and failure in turning. Direct methods include
result of variations in casting geometry and hardness. optical techniques as well as radioactivity analysis, work-
Such variations may be dicult to distinguish from piece size measurement, and tool/workpiece distance
increases in the sensor signals resulting from tool wear, measurement techniques. Indirect methods include cut-
ting force, acoustic emission, sound, vibration, spindle

Corresponding author. Tel.: +1-217-244-7762; fax: +1-217-333-
power, cutting temperature and surface roughness
1942. measurement. Jantunen [3] recently reviewed prior work
E-mail address: jastori@uiuc.edu (J.A. Stori). in tool condition monitoring for drilling. Below, we
0890-6955/$ - see front matter # 2004 Elsevier Ltd. All rights reserved.
doi:10.1016/j.ijmachtools.2004.06.018
76 S. Dey, J.A. Stori / International Journal of Machine Tools & Manufacture 45 (2005) 7591
briey discuss some of the sensors that have been com- Spindle current sensors have also been applied to tool
monly used in tool condition monitoring applications. condition monitoring of drilling and milling processes.
Cutting force sensors have proven popular for tool The time response of such sensors is inherently limited.
condition monitoring applications. The success of these For this reason, high frequency components of cutting
approaches may be attributed, at least in part, to the force are dicult to observe. Feed drive motor sensors
prevalence of analytic and mechanistic process models have also been used for tool condition monitoring and
capable of predicting cutting forces as a function of cutting force estimation. Previous applications of a spin-
various process parameters. One important practical dle power sensor to monitor tool wear include [2023]. Li
hindrance to the industrial deployment of cutting force et al. [24] apply neural network and fuzzy logic analysis
sensors for process monitoring sensors is the high cost techniques to estimate cutting forces and detect the onset
and intrusive nature of multi-axis dynamometers in a of accelerated wear using the feed motor current.
production environment. A variety of model-based tool
condition monitoring techniques using cutting force 1.1.2. Dimensional and geometric variations
sensors have been proposed [46]. Statistical techniques Dimensional variations in the incoming raw material
such as time series and autoregressive models based on will result in unanticipated depth of cut variations, and
cutting force sensors have been developed [79] as well corresponding increased loads on the cutting tool and
as neural network approaches [1012]. spindle. Spindle and cutter run-out will result in varying
Acoustic emission refers to the transient elastic and unbalanced cutting forces. Seethaler and Yellowly
waves generated from a localized source or sources [25] investigated the individual tooth run-out in milling
within a material. These waves are generated at high using linear force models. They showed experimentally
frequencies, usually above 500 kHz, and are picked up how the methods could be used to distinguish run-out
by AE transducers mounted in close proximity to the from edge breakage. Sastry et al. [26] developed a meth-
source of emission. Acoustic emission is sensitive to the odology that used variations in spindle speed to compen-
microscopic activities related to plastic deformation sate for radial run-out. Several researchers have
and friction in the cutting zone. developed techniques for the identication of the depths
Dornfeld pioneered some of the important early of cut in milling using cutting force data. Altintas and
eorts to develop acoustic emission techniques for tool Yellowly [27,28] used the ratio of two orthogonal cutting
condition monitoring of machining processes [13,14]. A force components to identify the axial and radial depth of
variety of applications of AE techniques to tool con- cut. They assumed a zero helix angle, and required the t-
dition monitoring by other research groups have been ting of polynomial functions for each feedrate value. Choi
reported in the literature some of which may be found and Yang [29] proposed a technique for calculating the
in the following review papers. depths of cut based only on the shape of the cutting force
Jemielniak [15] discusses the dierent acoustic emis- prole. Yang et al. [30,31] has developed a detailed diag-
sion and ultrasonic sensors used for tool condition nosis methodology for axial and radial depth of cut varia-
monitoring. Li [16] presents a compact review of vari- tions also based on an analysis of the cutting forces and
ous issues in acoustic emission monitoring for turning, comparison with model predictions. Their approach relies
including AE generation, classication and signal cor- only on the resultant cutting force prole.
rection, signal processing methodologies, and tool wear
1.1.3. Workpiece hardness
estimation methods.
Workpiece hardness variation does not appear to
The majority of AE monitoring applications have
have received a signicant amount of prior attention in
relied on the root mean square (rms) value of the acoustic
the process monitoring and diagnosis literature, yet it
emission signal. Other metrics such as the energy content
remains an important issue challenge for the successful
in various frequency ranges, and time domain statistical
industrial application of process monitoring systems.
parameters such as skew and kurtosis have been explored.
Production castings may contain signicant hardness
Applications of accelerometers or piezoelectric
variations that are dicult to distinguish from other
vibration sensors to tool condition monitoring have also
process variations such as tool wear or workpiece
been reported. El-Wardany et al. [17] investigated the
geometry variations. We were unable to nd any pub-
use of vibration signals for online drill wear monitoring
lished references addressing sensor-based identication
and breakage. Features sensitive to tool wear were
of workpiece hardness variations.
identied in both the time domain (ratio of absolute
mean and kurtosis metrics) and the frequency domain 1.2. Inference and diagnostic methods
(power spectra and cepstra ratio). Yao et al. [18] resear-
ched the detection and estimation of groove wear at the A variety of diagnostic or inferencing techniques
minor cutting edge of the tool using vibration sig- have been used in process monitoring, including arti-
natures. Rotberg et al. [19] also described the appli- cial neural networks, expert systems, fuzzy logic and
cation of a vibration sensor to tool wear monitoring. statistical methods.
S. Dey, J.A. Stori / International Journal of Machine Tools & Manufacture 45 (2005) 7591 77
1.2.1. Neural networks cess involving a number of states linked by a number

Articial neural networks, also known as connec- of discrete transitions. Each transition has an associa-
tionist architectures, parallel distributed processing, ted probability and each state has an associated obser-
and neuromorphic systems, are systems where inputs vation output. In hidden Markov models, the
are mapped to known outputs via weights during a transitions and output are stochastic. They have been
training process. The learned weights are used to pre- used extensively and with great eectiveness in speech
dict corresponding outputs for given inputs. Their recognition applications. Ertunc et al. [36] uses hidden
structure is modeled loosely on the neural conduction Markov models to classify the wear state of the tool in
system existing in the human body. Neural networks drilling using cutting force signals. Atlas et al. [37]
have proven successful as pattern recognition engines applied hidden Markov models to tool wear monitor-
and robust classiers, with the ability to generalize in ing in milling using a vibration sensor.
making decisions about imprecise input data. The
advantage of articial neural networks lies in their 1.2.3. Statistical techniques
resilience against distortions in the input data and their Statistical approaches to modeling and monitoring
capability of learning. Many types of articial neural of machining processes most often involve regression-
networks have been developed with varying structures based and time domain techniques such as time series
and connectivity. or moving averages. Liang and Dornfeld [13] used an
Dimla et al. [32] have compiled an excellent review autoregressive time series model to detect tool wear
of applications of neural networks to tool condition using acoustic emission signals. Yao and Fang [7]
monitoring applications. A number of individual eorts developed a stochastic method based on multivariate
have been referenced previously. Neural networks have autoregressive moving average vector models to quan-
also been used successfully to combine inputs from tify the process dynamics embedded in multi-dimen-
multiple sensors for tool condition monitoring. sional force and vibration signals to monitor tool wear
in nish turning. Kumar et al. [38] used a time series
1.2.2. Fuzzy logic and hidden Markov models approach to extract features from force and vibration
Fuzzy set theory, originally introduced by Lot signals and pattern recognition techniques to identify
Zadeh in the 1960s, emulates human reasoning in its the state of tool wear. Tarn and Tomizuka [23] used an
use of approximate information and uncertainty to autoregressive model to detect tool breakage using
generate decisions. Fuzzy set theory implements classes spindle current and cutting force signals.
or groupings of data with boundaries that are not shar- Govekar et al. [39] used spectral and time series
ply dened. Any methodology or theory implementing analysis methods to develop characteristics for tool
crisp denitions such as classical set theory, arithme- condition monitoring using data obtained from mul-
tic, and programming, may be fuzzied by general- tiple sensors including acoustic emission. Yan et al. [40]
izing the concept of a crisp set to a fuzzy set with used an autoregressive model to extract features from
blurred boundaries. The benet of extending crisp acoustic emission and cutting force sensors to detect
theory and analysis methods to fuzzy techniques is the tool failure in milling.
strength in solving real-world problems, which inevi-
tably entail some degree of imprecision and noise in the 1.2.4. Bayesian techniques
variables and parameters measured and processed for Perhaps the most widely applied technique in prob-
the application. Accordingly, linguistic variables are a abilistic inferencing is the Bayesian network. Bayesian
critical aspect of some fuzzy logic applications, where networks provide a exible structure for modeling and
general terms such a large, medium, and small evaluating uncertainty. The network consists of nodes
are each used to capture a range of numerical values. representing random variables connected by arcs that
Du et al. [33] demonstrated the eectiveness of using quantify a causal relationship between the nodes.
fuzzy set theory in tool condition monitoring. A total McCabe [41] provides an overview of Bayesian belief
of 11 monitoring indices were identied to describe sig- networks and their application. Bayesian networks
nature characteristics of force, vibration, and spindle have proven useful for a variety of monitoring and pre-
power sensor signals. Li et al. [34,35] used both fuzzy dictive purposes. Applications have been documented
set and wavelet transformation techniques to extract in the medical domain [4244], image-processing, target
features from the spindle and feed servo motor current recognition [45,46], and pattern matching [47].
signals in drilling and establish relationships between Bayesian networks have also been used for monitor-
these features and the cutting conditions for dierent ing and diagnostic applications in manufacturing. Lewis
tool states. and Ransing [48] present an interesting study of a
Hidden Markov models are an extension of Markov semantically constrained Bayesian network for diag-
chains and used to model processes whose character- nosis in manufacturing. The algorithm they use is based
istics vary with time. A Markov chain is a random pro- on Bayesian theory but is modied to enable belief
revision. The presence of uncertainty is also quantied a Bayesian belief network provides a convenient for-
by a relationship. Rigorous constraints have been malism for representing conditional probabilistic rela-
developed and are used to check the consistency of net- tionships between attributes of interest.
work values. The network is initialized and then used to
diagnose a pressure die casting process. Wolbrecht et al. 2.1. Bayesian inferencing
[49] describe a real time monitoring and diagnosis sys-
tem that identies component failures quickly in a Given a hypothesis H and an evidence e, Bayes the-
multi-stage process. Prateepasen et al. [50] demonstrates orem may be stated as follows:
the ability to combine signals from acoustic emission PejH PH
and vibration sensors for tool wear monitoring. The AE PHje 1
Pe
rms signal and the coherence function between the axial
and feed directions for the accelerometer signals were where P(H|e) is the conditional probability of hypoth-
found to be sensitive to tool wear. esis H being true given the evidence e, also know as the
posterior probability or posterior, P(H) is the prob-
1.3. Motivation ability of hypothesis H being true or the prior prob-
ability, and P(e|H) is the probability of evidence e
Despite the tremendous amount of prior work in
occurring given hypothesis H is true. This is also
sensor-based tool condition monitoring, very little
known as the likelihood that evidence e will materialize
attention has been placed on the important task of
if hypothesis H is true. As Pearl states: The impor-
diagnosing the root cause of process variations when
tance of [Bayes Theorem] is that it expresses a quan-
multiple potential sources of variation exist. For
tity, P(H|e)which is often dicult to assessin terms
example, increases in workpiece hardness, stock size,
of quantities that often can be drawn directly from our
and tool wear, may be confounded in a production
experiential knowledge [51]. The aim in monitoring
process monitoring application. The intent of the
and diagnosis is to be able to incorporate previous
present eort is to explore the possibility of using mul-
knowledge of the process with present observed evi-
tiple sensors and data obtained from sequential
dence to come up with the most plausible explanation
machining operations to diagnose the root cause of
of how the process is behaving. Bayes theorem incor-
process variations. We have adopted a Bayesian prob-
porates this type or predictive and diagnostic support.
abilistic framework to explicitly address the uncertainty
inherent in the root cause diagnosis.
In Section 2 of this paper, we motivate the appli- 2.2. The structure of a Bayesian belief network
cation of the Bayesian belief network and describe its
Bayesian networks are directed acyclic graphs in
structure. The procedure for learning from a training
which each node represents a random variable that can
database, and rules for the updation and propagation
take on two or more discrete values. The arcs in the
of evidence are described. In Section 3, we describe the
network signify the existence of direct causal inuences
experimental procedure and methodology that were
between the linked variables, and the strengths of these
used to identify the correlative sensor metrics for pro-
inuences are quantied by conditional probabilities.
cess fault diagnosis, as well as the resulting structure of
The directionality of the arrows is essential for display-
the Bayesian belief network. In Section 4, results are
ing causality and non-transitive dependencies. The cau-
presented demonstrating the eectiveness of the tech-
sal relationships expressed through the network
nique for root cause diagnosis of process faults using
structure are based on prior knowledge, experience, or
an acoustic emission and spindle power sensor and
statistically observed correlations.
data from two subsequent machining operations.
Each node has distinct states or levels that signify the
values that the particular node can take. A Bayesian
2. Bayesian belief networks network must be an acyclic graph, although it may be
multiply connected. Each node is associated with a
Process monitoring and fault diagnosis requires the conditional probability table that denes probabilities
assimilation of numerous noisy and incomplete sources for the dierent states of the node given the states of
of evidence in order to infer when a change in process the parents.
state has occurred and identify the root cause of that Fig. 1 illustrates the variables and notation to be
change. Such inferences will necessarily be imperfect, used in describing a Bayesian network. The notation
and it would be valuable if the conclusions could be has been adopted from Ramoni and Sebastiani [52]. In
augmented with some quantiable measure of its cer- this subset of a Bayesian network, there are three
tainty or uncertainty. nodes, representing the random variables X1, X2, and
Bayesian inferencing techniques are well suited to X3. The links between nodes X1 and X2 as well as X2
such problems. In particular, a data structure known as and X3 indicate an explicit conditional dependence. In
Fig. 1. Notation to be used in describing a Bayesian network.
other words, node X3 is conditionally dependent only identied, and initial values for the conditional prob-
on the state of nodes X1 and X2. The matrix h3 quanti- abilities must be obtained. We will discuss the develop-
es this conditional dependence. h3 contains the con- ment of an appropriate structure for our application in
ditional probabilities of all possible states of node X3 Section 3 of this paper. We rst discuss techniques for
given all possible states of the parents. The individual estimating the unknown conditional probabilities, hijk,
elements of matrix h3 are denoted h3jk PX3 given a prior set of data, or a database, D. The follow-
kjP3 p3j where p3j indicates the state of the parents ing development follows [52].
of X3. In this particular example, nodes X1 and X2 can A database consists of a series of instances of prior
each take on two possible states, so P3 can take on node states. The objective of a learning procedure is to
four possible values, p3j, j 1 . . . 4. The vector p3j assign values to the conditional probability matrix that
holds the individual states of the parents. are consistent with the database. One common way in
The general notation is as follows: each random which this is formulated is through maximization of
variable Xi is discrete and has a nite set of ci states. the likelihood function, P(D|h). In other words, it is
The state of the node Xi is given by xik. The state of desired to identify conditional probability values that
the parents of Xi is stored in the variable Pi. The num- maximize the likelihood of the database having
ber of states of Pi is denoted by qi, and a particular occurred.
state is indicated by pij. Thus, the subscript i indicates Let n(xik|pij) be the P
frequency of (xik|pij) in the data-
the node in question, j stands for the parent state base, and let npij k nxik jpij be the frequency of
among a total of qi states and k stands for the state of pij. It is well known that the ML estimate of hijk is the
the variable Xi among a total of ci states. Each element relevant frequency of the cases in the database [53]:
of the conditional probability matrix is dened as: hijk nxik jpij =npij 2
hijk PXi xik jPi pij . The vector pij holds the
individual states of the parents of Xi and has cardin- A major drawback of this method is that hijk 0 when-
ality equal to the number of parents. ever nxik jpij 0, so that the estimate can be too
extreme when the database is sparse or incomplete. Thus,
2.3. Learning conditional probabilities in a Bayesian the type of database available for learning is what prim-
belief network arily governs the choice of a suitable learning technique.
A complete database contains data for all possible com-
Before a Bayesian network may be used for process binations of the variables and their states. A database is
fault diagnosis, the structure of the network must be sparse when data for all the possible combinations of the
variables and their states are not available. A database is sparse training database, and the assumption of a uni-
said to be incomplete when in a particular data set, form prior expectation.
values of some of the variables are missing.
Learning for incomplete databases poses the most 2.4. Bayesian belief updatingpropagation of evidence
signicant statistical challenges. A summary of a num-
ber of common techniques for learning with incomplete Once a Bayesian network has been constructed and
databases can be found in [54]. In our process monitor- initial conditional probability tables have been gener-
ing application, it will typically be possible to observe ated by training on prior data, the network is ready for
all sensor metrics for all data instances. Thus, we will the introduction of external evidence based on sensor
not address learning with an incomplete database. observations.
Nevertheless, obtaining training data will be expensive, Upon the introduction of external evidence, those
requiring extensive experimentation. For this reason, it nodes representing sensor observations are forced to
is likely that a training database for a process monitor- take on a specic value while the remaining nodes must
ing application will be sparse. adjust their own belief (probability of being in each
The Bayesian approach is to regard hijk as a random of the potential states) in order to remain consistent
variable whose prior distribution represents the obser- with the new evidence. It is these updated beliefs that
vers belief about the conditional probability before will be observed and relied upon as the output of the
data are seen. Application of Bayesian learning process monitoring and diagnosis system.
involves the use of a database of training examples to A variety of techniques for inferencing in a Bayesian
update the prior belief on the parameters using Bayes network have been developed. We have adopted a
theorem. We rst require an assumption on the prior message passing methodology for belief updating
distribution of the parameters, h. developed by Pearl [51]. This method oers a number
For convenience, a Dirichlet prior distribution is typi- of appealing characteristics relevant for applications in
cally chosen for the multinomial parameters, hij. This process monitoring and diagnosis. Specically, the
choice of prior satises the desired assumption of mutual message passing methodology oers a local triggering
independence of the parameters introduced by Spie- mechanism and a computationally simple local updat-
gelhalter and Lauritzen [53] and results in a conjugate ing procedure. Finally, the triggering mechanism and
sampling situation, namely that after data are seen, the updating action is conceptually meaningful. Alternative
distribution of parameters remains in the Dirichlet fam- approaches typically rely on a global supervisor, or
ily [55]. Interestingly, Geiger and Heckerman [55] dem- monitor that must select the sequence of propagation
onstrate that the under these assumptions, a Dirichlet and computation throughout the network.
prior distribution is, in fact, the only possible choice. The In this section, we provide our formal implemen-
assumption of complete data, parameter independence, tation of Pearls message passing methodology for
and Dirichlet priors makes it possible to update each belief updating in a multiply connected tree (polytree).
vector of parameters, hij, independently [54]. For a detailed development of the message passing
For our application, we also assume the prior distri- mechanism itself, the reader is directed to Pearl [51],
bution of hij to be a Dirichlet distribution with para- Chapter 4. The notation and implementation of the
meters faij1 ; . . . aijci g. The prior density of hij is then belief updating rules has been changed signicantly so
given by [52]: as to maintain consistency with the data structures and
P notation used previously in this paper.
XC
k aijk a The probabilities of interest will be the belief, or likeli-
phij hijkijk 3
k
Ca ijk hood of each of the states of a particular node, given all
evidence received thus far. This evidence includes all train-
The parameters aijk encode an observers prior belief. ing data, as well as the current instantiation of particular
In the absence of prior information, we will assume nodes as a result of sensor observations. During the belief
that aijk a=ci qi for all i, j and k, so that the prior updation and inferencing, the conditional probability vec-
probability of (xik|pij) is simply 1=ci [55]. tors, hij, remain xed. These updated belief probabilities
As previously discussed, the posterior density of h will be denoted as a vector B(Xi), indicating the prob-
will remain within the family of Dirichlet distributions, ability, or likelihood of each of the possible states of Xi.
and the posterior expectation of hijk can be readily The message passing procedure occurs as follows:
obtained [52]: when new evidence is introduced into the network,
aijk nxik jpij each node updates its own belief, based on messages
Ehijk D 4 received from its parents and children, and correspond-
aij npij
ingly generates messages to be sent to its parents as
Eq. (4) will be used to initialize the conditional prob- well as to its children. Fig. 2 describes the notation to
abilities for our process monitoring application given a be used in message passing. The set of parents of node
Fig. 2. Message passing conventions for belief updating.
Xi will be denoted Xi . The number of parents will equal For all k, and all j,
the cardinality of the set, denoted jXi j. The set of chil- 8 9
dren of node Xi will be denoted Wi , with cardinality < jXj
Y =
jWi j. A message from one of the parents, m 2 Xi , to Nkl Nkl hijk xmi pij o ;
: o1;Xo6p ;
node Xi will be denoted xim , while a message from a
child, n 2 Wi , to node Xi will be denoted win . l pij ojXo p
The steps are as follows: Condition N on values of wi :
X
ci
Step 1Belief updating. When node Xi is activated, For all l, wip l Nkl wi k
it simultaneously inspects the messages commu- k1
nicated by its parents, xmi , and the messages com-
municated by its children, win . It then updates it is Step 3Generation of messages to be sent to children.
belief according to the following relationships: For each child, c,
8 9 ( )
< Y jWi j =X jXi j
Y
For all k; Bi k wi k xi k;
xic k w k hijk xmi pij o ;
1 :o1;W o6c in ; j o1
Bi P Bi i
k Bi k m Xi o; n Wi o
jWi j
Y
For all k; wi k win k; n Wi o The propagation of messages is triggered by instantia-
o1 ( ) tion of evidence into the network. Once the evidence
X jXi j
Y enters the network via the evidence nodes, the beliefs of
For all k; xi k hij k xim pij o ;
j o1
these nodes are updated. The next step is to post mes-
sages to all the parents and children of the availability
m Xi o
of evidence and set ags on them that indicate that they
Step 2Generation of messages to be sent to parents. should be updated. The nodes containing these ags are
For each parent, p 2 Xi of node Xi. then required to update their own belief. For each node
Create conditional probability matrix N: whose belief is updated, new messages are again gener-
ated and sent to the children and parents. Flags are also cause diagnosis application, the network is limited to
set for the parents and children as the messages are sent. two levels. The top level contains the root nodes encap-
The presence of undirected loops in the network sulating the state of the process variations. The second
introduces signicant challenges to the message propa- level contains evidence nodes representing the states of
gation and belief updating procedure. Because there the various sensor metrics. The selection of the individ-
may be multiple paths between two nodes, the message ual process metrics, and the determination of whether a
passing could propagate indenitely. The problem, in causal link between a particular root node and a parti-
general, is NP hard. For large networks with many cular evidence node is to exist in the network will be
loops, the computational challenges become signicant. described in detail in Section 3.
For small networks with a small number of loops, a Once the network structure has been determined
threshold approach may be employed [56]. The propa- (initialization), initial values for the conditional prob-
gation in such a case would follow all the rules of nor- abilities must be learned from a training database. The
mal propagation for singly connected networks. It learning procedure has been described previously in
would also compare the change in the belief between Section 3.2. A training database is provided through an
successive belief updations of a node. If the dierence initial experimental design.
is not greater than a small threshold, which is pre- After the database structure has been identied and the
determined, the propagation is terminated. conditional probability tables have been initialized, the
network is ready for application. The rst step in appli-
2.5. Application of the Bayesian belief network cation of the network is the instantiation of evidence.
methodology When new sensor data are available, the correspond-
ing metrics are evaluated, and the evidence nodes are
We now provide an overview of the procedure for forced into the corresponding state.
applying the Bayesian belief network to root cause The next step is propagating the evidence that has
diagnosis of process variation. In Section 3, we will been instantiated into the network to the dierent
describe the specic sensor metrics and experimental nodes in the network. Evidence propagation proceeds
methodology for training and utilizing the network. per the rules described previously in Section 2.4. This is
Fig. 3 graphically illustrates the procedure for train- followed by the updating of the belief in light of the
ing and applying the network. The rst step is the evidence received from both parents and children.
identication of the network structure. For our root After the belief of a particular node is updated, its
Fig. 3. Initialization and application of the Bayesian belief network methodology.

Table 1
Acoustic emission and spindle power metrics investigated
Acoustic emission sensor metrics Spindle power sensor metrics

Standard deviation of the AE data from the mean Mean of the spindle power data
Mean of the 10% largest amplitude peaks Standard deviation of the spindle power data
Mean of the absolute value of the AE data Mean of the 10% largest amplitude peaks
Pulse rate with a variety of threshold levels Power spectral density at tooth passing frequency (milling)
Peak amplitude in the FFT of the data Power spectral density in 10 Hz bands (drilling only)
Frequency at which the peak amplitude occurs Power spectral density between 10 and 100 Hz (drilling)
Power spectral density in 25 kHz frequency bands Mean of data during partial immersion (drilling only)
Power spectral density between 75 and 500 kHz
belief propagated to the surrounding nodes in accord- 3.1. Development of sensor metrics
ance with the propagation rules. This is an iterative
procedure and is carried out until saturation is reached. In order to identify a variety of sensor metrics that
Saturation is obtained when there are no other nodes had statistically signicant correlations with the process
to propagate the evidence or its eect to. faults of interest, a set of preliminary experiments were
conducted. A wide range of both time domain and fre-
quency domain metrics were investigated, and ANOVA
3. Development of sensor metrics and network analysis was used to determine statistical signicance.
structure The intent of the metrics is to encapsulate signicant
A primary objective of this work was to make use of features in the data stream with a scalarizing variable
input from multiple sensors on multiple processes in an that would be suitable as an evidence node in the Baye-
sian network. The metrics investigated are listed in
attempt to more reliably discern the root cause of pro-
Table 1.
cess faults. The three faults of interest were dimen-
The acoustic emission sensor was sampled at 1 MHz
sional workpiece variation, tool wear, and workpiece
and band pass ltered between 10 and 500 kHz. Dur-
hardness. These three faults would commonly manifest
ing face milling, the AE sensor responds in bursts to
themselves in similar ways from a process monitoring
the engagement of the individual inserts. The sensor
perspective (i.e. increase in power levels, cutting forces,
metrics were evaluated over these data bursts.
etc.). We are interested in attempting to discriminate
between these commonly confounded root causes.
The case study that was developed to validate the 3.1.1. Preliminary experimental study and ANOVA
analysis
methodology involved two sequential machining opera-
After the candidate set of metrics had been identied, a
tions carried out on a single workpiece. The workpiece
three variable, two-level full factorial experimental study
material was 100 diameter 4140 bar stock with hardness
was performed to identify those metrics with a statistically
varying from 24 to 32 Rockwell C. A face milling oper-
signicant correlation to the process faults of interest. The
ation was conducted using a 45 lead, 3.1500 diameter
v
three variables were the process variations of interest: tool

face mill with four carbide inserts (Kennametal part
wear, workpiece hardness, and dimensional workpiece
#SEKT443AEEN7GP) to machine a at on one end of
variation. After the subset of statistically signicant varia-
the bar stock. Next, a blind hole was drilled into this tions was identied, those corresponding metrics were
v
newly generated at. The drill was a 118 point angle, used as nodes in the Bayesian network.
v 00
45 helix, 0.375 diameter tin-coated HSS twist drill A 23 full factorial experimental design was executed
(Kennametal part #KHSS13324). The process para- with two repetitions. The sequence of experiments was
meters were as recommended by the Machining Data randomized. The average value of the repetitions was
handbook. It was hoped that the cumulative infor- used in the ANOVA analysis. The workpiece hardness
mation from the sequential process steps might aid in ranged from 24.0 to 32.0 Rockwell C. The geometric
discriminating between multiple causes. stock size variation was 0.07000 .
Care was taken to utilize sensors that may be cost A full-factorial experimental design was conducted
eective and practical in a production machining because we are also interested in the interaction eects
environment. For this reason, direct measurement of between the variables. The ANOVA analysis was con-
cutting forces through a multi-axis dynamometer was ducted for each variable using the F-test with a 95%
avoided. The two sensors employed were an acoustic condence interval.
emission sensor and a spindle power sensor, both pro- The metrics were chosen based on the following set
vided by Montronix, Inc. Data from each sensor were of rules that were intended to distinguish between sig-
obtained during each of the two machining operations. nicant causal relationships and confounded eects:
Table 2
Acoustic emission and spindle power metrics identied through ANOVA analysis
Process Sensor Label Metric ANOVA sensitivity

AEFM1 Standard deviation TWM
AEFM2 Pulse rate, 0.8 threshold SS, TWM
AEFM3 PSD (75475 kHz) SS, TWM, SS TWM
Acoustic emission
AEFM4 Mean of peaks SS
Face milling AEFM5 Frequency of peak SS
AEFM6 Mean, absolute value SS, TWM, SS TWM
PFM1 Mean WH, TWM
Spindle power PFM2 Standard deviation SS, TWM
PFM3 Mean of peaks SS, TWM
AED1 Mean of peaks SS TWD
AED2 Pulse rate, 0.9 threshold WH TWD
Acoustic emission AED3 Mean, absolute value TWD
AED4 PSD TWD
Drilling AED5 Standard deviation TWD
PD1 Dierence between means, full and partial engagement WH
PD2 Mean during partial engagement SS
Spindle power
PD3 Standard deviation TWD
PD4 PSD (10100 Hz) TWD
1 If only the main eects are signicant, all the main two sensors employed. The last column of the table
eects that are signicant are included as causative lists the root nodes with which a statistically signi-
agents for the metric in question. cant correlation was found. Each of these relation-
2 If the main eects and one or more rst order inter- ships represents a causal arc that will be added to
actions are signicant, then: the Belief network.
If the interactions are the combinations of variables
3.2. Initialization of the network structure
that show up as signicant in the main eects, then
include the variables in the interaction as being Fig. 4 illustrates the resulting network structure. There
causative agents for the metric in question. are a total of 18 evidence nodes representing individual
If the interactions are the combinations of vari- sensor metrics. There are four root notes, representing the
ables that are other than the variables that show four root causes of concern: stock size variation, work-
up as signicant in the main eects, the result is piece hardness variation, tool wear state of the face mill,
confounded and ignore the result. and tool wear state of the endmill. Each of these root
3 If none of the main eects are signicant and one or cause nodes contained two discrete states, high and low.
more rst order interactions are signicant then: The evidence nodes were discretized with either two or
If there are multiple interactions, the result is con- three states. Each of the links represents a causal relation-
founded and thus should be ignored. ship as enumerated in Table 2. In the event that a statisti-
If there is one interaction then the variables in the cally signicant relationship between a particular metric
interaction are the causative agents for the metric and an interaction eect exists and was not eliminated by
in question. the ltering rules above, a causal link between both of the
4 If the second order interaction is signicant, then: corresponding root nodes and the sensor metric in ques-
If none of the main eects or the rst order interaction were added to the Bayesian network.
tions are signicant, then all the variables involved
in the second order interaction are taken as signi- 4. Results and conclusions
cant and causal agents for the metric in question.
If some or all of the main eects are signicant The network shown in Fig. 4 was trained on the 23 full
with no rst order interaction being signicant, factorial DOE with two repetitions as described in Section
the results are confounded and thus ignored. 3. The raw training data and the corresponding table of
If any of the rst order interactions are signi- states from which the conditional probabilities were
cant, the results are confounded and ignored. learned is containing Appendix A, Tables A1A3. Sub-
If all the eects are signicant, the results are sequent to training the network, its diagnostic perform-
completely confounded and are ignored. ance was evaluated on 18 new trials. For these trials,
experimental training data were obtained, and the metrics
Application of the above rules resulted in the were evaluated and introduced as evidence into the net-
identication of the causally signicant sensor work. The belief updation procedure was executed and
metrics listed in Table 2 for the two processes and the beliefs of the root nodes were recorded. Tables 35
Fig. 4. Structure of Bayesian belief network for root cause diagnosis.
summarize the predictive success of the network on these A percentage in the neighborhood of 50% represents
18 trials. Three dierent scenarios are reported: belief diagnostic ambivalencethe network is unable to dis-
updation using milling evidence only (Table 3), drilling cern the state given the training data and current sen-
evidence only (Table 4), and both milling and drilling evi- sor evidence. A percentage below 50% represents an
dence (Table 5). The percentages reported in the tables incorrect diagnosisthe belief is stronger in the
represent the belief in the correct state of the trial. opposite (incorrect) state. Tables 35 have been color-
For example, consider the rst trial (rst row of coded to provide a visual summary of the diagnostic
data) in Table 3. In this trial, and as indicated on the successlighter shades of gray indicate increasing
rst row of the table, the tool wear of the face mill was levels of diagnostic condence.
in the low state, the tool wear of the drill was in the Table 6 and Fig. 5 summarize the diagnostic per-
high state, the stock size dimension was in the low formance of the network. As may be seen in Fig. 5, the
state, and the workpiece hardness was in the low state. predictive success increases with the use of data from
Because Table 3 represents a situation in which only both processes. This is an important observation. Sel-
the milling evidence is utilized, there is no ability to dom has data from subsequent machining operations
predict the tool wear state of the drill. For this reason, been combined to provide a greater level of condence
N/A appears in the corresponding cell of the table. in the diagnosis of process faults. Using evidence from
Belief predictions are available, however, for each of both the milling and drilling metrics, the network was
the remaining three diagnoses. In this trial, all three of able to correctly diagnose the state at a 60% condence
these beliefs are greater than 80%, indicating a greater level in all 18 trials for the drilling tool wear, workpiece
than 80% belief in the correct state of the stock size, hardness, and stock size nodes, and provided a correct
face mill wear state, and workpiece hardness level. diagnosis in 17 of 18 cases for the face mill tool wear
Higher percentages represent greater degrees of cer- state. As the condence threshold is increased to 70%
tainty, and therefore more successful diagnostic ability. and 80%, the predictive success declines, as would be
Table 3
Diagnostic beliefs on 18 trials using milling evidence only
Table 4
Diagnostic beliefs on 18 trials using drilling evidence only
Table 5
Diagnostic beliefs on 18 trials using milling and drilling evidence
Table 6
Diagnostic success on 18 trials at 60%, 70%, and 80% condence levels
Process fault
TWM TWD SS WH
Diagnosis condence 60% 70% 80% 60% 70% 80% 60% 70% 80% 60% 70% 80%
Drilling only, both sensors N/A N/A N/A 15 15 8 17 16 8 17 15 6
Evidence Milling only, both sensors 16 16 12 N/A N/A N/A 15 14 11 18 17 11
utilized Both processes, both sensors 17 17 13 18 16 10 18 16 12 18 18 13
expected. Nevertheless, using evidence from both pro- in 10 of the 18 trials, and the stock size variation,
cesses, the network was capable of achieving an 80% workpiece hardness, and face mill tool wear state in
condence level in between 10 and 13 of the 18 trials. either 12 or 13 of the 18 trials.
In conclusion, the Bayesian belief network method-
ology appears to be an eective tool for explicitly address- Acknowledgements
ing uncertainty and utilizing data from multiple sources.
In this application, data from two non-intrusive sensors The authors gratefully acknowledge the support of
(spindle power and acoustic emission) and two subsequent Montronix, Inc., Ford Motor Company, the NIST
processes (face milling and drilling) have been combined ATP program, and the National Science Foundation
to successfully diagnose a state of tool wear, workpiece (award DMI-9941214) for support of this work. Dr.
hardness, and a stock size dimensional variation. W. Kline and R. Sriram of Montronix, Inc. are person-
After training on a data set of 16 trials, the network ally acknowledged for their support and guidance.
was able to correctly diagnose the correct state at a
60% condence level in all but one of the 18 test cases. Appendix A.
The belief network was able to correctly diagnose the
state of the drill tool wear at the 80% condence level Tables A.1A.3.
Fig. 5. Diagnostic success on 18 trials at 60% and 80% condence levels.
Table A.1
Face milling metrics from training data
Acoustic emission metrics Spindle power metrics

AEfm1 AEfm4 AEfm6 AEfm2 AEfm5 AEfm3 Pfm1 Pfm2 Pfm3
1 0.56 1.59 0.42 1519 140,512 0.64 75.84 15.74 16.77
2 0.52 1.54 0.39 1328 140,343 0.57 74.90 15.51 16.60
3 0.61 1.21 1.05 920 920 920.31 80.95 16.79 18.01
4 0.53 1.04 0.88 649 649 649.19 80.04 15.78 16.52
5 0.49 1.48 0.36 1244 136,072 0.53 79.75 13.28 12.88
6 0.51 1.60 0.36 1209 133,606 0.52 78.87 14.21 14.16
7 0.49 1.03 0.80 416 416 416.75 85.27 14.56 14.74
8 0.49 1.07 0.80 407 407 407.99 84.69 15.25 15.59
9 0.56 1.59 0.42 1519 140,512 0.64 76.16 16.18 17.67
10 0.52 1.54 0.39 1328 140,343 0.57 73.86 14.83 15.64
11 0.61 1.21 1.05 920 920 920.31 81.46 16.95 18.42
12 0.53 1.04 0.88 649 649 649.19 79.94 15.66 16.57
13 0.49 1.48 0.36 1244 136,072 0.53 79.83 13.29 12.94
14 0.51 1.60 0.36 1209 133,606 0.52 75.30 13.43 12.17
15 0.49 1.03 0.80 416 416 416.75 85.37 14.32 14.47
16 0.49 1.07 0.80 407 407 407.99 80.63 14.50 13.45
Table A.2
Drilling metrics from training data
Acoustic emission metrics Spindle power metrics

AEd5 AEd1 AEd3 AEd2 AEd4 Pd3 Pd1 Pd4 Pd2
1 0.18 1.17 0.13 247 0.06 7.66 48.09 25.63 77.73
2 0.22 1.36 0.15 757 0.10 8.08 61.17 29.05 74.97
3 0.20 1.47 0.14 476 0.08 7.87 52.21 29.20 81.40
4 0.20 1.33 0.13 543 0.08 8.26 64.15 32.45 78.05
5 0.16 1.13 0.12 226 0.05 7.13 49.68 25.39 73.39
6 0.17 1.12 0.12 287 0.06 7.91 70.05 26.19 72.43
7 0.15 0.84 0.11 51 0.04 7.12 44.74 22.72 78.73
8 0.14 1.00 0.10 104 0.04 7.24 43.86 21.08 81.19
9 0.18 0.99 0.13 147 0.06 8.53 62.81 32.23 73.84
10 0.20 1.20 0.14 513 0.08 8.04 58.61 38.77 72.60
11 0.18 1.34 0.12 327 0.06 7.37 39.27 28.99 80.50
12 0.20 1.33 0.13 543 0.08 8.26 64.15 32.45 78.05
13 0.18 1.38 0.13 561 0.07 7.57 53.83 23.33 73.96
14 0.18 1.36 0.12 362 0.06 7.14 52.41 21.49 73.87
15 0.19 1.27 0.13 425 0.07 7.16 42.78 22.63 82.04
16 0.15 1.06 0.11 244 0.04 6.97 45.88 25.07 79.28
Table A.3
Evidence and process states for training data
Trial Process state Face milling metric state Drilling metric state
TWM SSV Hardness TWD AEFM1 AEFM2 AEFM3 AEFM4 PFM1 PFM2 AEFM5 PFM3 AEFM6 PD1 AED1 AED2 PD2 AED3 AED4 AED5 PD3 PD4
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 0 1 0 1 0 1
2 1 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1 1 0
3 1 1 1 1 1 0 1 1 0 1 1 1 1 0 1 0 1 1 1 1 0 0
4 1 1 1 1 1 1 1 1 0 1 1 1 1 1 1 1 0 1 0 1 1 0
5 1 1 1 1 1 1 1 1 0 1 1 1 1 0 0 1 1 1 1 1 1 1
6 1 1 1 1 1 1 1 1 0 1 1 1 1 1 0 0 1 1 0 1 0 1
7 1 1 1 1 1 1 0 1 1 0 1 0 1 1 1 1 0 1 1 1 1 1
8 1 1 1 1 1 1 0 1 1 0 1 0 1 1 1 1 1 1 1 1 1 1
9 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1 0
10 1 1 1 1 1 1 1 1 1 0 1 0 1 0 0 0 1 1 1 1 1 1
11 1 1 1 1 1 0 1 1 0 1 1 1 1 1 1 0 1 1 0 1 1 0
S. Dey, J.A. Stori / International Journal of Machine Tools & Manufacture 45 (2005) 7591
12 1 1 1 1 1 1 1 1 0 0 1 1 1 1 1 1 0 1 0 1 1 0
13 1 1 1 1 1 1 1 1 0 1 1 1 1 0 1 1 1 1 0 1 0 1
14 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 0 1 1 0 1 1 1
15 1 1 1 1 1 1 0 1 1 1 1 0 1 1 1 0 1 1 0 1 1 1
16 1 1 1 1 1 1 0 1 0 1 1 1 1 1 0 1 1 1 1 1 1 1
89
References [23] J.H. Tarn, M. Tomizuka, On-line monitoring of tool and cutting
conditions in milling, Journal of Engineering for Industry 111
[1] D. Li, J. Mathew, Tool wear and failure monitoring techniques (3) (1989) 206212.
for turninga review, International Journal of Machine Tools [24] X. Li, A. Djordjevich, P.K. Venuvinod, Current-sensor-based
and Manufacture 30 (4) (1990) 579598. feed cutting force intelligent estimation and tool wear condition
[2] E. Dimla, Sensor signals for tool-wear monitoring in metal cut- monitoring, IEEE Transactions on Industrial Electronics 47 (3)
ting operationsa review of methods, International Journal of (2000) 697702.
Machine Tools and Manufacture 40 (8) (2000) 10731098. [25] R.J. Seethaler, I. Yellowly, Identication of radial runout in
[3] E. Jantunen, A summary of methods applied to tool condition milling operations, Journal of Manufacturing Science and
monitoring in drilling, International Journal of Machine Tools Engineering 121 (3) (1999) 524531.
and Manufacture 42 (9) (2002) 9971010. [26] S. Sastry, S.G. Kapoor, R.E. DeVor, Compensation of progress-
[4] Y. Koren, T.-R. Ko, G.A. Ulsoy, K. Danai, Flank wear esti- ive radial run-out in face-milling by spindle speed variation,
mation under varying cutting conditions, Journal of Dynamic International Journal of Machine Tools and Manufacture 40 (8)
Systems, Measurement and Control 113 (2) (1991) 300307. (2000) 11211139.
[5] S.C. Lin, R.J. Yang, Force-based model for tool wear monitor- [27] Y. Altintas, I. Yellowly, The identication of radial width and
ing in face milling, International Journal of Machine Tools and axial depth of cut in peripheral milling, International Journal of
Manufacture 35 (9) (1995) 12011211. Machine Tools and Manufacture 27 (3) (1987) 367381.
[6] S.K. Choudhury, S. Rath, In-process tool wear estimation in [28] Y. Altintas, I. Yellowly, In process prediction of tool failure in
milling using cutting force model, Journal of Materials Proces- milling using cutting force models, Journal of Engineering for
sing Technology 99 (1) (2000) 113119. Industry 111 (2) (1989) 149157.
[7] Y. Yao, X.D. Fang, Modeling of multivariate time-series for [29] J.G. Choi, M.Y. Yang, In-process prediction of cutting depths in
tool wear estimate in nish turning, International Journal of end milling, International Journal of Machine Tools and Manu-
Machine Tools and Manufacture 32 (4) (1992) 495508. facture 39 (5) (1999) 705721.
[8] H.V. Ravindra, Y.G. Srinivasa, R. Krishnamurthy, Modeling of [30] L. Yang, R. Zhu, R.E. DeVor, S.G. Kapoor, W.A. Kline,
tool wear based on cutting forces in turning, Wear 169 (1) (1993) Identication of stock size variation and its application to pro-
2532. cess monitoring in end milling, Proceedings of the 2000 Japan
[9] S.A. Kumar, H.V. Ravindra, Y.G. Srinivasa, In-process tool USA Symposium on Flexible Automation, 2000, pp. 18, Paper
wear monitoring through time series modeling and pattern rec- No. 2000JUSFA-13212.
ognition, International Journal of Production Research 35 (3) [31] L. Yang, R.E. DeVor, S.G. Kapoor, A model-based method-
(1997) 739751. ology for detection of depth of cut variations in end milling,
[10] T. Szecsi, Cutting force modeling using articial neural networks, Proceedings of the 2002 JapanUSA Symposium on Flexible
Journal of Materials Processing Technology 9293 (1999) 344349. Automation, 2002.
[11] S.C. Lin, R.J. Lin, Tool wear monitoring in face milling using [32] D.E. Dimla Jr., P.E. Lister, N.J. Leighton, Neural network sol-
force signals, Wear 198 (12) (1996) 136142. ution to the tool condition monitoring problem in metal cutting-
[12] S.C. Lin, C.J. Ting, Tool wear monitoring in drilling using force a critical review of methods, International Journal or Machine
signals, Wear 180 (12) (1995) 5360. Tools and Manufacture 37 (9) (1997) 12191241.
[13] S.Y. Liang, D.A. Dornfeld, Tool wear detection using time-ser- [33] R.X. Du, M.A. Elbestawi, S. Li, Tool condition monitoring in
ies analysis of acoustic emission, Journal of Engineering for turning using fuzzy set theory, International Journal of Machine
Industry 111 (3) (1989) 198205. Tools and Manufacture 32 (6) (1992) 781796.
[14] E.N. Diei, D.A. Dornfeld, Acoustic emission sensing of tool [34] X. Li, S.K. Tso, J. Wang, Real-time tool condition monitoring
wear in face milling, Journal of Engineering for Industry 109 (3) using wavelet transforms and fuzzy techniques, IEEE Transac-
(1987) 234240. tions on Systems, Man and Cybernetics Part C: Applications
[15] K. Jemielniak, Some aspects of AE application in tool condition and Reviews 30 (3) (2000) 352357.
monitoring, Ultrasonics 38 (18) (2000) 604608. [35] X. Li, S.K. Tso, Drill wear monitoring based on current signals,
[16] X. Li, A brief review: acoustic emission method for tool wear Wear 231 (2) (1999) 172178.
monitoring during turning, International Journal of Machine [36] H.M. Ertunc, K.A. Loparo, H. Ocak, Tool wear condition
Tools and Manufacture 42 (2) (2002) 157165. monitoring in drilling operations using hidden Markov models,
[17] T.I. El-Wardany, D. Gao, M.A. Elbestawi, Tool condition International Journal of Machine Tools and Manufacture 41 (9)
monitoring in drilling using vibration signature analysis, Inter- (2001) 13631384.
national Journal of Machine Tools and Manufacture 36 (6) [37] L. Atlas, M. Ostendorf, G.D. Bernard, Hidden Markov models
(1996) 687711. for monitoring machining tool-wear, Proceedings of the IEEE
[18] Y. Yao, X.D. Fang, G. Arndt, Online estimation of groove wear International Conference on Acoustics, Speech and Signal Pro-
in the minor cutting edge for nish machining, Annals of the cessing, vol. 6, 2000, pp. 38873890.
CIRP 40 (1) (1991) 4144. [38] S.A. Kumar, H.V. Ravindra, Y.G. Srinivasa, In-process tool
[19] J. Rotberg, S. Braun, E. Lenz, Vibration generation models for wear monitoring through time series modeling and pattern rec-
cutting tool diagnostics, Diagnostics, Vehicle Dynamics and Spe- ognition, International Journal of Production Research 35 (3)
cial Topics, 18(5)ASME Design Engineering Division, DE, 1989, (1997) 739751.
pp. 16. [39] E. Govekar, J. Gradisek, I. Grabec, Analysis of acoustic emis-
[20] N. Constantinides, S. Bennett, An investigation of methods for sion signals and monitoring of machining processes, Ultrasonics
online estimation of tool wear, International Journal of Machine 38 (18) (2000) 598603.
Tools and Manufacture 27 (2) (1987) 225237. [40] D. Yan, T.I. El-Wardany, M.A. Elbestawi, A multi-sensor strat-
[21] D. Cuppini, G. DErrico, G. Rutelli, Tool wear monitoring based egy for tool failure detection in milling, International Journal of
on cutting power measurement, Wear 139 (2) (1990) 303311. Machine Tools and Manufacture 35 (3) (1995) 383398.
[22] R.X. Du, M.A. Elbestawi, S. Li, Tool condition monitoring in [41] B. McCabe, Belief networks for engineering applications, Inter-
turning using fuzzy set theory, International Journal of Machine national Journal of Technology Management 21 (34) (2001)
Tools and Manufacture 32 (6) (1992) 781796. 257270.
[42] B. Zheng, Y.-H. Chang, X.-H. Wang, W.F. Good, Comparison [49] E. Wolbrecht, B. DAmbrosio, R. Paasch, D. Kirby, Monitoring
of articial neural network and Bayesian belief network in a and diagnosis of a multi-stage manufacturing process using Baye-
computer-assisted diagnosis scheme for mammography, Proceed- sian networks, Articial Intelligence for Engineering Design, Analy-
ings of the International Joint Conference on Neural Networks, sis and Manufacturing: Aiedam 14 (1) (2000) 5367.
vol. 6, 1999, pp. 41814185. [50] A. Prateepasen, Y.H.J. Au, B.E. Jones, Acoustic emission and
[43] C.E. Kahn, J.J. Laur, G.F. Carrera, A Bayesian network for vibration for tool-wear monitoring in single-point machining
diagnosis of primary bone tumors, Journal of Digital Imaging 14 using belief network, IEEEInstrumentation and Measurement
(2) (2001) 5657. Technology Conference, vol. 3, 2001, pp. 15411546.
[44] P. Antal, H. Verrelst, D. Timmerman, Y. Moreau, S. Van [51] J. Pearl, Probabilistic Inference in Intelligent Systems: Networks of
Huel, B. De Moor, et al., Bayesian networks in ovarian cancer Plausible Inference, Morgan Kaufman, San Mateo (CA), 1988.
diagnosis: potentials and limitations, Proceedings of the IEEE [52] M. Ramoni, P. Sebastiani, Parameter estimation in Bayesian
Symposium on Computer based Medical Systems, 2000, pp. 103 networks for incomplete databases, Intelligent Data Analysis 2
108. (2) (1998).
[45] V. Pavlovic, B.J. Frey, T.S. Huang, Time-series classication [53] D.J. Spiegelhalter, S.L. Lauritzen, Sequential updating of con-
using mixed-state dynamic Bayesian networks, Proceedings of ditional probabilities on directed graphical structures, Networks
the IEEE Computer Society Conference on Computer Vision 20 (1990) 157224.
and Pattern Recognition, vol. 2, 1999, pp. 609615. [54] D. Heckerman, A tutorial on learning with Bayesian networks,
[46] K.-C. Chang, J. Liu, J. Zhou, Bayesian probabilistic inference in: M. Jordan (Ed.), Learning in Graphical Models, MIT Press,
for target recognition, Proceedings of SPIEThe International Cambridge (MA), 1999, Also appears as Technical Report
Society for Optical Engineering 2755 (1996) 158165. MSR-TR-95-06, Microsoft Research.
[47] C.-H. Wu, Sub-syllable-based discriminative segmental Bayesian [55] D. Geiger, D. Heckerman, A characterization of the Dirichlet
network for mandarin speech keyword spotting, IEEE Proceed- distribution through global and local independence, The Annals
ings: Vision, Image and Signal Processing 144 (2) (1997) 6571. of Statistics 25 (1997) 13441369.
[48] R.W. Lewis, R.S. Ransing, Semantically constrained Bayesian [56] K.P. Murphy, Y. Weiss, M.I. Jordan, Loopy belief propagation for
network for manufacturing diagnosis, International Journal of approximate inference: an empirical study, UAI 99: Proceedings of
Production Research 35 (8) (1997) 21712187. the 15 Conference on Uncertainty in Articial Intelligence, 1999.

Bayesian Network Approach To Root Cause Diagnosis

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Bayesian Network Approach To Root Cause Diagnosis

Uploaded by

Copyright:

Available Formats

International Journal of Machine Tools & Manufacture 45 (2005) 7591

A Bayesian network approach to root cause diagnosis of process

Received 28 October 2003; accepted 29 June 2004

1. Introduction for example. Due to the stochastic nature of manufac-

1.2.1. Neural networks cess involving a number of states linked by a number

Fig. 1. Notation to be used in describing a Bayesian network.

Fig. 2. Message passing conventions for belief updating.

Fig. 3. Initialization and application of the Bayesian belief network methodology.

Acoustic emission sensor metrics Spindle power sensor metrics

three variables were the process variations of interest: tool

Process Sensor Label Metric ANOVA sensitivity

Fig. 4. Structure of Bayesian belief network for root cause diagnosis.

Fig. 5. Diagnostic success on 18 trials at 60% and 80% condence levels.

Acoustic emission metrics Spindle power metrics

Acoustic emission metrics Spindle power metrics

You might also like