Fault Diagnosis System of Induction Motors Based o
Fault Diagnosis System of Induction Motors Based o
Research Article
Fault Diagnosis System of Induction Motors Based on
Multiscale Entropy and Support Vector Machine with Mutual
Information Algorithm
Shuang Pan,1 Tian Han,1 Andy C. C. Tan,2 and Tian Ran Lin3
1
School of Mechanical Engineering, University of Science and Technology Beijing, 30 Xueyuan Road, Haidian District,
Beijing 100083, China
2
Faculty of Engineering, Universiti Tunku Abdul Rahman, Sungai Long Campus, Kajang, Malaysia
3
School of Mechanical Engineering, Qingdao Technological University, 777 Jialingjian Road, Qingdao 266520, China
Copyright © 2016 Shuang Pan et al. This is an open access article distributed under the Creative Commons Attribution License,
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
An effective fault diagnosis method for induction motors is proposed in this paper to improve the reliability of motors using a com-
bination of entropy feature extraction, mutual information, and support vector machine. Sample entropy and multiscale entropy are
used to extract the desired entropy features from motor vibration signals. Sample entropy is used to estimate the complexity of the
original time series while multiscale entropy is employed to measure the complexity of time series in different scales. The entropy
features are directly extracted from the nonlinear, nonstationary induction motor vibration signals which are then sorted by using
mutual information so that the elements in the feature vector are ranked according to their importance and relevant to the faults. The
first five most important features are selected from the feature vectors and classified using support vector machine. The proposed
method is then employed to analyze the vibration data acquired from a motor fault simulator test rig. The classification results con-
firm that the proposed method can effectively diagnose various motor faults with reasonable good accuracy. It is also shown that the
proposed method can provide an effective and accurate fault diagnosis for various induction motor faults using only vibration data.
Table 1: Typical induction motor faults and the corresponding Induction motor
occurrence possibilities. Power Mechanical
supply load
Bearing Stator Rotor Others
faults faults faults MSE MI
IEEE 42% 28% 8% 22% Vibration data Entropy Feature
acquisition calculation selection
EPRI 40% 38% 10% 12%
for data processing. Feature extraction is thus a critical initial Xim Xjm
step in a monitoring and fault diagnosis system. The accuracy
of feature extraction has a direct effect on the accuracy
of the final diagnosis outcome. An important criterion is ±r
Sample data
that a feature extraction method should be able to preserve
the critical information for decision-making. Based on this
consideration, SampEn and MSE algorithms are adopted in
this paper to extract the usable effective features from the ±r
vibration data.
Pincus [25] proposed a statistical measure, Approximate
Entropy (ApEn), to quantify the regularity in a short and 0 1 2 i i+1 j j+1 N
noisy time series. ApEn reflects the likelihood of sequences Sample number
that are close to each other. That is to say, within given
Figure 2: A schematic illustration of the distance calculation (𝑚 =
tolerance r, m consecutive data points remain close when one 2).
more data point is known [26]. In general, lower values of
ApEn reflect more regular time series, while higher values are
associated with less predictable time series. ApEn measures
calculation is shown in Figure 2. The distance
the time series for similar segments, and a large number
between the two vectors is calculated using
of such segments yield a small value of ApEn. Importantly,
the ApEn algorithm counts each sequence as matching itself
𝑑 [𝑋𝑖𝑚 , 𝑋𝑗𝑚 ] = max [𝑥 (𝑖 + 𝑘) − 𝑥 (𝑗 + 𝑘)] ,
to avoid the occurrence of ln(0) in the calculations. This 𝑘=0,1,...,𝑚−1
condition has led to bias in the calculation of ApEn. In (2)
practice, the bias causes ApEn to lack two important expected 𝑖, 𝑗 = 1, 2, . . . , 𝑁 − 𝑚, 𝑖 ≠ 𝑗.
properties: (1) ApEn is heavily dependent on the record
length and is uniformly lower than the expected value for (3) Let 𝑛𝑖𝑚 (𝑟) be the number of distances within r and let
short records and (2) it lacks relative consistency; that is to 𝐵𝑖𝑚 (𝑟) be the total number of m-dimension matched
say, if ApEn of one dataset is higher than that of another, it vector pairs:
should, but does not, remain higher for all tested conditions.
To overcome the shortcomings of ApEn, Richman and 𝑛𝑖𝑚 (𝑟)
𝐵𝑖𝑚 (𝑟) = , 𝑖 = 1, 2, . . . , 𝑁 − 𝑚. (3)
Moorman [27] developed and characterized a new family (𝑁 − 𝑚 − 1)
of statistical measures, sample entropy (SampEn), which
do not count the self-matches. SampEn is precisely the (4) Define the average value of 𝐵𝑖𝑚 (𝑟) such that
negative natural logarithm of the conditional probability
that two sequences with similar m points remain similar 𝑁−𝑚
𝐵𝑖𝑚 (𝑟)
at the next point, where self-matches are not included in 𝐵𝑚 (𝑟) = ∑ . (4)
𝑖=1 (𝑁 − 𝑚)
calculating the probability. Thus, a lower value of SampEn
also indicates more self-similarity in a time series. In addition
to the benefit of eliminating self-matches, SampEn algorithm (5) Repeat steps (1)–(4) for 𝑚 + 1, and 𝐵𝑚+1 (𝑟) can be
is also simpler and uses only about one-half of the time obtained.
compared with ApEn algorithm for the same calculation. (6) For finite N, SampEn is defined as the logarithm of the
SampEn is largely independent of the data record length and ratio of 𝐵𝑚+1 (𝑟) and 𝐵𝑚 (𝑟):
displays relative consistency under various circumstances.
Mathematically, SampEn algorithm can be summarized as 𝐵𝑚+1 (𝑟)
follows. SampEn (𝑚, 𝑟, 𝑁) = − ln [ ]
𝐵𝑚 (𝑟) (5)
Let X be a time series of length 𝑁 : 𝑋 =
{𝑥(1), 𝑥(2), . . . , 𝑥(𝑁)}: 𝑚
= ln 𝐵 (𝑟) − ln 𝐵 𝑚+1
(𝑟) ,
(1) Construct template vectors with dimension m by where 𝑁 is the length of the time series, 𝑚 is the
using the following equation: dimension of sequences to be compared, and r is the
tolerance level for accepting matches. It is convenient
to set the tolerance as 𝑟 × SD, where SD is the
𝑋𝑖𝑚 = [𝑥 (𝑖) , 𝑥 (𝑖 + 1) , . . . , 𝑥 (𝑖 + 𝑚 − 1)] , standard deviation of the dataset. It is worth noting
(1)
𝑖 = 1, 2, . . . , 𝑁 − 𝑚. that the parameters m, r, and N must be fixed for each
calculation.
(2) A match occurs when the distance between two 2.2. Multiscale Entropy (MSE). Regularity in a time series
template vectors (𝑋𝑖𝑚 , 𝑋𝑗𝑚 ) is smaller than predefined is sometimes presented as coarse granularity; thus, the time
tolerance r. A schematic illustration of the distance series needs to be explored using different scales. Costa et al.
4 Shock and Vibration
x1 x2 x3 x4 x5 x6 xi xi+1
Scale 2
··· xi + xi+1
y1 y2 y3 yj =
2
x1 x2 x3 x4 x5 x6 xi xi+1 xi+2
Scale 3
···
y1 y2 xi + xi+1 + xi+2
yj =
3
[28] suggested that it is generally difficult to distinguish the Mutual information (MI) is a measure of the variables’
interbeat interval time series of different diseased and healthy mutual dependence. One major advantage of MI for fea-
states when only a single-scale SampEn is used. Therefore, the ture selection is its ability to detect nonlinear relationships
concept of MSE is proposed to resolve this difficulty. For the between variables. MI can also be defined for groups of
sake of brevity, only the two main steps of MSE algorithm are variables (or equivalently for multidimensional variables),
discussed here: which allows one to take the joint relevance and redundancy
of features into account during the feature selection process.
(1) A coarse-graining step, which is used to obtain the Mutual information 𝐼(𝑋, 𝑌) is the amount of uncertainty in
representations of the original time series on different X due to the knowledge of 𝑌. Mathematically, MI can be
time scales. calculated as [23]
(2) The SampEn step, which is used to quantify the
𝑝 (𝑥, 𝑦)
regularities of the coarse-grained time series. 𝐼 (𝑋, 𝑌) = ∑𝑝 (𝑥, 𝑦) log , (7)
𝑥,𝑦 𝑝 (𝑥) 𝑝 (𝑦)
To obtain the coarse-grained time series at a scale factor,
𝜏, the original time series is divided into nonoverlapping where 𝑝(𝑥, 𝑦) is the joint probability distribution function
windows of length 𝜏, and the data points inside each window of 𝑋 and 𝑌 and 𝑝(𝑥) and 𝑝(𝑦) are the marginal probability
are averaged. A coarse-grained time series in MSE is defined distribution functions for 𝑋 and 𝑌. MI is employed in this
using study to select a subset of relevant features, which considers
feature-class mutual information to determine an optimal set
𝑗𝜏 of features by sorting entropy features in descending order.
1 𝑁
𝑦𝑗 (𝜏) = ∑ 𝑥 (𝑖) , 1≤𝑗≤ . (6) It is noted that MI has not been employed previously for
𝜏 𝑖=(𝑗−1)𝜏+1 𝜏 induction motor fault diagnosis. It is used in this study to
select the effective entropy features closely related to a motor
For 𝜏 = 1, the coarse-grained time series is the original fault.
time series. As 𝜏 increases, the length of the resulting coarse-
grained time series decreases. Figure 3 describes 𝜏 coarse- 2.4. Support Vector Machine. SVM algorithm is capable of
grained time series divided from the original time series for handling large feature spaces since the training of SVM
a scale factor of 𝜏. In the conventional MSE algorithm, the is carried out in such a manner that the dimension of
MSE at a scale factor of 𝜏 is defined as the SampEn of the classified vectors does not have any distinct influence on the
first coarse-grained time series. The SampEn for each coarse- performance of SVM as it does on the performance of other
grained time series is calculated and plotted as a function of conventional classifiers. This property is very useful for fault
the scale factor 𝜏 in the final step to complete the procedure classification since the number of features for fault diagnosis
of multiscale entropy analysis. need not be limited. Furthermore, SVM-based classifier also
has better generalization properties than other conventional
2.3. Mutual Information for Feature Selection. The extracted classifiers attributed to the minimum structural misclassifi-
SampEn and MSE features are able to discriminate different cation risk when training the SVM classifier, whereas other
types of motor faults. Nevertheless, a feature vector with high traditional classifiers are usually trained to minimize the
dimension will be very time consuming to analysis. There- empirical risk [29, 30]. SVM can also be used in nonlinear
fore, it is our aim to reduce the feature space by selecting only classification tasks with the selection of the kernel functions.
the most effective features containing the fault information The definition of a legitimate kernel function is given by
to improve the performance of SVM classification and the Mercer’s theorem which states that the function must be
effectiveness of fault diagnosis. continuous and positive definite. Any function that satisfies
Shock and Vibration 5
Kernel 𝐾 (x𝑖 , x𝑗 )
Linear function 𝐾 (x𝑖 , x𝑗 ) = x𝑖𝑇 × x𝑗
Polynomial function 𝐾(x𝑖 , x𝑗 ) = (𝑔x𝑖𝑇 × x𝑗 + 𝑟)𝑑 , 𝑔 > 0
− x𝑖 − x𝑗
Radial basis function 𝐾 (x𝑖 , x𝑗 ) = exp ( 2 )
2𝑔
Normal
Rotor unbalance
Phase unbalance
Angular misalignments
Parallel misalignments
Bowed rotor
Faulty bearing
0 0.25 0.50
Time (s)
FBO
2 2
BR
SampEn
FBO
AMIS PMIS
AMIS
MUN NOR PUN MSE
PMIS MUN NOR
1.5 1.5 BR
BRB PUN
BRB
1 1
0 20 40 60 80 100 120 140 160 0 20 40 60 80 100 120 140 160
Sample number Sample number
Figure 7: Consistency analysis of the two chosen entropy features. (a) Consistency analysis of SampEn (tolerance, 𝑟 = 0.2) and (b) consistency
analysis of MSE (scale factor, 𝜏 = 2).
4. Application of the Proposed Algorithm for fault, and the green line represents the mean entropy value for
Motor Fault Diagnosis each fault. The analysis of the distribution of the two entropy
features using the “3s” criteria indicates that both SampEn
4.1. Consistency Analysis for Each Fault Condition. An effec- and MSE satisfy the consistency requirement.
tive feature should be able to distinguish different induction
motor fault conditions and satisfy the consistency require- 4.2. Entropy Calculation and Feature Selection. As described
ment for each induction motor fault. A consistency analysis is in Section 2, three parameters, namely, the tolerance r, the
thus required before the use of entropy features for induction dimension of sequences m, and the length of the time series N,
motor fault diagnosis since motor vibration data typically are used in the calculation of SampEn. The length of each data
contains nonlinear components and noise. Random errors sample is automatically determined by the data acquisition
in the experiment can cause variation of the calculated setup, which is 𝑁 = 4096. The dimension of sequences,
entropy feature values. A large variation of the calculated 𝑚 = 2, is used in this calculation. The tolerance r is commonly
entropy values will render the features inappropriate for fault expressed as a fraction of the standard deviation (SD) of
classification. The “3s” rule in statistics is adopted in this the time series that affects the SampEn value. A small 𝑟 will
study to evaluate the consistency of the two entropy features, lead to a poor conditional probability, while a large r will
SampEn and MSE. result in losing too much system information. In principle,
Figure 7 shows the consistency analysis of the two entropy r should be a value that minimizes the effect of noise on
features for the simulated motor faults. The black dots are the the calculation of the SampEn. The value of r chosen in this
entropy values of 20 data samples for each simulated motor study ranges between 0.1SD and 0.7SD. Figure 8 illustrates
8 Shock and Vibration
𝜏=1
𝜏=2
𝜏=3
𝜏=4
𝜏=5
AMIS FBO
Dataset (160)
Correctly classified Wrongly classified Classification accuracy
Training set Test set
Set-1 (80) Set-2 (80) 74 6 92.50%
Set-2 (80) Set-1 (80) 76 4 95.00%
Average classification accuracy (all ten features) 93.75%
Set-1 (80) Set-2 (80) 77 3 96.25%
Set-2 (80) Set-1 (80) 77 3 96.25%
Average classification accuracy (the first five features using MI) 96.25%
datasets for the validation of the proposed induction motor method. Only three wrongly classified cases are found from
fault diagnosis system. In a 𝑘-fold cross-validation method, the 160 datasets for the MUN and NOR cases due to the
the dataset is divided into 𝑘 subsets, and the holdout method similarity of the waveform (see Figure 6) and entropy values
[31] is repeated 𝑘 times. At each time, 𝑘−1 subsets are used for (see Figure 10) between these two cases.
training and 𝑘th subset is used for testing. The average error Figure 13 compares the classification accuracy of the
for all 𝑘 trials is computed. The second-fold test accuracy proposed method and another method previously developed
of the proposed method is shown in Table 6. The average by the same authors [1]. It is shown that the method proposed
classification accuracy using the first five features and the MI in this study is as accurate or better for motor fault diagnosis
algorithm is 96.25%. in most cases except for one (NOR). The proposed method
The prediction accuracy of the SVM model using the can identify 8 different operational conditions of induction
radial basis function is graphically illustrated in Figure 12. It is motors with high accuracy which is an advantage over
shown that the SVM classifier can identify the specific motor a previous work [20] where a single fault such as shaft
fault with 100% accuracy for most faults using the proposed misalignment was studied. The proposed method also differs
10 Shock and Vibration
2.4 1
2.2
2 0.95
1.8
1.6 0.9
Accuracy
MSE
1.4
1.2 0.85
1
0.8 0.8
0.6
0.4 0.75
1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9
c
Scale factor
0.98
0.94
5. Conclusion
A fault diagnosis algorithm using two entropy features 0.92
(SampEn and MSE), MI algorithm, and SVM classifier was
proposed in this paper for induction motor fault diagnosis.
The major steps and their sequential use in this algorithm 0.9
1 2 3 4 5
are (1) extracting the entropy features from the nonlinear, g
nonstationary induction motor vibration signals; (2) sorting
the features by using MI algorithm so that the elements in the Linear function
feature vector are ranked according to their importance and Polynomial function
relevant to the faults; (3) selecting the five most important fea- Radial basis function
tures from the feature vectors and then classifying them using (b) Parameter 𝑔
SVM method. The proposed classification and fault diagnosis
technique was validated using the induction motor vibration Figure 11: Classification accuracy of the three SVM models using
data in the study. The classification results confirmed that different parameter values in the data training.
the proposed fault diagnosis algorithm can effectively detect
8 various faults of the induction motor with an average
accuracy of 96.25%. The algorithm proposed in this study
thus provides an effective tool for a reliable fault diagnosis of
a multifault mechanical system such as an induction motor as proposed in the paper can largely improve the effectiveness
using only vibration CM data. The study presented in this and accuracy of machine fault classification and diagnosis
paper highlights the notions that (1) entropy features are and thus is very useful in the development of automatic fault
effective parameters to extract useful information from the diagnosis systems.
vibration data, which can satisfy the consistency requirement
for induction motor fault diagnosis; (2) mutual information
technique is an effective tool to reduce the feature space Conflict of Interests
extracted from the CM data, which can achieve a higher
recognition rate with selected features; (3) the combination The authors declare that there is no conflict of interests
of entropy, mutual information, and support vector machine regarding the publication of this paper.
Shock and Vibration 11
Rotating
Machinery
International Journal of
The Scientific
Engineering Distributed
Journal of
Journal of
Journal of
Control Science
and Engineering
Advances in
Civil Engineering
Hindawi Publishing Corporation Hindawi Publishing Corporation
http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014
Journal of
Journal of Electrical and Computer
Robotics
Hindawi Publishing Corporation
Engineering
Hindawi Publishing Corporation
http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014
VLSI Design
Advances in
OptoElectronics
International Journal of
International Journal of
Modelling &
Simulation
Aerospace
Hindawi Publishing Corporation Volume 2014
Navigation and
Observation
Hindawi Publishing Corporation
http://www.hindawi.com Volume 2014
in Engineering
Hindawi Publishing Corporation
http://www.hindawi.com Volume 2014
Engineering
Hindawi Publishing Corporation
http://www.hindawi.com Volume 2014
Hindawi Publishing Corporation
http://www.hindawi.com
http://www.hindawi.com Volume 2014
International Journal of
International Journal of Antennas and Active and Passive Advances in
Chemical Engineering Propagation Electronic Components Shock and Vibration Acoustics and Vibration
Hindawi Publishing Corporation Hindawi Publishing Corporation Hindawi Publishing Corporation Hindawi Publishing Corporation Hindawi Publishing Corporation
http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014 http://www.hindawi.com Volume 2014