Rolling element bearings are critical components in rotating machines and their states in operation directly affect industrial productivity and workers' safety. Fault diagnosis in the early stage could provide maintenance guidance and avoid huge economic losses so a fast and precise method is necessary. Traditionally, the framework of fault diagnosis for bearings consists of five parts: data acquisition, data preprocessing, feature extraction, dimensionality reduction, and fault classification[1-3].
Prominent features are critical in fault diagnosis, so multiple domain features have been investigated, including root mean square (RMS), kurtosis, and skewness in time domain and frequency domain, and short-time Fourier transform (STFT)[4] in time-frequency domain[5]. To deal with nonlinear dynamic characteristics of bearing fault signals, several entropy-based approaches have been proposed, such as approximate entropy (ApEn)[6-7], sample entropy (SaEn)[8-9], fuzzy entropy[10-11], multiscale entropy (MSE)[12-13], and permutation entropy (PE)[14-15]. PE can measure the complexity of time series through comparing adjacent values, which is simple, immune to noise, and suitable for online monitoring. Based on the PE method, multi-scale permutation entropy (MPE) was developed by Aziz and Arif[16] and was employed to estimate the complexity of the signals in different scales. MPE excels at its stability and robustness and performs better than PE in the application of bearing fault diagnosis[17-21].
Traditional approaches for dimensionality reduction are principal component analysis (PCA)[1] and modifications like locally linear embedding (LLE)[2], Laplacian score (LS)[3], and non-negative matrix factorization (NMF)[4]. With the selected features, the fault diagnosis should be performed with a classifier, which consists of two types based on the vibration data, i.e., statistical models[1, 3] and data-driven models[2, 11, 13, 15, 20-22]. The two classifiers have their own advantages for bearing fault classification but are with deficiencies that they are complex and difficult to implement.
Since the essence of fault diagnosis is the problem of pattern recognition, distance would be an essential factor for the discrimination of different feature vectors. Hence, the nearest neighbor principle should be considered and the fault type could be identified with the smallest distance. For example, dynamic time wrapping[23] has been applied in bearing fault diagnosis[24], which measures the similarity of vectors with different length, but is with high computation complexity at the same time. In addition, not all the distances (e.g., the Euclidean distance) could differentiate the pattern. Based on the above analysis, MSD[25] is applied in this paper for bearing fault classification, which considers the shape of feature vectors and is simple to implement because no tuning parameters need to be set up or optimized compared with the abovementioned classifiers.
The rest of this paper is organized as follows: a brief review of the theoretical background concerning MPE and MSD is provided in Section 2; Section 3 illustrates the procedure of MPE-MSD, and two experiments considering different categories are designed and implemented to verify effectiveness of the proposed method in Section 4; finally, conclusions are drawn in Section 5.
2 Theoretical Background 2.1 Multi-scale Permutation EntropyPE[14] was firstly used for detecting the dynamic change of time series data and to overcome former entropy method limitations such as the requirement of long datasets and high computational cost. The definition and implement procedure of PE are described as follows.
Considering a time series {x(k), k=1, 2, …, N}, the m dimensional vector at time i can thus be constructed as
$ \begin{array}{l} x_i^m = \{ x(i),x(i + \tau ), \cdots ,x(i + (m - 1)\tau )\} ,\\ \;\;\;\;\;\;\;i = 1,2, \cdots ,N - (m - 1)\tau \end{array} $ | (1) |
where xim is a re-constructed series, m represents the embedding dimension, and τ is the time delay. Let πj=(r0, r1, …, rm-1), then xim has a permutation πj if it satisfies
$ x\left( {i + {r_0}\tau } \right) \le x\left( {i + {r_1}\tau } \right) \le \cdots \le x\left( {i + {r_{m - 1}}\tau } \right) $ | (2) |
where 0≤rn≤m-1, and rn-1 < rn when
$ x\left( {t + {r_{n - 1}}\tau } \right) = x\left( {t + {r_n}\tau } \right) $ |
For each permutation πj, 1≤j≤m!, the relative frequency can be defined as
$ p\left( {{\mathit{\boldsymbol{\pi }}_j}} \right) = \frac{{\# \left\{ {i|i \le N - (m - 1)\tau ,x_i^m\;{\rm{has type }}{\mathit{\boldsymbol{\pi }}_j}} \right\}}}{{N - (m - 1)\tau }} $ | (3) |
where # represents the number of xim belonging to the type πj. Then PE with m dimension is defined as
$ {H_{{\rm{PE}}}}(m) = - \sum\limits_j^{m!} p \left( {{\mathit{\boldsymbol{\pi }}_j}} \right)\ln \left[ {p\left( {{\mathit{\boldsymbol{\pi }}_j}} \right)} \right] $ | (4) |
The MPE algorithm based on PE comprises the following two steps:
1) Considering a time series {x(k), k=1, 2, …, N}, as illustrated in Fig. 1, the coarse-grained time series yj(s) with scale factor s can be constructed as
$ y_j^{(s)} = \frac{1}{s}\sum\limits_{i = (j - 1)s + 1}^{js} {{x_i}} \;\;\;1 \le j \le \frac{N}{s} $ | (5) |
2) The PEs of each coarse-grained time series are calculated based on Eqs. (1)-(5) and then expressed as the function of the scale factor s
$ {\rm{MPE}}(x,s,m,\tau ) = {\rm{PE}}\left( {y_j^{(s)},m,\tau } \right) $ | (6) |
Four parameters should be specified before using MPE, including N, m, τ, and s. In this paper, the parameters are set as m=4, τ=1, s=20[21]. The normalized MPE is expressed as
$ \begin{array}{*{20}{c}} {{\rm{NorMPE}}(x,t,m,\tau ) = {\rm{MPE}}(x,t,m,\tau )/}\\ {\sum\limits_{t = 1}^s {\rm{MPE}} (x,t,m,\tau )} \end{array} $ | (7) |
Considering vectors as objects, the morphology similarity distance is proposed to estimate similarity. The morphology similarity distance between two n-dimensionality vectors Li=(li1, li2, …, lin) and Lj=(lj1, lj2, …, ljn) is defined as[25]
$ {D_{{\rm{MSD}}}} = {D_{{\rm{Euclid}}}} \times \left( {2 - {\rm{ASD}}/{D_{{\rm{Manhattan}}}}} \right) $ | (8) |
where DEuclid is the Euclid distance, DManhattan is the Manhattan distance, ASD is the absolute sum of the differences as follows:
$ \begin{array}{l} {D_{{\rm{MSD}}}}\left( {{L_i},{L_j}} \right) = \sqrt {\sum\limits_{k = 1}^n {{{\left( {{l_{ik}} - {l_{jk}}} \right)}^2}} } \times \\ \;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\left( {2 - \frac{{\left| {\sum\limits_{k = 1}^n {\left( {{l_{ik}} - {l_{jk}}} \right)} } \right|}}{{\sum\limits_{k = 1}^n {\left| {{l_{ik}} - {l_{jk}}} \right|} }}} \right) \end{array} $ | (9) |
The traditional index used for similarity is the Minkowski metric
$ d\left( {j,k} \right) = {\left( {\sum\limits_{i = 1}^n {{{\left| {{x_{ji}} - {x_{ki}}} \right|}^r}} } \right)^{1/r}} $ | (10) |
The traditional distances Euclidean (r=2) and Manhattan distances (r=1) can be regarded as special cases of Minkowski distance. However, they both neglect the difference between vectors and fail to reflect shape similarities such as translation, compression, and stretch.
For example, there are four vectors as follows:
$ {\mathit{\boldsymbol{x}}_0} = (5,5,5),{\mathit{\boldsymbol{x}}_1} = (4,4,4) $ |
$ {\mathit{\boldsymbol{x}}_2} = (5,4,3),{\mathit{\boldsymbol{x}}_3} = (5,4,7) $ |
Let vector x0 be the reference vector. The vector differences, Manhattan distance, Euclidean distance, and MSD from x1, x2, and x3 to x0 are computed and listed in Table 1.
3 Procedure of Fault Diagnosis
A procedure based on MPE and MSD is established for the fault classification of rolling element bearings, and the steps are as follows:
1) Collect vibration signals of healthy and different types of fault bearings;
2) Calculate MPE values of the original signals with different fault types and construct normalized vectors as reference samples and testing samples;
3) Assign the number of the reference samples to each fault type, compute the MSD between every testing sample and all reference samples, and then average the MSD values with the specified fault;
4) Based on the nearest neighbor principle, choose the minimum value among the MSD means from all fault types, and the corresponding fault type can be identified.
The flowchart of the proposed method is shown in Fig. 2.
4 Application 4.1 Case 1: Artificially Seeded Damage Bearings
The bearing data were obtained from Bearing Data Centre of Case Western Reserve University[26] and the bearing experiment system is shown in Fig. 3. During the experiment, the drive end bearing 6205-2RS JEM SKF was investigated and defective bearings were seeded with single point faults using electro-discharge machining. Vibration signals from the accelerometer were placed at the 12 o'clock position at the drive end of the motor housing with the sampling frequency 48 kHz, including normal, ball fault (BF), inner race fault (IRF), and outer race fault (ORF) (at the 6 o'clock position).
To verify the effectiveness of the proposed method, three groups of tests at defect size 0.007 inch and loading 1 HP were investigated, as depicted in Table 2, including different fault types (Group A) with waveforms shown in Fig. 4(a), different defective sizes (Group B) for damage severity, and a combination of fault types and defective sizes (Group C) for complex conditions. In every condition, non-overlapping segments with length N=4 096 were extracted, and MPE values were computed for each segment.
To illustrate specific fault types in the signal, the envelop spectra based on Teager energy operator corresponding to BF, IRF, and ORF are shown in Fig. 4(b). The remarkable amplitudes occurring in the location of the fault frequencies clearly indicated the fault types. The structural parameters of the bearing are listed in Table 3, and the corresponding fault frequencies are computed as follows:
Ball spin frequency (BSF)
$ {f_B} = 0.5{f_r}\left( {1 - {d^2}{{\cos }^2}\alpha /{D^2}} \right)D/d $ | (11) |
Ball pass frequency on inner race (BPFI)
$ {f_I} = 0.5{f_r}(1 + d\cos \alpha /D)z $ | (12) |
Ball pass frequency on outer race (BPFO)
$ {f_O} = 0.5{f_r}(1 - d\cos \alpha /D)z $ | (13) |
where d is the diameter of the rolling element, D is the pitch diameter, α is the contact angle, z is the number of rolling elements, and fr is the shaft speed.
Then, samples with numbers 10, 20, and 30 were utilized as reference and the rest were used for verification. The classification results are listed in Table 4.
The classification accuracies reached 100%, 100%, and 97.54% in Group A, Group B, and Group C with 10 reference samples respectively, as illustrated in Fig. 5. Moreover, the accuracy was not highly dependent on the number of the reference samples according to the comparison among different sample numbers in Table 4. In Fig. 5(a), for example, when the bearing was normal, the mean MSD between the testing sample and the normal reference sample was the minimum, which was the same in the other three fault conditions. The four types of fault could be clearly identified based on the prominent distance between the minimum mean MSD and the other three MSDs. As illustrated in Fig. 5(b), distances from the normal condition were BF014 < BF007 < BF201 at the first 100 samples, which means the damage severity was non-monotone because of the non-continuous feature, which could not represent the overall tendency. In Fig. 5(c), a more complicated condition was considered, where a ten-sample interval was set to display the result clearly. Compared with the previous work, considering the same four conditions as normal, BF, IRF, and ORF as listed in Table 5, the results in our work are satisfied.
4.2 Case 2: Run-to-Failure Bearings
The data were obtained from Intelligent Maintenance System (IMS), University of Cincinnati. Four bearings were tested at one time on the same shaft in the bearing test rig as shown in Fig. 6. The shaft is 2 000 r/min and a radial load of 6 000 LB is forced on the shaft with a spring mechanism. The data sampling rate is 20 kHz with the data length 20 480 points. Three groups of tests were conducted in the experiment in total and Ref. [28] can provide more details. Table 6 presents four types of fault bearings and the typical waveforms are shown in Fig. 7. One hundred samples were extracted from each bearing fault condition, and 400 samples in whole were generated with length N=20 480. Then MPE values were computed for each sample.
Following the same abovementioned procedure, samples with numbers 10, 20, 30 were utilized as reference, and the rest were used for verification. The classification accuracies were all 100% with 10, 20, 30 samples, and the results are illustrated in Fig. 8. A comparison of previous work considering the same four conditions as normal, BF, IRF, and ORF is listed in Table 7, which shows the advantage of the proposed method.
The whole life of Bearing 1 in Set No. 2 with ORF was employed for the investigation of damage severity. MPE values were computed at every sample time point. The first tenth points were abandoned because they were not stable at the beginning, and the confidence value (CV) was introduced as an index to evaluate the severity. A normalization function combined the Sigmoid function with MSD is introduced as[31]
$ {\rm{CV}} = 2 - \frac{2}{{1 + {e^{ - {c_0}{\rm{MSD}}}}}} $ | (14) |
in which the scale parameter
$ {c_0} = - \ln \left( {\frac{{{\rm{C}}{{\rm{V}}_{{\rm{pre}}}}}}{{2 - {\rm{C}}{{\rm{V}}_{{\rm{pre}}}}}}} \right)/{\rm{Mean}}\left( {{\rm{MS}}{{\rm{D}}_{{\rm{normal}}}}} \right) $ | (15) |
where Mean (MSDnormal) is the average of all MSDs under normal condition, and CVpre corresponds to the Mean (MSDnormal), which was determined artificially. In this paper, the scale parameter c0=10.8 and CVpre=0.99. For comparison, additional features like RMS, kurtosis, and skewness were considered, and the corresponding CV with MPE is illustrated in Fig. 9. The comparison shows that the CV with MPE only was stable and could monotonously represent the damage severity along the time.
5 Conclusions
To fast and accurately diagnose the fault type in bearings, this paper presents a novel bearing fault diagnosis method based on MPE-MSD. From the above research, some conclusions are drawn as follows:
1) MPE could be employed as the representative feature for fault diagnosis and MSD is an efficient and simple approach for classification without the set-up or optimization of the tuning parameters.
2) Two experiments were performed to verify the feasibility of the proposed method. In comparison with previous studies, high accuracies were obtained in fault diagnosis for different conditions without feature selection.
3) A normalization function combining the Sigmoid function with MSD is proposed for continuous damage severity without considering full fault dataset. The degradation path shows stability and monotonicity with MPE, compared with the conditions considering MPE, RMS, kurtosis, and skewness.
[1] |
Liu J, Hu Y, Wu B, et al. A hybrid generalized hidden Markov model-based condition monitoring approach for rolling bearings. Sensors, 2017, 17(5): 1143. DOI:10.3390/s17051143 (0) |
[2] |
Yang Y, Jiang D. Casing vibration fault diagnosis based on variational mode decomposition, local linear embedding, and support vector machine. Shock and Vibration, 2017.Article ID 5963239. DOI: 10.1155/2017/5963239.
(0) |
[3] |
Zheng J. Rolling bearing fault diagnosis based on partially ensemble empirical mode decomposition and variable predictive model-based class discrimination. Archives of Civil & Mechanical Engineering, 2016, 16(4): 784-794. DOI:10.1016/j.acme.2016.05.003 (0) |
[4] |
Gao H, Liang L, Chen X, et al. Feature extraction and recognition for rolling element bearing fault utilizing short-time Fourier transform and non-negative matrix factorization. Chinese Journal of Mechanical Engineering, 2015, 28(1): 96-105. DOI:10.3901/CJME.2014.1103.166 (0) |
[5] |
Caesarendra W, Tjahjowidodo T. A review of feature extraction methods in vibration-based condition monitoring and its application for degradation trend estimation of low-speed slew bearing. Machines, 2017, 5(4): 21. DOI:10.3390/machines5040021 (0) |
[6] |
Pincus S M. Approximate entropy as a measure of system complexity. Proceedings of the National Academy of Sciences of the United States of America, 1999, 88(6): 2297-2301. DOI:10.1073/pnas.88.6.2297 (0) |
[7] |
Li K, Su L, Wu J, et al. A rolling bearing fault diagnosis method based on variational mode decomposition and an improved kernel extreme learning machine. Applied Sciences, 2017, 7(10): 1004. DOI:10.3390/app7101004 (0) |
[8] |
Richman J S, Randall Moorman J. Physiological time-series analysis using approximate entropy and sample entropy. American Journal of Physiology: Heart and Circulatory Physiology, 2000, 278(6): H2039-H2049. DOI:10.1152/ajpheart.2000.278.6.H2039 (0) |
[9] |
Han M, Pan J. A fault diagnosis method combined with LMD, sample entropy and energy ratio for roller bearings. Measurement, 2015, 76: 7-19. DOI:10.1016/j.measurement.2015.08.019 (0) |
[10] |
Chen W, Zhuang J, Yu W, et al. Measuring complexity using FuzzyEn, ApEn, and SampEn. Medical Engineering & Physics, 2009, 31(1): 61-68. DOI:10.1016/j.medengphy.2008.04.005 (0) |
[11] |
Zheng J, Cheng J, Yang Y. A rolling bearing fault diagnosis approach based on LCD and fuzzy entropy. Mechanism and Machine Theory, 2013, 70: 441-453. DOI:10.1016/j.mechmachtheory.2013.08.014 (0) |
[12] |
Costa M, Goldberger A L, Peng C K. Multiscale entropy analysis of complex physiologic time series. Physical Review Letters, 2002, 89: 068102(1-4). DOI:10.1103/PhysRevLett.89.068102 (0) |
[13] |
Liu H, Han M. A fault diagnosis method based on local mean decomposition and multi-scale entropy for roller bearings. Mechanism and Machine Theory, 2014, 75: 67-78. DOI:10.1016/j.mechmachtheory.2014.01.011 (0) |
[14] |
Bandt C, Pompe B. Permutation entropy: A natural complexity measure for time series. Physical Review Letters, 2002, 88: 174102(1-4). DOI:10.1103/PhysRevLett.88.174102 (0) |
[15] |
Liu X, Chen Y, Yang J. A novel fault diagnosis method for rolling bearing based on EEMD-PE and multiclass relevance vector machine. Proceedings of the 2017 International Instrumentation and Measurement Technology Conference. Piscataway: IEEE, 2017: 1-6. DOI:10.1109/I2MTC.2017.7969729
(0) |
[16] |
Aziz W, Arif M. Multiscale permutation entropy of physiological time series. Proceedings of the 2005 Pakistan Section Multitopic Conference. Piscataway: IEEE, 2005: 1-6. DOI:10.1109/INMIC.2005.334494
(0) |
[17] |
Gao Y, Villecco F, Li M, et al. Multi-scale permutation entropy based on improved LMD and HMM for rolling bearing diagnosis. Entropy, 2017, 19(4): 176. DOI:10.3390/e19040176 (0) |
[18] |
Zhao L, Wang L, Yan R. Rolling bearing fault diagnosis based on wavelet packet decomposition and multi-scale permutation entropy. Entropy, 2015, 17(9): 6447-6461. DOI:10.3390/e17096447 (0) |
[19] |
Vakharia V, Gupta V K, Kankar P K. A multiscale permutation entropy based approach to select wavelet for fault diagnosis of ball bearings. Journal of Vibration and Control, 2015, 21(16): 3123-3131. DOI:10.1177/1077546314520830 (0) |
[20] |
Wu S D, Wu P H, Wu C W, et al. Bearing fault diagnosis based on multiscale permutation entropy and support vector machine. Entropy, 2012, 14(8): 1343-1356. DOI:10.3390/e14081343 (0) |
[21] |
Li Y, Xu M, Wei Y, et al. A new rolling bearing fault diagnosis method based on multiscale permutation entropy and improved support vector machine based binary tree. Measurement, 2016, 77: 80-94. DOI:10.1016/j.measurement.2015.08.034 (0) |
[22] |
Zhang X, Liang Y, Zhou J, et al. A novel bearing fault diagnosis model integrated permutation entropy, ensemble empirical mode decomposition and optimized SVM. Measurement, 2015, 69: 164-179. DOI:10.1016/j.measurement.2015.03.017 (0) |
[23] |
Berndt D J, Clifford J. Using dynamic time warping to find patterns in time series. Proceedings of the AAAI-94 Workshop on Knowledge Discovery in Databases. AAAI Press, 1994: 359-370.
(0) |
[24] |
Tian Y, Wang Z, Lu C. Self-adaptive bearing fault diagnosis based on permutation entropy and manifold-based dynamic time warping. Mechanical Systems & Signal Processing, 2019, 114: 658-673. DOI:10.1016/j.ymssp.2016.04.028 (0) |
[25] |
Li Z, Yuan J, Zhang W. Fuzzy C-mean algorithm with morphology similarity distance. Proceedings of the 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery. Piscataway: IEEE, 2009: 90-94. DOI:10.1109/FSKD.2009.180
(0) |
[26] |
Case Western Reserve University. Bearing Data Center. Available online: http://csegroups.case.edu/bearingdatacenter/pages/apparatus-procedures.
(0) |
[27] |
Li Y, Xu M, Wang R, et al. A fault diagnosis scheme for rolling bearing based on local mean decomposition and improved multiscale fuzzy entropy. Journal of Sound & Vibration, 2016, 360: 277-299. DOI:10.1016/j.jsv.2015.09.016 (0) |
[28] |
Qiu H, Lee J, Lin J, et al. Wavelet filter-based weak signature detection method and its application on roller bearing prognostics. Journal of Sound & Vibration, 2006, 289(4-5): 1066-1090. DOI:10.1016/j.jsv.2005.03.007 (0) |
[29] |
Yu J. Local and nonlocal preserving projection for bearing defect classification and performance assessment. IEEE Transactions on Industrial Electronics, 2012, 59(5): 2363-2376. DOI:10.1109/TIE.2011.2167893 (0) |
[30] |
Wu L, Yao B, Peng Z, et al. Fault diagnosis of roller bearings based on a wavelet neural network and manifold learning. Applied Sciences, 2017, 7(2): 158. DOI:10.3390/app7020158 (0) |
[31] |
Wang Z, Lu C, Wang Z, et al. Fault diagnosis and health assessment for bearings using the Mahalanobis-Taguchi system based on EMD-SVD. Transactions of the Institute of Measurement and Control, 2013, 35(6): 798-807. DOI:10.1177/0142331212472929 (0) |