A Multiplier-Free Convolution Neural Network Hardware Accelerator for Real-Time Bearing Condition Detection of CNC Machinery
<p>The proposed CNN network architecture.</p> "> Figure 2
<p>The confusion matrix of the proposed CNN network architecture.</p> "> Figure 3
<p>From software training to quantization, parameter extraction, final software verification, and hardware implementation.</p> "> Figure 4
<p>The proposed CNN network block diagram.</p> "> Figure 5
<p>Accuracy vs. presenting bits of the input value.</p> "> Figure 6
<p>Effect on accuracy vs. bit number of the fixed representation and dynamic fixed point.</p> ">
Abstract
:1. Introduction
- In this work, a CNN-based hardware accelerator has been implemented for bearing faults, which can achieve real-time processing in industrial environments.
- An improved incremental network quantization (INQ) method has been proposed to reduce the memory usage of the proposed hardware accelerator.
- The complex multiplier operations have been removed from the proposed method to realize a multiplier-free accelerator.
- The power consumption of the proposed accelerator can be reduced to 342 mW with a 140 MHz clock frequency.
- The accuracy of the proposed accelerator can achieve 95.12% with only 8.69 K parameters.
2. Exploring Different Fault Diagnosis Methods
- (1)
- K-Nearest Neighbor (K-NN)
- (2)
- Support Vector Machine (SVM)
- (3)
- Principal Component Analysis (PCA)
- (4)
- Neural Network (NN)
3. The Proposed Methodology
3.1. Software Design
3.1.1. Dataset Preprocessing
3.1.2. Architecture of the Proposed CNN Model
3.2. Hardware Implementation
Algorithm 1. Select order and training |
Algorithm 2. Quantization method |
if (validation_acc >= SettheAccuracyRate) ; ; else return (Net); |
Algorithm 3. Quantization network |
4. Experimental Results
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Bellini, A.; Filippetti, F.; Tassoni, C.; Capolino, G.-A. Advances in diagnostic techniques for induction machines. IEEE Trans. Ind. Electron. 2008, 55, 4109–4126. [Google Scholar] [CrossRef]
- Motor Reliability Working Group. Report of large motor reliability survey of industrial and commercial installations, Part I. IEEE Trans. Ind. Appl. 1985, 21, 853–864. [Google Scholar]
- Motor Reliability Working Group. Report of large motor reliability survey of industrial and commercial installations, Part II. IEEE Trans. Ind. Appl. 1985, 21, 865–872. [Google Scholar]
- Motor Reliability Working Group. Report of large motor reliability survey of industrial and commercial installations, Part III. IEEE Trans. Ind. Appl. 1987, 23, 153–158. [Google Scholar]
- Albrecht, P.F.; Appiarius, J.C.; McCoy, R.M.; Owen, E.L.; Sharma, D.K. Assessment of the reliability of motors in utility applications—Updated. IEEE Power Eng. Rev. 1986, 6, 31–32. [Google Scholar] [CrossRef]
- Li, G.; Li, J.; Fan, H.; Cao, Y.; Xu, M.; Wei, J.; Dong, L. Model-based fault diagnosis method for gyro. In Proceedings of the IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chengdu, China, 15–17 March 2019; pp. 1004–1007. [Google Scholar]
- Liu, T.; Luo, H.; Yang, Z. A novel data-driven fault classification method and its application to DC motor. In Proceedings of the IEEE International Conference on Industrial Technology (ICIT), Melbourne, Australia, 13–15 February 2019; pp. 1261–1266. [Google Scholar]
- Niu, G. Data-Driven Technology for Engineering System Health Management: Design Approach, Feature Construction, Fault Diagnosis, Prognostics, Fusion and Decisions; Springer Science + Business Media Singapore and Science Press: Beijing, China, 2017. [Google Scholar]
- Case Western Reserve University (CWRU) Bearing Data Center Website. Available online: https://engineering.case.edu/bearingdatacenter (accessed on 21 November 2023).
- Liang, M.; Zhou, K. Probabilistic bearing fault diagnosis using Gaussian process with tailored feature extraction. Int. J. Adv. Manuf. Technol. 2021, 119, 2059–2076. [Google Scholar] [CrossRef]
- Mao, W.T.; Liu, Y.M.; Ding, L.; Li, Y. Imbalanced fault diagnosis of rolling element bearing based on generative adversarial network: A comparative study. IEEE Access 2019, 7, 9515–9530. [Google Scholar] [CrossRef]
- Gao, Y.; Liu, X.; Xiang, J. FEM simulation-based generative adversarial networks to detect bearing faults. IEEE Trans. Ind. Inform. 2020, 16, 4961–4971. [Google Scholar] [CrossRef]
- Zhang, W.; Zhang, P.; He, X.; Zhang, D. Convolutional neural network based two-layer transfer learning for bearing fault diagnosis. IEEE Access 2022, 10, 109779–109794. [Google Scholar] [CrossRef]
- Zhang, R.; Gu, Y. A transfer learning framework with a one-dimensional deep subdomain adaptation network for bearing fault diagnosis under different working conditions. Sensors 2022, 22, 1624. [Google Scholar] [CrossRef]
- Zhu, M.H.; Gupta, S. To prune or not to prune: Exploring the efficacy of pruning for model compression. arXiv 2017, arXiv:1710.01878v2. [Google Scholar]
- Han, S.; Mao, H.; Dally, W.J. Deep compression: Compressing deep neural networks with pruning trained quantization and Huffman coding. arXiv 2016, arXiv:1510.00149v5. [Google Scholar]
- Lin, X.; Zhao, C.; Pan, W. Towards accurate binary convolutional neural network. In Proceedings of the Conference on Neural Information Processing System (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017; pp. 345–353. [Google Scholar]
- Li, F.; Zhang, B.; Liu, B. Ternary weight networks. arXiv 2016, arXiv:1605.04711v2. [Google Scholar]
- Zhou, A.; Yao, A.; Guo, Y.; Xu, L.; Chen, Y. Incremental network quantization: Towards lossless CNNs with low-precision weights. arXiv 2017, arXiv:1702.03044v2. [Google Scholar]
- Sánchez, R.-V.; Lucero, P.; Vásquez, R.E.; Cerrada, M.; Macancela, J.-C.; Cabrera, D. Feature ranking for multi-fault diagnosis of rotating machinery by using random forest and KNN. J. Intell. Fuzzy Syst. 2018, 34, 3463–3473. [Google Scholar] [CrossRef]
- FernáNdez-Francos, D.; Martínez-Rego, D.; Fontenla-Romero, O.; Alonso-Betanzos, A. Automatic bearing fault diagnosis based on one-class v-SVM. Comput. Ind. Eng. 2013, 64, 357–365. [Google Scholar] [CrossRef]
- Fadda, M.L.; Moussaoui, A. Hybrid SOM–PCA method for modeling bearing faults detection and diagnosis. J. Braz. Soc. Mech. Sci. Eng. 2018, 40, 268. [Google Scholar] [CrossRef]
- Al-Raheem, K.F.; Roy, A.; Ramachandran, K.P.; Harrison, D.K.; Grainger, S. Application of the Laplace-wavelet combined with ANN for rolling bearing fault diagnosis. J. Vib. Acoust. 2008, 130, 051007. [Google Scholar] [CrossRef]
- Yang, Y.; Fu, P.; He, Y. Bearing fault automatic classification based on deep learning. IEEE Access 2018, 6, 71540–71554. [Google Scholar] [CrossRef]
- Deng, W.; Liu, H.; Xu, J.; Zhao, H.; Song, Y. An improved quantum-inspired differential evolution algorithm for deep belief network. IEEE Trans. Instrum. Meas. 2020, 69, 7319–7327. [Google Scholar] [CrossRef]
- Xie, Y.; Zhang, T. Feature extraction based on DWT and CNN for rotating machinery fault diagnosis. In Proceedings of the 29th Chinese Control and Decision Conference (CCDC), Chongqing, China, 28–30 May 2017; pp. 3861–3866. [Google Scholar]
- Wen, L.; Li, X.; Gao, L.; Zhang, Y. A new convolutional neural network-based data-driven fault diagnosis method. IEEE Trans. Ind. Electron. 2018, 65, 5990–5998. [Google Scholar] [CrossRef]
- Iqbal, M.; Madan, A.K. CNC machine-bearing fault detection based on convolutional neural network using vibration and acoustic signal. J. Vib. Eng. Technol. 2022, 10, 1613–1621. [Google Scholar] [CrossRef]
- Li, X.; Yu, S.; Lei, Y.; Li, N.; Yang, B. Intelligent machinery fault diagnosis with event-based camera. IEEE Trans. Ind. Inform. 2023. [Google Scholar] [CrossRef]
- Chung, C.-C.; Liang, Y.-P.; Chang, Y.-C.; Chang, C.-M. A binary weight convolutional neural network hardware accelerator for analysis faults of the CNC machinery on FPGA. In Proceedings of the 2023 International VLSI Symposium on Technology, Systems and Applications (VLSI-TSA/VLSI-DAT), Hsinchu, Taiwan, 17–20 April 2023; pp. 1–4. [Google Scholar]
- Zhao, W.; Fu, H.; Luk, W.; Yu, T.; Wang, S.; Feng, B.; Ma, Y.; Yang, G. F-CNN: An FPGA-based framework for training convolutional neural networks. In Proceedings of the Conference on Application-Specific Systems, Architectures and Processors (ASAP 2016), Manchester, UK, 6–8 July 2016; pp. 107–114. [Google Scholar]
- Liu, Z.; Dou, Y.; Jiang, J.; Xu, J.; Li, S.; Zhou, Y.; Xu, Y. Throughput-optimized FPGA accelerator for deep convolutional neural networks. ACM Trans. Reconfigurable Technol. Syst. 2017, 10, 17:1–17:23. [Google Scholar] [CrossRef]
- Dai, R.; Tang, Y. Accelerator implementation of Lenet-5 convolution neural network based on FPGA with HLS. In Proceedings of the Conference on Circuits, System and Simulation (ICCSS), Nanjing, China, 13–15 June 2019; pp. 64–67. [Google Scholar]
- Zhang, L.; Bu, X.; Li, B. XNORCONV: CNNs accelerator implemented on FPGA using a hybrid CNNs structure and an inter-layer pipeline method. IET Image Process. 2019, 14, 105–113. [Google Scholar] [CrossRef]
- Hailesellasie, M.T.; Hasan, S.R. MulNet: A flexible CNN processor with higher resource utilization efficiency for constrained devices. IEEE Access 2019, 7, 47509–47524. [Google Scholar] [CrossRef]
Label | Fault Diameter | Bearing Status |
---|---|---|
0 | No fault | health |
1 | 0.007 inch | ball fault |
2 | 0.007 inch | inner race fault |
3 | 0.007 inch | outer race fault |
4 | 0.014 inch | ball fault |
5 | 0.014 inch | inner race fault |
6 | 0.014 inch | outer race fault |
7 | 0.021 inch | ball fault |
8 | 0.021 inch | inner race fault |
9 | 0.021 inch | outer race fault |
Label | Training Images | Validation Images | Test Images |
---|---|---|---|
0 | 259 | 54 | 100 |
1 | 401 | 120 | 125 |
2 | 398 | 115 | 133 |
3 | 1120 | 260 | 339 |
4 | 465 | 96 | 125 |
5 | 391 | 73 | 113 |
6 | 423 | 114 | 138 |
7 | 400 | 113 | 132 |
8 | 414 | 104 | 129 |
9 | 1046 | 280 | 326 |
Total | 5317 | 1329 | 1660 |
Kernel Size | Training (64%) | Validation (16%) | Test (20%) |
---|---|---|---|
1 × 3 | 92.41% | 90.66% | 92.34% |
1 × 5 | 98.70% | 97.14% | 97.16% |
2 × 2 | 95.35% | 92.55% | 94.09% |
3 × 3 | 97.25% | 94.65% | 94.39% |
Architecture | Output Filter | Parameter | Test | |||
---|---|---|---|---|---|---|
Conv1 | Conv2 | Conv3 | Conv4 | |||
1 | 4 | 8 | 16 | × | 990 | 85.36% |
2 | 8 | 16 | 32 | × | 3570 | 95.54% |
3 | 16 | 32 | 64 | × | 13,530 | 97.34% |
4 | 32 | 64 | 128 | × | 54,150 | 97.53% |
5 | 4 | 8 | 16 | 16 | 2270 | 89.15% |
6 | 8 | 16 | 32 | 32 | 8690 | 97.16% |
7 | 16 | 32 | 64 | 64 | 34,010 | 97.28% |
8 | 32 | 64 | 128 | 128 | 134,570 | 97.46% |
[24] | [25] | [27] | This Work | |
---|---|---|---|---|
Algorithm | DNN | MSIQDE + DBN | 2-D CNN | 2-D CNN |
Parameter (K) | 1110 | 266 | 10,899 | 8.69 |
Category | 7 | 10 | 10 | 10 |
Accuracy | 100% | 99.7% | 99.77% | 97.16% |
Training | Validation | Test | |
---|---|---|---|
Unquantized | 98.70% | 97.14% | 97.16% |
Quantize 50% | 97.98% | 97.74% | 95.78% |
Quantize 75% | 98.00% | 97.81% | 97.04% |
Quantize 87.5% | 98.00% | 97.74% | 96.80% |
Quantize 100% | 97.28% | 95.93% | 95.54% |
Version | Training | Validation | Test |
---|---|---|---|
v1 | 97.28% | 95.93% | 95.54% |
v2 | 92.56% | 91.94% | 92.46% |
v3 | 97.91% | 93.15% | 93.85% |
v4 | 98.45% | 93.07% | 93.43% |
v5 | 96.40% | 93.37% | 92.77% |
v6 | 96.00% | 94.50% | 92.95% |
v7 | 98.56% | 96.08% | 95.48% |
v8 | 94.86% | 93.00% | 94.75% |
v9 | 98.23% | 91.87% | 93.73% |
v10 | 95.33% | 93.30% | 91.38% |
Layer | Maximum | Minimum | Integer Bits |
---|---|---|---|
Conv1 | 13.078 | −13.988 | 5 |
Conv2 | 41.105 | −67.315 | 8 |
Conv3 | 273.757 | −218.08 | 10 |
Conv4 | 1081.706 | −1536.202 | 12 |
FC | 437.778 | −637.388 | 11 |
Label | Test Image | Correct Image | Accuracy |
---|---|---|---|
0 | 100 | 100 | 100% |
1 | 125 | 118 | 94.40% |
2 | 133 | 110 | 82.71% |
3 | 339 | 331 | 97.64% |
4 | 125 | 112 | 89.60% |
5 | 113 | 109 | 96.46% |
6 | 138 | 137 | 99.28% |
7 | 132 | 127 | 96.21% |
8 | 129 | 115 | 89.15% |
9 | 326 | 320 | 98.16% |
Total | 1660 | 1579 | 95.12% |
32-bit Floating Point | This Work | Reduction | |
---|---|---|---|
Weight | 277,760 | 34,720 | 87.5% |
Bias | 320 | 180 | 43.75% |
Input ROM | 131,072 | 36,864 | 71.88% |
Ofmap RAM1 | 262,144 | 147,456 | 43.75% |
Ofmap RAM2 | 131,072 | 73,728 | 43.75% |
Total | 802,368 | 292,948 | 63.49% |
[31] | [32] | [33] | [34] | [35] | This Work | |
---|---|---|---|---|---|---|
Technology | FPGA Altera Stratix V | FPGA VC709 | FPGA ZYBO Z7 | FPGA Zynq-7000 | FPGA Zynq XC7Z045 | FPGA VC707 |
Algorithm | F-CNN | CNN | CNN | CNN | CNN | CNN |
Architecture | LeNet-5 | LeNet | LeNet-5 | LeNet-5 | LeNet | LeNet |
Dataset | MNIST | CIFAR10 | MNIST | MNIST | MNIST | CWRU |
Parameters (K) | 430 | N/A | 13.47 | 61.5 | 33.6 | 8.69 |
Frequency (MHz) | 150 | 100 | 100 | 150 | N/A | 5/140 |
Precision (bits) | 32-bit | 8-bit | 32-bit | N/A | 8-bit | 18-bit |
Power (W) | 27.3 | 25.2 | 1.8 | N/A | 0.029 | 0.342/0.616 |
LUT | 69,510 | 233,215 | 14,659 | 36,798 | 2800 | 9306/9689 |
DSP | 23 | 2907 | 125 | 214 | 5 | 0/0 |
FF | 87,580 | 307,617 | 14,172 | N/A | 2700 | 5703/5703 |
BRAM | 510 | 477 | 119.5 | 123 | 7 | 8.5/8.5 |
GOPS | 62.06 | 424.7 | 0.343 | N/A | 0.1 | 0.025/0.6938 |
GOPS/W | 2.27 | 16.85 | 0.19 | N/A | 3.45 | 0.073/1.126 |
Accuracy | N/A | 79.64% | N/A | 98.4% | 98.68% | 95.12% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liang, Y.-P.; Hung, M.-Y.; Chung, C.-C. A Multiplier-Free Convolution Neural Network Hardware Accelerator for Real-Time Bearing Condition Detection of CNC Machinery. Sensors 2023, 23, 9437. https://doi.org/10.3390/s23239437
Liang Y-P, Hung M-Y, Chung C-C. A Multiplier-Free Convolution Neural Network Hardware Accelerator for Real-Time Bearing Condition Detection of CNC Machinery. Sensors. 2023; 23(23):9437. https://doi.org/10.3390/s23239437
Chicago/Turabian StyleLiang, Yu-Pei, Ming-You Hung, and Ching-Che Chung. 2023. "A Multiplier-Free Convolution Neural Network Hardware Accelerator for Real-Time Bearing Condition Detection of CNC Machinery" Sensors 23, no. 23: 9437. https://doi.org/10.3390/s23239437
APA StyleLiang, Y.-P., Hung, M.-Y., & Chung, C.-C. (2023). A Multiplier-Free Convolution Neural Network Hardware Accelerator for Real-Time Bearing Condition Detection of CNC Machinery. Sensors, 23(23), 9437. https://doi.org/10.3390/s23239437