Application of a Hybrid Model Based on a Convolutional Auto-Encoder and Convolutional Neural Network in Object-Oriented Remote Sensing Classification
<p>An example of an image object: (<b>a</b>) the original image object; (<b>b</b>) the post-processing image object.</p> "> Figure 2
<p>Architecture of the CAE_CNN model (the green layers represent the feature maps obtained by convolution operations, and the yellow layers represent the feature maps obtained by max pooling operations).</p> "> Figure 3
<p>Example of feature maps (the red box in feature map ② represents a central 53 × 53 grid).</p> "> Figure 4
<p>Variation of each indicator.</p> "> Figure 5
<p>Grey histograms ((<b>a</b>–<b>f</b>) represent the grey histograms of feature maps 1, 2, 3, 8, 10, and 11, respectively. The grey histograms of feature maps 5, 6, 7, and 9 are similar to the grey histogram of feature map 3; and the grey histograms of feature maps 4 and 12 are similar to the grey histogram representing feature map 1.).</p> "> Figure 6
<p>Overall accuracy with the compression ratio change.</p> "> Figure 7
<p>Architecture of the Best_CNN model (the green layers represent the feature maps obtained by convolution operations, and the yellow layers represent the feature maps obtained by max pooling operations).</p> "> Figure 8
<p>Loss function values during model training.</p> "> Figure 9
<p>Classification accuracy with decreasing training set size.</p> ">
Abstract
:1. Introduction
2. Related Work
3. Hybrid Model Based on CAE and CNN
3.1. Method for the Production of an Object-Oriented Remote Sensing Data Set
3.2. Architecture of the Designed Hybrid Model
4. Feature Extraction Based on the CAE Model
5. Parameters of the CAE_CNN Model
6. Results and Discussion
6.1. Overall Accuracy of the Different Models
6.2. Temporal Efficiency of Different Models
6.3. Dependence on Labelled Samples
7. Conclusions
Supplementary Materials
Acknowledgments
Author Contributions
Conflicts of Interest
References
- Bengio, Y. Learning Deep Architectures for AI. Found. Trends Mach. Learn. 2009, 2, 1–127. [Google Scholar] [CrossRef]
- Kavukcuoglu, K.; Sermanet, P.; Boureau, Y.L.; Gregor, K.; Lecun, Y. Learning convolutional feature hierarchies for visual recognition. In Proceedings of the International Conference on Neural Information Processing Systems, Vancouver, Canada, 6–9 December 2010; pp. 1090–1098. [Google Scholar]
- Hu, W.; Huang, Y.Y.; Wei, L.; Zhang, F.; Li, H.C. Deep convolutional neural networks for hyperspectral image classification. J. Sens. 2015, 2, 1–12. [Google Scholar] [CrossRef]
- Yue, J.; Zhao, W.Z.; Mao, S.J.; Liu, H. Spectral–spatial classification of hyperspectral images using deep convolutional neural networks. Remote Sens. Lett. 2015, 6, 468–477. [Google Scholar] [CrossRef]
- Lee, H.; Kwon, H. Contextual deep CNN based hyperspectral classification. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, Beijing, China, 10–15 July 2016; p. 1. [Google Scholar]
- Han, X.B.; Zhong, Y.F.; Zhao, B.; Zhang, L.P. Scene classification based on a hierarchical convolutional sparse auto-encoder for high spatial resolution imagery. Int. J. Remote Sens. 2017, 38, 514–536. [Google Scholar] [CrossRef]
- Zhang, F.; Du, B.; Zhang, L.P. Scene classification via a gradient boosting random convolutional network framework. IEEE Trans. Geosci. Remote Sens. 2016, 54, 1793–1802. [Google Scholar] [CrossRef]
- Dstl Satellite Imagery Competition, 1st Place Winner’s Interview: Kyle Lee. Available online: http://blog.kaggle.com/2017/04/26/dstl-satellite-imagery-competition-1st-place-winners-interview-kyle-lee/ (accessed on 12 October 2017).
- Ronneberger, O.; Fischer, P.; Brox, P. U-Net convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention-MICCAI, Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
- Baatz, M.; Schäpe, A. Multiresolution Segmentation: An optimization approach for high quality multi-scale image segmentation. Angew. Geogr. Informationsverarbeitung 2000, 12–23. [Google Scholar]
- Zhang, H.; Liu, X.Y.; Yang, S.; Yu, L.I. Retrieval of remote sensing images based on semisupervised deep learning. J. Remote Sens. 2017, 21, 406–414. [Google Scholar]
- Yang, N. Feature Selection for Object-oriented Classification of High Resolution Remote Sensing Images. Master’s Thesis, Xi’an University of Science and Technology, Xi’an, China, July 2012. [Google Scholar]
- Hinton, G.E.; Osindero, S.; Teh, Y.W. A fast learning algorithm for deep belief nets. Neural Comput. 2006, 18, 1527–1554. [Google Scholar] [CrossRef] [PubMed]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv, 2015; arXiv:1409.1556. [Google Scholar]
- He, K.; Zhang, X.Y.; Ren, S.Q.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Szegedy, C.; Liu, W.; Jia, Y.Q.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
- Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2818–2826. [Google Scholar]
- Szegedy, C.; Ioffe, S.; Vanhoucke, V.; Alemi, A. Inception-v4, inception-resnet and the impact of residual connections on learning. In Proceedings of Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 4–9 February 2017. [Google Scholar]
- Chollet, F. Xception: Deep learning with depthwise separable convolutions. arXiv, 2017; arXiv:1610.02357. [Google Scholar]
- Krizhevsky, A. Learning Multiple Layers of Features from Tiny Images. Master’s Thesis, University of Toronto, ON, Canada, April 2009. [Google Scholar]
- Deng, J.; Wei, D.; Richard, S.; Li, L.J.; Li, K.; Li, F.F. Imagenet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 248–255. [Google Scholar]
- Lin, T.Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Dollár, P.; Zitnick, C.L. Microsoft coco: common objects in context. In Proceedings of the European Conference on Computer Vision, Zurich, Switzerland, 6–12 September 2014; pp. 740–755. [Google Scholar]
- Hinton, G.E.; Salakhutdinov, R.R. Reducing the dimensionality of data with neural networks. Science 2006, 313, 504–507. [Google Scholar] [CrossRef] [PubMed]
- Tan, S.Q.; Li, B. Stacked convolutional auto-encoders for steganalysis of digital images. In Proceedings of the Asia-Pacific Signal and Information Processing Association Summit and Conference, Chiang Mai, Thailand, 9–12 December 2014; pp. 1–4. [Google Scholar]
- Geng, C.; Song, J.X. Human Action recognition based on convolutional neural networks with a convolutional auto-encoder. Proceedings of 2015 5th International Conference on Computer Sciences and Automation Engineering, Sanya, China, 14–15 November 2015. [Google Scholar]
- Masci, J.; Meier, U.; Cireşan, D.; Schmidhuber, J. Stacked Convolutional auto-encoders for hierarchical feature extraction. In Proceedings of the International Conference on Artificial Neural Networks, Espoo, Finland, 14–17 June 2011; pp. 52–59. [Google Scholar]
- Zayene, O.; Seuret, M.; Touj, S.M.; Hennebert, J.; Ingold, R.; Ben Amara, N.E. Text detection in Arabic news video based on SWT operator and convolutional auto-encoders. In Proceedings of the Document Analysis Systems, Santorini, Greece, 11–14 April 2016; pp. 13–18. [Google Scholar]
- Han, X.B.; Zhong, Y.F.; Zhao, B.; Zhang, L.P. Unsupervised hierarchical convolutional sparse auto-encoder for high spatial resolution imagery scene classification. In Proceedings of the International Conference on Natural Computation, Zhangjiajie, China, 15–17 August 2015; pp. 42–46. [Google Scholar]
- Maimaitimin, M.; Watanabe, K.; Maeyama, S. Stacked convolutional auto-encoders for surface recognition based on 3d point cloud data. Artif. Life Robot. 2017, 22, 259–264. [Google Scholar] [CrossRef]
- Ioffe, S.; Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In Proceedings of the International Conference on Machine Learning, Lille, France, 6–11 July 2015; pp. 448–456. [Google Scholar]
- Alberti, M.; Seuret, M.; Ingold, R.; Liwicki, M. What You Expect is NOT What You Get! Questioning Reconstruction/Classification Correlation of Stacked Convolutional Auto-Encoder Features. arXiv, 2017; arXiv:1703.04332. [Google Scholar]
- Haralick, R.M. Texture features for image classification. IEEE Trans. Syst. Man Cybern. 1973, 6, 610–621. [Google Scholar] [CrossRef]
- Park, B.; Lawrence, K.C.; Windham, W.R.; Chen, Y.R.; Chao, K. Discriminant analysis of dual-wavelength spectral images for classifying poultry carcasses. Comput. Electron. Agric. 2002, 33, 219–231. [Google Scholar] [CrossRef]
- Liu, Q.; Liu, X.P.; Zhang, L.J.; Zhao, L.M. Image texture feature extraction & recognition of Chinese herbal medicine based on gray level co-occurrence matrix. Adv. Mater. Res. 2013, 605–607, 2240–2244. [Google Scholar] [CrossRef]
- Jiang, J.; Reddy, M. Open-loop rate control for JPEC-LS near lossless image compression. Electron. Lett. 1999, 35, 465–466. [Google Scholar] [CrossRef]
Object | Point 1 | Point 2 | Point 3 | Point 4 | Point 5 | Point 6 | |
---|---|---|---|---|---|---|---|
Band1 | 2908 | 4022 | 3080 | 3957 | 4330 | 3112 | 3665 |
Band4 | 2560 | 3075 | 2176 | 2999 | 3535 | 2342 | 2606 |
Band8 | 3629 | 2695 | 2053 | 2767 | 4405 | 3026 | 3286 |
Road | Forest | Green Space | Water Body | Residence | Total | |
---|---|---|---|---|---|---|
Road | 91 | 3 | 6 | 0 | 0 | 100 |
Forest | 1 | 97 | 2 | 0 | 0 | 100 |
Green space | 1 | 11 | 88 | 0 | 0 | 100 |
Water body | 0 | 0 | 0 | 100 | 0 | 100 |
Residence | 4 | 0 | 0 | 0 | 96 | 100 |
Total | 97 | 111 | 96 | 100 | 96 | 500 |
Producer’s accuracy | 0.938 | 0.874 | 0.917 | 1.000 | 1.000 | |
Overall accuracy | 0.944 | |||||
Kappa value | 0.930 |
Road | Forest | Green Space | Water Body | Residence | Total | |
---|---|---|---|---|---|---|
Road | 0 | 0 | 100 | 0 | 0 | 100 |
Forest | 0 | 0 | 100 | 0 | 0 | 100 |
Green space | 0 | 0 | 100 | 0 | 0 | 100 |
Water body | 0 | 0 | 100 | 0 | 0 | 100 |
Residence | 0 | 0 | 100 | 0 | 0 | 100 |
Total | 0 | 0 | 500 | 0 | 0 | 500 |
Producer’s accuracy | 0.20 | |||||
Overall accuracy | 0.20 | |||||
Kappa value | 0.00 |
Road | Forest | Green Space | Water Body | Residence | Total | |
---|---|---|---|---|---|---|
Road | 84 | 1 | 13 | 0 | 2 | 100 |
Forest | 0 | 95 | 5 | 0 | 0 | 100 |
Green space | 0 | 17 | 83 | 0 | 0 | 100 |
Water body | 1 | 0 | 0 | 99 | 0 | 100 |
Residence | 3 | 0 | 0 | 0 | 97 | 100 |
Total | 88 | 113 | 101 | 99 | 99 | 500 |
Producer’s accuracy | 0.954 | 0.841 | 0.822 | 1 | 0.980 | |
Overall accuracy | 0.916 | |||||
Kappa value | 0.895 |
CNN (CAE_CNN) | Best_CNN | |
---|---|---|
Convolution layer | 1242 | 2895 |
Fully-connected layer | 3005 | 2885 |
Total | 4247 | 5780 |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cui, W.; Zhou, Q.; Zheng, Z. Application of a Hybrid Model Based on a Convolutional Auto-Encoder and Convolutional Neural Network in Object-Oriented Remote Sensing Classification. Algorithms 2018, 11, 9. https://doi.org/10.3390/a11010009
Cui W, Zhou Q, Zheng Z. Application of a Hybrid Model Based on a Convolutional Auto-Encoder and Convolutional Neural Network in Object-Oriented Remote Sensing Classification. Algorithms. 2018; 11(1):9. https://doi.org/10.3390/a11010009
Chicago/Turabian StyleCui, Wei, Qi Zhou, and Zhendong Zheng. 2018. "Application of a Hybrid Model Based on a Convolutional Auto-Encoder and Convolutional Neural Network in Object-Oriented Remote Sensing Classification" Algorithms 11, no. 1: 9. https://doi.org/10.3390/a11010009
APA StyleCui, W., Zhou, Q., & Zheng, Z. (2018). Application of a Hybrid Model Based on a Convolutional Auto-Encoder and Convolutional Neural Network in Object-Oriented Remote Sensing Classification. Algorithms, 11(1), 9. https://doi.org/10.3390/a11010009