A New Lightweight Convolutional Neural Network for Multi-Scale Land Surface Water Extraction from GaoFen-1D Satellite Images
<p>The study area and the GaoFen-1 D dataset (A, B, C, D, E, and G are used for training and F is used for testing).</p> "> Figure 2
<p>Steps of sample generation.</p> "> Figure 3
<p>Flowchart of remote sensing image preprocessing.</p> "> Figure 4
<p>The structure of Squeeze and Excitation (SE) blocks. (<b>a</b>–<b>c</b>) are the structures of cSE, sSE, and scSE, respectively.</p> "> Figure 5
<p>(<b>a</b>–<b>c</b>) Convolution kernels with different dilation rates of 0, 1, and 2, respectively.</p> "> Figure 6
<p>The structure of the Lightweight Multi-Scale Land Surface Water Extraction Network (LMSWENet).</p> "> Figure 7
<p>The accuracy curves of training and validation.</p> "> Figure 8
<p>The loss curves of training and validation.</p> "> Figure 9
<p>Land surface water information extraction results for different water types. (<b>a</b>(<b>1</b>)) is the raw image of artificial ditches; (<b>a</b>(<b>2</b>–<b>7</b>)) are the water areas extracted from (<b>a</b>(<b>1</b>)) using manually labeling, LMSWENet, DeeplabV3+, FCN, PSPNet, and UNet, respectively; (<b>b</b>(<b>1</b>)–<b>g</b>(<b>7</b>)) are the water areas extracted from various types of water with agricultural water, riverside, lakes, open pools, puddles, and tiny streams, respectively.</p> "> Figure 10
<p>Land surface water information extraction results in a confusing area. (<b>a</b>(<b>1</b>)) is the raw image of highways; (<b>a</b>(<b>2</b>–<b>7</b>)) are the water areas extracted from (<b>a</b>(<b>1</b>)) using manually labeling, LMSWENet, DeeplabV3+,FCN, PSPNet, and UNet, respectively; (<b>b</b>(<b>1</b>)–<b>e</b>(<b>7</b>)) are the water areas extracted from different surface environments with mountain shadows, architecture shadows, agricultural land, and playgrounds, respectively.</p> "> Figure 11
<p>Land surface water information extraction results for LMSWENet, LMSWENet “without scSE Blocks”, and LMSWENet “without Dilated Convolutions”. (<b>a</b>(<b>1</b>)) is the raw image of architecture shadows, and (<b>a</b>(<b>2</b>–<b>5</b>)) are the water areas extracted from (<b>a</b>(<b>1</b>)) using manually labeling, LMSWENet, LMSWENet “without scSE Blocks”, and LMSWENet “without Dilated Convolutions”, respectively; (<b>b</b>(<b>1</b>)–<b>e</b>(<b>7</b>)) are the water areas extracted from geography scenes with highways, ponds, small puddles, and rivers, respectively.</p> "> Figure 12
<p>The principle of the “sliding window” prediction method.</p> "> Figure 13
<p>Land surface water map of Wuhan for 2020 (LSWMWH-2020).</p> "> Figure 14
<p>The distribution of validation points in Wuhan. (<b>a</b>,<b>b</b>) are the distributions of random validation points and equidistant validation points, respectively.</p> "> Figure 15
<p>Border area of water and non-water and the water extraction results. (<b>a</b>(<b>1</b>),<b>b</b>(<b>1</b>)) and are the raw image of ponds mixed with vegetation and sediment; (<b>a</b>(<b>2</b>),<b>b</b>(<b>2</b>)) are the water areas extracted from (<b>a</b>(<b>1</b>),<b>b</b>(<b>1</b>)) using LMSWENet; (<b>c</b>(<b>1</b>),<b>c</b>(<b>2</b>)) are the raw image of wetland and the water extraction result.</p> ">
Abstract
:1. Introduction
2. Study Materials
2.1. Study Area and Date
2.2. Sample Generation
3. Method
3.1. Spatial and Channel Squeeze and Excitation
3.2. Dilated Convolution
3.3. LMSWENet
4. Experiment
4.1. Training
4.2. Accuracy Evaluation Criteria
4.3. Comparative Experiment
4.3.1. Comparison of Training Process
4.3.2. Comparison of Performance for Different Water Types
4.3.3. Comparison of Performance for Confusing Area
4.3.4. Comparison of Accuracy Using Evaluation Criteria
4.3.5. Comparison of Performance for LMSWENet, LMSWENet “without scSE Blocks” and LMSWENet “without Dilated Convolutions”
5. Land Surface Water Mapping Method and Accuracy Evaluation
5.1. Land Surface Water Mapping Method
5.2. Land Surface Water Mapping Accuracy Evaluation
6. Discussion
6.1. Effects of the CNN’s Structure on Water Extraction
6.2. Analysis of Mapping Results
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
HSRRSI | High Spatial Resolution Remote Sensing Image |
CNN | Convolutional Neural Network |
LMSWENet | Lightweight Multi-Scale Land Surface Water Extraction Network |
FCN | Fully Convolutional Network |
PSPNet | Pyramid Scene Parsing Network |
LSWMWH-2020 | Land Surface Water Map of Wuhan for 2020 |
NDWI | Normalized Difference Water Index |
MNDWI | Modified Normalized Difference Water Index |
GNDWI | Gaussian Normalized Difference Water Index |
SVM | Support Vector Machine |
MLC | Maximum Likelihood Classification |
DT | Decision Tree |
RF | Random Forest |
DS | Decorrelation Stretch |
scSE | Spatial and Channel Squeeze and Excitation (scSE) |
PMS | Panchromatic Multispectral Sensor |
GXL | PCI GeoImaging Accelerator |
cSE | Spatial Squeeze and Channel Excitation |
sSE | Channel Squeeze and Spatial Excitation |
ASPP | Atrous Spatial Pyramid Pooling |
GPU | Graphics Processing Unit |
cuDNN | CUDA Deep Neural Network |
Adam | Adam Adaptive Moment Estimation |
PA | Pixel Accuracy |
ER | Error Rate |
WP | Water Precision |
MP | Mean Precision |
WIoU | Water Intersection over Union |
MIoU | Mean Intersection over Union |
TP | True Positive |
TN | True Negative |
FN | False Negative |
FP | False Positive |
FLOPs | Floating-point Operations |
References
- Feyisa, G.L.; Meilby, H.; Fensholt, R.; Proud, S. Automated Water Extraction Index: A new technique for surface water mapping using Landsat imagery. Remote Sens. Environ. 2014, 140, 23–35. [Google Scholar] [CrossRef]
- Alderman, K.; Turner, L.; Tong, S. Floods and human health: A systematic review. Environ. Int. 2012, 47, 37–47. [Google Scholar] [CrossRef] [Green Version]
- Bond, N.R.; Lake, P.S.; Arthington, A.H. The impacts of drought on freshwater ecosystems: An Australian perspective. Hydrobiologia 2008, 600, 3–16. [Google Scholar] [CrossRef] [Green Version]
- Charron, D.F.; Thomas, M.K.; Waltner-Toews, D.; Aramini, J.J.; Edge, T.; Kent, R.A.; Maarouf, A.R.; Wilson, J. Vulnerability of waterborne diseases to climate change in canada: A review. J. Toxicol. Environ. Health Part A 2004, 67, 1667–1677. [Google Scholar] [CrossRef] [PubMed]
- Kondo, H.; Seo, N.; Yasuda, T.; Hasizume, M.; Koido, Y.; Ninomiya, N.; Yamamoto, Y. Post-flood—Infectious Diseases in Mozambique. Prehospital Disaster Med. 2002, 17, 126–133. [Google Scholar] [CrossRef]
- Lake, P.S. Ecological effects of perturbation by drought in flowing waters. Freshw. Biol. 2003, 48, 1161–1172. [Google Scholar] [CrossRef]
- Li, K.; Wu, S.; Dai, E.; Xu, Z. Flood loss analysis and quantitative risk assessment in China. Nat. Hazards 2012, 63, 737–760. [Google Scholar] [CrossRef]
- Dan, L.I.; Wu, B.; Chen, B.W.; Xue, Y.; Zhang, Y. Review of water body information extraction based on satellite remote sensing. J. Tsinghua Univ. (Sci. Technol.) 2020, 60, 147–161. [Google Scholar] [CrossRef]
- Xiao, Y.F.; Lin, Z. A study on information extraction of water body using band1 and band7 of TM imagery. Sci. Surv. Mapp. 2010, 35, 226–227+216. [Google Scholar] [CrossRef]
- Zhang, M.H. Extracting Water-Body Information with Improved Model of Spectal Relationship in a Higher Mountain Area. Geogr. Geo-Inf. Sci. 2008, 24, 1416. [Google Scholar]
- Warner, T.A.; Nerry, F. Does single broadband or multispectral thermal data add information for classification of visible, near- and shortwave infrared imagery of urban areas? Int. J. Remote Sens. 2009, 30, 2155–2171. [Google Scholar] [CrossRef]
- Acharya, T.D.; Lee, D.H.; Yang, I.T.; Lee, J.K. Identification of Water Bodies in a Landsat 8 OLI Image Using a J48 Decision Tree. Sensors 2016, 16, 1075. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Isikdogan, F.; Bovik, A.C.; Passalacqua, P. Surface Water Mapping by Deep Learning. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 4909–4918. [Google Scholar] [CrossRef]
- Guo, H.; He, G.; Jiang, W.; Yin, R.; Yan, L.; Leng, W. A Multi-Scale Water Extraction Convolutional Neural Network (MWEN) Method for GaoFen-1 Remote Sensing Images. ISPRS Int. J. Geo-Inf. 2020, 9, 189. [Google Scholar] [CrossRef] [Green Version]
- McFeeters, S.K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
- Xu, H. A study on information extraction of water body with the modified normalized difference water index (MNDWI). J. Remote Sens. 2005, 9, 589–595. [Google Scholar]
- Shen, Z.; Xia, L.; Li, J.; Luo, J. Automatic and high-precision extraction of rivers from remotely sensed images with Gaussian normalized water index. J. Image Graph. 2013, 18, 421–428. [Google Scholar]
- Christianini, N.; Shawe-Taylor, J. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
- Deng, R.; Huang, J.; Wang, F.-M. Research on extraction method of water body with DS spectral enhancement based on HJ-1 images. Guang Pu Xue Yu Guang Pu Fen Xi=Guang Pu 2011, 31, 3064–3068. [Google Scholar]
- Friedl, M.; Brodley, C. Decision tree classification of land cover from remotely sensed data. Remote Sens. Environ. 1997, 61, 399–409. [Google Scholar] [CrossRef]
- Breiman, L. Random forests, machine learning 45. J. Clin. Microbiol. 2001, 2, 199–228. [Google Scholar]
- Aung, E.; Tint, T. Ayeyarwady River Regions Detection and Extraction System from Google Earth Imagery. In Proceedings of the 2018 IEEE International Conference on Information Communication and Signal Processing (ICICSP) 2018, Singapore, 28–30 September 2018; pp. 74–78. [Google Scholar]
- Rao, P.; Jiang, W.; Wang, X.; Chen, K. Flood Disaster Analysis Based on MODIS Data—Taking the Flood in Dongting Lake Area in 2017 as an Example. J. Catastroph. 2019, 34, 203–207. [Google Scholar] [CrossRef]
- Penatti, O.; Nogueira, K.; dos Santos, J. Do deep features generalize from everyday objects to remote sensing and aerial scenes domains? In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Boston, MA, USA, 7–12 June 2015; pp. 44–51. [Google Scholar]
- Huang, B.; Zhao, B.; Song, Y. Urban land-use mapping using a deep convolutional neural network with high spatial resolution multispectral remote sensing imagery. Remote Sens. Environ. 2018, 214, 73–86. [Google Scholar] [CrossRef]
- Cheng, B.; Liang, C.; Liu, X.; Liu, Y.; Ma, X.; Wang, G. Research on a novel extraction method using Deep Learning based on GF-2 images for aquaculture areas. Int. J. Remote Sens. 2020, 41, 3575–3591. [Google Scholar] [CrossRef]
- Shao, Z.; Cai, J.; Fu, P.; Hu, L.; Liu, T. Deep learning-based fusion of Landsat-8 and Sentinel-2 images for a harmonized surface reflectance product. Remote Sens. Environ. 2019, 235, 111425. [Google Scholar] [CrossRef]
- Carvalho, O.; Júnior, O.D.C.; Albuquerque, A.; Bem, P.; Silva, C.; Ferreira, P.; Moura, R.; Gomes, R.; Guimarães, R.; Borges, D. Instance Segmentation for Large, Multi-Channel Remote Sensing Imagery Using Mask-RCNN and a Mosaicking Approach. Remote Sens. 2020, 13, 39. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
- Long, J.; Shelhamer, E.; Darrell, T. Fully Convolutional Networks for Semantic Segmentation. arXiv 2014, arXiv:1605.06211. [Google Scholar]
- Chen, L.-C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A.L. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 40, 834–848. [Google Scholar] [CrossRef] [PubMed]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer: Cham, Switzerland, 2015; pp. 234–241. [Google Scholar] [CrossRef] [Green Version]
- Zhao, H.; Shi, J.; Qi, X.; Wang, X.; Jia, J. Pyramid Scene Parsing Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 6230–6239. [Google Scholar]
- Aceto, G.; Ciuonzo, D.; Montieri, A.; Pescape, A. Mobile Encrypted Traffic Classification Using Deep Learning: Experimental Evaluation, Lessons Learned, and Challenges. IEEE Trans. Netw. Serv. Manag. 2019, 16, 445–458. [Google Scholar] [CrossRef]
- O’Shea, T.; Hoydis, J. An Introduction to Deep Learning for the Physical Layer. IEEE Trans. Cogn. Commun. Netw. 2017, 3, 563–575. [Google Scholar] [CrossRef] [Green Version]
- Jia, H.H.; Yu, G.M. Assessing the Carrying Capacity of Water Resources in Wuhan City, China. Adv. Mater. Res. 2014, 905, 357–361. [Google Scholar] [CrossRef]
- Tengfei, L.; Weili, J.; Guojin, H. Nested Regression Based Optimal Selection (NRBOS) of Rational Polynomial Coefficients. Photogramm. Eng. Remote Sens. 2014, 80, 261–269. [Google Scholar] [CrossRef]
- Wang, T.; Zhang, G.; Jiang, Y.; Wang, S.; Huang, W.; Li, L. Combined Calibration Method Based on Rational Function Model for the Chinese GF-1 Wide-Field-of-View Imagery. Photogramm. Eng. Remote Sens. 2016, 82, 291–298. [Google Scholar] [CrossRef]
- Su, Y.Y.; Li, Y.J.; Zhou, Z.F. Remote sensing image fusion methods and their quality evaluation. Geotechnol. Investig. Surv. 2012, 40, 70–74. [Google Scholar]
- Roy, A.G.; Navab, N.; Wachinger, C. Concurrent Spatial and Channel ‘Squeeze & Excitation’ in Fully Convolutional Networks. In Proceedings of the Medical Image Computing and Computer Assisted Intervention—MICCAI 2018, Granada, Spain, 16–20 September 2018; pp. 421–429. [Google Scholar]
- Liu, S.; Huang, D.; Wang, Y. Receptive Field Block Net for Accurate and Fast Object Detection. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; Springer: Berlin, Germany, 2018; pp. 385–400. [Google Scholar]
- Yu, F.; Koltun, V. Multi-Scale Context Aggregation by Dilated Convolutions. arXiv 2016, arXiv:1511.07122. [Google Scholar]
Category | Extraction Method of Land Surface Water |
---|---|
Threshold algorithm | Threshold value of single wave band [8] |
Law of the spectrum relevance [9,10] | |
Index of water body [15,16,17] | |
Machine learning | Support vector machine [22] |
Maximum likelihood classification [19] | |
Decision tree [20] | |
Random forest [23] | |
Deep learning | Convolutional neural network [13,14] |
Symbol | Description |
---|---|
The input image. | |
The feature map of the channel of an input image. | |
= Hight of a feature map, and = Width of a feature map. | |
A descriptor encoding the global spatial information. | |
The channel dimensions of feature maps | |
The spatial location of a point in the feature map. | |
The vectors obtained through two cascaded fully connected layers | |
Weights of two fully-connected layers. | |
The function | |
The Sigmoid function. | |
The mapping between and . | |
The tensor after spatial-dimension recalibration. | |
A vector in spatial position (i, j) along all channels | |
A projection tensor representing the linearly combined representation. | |
The weight of a convolutional layer. | |
The mapping between and . | |
The tensor after spatial-dimension recalibration. | |
The concatenation of and . |
Images | Central Longitude (°E) | Central Latitude (°N) | Imaging Time (M-D-Y) | Image Size (pixel×pixel) | Image Usage |
---|---|---|---|---|---|
A | 114.1 | 31.3 | 11-09-2020 | 41,329 × 40,988 | Training |
B | 114.0 | 30.8 | 11-09-2020 | 41,335 × 40,983 | Training |
C | 113.8 | 30.2 | 11-09-2020 | 40,004 × 39,643 | Training |
D | 114.7 | 31.3 | 11-13-2020 | 41,188 × 40,839 | Training |
E | 115.1 | 30.8 | 10-11-2020 | 41,070 × 40,715 | Training |
G | 114.4 | 30.2 | 11-13-2020 | 41,224 × 40,854 | Training |
F | 114.5 | 30.8 | 11-13-2020 | 41,211 × 40,876 | Testing |
Parameter | Value |
---|---|
Loss function | Mean cross-entropy |
Optimizer | Adam |
Learning rate | 0.001, reduce to 1/2 of the previous every ten epochs |
Batch size | 4 |
Train epoch | 200 |
Accuracy Evaluation Criteria | Definition | Formula |
---|---|---|
PA | The ratio of the correctly predicted pixel numbers to the total pixel numbers | |
ER | The ratio of erroneously predicted pixel numbers to the total pixel numbers | |
WP | The ratio of the number of correctly predicted water pixels to the number of the labelled water pixels | |
MP | The mean Precision for all classes (water and background) | |
WIoU | The ratio of the intersection to the union of the ground truth water and the predicted water area | |
MIoU | The mean IoU for all classes (water and background) | |
Recall | The proportion of the number of correctly predicted water pixels and the number of the actual target feature pixels | |
F1-Score | The harmonic means for water precision and recall |
CNN | Number of Trainable Parameters (Million) | FLOPs(G) | Training Time (s/epoch) |
---|---|---|---|
LMSWENet | 11.35 | 49.24 | 65.63 |
DeeplabV3+ | 26.71 | 54.80 | 82.53 |
FCN | 18.64 | 80.56 | 75.64 |
PSPNet | 17.46 | 117.28 | 78.86 |
UNet | 17.27 | 160.60 | 72.53 |
CNN | PA | ER | WPI | MPI | WIoU | MIoU | Recall | F1-Score |
---|---|---|---|---|---|---|---|---|
LMSWENet | 98.23% | 1.77% | 98.17% | 98.16% | 92.33% | 94.99% | 93.94% | 96.01% |
DeeplabV3+ | 97.85% | 2.15% | 97.55% | 97.63% | 91.10% | 94.12% | 93.27% | 95.34% |
FCN | 97.96% | 2.04% | 98.52% | 98.08% | 91.43% | 94.36% | 92.71% | 95.52% |
PSPNet | 97.29% | 2.71% | 95.57% | 96.63% | 88.99% | 92.70% | 92.81% | 94.17% |
UNet | 98.07% | 1.93% | 98.29% | 98.09% | 91.76% | 94.60% | 93.25% | 95.70% |
CNN | PA | ER | WPI | MPI | WIoU | MioU | Recall | F1-Score |
---|---|---|---|---|---|---|---|---|
LMSWENet | 98.23% | 1.77% | 98.17% | 98.16% | 92.33% | 94.99% | 93.94% | 96.01% |
LMSWENet “without scSE Blocks” | 97.95% | 2.05% | 98.74% | 98.15% | 92.29% | 94.74% | 93.39% | 95.99% |
LMSWENet “without Dilated Convolutions” | 97.35% | 2.65% | 98.70% | 97.76% | 88.90% | 92.70% | 89.95% | 91.98% |
Type of Validation Point | Classification Data | ||||
---|---|---|---|---|---|
Water | Not Water | Total | |||
Random validation points | Reference data | Water | 91 | 19 | 110 |
Not water | 5 | 235 | 240 | ||
Total | 96 | 254 | 350 | ||
Equidistant validation points | Water | 64 | 10 | 74 | |
Not water | 4 | 266 | 270 | ||
Total | 68 | 276 | 344 |
Type of Validation Point | PA | ER | WPI | MPI | WIoU | MIoU | Recall | F1-Score |
---|---|---|---|---|---|---|---|---|
Random validation points | 93.14% | 6.86% | 82.73% | 90.32% | 79.13% | 84.93% | 94.79% | 88.35% |
Random validation points | 95.93% | 4.07% | 86.45% | 92.50% | 82.05% | 88.53% | 94.12% | 90.14% |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Duan, Y.; Zhang, W.; Huang, P.; He, G.; Guo, H. A New Lightweight Convolutional Neural Network for Multi-Scale Land Surface Water Extraction from GaoFen-1D Satellite Images. Remote Sens. 2021, 13, 4576. https://doi.org/10.3390/rs13224576
Duan Y, Zhang W, Huang P, He G, Guo H. A New Lightweight Convolutional Neural Network for Multi-Scale Land Surface Water Extraction from GaoFen-1D Satellite Images. Remote Sensing. 2021; 13(22):4576. https://doi.org/10.3390/rs13224576
Chicago/Turabian StyleDuan, Yueming, Wenyi Zhang, Peng Huang, Guojin He, and Hongxiang Guo. 2021. "A New Lightweight Convolutional Neural Network for Multi-Scale Land Surface Water Extraction from GaoFen-1D Satellite Images" Remote Sensing 13, no. 22: 4576. https://doi.org/10.3390/rs13224576
APA StyleDuan, Y., Zhang, W., Huang, P., He, G., & Guo, H. (2021). A New Lightweight Convolutional Neural Network for Multi-Scale Land Surface Water Extraction from GaoFen-1D Satellite Images. Remote Sensing, 13(22), 4576. https://doi.org/10.3390/rs13224576