1. Introduction
Wetlands play an important role in ecosystem management, ranging from the local to the global scale [
1]. They are defined as the boundary area of terrestrial and aquatic ecosystems, having features of both [
2]. Wetlands generate a variety of benefits to society and nature, such as providing fertile soils for agriculture, food, and habitat for shorebirds, generating oxygen, adjusting climate, improving water quality, etc. They have been regarded as one of the most valuable resources in the world [
3]. In recent decades, however, the extensive loss of wetlands has occurred in many countries throughout the world due to the high speed of economic development and population increase [
4]. Therefore, the study of wetland conservation has generated considerable interest among researchers and achieved significant progress [
5].
As a prelude to wetland conservation, the land-use/cover change (LUCC) of wetlands needs to be assessed to obtain thematic information. The conventional method applied to LUCC by ground survey is time-consuming and laborious, and is prone to errors for inaccessible areas as well. Satellite multispectral imagery, therefore, has recently been widely used in the characterization of wetland ecosystems, since it is able to cover large geographic areas with different spatial and spectral resolutions according to different requirements. Satellite multispectral imagery is mainly applied in five major implementations: (1) the survey and recognition of wetland resources [
6,
7]; (2) the generation of landscape-type thematic maps for wetlands [
8,
9]; (3) the dynamic change analysis of wetlands [
10,
11]; (4) the evaluation and inversion of biomass in wetland areas [
12,
13]; and (5) the evolution of wetland landscape patterns [
14].
The quality of the remotely-sensed data and the ability of classifiers affects the dynamic spatio-temporal analysis directly. A remote sensing image with a higher spatial resolution, or a classifier with higher accuracy, leads to a more reasonable analysis which fits the reality better. However, using data with higher spatial resolution brings increasing cost. Thus, many researchers have attempted to generate a more accurate spatio-temporal analysis of wetlands by enhancing the classification accuracy. Based on previous studies, a wide array of classification approaches have been developed and applied for monitoring and analyzing the LUCC of wetland resources. Decision trees (DTs), due to their intuitive simplicity and computational efficiency, are also widely used. They can easily accommodate data from all measurements scales [
15]. Many applications have been explored for various wetlands [
16,
17,
18]. However, DT has the disadvantage of overfitting, leading to a result with low classification accuracy. Resampling statistical methods, like bagging and boosting, can avoid this issue and enhance accuracy, but they also make the interpretation of the results more complex [
19]. Maximum likelihood classification (MLC), according to Ozesmi and Bauer’s research [
4], leads to higher accuracy than DT and is the most commonly used supervised classification algorithm. Moreover, a large amount of classification algorithms based on machine learning have been proposed recently and implemented successfully in various fields with higher accuracy, such as support vector machines (SVMs) [
20] and artificial neural networks (ANNs) [
21]. However, some of the machine learning approaches have high complexity and large computation. Thus, how to generate the LUCC with high accuracy and efficiency is still the main issue in spatio-temporal analysis of wetlands.
The main purpose of this study is to analyze the evolution of Chongming Dontan with high accuracy and efficiency by remote sensing data, which is mainly on the basis of the classification results. Therefore, we took a first step to perform the LUCC for the Chongming Dongtan wetland using an optimized extreme learning machine (ELM) [
22], namely the Kernel-ELM (K-ELM). Then, in order to demonstrate the performance and benefits of K-ELM, we conducted a comparative study of K-ELM, MLC, SVM, and conventional ELM with multispectral remotely-sensed data from 2009. The spatio-temporal dynamic changes were analyzed in terms of area change between different years. Finally, the experimental results allow a clear conclusion to be made about the dynamic change of Chongming Dongtan from 1986 to 2013. The spatio-temporal analysis of the Dongtan wetland will help researchers learn about its evolution more completely. It is, moreover, more intuitive to analyze the relationship between different objects. Generally, the novelty of our research is divided into two parts: higher classification accuracy and efficiency generated by K-ELM for the Chongming Dongtan wetland, and a long periodic analysis of spatio-temporal changes of the Dongtan wetland. The results could provide a reliable scientific basis for urban planning, ecological resource management, and sustainable development.
2. Study Area and Data Used
The Dongtan wetland is at the mouth of the Yangtze River, located in the far east of Chongming Island, in Shanghai, China, between 121°50′ and 122°05′ east longitude, and ranging from 31°25′ to 31°38′ north latitude. Due to the influence of subtropical oceanic monsoons, it has a mild and moist climate. Since 1949, it has undergone several reclamations and seven dikes (shown in
Figure 1) have been built to protect the bank and to promote silt. Nowadays, the wetland consists of freshwater, salty marshes (mainly dominated by
Spartina alterniflora,
Phragmites australis, and
Scirpus mariqueter) [
23], tidal creeks, and intertidal mudflats. Due to the unique climate and ecosystem it has been regarded as an important habitat for many wildlife species and a migratory stopover site for shorebirds along the East Asian–Australasian Flyway. However, with the increase in development intensity and human activities, the Dongtan wetland has to face many common ecological problems as well, such as sea level rise, inundation, alien species invasion, and fresh water shortage. In short, human interference exceeds the wetland’s ability to maintain its biodiversity. On the other hand, biological succession in the mudflat wetland, behaving as the eastern beach of Chongming Island, extends further each year, which has made Chongming Island a rapidly developing and worldwide rare ecosystem.
Eight cloudless and terrain-corrected Landsat images of Chongming Island from 1986 to 2013 were utilized in this study (
Table 1). For a number of reasons, growing season (July and August) images were selected. At this time of year, different kinds of vegetation are all at a stable stage. Meanwhile, it is easier to achieve cloud-free images due to the climatic conditions. The experimental data is comprised of six original spectral bands (band 1–5, 7 for TM and ETM+, band 2–7 for OLI) with 30 m spatial resolution.
A reference dataset was constructed based on the ground survey in 2009 which was used to train the classifier and evaluate the classification results. It contains 437 samples. Considering that the distribution of the bare flat is along the coastline, it is difficult and dangerous to obtain enough ground survey points. We only obtained 32 samples for the bare flat, and another nine classes contain 45 samples, respectively.
3. Methodology
3.1. Extreme Learning Machine
The extreme learning machine (ELM) is a simple learning method for single-hidden layer feed-forward networks (SLFNs), randomly assigning the input weights and hidden layer biases [
24].
The output of a conventional SLFN with
hidden nodes, whose output nodes are linear, can be obtained by:
where
is the input vector with m nodes;
is the output vector with n nodes;
is the weight vector connecting the input nodes and the
ith hidden node;
is the weight vector connecting the output nodes and the
ith hidden node;
is the bias of the
ith hidden node; and
is the activation function of hidden layer. Then Equation (1) can be written as:
where
is the hidden layer output matrix of the neural network, which can be expressed as [
25]:
According to Huang et al. [
24], the input weights and hidden layer biases can be randomly assigned if only the activation function is infinitely differentiable. The least-squares solution of weight vector
is expressed as:
where
is the Moore-Penrose generalized inverse of
[
26].
can be achieved, e.g., by orthogonal projection, iterative method, singular value decomposition. When
is nonsingular:
Using ridge regression theory [
27], the resultant solution is more stable and tends to have better generalization performance by adding a positive value to the diagonal of
[
28]. Then the solution of
becomes:
where
I Iis the identity matrix and
is the regularization parameter. In our study we set
.
Hence the learning procedure of SLFNs is simplified as follows:
- Step 1
Define the training set, activation function and the number of hidden node.
- Step 2
Randomly generate the input weight and bias.
- Step 3
Calculate the hidden layer output matrix according to Equation (3).
- Step 4
Obtain the output weight based on Equation (6).
ELM is a faster learning algorithm, in which the hidden layer does not need to be tuned. It has been demonstrated that ELM has higher scalability and less computational complexity than the conventional feedforward network learning algorithms [
28].
3.2. Kernel-Extreme Learning Machine
In this study an optimized ELM on the basis of kernel theory was used, namely the kernel-extreme learning machine (K-ELM) proposed by Yao et al. [
29]. In the K-ELM, the activation function is replaced by a ‘kernel trick’ [
30]. Theoretically, since K-ELM combines a kernel trick with ELM, it gives a better general performance and less computational complexity than the conventional popular learning algorithms for feedforward neural networks. Based on the kernel theory the kernel function for a SLFN [
29] can be defined as:
Solution (6) can now be rewritten as:
In K-ELM we only need to define the type of kernel function instead of the activation function and node number of the hidden layer, which simplifies the random feature mapping and makes the machine much more stable. In our experiment, a third-order polynomial function was used as the kernel function.
3.3. Support Vector Machine
Support vector machines (SVMs) are based on the statistical learning theory developed by Vapnik [
20], aiming to find a separating hyperplane with the maximum geometric margin between classes. SVMs have been widely introduced for solving classification, regression, and other learning problems. The v-support vector classification (v-SVM) [
31] was selected to recognize the different objects of Dongtan wetland in this study. The primal optimization problem can be represented by:
where
are the training vectors;
maps
into a higher-dimensional space by some kernel tricks;
is a parameter giving an upper bound on the fraction of training errors and a lower bound of the fraction of support vector respectively [
32];
is the class label;
are slack variables;
is an additional variable;
and
are the components of the separating hyperplane:
where
is a matrix of reflectance of pure endmembers.
This optimization problem can be solved by obtaining its Lagrangian and transforming it to its dual problem [
31].
3.4. Maximum Likelihood Classification
Maximum likelihood classification is one of the most commonly used parametric supervised classifiers [
33]. This method distinguishes objects by computing the weighted distance or likelihood based on the Bayesian theory [
34]. The discriminant function
for each pixel
is:
where
,
, and
are the probability, covariance, and mean value of
th class, respectively.
When conducting a hard classification objects would belong to the class in which it has the highest probability.
4. Experiment
Figure 2 presents a flowchart of the main process in this study. Aiming to validate the effectiveness and efficiency of K-ELM, a comprehensive comparison between K-ELM and three other conventional methods was made with the image in 2009. In the comparison experiment the classification results were evaluated by three accuracy indices (overall, producer’s, and user’s accuracy) and computation time. Then the spatio-temporal analysis of the Dongtan wetland from 1986 to 2013 was generated with the classification results of the K-ELM.
4.1. Pre-Processing
In order to eliminate the atmospheric influence on radiance in the remote sensing imagery and to achieve the real surface reflectance, the atmospheric interference was removed by FLAASH [
35] in this study. Ten ground control points were selected to make the geometric correction. The root mean square errors of the images were all less than one pixel.
4.2. Band Analysis and Feature Space Construction
The feature space consists of one optimal bandwidth combination, normalized difference vegetation index (NDVI), and the wetness dimension of the Kanth–Thomas (K-T) transformation.
The optimum index factor (OIF) [
36], which weighs the variance of each individual band, was utilized to select the optimal bandwidth combination. According to the results of the OIF, bands 3–5 (red, near-infrared, and short-wave infrared) for TM/ETM+ data and bands 3, 5, and 6 (green, near-infrared, and short-wave infrared) for OLI data were chosen.
4.3. Classifier Training and Accuracy Evaluation
According to the ground survey in 2009, to the west of the dike, the classification system consists of green land, building land, farmland, water area, reclaimed land, and bare land, while in the east, it is comprised of Spartina alterniflora, Phragmites australis, Scirpus mariqueter, and bare flat. Therefore, there are ten totally different classes in our study area.
For the image of 2009, since a total of 437 reference samples were obtained based on the ground survey, the training and testing samples were selected from the reference dataset. Then a three-fold cross-validation was applied for the classification of accuracy evaluation. Thus, 292 training samples, which were separately and randomly selected from each class sample in proportion, were utilized to train the classifier. In addition, the remaining 145 samples were regarded as testing data. For images of other years the training samples were defined by a priori knowledge. The final selected training samples for each class have separability greater than 1.9 in ENVI software (Harris Geospatial Solutions, Broomfield, CO, USA) (Deng 2010). The number of training samples for each class in other years is equal to that in the 2009 image.
In this study we applied a K-ELM with a third-order polynomial kernel function to distinguish between the objects. The node number in the input layer and output layer are equal to the dimension of the feature space and the number of classes, respectively. To examine the success and effectiveness of the K-ELM a comparison between K-ELM and other three methods: MLC, SVM, and standard ELM were generated by the 2009 image. The assessment is divided into two parts: accuracy and efficiency. The classification accuracy was based on the three-fold cross-validation result and evaluated by the average values of overall accuracy, producer’s accuracy, and user’s accuracy. On the other hand, computation time was recorded and applied for a comparison of the efficiency.
4.4. Land-Use Cover/Change
Based on the classification results, a quantitative change detection was generated by calculating the area difference between images. An index called LUCC dynamic degree was developed to measure the rate of changes, which led to a better description of the quantitative change of area year by year. The expression is given by:
where
is the LUCC dynamic degree;
and
are the area of a given class in the year of
and
, respectively. After achieving each year’s LUCC dynamic degree, we analyzed the dynamic spatio-temporal change.
5. Results
5.1. Classification Result and Evaluation
To examine the performance and benefits of the K-ELM method the classification accuracy and efficiency were evaluated by comparing them with the other three commonly utilized methods: MLC, ELM, and SVM.
Figure 3 shows the classification results in 2009 of different methods. Different colors indicate different objects. The result of the K-ELM, according to ground survey data, is more consistent with the actual distribution. Especially for the east part (marked by a black rectangle), which is mainly occupied by
Phragmites australis,
Scirpus mariqueter, and bare flat in practice, K-ELM could distinguish them significantly, while MLC and SVM misclassify some areas as grassland and
Spartina alterniflora, respectively. The ELM could not effectively recognize
Scirpus mariqueter as well.
Then, based on the classification results, the accuracy of different methods was assessed with the producer’s and user’s accuracy for each class (given in
Table 2) and the overall accuracy for the entire research area (given in
Table 3). Meanwhile, the computation time was calculated to assess different methods’ efficiencies, as shown in
Table 3, as well. It can be seen that the K-ELM performs best with the highest overall accuracy (86.07%), followed by the SVM (85.35%). The conventional ELM, whose accuracy is 82.59%, performs worse than the SVM, while the MLC has the lowest accuracy, which is 79.31%. However, the classification ability of four classifiers varies slightly with classes. K-ELM generates both the highest producer’s and user’s accuracy for
Spartina alterniflora,
Scirpus mariqueter,
Phragmites australis, reclamation land, and water, while SVM and ELM work better for farmland and green land. MLC is more suitable for building land and bare land recognition. As for the efficiency, SVM is the most time consuming, with a computation time more than twice as long as that of ELM. The computation time of K-ELM is 1.38 s, which is much less than MLC and SVM.
Generally, compared with the conventional ELM, the ‘kernel trick’ increases the classification accuracy of K-ELM significantly, especially for the area mixed by Spartina alternflora, Scirpus mariqueter, and Phragmites australis. Even though there is, to some extent, a limitation to the efficiency of the ‘kernel trick’ its efficiency is still remarkable and much higher than that of SVM and MLC. Comprehensively, the K-ELM is advantageous when applied to the LUCC of the Dongtan wetland on Chongming Island.
5.2. Dynamic Spatio-Temporal Analysis
Eight images, from 1986 to 2013, were classified by the K-ELM, shown in
Figure 4, after which the change detection and spatio-temporal analysis were performed.
The area statistics of the ten classes and the corresponding LUCC dynamic degree are given in
Table 4 and
Table 5, respectively. In order to make the results more intuitive, the corresponding bar graph of area statistics is drawn and shown in
Figure 5. For each class the farm land increased quickly during 1990–1995 and 1995–2000, with growth rates of 27.41% and 15.73%, respectively. Its changing trend has a strong relationship with the two dikes built in 1991 and 1998. The average annual growth is 4.07 km
2, whereas the total area changed only slightly after the dike was built in 1998.
The increase in building land reached its peak between 2007 and 2009, with 13.73 km2, due to the development of the tourist industry indicating the quick economic development and increasing population on Chongming Island during that period.
The reclamation tended to increase when a new dike was built. After 2003 it began to degrade. It fell at the highest rate, −36.23%, from 2007 to 2009.
The water area is comprised of an east part (outside the dike) and a west part (inside the dike). The east water area depends on the sea tide, while the west part, which mainly consists of lakes and rivers, is strongly affected by rainfall. It generally has a stable trend.
The change in green land is greatly influenced by an increase in farmland. Before 2003 the green land area kept increasing, but after that it had an opposite trend. By analyzing the classification results in different years, we found that the decreased area of green land transferred to farmland.
The three kinds of vegetation (Spartina alterniflora, Phragmites australis, Scirpus mariqueter) differed distinctly from each other. Spartina alterniflora, after being introduced into Chongming Island in the middle of the 1990s, has an obvious trend of southward expansion, while dominating the growth area of Phragmites australis. This is consistent with the area statistics of Spartina alterniflora and Phragmites australis. Scirpus mariqueter is located in the east of the wetland. Therefore, its area is greatly affected by the tide and cannot be extracted accurately.
The bare flat extends eastward and its expansion in a northerly direction is larger than that towards the south. Though it did not change much from 2003 to 2007, its total area still kept increasing.
To make a more visual change analysis result the change detection in different periods was generated. Since we focused our study on the change of the wetland, which is separated by dikes and consisted of
Spartina alterniflora,
Phragmites australis,
Scirpus mariqueter, and bare flat, the change detection only contained these four objects, as shown in
Figure 6, where yellow indicates the increasing area of the wetland and blue shows the decreasing area.
It can be seen that the wetland extended quickly towards the east and its area kept increasing, generally, except for the period between 2003 and 2007. After comparing the classification results in 2003 and 2007 it was easy to ascertain that the large area of the wetland that disappeared was covered by sea water. This might have been caused by tidal influence. During two other periods, 1990 to 1995 and 2000 to 2003, the main reason which led to the decrease in wetland area was the fact that two new dikes were finished in 1991 and 2001, respectively. Therefore, the region of wetland would be redefined at a time when a new dike was built and the part to the west of the new dike was not part of the wetland anymore. Dikes built at different times are shown in
Figure 1. Overall, the change in wetland has a strong relationship to dike building.
6. Discussion
Compared with the conventional ANNs, the K-ELM combines ELM theory with a kernel trick. The users, therefore, only need to select a kernel function when applying it, instead of defining the active function, the number of hidden nodes, and input weights. The nonlinear mapping of an ELM is replaced by the kernel function, which gives the original inputs map a higher dimension space. This enables the K-ELM to be a more stable and accurate classifier. It has advantages for LUCC, especially for a region that contains similar and complex kinds of objects.
The classification results reveal the distribution characteristics of objects and provide their corresponding area size. Since the spatio-temporal analysis produced is based on classification results, and the accuracy of the classification results has a significant effect on the change detection.
With the rapid development of Chongming Dongtan the building land (west of the dike) grew quickly. Meanwhile, there was a sustained transformation between green land and farm land, which increased the landscape diversity. During the periods of new dike building, the area of reclamation would increase as well. Generally, human activities to the west of the dike have a great influence on the ecosystem. In the wetland ecosystem the bare flat decreased rapidly, since a large amount of vegetation was cultivated while it held the highest proportion before 1995. Additionally, the area of bare flat is also affected by the tide. For instance, the rapid increase in the bare flat in 2003 was mainly due to the low tide. Then, after Spartina alterniflora was introduced into Dongtan in 1995, the existence of local vegetation (Phragmites australis and Scirpus mariqueter) became seriously threatened. Thus, some measures were taken to manage the invasion of Spartina alterniflora. Meanwhile, the Chongming Dongtan Nature Reserve was established in 1998 and work began on it one year later, which was beneficial for improving the health of Chongming Dongtan. All the efforts are significant in keeping the ecological balance and raising the overall health level.
The spatio-temporal analysis, therefore, is advantageous for learning about the historical changes to the Chongming Dongtan and obtaining the evoluation rules. To some extent, it could provide useful information for constructing and managing Chongming Dongtan.
7. Conclusions
In this study a novel method, which mainly takes advantages of the kernel extreme learning machine (K-ELM) algorithm, was applied to distinguish between different objects and extract their distribution information over the Dongtan wetland located on Chongming Island, Shanghai, China. In the K-ELM the active function is replaced by a kernel function, implying the random feature mapping of active functions and input connection weights are not necessary. Thus, the K-ELM is more stable and effective than a conventional ELM. On the other hand, the experimental results demonstrate that the K-ELM, whose overall accuracy is 86.07%, has a higher classification accuracy than the other three classification methods (MLC, SVM, and ELM).
Additionally, based on the classification results, another novelty of this study is a dynamic spatio-temporal analysis of the Dongtan wetland made from 1986 to 2013. The statistical information of the ten classes in the Dongtan wetland, in terms of area, LUCC dynamic degree, and change detection visualization, was analyzed specifically. Generally, the dike building has a great influence on changes in the wetland region. These quantitative results did not only help in assessing the subsequent ecosystems and supporting their management effectively, but it also led to a more reliable and scientific decision regarding the harmonious development of the ecological environment, regional management, urban planning, etc.
In the long-term, dynamic monitoring and analysis can play a very important role in the Dongtan wetland’s protection and rational use. It could provide a reliable scientific basis for rational planning, development, ecological balance, and regional sustainability of the wetland area. According to the results of the remote sensing monitoring and spatio-temporal quantitative analysis the government could obtain a correct evaluation in order to execute ecological regional planning for the Dongtan zone.
Despite its success our study leads to some issues that need to be improved in future work. Firstly, as the spatial resolution of TM images is 30 m, there is a large number of mixed pixels in the image which can cause classification errors and cannot be reclassified, except when adding extra information. Therefore, some high spatial resolution satellite images, such as WorldView-2 or QuickBird, should be utilized. In addition, since the training dataset was not very large in our experiment, we did not consider the rare class issue which is, however, very important for classifier training. Finally, this study is mainly based on remotely-sensed images. More multi-source data, like environmental survey data and economics data, should be applied jointly to improve the assessment.
In addition, a monitoring network system should be established for future studies, which involves monitoring environmental qualities including land use, water quality, atmospheric environment, acoustic environment, and solid quality. It would, furthermore, involve aquatic and wetland ecosystems in order to realize comprehensive monitoring and evaluation.