1. Introduction
Extreme rainfall analysis is crucial for climate modeling, floodplain management and hazard assessment [
1,
2]. As Earth’s climate changes, there is a significant impact of extreme weather effects around the world [
3], highlighting the need for robust methods to assess their impacts. Among these tools, probabilistic methods based on Extreme Value Theory (EVT) play a central role in predicting such events [
4]. Currently, two main EVT-based approaches are widely used [
5]: (a) the block maxima (BM) method and (b) the point process (PP) method, also known as peaks-over-threshold (POT). The BM method utilizes maximum values from fixed data blocks, usually annual, while the PP method analyzes excesses above a predefined high threshold within continuous time series data [
5].
Although convenient and straightforward, the BM method has certain limitations, namely its inability to include information on precipitation’s seasonal cycles and sensitivity to parameter estimation errors. For instance, Katz et al. [
4] illustrated how the exclusion of an extreme point can disproportionately affect BM parameter estimates. A comprehensive review of these methods, including parameter estimation and model selection, is provided by Nerantzaki and Papalexiou [
6], and a review of recent developments in software implementations is provided by Bezlile et al. [
7].
In 2023, new parameters for ombrian curves in Greece were developed as part of the National Flood Risk Management Plans (NFRMP), aligning with the EU Directive 2007/60/EC [
8]. These parameters were calculated using the BM method without incorporating continuous precipitation data, limiting their ability to estimate extreme values.
In this study, uninterrupted daily rainfall records ranging over a century (1901–2023) from the National Observatory of Athens meteorological station in Thiseion are analyzed. This research examines the existence of trends in annual maxima and seasonality in daily precipitation values. An appropriate high threshold is selected and two different PP models are fitted using the maximum likelihood method to assess whether incorporating seasonality is statistically significant. Finally, the results are compared with those from the NFRMP. This study aims to address gaps in the NFRMP methodology by improving the understanding of non-stationarity in extreme rainfall analysis. It emphasizes the uncertainties of predictions and models, contributing to a more effective revision of the NFRMP.
2. Materials and Methods
2.1. Data
Continuous daily rainfall records from 1901 to 2023, obtained from the National Observatory of Athens meteorological station at Thiseion (latitude: 37.972° N; longitude: 23.717° E; altitude: 107 m above mean sea level), were used in the analysis. Notably, the station has not been relocated over the years. Daily precipitation values do not show apparent long-term trends over time. Also, no large-scale clustering patterns were found, including neither prolonged periods of intense rainfall nor diminished rainfall that could indicate temporal dependence (
Figure 1).
2.2. Non-Parametric Trend Analysis
The presence of a monotonic trend in the annual maxima series of daily precipitation was examined using Kendall’s Tau rank correlation test as implemented in the R language for statistical computing and graphics [
9]. Kendall’s Tau measures the strength of the monotonic relationship between two variables, is resistant to the effect of outliers and is a classic statistical method applied to hydrological data [
10]. Tau is expressed as
, where is
S is
where
is the annual maxima for a period of n years, and
. In the presence of ties, the formula for
is more complicated [
11]. The
p-value of Tau under the null hypothesis of no association is computed, in the case of no ties, using the algorithm AS71, given by Best and Gipps [
12]. When ties are present, a normal approximation, with continuity correction, is used by taking
normally distributed with mean zero and variance, as given by Kendall [
11].
2.3. Extreme Value Distribution Functions
As mentioned, the two main approaches based on Extreme Value Theory for analyzing extremes are the BM and PP methods. The Generalized Extreme Value (GEV) distribution is used to fit the BM:
where
,
is the location parameter,
is the scale parameter and
is the shape parameter [
13].
The generalized Pareto (GP) distribution is used to fit excesses over a high threshold:
where
is the high threshold,
,
is the scale parameter (depends on
) and
is the shape parameter.
2.4. Non-Stationarity of Extremes
If the data are believed to be non-stationary, it is possible to integrate this information into the parameters of the distribution functions (Equations (2) and (3)) using a regression-like approach [
4,
14]. For example, in
the location parameter
varies with time
, where
,
and
are the regression coefficients.
2.5. Point Process Approach
In the presence of continuous time series data, methods such as the PP approach, which incorporate all available information, are more effective because they (a) utilize both the frequency and magnitude of the threshold exceedances, (b) account for temporal patterns such as seasonality and (c) provide a comprehensive statistical framework for modeling extreme events while being less sensitive to parameter estimation errors [
13].
Modeling extreme values using a PP approach was introduced probabilistically by Leadbetter et al. [
15,
16] and as a statistical method by Smith [
17]. A PP model consists of two components: (a) a Poisson process, which models the exceedance of a high threshold, and (b) the generalized Pareto distribution, which describes the excesses above the threshold. In brief, following Coles [
13] and Katz [
4], if a process is stationary and the data above a threshold do not cluster, the limiting form of the process is non-homogeneous Poisson with intensity measure
. Given a set
,
is:
where the
,
and
parameters have the same meanings as in Equation (2). Maximum likelihood estimation (MLE) can be used to estimate the parameters of a PP by optimizing the log-likelihood function using numerical methods [
14]:
where
is the number of years in the data, so that the parameters represent the annual maxima. Similarly, Equation (6) uses excesses, where the parameters are in terms as those in Equation (2). More details about the method can be found in Coles (Chapter 7 in [
13]).
2.6. Selection of High Threshold
Before fitting the parameters of a PP, an appropriate threshold must be selected. This choice involves a trade-off between low variance (a lower threshold provides more data) and reduced bias (a higher threshold yields less biased estimates). The ideal threshold should produce parameter estimates that are consistent with those obtained from any higher threshold, within uncertainty bounds (e.g., confidence intervals) [
13,
14].
2.7. Statistical Test of Nested Models
To determine whether adding covariates to the parameters in the aforementioned regression-like manner improves the model, the likelihood ratio test can be applied [
13,
18]. Let
be the negative log-likelihood of the base model
and
be that of the nested model
. The likelihood ratio statistic is then given by:
When testing the null hypothesis , the significance level can be determined using the quantile of the distribution, with the degrees of freedom equal to the difference in the number of model parameters. The null hypothesis is rejected when .
3. Results and Discussion
The daily precipitation values per month were analyzed using four key central moments, namely the mean, standard deviation, skewness and kurtosis, as well as additional statistical properties such as the median and coefficient of variation (
Table 1). These analyses aimed to identify seasonality in the data. The results show that there is a clear pattern of higher precipitation during the months from October to March. Notably, the highest recorded value in September 1950 was 143 mm. Positive skewness indicates that the data are asymmetric, while positive kurtosis indicates a heavy-tailed distribution.
Annual maxima also did not reveal any apparent long-term trends over time (
Figure 2). Kendall’s Tau rank correlation test result indicates that the null hypothesis that annual maxima do not change over time could not be rejected for a significance level α = 5% (
).
A high threshold of 10 mm was selected for the PP model by examining the stability of the parameter estimates across a range of values. Then, MLE was employed on 1509 rainfall data points that fell over the 10 mm threshold. Two different models were used: (a) a stationary model and (b) one that incorporates the observed seasonality into the location
and shape
parameters of the PP model:
where the day of the year is
The logarithm is used in order to keep the scale σ values positive.
A likelihood ratio test for the two nested models confirmed that incorporating seasonality is statistically significant (), as expected.
Figure 3 depicts the effective design value, with 95% confidence intervals, for a 100-year return period (probability
p = 0.01) from the non-stationary PP model (b). In this way, a general sense of the annual cycle in extreme precipitation is given for each day of the year. The rainfall records align with the effective 100-year design values, revealing the seasonal pattern, with higher rainfall extremes occurring in winter compared to summer.
Finally, the results from the PP method were compared to those from the NFRMP (
Table 2). The stationary PP model’s estimates for various return periods were consistent with the NFRMP values for the upper 95% confidence interval (CI). Notably, the non-stationary PP model, which accounted for seasonality, produced estimates up to 22% higher for the upper 95% CI. It should be noted that the reported CIs were calculated using a normal approximation; however, the use of profile likelihoods would allow skewed intervals and produce even higher values for the upper CI bound [
13,
20].
4. Conclusions
The results of this study indicate that incorporating seasonality (non-stationarity) into probabilistic extreme analysis of rainfall leads to a significant difference in the rainfall design values. This is because statistical methods, such as the point process approach, can more precisely account for uncertainties involved in modeling and prediction. As a result, these means produce more accurate assessments of future extremes that are less likely to be contradicted by observed hydrological events, like the recent Storm Daniel in Greece. Overall, our findings underscore the need to revise the methodology currently employed in the National Flood Risk Management Plans. Such a revision could have a substantial impact on flood risk assessments and management strategies in Greece.
However, some limitations should be acknowledged. To begin with, this study focuses on a single long-term rainfall record, and extending this approach to regional datasets could reveal spatial dependence in extreme rainfall, which is essential for nation-wide applications. Moreover, the point process models require the careful selection of thresholds and covariates, introducing potential uncertainties in parameter estimation. Finally, the impacts of climate change on extreme rainfall remain a modeling challenge that should be incorporated.
Future research should focus on integrating non-stationary regional models with dynamic climatic projections. Incorporating these methodologies into flood risk management plans will require collaboration among researchers, policymakers and stakeholders to address technical, economical and regulatory challenges.
Author Contributions
Conceptualization, methodology, K.V.; writing—original draft preparation, K.V.; writing—review and editing, A.L. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
The original data presented in the study are openly available under license CC-BY 4.0 at
https://climpact.gr/ (accessed on 14 January 2025).
Acknowledgments
The data importing, analysis and presentation were carried out using the open source R language for statistical computing and graphics [
9] using the packages hydroscoper [
21], hyetor [
22], eXtremes [
14] and ggplot2 [
23].
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Chow, V.T.; Maidment, D.R.; Larry, W. Applied Hydrology; McGraw-Hill, Inc.: New York, NY, USA, 1988. [Google Scholar]
- Maidment, D.R. Handbook of Hydrology; McGraw-Hill: New York, NY, USA, 1992. [Google Scholar]
- Intergovernmental Panel On Climate Change (IPCC). Climate Change 2021—The Physical Science Basis: Working Group I Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, 1st ed.; Cambridge University Press: Cambridge, UK, 2023; ISBN 978-1-00-915789-6. [Google Scholar]
- Katz, R.W.; Parlange, M.B.; Naveau, P. Statistics of Extremes in Hydrology. Adv. Water Resour. 2002, 25, 1287–1304. [Google Scholar] [CrossRef]
- Bücher, A.; Zhou, C. A horse race between the block maxima method and the peak–over–threshold approach. Stat. Sci. 2021, 36, 360–378. [Google Scholar] [CrossRef]
- Nerantzaki, S.D.; Papalexiou, S.M. Assessing Extremes in Hydroclimatology: A Review on Probabilistic Methods. J. Hydrol. 2022, 605, 127302. [Google Scholar] [CrossRef]
- Belzile, L.R.; Dutang, C.; Northrop, P.J.; Opitz, T. A Modeler’s Guide to Extreme Value Software. Extremes 2023, 26, 595–638. [Google Scholar] [CrossRef]
- Koutsoyiannis, D.; Iliopoulou, T.; Koukouvinos, A.; Malamos, N.; Mamassis, N.; Dimitriadis, P.; Tepetidis, N.; Markantonis, D. Production of Maps with Updated Parameters of the Ombrian Curves at Country Level (Implementation of the EU Directive 2007/60/EC in Greece); Technical Report; Ministry of Ennvironment and Energy: Athens, Greece, 2023; p. 230. [Google Scholar]
- R Core Team. R: A Language and Environment for Statistical Computing; Foundation for Statistical Computing: Vienna, Austria, 2024. [Google Scholar]
- Helsel, D.R.; Hirsch, R.M.; Ryberg, K.R.; Archfield, S.A.; Gilroy, E.J. Statistical Methods in Water Resources; U.S. Geological Survey: Reston, VA, USA, 2020. [Google Scholar]
- Kendall, M.G. Rank Correlation Methods; C. Griffin: Salisbury South, Australia, 1948. [Google Scholar]
- Best, D.J.; Gipps, P.G. Algorithm AS 71: The Upper Tail Probabilities of Kendall’s Tau. J. R. Stat. Soc. Ser. C 1974, 23, 98–100. [Google Scholar] [CrossRef]
- Coles, S. An Introduction to Statistical Modeling of Extreme Values; Springer Series in Statistics; Springer: London, UK, 2001; ISBN 978-1-84996-874-4. [Google Scholar]
- Gilleland, E.; Katz, R.W. extRemes 2.0: An Extreme Value Analysis Package in R. J. Stat. Soft. 2016, 72, 1–39. [Google Scholar] [CrossRef]
- Leadbetter, M.R.; Lindgren, G.; Rootzén, H. Extremes and Related Properties of Random Sequences and Processes; Springer Science & Business Media: New York, NY, USA, 1983; ISBN 978-1-4612-5451-5. [Google Scholar]
- Resnick, S.I. Extreme Values, Regular Variation and Point Processes; Springer Series in Operations Research and Financial Engineering; Springer: New York, NY, USA, 1987; ISBN 978-0-387-75952-4. [Google Scholar]
- Smith, R.L. Extreme Value Analysis of Environmental Time Series: An Application to Trend Detection in Ground-Level Ozone. Stat. Sci. 1989, 4, 367–377. [Google Scholar]
- Thomas, M. Statistical Analysis of Extreme Values: With Applications to Insurance, Finance, Hydrology, and Other Fields; Birkhäuser Verlag: Basel, Switzerland, 2001. [Google Scholar]
- Cleveland, W.S.; Grosse, E.; Shyu, W.M. Local Regression Models. In Statistical Models in S; Routledge: London, UK, 2017; pp. 309–376. [Google Scholar]
- Coles, S.; Pericchi, L.R.; Sisson, S. A Fully Probabilistic Approach to Extreme Rainfall Modeling. J. Hydrol. 2003, 273, 35–50. [Google Scholar] [CrossRef]
- Vantas, K. Hydroscoper: R Interface to the Greek National Data Bank for Hydrological and Meteorological Information. J. Open Source Softw. 2018, 3, 625. [Google Scholar] [CrossRef]
- Vantas, K. Hyetor: R Package to Analyze Fixed Interval Precipitation Time Series. 2020. Available online: https://github.com/kvantas/hyetor (accessed on 14 January 2025).
- Wickham, H. Ggplot2; Use R! Springer International Publishing: Cham, Switzerland, 2016; ISBN 978-3-319-24275-0. [Google Scholar]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).