7.9
CiteScore
 
3.6
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Research Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Research Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
View/Download PDF

Translate this page into:

Research Article
2025
:37;
3782025
doi:
10.25259/JKSUS_378_2025

Development of extreme value models for precipitation forecasting using a nonstationary process for critical environment sustainable management

Faculty of Interdisciplinary Studies, Khon Kaen University, Nong Khai, 43000, Thailand
Department of Mathematics, Faculty of Science, Mahasarakham University, Maha Sarakham, 44150, Thailand

*Corresponding authorE-mail address: monchaya.c@msu.ac.th (M. Chiangpradit)

Licence
This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-Share Alike 4.0 License, which allows others to remix, transform, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms.

Abstract

In recent years, there has been a noticeable shift in precipitation patterns, characterized by more frequent and intense precipitation events. These changes suggest a departure from historical norms and highlight the growing impact of climate variability. Therefore, the objective of this research was to develop a daily maximum precipitation forecasting model for Thailand using the Extreme Value Theory (EVT) and the Generalized Extreme Value (GEV) distribution using eight non-stationary process models. This research utilized reanalysis data from the National Oceanic and Atmospheric Administration (NOAA) Physical Sciences Laboratory, which allowed for the analysis of daily accumulated precipitation data at a spatial scale of 50 km2, with 208 grid points. The results indicate that Model 2 ( μ ( t ) = β 0 + β t where σ and ξ are constants) was the appropriate model for three grid points, while Model 1 ( μ , σ , and ξ are constants) was the appropriate model for the remaining 205 grid points. Estimating the return levels for the return periods of 2, 5, 10, 25, 50, and 100 years showed the areas with the most extreme daily precipitation in Thailand. The results of this analysis can also be used to inform and support decision-making when conducting response planning in high-risk areas, which can be helpful for efficient resource planning and prevention actions.

Keywords

Extreme value theorem
GEV distribution
Non-stationary process
Precipitation

1. Introduction

Greenhouse gases in the Earth"s atmosphere have increased rapidly, especially since the Industrial Revolution. Over the past five decades, human activities have caused the concentration of carbon dioxide to surge from 280 ppm, a level that had remained stable for millions of years. In 2022, the concentration reached 421 ppm, which is 50% higher than pre-industrial levels. This has intensified global warming, leading to changes in the climate from its original state and triggering cascading effects on ecosystems and livelihoods, potentially causing a variety of impacts. Thailand is inevitably affected by global warming, and the impacts of "climate change" are becoming increasingly evident. The recurring severe flooding appears to be worsening over time. Statistics from the Meteorological Department of Thailand over the past 10 years indicate that more severe thunderstorms have affected many parts of the country.

Over the past 30 years, Thai communities have frequently experienced natural disasters. The severity of these natural disasters has destroyed houses and damaged farms. Heavy precipitation can cause floods and landslides, particularly in flood-prone areas. Precipitation forecasting is crucial to prepare for, prevent, and address the potential damage from such events. The forecast results provide information about future precipitation, which is beneficial for resource planning to enable effective preventive measures. Many researchers have developed models to forecast precipitation. For instance, Barman et al. (2021) applied Seasonal Autoregressive Integrated Moving Average (SARIMA) and Autoregressive Integrated Moving Average (ARIMA) models to analyze the monthly time series of precipitation in Assam from 1901 to 2017. In 2022, Zhao et al. studied an ARIMA model combined with a Radial Basis Function (RBF) network model to forecast monthly precipitation in Nanchang, Jiangxi Province. Zhao et al. (2023) proposed models combining Ensemble Empirical Mode Decomposition (EEMD), a Long Short-Term Memory (LSTM) neural network, and ARIMA for monthly precipitation forecasting using data from Luoyang spanning 1973 to 2021. Bora and Hazarika (2023) studied the forecasting patterns of precipitation distribution and proposed an ARIMA model to forecast precipitation in Assam and Meghalaya, India. Khan et al. (2023) used an ARIMA model to forecast precipitation in the Klang River Basin, Selangor. Chandran et al. (2023) studied 34 years of precipitation data (from 1976 to 2009) from 10 sub-basins of the Vaigai River in Tamil Nadu using an ARIMA model to identify the appropriate model for precipitation forecasting. Abd-Elhamid et al. (2024) used precipitation data from satellite and ground-based photographs and proposed an ARIMA model for forecasting precipitation in Syria. Sharma and Singh (2024) built an ARIMA model to forecast precipitation over Mandi district, Himachal Pradesh, India. Hendri et al. (2024) studied precipitation data exhibiting seasonal variations using two Holt-Winters models: the additive model and the multiplicative model.

The aforementioned studies focused on developing a model to forecast short-term precipitation. However, spatially accurate climate forecasting, particularly for the long term, remains both crucial and challenging, since it enables relevant agencies to plan effective long-term responses. Many researchers have applied extreme value theory (EVT) to develop models for forecasting long-term extreme values. Analyzing this data with extreme values is crucial for preparing for large-scale catastrophic events, helping to reduce potential losses if such events occur in the future. Therefore, accurate analysis of extreme value data is essential for obtaining precise results and for providing preliminary information for further management. Many researchers have used EVT to develop models for forecasting long-term extreme precipitation, including Chutiman et al. (2019), Busababodhin et al. (2021), Guayjarernpanishk et al. (2023), Sopipan (2023), Chaubey et al. (2023), Lymperi and Varouchakis (2024), and Acero et al. (2024). These studies used data collected from weather stations for analysis. Limitations in data acquisition from these stations may arise due to factors such as the distance between stations and data loss during collection. The data analysis methods in the aforementioned studies apply EVT under a stationary process.

A review of existing literature highlights critical gaps in precipitation forecasting. While many studies focus on short-term predictions or apply EVT for long-term extremes, they often assume a stationary process, which is inconsistent with inherently non-stationary climate data, and primarily rely on weather station data, which suffers from spatial sparsity and potential data loss. This research aims to bridge these gaps by developing a long-term extreme precipitation forecasting model that explicitly incorporates non-stationary processes within an EVT framework and leverages satellite data to overcome ground-based measurement limitations, thereby providing more accurate, spatially refined, and reliable information for climate change adaptation and disaster risk management in Thailand.

2. Materials and Methods

In this research, parameter estimation and long-term precipitation forecasting by employing nonstationary process modeling combined with EVT were investigated. This section presents details of the study area, data preprocessing steps, and the methodology, such as non-stationary process and return level.

2.1 Study area

Thailand is located between latitudes 5°37′N and 20°27′N, and longitudes 97°22′E and 105°37′E. The country is divided into six regions: Northern region (9 provinces, marked in gray), Central region (22 provinces, light brown), Northeastern region (20 provinces, red), Eastern region (7 provinces, yellow), Western region (5 provinces, brown), and Southern region (14 provinces, pink).

Northernmost point: Latitude 20°27"30”N, Mae Sai District, Chiang Rai Province.

Southernmost point: Latitude 5°37′N, Betong District, Yala Province.

Easternmost point: Longitude 105°37"30”E, Phibun Mangsahan District, Ubon Ratchathani Province.

Westernmost point: Longitude 97°22"E, Mae Lan Noi District, Mae Hong Son Province (Fig. 1).

Study area presenting the six regions of Thailand.
Fig. 1.
Study area presenting the six regions of Thailand.

2.2 Data preparation

Data was obtained from the NOAA Physical Sciences Laboratory satellite (https://psl.noaa.gov/), consisting of daily accumulated precipitation data as grid point data with a spatial resolution of 0.50 degrees, covering January 2012 to March 2024. It includes 208 grid points: 36 in the Northern region, 31 in the Central region, 63 in the Northeastern region, 18 in the Eastern region, 25 in the Western region, and 35 in the Southern region.

2.3 Methods

2.3.1 Generalized extreme value distribution

The Generalized Extreme Value (GEV) distribution, first introduced by A.F. Jenkinson in 1955, was used for analyzing extreme values. This method is appropriate for data collected over regular time intervals, such as weekly, monthly, quarterly, or yearly periods. The data selection process involves selecting the highest value within each time period, based on the following concept:

Let Xi (i = 1, 2, …, n) represent independent variables with the same distribution function, F ( x ) and define X(n) = max (X1, X2, …, Xn). In the GEV distribution, three parameters are considered: μ (Location parameter), σ (Scale parameter), and ξ (Shape parameter).

In 1978, Galambos developed the Cumulative Distribution Function (CDF) of the GEV distribution for < x < as follows:

(1)
G ( x ; μ , σ , ξ ) = exp 1 + ξ x μ σ 1 ξ

As defined on 1 + ξ x μ σ > 0 when < μ , ξ < and σ > 0

If ξ = 0 or ξ 0 , we obtain

(2)
G ( x ; μ , σ , ξ ) = exp exp x μ σ

From eq. (1), if ξ < 0 , the GEV distribution is called the Weibull Distribution, and if ξ > 0 , it is called the Fréchet Distribution. From eq. (2), when ξ 0 , it is called a Gumbel Distribution. The probability distribution function (PDF) of the GEV distribution is as follows:

(3)
g ( x ; μ , σ , ξ ) = 1 σ 1 + ξ x μ σ 1 ξ 1 exp 1 + ξ x μ σ 1 ξ ; ξ 0 1 σ exp x μ σ 1 exp exp x μ σ ; ξ = 0

2.3.2 The GEV distribution under a non-stationary process

Most meteorological, insurance, or economic data analyzed for extreme values are influenced by other variables, such as changes over time or underlying trends, making the data use a non-stationary process. This type of analysis differs from the analysis of data under a stationary process. The characteristics of data under a non-stationary process have been shown in Fig. 2.

Maximum daily exchange rate of Euro to Thai Baht from 2009 to 2023.
Fig. 2.
Maximum daily exchange rate of Euro to Thai Baht from 2009 to 2023.

The GEV distribution model under a non-stationary process, used to describe the distribution of Xt at time t (where t = 1, 2, .., m), is as follows (Guayjarernpanishk et al., 2021): X t GEV μ ( t ) , σ ( t ) , ξ ( t )

The possible models of non-stationary process are as follows:

Model 1 : μ , σ and ξ are constants.

Model 2 : μ ( t ) = β 0 + β 1 t , where σ and ξ are constants.

Model 3 : μ ( t ) = β 0 + β 1 t + β 2 t 2 , where σ and ξ are constants.

Model 4 : μ ( t ) = β 0 + β 1 exp ( β 2 t ) , where σ and ξ are constants.

Model 5 : σ ( t ) = exp ( β 0 + β 1 t ) , where μ and ξ are constants.

Model 6 : μ ( t ) = β 0 + β 1 t and σ ( t ) = exp ( β 0 + β 1 t ) , where ξ is a constant.

Model 7 : μ ( t ) = β 0 + β 1 t + β 2 t 2 and σ ( t ) = exp ( β 0 + β 1 t ) , where ξ is a constant.

Model 8 : μ ( t ) = β 0 + β 1 exp ( β 2 t ) and σ ( t ) = exp ( β 0 + β 1 t ) , where ξ is a constant.

If the result of the parameter analysis follows Model 1, the data used for analysis can be considered stationary. However, if the result follows other models, the data can be classified as non-stationary.

2.3.3 Parameter estimation of the GEV distribution using a non-stationary process

The steps for estimating the GEV distribution parameters using the maximum likelihood estimation (MLE) method are as follows (Guayjarernpanishk et al., 2021):

  1. Construct the likelihood function of the probability distribution function of the GEV distribution, which results in L ( β ) = t = 1 m g ( x t ; μ ( t ) , σ ( t ) , ξ ( t ) ) ,

where β is a vector of parameters β i and g ( x t ; μ ( t ) , σ ( t ) , ξ ( t ) ) is the probability distribution function of the GEV distribution, with μ ( t ) , σ ( t ) and ξ ( t ) representing the parameters at x t .

  • 2)

    Construct the log-likelihood function of the probability distribution function of the GEV distribution obtained in step 1 and equation (3) for t = 1, 2, ..., m as follows:

  • 2.1)

    In the case when ξ 0

    (4)
    l ( β ) = t = 1 m log σ ( t ) + 1 + 1 ξ ( t ) log 1 + ξ ( t ) x t μ ( t ) σ ( t ) + 1 + ξ ( t ) x t μ ( t ) σ ( t ) 1 ξ ( t )

Defined on 1 + ξ ( t ) x t μ ( t ) σ ( t ) > 0

  • 2.2)

    In the case when ξ = 0

    (5)
    l ( β ) = t = 1 m log σ ( t ) + x t μ ( t ) σ ( t ) + exp x μ σ

Examples of the GEV distribution under a non-stationary process:

When μ ( t ) = β 0 + β 1 exp ( β 2 t ) , σ ( t ) and ξ ( t ) are constants, we obtain: l μ ( t ) , σ , ξ = t = 1 m log σ + 1 + 1 ξ log 1 + ξ x t ( β 0 + β 1 e β 2 t ) σ + 1 + ξ x t ( β 0 + β 1 exp ( β 2 t ) ) σ 1 ξ

  • 3)

    Estimate the parameters μ ( t ) , σ ( t ) , and ξ ( t ) , ( μ ^ (t), σ ^ (t), ξ ^ (t)) ), by calculating a partial derivative of the functions obtained in step 2, as follows: l μ ( t ) μ ( t ) , σ ( t ) , ξ ( t ) = 0 , l σ ( t ) μ ( t ) , σ ( t ) , ξ ( t ) = 0 and l ξ ( t ) μ ( t ) , σ ( t ) , ξ ( t ) = 0

2.3.4 Model selection

The purpose of model selection is to identify the most appropriate model for the data being analyzed. Assessing model fit is crucial when multiple models result from the analysis. When the data follows a single stationary process, comparing different models to identify the most appropriate one is commonly done using Diagnostic Plots, including the Quantile Plot, Probability Plot, Density Plot, and Return Level Plot. Alternatively, the Akaike Information Criterion (AIC) or Bayesian Information Criterion (BIC) can be used.

When selecting an appropriate model for data under a stationary or non-stationary process, using the MLE method generally results in multiple related models being obtained. A model with this characteristic is referred to as a Nested Model. The statistic used for testing in this context is the Deviance Statistic, denoted as D (Guayjarernpanishk et al., 2021).

Define M 0 and M 1 as the initial model and the model to be compared, respectively, with the condition that M 0 M 1 . The following Hypothesis testing:

H0: The initial model is appropriate.

H1: The model to be compared is appropriate.

Using the statistic D, D is defined as follows:

(6)
D ( 0 , 1 ) = 2 { l 1 ( M 1 ) l 0 ( M 0 ) }

where l 0 ( M 0 ) and l 1 ( M 1 ) are the maximum log-likelihood values ​​of the models M 0 and M 1 , respectively.

From equation (6), we can see that the statistic D converges in distribution to a chi-square distribution with degrees of freedom equal to k ( χ k 2 ) , where k is the difference in the number of parameters between models M 1 and M 0 . Model M 0 will be rejected at the level of significance ( α ) if D > c α , where c α is the quantile at ( 1 α ) of χ k 2 . Therefore, if the model M 0 is rejected, it implies that the model M 1 better explains the variability in the data than the model M 0 . The test using statistic D is performed one pair at a time. Once an appropriate model is identified, it is tested against other models until the most appropriate one is found.

Therefore, based on the possible models of non-stationary processes above, when M i represents the i-th model, we obtain M 1 M 3 M 7 , M 1 M 2 M 6 , M 1 M 4 M 8 and M 1 M 5 . The sequential pairwise comparison steps are shown in Fig. 3. Additionally, the flowchart outlining the steps for pairwise comparison in the case M 1 M 2 M 6 has been presented in Fig. 4.

The sequential pairwise comparison steps.
Fig. 3.
The sequential pairwise comparison steps.
The steps for pairwise comparison in the case M 1 ⊂ M 2 ⊂ M 6 .
Fig. 4.
The steps for pairwise comparison in the case M 1 M 2 M 6 .

Once the most appropriate model is obtained, it will be used for return level estimation at time T years as follows (Guayjarernpanishk et al., 2021):

(7)
R ^ T = μ ^ (t) σ ^ (t) ξ ^ (t) 1 log 1 1 T ξ ^ (t) ;ξ0 μ ^ (t) σ ^ (t)log log 1 1 T ;ξ=0

3. Results

The data used in this research consisted of the daily maximum accumulated precipitation from 208 points, between January 2009 and March 2024. Descriptive statistics of the top two daily maximum accumulated precipitation in each region of Thailand have been shown in Table 1.

Table 1. Descriptive statistics of the top two daily maximum accumulated precipitation.
Regions Latitude Longitude Provinces Average daily precipitation Month/Year of the daily maximum precipitation Daily maximum precipitation
Northern 20.125 99.625 Chiang Rai 25.347 May 2022 93.656
18.625 100.625 Nan 20.773 Aug 2010 91.529
Central 16.625 99.125 Uthai Thani 24.845 Aug 2019 142.86
15.125 99.125 Uthai Thani 24.065 Aug 2019 127.12
Western 16.125 98.625 Tak 25.843 Aug 2019 153.39
15.625 98.625 Tak 24.838 Aug 2019 141.92
Eastern 12.625 101.625 Rayong 23.502 Sep 2011 80.513
14.125 101.625 Prachinburi 18.662 Sep 2013 76.849
Northeastern 15.125 105.625 Ubon Ratchathani 27.480 Feb 2022 123.93
15.625 105.625 Ubon Ratchathani 28.745 Sep 2013 113.1
Southern 8.625 99.625 Nakhon Si Thammarat 31.409 Jan 2012 167.4
6.125 102.125 Narathiwat 32.674 Dec 2009 113.86

As shown in Table 1, the points with the highest accumulated precipitation over the past 15 years, exceeding 100 mm, were located in the Central, Western, North-eastern, and Southern regions. The highest accumulated precipitation was recorded in Nakhon Si Thammarat Province, Southern region, in January 2012, while the second highest was in Tak Province, Western region, in August 2019.

In statistical modeling using EVT, the GEV distribution was applied by setting a monthly block time and analyzing the daily maximum accumulated precipitation data for each month. This analysis utilized the method for calculating extreme values under a non-stationary process across eight models, with data from 208 grid points. The results showed that Model 2 was the appropriate model for three grid points, while Model 1 was the appropriate model for the remaining 205 grid points. The estimated parameters for the appropriate model at the grid points with the top two daily maximum accumulated precipitation values in each region have been presented in Table 2, and the estimated parameter values for the appropriate model for Model 2 at all 3 grid points have been shown in Table 2.

Table 2. The appropriate model for daily maximum accumulated precipitation at the points where the most precipitation was recorded in each region.
Regions Latitude Longitude Model Provinces Estimated parameter values
μ σ ξ
Northern 20.125 99.625 1 Chiang Rai 16.173 15.282 0.017
18.625 100.625 1 Nan 13.218 12.597 0.018
Central 16.625 99.125 1 Uthai Thani 14.110 14.084 0.179
15.125 99.125 1 Uthai Thani 14.550 13.932 0.103
Western 16.125 98.625 1 Tak 13.528 14.488 0.260
15.625 98.625 1 Tak 14.206 14.224 0.166
Eastern 12.625 101.625 1 Rayong 17.466 12.156 -0.089
14.125 101.625 1 Prachinburi 12.704 10.448 -0.008
Northeastern 15.125 105.625 1 Ubon Ratchathani 17.432 14.409 0.111
15.625 105.625 1 Ubon Ratchathani 18.661 15.343 0.076
Southern 8.625 99.625 1 Nakhon Si Thammarat 20.945 13.962 0.147
6.125 102.125 1 Narathiwat 20.477 13.896 0.249

Once the appropriate model for each grid point was obtained, it was used to estimate return levels for time periods of 2, 5, 10, 25, and 50 years. The return levels of daily maximum precipitation in areas that previously experienced the highest precipitation in each region, according to Model 1 in Table 2, and those for Model 2 in Table 3.

Table 3. The appropriate model for Model 2.
Regions Latitude Longitude Provinces Estimated parameter values
μ 1 μ 2 σ ξ
Southern 5.625 101.125 Yala 17.357 0.028 9.149 0.0914
Eastern 13.625 101.625 Chachoengsao 14.007 -5.00 x10-5 10.603 -0.091
Northern 19.125 101.125 Nan 14.802 -7.620x10 -5 13.074 -0.052
Table 5. Return levels of daily maximum precipitation-based on Model 2.
Regions latitude Longitude Provinces Return levels (millimeters)
2 years 5 years 10 years 25 years 50 years
Southern 5.625 101.125 Yala 25.928 37.227 45.377 56.508 65.412
Eastern 13.625 101.625 Chachoengsao 17.819 28.865 35.577 43.430 48.833
Northern 19.125 101.125 Nan 19.535 33.653 42.553 53.313 60.959

Tables 4 and 5 present the return levels of daily maximum precipitation over a period of T years with a probability of 1/T. For example, a return level of 2 years indicates that within a 2-year period, daily maximum precipitation at the forecasted level is expected to occur at least once, with a probability of 1/2 or 0.5. The forecasted values from all 208 grid points for return levels of 2, 5, 10, 25, and 50 years were used to create contour graphs through GIS Kriging interpolation, with the results presented in Fig. 5.

Table 4. Return levels of daily maximum precipitation in areas with the highest recorded precipitation in each region, based on Model 1.
Regions Latitude Longitude Provinces Return levels (millimeters)
2 years 5 years 10 years 25 years 50 years
Northern 20.125 99.625 Chiang Rai 21.791 39.383 51.213 66.373 77.775
18.625 100.625 Nan 17.850 32.372 42.154 54.706 64.159
Central 16.625 99.125 Uthai Thani 19.445 38.335 53.119 74.866 93.552
15.125 99.125 Uthai Thani 19.754 37.143 49.820 67.300 81.412
Western 16.125 98.625 Tak 19.100 40.114 57.859 85.856 111.580
15.625 98.625 Tak 19.582 38.439 53.030 74.273 92.345
Eastern 12.625 101.625 Rayong 21.849 34.536 42.260 51.310 57.548
14.125 101.625 Prachinburi 16.527 28.283 36.010 45.707 52.855
Northeastern 15.125 105.625 Ubon Ratchathani 22.822 40.954 54.282 72.795 87.849
15.625 105.625 Ubon Ratchathani 24.364 43.039 56.319 74.221 88.360
Southern 8.625 99.625 Nakhon Si Thammarat 26.202 44.370 58.17 77.934 94.476
6.125 102.125 Narathiwat 25.809 45.737 62.382 88.381 112.039
Estimated values for the return levels of daily maximum precipitation in the following years: (a) 2 years; (b) 5 years; (c) 10 years; (d) 25 years, and (e) 50 years.
Fig. 5.
Estimated values for the return levels of daily maximum precipitation in the following years: (a) 2 years; (b) 5 years; (c) 10 years; (d) 25 years, and (e) 50 years.

Fig. 5 indicates that the southern region of Thailand experiences higher daily maximum precipitation compared to other regions. Provinces along the eastern coast, adjacent to the Gulf of Thailand, receive higher daily maximum precipitation than those along the western coast, adjacent to the Andaman Sea. The northeastern region bordering Laos records higher daily maximum precipitation than other parts of the country.

4. Discussion

After statistical modeling with EVT using the GEV distribution under a non-stationary process across eight models, Model 2 was found to be the appropriate model for 3 grid points, while Model 1 was the appropriate model for the remaining 205 grid points. The forecast of daily maximum precipitation recurrence also indicates that the southern region of Thailand is expected to experience higher daily maximum precipitation compared to other regions. This can be attributed to the influence of monsoons from three different directions. From October to January, the southern region is influenced by the Northeast Monsoon, which carries water vapor and moisture from the South China Sea and the Gulf of Thailand to the mainland. This causes heavy precipitation along the eastern coast before it reaches the western coast. In February, the Southeast Monsoon begins to carry water vapor, humidity, and heat from the equator through the Gulf of Thailand to the mainland. This monsoon affects the southern region from February to April, coinciding with the summer season, resulting in high temperatures and precipitation. When the rainy season arrives, the Southeast Monsoon shifts direction to become the Southwest Monsoon, which brings water vapor and moisture from the Indian Ocean and the Andaman Sea to the mainland. This monsoon affects the southern region from May to September, causing heavy precipitation to begin on the western coast before spreading to the eastern coast. Due to the influence of monsoons from all three directions, the southern region experiences heavy precipitation nearly year-round.

In the northeastern region, areas bordering Laos receive more precipitation than other parts of the region. This is due to the rainy season, which lasts from May to October each year and is influenced by the Southwest Monsoon. However, the Dong Phayayen and San Kamphaeng mountain ranges block the winds and precipitation from the southwest monsoon, resulting in lower precipitation in areas located behind the mountains. Consequently, provinces located in front of these mountain ranges and bordering Laos receive more precipitation than other areas in the region.

5. Conclusion

This study demonstrates that nonstationary GEV models, particularly Model 2, provide a robust framework for forecasting extreme daily precipitation in Thailand. The findings can support sustainable water resource planning and disaster risk management. For future research, the integration of the r-largest method with EVT will be investigated to expand the applicability of extreme value models in real-world hydrological forecasting.

Acknowledgments

This research was supported by the Fundamental Fund of Khon Kaen University and the National Science, Research and Innovation Fund (NSRF), and Mahasarakham University. The authors would like to thank the editor and the referees for their valuable feedback and insightful suggestions.

CRediT authorship contribution statement

Pannarat Guayjarernpanishk and Nipaporn Chutiman: Concept and design; Pannarat Guayjarernpanishk, Monchaya Chiangpradit, and Butsakorn Kong-ied: Methodology; Pannarat Guayjarernpanishk: Software development, formal analysis, and project administration; Monchaya Chiangpradit, Nipaporn Chutiman, and Butsakorn Kong-ied: Validation; Monchaya Chiangpradit and Nipaporn Chutiman: Investigation; Butsakorn Kong-ied: Resources and data curation; Nipaporn Chutiman and Pannarat Guayjarernpanishk: Manuscript preparation (original draft); Monchaya Chiangpradit and Butsakorn Kong-ied: Manuscript editing, review, and visualization; Pannarat Guayjarernpanishk and Nipaporn Chutiman: Supervision; Pannarat Guayjarernpanishk: Funding acquisition. All authors have read and agreed to the published version of the manuscript.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Declaration of Generative AI and AI-assisted technologies in the writing process

The authors confirm that there was no use of artificial intelligence (AI)-assisted technology for assisting in the writing or editing of the manuscript and no images were manipulated using AI.

Funding

This research was supported by the Fundamental Fund of Khon Kaen University and the National Science, Research and Innovation Fund (NSRF).

References

  1. , , , , , . Rainfall forecasting in arid regions in response to climate change using ARIMA and remote sensing. Geomatics, Natural Hazards and Risk 2024:15. https://doi.org/10.1080/19475705.2024.2347414
    [Google Scholar]
  2. , , , . Return level estimation for extreme rainfall over the iberian peninsula: comparing methodologies. First International Electronic Conference on the Hydrological Cycle Sciforum.net 2017:4837.
    [Google Scholar]
  3. , , , , . Time series analysis of Assam rainfall using SARIMA and ARIMA. In: Smart Innovation, systems and technologies" ," Smart Computing Techniques and Applications Smart Innovation, Systems and Technologies" ," Smart Computing Techniques and Applications. Singapore: Springer Singapore; p. :357-364. https://doi.org/10.1007/978-981-16-0878-0_35
    [Google Scholar]
  4. , . Rainfall time series forecasting using ARIMA model. 2023 International conference on artificial intelligence and applications (ICAIA) alliance technology conference (ATCON-1) Bangalore, India, pp. 1-5. https://doi.org/10.1109/icaia57370.2023.10169493
  5. , , , , , . LH-moments of the wakeby distribution applied to extreme rainfall in Thailand.
  6. , , , , . Probability analysis and rainfall forecasting using ARIMA model. MAUSAM. 2023;74:1081-1092. https://doi.org/10.54302/mausam.v74i4.805
    [Google Scholar]
  7. , , . Changes in extremes rainfall events in present and future climate scenarios over the teesta river Basin, India. Sustainability. 2023;15:4668. https://doi.org/10.3390/su15054668
    [Google Scholar]
  8. , , , , , . Analysis extreme weather event of thailand with spatial modeling theory of extreme values: Case study in Mekong, Chi, Mun River Basin. Adv Applications Math Sci. 2019;19:129-138.
    [Google Scholar]
  9. , . The asymptotic theory of extreme order statistics 1978
  10. , , , . Climate forecasting models for precise management using extreme value theory. Civ Eng J. 2023;9:1753-1767. https://doi.org/10.28991/cej-2023-09-07-014
    [Google Scholar]
  11. , , , . Extreme value analysis: Non–stationary Process. j.kmutnb. 2021;32 https://doi.org/10.14416/j.kmutnb.2021.05.041
    [Google Scholar]
  12. , . Times series data analysis: The Holt-Winters model for rainfall prediction In West Java. App Sci Def. 2024;2:1-8. https://doi.org/10.58524/app.sci.def.v2i1.325
    [Google Scholar]
  13. . The frequency distribution of the annual maximum (or minimum) values of meteorological elements. Quart J Royal Meteoro Soc. 1955;81:158-171. https://doi.org/10.1002/qj.49708134804
    [Google Scholar]
  14. , , , , . Short-term and long-term rainfall forecasting using ARIMA model. Int J Environ Sci Dev. 2023;14:292-298. https://doi.org/10.18178/ijesd.2023.14.5.1447
    [Google Scholar]
  15. , . Modeling extreme precipitation data in a mining area. Math Geosci. 2024;56:1405-1437. https://doi.org/10.1007/s11004-023-10126-1
    [Google Scholar]
  16. , . ARIMA-Based forecasting of monthly rainfall in Mandi district, Himachal Pradesh. Water Supply. 2024;24:3226-3237. https://doi.org/10.2166/ws.2024.200
    [Google Scholar]
  17. . Application of extreme value theory for maximum rainfall at nakhon ratchasima province. Thai Journal of Mathematics, Special Issue: Annual Meeting in Mathematics. 2023;2022:41-53.
    [Google Scholar]
  18. , , . Rainfall study based on ARIMA-RBF combined model. J Phys Conf Ser. 2022;2294:012029. https://doi.org/10.1088/1742-6596/2294/1/012029
    [Google Scholar]
  19. , , . Monthly precipitation prediction in Luoyang city based on EEMD-LSTM-ARIMA model. Water Sci Technol. 2023;87:318-335. https://doi.org/10.2166/wst.2022.425
    [Google Scholar]
Show Sections