Artificial intelligence models for prediction of monthly rainfall without climatic data for meteorological stations in Ethiopia

Abebe, Wondmagegn Taye; Endalie, Demeke

doi:10.1186/s40537-022-00683-3

Research
Open access
Published: 03 January 2023

Artificial intelligence models for prediction of monthly rainfall without climatic data for meteorological stations in Ethiopia

Wondmagegn Taye Abebe¹ &
Demeke Endalie²

Journal of Big Data volume 10, Article number: 2 (2023) Cite this article

5143 Accesses
10 Citations
Metrics details

Abstract

Global climate change is affecting water resources and other aspects of life in many countries. Rainfall is the most significant climate element affecting the livelihood and well-being of the majority of Ethiopians. Rainfall variability has a great impact on agricultural production, water supply, transportation, the environment, and urban planning. Because all agricultural activities and subsequent national crop production hinge on the amount and distribution of rainfall, accurate monthly and seasonal predictions of this rainfall are vital for agricultural planning. Rainfall prediction is also useful for governmental, non-governmental, and private agencies in making long-term decisions and planning in numerous areas such as farming, early warning of potential hazards, drought mitigation, disaster prevention, and insurance policy. Artificial Intelligence (AI) has been widely used in almost every area, and rainfall prediction is one of them. In this study, we attempt to investigate the use of AI-based models to predict monthly rainfall at 92 Ethiopian meteorological stations. The applicability of Artificial Neural Networks (ANNs) and Adaptive Neuro-Fuzzy Inference System (ANFIS) models in predicting long-term monthly precipitation was investigated using geographical and periodicity component (longitude, latitude, and altitude) data collected from 2011 to 2021. The experimental results reveal that the ANFIS model outperforms the ANN model in all assessment criteria across all testing stations. The Nash–Sutcliffe efficiency coefficients were 0.995 for ANFIS and 0.935 for ANN over testing stations.

Introduction

Global climate change is affecting water resources and several other aspects of life in many countries. Studies on climate change due to global warming have achieved high importance over the past few years [1, 2]. Ref. [3] stated that, global warming has recently attracted considerable attention from researchers, and it may cause changes in rainfall patterns, a rise in seawater level, and impacts on plants, wildlife, and humans. The magnitude of climatic variations, including temperature and rainfall, differs in several parts of the world [4]. Consequently, some arid regions are expected to experience droughts while others may be affected by heavy rainfall [5]. As a result, the prediction of climatic variables has considerably increased all over the world [6]. Climate predictions are fundamentally probabilistic statements about the future climate conditions on timescales ranging from seasons to decades or longer, and on spatial scales ranging from local to regional to global. Such predictions may provide some statistics on the seasonal or annual mean difference together with a degree of its probability of occurrence [7].

Rainfall is the most significant climate element affecting the livelihood and well-being of the majority of Ethiopians. The rainy season, or Kiremt, (from June to September), supports 85–95% of the country’s food production [8]. A study made by [9] on rainfall cyclicity over selected stations in Ethiopia indicates that there is a periodic tendency in the annual rainfall series. Several regions of Ethiopia receive rainfall throughout the year, but in some regions there is a seasonal and spatial variation of rainfall, which is the main factor affecting irrigation development.

Rainfall variability has a great impact on agricultural production, water supply, transportation, the environment, urban planning, and the lives of people in general. Its variability is the main cause of the frequent droughts and floods. Ethiopia is one of the countries whose economy is mainly dependent on rain-fed agriculture and also faces periodic floods and drought. Current climate variability is imposing a substantial challenge on Ethiopia [10]. Because all agricultural activities and subsequent national crop production hinge on the amount and distribution of rainfall, accurate monthly and seasonal predictions of this rainfall are vital for agricultural planning [8]. Rainfall is caused by a variety of meteorological conditions and the mathematical model for it is nonlinear. Due to this, accurate prediction of rainfall is challenging [11]. In addition to this, variation in rainfall timing and quantity makes rainfall prediction a challenging task for meteorological scientists [12], and many weather forecasters and experts devote themselves to improving the accuracy of their predictions [13].

Rainfall as a stochastic variable significantly differs in space and time concerning the general pattern of atmospheric circulation and local factors. Weather forecasts, especially rainfall prediction, pose complex tasks because they depend on numerous parameters to predict the dependent variables like temperature, humidity, wind speed, and direction, which change from time to time and their calculation varies with the geographical location along with their atmospheric variables [13]. Even though rainfall is a complex nonlinear phenomenon and its distribution varies in time and space, there are many studies in the literature showing that it is predictable [14, 15].

Predicting rainfall is an essential requirement to support water resources management, especially when it is related to climate change. Hence, climate change affects the pattern of rainfall and the prediction of rainfall with a good and accurate method is crucial to anticipate the impact [16]. Timely, actionable, and reliable climate prediction plays a vital role in decision making for individual users, users in a variety of sectors, and national development planning to help the management of development opportunities and risks, and for adaptation and mitigation. Demand for climate information for decision and policy making is growing as the private and public sectors increasingly recognize the significance and value of such information for building climate resilience and mitigating and adapting to climate change. Several users are looking for tailored and actionable climate information on a wide range of timescales, from past, current, and future climates. Their needs are broad, including long-term decisions and planning, early warning of potential hazards, and managing risks arising from climate variability and change [17]. Therefore, to get efficient and accurate results for forecasting rainfall, methods have been developed. Among them, a statistical model has been broadly used to make predictions of rainfall [18].

The attempt to predict statistics of rainfall several months in advance needs the predictor’s engagement with the theory of climate systems, consideration of trade-offs between physical-based dynamical methods and empirically grounded statistical methods, and selection of appropriate models that are generalizable and provide the best fit to recent observations [19].

AI has been widely used in almost every area, and weather prediction is one of them. Rainfall prediction is one of the most widely used research areas as many lives and property damage occur due to this. Intense rainfall has numerous impacts on society and on our daily life, from cultivation to disaster measures [20]. Weather prediction methods based on ANNs and ANFIS have been investigated intensively in recent years [6]. Different studies indicate that models based on AI can be applied for the identification of nonlinear systems in various fields of engineering, and can be used for rainfall prediction [21, 22]. Therefore, this study aims to apply ANN and ANFIS models to predict the monthly rainfall of meteorological stations in Ethiopia.

Weather predictions are identified as major areas requiring further progress in climate research and have thus been selected as one of the World Climate Research Program (WCRP) Grand Challenges [23]. Reliable predictions of climate variables are required on short and long time scales to reduce potential risks and damage that result from weather and climate extremes [24]. Precise and timely weather prediction is a major challenge for national meteorological agencies all over the world.

Weather prediction models are important for developing countries like Ethiopia, where most of the agriculture depends on rainfall. It is a major concern to identify any trends for weather parameters to deviate from their periodicity, which would affect the economy of the country. This fear has been aggravated due to the threat of global warming and the greenhouse effect. The impact of extreme weather phenomena on society is growing more and more costly, causing infrastructure damage, injury, and the loss of life. Therefore, there is a need for accurate weather forecasts today more than ever before, not only as a defense against hazardous weather but also in planning the day-to-day operations of private enterprises and governments, and by individuals to enhance their quality of life [25].

Rainfall prediction and early warning systems are the most important services for an agricultural country like Ethiopia [26]. Meteorological data is periodically gathered by the Ethiopian meteorology agency. However, due to the lack of appropriate data analysis tools, the available data cannot be practically used to alleviate the problems faced by planners, policymakers, and decision-makers. In Ethiopia, agriculture is the backbone of the economy. Irrigation facilities are still not so good in the country and most agriculture depends upon the rain [27]. A reliable rainfall prediction results in the occurrence of a dry period for a long time or heavy rain that affects both the crop yield as well as the economy of the country, so early rainfall prediction is very crucial. Rainfall forecasting models have been applied in many sectors, such as agriculture [28] and water resources management [29]. Rainfall prediction involves a combination of statistical models, observation, and knowledge of trends and patterns. Using these methods, reasonably accurate forecasts can be made. The main aim of this study is to apply AI-based models for the prediction of monthly rainfall in Ethiopia. The contribution of this study is summarized as follows:

1.
Develop a model to predict monthly rainfall of the study area using ANN and ANFIS.
2.
To evaluate model performance using different statistical evaluation criteria and observed values and select the best fit model.

This paper is organized as follows: related works are described in “Related works” Section. Our method of monthly rainfall prediction models is defined in “Methods and materials” Section. The experimental findings of the study are defined in “Results and discussions” Section. Finally, “Conclusion” Section contains the conclusion of the study.

Related works

Rainfall prediction is important in water resource engineering, management, and planning. There are difficulties in the accurate prediction of rainfall because of the complexity of physical processes, especially for long-term prediction. As a result, many efforts have been made to develop appropriate methods to predict rainfall, which can be classified into dynamical methods [30], statistical methods [31], soft computing methods [32], and numerical weather prediction methods [33].

Many researchers worldwide have attempted to accurately predict the spatial and temporal distribution of rainfall using various techniques such as simple linear regression and ANNs [18, 34]. However, the accuracy of prediction obtained by some of these techniques could not achieve a satisfactory level because of the complex and nonlinear nature of rainfall.

Several studies have indicated that they are still inaccurate methods to predict rainfall because weather data is non-linear [18]. However, in some cases, the statistical method is also able to produce good and accurate predictions. Along with the development of computing technology, many researchers are trying to make predictions using the ANN method in the field of hydrology.

In recent years, different researchers have been applying soft computing techniques such as ANNs, ANFIS, and Support Vector Machines (SVM) in different research areas [35, 36]. Among numerous soft computing methods, ANNs are promising tools based on their ability to model nonlinear processes. The ANN algorithm is an inductive, data-driven approach that can model both linear and non-linear systems without the need to make pre-assumptions. It is the most popular approach for rainfall prediction [37].

Different researchers apply ANNs to generating short-term predictions of rainfall. ANNs can be easily adapted to provide spatial predictions, areal average precipitation, or any other precipitation-related parameters that might be useful for hydrologic forecasting [38]. ANNs have been applied for quantitative precipitation forecast, predicting monthly rainfall and temperature using geographical information of stations [3], for prediction of rainfall time series coupled with data preprocessing methods [39], and for flood forecasting by comparing the performance of ANNs with Auto Regressive Moving Average (ARMA) and nearest neighbor methods [40]. The results indicated that the use of ANNs provided a substantial enhancement in flood forecasting accuracy.

Several previous works have applied soft-computing approaches to overcome prediction difficulty, mainly based on neural computation approaches. These approaches have several advantages over global numerical models: they are much simpler and faster to train; they can be applied to data from a specific point of measurement (a specific area in a river basin, for example); and their performance is competitive compared to global techniques [32].

More recently, ANNs have been applied to model and forecast precipitation in Athens, Greece [41], to forecast precipitation during the summer monsoon season in India using El Niño South Oscillation (ENSO) indices [42], and a neural computation approach is applied to the short-term forecasting of thunderstorm rainfall [43].

Data-driven modeling, which aims to apply AI techniques to extract the data patterns in historical variables to forecast future events, has proven to be a very popular and successful forecasting and prediction tool. Most recently, a massive development has been accomplished by several researchers in the field of hydrology; for instance, sediment transport modeling [44], water level [45], groundwater simulation [46], rainfall pattern analysis [47], and water irrigation prediction [48].

There are numerous categories of data-driven models, including ANNs and ANFISs. They are used for rainfall and temperature analyses, and these models may perform non-linear regression using various optimization techniques [5]. Data-driven models are simple to use and require less time and effort when compared to Global Circulation Models (GCMs) [49]. These models can efficiently address the non-linearity of systems due to their parallel architecture. ANN in particular is considered a modern technique to address signals in engineering fields and has also been used as a calculation tool to solve certain problems concerning water resources. Other types of data-driven models, such as fuzzy logic and genetic algorithms, cannot be used for long-term predictions due to their logical assumptions [5]. They can be used in a hybrid approach with ANN models, to optimize the weights and bias values during the iteration process. However, ANN and ANFIS are trained based on a database and have the ability to make long-term predictions.

The ANFIS model has a great ability to integrate the power of a fuzzy logic system with the numeric power of a neural system adaptive network in modeling numerous processes. As stated by [50], the advantage of fuzzy rule-base methods such as ANFIS is that they include all of the causes that are not included in the idealized model, whereas they exclude some of the causes that are taken into account in physically-based models [6].

Various identification methods, such as Grid Partitioning (GP) and Subtractive Clustering (SC), can be applied in the ANFIS model, and different researchers have applied this method for different purposes. Some of them are [51] compared ANFIS-GP, ANFIS-SC, and ANFIS with the Gustafson–Kessel Clustering (GKC) method for rainfall-discharge modeling; [52] introduced the hybrid model of ANFIS and wavelet transform for precipitation forecasting; [53] applied ANFIS-GP for investigation of the influence of lag time on the event-based rainfall-runoff process; [35] compared the performance of the ANFIS-GP and ANFIS-SC in streamflow prediction (the results from the studies indicated that the ANFIS-SC has slightly better accuracy than the ANFIS-GP in streamflow estimation); [54] applied ANFIS and Gene Expression Programming (GEP) with wavelet to forecast precipitation for two stations in Turkey; [55] applied ANNs and ANFIS-GP for spatial prediction of monthly air temperature using geographical inputs; [56] examined the performance of ARMA, ANNs, ANFIS, SVR, and genetic programming for forecasting monthly discharge time series. The best performance was achieved by ANFIS, SVM, and genetic programming during the training and validation period; [57] introduced a model that integrated SVM and a multi-objective genetic algorithm to predict hourly typhoon rainfall. The proposed model provided an accurate forecast of hourly rainfall and improved the long lead-time forecasts.

But many studies do not employ spatial modeling of long-term monthly rainfall predictions by ANNs and ANFIS, which uses geographical information of stations as an input. To the best knowledge of the authors, there is no published work in the literature that uses ANNs and ANFIS for predicting long-term monthly rainfall in the study area. This gave motivation to the present study. In this paper, the applicability of ANNs and ANFIS models is investigated for predicting long-term monthly rainfall using the geographical and periodicity components as input data.

Methods and materials

Artificial neural networks

The ANN is an engineering concept of knowledge in the field of AI designed by adopting the human nervous system. Wherein the main processing of the human nervous system is composed of the brain’s nerve cells as the basic unit of information processing. In the concept of ANN, the basic unit of information processing (neurons) serves to process information in parallel and immediately. Furthermore, the process of training the ANN has many types and uses, including perceptron, backpropagation, Self-Organizing Map (SOM), and delta.

ANN, as the most general AI method, is the collection of some neurons with a specific structure formed based on the relationships between neurons in different layers [6]. A neural network is a computing system made up of several simple and highly interconnected nodes or processing elements called neurons. The goal of neural networks is to map a set of input patterns onto a corresponding set of output patterns. The neural networks achieve this mapping by first training the neurons to be suitable for a given series of patterns. Then, the neural network applies this model to a new input pattern to predict the appropriate output pattern [58].

There are many kinds of neural networks depending on their structure, function, or training method. In this study, multiple-layer feed-forward neural networks are applied for rainfall prediction using geographical information and a periodicity component. The structure to be considered here includes one input layer, a hidden layer, and an output layer. For each layer, some neurons are related by weighted connections. The number of neurons for the input and output layers is equal to the numbers of input and output variables, but the number of neurons in the hidden layer will be selected by a trial-and-error procedure.

The weights and bias of connected neurons should be determined before applying the ANN model. In this matter, the model should be trained using a dataset. The backpropagation method is utilized for the training of networks and among various training algorithms, Levenberg–Marquardt, gradient descent, gradient descent with adaptive learning rate, gradient descent with momentum, adaptive learning rate, and scaled conjugate gradient are used. For all training algorithms, the tangent sigmoid transfer function is used in the hidden layers and the purelin transfer function in the output layer.

A typical neural network propagates information in the feedforward direction using Eq. 1.

$${b}_{j}=f(\sum_{i=1}^{n}{(w}_{ij}{a}_{i})-{T}_{j})$$

(1)

where a_i is the input vector, b_j is the output vector, w_ij is a weight factor between two nodes, T_j is the internal threshold, and f is a transfer function.

The backpropagation learning algorithm is based on a generalized delta-rule accelerated by a momentum term. To improve the performance of the neural network, both the weight factors and the internal threshold values are adjusted using Eqs. 2 and 3.

$${w}_{ij}^{new}={w}_{ij}^{old}+\eta .\sum_{p}{\delta }_{pj}{O}_{pi}+\alpha .\Delta {w}_{ij}^{old}$$

(2)

$${T}_{j}^{new}={T}_{j}^{old}+\eta .\sum_{p}{\delta }_{pj}+\alpha .\Delta {T}_{j}^{old}$$

(3)

where, $\eta$ is the learning rate, α is the momentum coefficient, $\Delta$w is the previous weight factor change, $\Delta$T is the previous threshold value change, O is the output, $\delta$ is the gradient-descent correction term, and p stands for the pattern.

Despite its theoretical simplicity, the neural network model has excellent performance for a wide range of applications and has developed into a powerful and versatile tool in recent years [58]. The ANN method was selected for this study because it is the most popular data-driven method in hydrological applications.

Adaptive Neuro-Fuzzy inference system

ANFIS is an effective AI model that combines neural networks and fuzzy logic capabilities [6]. ANFIS utilizes a feed-forward network for searching for fuzzy decision rules to perform well on a given problem. With considering a given input–output dataset, ANFIS creates a Fuzzy Inference System (FIS) for which Membership Function (MF) parameters are adjusted using either a back-propagation algorithm or a combination of a back-propagation algorithm and a least-squares method.

By using a first-order Takagi–Sugeno fuzzy model, Eqs. (4) and (5) present a typical rule set with two fuzzy if/then rules.

$$Rule\,1:if\,x\,is {A}_{1}\,and\,y\,is\,{B}_{1}\,then\,{f}_{1}={p}_{1}x+{q}_{1}y+{r}_{1}$$

(4)

$$Rule\,2:if\,x\,is\,{A}_{2}\,and\,y\,is\,{B}_{2}\,then\,{f}_{2}={p}_{2}x+{q}_{2}y+{r}_{2}$$

(5)

where, A₁(LOW), A₂(LOW) and B₁(HIGH), B₂(MEDIUM) are the MFs for inputs x(LAT) and y(LON), respectively, and p₁, q₁, r_1, and p₂, q₂, r₂ are the parameters of the output function. The system consists of five layers. The relationship between the input and output of each layer is described as follows:

Layer 1: Every node i in this layer is an adaptive node with a node output defined by;

$${O}_{i}^{1}={\mu }_{{A}_{i}}\left(LAT\right)$$

(6)

where, LAT is the input to the node; Ai is a fuzzy set associated with this node, identified by the shape of the MF in this node, and can be any appropriate function that is continuous and piecewise differentiable such as a Gaussian function. Supposing a Gaussian function as an MF, Ai can be computed as;

$${\mu }_{{A}_{i}}\left(LAT\right)=exp\left\lfloor\frac{1}{2}{\left\{\right(x-{c}_{i})/{\sigma }_{i}\}}^{2}\right\rfloor$$

(7)

where, {ci, σi} are parameter sets that are called to as premise (antecedent) parameters.

Layer 2: Every node in this layer is a fixed node, which multiplies the incoming signals and output product. For instance,

$${O}_{i}^{2}={w}_{i}={\mu }_{{A}_{i}}\left(LAT\right).{\mu }_{{B}_{i}}\left(LON\right), i=\mathrm{1,2},\dots$$

(8)

Each output node describes the firing strength of a rule.

Layer 3: Every node in this layer computes the ratio of the i^th rule’s firing strength to the sum of all rule’s firing strengths as follows:

$${O}_{i}^{3}=\overline{{w }_{i}}=\frac{{w}_{i}}{{w}_{1}+{w}_{2}}, i=\mathrm{1,2},\dots$$

(9)

The output of this layer is referred normalized firing strengths.

Layer 4: Node i in this layer calculate the contribution of the i^th rule towards the model output as described follows:

$${O}_{i}^{4}=\overline{{w }_{i}}{f}_{i}=\overline{{w }_{i}}({p}_{i}LAT+{q}_{i}LON+{r}_{i})$$

(10)

where, $\overline{{w }_{i}}$ is the output of layer 3 and {pi, qi, ri} is the parameter set that is called as consequent parameters.

Layer 5: This layer calculates the overall output as the summation of all incoming signals.

$${O}_{i}^{5}=\sum_{i}\overline{{w }_{i}}{f}_{i}=\frac{\sum_{i}{w}_{i}{f}_{i}}{\sum_{i}{w}_{i}}$$

(11)

The ANFIS method was also selected in this study because it is commonly used in hydrological applications.

Data collection

In this paper, the applicability of ANNs and ANFIS models was investigated for predicting long-term monthly rainfall using the geographical and periodicity components (longitude, latitude, and altitude) as input data. The rainfall data from 92 meteorological stations within the study area (Ethiopia) (Fig. 1) was collected from Climate Prediction Centre (CPC) and used for training, evaluating, and testing the performance of the models. The acquired data is global unified gauge-based rainfall data for 11 years (2011–2021). Sample geographical information about study areas used by this study is depicted as shown in Table 1.

Table 1 Sample geographical information and data format

Full size table

The performance of the trained network is verified by determining the error between the predicted value and the real value. Before training the neural network, all the data points for the patterns are normalized to be less than 1.

Evaluation metrics

For model development, different statistical evaluation criteria, including Root Mean Square Error (RMSE), Nash–Sutcliffe model efficient coefficient (E), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and coefficient of determination (R²) are applied to assess the model performance. The RMSE, E, MAE, MAPE, and R² are used to evaluate the performance of the model, and they can be calculated as follows.

$$RMSE=\sqrt{\frac{1}{n}\sum_{i=1}^{n}({R}_{p}-{R}_{o})}\dots \dots \dots \dots$$

(12)

$$E=1-\frac{\sum_{i=1}^{n}{\left({R}_{o}-{R}_{p}\right)}^{2}}{{\sum }_{i=1}^{n}{\left({R}_{o}-\overline{{R }_{o}}\right)}^{2}}\dots \dots \dots$$

(13)

$$MAE=\frac{\left|{R}_{o}-{R}_{p}\right|}{N}\left(100\right)\dots \dots \dots$$

(14)

$$MAPE=\frac{1}{n}\sum_{i=1}^{n}\left|\frac{{R}_{o}-{R}_{p}}{{R}_{o}}\right|\dots \dots \dots$$

(15)

$${R}^{2}=1-\frac{Unexpected\,variaton}{Total\,variation}\dots \dots \dots$$

(16)

where, n is the number of the dataset, $\overline{{R }_{o}}$ is the mean of observed monthly rainfall, and R_p and R_o denote the rainfall values generated by different models and observed monthly rainfall values, respectively.

Results and discussions

All experiments in this study were conducted with a device having the Windows 10 operating system, a core i7, and 16 GB of RAM. A grid search strategy was used to compute the optimal hyper-parameter values of both ANFIS and ANN. Within the input parameter values indicated in Table 2, the ANFIS model produced a better predictive outcome.

Table 2 The values of hyper-parameters used in ANFIS

Full size table

The hyper parameters for the ANN model were 0.5 dropout, sigmoid activation function, 100 epoch, Adam optimizer, and batch size of 16.

Model training and performance evaluation

We partitioned the dataset into three sections with an 80%, 10%, and 10% split ratio for training, validating, and testing the model, respectively. ANFIS and ANN training and validation losses using the dataset gathered from 92 weather stations in Ethiopia are depicted as shown in Figs. 2 and 3.

As demonstrated in Figs. 2 and 3, the ANFIS model learns the patterns of the input variables in order to predict rainfall. The ANFIS validation loss overlaps the training loss around the 3^rd epoch, whereas the ANN model validation loss overlaps the training loss around the 20^th epoch. As a result, the ANFIS model learns the pattern in the training data faster than the ANN model. We examine the performance of the two models on the testing dataset after training and validating them. The result is depicted as shown in Figs. 4 and 5 for ANFIS and ANN, respectively. The actual rainfall value used in the graphs below has been normalized to reduce the impact of outliers on the learning process.

Figure 4 shows how the rainfall value predicted with the ANFIS is related to the actual rainfall value. The predicted and actual rainfall values are remarkably similar, indicating that the ANFIS is approximately 100 percent accurate in its prediction. In the majority of the months used for testing, the rainfall value predicted by ANN is lower than the actual monthly rainfall, as shown in Fig. 5.

We also compare and contrast the two models’ performance using other evaluation metrics like mean absolute error, R-square, root mean square error, and mean absolute percentage error. Figures 6 and 7 illustrate a comparison of the two models’ results using those evaluation metrics.

The R² of the ANFIS is 0.9992, while the R² of the ANN is 0.9383, according to the graphs in Figs. 6 and 7. This means that when compared to ANN, ANFIS improves the R² of the prediction by 0.0609 values. MAE of 0.0028, RMSE of 0.0033, and MAPE of 0.28 are the errors generated by ANFIS, while MAE of 0.03, RMSE of 0.032, and MAPE of 2.8 are errors produced by ANN. In all of the evaluation criteria utilized in this study, the results show that the ANN model produces more errors than ANFIS. We also compare and contrast these two predictive models with the Nash–Sutcliffe model efficient coefficient (E). The value of E is 0.9954 and 0.935 for ANFIS and ANN, respectively. The value E for the ANFIS model is 0.9954, which is nearly equal to 1, which means the model is a perfect match between the model and the observed data.

We have tested the ANFIS and ANN models on the rainfall prediction for the nine stations from station ID 84 to 92. At these stations, ANFIS performs better than ANN. Therefore, we recommend researchers use the ANFIS model for applications that require rainfall prediction without climatic data.

Conclusion

The prediction accuracy of ANFIS and ANN models was investigated in the prediction of monthly rainfall using meteorological stations in Ethiopia. Longitude, latitude, and altitude data from 92 weather stations for 11 years running, from 2011 to 2021, were used for this study. We conducted an experiment using weather station data from Ethiopia’s 92 stations to evaluate and compare the ANFIS and ANN predictive models. We used different evaluation metrics to evaluate these models, and the experimental result shows the ANFIS model performs better than the ANN model. In general, the ANFIS model was found to be better than the other models in long-term monthly rainfall prediction. It gave the best prediction accuracy of the nine stations.

Availability of data and materials

The data used for this study will be made available upon request to the authors.

References

Krysanova V, et al. Intercomparison of regional-scale hydrological models and climate change impacts projected for 12 large river basins worldwide: a synthesis. Environ Res Lett. 2017. https://doi.org/10.1088/1748-9326/aa8359.
Article Google Scholar
Zhang H, Zhang LL, Li J, An RD, Deng Y. Climate and hydrological change characteristics and applicability of GLDAS data in the Yarlung Zangbo River basin, China. Water. 2018. https://doi.org/10.3390/w10030254.
Article Google Scholar
Bilgili M, Sahin B. Prediction of long-term monthly temperature and rainfall in Turkey. Energy Sources Part A Recover Util Environ Eff. 2010;32(1):60–71. https://doi.org/10.1080/15567030802467522.
Article Google Scholar
Fenta Mekonnen D, Disse M. Analyzing the future climate change of Upper Blue Nile River basin using statistical downscaling techniques. Hydrol Earth Syst Sci. 2018;22(4):2391–408. https://doi.org/10.5194/hess-22-2391-2018.
Article Google Scholar
Alotaibi K, Ghumman AR, Haider H, Ghazaw YM, Shafiquzzaman M. Future predictions of rainfall and temperature using GCM and ANN for arid regions: a case study for the Qassim region, Saudi Arabia. Water. 2018. https://doi.org/10.3390/w10091260.
Article Google Scholar
Kisi O, Sanikhani H. Prediction of long-term monthly precipitation using several soft computing methods without climatic data. Int J Climatol. 2015;35(14):4139–50. https://doi.org/10.1002/joc.4273.
Article Google Scholar
Li JP, Ding RQ. Weather forecasting: seasonal and interannual weather prediction. Encycl Atmos Sci Second Ed. 2015;6:303–12. https://doi.org/10.1016/B978-0-12-382225-3.00463-1.
Article Google Scholar
Segele Z. Ensemble-based empirical prediction of Ethiopian monthly-to-seasonal monsoon rainfall. 2015, pp. 1–6.
Admassu S. Rainfall variation and its effect on crop production in Ethiopia and its effect on crop production in Ethiopia of science in civil engineering. Addis Ababa University; 2004.
Takele R, Gebretsidik S. Prediction of Long-term pattern and its extreme event frequency of rainfall in Dire Dawa Region, Eastern Ethiopia. J Climatol Weather Forecast. 2015;03(01):1–15. https://doi.org/10.4172/2332-2594.1000130.
Article Google Scholar
Kashiwao T, Nakayama K, Ando S, Ikeda K, Lee M, Bahadori A. A neural network-based local rainfall prediction system using meteorological data on the Internet: a case study using data from the Japan Meteorological Agency. Appl Soft Comput J. 2017;56:317–30. https://doi.org/10.1016/j.asoc.2017.03.015.
Article Google Scholar
Aakash P, Kinjal M, Mithila S. Machine learning techniques for sentiment analysis: a review. Innov Inform Embed Commun Syst. 2017;8(3):27–32.
Google Scholar
Du J, Liu Y, Yu Y, Yan W. A prediction of precipitation data based on support vector machine and particle swarm optimization (PSO-SVM) algorithms. Algorithms. 2017. https://doi.org/10.3390/a10020057.
Article MATH Google Scholar
Azadi S, Sepaskhah AR. Annual precipitation forecast for west, southwest, and south provinces of Iran using artificial neural networks. Theor Appl Climatol. 2012;109(1–2):175–89. https://doi.org/10.1007/s00704-011-0575-9.
Article Google Scholar
Amiri MA, Conoscenti C, Mesgari MS. Improving the accuracy of rainfall prediction using a regionalization approach and neural networks. Kuwait J Sci. 2018;45(4):66–75.
Google Scholar
Manton MJ, et al. Trends in extreme daily rainfall and temprature in South East Asia and the South Pacific: 1961–1998. Int J Climatol. 2007;21:269–84. https://doi.org/10.1002/joc.610.
Article Google Scholar
WMO. Use of climate predictions to manage risks. 2016.
Mislan, Haviluddin, Hardwinarto S, Sumaryono, Aipassa M. Rainfall monthly prediction based on artificial neural network: a case study in Tenggarong station, East Kalimantan Indonesia. Procedia Comput Sci. 2015;59:142–51. https://doi.org/10.1016/j.procs.2015.07.528.
Article Google Scholar
Badr HS, Zaitchik BF, Guikema SD. Application of statistical models to the prediction of seasonal rainfall anomalies over the Sahel. J Appl Meteorol Climatol. 2014;53(3):614–36. https://doi.org/10.1175/JAMC-D-13-0181.1.
Article Google Scholar
Refonaa J, Lakshmi M, Abbas R, Raziullha M. Rainfall prediction using regression model. Int J Recent Technol Eng. 2019;8(2 Special Issue 3):543–6. https://doi.org/10.35940/ijrte.B1098.0782S319.
Article Google Scholar
Hung NQ, Babel MS, Weesakul S, Tripathi NK. An artificial neural network model for rainfall forecasting in Bangkok, Thailand. Hydrol Earth Syst Sci. 2009;13(8):1413–25. https://doi.org/10.5194/hess-13-1413-2009.
Article Google Scholar
Abhishek K, Kumar A, Ranjan R, Kumar S. A rainfall prediction model using artificial neural network. Proc. 2012 IEEE Control System Graduate Research Colloquium, ICSGRC 2012; 2012. p. 82–87. https://doi.org/10.1109/ICSGRC.2012.6287140.
Sillmann J, et al. Understanding, modeling and predicting weather and climate extremes: challenges and opportunities. Weather Clim Extrem. 2017;18(April):65–74. https://doi.org/10.1016/j.wace.2017.10.003.
Article Google Scholar
IPCC. Climate change: the physical science basis summary for policymakers. 2013.
Tibebu E. Application of data mining for weather forecasting. Addis Ababa University; 2015.
Endalie D, Haile G, Taye W. Deep learning model for daily rainfall prediction: case study of Jimma, Ethiopia. Water Supply. 2022;22(3):3448–61. https://doi.org/10.2166/WS.2021.391.
Article Google Scholar
Hirani D, Mishra N. A survey on rainfall prediction techniques. Int J Comput Appl. 2016;6(2):28–42. https://doi.org/10.3389/fnhum.2014.00445.
Article Google Scholar
Wei H, Li JL, Liang TG. Study on the estimation of precipitation resources for rainwater harvesting agriculture in semi-arid land of China. Agric Water Manag. 2005;71(1):33–45. https://doi.org/10.1016/j.agwat.2004.07.002.
Article Google Scholar
Kisi O, Cimen M. A wavelet-support vector machine conjunction model for monthly streamflow forecasting. J Hydrol. 2011;399(1–2):132–40. https://doi.org/10.1016/j.jhydrol.2010.12.041.
Article Google Scholar
Claußnitzer A, Névir P. Analysis of quantitative precipitation forecasts using the dynamic state index. Atmos Res. 2009;94(4):694–703. https://doi.org/10.1016/j.atmosres.2009.08.013.
Article Google Scholar
Chardon J, Hingray B, Favre AC. An adaptive two-stage analog/regression model for probabilistic prediction of small-scale precipitation in France. Hydrol Earth Syst Sci. 2018;22(1):265–86. https://doi.org/10.5194/hess-22-265-2018.
Article Google Scholar
Ortiz-García EG, Salcedo-Sanz S, Casanova-Mateo C. Accurate precipitation prediction with support vector classifiers: a study including novel predictive variables and observational data. Atmos Res. 2014;139:128–36. https://doi.org/10.1016/j.atmosres.2014.01.012.
Article Google Scholar
Park Y, Buizza R, Leutbecher M. TIGGE: preliminary results on comparing and combining ensembles. Q J R Meteorol Soc. 2008. https://doi.org/10.1002/qj.334.
Article Google Scholar
Dubey AD. Artificial neural network models for rainfall prediction in Pondicherry. Int J Comput Appl. 2015;120(3):30–5. https://doi.org/10.5120/21210-3910.
Article Google Scholar
Sanikhani H, Kisi O. River flow estimation and forecasting by using two different adaptive neuro-fuzzy approaches. Water Resour Manag. 2012;26(6):1715–29. https://doi.org/10.1007/s11269-012-9982-7.
Article Google Scholar
Taormina R, Chau KW, Sethi R. Artificial neural network simulation of hourly groundwater levels in a coastal aquifer system of the Venice lagoon. Eng Appl Artif Intell. 2012;25(8):1670–6. https://doi.org/10.1016/j.engappai.2012.02.009.
Article Google Scholar
Nayak PC, Sudheer KP, Rangan DM, Ramasastri KS. A neuro-fuzzy computing technique for modeling hydrological time series. J Hydrol. 2004;291(1–2):52–66. https://doi.org/10.1016/j.jhydrol.2003.12.010.
Article Google Scholar
Cheng C, Chau K, Sun Y, Lin J. Long-term prediction of discharges in Manwan reservoir using artificial neural network models. Lect Notes Comput Sci. 2005;3498(III):1040–5.
Article MATH Google Scholar
Wu CL, Chau KW, Li YS. Predicting monthly streamflow using data-driven models coupled with data-preprocessing techniques. Water Resour Res. 2009;45(8):1–23. https://doi.org/10.1029/2007WR006737.
Article Google Scholar
Toth E, Brath A, Montanari A. Comparison of short-term rainfall prediction models for real-time flood forecasting. J Hydrol. 2000;239(1–4):132–47. https://doi.org/10.1016/S0022-1694(00)00344-9.
Article Google Scholar
Nastos PT, Moustris KP, Larissi IK, Paliatsos AG. Rain intensity forecast using artificial neural networks in Athens, Greece. Atmos Res. 2013;119:153–60. https://doi.org/10.1016/j.atmosres.2011.07.020.
Article Google Scholar
Shukla RP, Tripathi KC, Pandey AC, Das IML. Prediction of Indian summer monsoon rainfall using Niño indices: a neural network approach. Atmos Res. 2011;102(1–2):99–109. https://doi.org/10.1016/j.atmosres.2011.06.013.
Article Google Scholar
Manzato A. Sounding-derived indices for neural network based short-term thunderstorm and rainfall forecasts. Atmos Res. 2007;83(2–4):349–65. https://doi.org/10.1016/j.atmosres.2005.10.021.
Article Google Scholar
Afan HA, Keshtegar B, Mohtar WHMW, El-Shafie A. Harmonize input selection for sediment transport prediction. J Hydrol. 2017;552:366–75. https://doi.org/10.1016/j.jhydrol.2017.07.008.
Article Google Scholar
Chang FJ, Chen PA, Lu YR, Huang E, Chang KY. Real-time multi-step-ahead water level forecasting by recurrent neural networks for urban flood control. J Hydrol. 2014;517:836–46. https://doi.org/10.1016/j.jhydrol.2014.06.013.
Article Google Scholar
Luo Q, Wu J, Yang Y, Qian J, Wu J. Multi-objective optimization of long-term groundwater monitoring network design using a probabilistic Pareto genetic algorithm under uncertainty. J Hydrol. 2016;534:352–63. https://doi.org/10.1016/j.jhydrol.2016.01.009.
Article Google Scholar
Chang FJ, Tsai MJ. A nonlinear spatio-temporal lumping of radar rainfall for modeling multi-step-ahead inflow forecasts by data-driven techniques. J Hydrol. 2016;535:256–69. https://doi.org/10.1016/j.jhydrol.2016.01.056.
Article Google Scholar
Zhang J, Li Y, Zhao Y, Hong Y. Wavelet-cointegration prediction of irrigation water in the irrigation district. J Hydrol. 2017;544(November):343–51. https://doi.org/10.1016/j.jhydrol.2016.11.040.
Article Google Scholar
Saymohammadi S, Zarafshani K, Tavakoli M, Mahdizadeh H, Amiri F. Prediction of climate change induced temperature & precipitation: the case of Iran. Sustainability. 2017. https://doi.org/10.3390/su9010146.
Article Google Scholar
Hundecha Y, Bardossy A, Werner HW. Development of a fuzzy logic-based rainfall-runoff model. Hydrol Sci J. 2001;46(3):363–76. https://doi.org/10.1080/02626660109492832.
Article Google Scholar
Vernieuwe H, Georgieva O, De Baets B, Pauwels VRN, Verhoest NEC, De Troch FP. Comparison of data-driven Takagi–Sugeno models of rainfall—discharge dynamics. J Hydrol. 2005;302(1–4):173–86. https://doi.org/10.1016/j.jhydrol.2004.07.001.
Article Google Scholar
Partal T, Kişi Ö. Wavelet and neuro-fuzzy conjunction model for precipitation forecasting. J Hydrol. 2007;342(1–2):199–212. https://doi.org/10.1016/j.jhydrol.2007.05.026.
Article Google Scholar
Talei A, Chua LHC. Influence of lag time on event-based rainfall-runoff modeling using the data driven approach. J Hydrol. 2012;438–439:223–33. https://doi.org/10.1016/j.jhydrol.2012.03.027.
Article Google Scholar
Kisi O, Shiri J. Precipitation forecasting using wavelet-genetic programming and wavelet-neuro-fuzzy conjunction models. Water Resour Manag. 2011;25(13):3135–52. https://doi.org/10.1007/s11269-011-9849-3.
Article Google Scholar
Kisi O, Shiri J. Prediction of long-term monthly air temperature using geographical inputs. Int J Climatol. 2014;34(1):179–86. https://doi.org/10.1002/joc.3676.
Article Google Scholar
Wang WC, Chau KW, Cheng CT, Qiu L. A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series. J Hydrol. 2009;374(3–4):294–306. https://doi.org/10.1016/j.jhydrol.2009.06.019.
Article Google Scholar
Lin GF, Jhong BC, Chang CC. Development of an effective data-driven model for hourly typhoon rainfall forecasting. J Hydrol. 2013;495:52–63. https://doi.org/10.1016/j.jhydrol.2013.04.050.
Article Google Scholar
Jung SK, McDonald K. Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization. BMC Bioinform. 2011;12(1):340. https://doi.org/10.1186/1471-2105-12-340.
Article Google Scholar

Download references

Acknowledgements

The academic staffs of Jimma Institute of Technology, Jimma University, Ethiopia conducted this study. The authors would like to express their gratitude to the institute for its assistance with various resources as well as its assistance during the research process.

Funding

This study received no outside funding.

Author information

Authors and Affiliations

Faculty of Civil and Environmental Engineering, Jimma Institute of Technology, Jimma, Ethiopia
Wondmagegn Taye Abebe
Faculty of Computing and Informatics, Jimma Institute of Technology, Jimma, Ethiopia
Demeke Endalie

Authors

Wondmagegn Taye Abebe
View author publications
You can also search for this author in PubMed Google Scholar
Demeke Endalie
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both authors wrote and reviewed the main manuscript text. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Wondmagegn Taye Abebe.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they do not have any conflicts of interest with regard to this work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Abebe, W.T., Endalie, D. Artificial intelligence models for prediction of monthly rainfall without climatic data for meteorological stations in Ethiopia. J Big Data 10, 2 (2023). https://doi.org/10.1186/s40537-022-00683-3

Download citation

Received: 21 June 2022
Accepted: 25 December 2022
Published: 03 January 2023
DOI: https://doi.org/10.1186/s40537-022-00683-3

Artificial intelligence models for prediction of monthly rainfall without climatic data for meteorological stations in Ethiopia

Abstract

Introduction

Related works

Methods and materials

Artificial neural networks

Adaptive Neuro-Fuzzy inference system

Data collection

Evaluation metrics

Results and discussions

Model training and performance evaluation

Conclusion

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords