The potential of hybrid evolutionary fuzzy intelligence model for suspended sediment concentration prediction

The potential of hybrid evolutionary fuzzy intelligence model for suspended sediment concentration prediction

Catena 174 (2019) 11–23 Contents lists available at ScienceDirect Catena journal homepage: www.elsevier.com/locate/catena The potential of hybrid e...

2MB Sizes 1 Downloads 52 Views

Catena 174 (2019) 11–23

Contents lists available at ScienceDirect

Catena journal homepage: www.elsevier.com/locate/catena

The potential of hybrid evolutionary fuzzy intelligence model for suspended sediment concentration prediction Ozgur Kisia, Zaher Mundher Yaseenb, a b

T



Faculty of Natural Sciences and Engineering, Ilia State University, Tbilisi, Georgia Sustainable Developments in Civil Engineering Research Group, Faculty of Civil Engineering, Ton Duc Thang University, Ho Chi Minh City, Viet Nam

ARTICLE INFO

ABSTRACT

Keywords: Suspended sediment concentration Hybrid intelligence model River discharge Eel River basin

Providing a robust and reliable prediction model for suspended sediment concentration (SSC) is an essential task for several environmental and geomorphology prospective including water quality, river bed engineering sustainability, and aquatic habitats. In this research, a novel hybrid intelligence approach based on evolutionary fuzzy (EF) approach is developed to predict river suspended sediment concentration. To demonstrate the modeling application, one of the highly affected rivers located in the north-western part of California is selected as a case study (i.e., Eel River). Eel River is considered as one of the most polluted river due to the streamside land sliding, owing to the highly stochastic water river discharge. Thus, the predictive model is constructed using discharge information as it is the main trigger for the SSC amount. The prediction conducted on different locations of the stream (i.e., up-stream and down-stream stations). Three different well-established integrative fuzzy models are developed for the validation purpose including adaptive neuro-fuzzy inference system coupled with subtractive clustering (ANFIS-SC), grid partition (ANFIS-GP), and fuzzy c-means (ANFIS-FCM) models. The predictive models evaluated based on several numerical indicators and two-dimension graphical diagram (i.e., Taylor diagram) that vividly exhibits the observed and predicted values. The attained results evidenced the predictability of the EF model for the SSC over the other models. The discharge information provided an excellent input attributes for the predictive models. In summary, the discovered model showed an outstanding data-intelligence model for the environmental perspective and particularly for Eel River. The methodology is highly qualified to be implemented as a real-time prediction model that can provide a brilliant approach for the river engineering sustainability.

1. Introduction 1.1. Background and research significant Sediment transport in the river considered as one of the significant processes that affect various characteristics of the river system including water health and quality, river geography, channel navigability, and several other water engineering aspects (Sivakumar and Jayawardena, 2002). As a fact, understanding the exact amount of the transported sediment at a particular river is highly essential for hydraulic engineering perspective. This is owing to its importance to the submerged structure in the river and generally on water resources projects (Chang, 2008; Martinez et al., 2009). Nevertheless, the prediction of the suspended sediment is an extremely complicated phenomenon, this is because sediment process is influenced by multiple metrological and hydrological variables in a specific catchment (Frings



and Kleinhans, 2008). In order to comprehend the transported sediment amount through rivers that usually governed by several hydrological and morphological variables acting in two different scales spatial and temporal (Yaseen et al., 2015); river sediment transport can be modeled either by physical modeling which is required a several efforts and information or conceptually using intelligence models (Kisi, 2015). Intelligence models have been found an appeal in accordance to their advantages (e.g., simplicity, accurate models, less required information, and less time consuming) (Yaseen et al., 2015). 1.2. State-of-the-art Over the past, a few decades, the issue of the complicity in the determination of the dynamism, high non-linearity, and non-stationary of suspended sediment loads have been overcome through the

Corresponding author. E-mail address: [email protected] (Z.M. Yaseen).

https://doi.org/10.1016/j.catena.2018.10.047 Received 19 May 2018; Received in revised form 22 October 2018; Accepted 29 October 2018 0341-8162/ © 2018 Elsevier B.V. All rights reserved.

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

sophistication of the computational and artificial intelligence (AI) models. This is due to the limitations indication of the empirical formulations to solve this kind of hydraulic engineering problem (Greimann et al., 1999; Li et al., 2010; Ribbe and Holloway, 2001; Ruessink, 2000; Zhong et al., 2011). Empirical formulations showed a noticeable drawback in the generalization implementation on the sediment transport simulation. This is because sediment transport phenomenon is incorporated with several variables in nature. The major advantages of AI models are their ability to handle the non-linear mechanism, operating large-scale data volumes and mimicking the human brain phenomenon in solving complex problems (Russell and Norvig, 1995). In water resource engineering, sediment transport computing is widely known as a major issue. The sediment load prediction relies not only on the discharge of water into rivers but also relies on other features of the river itself that are prone to seasonal changes (Stott, 2006). A significant increase has been noted in a number of data-based approaches used in hydrologic modeling, especially in the computation of the suspended sediments. The data-based modeling techniques through AI have shown the capability of solving and handling the complexity and the noise problem of data (Yaseen et al., 2016). In modeling sediment transport to be specific, various AI-based techniques have been employed. Among many approaches, couple models have shown noticeable progress in modeling sediment transport such as the traditional artificial neural network (Afan et al., 2014; Agarwal, 2009; Doğan et al., 2007; Huang et al., 2012; Nagy et al., 2002; Tfwala and Wang, 2016; Van Maanen et al., 2010), the theory of fuzzy logic (Bakhtyar et al., 2008; Dogan, 2005; Doğan et al., 2007; Kabiri-Samani et al., 2011; Mianaei and Keshavarzi, 2010), the application of support vector regression (Azamathulla et al., 2010; Buyukyildiz and Kumcu, 2017; Ebtehaj and Bonakdari, 2016; Batt, 2013; Kisi, 2012; Misra et al., 2009; Wei, 2009; Zounemat-Kermani et al., 2016), the employment of evolutionary computing (Altunkaynak, 2009; Aytek and Kişi, 2008; Jaiyeola, 2015; Kizhisseri et al., 2006; Kumar et al., 2014), and most recently the complementary model of wavelet-AI models (Ebtehaj et al., 2016; Goyal, 2014; Liu et al., 2013; Partal and Cigizoglu, 2008; Rajaee, 2011). Despite the extensive researches on the employment of soft computing models, scholars are still seeking for more robust, reliable and effective models that are able to mimic this complex stochastic problem of the suspended sediment transport.

At all stages of the process, from the initial deformation of sediment to all types of arrival sediment depositions, a multiple variability interacting on the sediment transport phenomena that initiate the complexity, non-linear and dynamic characteristics (Kleinhans and van Rijn, 2002; van Rijn, 1984). The literature had been reported several empirical formulations to comprehend the actual phenomena of sediment flux transport (Greimann et al., 1999; Li et al., 2010; Ribbe and Holloway, 2001; Ruessink, 2000); however, several conclusions evidenced the extreme complexity to candidate a universal formulation for sediment simulation (Fischer et al., 2017; Sivakumar and Wallender, 2005). Hence, exploring more reliable and accurate modeling strategy for this problem is always the motivation for engineers and designers (Alcayaga et al., 2018; Javernick et al., 2018). Nevertheless, the necessity of accurate sediment transport modeling is essential for wide range of engineering applications such as (Afan et al., 2016): i. ii. iii. iv. v.

River engineering sustainability. Design the dead storage for reservoir. Computing the degradation around the bridge piers. Channels design. Assessment of the environmental impact of water quality.

In the United States, the Eel River which drains the coast range of north-western California records the highest average suspended sediment yield per drainage area among all the river of its size and is not affected by active glaciers or volcanic eruptions (Hill et al., 2000; Mackey et al., 2009; Warrick, 2014). Several factors such as the widespread tectonic deformation of the base rocks, landscaping advancements in recent times, high seasonal rainfall, and human ground surface disruption in the last century contribute the high rate of erosion and sediment transport. It is not surprising that the basin boost of most unusual geomorphologic characteristics. The process of sediment transportation on hill-slopes and in channels is closely related, and consequently, there is a significant contribution of high-magnitude and low-frequency climatic events to the formation of channels in this area than in most of other areas. Recently, a new era of soft computing models devoted on the integration of natural-inspired meta-heuristic evolutionary optimization algorithms for the sake of enhancing the predictability of stand-alone AI models within the field of water resources engineering (Yaseen et al., 2017a, 2017b). The motivation of these optimization algorithms is to tune the internal deterministic parameters of AI models based on the concept of natural mechanism behavior (Fahimi et al., 2016). The current study is committed on the implementation of coupled fuzzy logic approach with the genetic algorithm (GA) optimizer. GA optimizer is featured by the potential overcoming the problem of solving the global and local optima effectively. Hence, new intelligence model called evolutionary fuzzy (EF) model is developed to predict sediment transport process.

1.3. Research motivation The surface water quality can be degraded due to the presence of excess sediment in streams and reservoirs, thereby, increasing the cost of treating water for drinking and other usages (Vigiak et al., 2016, 2017). A significant quantity of nutrients, metal, and other hazardous contaminants which can be dangerous to human health might be transported through sediments into the river (Pan et al., 2018). Excess sediments can result in fish mortality in the riverine areas during the embryonic stage of their lives; can also restrict the growth of periphyton and limit primary productivity. Additionally, reservoir storage capacity may be lost to increased sediment loads, thereby, reducing the capability of reservoirs to deliver their services, including hydropower, water supply, and flood control (Afan et al., 2016). Human activities such as deforestation, land use change, agricultural activities, overgrazing, and mismanagement of rivers contribute significantly to the accumulation of sediments in water bodies (Li et al., 2017). It is, therefore, necessary to have a better understanding of the sediment transportation processes, sediment yield, and sediment effects on reservoirs, channels and shallow harbors for appropriate policy and decision making in relation to water resources management (Afan et al., 2016; Wei et al., 2016). There is a need for reliable and accurate models for a perfect understanding of the temporal variations in sediment yield and for the prediction of suspended sediment dynamics at the watershed scale. Several natural processes influence the sediment dynamics in river basins.

1.4. Research objectives Eel River suspended sediment concentration is mainly generated due to the high-intensity monsoon rainfall events. Over the last decades, the river region experienced several attracts of erosion proceedings. This is emphasized in the current study to focus on the implementation of hybrid data-intelligence models for the environmental perspective. The main objectives of this research are: i) investigating new hybrid data-intelligence model called evolutionary fuzzy model for suspended sediment concentration, ii) inspecting the predictability of the proposed model on one of the most affected river in north-western of California (i.e., Eel River), iii) building the predictive model using the historical river discharge as this hydrological variable is the main contributor to the percentage of the SSC, and iv) authenticating the EF model against the predominate integrated version of fuzzy model utilized for suspended sediment transport. 12

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

Fig. 1. Eel River morphology map presenting the studied metrological stations.

2. The investigated case study

2.2. Basin sediment characteristics

2.1. Site description

The high rate of suspended sediments discharged in the Eel River is a result of the combined effects of high sediment concentrations (on the average of 3000 ppm over discharge at Scotia) and the high discharge rates (Holeman, 1968; Nolan et al., 1987). Gullying and mass movement which are encouraged by human activities around the erodible region contribute significantly to the generation of fine sediment which can be easily moved down to stream channels (Lisle, 1982). Increased precipitation events can trigger the surface erosion of exposed grounds as a result of active earth flows, over-grazing, timber harvesting, and road construction. The movement of particles into channels can also be promoted by an increased soil moisture and erosion of the toes of streamside slides and earth flows. However, the high annual precipitation events in a basin do not guarantee a denser protective cover of vegetation than in basins with a lower precipitation. The relatively low level of precipitation encountered in the winter can be utilized for growing plants, and the basin is adequately vegetated naturally except on the steep hill-slopes along down-cutting channels. Based on this, the rate of sediment discharge in the Coast Range increases with annual precipitation compared to the other areas. Table 1 tabulated the statistical properties of the discharge and suspended sediment for both the stations. The statistical variables presented are maximum, mean and minimum values (xmax, xmean, and xmin), standard deviation (Sx), variation coefficient (Cv), skewness coefficient (Csx). It can be observed from the presented information; the river discharge and sediment have a highly skewed distribution (ranges

Daily of discharge and suspended sediment concentration data belong to two different stations are being used in this study. One of the stations is located on the up-stream of the Eel River in California which is named as Dos Rios with a drainage area 1368 km2. The second stream monitoring station is in the down-stream at Scotia with a drainage area 8063 km2. Both stations are maintained by the US Geological Survey (USGS). Fig. 1 demonstrates the Eel River map with the station's location. The span of the utilized historical data to construct the predictive models covers the period (1966–1977). Fig. 2a, b, c, and d show the actual pattern of the discharge and SSC at the up- and down-stream stations. Through these figures, it can be comprehended the clear correlation between the peak discharge values and the amount of the SSC magnitudes. Eel River has the most dynamic river system in California and this is due to the unsteady geology of the site and major Pacific storms effect. The stream flow is very inconsistent through the year whereby it has average flows over January and February but in August and September the discharge is increased by 100 times. Because of the frequent landslides in the region, the river carries the high suspended sediment load. In this area, the increment in suspended sediment amount is being observed.

13

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

(a) 1800

(c)

1600

10000

Discharge (m3/s)

9000 1400

Discharge (m3/s)

1200 1000 800 600

8000 7000 6000 5000 4000 3000

400 2000 200

1000

0

01-Oct-1966 18-Jan-1967 07.May.67 24-Aug-1967 11-Dec-1967 29.Mar.68 16-Jul-1968 02-Nov-1968 19-Feb-1969 08-Jun-1969 25-Sep-1969 12-Jan-1970 01.May.70 18-Aug-1970 05-Dec-1970 24.Mar.71 11-Jul-1971 28-Oct-1971 14-Feb-1972 02-Jun-1972 19-Sep-1972 06-Jan-1973 25-Apr-1973 12-Aug-1973 29-Nov-1973 18.Mar.74 05-Jul-1974 22-Oct-1974 08-Feb-1975 28.May.75 14-Sep-1975 01-Jan-1976 19-Apr-1976 06-Aug-1976 23-Nov-1976 12.Mar.77 29-Jun-1977

01-Oct-1966 13-Jan-1967 27-Apr-1967 09-Aug-1967 21-Nov-1967 04.Mar.68 16-Jun-1968 28-Sep-1968 10-Jan-1969 24-Apr-1969 06-Aug-1969 18-Nov-1969 02.Mar.70 14-Jun-1970 26-Sep-1970 08-Jan-1971 22-Apr-1971 04-Aug-1971 16-Nov-1971 28-Feb-1972 11-Jun-1972 23-Sep-1972 05-Jan-1973 19-Apr-1973 01-Aug-1973 13-Nov-1973 25-Feb-1974 09-Jun-1974 21-Sep-1974 03-Jan-1975 17-Apr-1975 30-Jul-1975 11-Nov-1975 23-Feb-1976 06-Jun-1976 18-Sep-1976 31-Dec-1976 14-Apr-1977 27-Jul-1977

0

Date

Date

(b)

(d)

700000

7000000

Suspended Sediment (mg/l)

Suspended Sediment (mg/l)

6000000 600000 500000 400000 300000 200000 100000

5000000 4000000 3000000 2000000 1000000 0

01-Oct-1966 18-Jan-1967 07.May.67 24-Aug-1967 11-Dec-1967 29.Mar.68 16-Jul-1968 02-Nov-1968 19-Feb-1969 08-Jun-1969 25-Sep-1969 12-Jan-1970 01.May.70 18-Aug-1970 05-Dec-1970 24.Mar.71 11-Jul-1971 28-Oct-1971 14-Feb-1972 02-Jun-1972 19-Sep-1972 06-Jan-1973 25-Apr-1973 12-Aug-1973 29-Nov-1973 18.Mar.74 05-Jul-1974 22-Oct-1974 08-Feb-1975 28.May.75 14-Sep-1975 01-Jan-1976 19-Apr-1976 06-Aug-1976 23-Nov-1976 12.Mar.77 29-Jun-1977

01-Oct-1966 18-Jan-1967 07.May.67 24-Aug-1967 11-Dec-1967 29.Mar.68 16-Jul-1968 02-Nov-1968 19-Feb-1969 08-Jun-1969 25-Sep-1969 12-Jan-1970 01.May.70 18-Aug-1970 05-Dec-1970 24.Mar.71 11-Jul-1971 28-Oct-1971 14-Feb-1972 02-Jun-1972 19-Sep-1972 06-Jan-1973 25-Apr-1973 12-Aug-1973 29-Nov-1973 18.Mar.74 05-Jul-1974 22-Oct-1974 08-Feb-1975 28.May.75 14-Sep-1975 01-Jan-1976 19-Apr-1976 06-Aug-1976 23-Nov-1976 12.Mar.77 29-Jun-1977

0

Date

Date

Fig. 2. Discharge and suspended sediment concentration pattern for the up-stream investigated metrological station (a and b), discharge and suspended sediment concentration pattern for the down-stream investigated metrological station (c and d).

are 7.11–14.5 and 8.05–19.5 for the down-stream and up-stream stations). This is clearly obvious over the validation and testing phases of the data set. On the other hand, the ratio (xmax/xmean) of the SSC over the validation and testing data are high. The exhibited statistics evidenced the highly stochastic and redundant between the river flow discharge and the suspended sediment concentration.

simulate human thinking abilities. The fuzzy theory depends on the thought of relatively obtained membership degrees (MDs) range [0,1]. The sets in FL are able to map obscure or questionable data, regularly observed in our life. The FL inference system is illustrated in Fig. 3a. It has 3 parts: i) a fuzzy rule base that includes the IF-THEN rules linked the inputs to the output through the membership functions (MFs), ii) a database comprising MFs utilized in FL rules; iii) a system of inference which combines some rules to identify the relationships between inputs and outputs to acquire a feasible output. Inputs-outputs data are taken into consideration as comprising vague features in the fuzzification and as a result, data are partitioned in sub-sets characterized via linguistic terms such as high and low and MFs are then obtained. A single value is calculated using the linguistic output in defuzzification (Nayak et al., 2005). The part between IF and THEN and the part after THEN are

3. The applied methodology 3.1. Fuzzy logic set (FL) FL approach is utilized in various research areas (Zadeh, 1965). The theory of FL involves a system having linguistic structures e.g. “big”, “small”, “moderate”, “little” etc. Theory of classical binary set defines crisp events whereas fuzzy logic possesses an inference system that can Table 1 The daily statistical parameters of data set for the stations. Set

Station

Training

Upstream (11472150) Downstream (11477000) Upstream (11472150) Downstream (11477000) Upstream (11472150) Downstream (11477000)

Validation

Test

Basin area (km2) 1368 8063 1368 8063 1368 8063

Data type

xmax

xmean

xmin

xmax xmean

Sx

Flow (m3 s−1) Sediment (Mg l−1) Flow (m3 s−1) Sediment (Mg l−1) Flow (m3 s−1) Sediment (Mg l−1) Flow (m3 s−1) Sediment (Mg l−1) Flow (m3 s−1) Sediment (Mg l−1) Flow (m3 s−1) Sediment (Mg l−1)

1270 318,000 7560 4,930,000 1590 661,000 9170 6,230,000 680 86,500 5270 2,870,000

32.7 2790 266 60,966 33.3 2706 296 46,210 12.1 561 131 18,432

0.05 0 2.07 0.23 0.08 0 2.32 0 0 0 0.71 0.06

38.8 114 28.4 80.9 47.7 244 31.0 135 56.1 154 40.3 156

92.8 17,076 625 288,396 95.5 25,108 693 303,083 49.1 4780 408 143,023

14

Cv (Sx/xmean) 2.83 6.12 2.35 4.73 2.86 9.28 2.34 6.55 4.05 8.52 3.12 7.76

Csx 6.31 10.8 5.01 8.50 8.02 19.5 6.32 14.4 8.05 13.1 7.11 14.5

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

(a)

(b) (i)

1

1

0

1

0

0

(ii)

1

0

1

0

0

1

(iii)

1

1

0

1

0

1

(iv)

1

0

1

0

0

0

(c) (i)

1

0

1

0

0

(ii)

1

1

0

1

1

0

1

Fig. 3. (a) The general fuzzy inference system procedure, (b) bit-string crossover of parents (i) and (ii) to form offspring (iii) and (iv), (c) bit-flipping mutation of parents (ii) to form offspring (ii), d) the evolutionary fuzzy model flow chart steps, and e) adaptive neuro-fuzzy inference system architecture.

referred as antecedent and consequent, respectively. In the present study, Gaussian fuzzy MFs were used in all fuzzy based models. Let assume that we have 3 input variables containing 2 MFs in the antecedent phase and thus supposed to be there are 23 rules in the fuzzy rule base. The increment in subsets quantity might bring better precision. For this situation, nevertheless, the rule base will be bigger and hard to build (Şen, 1998). Expect that our model has 2 inputs, and each has 2 MFs with “heavy” and “light” labels and 1 output. In this case, we will have 4 rules as: Rule 1: IF x1 is heavy and x2 is heavy THEN y1 Rule 2: IF x1 is heavy and x2 is light THEN y2 Rule 3: IF x1 is light and x2 is heavy THEN y3 Rule 4: IF x1 is light and x2 is light THEN y4where x1 and x2 stand for 1st and 2nd inputs and (yi, …, y4) are linear equations or constants. In the current study, the final output, y was calculated by the following equation: 4

y=

n=1 4

optimization issues (Sivanandam and Deepa, 2008). The main principles of the genetic algorithm are chromosomes population initialization, crossover, mutation and reproduction (Goldberg et al., 1989). Three main steps are comprised in GA followed the established research (Solomatine and Ostfeld, 2008), the steps can be explained as followed: ➢ The production of the preliminary population: GA produces an arrangement of strings, with each string or chromosome including an arrangement of parameters for optimization. ➢ Strings' fitness computation: GA checks each string's fitness (i.e., evaluation of goal function). ➢ Production of a next generation: New generation of chromosomes is initiated by employing the crossover and selection processes. The selection process is used to select the suitable chromosomes from the previous population based on its fitness value. Bit-string crossover is one of the important reproduction operators utilized (Fig. 3b). Two strings (parents) are utilized in this operator and next individuals are produced by exchanging a sub- grouping between the two strings. Bit-flipping mutation is another important operator (Fig. 3c). A single gene (or bit) in the chromosome is flipped in mutation operator to form another offspring chromosome. In GAs, all operators are restricted to handle the chromosome in a parallel mode. For example, 2 genes having the same location on 2 chromosomes might be swapped with the original parents; however, not merged, and that is depending on their values. Individuals are generally selected to be the parents in accordance with the magnitude of the goal or fitness, the formed offspring change the parents (Sivanandam and Deepa, 2008). GA is an effective technique with respect to seek the ideal solution to complicated issues, for instance, the selection of the MFs nearly

wn yn

n=1

wn

(1)

the output value, y, could be easily calculated using Eq. (1) for any set of inputs after determining the fuzzy rule base (Şen, 1998). Detailed information related to FL theory could be obtained from the related books (Kosko, 1993; Ross, 2010). 3.2. Genetic algorithm Genetic algorithms (GAs) firstly reported by (Holland, 1975). GAs have been recently utilized as an effective method for solving 15

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

(d) Randomly generation of initial population

Finding optimum parameters of the MFs

Estimating SSC with fuzzy inference method

Calculating objective function (RMSE)

Selecting chromosomes to pass the next generation

Obtaining new chromosomes with crossover and mutation

Yes No

Is objective function optimized?

Is iteration number reached?

No

Yes Solution with optimized parameters

End

Fig. 3. (continued)

3.3. EF approach

cannot be justified optimally (Shoorehdeli et al., 2006). The fundamental distinctions between GAs and traditional optimization techniques are:

In the current study, the evolutionary fuzzy models were obtained by combining two approaches, fuzzy logic, and GA. The best values of the fuzzy model's parameters (e.g., parameters of MFs or linear equations) were computed by utilizing GA. The mechanism of the evolutionary fuzzy model is exhibited in Fig. 3d. GA is employed for minimization of the goal error between the model's results and corresponding observed ones. Mean square error (MSE) is utilized in the present study as a fitness in GA.

➢ In GAs, parameters sets are coded, not the individual parameters. ➢ In GAs, a local minimum is investigated from a population instead of a single point. ➢ In GAs, the information of goal function is utilized, not auxiliary information. ➢ In GAs, the rule of probabilistic evolution is used instead of deterministic ones (Goldberg, 1989).

MSE =

The optimal solutions are investigated by GA. The investigated problem is changed over the binary form and crossover and mate are applied to the solutions with a predetermined evaluation statistic to give the minimum. The detailed GAs theory may be acquired from Ahmed and Sarma (2005) and Wang (1991).

1 N

N

(yiobserved i=1

yimodel ) 2

(2)

where N is the quantity of data in training. In this study, the goal function (Eq. (2)) is optimized via modifying the MFs parameters of each input and output. This issue (optimizing the fuzzy MFs) is very complicated for the scheme of supervised learning. GA, nevertheless, uses a scheme of non-supervised learning and may be effectively employed in the solution of such problems (Goldberg, 1989; Özger, 2009). 16

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

Fig. 3. (continued)

3.4. Adaptive neuro-fuzzy inference system model (ANFIS)

validation and testing of the applied models, respectively. The proposed sediment predictive model tested using various input attributes combination that based on the antecedent flow discharge values (as the river flow is the main surface hydrology component stimulus the movement and the concentration of the sediment particles). The input combination structure tabulated in Table 2 that constructed in accordance to the correlation statistics between sediment and river flow discharge. This procedure implemented with harmony with several researches conducted in the literature of the same research field (Afan et al., 2017; Rai and Mathur, 2008; Senthil Kumar et al., 2012; Van Maanen et al., 2010). The presented information of the correlation in Table 2 showed that including current and three antecedent discharge records gave the yield point, in which the fourth day exhibits less correlation with the targeted SSC value. The input combinations are presented as follows:

ANFIS, as a generic approximator, was firstly presented by Jang (1996). It has a system structure involving various nodes linked via directional connections. Every node is portrayed by a node function comprising constant or alterable parameters (Jang et al., 1997). Imagine a FL inference system comprising 3 inputs (x, y, and z) and 1 output (f) and its rules include 2 TS (Takagi and Sugeno) type fuzzy rules (Takagi and Sugeno, 1985):

Rule 1: If x is A1, y is B1 and z is C1 then f1 = p1 x + q1 y + r1 z + s1 (3)

Rule 2: If x is A2, y is B2 and z is C2 then f2 = p2 x + q2 y + r2 z + s2 (4) where f1, f2 demonstrate the function of output for the 1st and 2nd rules. An identical ANFIS structure is illustrated in Fig. 3e. The output of ANFIS includes linear or constant functions. More theory about ANFIS can be obtained from Jang (1996). In the current study, we used three different ANFIS methods, ANFIS with subtractive clustering (ANFIS-SC), ANFIS with grid partition (ANFIS-GP), and ANFIS with fuzzy c-means (ANFIS-FCM). The GP is an input partitioning method and frequently used in the literature (Jang, 1996). In SC method, data are partitioned into clusters and less number of fuzzy rules are obtained compared to GP method. An iterative method used in FCM with the aim of computing cluster centers by minimizing the square error. For the detailed information for these methods, the readers are referred to (Kisi et al., 2017).

(5)

Model 1:

SSC(t ) = Q(t )

Model 2:

SSC(t ) = Q(t ) , Q(t

1)

Model 3:

SSC(t ) = Q(t ) , Q(t

1) ,

Q (t

2)

Model 4:

SSC(t ) = Q(t ) , Q(t

1) ,

Q (t

2) ,

(6) (7)

Q(t

(8)

3)

The proposed and the comparable predictive models were evaluated using four different quantitative statistical metrics including root mean square error (RMSE), mean absolute error (MAE), Nash-Sutcliffe coefficient (NSE) and Willmott's index of agreement (WI). The Table 2 The cross-correlations between discharge and SSC in the upstream and downstream stations.

3.5. Input variables structure and prediction indicators In the up- and down-stream stations, the data periods October 01, 1966–September 30, 1971, October 01, 1971–September 30, 1974 and October 01, 1974–September 30, 1977 were utilized for training,

Upstream St Downstream St

17

Qt

Qt-1

Qt-2

Qt-3

Qt-4

Qt-5

0.884 0.873

0.545 0.524

0.353 0.358

0.353 0.358

0.310 0.317

0.275 0.259

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

mathematical description can be explained as follows (Chai and Draxler, 2014; Nash and Sutcliffe, 1970; Willmott et al., 2012):

1 N

RMSE =

MAE =

1 N

NSE = 1

WI = 1

outperformed the other models (i.e., ANFIS-GP, ANFIS-SC and ANFISFCM) for the up-stream and down-stream stations using the evaluation metrics (i.e., RMSE, MAE, NSE, and WI). For the up-stream station, the optimal antecedent river flow information identified for the proposed (EF) and the comparable (ANFISGP, ANFIS-SC, and ANFIS-FCM) models are (Model 3, Model 4, Model 1 and Model 3). Whereas, the down-stream station attained the best results are (Model 3, Model 4, Model 1 and Model 1). The optimal training (RMSE-MAE-NSE-WI) for the EF are (4579-983-0.928-0.9999) “up-stream station” and (75041-19719-0.932-0.9999) “down-stream station”. Whereas, the testing phase of the modeling attained (RMSEMAE-NSE-WI) values (2583-413-0.708-0.9989) “up-stream station” and (42714-9032-0.911-0.9997) “down-stream station”. The testing phase is the most significant phase that matter for the evaluation of engineering concern. The enhancement of the prediction accuracies through this phase (i.e., RMSE-MAE) using the EF over the other models (i.e., ANFIS-GP, ANFIS-SC and ANFIS-FCM) are ((25.1–6%), (29.3–35.2%), and (10.8–0.4%)) and ((8.8–23.6%), (31–45.2%), and (38.3–12.1%)) for up-stream and down-stream stations. The EF has superior accuracy with respect to the NSE and WI statistics. WI has an advantage over the RMSE and R2 which describe squared differences between measured and predicted SSC and neglect the low values. In WI, however, the ratio of mean square error is considered instead (Diop et al., 2018; Willmott, 1984). It is evident from the Table 5 that the best EF (Model 3 for both stations) have the highest NSE values, 0.708 and 0.911, which indicate ‘good’, and, ‘very good’ efficiencies of the models (Moriasi et al., 2007). The general evaluation of the proposed evolutionary fuzzy model seemed to perform the SSC prediction more precisely than the comparable models (ANFIS-GP, ANFIS-SC, and ANFISFCM). It is important to state that, the proposed and the validation models exhibited no consistency in the input information for the best prediction outcome. Indeed, this is not surprising as the data-intelligent models act differently from one case to another in accordance with the learning process of the model. Also, that is relying on the provided historical information from one case to another. The other important reason for this is the highly complex behavior of discharge-sediment phenomenon (please see the highly skewed distributions of discharge and SSC in Table 1 together with their significantly high xmax/xmax ratio). Moreover, the high different ranges (xmax–xmax) and distributions between

n

Sf )2

(So

(9)

t=1

n

So

Sf

(10)

t=1 N i=1 N i=1

(So

Sf ) 2

(So

So ) 2

N i=1 N i=1

(|Sf

(So

,

NSE

Sf ) 2

So | + |So

So |)2

1

,0

(11)

WI

1

(12)

where the So and Sf are the observed and predicted values of SSC data. Whereas, So and Sf are the mean value of the observed and predicted values of SSC data. 4. Modeling results and discussion The motivation of the current research is to develop new robust intelligent predictive models called evolutionary fuzzy model (EF) for environmental river engineering application (i.e., sediment transport prediction). Sediment concentration prediction as stated in the introductory section can profit significant knowledge to multiple application of water resources engineering and planning including reservoir operation, river topology mechanism, water quality percentage and others. The new EF model was evaluated against the predominant family of the fuzzy logic models such as ANFIS integrated with GP, SC and FCM. The modeling of the SSC established based on river flow discharge information for two different coordinated stations on Eel River upstream and down-stream. In quantitative visualization, Tables 3, 4 and 5 illustrate the training, validation, and testing performance metrics in addition to the optimum parameters of the all developed models (EF, ANFIS-GP, ANFIS-SC and ANFIS-FCM) for up-stream and down-stream stations, respectively. Based on the visualized prediction performance of the daily SSC using discharge information, it can be observed that EF model

Table 3 The training and validation performances of the EF, ANFIS-GP, ANFIS-SC and ANFIS-FCM models in suspended sediment prediction for up-stream station. Models

EF (2,gauss,10000)a EF (2,gauss,1000) EF (3,gauss,5000) EF (3,gauss,5000) ANFIS-GP (3,gauss,100)b ANFIS-GP (2,gauss,70) ANFIS-GP (2,3,3,gauss,10)c ANFIS-GP (2,2,3,3,gauss,100) ANFIS-SC (0.5,50)d ANFIS-SC (1100) ANFIS-SC (1100) ANFIS-SC (1100) ANFIS-FCM (7,30)e ANFIS-FCM (8,00) ANFIS-FCM (5,40) ANFIS-FCM (5,40) a b c d e

Inputs

Model Model Model Model Model Model Model Model Model Model Model Model Model Model Model Model

Training

1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4

Validation

RMSE (Mg/l)

MAE (Mg/l)

5482 4876 4579 4436 17,297 17,294 17,240 3411 5813 4275 3490 2363 4764 4091 4387 3863

1378 1190 983 1095 2790 2800 2769 742 2148 1483 906 593 961 812 1017 857

NSE

WI

RMSE (Mg/l)

MAE (Mg/l)

NSE

WI

0.897 0.918 0.928 0.932 −0.027 −0.026 −0.020 0.960 0.884 0.937 0.958 0.981 0.922 0.943 0.934 0.949

0.9998 0.9999 0.9999 0.9999 0.9972 0.9972 0.9972 0.9999 0.9998 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999

9704 9783 9973 12,232 25,242 25,240 25,171 12,364 9753 9562 36,653 7,981,014 6965 7364 8998 14,628

1604 1415 1259 1610 2705 2716 2684 1508 1780 1497 3061 346,190 1119 1021 1247 1625

0.850 0.848 0.842 0.762 −0.012 −0.012 −0.006 0.757 0.849 0.855 −1.133 −1011 0.923 0.914 0.871 0.660

0.9990 0.9990 0.9990 0.9985 0.9897 0.9896 0.9897 0.9986 0.9990 0.9991 0.9910 0.5385 0.9995 0.9995 0.9992 0.9978

2 Gaussian MFs for each input and 10,000 generations. 3 Gaussian MFs for each input and 100 iterations. 2,3,3 Gaussian MFs for the inputs Qt, Qt-1, Qt-2, respectively and 10 iterations. 0.5 radii value and 50 iterations. 7 clusters and 30 iterations. 18

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

Table 4 The training and validation performances of the EF, ANFIS-GP, ANFIS-SC and ANFIS-FCM models in suspended sediment prediction for down-stream station. Models

Inputs

EF (2,gauss,20000)a EF (5,gauss,1000) EF (2,gauss,50,000) EF (4,gauss,5000) ANFIS-GP (3,gauss,100)b ANFIS-GP (3,gauss,100) ANFIS-GP (3,gauss,100) ANFIS-GP (2,gauss,100) ANFIS-SC (1100)c ANFIS-SC (1100) ANFIS-SC (0.3,70) ANFIS-SC (1100) ANFIS-FCM (2100)d ANFIS-FCM (8,10) ANFIS-FCM (7,10) ANFIS-FCM (5,80) a b c d

Model Model Model Model Model Model Model Model Model Model Model Model Model Model Model Model

Training

1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4

Validation

RMSE (Mg/l)

MAE (Mg/l)

NSE

WI

RMSE (Mg/l)

MAE (Mg/l)

NSE

WI

85,898 63,122 75,041 72,437 294,692 294,692 294,692 64,740 90,461 64,107 53,369 39,785 87,062 79,485 87,877 336,012

23,182 16,028 19,719 19,282 60,966 60,966 60,966 20,388 35,472 21,007 17,070 12,203 24,860 17,499 19,304 56,333

0.911 0.952 0.932 0.937 −0.045 −0.045 −0.045 0.950 0.902 0.951 0.966 0.981 0.909 0.924 0.907 −0.358

0.9999 0.9999 0.9999 0.9999 0.9982 0.9982 0.9982 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9999 0.9988

164,866 168,184 214,455 182,543 306,449 306,449 306,449 207,596 156,225 377,626 360,274 1,382,684 157,881 222,667 167,361 747,048

32,127 31,977 40,579 34,471 46,210 46,210 46,210 39,893 37,656 42,714 47,307 120,462 34,649 29,080 32,601 86,449

0.704 0.692 0.499 0.637 −0.023 −0.023 −0.023 0.530 0.734 −0.554 −0.414 −19.83 0.728 0.460 0.695 −5.081

0.9991 0.9990 0.9985 0.9989 0.9943 0.9943 0.9943 0.9986 0.9992 0.9958 0.9965 0.9727 0.9992 0.9985 0.9991 0.9894

2 Gaussian MFs for each input and 10,000 generations. 3 Gaussian MFs for each input and 100 iterations. 1 radii value and 100 iterations. 2 clusters and 100 iterations.

the training and validation/test data bring additional difficulties to the applied models. Scatter plot generated between the observed SSC values and predicted values over the testing phase of the modeling for both investigated stations (see Fig. 4a and b). The determination coefficient (R2) demonstrated an excellent correlation to the ideal line of the 45° for the EF. To have a more detailed inspection of the four data-intelligence predictive models, a combination of three metrics including standard deviation, correlation, and root mean square error is constructed to be visualized as a Taylor diagram (Fig. 5) (Taylor, 2001). The main point of this diagram is to summarize multiple performance metrics in one combination and statistically quantify the degree of similarity between the observed (actual suspended sediment concentration) and the predicted values. Based on the graphical presentation, the EF model located closer to the benchmark observed record in comparison with ANFIS-GP, ANFIS-SC, and ANFIS-FCM models. It can be seen very obvious for the up-stream station and with slight difference between EF and ANFIS-SC. This is not surprising as ANFIS-SC model showed the

excellent performance of prediction in the last decade within the field of hydrology and environment (Kisi et al., 2015; Kisi and ZounematKermani, 2014; Orouji and Haddad, 2013; Sanikhani et al., 2012). The analyzed data of the suspended sediment of the two stations in up-stream and down-stream of the Eel River, it is very clear the SSC amount values of the down-stream station is highly greater than the upstream. This is owing to the uplifting of the streamside land sliding which presents the main source of the sediment. Here the uplifting of the soil originally materialized due to the high event of river flow discharge. It is worth it to validate the current research results with the established literature. Rajaee et al. (2009) applied ANFIS and neural networks (NN) for daily suspended sediment (SS) concentration of Little Black River and Salt River, USA. Previous streamflow and SS values were used as inputs to the applied models. Authors used simple two data separation rule, training, and testing. Skewness coefficients of the SS data used vary from 4.63 to 6.69. ANFIS model attained R2 value of 0.697. Senthil Kumar et al. (2012) modeled SS concentration of Kasol, Sutlej basin, India using various AI models including: NN, radial basis

Table 5 The testing performances of the EF, ANFIS-GP, ANFIS-SC and ANFIS-FCM models in suspended sediment prediction for the up-stream and down-stream stations. Models

EF EF EF EF ANFIS-GP ANFIS-GP ANFIS-GP ANFIS-GP ANFIS-SC ANFIS-SC ANFIS-SC ANFIS-SC ANFIS-FCM ANFIS-FCM ANFIS-FCM ANFIS-FCM

Inputs

Model Model Model Model Model Model Model Model Model Model Model Model Model Model Model Model

Up-stream

1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4

Down-stream

RMSE (Mg/l)

MAE (Mg/l)

2588 2654 2583 2685 4810 4808 4772 3449 2719 3271 3655 20,679 3482 3060 2898 3510

503 445 413 476 561 585 558 444 1339 976 638 1879 434 393 415 426

NSE

WI

RMSE (Mg/l)

MAE (Mg/l)

NSE

WI

0.707 0.691 0.708 0.684 −0.014 −0.013 0.003 0.479 0.676 0.531 0.415 −17.74 0.469 0.590 0.632 0.460

0.9990 0.9989 0.9989 0.9989 0.9918 0.9914 0.9917 0.9982 0.9994 0.9989 0.9982 0.9712 0.9980 0.9985 0.9987 0.9981

47,773 44,593 42,714 45,493 144,141 144,141 144,141 48,915 50,972 65,172 122,783 358,188 50,489 72,319 75,641 337,632

10,285 8414 9032 10,149 18,432 18,432 18,432 11,023 22,568 15,352 17,019 29,063 11,007 9573 11,154 28,816

0.888 0.903 0.911 0.899 −0.017 −0.017 −0.017 0.883 0.873 0.792 0.262 −5.278 0.875 0.744 0.720 −4.578

0.9996 0.9996 0.9997 0.9996 0.9929 0.9929 0.9929 0.9996 0.9997 0.9993 0.9978 0.9834 0.9996 0.9991 0.9990 0.9865

19

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

Fig. 4. Scatter plot graphical presentation between the observed and the predictive intelligence models, a) the upstream investigated metrological station b) the downstream investigated metrological station.

function (RBF), fuzzy logic, M5 model tree and REPTree. Input variables were antecedent values of rainfall, streamflow, and SS. The best model (NN) provided the NSE of 0.91. Vafakhah (2012) developed neuro-fuzzy, ANN and cokriging based models for daily SS estimation.

Various input combinations based on the current and past daily rainfall and streamflow data obtained from the Kojor forest watershed near the Caspian Sea. The achieved results of the model's predictability displayed a value of NSE 0.64, 0.79 and 0.85 for the best neuro-fuzzy, 20

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen

attributes for the developed model. This is very significant for limited hydrological data catchments. Nearly 90% of the conducted literature studies were modeled using sediment information in addition to other hydrological variables as an input attribute. Indeed, this is not recommended practically due to the measurement of SS is a very difficult task from the engineering point of view. Also, the SS value of the current day should be measured and applied as an input to estimate tomorrow's value. However, streamflow can be measured or calculated from river stage data much easier than the SS. To conclude the discussion section, the proposed EF demonstrates an adequate predictive model for the investigated environmental problem over the other version of the integrated ANFIS models. The main merit of the EF in its integration with the nature-inspired genetic algorithm. On the contrary with ANFIS model, the gradient descent algorithm is used to define the MFs, in which several disadvantages can be faced such as tapped in the local minima (Piotrowski and Napiorkowski, 2011). Note that GA is a combination of stochastic and direct searching solution that never tapped in the local optima problem (David and Greental, 2014; Garousi-Nejad et al., 2016). 5. Conclusion and remarks In this research, a new hybrid intelligence model called evolutionary fuzzy is proposed to predict SSC at El River, one of the very big and problematic rivers in northern California. Different data-intelligence models of the same family of the fuzzy logic set (ANFIS-SC, ANFIS-GP, and ANFIS-FCM) were developed for validation purpose of the EF. The prediction models built using the information of river flow discharge as this hydrological process is the main casual correlated variable to the SSC. The modeling carried out on two different locations of the river (i.e., up-stream and down-stream). The results indicated the potential of the evolutionary fuzzy predictive model to mimic the SSC pattern with a high level of accuracy over the comparable models. The predictability effectiveness of the EF attributed to several aspects of the environmental concerns. It is highly recommended to integrate the applicability of this evolutionary predictive model with the typical monitoring expert system of the Eel River. The current findings of this research can be expanded for future research by employing more hydrological or climatological or even morphological information to the predictive model. In addition, other water quality parameters that are associated with the percentage of the SSC can be included in the modeling for inspection.

Fig. 5. Tayler diagram exhibition denoting the prediction skill between the data-intelligence models and the observed record of SSC a) the upstream investigated metrological station b) the downstream investigated metrological station.

References

ANN, and cokriging models, respectively. Another study conducted on modeling daily SS of Kopili River, India, using NN, RBF, least square support vector regression (LS-SVR), multi-linear regression (MLR), Classification and Regression Tree (CART) and M5 model tree (Kumar et al., 2016). The scholars included various combinations of current and past rainfall, streamflow, SS data in the model inputs. They used two data separation rule, calibration and validation. LS-SVR attained the best prediction results with NSE value equal to 0.89. Most of the studies in the related literature used simple two data-splitting rule, training and testing. Throughout the numerical statistic performance of the literature studies, it is very obvious that the proposed modeling strategy of the evolutionary fuzzy model successfully estimates SS of Eel River using only the current and antecedent streamflow values. In the present study, three data splitting procedures (i.e., training, validation and testing), were applied. Here, the optimal model was designated using training and validation data sets where the validation phase was used for controlling the overfitting learning process. The testing data phase was performed as independent dataset which is not known by the developed model. The other advantage of the present study is the use of the individual streamflow information as input

Afan, H.A., El-Shafie, A., Yaseen, Z.M., Hameed, M.M., Wan Mohtar, W.H.M., Hussain, A., 2014. ANN based sediment prediction model utilizing different input scenarios. Water Resour. Manag. 29, 1231–1245. https://doi.org/10.1007/s11269-014-0870-1. Afan, H.A., El-Shafie, A., Mohtar, W.H.M.W., Yaseen, Z.M., 2016. Past, present and prospect of an Artificial Intelligence (AI) based model for sediment transport prediction. J. Hydrol. https://doi.org/10.1016/j.jhydrol.2016.07.048. Afan, H.A., Keshtegar, B., Mohtar, W.H.M.W., El-Shafie, A., 2017. Harmonize input selection for sediment transport prediction. J. Hydrol. 552, 366–375. https://doi.org/ 10.1016/j.jhydrol.2017.07.008. Agarwal, A., 2009. Forecasting of runoff and sediment yield using artificial neural networks. J. Water Resour. Prot. 01, 368–375. https://doi.org/10.4236/jwarp.2009. 15044. Ahmed, J.A., Sarma, A.K., 2005. Genetic algorithm for optimal operating policy of a multipurpose reservoir. Water Resour. Manag. 19 (2), 145–161. Alcayaga, H.A., Mao, L., Belleudy, P., 2018. Predicting the geomorphological responses of gravel-bed rivers to flow and sediment source perturbations at the watershed scale: an application in an Alpine watershed. Earth Surf. Process. Landf. 43, 894–908. https://doi.org/10.1002/esp.4278. Altunkaynak, A., 2009. Sediment load prediction by genetic algorithms. Adv. Eng. Softw. 40, 928–934. https://doi.org/10.1016/j.advengsoft.2008.12.009. Aytek, A., Kişi, Ö., 2008. A genetic programming approach to suspended sediment modelling. J. Hydrol. 351, 288–298. https://doi.org/10.1016/j.jhydrol.2007.12.005. Azamathulla, H.M., Ghani, A.A., Chang, C.K., Hasan, Z.A., Zakaria, N.A., 2010. Machine learning approach to predict sediment load - a case study. Clean Soil Air Water 38. https://doi.org/10.1002/clen.201000068.

21

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen Bakhtyar, R., Ghaheri, A., Yeganeh-Bakhtiary, A., Baldock, T.E., 2008. Longshore sediment transport estimation using a fuzzy inference system. Appl. Ocean Res. 30, 273–286. https://doi.org/10.1016/j.apor.2008.12.001. Batt, H.A., 2013. Relevance vector machine models of suspended fine sediment transport in a shallow lake - I: data collection. Environ. Eng. Sci. 30, 681–688. https://doi.org/ 10.1089/ees.2012.0487. Buyukyildiz, M., Kumcu, S.Y., 2017. An estimation of the suspended sediment load using adaptive network based fuzzy inference system, support vector machine and artificial neural network models. Water Resour. Manag. 31, 1343–1359. https://doi.org/10. 1007/s11269-017-1581-1. Chai, T., Draxler, R.R., 2014. Root mean square error (RMSE) or mean absolute error (MAE)? -arguments against avoiding RMSE in the literature. Geosci. Model Dev. 7, 1247–1250. https://doi.org/10.5194/gmd-7-1247-2014. Chang, H.H., 2008. River morphology and river channel changes. Trans. Tianjin Univ. 14, 254–262. https://doi.org/10.1007/s12209-008-0045-3. David, O.E., Greental, I., 2014. Genetic algorithms for evolving deep neural networks. In: Proceedings of the 2014 Conference Companion on Genetic and Evolutionary Computation, pp. 1451–1452. https://doi.org/10.1145/2598394.2602287. Diop, L., Bodian, A., Djaman, K., Yaseen, Z.M., Deo, R.C., El-Shafie, A., Brown, L.C., 2018. The influence of climatic inputs on stream-flow pattern forecasting: case study of Upper Senegal River. Environ. Earth Sci. 77, 182. https://doi.org/10.1007/s12665018-7376-8. Dogan, E., 2005. Suspended sediment load estimation in lower Sakarya River by using artificial neural networks, fuzzy logic and neuro-fuzzy models. Electron. Lett. Sci. Eng. 1, 22–32. Doğan, E., Yüksel, İ., Kişi, Ö., 2007. Estimation of total sediment load concentration obtained by experimental study using artificial neural networks. Environ. Fluid Mech. 7, 271–288. https://doi.org/10.1007/s10652-007-9025-8. Ebtehaj, I., Bonakdari, H., 2016. A comparative study of extreme learning machines and support vector machines in prediction of sediment transport in open channels. Int. J. Eng. Trans. B Appl. Int. J. Eng. J 29. Ebtehaj, I., Bonakdari, H., Shamshirband, S., Mohammadi, K., 2016. A combined support vector machine-wavelet transform model for prediction of sediment transport in sewer. Flow Meas. Instrum. 47, 19–27. https://doi.org/10.1016/j.flowmeasinst. 2015.11.002. Fahimi, F., Yaseen, Z.M., El-Shafie, A., 2016. Application of soft computing based hybrid models in hydrological variables modeling: a comprehensive review. Theor. Appl. Climatol. 1–29. https://doi.org/10.1007/s00704-016-1735-8. Fischer, S., Pietroń, J., Bring, A., Thorslund, J., Jarsjö, J., 2017. Present to future sediment transport of the Brahmaputra River: reducing uncertainty in predictions and management. Reg. Environ. Chang. 17, 515–526. https://doi.org/10.1007/s10113-0161039-7. Frings, R.M., Kleinhans, M.G., 2008. Complex variations in sediment transport at three large river bifurcations during discharge waves in the river Rhine. Sedimentology 55, 1145–1171. https://doi.org/10.1111/j.1365-3091.2007.00940.x. Garousi-Nejad, I., Bozorg-Haddad, O., Loáiciga, H.A., 2016. Modified firefly algorithm for solving multireservoir operation in continuous and discrete domains. J. Water Resour. Plan. Manag., 04016029. https://doi.org/10.1061/(ASCE)WR.1943-5452. 0000644. Goldberg, D.E., 1989. Genetic Algorithms in Search, Optimization, and Machine Learning. Addison Wesleyhttps://doi.org/10.1007/s10589-009-9261-6. Goldberg, D.E., Korb, B., Deb, K., 1989. Messy genetic algorithms: motivation, analysis, and first results. Engineering 3, 493–530. Goyal, M.K., 2014. Modeling of sediment yield prediction using M5 model tree algorithm and wavelet regression. Water Resour. Manag. 1991–2003. https://doi.org/10.1007/ s11269-014-0590-6. Greimann, B.P., Muste, M., Holly, F.M., 1999. Two-phase formulation of suspended sediment transport. J. Hydraul. Res. 37, 479–500. https://doi.org/10.1080/00221686. 1999.9628264. Hill, P.S., Milligan, T.G., Geyer, W.R., 2000. Controls on effective settling velocity of suspended sediment in the Eel River flood plume. Cont. Shelf Res. 20, 2095–2111. https://doi.org/10.1016/S0278-4343(00)00064-9. Holeman, J.N., 1968. The sediment yield of major rivers of the world. Water Resour. Res. 4, 737–747. https://doi.org/10.1029/WR004i004p00737. Holland, J., 1975. Adaptation in Natural and Artificial Systems. Univ. Michigan Press, Ann Arbor. Huang, J.Z., Ge, X.P., Yang, X.F., Zheng, B., Wang, D.S., 2012. Remobilization of heavy metals during the resuspension of Liangshui River sediments using an annular flume. Chin. Sci. Bull. 57, 3567–3572. https://doi.org/10.1007/s11434-012-5370-1. Jaiyeola, A.T., 2015. Modelling streamflow-sediment relationship using genetic programming. Adv. Energy Environ. Sci. Eng. 41, 124–129. Jang, J.-S., 1996. Input selection for ANFIS learning. In: Proceedings of the Fifth IEEE International Conference on Fuzzy Systems, pp. 1493–1499 1996. Jang, J.-S.R., Sun, C.-T., Mizutani, E., 1997. Neuro-Fuzzy and Soft Computing: A Computational Approach to Learning and Machine Intelligence. Prentice Hall. Javernick, L.A., Redolfi, M., Bertoldi, W., 2018. Evaluation of a numerical model's ability to predict bed load transport observed in braided river experiments. Adv. Water Resour. 115, 207–218. https://doi.org/10.1016/j.advwatres.2018.03.012. Kabiri-Samani, A.R., Aghaee-Tarazjani, J., Borghei, S.M., Jeng, D.S., 2011. Application of neural networks and fuzzy logic models to long-shore sediment transport. Appl. Soft Comput. J. 2880–2887. https://doi.org/10.1016/j.asoc.2010.11.021. Kisi, O., 2012. Modeling discharge-suspended sediment relationship using least square support vector machine. J. Hydrol. 456–457, 110–120. https://doi.org/10.1016/j. jhydrol.2012.06.019. Kisi, O., 2015. Streamflow Forecasting and Estimation Using Least Square Support Vector Regression and Adaptive Neuro-Fuzzy Embedded Fuzzy C-means Clustering. pp.

5109–5127. https://doi.org/10.1007/s11269-015-1107-7. Kisi, O., Zounemat-Kermani, M., 2014. Comparison of two different adaptive neuro-fuzzy inference systems in modelling daily reference evapotranspiration. Water Resour. Manag. 28, 2655–2675. https://doi.org/10.1007/s11269-014-0632-0. Kisi, O., Sanikhani, H., Zounemat-Kermani, M., Niazi, F., 2015. Long-term monthly evapotranspiration modeling by several data-driven methods without climatic data. Comput. Electron. Agric. 115, 66–77. https://doi.org/10.1016/j.compag.2015.04. 015. Kisi, O., Demir, V., Kim, S., 2017. Estimation of long-term monthly temperatures by three different adaptive neuro-fuzzy approaches using geographical inputs. J. Irrig. Drain. Eng. 143, 1–18. https://doi.org/10.1061/(ASCE)IR.1943-4774.0001242. Kizhisseri, A.S., Simmonds, D., Rafiq, Y., Borthwick, M., 2006. An evolutionary computation approach to sediment transport modelling. In: Coastal Dynamics 2005 Proceedings of the Fifth Coastal Dynamics International Conference, https://doi.org/ 10.1061/40855(214)81. Kleinhans, M.G., van Rijn, L.C., 2002. Stochastic prediction of sediment transport in sandgravel bed rivers. J. Hydraul. Eng. 128, 412–425. https://doi.org/10.1061/(ASCE) 0733-9429(2002)128:4(412). Kosko, B., 1993. Fuzzy Thinking: The New Science of Fuzzy Logic. Hyperion, New York. Kumar, B., Jha, A., Deshpande, V., Sreenivasulu, G., 2014. Regression model for sediment transport problems using multi-gene symbolic genetic programming. Comput. Electron. Agric. 103, 82–90. https://doi.org/10.1016/j.compag.2014.02.010. Kumar, D., Pandey, A., Sharma, N., Flügel, W.-A., 2016. Daily suspended sediment simulation using machine learning approach. Catena 138, 77–90. https://doi.org/10. 1016/j.catena.2015.11.013. Li, Y., Yu, X.-P., Lin, M., 2010. Numerical study on suspended sediment transport under waves. Shuidonglixue Yanjiu yu Jinzhan/Chin. J. Hydrodyn. Ser. A 25. https://doi. org/10.3969/j.issn.1000-4874.2010.02.012. Li, X., Chen, H., Jiang, X., Yu, Z., Yao, Q., 2017. Impacts of human activities on nutrient transport in the Yellow River: the role of the water-sediment regulation scheme. Sci. Total Environ. 592, 161–170. https://doi.org/10.1016/j.scitotenv.2017.03.098. Lisle, T.E., 1982. Effects of aggradation and degradation on riffle-pool morphology in natural gravel channels, northwestern California. Water Resour. Res. 18, 1643–1651. Liu, Q.J., Shi, Z.H., Fang, N.F., De Zhu, H., Ai, L., 2013. Modeling the daily suspended sediment concentration in a hyperconcentrated river on the Loess Plateau, China, using the Wavelet-ANN approach. Geomorphology 186, 181–190. https://doi.org/ 10.1016/j.geomorph.2013.01.012. Mackey, B.H., Roering, J.J., McKean, J.A., 2009. Long-term kinematics and sediment flux of an active earthflow, Eel River, California. Geology 37, 803–806. https://doi.org/ 10.1130/G30136A.1. Martinez, J.M., Guyot, J.L., Filizola, N., Sondag, F., 2009. Increase in suspended sediment discharge of the Amazon River assessed by monitoring network and satellite data. Catena 79, 257–264. https://doi.org/10.1016/j.catena.2009.05.011. Mianaei, S.J., Keshavarzi, A.R., 2010. Prediction of riverine suspended sediment discharge using fuzzy logic algorithms, and some implications for estuarine settings. Geo-Mar. Lett. 30, 35–45. https://doi.org/10.1007/s00367-009-0149-3. Misra, D., Oommen, T., Agarwal, A., Mishra, S.K., Thompson, A.M., 2009. Application and analysis of support vector machine based simulation for runoff and sediment yield. Biosyst. Eng. 103, 527–535. https://doi.org/10.1016/j.biosystemseng.2009. 04.017. Moriasi, D.N., Arnold, J.G., Van Liew, M.W., Binger, R.L., Harmel, R.D., Veith, T.L., 2007. Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE 50, 885–900. https://doi.org/10.13031/2013.23153. Nagy, H.M., Watanabe, K., Hirano, M., 2002. Prediction of sediment load concentration in rivers using artificial neural network model. J. Hydraul. Eng. 128, 588–595. https:// doi.org/10.1061/(ASCE)0733-9429(2002)128:6(588). Nash, J.E., Sutcliffe, J.V., 1970. River flow forecasting through conceptual models part I-a discussion of principles. J. Hydrol. 10, 282–290. https://doi.org/10.1016/00221694(70)90255-6. Nayak, P.C., Sudheer, K.P., Ramasastri, K.S., 2005. Fuzzy computing based rainfall-runoff model for real time flood forecasting. Hydrol. Process. 19, 955–968. https://doi.org/ 10.1002/hyp.5553. Nolan, K.M., Lisle, T.E., Kelsey, H.M., 1987. Bankfull discharge and sediment transport in northwestern California. In: Eros. Sediment. Pacific Rim (Proceedings Corvallis Symp). 165. pp. 439–449. Orouji, H., Haddad, O., 2013. Modeling of water quality parameters using data-driven models. J. Environ. Eng. 139, 947–957. https://doi.org/10.1061/(ASCE)EE.19437870.0000706. Özger, M., 2009. Comparison of fuzzy inference systems for streamflow prediction. Hydrol. Sci. J. 54, 261–273. https://doi.org/10.1623/hysj.54.2.261. Pan, Y., Chen, J., Zhou, H., Tam, N.F.Y., 2018. Changes in microbial community during removal of BDE-153 in four types of aquatic sediments. Sci. Total Environ. 613–614, 644–652. https://doi.org/10.1016/j.scitotenv.2017.09.130. Partal, T., Cigizoglu, H.K., 2008. Estimation and forecasting of daily suspended sediment data using wavelet-neural networks. J. Hydrol. 358, 317–331. https://doi.org/10. 1016/j.jhydrol.2008.06.013. Piotrowski, A.P., Napiorkowski, J.J., 2011. Optimizing neural networks for river flow forecasting - evolutionary computation methods versus the Levenberg-Marquardt approach. J. Hydrol. 407, 12–27. https://doi.org/10.1016/j.jhydrol.2011.06.019. Rai, R.K., Mathur, B.S., 2008. Event-based sediment yield modeling using artificial neural network. Water Resour. Manag. 22, 423–441. https://doi.org/10.1007/s11269-0079170-3. Rajaee, T., 2011. Wavelet and ANN combination model for prediction of daily suspended sediment load in rivers. Sci. Total Environ. 409, 2917–2928. https://doi.org/10. 1016/j.scitotenv.2010.11.028. Rajaee, T., Mirbagheri, S.A., Zounemat-Kermani, M., Nourani, V., 2009. Daily suspended

22

Catena 174 (2019) 11–23

O. Kisi, Z.M. Yaseen sediment concentration simulation using ANN and neuro-fuzzy models. Sci. Total Environ. 407, 4916–4927. https://doi.org/10.1016/j.scitotenv.2009.05.016. Ribbe, J., Holloway, P.E., 2001. A model of suspended sediment transport by internal tides. Cont. Shelf Res. 21, 395–422. https://doi.org/10.1016/S0278-4343(00) 00081-9. van Rijn, L.C., 1984. Sediment transport, part II: suspended load transport. J. Hydraul. Eng. 110, 1613–1641. https://doi.org/10.1061/(ASCE)0733-9429(1987) 113:9(1187). Ross, T.J., 2010. Fuzzy logic with engineering applications: third edition. In: Fuzzy Logic With Engineering Applications: Third Edition, https://doi.org/10.1002/ 9781119994374. Ruessink, B.G., 2000. An empirical energetics-based formulation for the cross-shore suspended sediment transport by bound infragravity waves. J. Coast. Res. 16, 482–493. Russell, S.J., Norvig, P., 1995. Artificial intelligence: a modern approach. Neurocomputing. https://doi.org/10.1016/0925-2312(95)90020-9. Sanikhani, H., Kisi, O., Nikpour, M.R., Dinpashoh, Y., 2012. Estimation of daily pan evaporation using two different adaptive neuro-fuzzy computing techniques. Water Resour. Manag. 26, 4347–4365. https://doi.org/10.1007/s11269-012-0148-4. Şen, Z., 1998. Fuzzy algorithm for estimation of solar irradiation from sunshine duration. Sol. Energy 63, 39–49. https://doi.org/10.1016/S0038-092X(98)00043-7. Senthil Kumar, A.R., Ojha, C.S.P., Goyal, M.K., Singh, R.D., Swamee, P.K., 2012. Modeling of suspended sediment concentration at Kasol in India using ANN, fuzzy logic, and decision tree algorithms. J. Hydrol. Eng. 17, 394–404. https://doi.org/10.1061/ (ASCE)HE.1943-5584.0000445. Shoorehdeli, M.a., Teshnehlab, M., Sedigh, a.K., 2006. A novel training algorithm in ANFIS structure. In: 2006 Am. Control Conf, https://doi.org/10.1109/ACC.2006. 1657525. Sivakumar, B., Jayawardena, A.W., 2002. An investigation of the presence of low-dimensional chaotic behaviour in the sediment transport phenomenon. Hydrol. Sci. J. 47, 37–41. https://doi.org/10.1080/02626660209492943. Sivakumar, B., Wallender, W.W., 2005. Predictability of river flow and suspended sediment transport in the Mississippi River basin: a non-linear deterministic approach. Earth Surf. Process. Landf. 30, 665–677. https://doi.org/10.1002/esp.1167. Sivanandam, S.N., Deepa, S.N., 2008. Introduction to genetic algorithms. Vasa. https:// doi.org/10.1007/978-3-540-73190-0\_2. Solomatine, D.P., Ostfeld, A., 2008. Data-driven modelling: some past experiences and new approaches. J. Hydroinf. 10, 3. https://doi.org/10.2166/hydro.2008.015. Stott, T., 2006. Impacts of constructing a rural cycle way on suspended sediment transport processes. Catena 68, 16–24. https://doi.org/10.1016/j.catena.2006.04.018. Takagi, T., Sugeno, M., 1985. Fuzzy identification of systems and its applications to modeling and control. In: IEEE Trans. Syst. Man Cybern. SMC-15, pp. 116–132. https://doi.org/10.1109/TSMC.1985.6313399. Taylor, K.E., 2001. Summarizing multiple aspects of model performance in a single diagram. J. Geophys. Res. Atmos. 106, 7183–7192. https://doi.org/10.1029/ 2000JD900719. Tfwala, S.S., Wang, Y.M., 2016. Estimating sediment discharge using sediment rating curves and artificial neural networks in the Shiwen River, Taiwan. Water

(Switzerland) 8. https://doi.org/10.3390/w8020053. Vafakhah, M., 2012. Comparison of cokriging and adaptive neuro-fuzzy inference system models for suspended sediment load forecasting. Arab. J. Geosci. 6, 3003–3018. https://doi.org/10.1007/s12517-012-0550-5. Van Maanen, B., Coco, G., Bryan, K.R., Ruessink, B.G., 2010. The use of artificial neural networks to analyze and predict alongshore sediment transport. Nonlinear Process. Geophys. 17, 395–404. https://doi.org/10.5194/npg-17-395-2010. Vigiak, O., Malagó, A., Bouraoui, F., Grizzetti, B., Weissteiner, C.J., Pastori, M., 2016. Impact of current riparian land on sediment retention in the Danube River Basin. Sustain. Water Qual. Ecol. 8, 30–49. https://doi.org/10.1016/j.swaqe.2016.08.001. Vigiak, O., Malagó, A., Bouraoui, F., Vanmaercke, M., Obreja, F., Poesen, J., Habersack, H., Fehér, J., Grošelj, S., 2017. Modelling sediment fluxes in the Danube River Basin with SWAT. Sci. Total Environ. 599–600, 992–1012. https://doi.org/10.1016/j. scitotenv.2017.04.236. Wang, Q.J., 1991. The genetic algorithm and its application to calibrating conceptual rainfall‐runoff models. Water Resour. Res. 27 (9), 2467–2471. Warrick, J.A., 2014. Eel River margin source-to-sink sediment budgets: revisited. Mar. Geol. 351, 25–37. https://doi.org/10.1016/j.margeo.2014.03.008. Wei, G., 2009. Sediment classification based on least-squares support vector machine and phase-plane analysis. In: 5th International Conference on Natural Computation. 2009. ICNC, pp. 560–564. https://doi.org/10.1109/ICNC.2009.610. Wei, Y., Jiao, J., Zhao, G., Zhao, H., He, Z., Mu, X., 2016. Spatial-temporal variation and periodic change in streamflow and suspended sediment discharge along the mainstream of the Yellow River during 1950–2013. Catena 140, 105–115. https://doi.org/ 10.1016/j.catena.2016.01.016. Willmott, C.J., 1984. On the evaluation of model performance in physical geography. In: Spatial Statistics and Models, pp. 443–446. Willmott, C.J., Robeson, S.M., Matsuura, K., 2012. A refined index of model performance. Int. J. Climatol. 32, 2088–2094. https://doi.org/10.1002/joc.2419. Yaseen, Z.M., El-Shafie, A., Jaafar, O., Afan, H.A., Sayl, K.N., 2015. Artificial intelligence based models for stream-flow forecasting: 2000–2015. J. Hydrol. 530, 829–844. https://doi.org/10.1016/j.jhydrol.2015.10.038. Yaseen, Z.M., Jaafar, O., Deo, R.C., Kisi, O., Adamowski, J., Quilty, J., El-Shafie, A., 2016. Stream-flow forecasting using extreme learning machines: a case study in a semi-arid region in Iraq. J. Hydrol. https://doi.org/10.1016/j.jhydrol.2016.09.035. Yaseen, Z.M., Ebtehaj, I., Bonakdari, H., Deo, R.C., Mehr, A.D., Mohtar, W.H.M.W., Diop, L., El-shafie, A., Singh, V.P., 2017a. Novel approach for streamflow forecasting using a hybrid ANFIS-FFA model. J. Hydrol. 554, 263–276. Yaseen, Z.M., Ghareb, M.I., Ebtehaj, I., Bonakdari, H., Siddique, R., Heddam, S., Yusif, A.A., Deo, R., 2017b. Rainfall pattern forecasting using novel hybrid intelligent model based ANFIS-FFA. Water Resour. Manag. 32 (1), 105–122. Zadeh, L., 1965. Fuzzy sets. Inf. Control. https://doi.org/10.1109/2.53. Zhong, D., Wang, G., Sun, Q., 2011. Transport equation for suspended sediment based on two-fluid model of solid/liquid two-phase flows. J. Hydraul. Eng. 137, 530–542. https://doi.org/10.1061/(ASCE)HY.1943-7900.0000331. Zounemat-Kermani, M., Kişi, Ö., Adamowski, J., Ramezani-Charmahineh, A., 2016. Evaluation of data driven models for river suspended sediment concentration modeling. J. Hydrol. 535, 457–472. https://doi.org/10.1016/j.jhydrol.2016.02.012.

23