Accurate prediction of water dewpoint temperature in natural gas dehydrators using gene expression programming approach

Accurate prediction of water dewpoint temperature in natural gas dehydrators using gene expression programming approach

Accepted Manuscript Accurate prediction of water dewpoint temperature in natural gas dehydrators using Gene Expression Programming approach Alireza R...

732KB Sizes 1 Downloads 117 Views

Accepted Manuscript Accurate prediction of water dewpoint temperature in natural gas dehydrators using Gene Expression Programming approach

Alireza Rostami, Amin Shokrollahi PII: DOI: Reference:

S0167-7322(17)32685-5 doi: 10.1016/j.molliq.2017.08.045 MOLLIQ 7753

To appear in:

Journal of Molecular Liquids

Received date: Revised date: Accepted date:

19 June 2017 8 August 2017 11 August 2017

Please cite this article as: Alireza Rostami, Amin Shokrollahi , Accurate prediction of water dewpoint temperature in natural gas dehydrators using Gene Expression Programming approach, Journal of Molecular Liquids (2017), doi: 10.1016/ j.molliq.2017.08.045

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT Accurate Prediction of Water Dewpoint Temperature in Natural Gas Dehydrators Using Gene Expression Programming Approach Alireza Rostamia2, Amin Shokrollahib1 a

Department of Petroleum Engineering, Petroleum University of Technology (PUT), Ahwaz, Iran Young Researchers and Elites Club, North Tehran Branch, Islamic Azad University, Tehran, Iran

PT

b

RI

Abstract

AC C

EP T

ED

MA

NU

SC

Triethylene Glycol (TEG) is one of the most frequently applied liquid desiccants in natural gas dehydration. For this purpose, minimum concentration of TEG is necessary to achieve the outlet gas water dewpoint specification. Several empirical models have been developed to predict the equilibrium established between water vapor and TEG; however, the existing models requires numerous input parameters needing to longer and more complicated calculations. Thus, in present study, the most recent and powerful mathematical strategy known as, Gene Expression Programming (GEP), was applied to develop an easy-to-use and a simple mathematical expression for prognostication equilibrium water dewpoint temperature as a function of contactor temperature and TEG purity in water. For absolute pressures up to 10300 kpa (10.3 MPa), equilibrium water dewpoint temperature is nearly independent of operating pressure. For this, 167 datapoints were collected from the open literature, then the database was divided into the two sets of training (about 80% of the databank) and test (about 20% of the databank). Comprehensive error analysis and diverse graphical illustration were employed to show the performance of the developed empirically derived correlation in this study. The results of the GEP model was also compared with the previously published correlation available in literature, which leads to this consequence that the proposed model in this study is a more efficient approach with higher simplicity than the prementioned literature correlation. GEP-based model gives an acceptable agreement with experimental databank reported in the open literature in terms of determination coefficient (R2) of 0.9820 and Root Mean Square Error (RMSE) of 3.5018. With respect to cumulative frequency analysis, it is understood that about 80% of the whole databank used in this study have estimation error equal or less than 0.2. The results of sensitivity analysis approved that the TEG purity is the most affecting variable on the natural gas dehydration. At last, it can be concluded that the suggested GEP-based empirically-derived correlation is an efficient tool for experts working in gas industry to have a quick check on equilibrium water dewpoint temperature at various TEG concentrations and temperatures.

Keywords: Natural gas dehydration; Triethylene glycol; Equilibrium water dewpoint temperature; Gene expression programming; Correlation; Error analysis.

1

ACCEPTED MANUSCRIPT 1

Introduction

During natural gas transmission and processing, the existence of water vapor has been identified as the main causes of serious operational difficulties when they appear at outlet of subterranean hydrocarbon reservoirs [1]. The hydrate related issues are corrosion, decrease in heating value of natural gas, capacity reduction of transmission lines caused by the presence of free water, and

PT

plugging of lines as a consequence of gas hydrates formation [1]. Therefore, safe operation of

RI

gas industry requires the proper implementation of the so-called technology of dehydration

SC

processes. A number of techniques such as liquid and solid desiccants, can be used for inhibiting gas hydrate formation; however, liquid desiccants are more commonly applied to fulfill the

NU

economic aspect of dehydration process [2].

MA

In the midst of all absorption liquids, glycols are of great significance where 15 to 49 oC depression of dewpoint is desirable [3]. In a countercurrent mass transfer system, Triethylene

ED

Glycols (TEG) have been most commonly applied inside a contactor so that the suitable water content in the gas stream will be achieved [4]. An example of a dehydration system by utilization

EP T

of TEG is illustrated in Fig. 1 [2]. In a countercurrent system to the gas flow, the regenerated TEG solution is pumped to the contactor top tray in order to absorb water vapors. The glycol,

AC C

which is water-rich, is taken out of the absorber column (contactor), then most of the soluble gas will be flashed off by passing over the reflux condenser coil and rich-lean heat exchanger. Afterwards, TEG will flow toward regenerator, in which the absorbed water is removed from glycol by the use of distillation process at nearby atmospheric condition. This lean TEG will also pass through the surge drum, and then will be cooled by going over the lean-rich exchanger and glycol cooler before TEG recirculation to the contactor [5]. At first step of simulating dehydration process, it is highly demanded to establish the minimum TEG concentration for optimizing the process [6]. For this, a number of investigations have been implemented on natural gas processes by TEG dehydration system so as to develop empirically2

ACCEPTED MANUSCRIPT derived and equilibrium correlations for estimating equilibrium water dewpoint temperature [713]. In the main, the proposed correlations by Parrish et al. [7], Rosman [11] and Worley [10] work acceptably and are appropriate for plenty of TEG system designs; even though, along with literature [1], the main drawback of aforementioned published correlations is their limited ability for accurate estimation of equilibrium water vapor concentration above the TEG solution. At

PT

unlimited dilution, equilibrium water vapor concentration was determined by the correlations

RI

extended in the work of Won [13] and Parish et al. [7]. By means of extrapolating data at lower

SC

concentrations the authors presented other correlations in 100% TEG purity. For pressures equal or less than 2000 psi (13.8 MPa), the pressure impact on water-TEG system equilibrium can be

NU

ignored. More recently, Bahadori and Vuthaluru [2] developed an equilibrium polynomial equation for water dewpoint prediction as a function of contactor temperature and TEG purity in

MA

water. They divided the TEG purity in water into three ranges, and reported three sets of tuned coefficients for their proposed correlation. Despite the considerable efforts conducted in

ED

literature for simulation of dehydration process, there is still a large requisite for development of strong models estimating water dewpoint temperature with high accuracy. Afterwards, Ahmadi

EP T

et al. [1] proposed an Artificial Neural Network (ANN) model for predicting water dewpoint temperature in equilibrium condition with respect to the TEG purity in water and contactor

AC C

temperature. Using Particle Swarm Optimization (PSO) and Back-Propagation (BP) algorithms, the authors have trained their smart strategy, and consequently understood that ANN-PSO gives more precise estimates than that of ANN-BP. Lately, two new types of ANN-based numerical schemes termed as Multilayer Perceptron (MLP) and Radial Basis Function (RBF) networks, was established in the work of Tatar et al. [14] with the same databank used by Ahmadi et al. [1]. Then, it was found out the MLP results is more accurate than the RBF technique in prognosticating water formation temperature in dehydration process.

3

ACCEPTED MANUSCRIPT In this work, a new evolutionary algorithm termed as, Gene Expression Programming (GEP), was utilized to construct an empirically derived correlation to predict water dewpoint temperature in equilibrium condition as a function of TEG purity in water and contactor temperature for the first time in literature. Based on the previous researches focusing on this mathematical strategy, GEP scheme has been successfully applied in wide varieties of chemical

PT

and petroleum engineering areas [15-19]. For this, a comprehensive databank [7, 12] was

RI

collected from the open literature for modeling. Then, several statistical quality measures like

SC

Root Mean Square Error (RMSE) and determination coefficient (R2), and graphical illustrations including cross plot, error distribution diagrams and cumulative frequency figure were employed

NU

to show the supremacy of the proposed GEP-based empirically derived correlation. Moreover, the performance of the GEP-based model was also compared with the formerly published

MA

correlation of Bahadori and Vuthaluru [2]. Finally, the sensitivity analysis was implemented to

2

Data Collection

ED

explore the impact of each input variable on the output of the model.

EP T

A bulk of previous investigations concerning on mathematical and numerical modeling approaches, has been proved that a large database including wide ranges of operational

AC C

conditions is necessary for implementing a comprehensive modeling [20-24]. So, a wide-ranging databank was adopted from the existing open literature [7, 12] to develop the GEP-based empirically-derived model in this study. The utilized databank consists of 167 datapoints of equilibrium water dewpoint temperature as a function of TEG purity in water and contactor temperature. Table 1 shows the operational ranges of the used databank as well as their standard deviation and average values. As can be seen, contactor temperature varies in the range of 10 to 75 oC, TEG purity in water from 90 to 99.997%, and equilibrium water dewpoint temperature between -78 to 20 oC.

4

ACCEPTED MANUSCRIPT 3

Gene Expression Programming (GEP)

3.1 Background One of the latest strategies, which has been proposed for creating powerful equations, is known as Gene Expression Programming (GEP). In GEP [25] scheme, the disadvantages of Genetic Programming (GP) [26] and Genetic Algorithm (GA) approaches have been modified, to explore

PT

the best solution for highly nonlinear engineering and scientific problems by applying advanced

RI

regression strategy. In this powerful mathematical scheme, no specific equation format will be

SC

supposed, and it is the major function of computer program in order to determine the optimum relationship by testing all the predefined algebraic operators between the input and output

NU

parameters. The model, which gives the lowest prediction error defined in the program or software will be given to the user [27]. GEP [25] mathematical strategy has two key components

MA

comprising of chromosome and Expression Tree (ET). Similar to biological genes, chromosomes have the role of putting into code the candidate solution, which will be converted to the real

ED

candidate solution (i.e., ET) [17, 28]. It should be noted that the genetic operations in GEP

EP T

strategy are applied on chromosomes, directly. Reproduction of almost always valid ETs obtained in GEP is mainly due to genes’ novel arrangement, reproduction approach, chromosomes structural organization, and translation process into ET [28]. As a results of

AC C

literature studies, the convergence speed and calculation efficiency of GEP is two to four orders of magnitude higher than that of GP technique in order to establish a symbolic equation during regression analysis [29]. Each gene is composed of a head comprising of functions and terminals, and a tail containing terminals [28]. The tail length (t) can be computed as a function of the head length (h) and the largest arity function (n), as follows:

t  h (n 1) 1

(1)

A typical chromosome with two genes composed of four operators (i.e., ×, +, /, terminals a, b, c, is indicated in Fig. 2 [17]. 5

) and three

ACCEPTED MANUSCRIPT 3.2 Correlation Development By implementation of highly complicated GEP [25] computation, an accurate, symbolic and easy-to-use equation will be produced showing the optimum behavior of the studied phenomenon. According to the preceding investigations, the equilibrium water dewpoint temperature (Td) has a functionality of contactor temperature (T) and TEG purity in water (W) [1,

RI

building the GEP-based empirically-derived model, as follows:

PT

2]. Therefore, the abovementioned parameters can be selected as the correlating variables for

(2)

SC

Td  f (T ,W )

To construct a new model for effective estimation of water dewpoint temperature in equilibrium

NU

with TEG, the following procedure is advised [30]:

MA

At first, the program randomly generates chromosomes to initialize a particular number of population individuals. By using the following Objective Function (OF), a fitness value will be

100 N Tdiexp Tdiest N  Tdiexp i

(3)

EP T

OF(i ) 

ED

computed for each population:

in which, the superscripts exp and est, and the symbol N specify, respectively, the real

AC C

equilibrium water dewpoint temperature value, the estimated values by the established GEPbased empirically-derived correlation, and the number of the data points used in modeling [29]. On the basis of these fitness values, subsequent modifications will be applied. Afterwards, the chosen population individuals will be processed by the application of selection environment, confrontation of the genomes expression, and reconstruction with modification. Finally, the procedures mentioned above will be repeated to fulfill the defined criterion for convergence during GEP modeling.

6

ACCEPTED MANUSCRIPT 4

Results and Discussions

A GEP mathematical strategy has been established employing the before mentioned calculation procedure. The total databank has been split into the two group of training (nearly 80% of the entire databank) and test (nearly 20% of the entire databank) for building the model and testing the prediction capability of the model. By the use of a random computer-based selection, the

PT

process of databank division was performed. For this, a lot of databank divisions with different

RI

distributions was implemented, then the division with homogeneous feature was accepted

SC

inhibiting the data local gathering in possible problem area. The utilized technique in this study can determine the effective correlating variables leading to development of sufficiently precise

NU

model with respect to the input parameters (T, W). Hence, the researchers can present a number of independent variables as the input data for a typical problem and achieve the influencing

MA

parameters on the target output parameter. The ultimate correlation obtained in this study is as follows:

A2  A3 

EP T

A1 T 

(4)

1  27.0885513811173 100.033063778086 W

AC C

where

ED

Td  A1  A2  A3

(5)

1.80123526661289 1.23630193779173T W 3.218777155

(6)

1 W ln(ln( )) 37.3855344958176

(7)

In above equations, Td is computed in oC, T in oC, and W in weight percent (%). During optimization process, the number of digits in these equations are determined by conducting simultaneous sensitivity analysis between the estimates of GEP and target values of equilibrium water dewpoint temperature. Table 2 reports the settings of implemented GEP scheme in this 7

ACCEPTED MANUSCRIPT study such as head size, numbers of genes and chromosomes, mutation and inversion coefficients, Insertion Sequence (IS) transposition and Root IS (RIS) transposition, one- and twopoint recombination coefficients, and used algebraic operators.

4.1 Evaluation of the Developed Model

PT

4.1.1 Analytical Analysis For assessing the capability, numerous statistical approaches like statistical quality measures and

RI

graphical analyses are implemented. The main statistical parameters used in this study are

SC

Average Absolute Deviation (AAD), Average Relative Deviation (ARD), Root Mean Square Error (RMSE), Standard Deviation (SD), and determination coefficient (R2). The definitions of

NU

the applied error parameters in this study are indicated in Table 3. The comprehensive error

MA

analysis of the GEP-based empirically-derived model for three ranges of TEG purity in water including 90 to 99%, 99 to 99.9% and 99.9 to 99.997%, are fully studied in Table 4. For the

ED

whole ranges of TEG purity in water, the values of AAD, ARD, AARD, RMSE, and SD, are, respectively, less than 3.0, 0.3, 0.5, 4.0, and 1.1. Moreover, the R2 values are greater than 0.94

EP T

showing the prediction acceptability of the developed model (i.e., Eq. (4)). As shown, the principal parameters of AAD, AARD, ARD, and SD demonstrate that for the highest TEG purity

AC C

range (i.e., 99.9 to 99.997% TEG) the proposed GEP-based correlation gives the best result. A different way of examining Eq. (4) is conducted by detailed analysis of the databank used in this study for various operational conditions, which is indicated in Table 5. In this table, prediction range of equilibrium water dewpoint temperatures are compared with the corresponding experimental range as well as their estimation errors. In nearly all operating conditions, the estimation range of GEP model has a satisfactory agreement with that of experimental range of equilibrium water dewpoint temperatures with AAD<4.5, RMSE<5.5 and R2>0.99.

4.1.2 Graphical Analysis

8

ACCEPTED MANUSCRIPT In addition, some graphical tools like cross plot analysis, relative error distribution plot, and cumulative frequency diagram are used to show the superiority of the model in different manners. Cross plot comparison of GEP estimates with target values of equilibrium dewpoint temperatures for different ranges of TEG purity, are illustrated in Fig. 3. According to this figure, datapoints are mainly accumulated around the unit slope line. This is due to the high

PT

determination coefficient (R2>0.98) sufficiently low estimation error (e.g., AAD<3.0) resulting

RI

in the good fit of predicted outputs with target values. Thus, GEP estimates will give the least

SC

possible deviation from 45o line, which is illustrative of ideal fitness. Simultaneous exhibition of estimated values by Eq. (4) and measured datapoints versus the data index are shown in Fig. 4.

NU

This figure confirms the good coincidence of predicted datapoints with the experimental line. Fig. 5 describes the relative error distribution of GEP model versus experimental equilibrium

MA

dewpoint temperature in subplot Fig. 5(a), versus TEG purity in water in subplot Fig. 5(b), and versus contactor temperature in subplot Fig. 5(c). The higher concentration of the dataset around

ED

the zero horizontal line in this plot is another indication for good performance of Eq. (4) for

processes.

EP T

efficient prediction of equilibrium condition existing between TEG-water system in dehydration

The performance of Eq. (4) is analyzed through the calculating RMSE parameter for different

AC C

ranges of contactor temperature from 10 to 75 oC Fig. 6. It is interesting to see that in all ranges the RMSE value is less than 5. The highest accuracy of GEP-based empirically derived model is obtained when contactor temperature varies from 20 to 30 oC. Calculation of AAD for individual TEG purity in water leads to the plot of Fig. 7. At the worst condition, this diagram demonstrates that the GEP-based model gives the AAD value less than 7. Moreover, the developed correlation is incredibly efficient at TEG purities of 99.9, 99.95, 99.97, and 99.98%. For various TEG concentrations, equilibrium water dewpoint temperature against contactor temperature is displayed in Fig. 8. An increase in contactor temperature leads to the incremental growth of 9

ACCEPTED MANUSCRIPT dewpoint temperature; however, the water dewpoint will be decreased by increasing TEG purity. In all TEG purities, GEP prediction lines show satisfactory agreement with the measured datapoint. This trend analysis conducted in Fig. 8 can give the readerships a good view of the GEP effectiveness for predicting the water-TEG equilibrium in this study. Fig. 9 represents the cumulative frequency of the databank versus the absolute relative error for

PT

different TEG purity ranges. For example, about 80% of GEP estimates have estimation error

RI

equal or less than 0.2 for the total TEG purity range. This figure also shows that increasing TEG

SC

purity leads to the higher cumulative frequency. The more cumulative frequency means that higher portion of the databank has estimation errors equal or less than a typical value verifying

NU

the greater model capability in accurate output prediction.

MA

4.2 Comparison with Previously Developed Correlation The performance of the GEP-based empirically-derived correlation is compared with the recently

ED

published correlation of Bahadori and Vuthaluru [2] in Fig. 10. For the total range of TEG purity, Eq. (4) gives R2>0.98, although the R2 value for Bahadori and Vuthaluru [2] is less than

EP T

0.53. It means that correlation of Bahadori and Vuthaluru [2] is much less accurate than the proposed Eq. (4) in this study. Moreover, Bahadori and Vuthaluru [2] divided the TEG purity

AC C

into the three ranges and reported three sets of tuned coefficient; however, a unique GEP-based empirically-derived model is proposed for the entire range of TEG purity in this study. Clearly, the GEP model in this study is much simpler than the that of Bahadori and Vuthaluru [2] with less number of constant values in its equation.

4.3 Sensitivity Analysis In this section, the so-called sensitivity analysis is executed to find the quantitative impact of TEG purity on water and contactor temperature on the equilibrium water dewpoint temperature. The relative dependency value for each variable is computed by equation of correlation

10

ACCEPTED MANUSCRIPT coefficient (R) inserted in Table 3 and known as Pearson technique [31]. The results of sensitivity analysis are illustrated in Fig. 11. As can be seen, TEG purity in water has a negative effect and contactor temperature has a positive impact on the trend of equilibrium water dewpoint temperature. Totally, the equilibrium water dewpoint temperature has more dependency to TEG purity in water because of its higher relative impact value calculated by

Conclusions

RI

5

PT

Pearson technique [31].

SC

In this modeling study, an empirically-derived correlation is developed by means of state-of-theart mathematical strategy known as, Gene Expression Programming (GEP), for straightforward

NU

determination of equilibrium water dewpoint temperature of 167 datapoints at various contactor

MA

temperature (10 to 75 oC) and TEG purity in water (90 to 99.997%) as correlating variables, which were undertaken from the open literature. About 80% and 20% of the whole database

ED

were assigned, respectively, for developing and testing the model. The most commonly used statistical quality measures like Average Absolute Deviation (AAD), Root Mean Square Error

EP T

(RMSE) and Standard Deviation (SD), as well as, graphical tools like cross plot comparison and relative error distribution plot, were utilized in order to conduct a comprehensive error analysis

AC C

in this study. The relative impact of each variable is detected quantitatively by applying Pearson technique [31]. As a result, it is found out the proposed GEP-based empirically-derived correlation is simpler and more accurate than the published correlation by Bahadori and Vuthaluru [2] with lower numbers of tuning parameters for the whole range of TEG purity in water. The GEP model gives satisfactory agreement with the target values with determination coefficient (R2) of 0.9820 and Root Mean Square Error (RMSE) of 3.5018. Moreover, it is demonstrated that the equilibrium water dewpoint temperature has more dependency on the TEG purity than the contactor temperature. To this end, the developed tool in this study can be of immense practical value for reliable and rapid estimate of equilibrium water dewpoint 11

ACCEPTED MANUSCRIPT temperature accurately for engineers dealing with natural gas dehydration in academic researches

AC C

EP T

ED

MA

NU

SC

RI

PT

and industrial processes.

12

ACCEPTED MANUSCRIPT References

AC C

EP T

ED

MA

NU

SC

RI

PT

[1] M.A. Ahmadi, R. Soleimani, A. Bahadori, A computational intelligence scheme for prediction equilibrium water dew point of natural gas in TEG dehydration systems, Fuel, 137 (2014) 145-154. [2] A. Bahadori, H.B. Vuthaluru, Rapid estimation of equilibrium water dew point of natural gas in TEG dehydration systems, Journal of Natural Gas Science and Engineering, 1 (2009) 68-71. [3] G. Processors, Suppliers Association (GPSA), Gas Processors and Suppliers Association Engineering Data Book, DOI (2004). [4] A. Bahadori, New model predicts solubility in glycols, Oil & gas journal, 105 (2007) 50None. [5] A. Bahadori, New model calculates solubility of light alkanes in triethylene gycol, Petroleum Chemistry, 49 (2009) 171-179. [6] A. Bahadori, Y. Hajizadeh, H. Vuthaluru, M. Tade, S. Mokhatab, Novel approaches for the prediction of density of glycol solutions, Journal of Natural Gas Chemistry, 17 (2008) 298-302. [7] W. Parish, K. Won, M. Baltau, Phase behaviour of the triethylene glycol-water system and dehydration/regeneration design for extremely low dew point requirement, 65 th Annual Convention of the GPSA, TX, 1986. [8] F. Townsend, Vapor-Liquid Equilibrium Data for Diethylene-and Triethylene Glycol-Water, Natural Gas Systems, Proc. Gas Hydrate Control Conference”, LS Reid, ed., Norman, OK, 1953, pp. 57-77. [9] F.R. Scauzillo, Equilibrium ratios of water in the water-triethylene glycol-natural gas system, Journal of Petroleum Technology, 13 (1961) 697-702. [10] M. Worley, Super Dehydration with Glycols, Proceedings of the Gas Conditioning Conference, The University of Oklahoma, Norman, 1967. [11] A. Rosman, Water equilibrium in the dehydration of natural gas with triethylene glycol, Society of Petroleum Engineers Journal, 13 (1973) 297-306. [12] M. Herskowitz, M. Gottlieb, Vapor-liquid equilibrium in aqueous solutions of various glycols and polyethylene glycols. 1. Triethylene glycol, Journal of Chemical and Engineering Data, 29 (1984) 173-175. [13] K. Won, F. Daniel, Thermodynamic Basis of the Glycol Dew-Point Chart and Its Application to Dehydration, PROCEEDINGS OF THE ANNUAL CONVENTION-GAS PROCESSORS ASSOCIATION, GAS PROCESSORS ASSOCIATION, 1994, pp. 108-108. [14] T. Afshin, B.-H. Ali, M. Hossein, N. Saeid, B. Meysam, L. Moonyong, B. Alireza, N.-M. Adel, Prediction of water formation temperature in natural gas dehydrators using radial basis function (RBF) neural networks, Natural Gas Industry B, 3 (2016) 173-118. [15] A. Rostami, H. Ebadi, M. Arabloo, M.K. Meybodi, A. Bahadori, Toward genetic programming (GP) approach for estimation of hydrocarbon/water interfacial tension, Journal of Molecular Liquids, 230 (2017) 175-189. [16] A. Kamari, M. Sattari, A.H. Mohammadi, D. Ramjugernath, Rapid method for the estimation of dew point pressures in gas condensate reservoirs, Journal of the Taiwan Institute of Chemical Engineers, 60 (2016) 258-266. [17] A. Kamari, M. Arabloo, A. Shokrollahi, F. Gharagheizi, A.H. Mohammadi, Rapid method to estimate the minimum miscibility pressure (MMP) in live reservoir oil systems during CO2 flooding, Fuel, 153 (2015) 310-319. [18] F. Gharagheizi, A. Eslamimanesh, M. Sattari, A.H. Mohammadi, D. Richon, Development of corresponding states model for estimation of the surface tension of chemical compounds, AIChE Journal, 59 (2013) 613-621.

13

ACCEPTED MANUSCRIPT

AC C

EP T

ED

MA

NU

SC

RI

PT

[19] F. Gharagheizi, P. Ilani-Kashkouli, N. Farahani, A.H. Mohammadi, Gene expression programming strategy for estimation of flash point temperature of non-electrolyte organic compounds, Fluid Phase Equilibria, 329 (2012) 71-77. [20] A. Shokrollahi, M. Arabloo, F. Gharagheizi, A.H. Mohammadi, Intelligent model for prediction of CO2 – Reservoir oil minimum miscibility pressure, Fuel, 112 (2013) 375-384. [21] M. Mesbah, E. Soroush, A. Shokrollahi, A. Bahadori, Prediction of phase equilibrium of CO2/cyclic compound binary mixtures using a rigorous modeling approach, The Journal of Supercritical Fluids, 90 (2014) 110-125. [22] M. Bahari, A. Rostami, E. Joonaki, M. Ali, Investigation of a Novel Technique for Decline Curve Analysis in Comparison with the Conventional Models, International Journal of Computer Applications, 99 (2014) 1-11. [23] A. Rostami, M.A. Anbaz, H.R.E. Gahrooei, M. Arabloo, A. Bahadori, Accurate estimation of CO 2 adsorption on activated carbon with multi-layer feed-forward neural network (MLFNN) algorithm, Egyptian Journal of Petroleum, DOI (2017) 621-1110. [24] A. Rostami, M. Masoudi, A. Ghaderi-Ardakani, M. Arabloo, M. Amani, Effective Thermal Conductivity Modeling of Sandstones: SVM Framework Analysis, International Journal of Thermophysics, 37 (2016) 1-15. [25] C. Ferreira, Gene expression programming: a new adaptive algorithm for solving problems, Complex Systems, 13 (2001) 87-129. [26] J.R. Koza, Genetic programming: on the programming of computers by means of natural selection, MIT press, Cambridge, 1992. [27] A. Rostami, M. Arabloo, H. Ebadi, Genetic programming (GP) approach for prediction of supercritical CO 2 thermal conductivity, Chemical Engineering Research and Design, 122 (2017) 164-175. [28] L. Teodorescu, D. Sherwood, High energy physics event selection with gene expression programming, Computer Physics Communications, 178 (2008) 409-419. [29] C. Ferreira, Gene Expression Programming: Mathematical Modeling by an Artificial Intelligence 2nd Edition ed., Springer-Verlag New York, Inc.2006. [30] C. Ferreira, Gene expression programming: mathematical modeling by an artificial intelligence, Springer2006. [31] N.S. Chok, Pearson's Versus Spearman's and Kendall's Correlation Coefficients for Continuous Data, Graduate School of Public Health, University of Pittsburgh, 2010.

14

ACCEPTED MANUSCRIPT Table Captions Table 1. Specification of the databank applied for modeling Table 2. Optimized parameters of the proposed GEP model Table 3. Definitions of statistical quality measures applied for error analysis conducted in this

PT

study

RI

Table 4. Comprehensive error investigation of the proposed GEP tool estimating equilibrium

SC

water dewpoint temperature for different ranges of TEG purity in water

NU

Table 5. The performance evaluation of the developed GEP model for prediction of the

AC C

EP T

ED

MA

equilibrium water dewpoint temperature in different operational conditions

15

ACCEPTED MANUSCRIPT

Symbol

Unit

Min.

Avg.

Max.

STDEVa

TEG purity in water

W

%

90

98.983

99.997

2.128

Contactor temperature

T

o

C

10

39.174

75

19.220

Equilibrium water dewpoint temperature

Td

o

C

-78

-32.531

25.660

20

STDEV refers to the standard deviation, which can be calculated as follows:





SC

a

RI

Parameter

PT

Table 1

1

NU

2 2  1 N STDEV   xi x    N 1 i 1 

AC C

EP T

ED

MA

where, x is a typical data sample and N shows the number of dataset.

16

ACCEPTED MANUSCRIPT

Number of chromosomes

3

Head size

7

Number of genes

3

Linking function

+

Generations without change

2000

Fitness function

RMSE

Mutation

0.00138

Inversion

0.00546

IS transposition

0.00546

RIS transposition

0.00546

One-point recombination

0.00277

NU

Two-point recombination

0.00277 0.00277

MA

Gene recombination Gene transposition

0.00277

Constants per gene

10

+, -, *, /, EXP, X2, INV, LN

AC C

EP T

ED

Operators used:

RI

Value

SC

GEP algorithm parameters

PT

Table 2

17

ACCEPTED MANUSCRIPT Table 3 Error Minimum deviation Maximum deviation

Formula D min  min(Td iexp Td est i ) max exp D  max(Td i Td est i )

Average deviation

AD 

T N

1 N

i 1 N

1 N 1 ARD  N

AAD 

Average absolute deviation

exp di

T

exp di

i 1





Td est i

Td iexp Td est  i    exp Td i i 1   exp est 1 N  T T AARD   d i exp d i N i 1  Td i

1 N

PT

T N

SC

Average absolute relative deviation

N

RI

Average relative deviation

MSE 

Mean square error

i 1

NU

1 RMSE   N

exp di

exp di

i 1



2

Td est i

T N

  

1 2 est 2 di

 

T

1

 1 N T exp T est 2 2 di di SD       exp  N 1 i 1  Td i    SD CV  

MA

Root mean square error

Standard deviation

 T N

R

EP T

ED

Coefficient of variation

Correlation coefficient

Td est i

  T  .T N

 T 

 Td iexp Td . Td esti Td i 1 N

exp di

i

i 1 2 N

est di

d

i





2

d



2



2

N  N  exp est   Td i Td    Td i Td    i 1  R 2   i 1

AC C

Determination coefficient

T N i

18

exp di

  2 N

Td . Td est i Td i



2

ACCEPTED MANUSCRIPT Table 4 For 90 to 99% TEG

For 99 to 99.9% TEG

For 99.9 to 99.997% TEG

N

46

40

81

Td min

-30

-52.5

-78

Td

-0.8646

-25.1537

-52.4928

Td max

20

4.5

-22.5

D min

-6.6163

0.6662

-10.0929

D max

5.5941

15.8997

AD

-0.4700

2.8834

AAD

2.8943

2.8834

ARD

0.2187

0.0348

AARD

0.4927

MSE

11.5358

RMSE

3.3964

SD

0.8485

CV R

MA

RI

NU

SC

2.5917

-1.5771 2.5544 0.0334 0.0508

13.7290

12.1595

ED

0.3957

3.4870

1.0592

0.0697

-0.9814

-0.0421

-0.0013

0.9733

0.9893

0.9749

0.9788

0.9504

EP T

3.7053

AC C

R2

PT

Parameter

0.9473

19

ACCEPTED MANUSCRIPT Table 5 W, %

T, oC

Td, exp, o C

Td, est, oC

AAD

RMSE

R2

6

90

10–37

-6–20

-0.2300– 16.3996

2.7587

3.2646

0.9993

7

95

10–45

-12–19

-5.383– 16.1732

3.3242

3.9972

0.9992

10

97

10–55

-18–19.5

-11.5316– 16.1843

3.2135

3.6246

0.9987

11

98

10–60

-22 – 17.5

-17.4481– 13.3474

2.5967

12

99

10–70

-30–14.5

-28.2991– 8.6555

10

99.5

10–55

-37.5– -6 -38.1662– -10.4503

14

99.8

10–75

-46.5– -5.5

14

99.9

10–75

-52.5– -14

14

99.95

10–75

14

99.97

10–75

14

99.98

14

99.99

13 Overall

RI

2.9004

2.8083

3.0910

0.9994

2.8372

2.8955

0.9991

-54.7781– -14.7440

1.7396

1.8017

0.9992

-59– -22.5

-61.0462– -21.0121

1.0498

1.2260

0.9988

-63– -28

-65.5917– -25.5576

1.4213

1.6585

0.9994

10–75

-66.5– -32.5

-68.9505– -28.9164

1.7427

1.9820

0.9991

10–75

-72– -39.5

-73.7016– -33.6675

2.4696

3.0071

0.9993

NU

SC

0.9997

-48.3534– -8.3202

MA

ED

EP T

2.6420

0.9993

3.3046

AC C

14

PT

N

99.995

10–75

-77– -44

-76.9412– -36.9071

4.3544

5.3053

0.9984

99.997

10–70

-78– -51.5

-78.4742– -41.5196

4.4222

5.3640

0.9978

90– 99.997

10–75

-78–20

-78.4742– 16.3995

2.7786

3.5018

0.9820

20

ACCEPTED MANUSCRIPT Figure Captions Fig. 1. Typical TEG-Natural Gas Dehydration System [2] Fig. 2. A typical two-gene chromosome with its corresponding mathematical expression [17] Fig 3. Comparison of GEP estimates with experimental values of equilibrium water dewpoint

PT

temperature

RI

Fig. 4. Experimental and GEP predicted values of equilibrium water dewpoint temperature

SC

versus data index

NU

Fig. 5. Distribution of relative deviation for GEP model predictions when relative deviation is drawn versus various parameters of: (a) Experimental equilibrium water dewpoint temperature,

MA

(b) TEG purity in water, and (c) Contactor temperature

Fig. 6. Root mean square error (RMSE) of GEP model for various contactor temperature ranges

ED

Fig. 7. Average absolute deviation (AAD) of GEP model for various range of TEG purity in

EP T

water

Fig. 8. Comparison of the experimental, GEP prediction values of equilibrium water dewpoint

AC C

temperature versus contactor temperature for TEG purity in water of 98 to 99.98% Fig. 9. Comparison of the determination coefficent (R2) of the developed GEP model with the previously published correlation of Bahadori and Vuthaluru in diverse ranges of TEG purity in water Fig. 10. Cumulative frequency versus the absolute relative deviation for the developed GEP tool in this study for different operational conditions Fig. 11. Relative dependency of equilibrium water dewpoint temperature to contactor temperature and TEG purity in water 21

MA

NU

SC

RI

PT

ACCEPTED MANUSCRIPT

AC C

EP T

ED

Fig. 1. Typical TEG-Natural Gas Dehydration System [2]

22

RI

PT

ACCEPTED MANUSCRIPT

AC C

EP T

ED

MA

NU

SC

Fig. 2. A typical two-gene chromosome with its corresponding mathematical expression [17]

23

ACCEPTED MANUSCRIPT

20

PT

-10

-40

Unit slope line

RI

GEP for 90 to 99% TEG -70

GEP for 99 to 99.9% TEG

SC

Estimated water dewpoint temperature, °C

50

GEP for 99.9 to 99.997% TEG

-100

-70

-40

-10

NU

-100

20

50

Experimental water dewpoint temperature, °C

AC C

EP T

ED

MA

Fig 3. Comparison of GEP estimates with experimental values of equilibrium water dewpoint temperature

24

ACCEPTED MANUSCRIPT

20

Experimental data

GEP for 90 to 99% TEG

GEP for 99 to 99.9% TEG

GEP for 99.9 to 99.997% TEG

0

PT

-20

-40

RI

-60

-80 0

20

40

60

80

SC

Equilibrium water dewpoint temperature, °C

40

100

120

140

160

180

NU

Data index

AC C

EP T

ED

MA

Fig. 4. Experimental and GEP predicted values of equilibrium water dewpoint temperature versus data index

25

ACCEPTED MANUSCRIPT

(a) Zero line GEP for 90 to 99% TEG GEP for 99 to 99.9% TEG GEP for 99.9 to 99.997% TEG

3

4

2 1 0 -1

3 2 1 0 -1

-2

-2

-3

-3

-4

-4 -100

-70

-40

-10

20

90 91 92 93 94 95 96 97 98 99 100

50

TEG purity in water, %

SC

Experimental water dewpoint temperature, °C

(c)

NU

6

Zero line GEP for 90 to 99% TEG GEP for 99 to 99.9% TEG GEP for 99.9 to 99.997% TEG

5

MA

4 3 2 1 0

ED

Relative deviation

PT

4

Zero line GEP for 90 to 99% TEG GEP for 99 to 99.9% TEG GEP for 99.9 to 99.997% TEG

5

Relative deviation

5

Relative deviation

(b) 6

RI

6

-1 -2

EP T

-3 -4

0

10

20

30

40

50

60

70

80

Contactor temperature, °C

AC C

Fig. 5. Distribution of relative deviation for GEP model predictions when relative deviation is drawn versus various parameters of: (a) Experimental equilibrium water dewpoint temperature, (b) TEG purity in water, and (c) Contactor temperature

26

ACCEPTED MANUSCRIPT

5 4.5

3.5

PT

3 2.5

RI

2 1.5

SC

Root mean square error (RMSE)

4

1

NU

0.5 0 10.0-20.0

20.0-30.0

30.0-40.0

40.0-50.0

50.0-60.0

60.0-75.0

MA

Contactor temperature, °C

AC C

EP T

ED

Fig. 6. Root mean square error (RMSE) of GEP model for various contactor temperature ranges

27

ACCEPTED MANUSCRIPT

6 5

PT

4

RI

3 2

SC

Average absolute deviation (AAD)

7

1

NU

0

MA

TEG purity in water, %

AC C

EP T

ED

Fig. 7. Average absolute deviation (AAD) of GEP model for various range of TEG purity in water

28

ACCEPTED MANUSCRIPT

GEP, TEG=99.98% GEP, TEG=99.9% GEP, TEG=98% Data, TEG=99.95% Data, TEG=99%

GEP, TEG=99.95% GEP, TEG=99% Data, TEG=99.97% Data, TEG=99.8%

10

RI

PT

-10

SC

-30

-50

-70 10

20

30

40

MA

0

NU

Equilibrium water dewpoint temperature,°C

30

GEP, TEG=99.97% GEP, TEG=99.8% Data, TEG=99.98% Data, TEG=99.9% Data, TEG=98%

50

60

70

80

Contactor temperature,°C

AC C

EP T

ED

Fig. 8. Comparison of the experimental, GEP prediction values of equilibrium water dewpoint temperature versus contactor temperature for TEG purity in water of 98 to 99.98%

29

ACCEPTED MANUSCRIPT

1 0.9 0.8

PT

0.6

RI

0.5 0.4

SC

Cumulative frequency

0.7

Total

0.3

NU

For TEG=90 to 99%

0.2

MA

0.1 0 0

0.1

0.2

0.3

For TEG=99 to 99.9% For TEG=99.9 to 99.997%

0.4

0.5

ED

Absolute relative deviation

AC C

EP T

Fig. 9. Cumulative frequency versus the absolute relative deviation for the developed GEP tool in this study for different operational conditions

30

ACCEPTED MANUSCRIPT GEP

Bahadori and Vuthaluru

1

0.8 0.7

PT

0.6 0.5 0.4

RI

Determination coefficient (R2)

0.9

0.3

SC

0.2

0 90-99

99-99.9

NU

0.1

99.9-99.997

Overall

MA

TEG purity in water, %

AC C

EP T

ED

Fig. 10. Comparison of the determination coefficent (R2) of the developed GEP model with the previously published correlation of Bahadori and Vuthaluru in diverse ranges of TEG purity in water

31

ACCEPTED MANUSCRIPT 1 TEG purity in water 0.8

Contactor temperature

0.4

PT

0.2 0 -0.2

RI

Relative dependency factor

0.6

SC

-0.4

-0.8

NU

-0.6

99-99.9%

99.9-99.997%

Overall range

-0.504777569

-0.309224616

-0.452625726

-0.66078668

Contactor temperature

0.610834711

0.855966602

0.895686359

0.258201195

MA

90-99%

TEG purity in water

ED

Range of TEG purity in water, %

AC C

EP T

Fig. 11. Relative dependency of equilibrium water dewpoint temperature to contactor temperature and TEG purity in water

32

ACCEPTED MANUSCRIPT Research Highlights 

A new efficient correlation based on GEP algorithm for modeling the natural gas dehydration process has been introduced



For assessing the capability of developed correlation statistical quality measurement and graphical analyses are implemented The relative impact of each variable is detected quantitatively by applying Pearson

EP T

ED

MA

NU

SC

RI

PT

technique

AC C



33