Neural network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for solubility prediction

Neural network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for solubility prediction

Journal Pre-proof Neural network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for solubility ...

4MB Sizes 0 Downloads 14 Views

Journal Pre-proof Neural network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for solubility prediction Mengshan Li, Suyun Lian, Fan Wang, Yanying Zhou, Bingsheng Chen, Lixin Guan, Yan Wu

PII:

S0263-8762(20)30004-6

DOI:

https://doi.org/10.1016/j.cherd.2020.01.003

Reference:

CHERD 3953

To appear in:

Chemical Engineering Research and Design

Received Date:

29 July 2019

Revised Date:

2 December 2019

Accepted Date:

2 January 2020

Please cite this article as: Li M, Lian S, Wang F, Zhou Y, Chen B, Guan L, Wu Y, Neural network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for solubility prediction, Chemical Engineering Research and Design (2020), doi: https://doi.org/10.1016/j.cherd.2020.01.003

This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. © 2020 Published by Elsevier.

Neural Network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for

of

Solubility prediction

ro

Mengshan LI*, Suyun LIAN, Fan WANG, Yanying ZHOU, Bingsheng CHEN, Lixin GUAN, Yan WU

-p

College of Physics and Electronic Information, Gannan Normal University, Ganzhou, 341000, Jiangxi,

lP

re

CHINA;

ur na

Correspondence to: Mengshan LI (E-mail: [email protected])

Jo

Graphical abstract

1

of ro -p re lP

Highlights

A novel method is proposed for solubility prediction.



An improved PSO algorithm based on the diffusion theory is proposed.



RBF ANN trained by improved APSO is proposed.



Proposed method has a broad application perspective

Jo

ur na



ABSTRACT

Solubility is as a key chemical and physical property. Solubility prediction methods are applied in diverse fields including preparation synthesis and modifications of 2

materials. To overcome the shortcomings of existing solubility prediction methods, taking the mass transfer of two-phase system as an example, a solubility prediction model based on the diffusion theory and hybrid artificial intelligence method was proposed in this paper. An improved double-population chaotic

of

accelerated particle swarm optimization (APSO) algorithm combined diffusion theory was developed according to the particle evolution utilizing diffusion energy.

ro

The developed algorithm was applied in the training of parameters of the radial

-p

basis function artificial neural network and then a model for predicting solubility

re

was developed. The experimental results of supercritical carbon dioxide solubility in 8 polymers were consistent with the predicted values by the model, indicating

lP

the high prediction accuracy. The average relative deviation, squared correlation

ur na

coefficient, and root mean square error were respectively 0.0036, 0.9970, and 0.0152, displaying its higher comprehensive performance. The model may also be

Jo

applied in other physicochemical fields.

ABBREVIATIONS

PVT

Pressure, Volume, Temperature

ANN

Artificial Neural Network 3

RBF

Radial Basis Function

PSO

Particle Swarm Optimization

CAPSO

Chaotic accelerated PSO

Poly(butylene succinate).

PBSA

Poly(butylene succinate-co-adipate).

PS

Polystyrene.

re

Polypropylene.

lP

PP

-p

PBS

Carboxylated polyesters

PLLA

Poly(l-lactide)

Poly(D,l-lactide-co-glycolide) High-density polyethylene

Jo

HDPE

ur na

CPEs

PLGA

ARD R2

MSE

ro

of

DP-DT-CAPSO Double population CAPSO based on diffusion theory

Average Relative Deviation

Squared Correlation Coefficient Mean Square Error 4

RMSEP

Root Mean Square Error of Prediction

Keywords: Neural Network; double-population; particle swarm optimization;

ro

of

diffusion theory; Solubility prediction

1. Introduction

-p

Solubility of supercritical carbon dioxide (SCCO2) in polymeric compounds is

re

important for the modifications, synthesis, and preparation of new materials[1-6].

lP

Under high-temperature and high-pressure supercritical conditions, a solubility experiment is time-consuming and costly and it is not easy to obtain experimental

ur na

data. Therefore, it is necessary to design a prediction model with high accuracy. The solubility of SCCO2 in polymers is affected by polarity of molecules, density, temperature, and pressure. These factors show the complicated non-linear

Jo

associations with the solubility and the relationships among these factors are also complex. Therefore, traditional prediction methods based on thermodynamic equation of state or other empirical equations cannot provide the satisfactory prediction accuracy[7-12]. Ziaee et al.[13] predicted carbon dioxide solubility in different polymers with support vector machine and indicated that the proposed 5

models were efficient. Artificial neural network (ANN) possesses the ability of self-organizing, non-linear processing, and fault tolerance and can overcome the challenges of prediction[14-23]. Bakhbakhi et al.[24] and Lashkarbolooki et al.[25] compared the performances of ANN and the thermodynamic equation of

of

state in solubility prediction and showed that the ANN method possessed the higher prediction accuracy than the thermodynamic equation of state.

ro

Gharagheizi [26] showed that the ANN model had a higher prediction accuracy of

-p

the SCCO2 solubility in compounds. Similarly, through SCCO2 solubility prediction

re

experiments, Eslamimanesh et al.[27] demonstrated a high performance of ANN. Pahlavanzadeh et al.[28] predicted the solubility based on ANN model and

lP

Deshmukh-Mather method and demonstrated that ANN model was more

ur na

effective than conventional thermodynamic equations. The accuracy and reliability of ANN are dependent on the adopted training algorithm and the essence of training is to optimize the model structure and

Jo

variables. Several algorithms have been developed to optimize ANN training, including ant colony algorithm[29, 30], particle swarm optimization algorithm (PSO)[31-33], simulated annealing algorithm, and genetic algorithm[34]. With the online strategy and PSO algorithm, Liu et al.[35] trained the fuzzy neural network 6

and made a successful prediction of the melt flow rate (MFR). Lazzus et al.[36] used PSO algorithm to predict the phase equilibrium data of SCCO2 and achieved the better prediction results. Khajeh et al.[37] proposed the solubility prediction method of ANN and ANFIS and realized the higher prediction accuracy compared combined Kent-Eisenberg model with

of

to traditional methods. Hussain et al.[38]

ANN, proposed a hybrid neural network solution model, and achieved the better

ro

performance in terms of prediction. LI et al. [39-45] combined the chaos theory

-p

with the PSO and clustering methods, proposed several solubility prediction

re

models through improving ANN training algorithm, and achieved the higher

lP

prediction accuracy.

Previous studies mainly focused on improving the model itself without

ur na

considering the solubility element. Under static conditions, the solution process is basically a diffusion process in which solute molecules are adsorbed onto the surface. This process is driven by thermal movement of molecules and dependent

Jo

on the gradient of temperature or density between two media. Solute molecules migrate from the denser medium to the less dense medium continuously to produce an equilibrium at which the movement of solute molecules stops. In a two-phase system of polymer/SCCO2, at the interfacial film layer, due to interface 7

force, CO2 molecules attached to the interface spread to the interfacial film and are dissolved in the melted polymer. PSO algorithm may be used to analyze the solution process. Firstly, particles in PSO algorithm resemble those in solution and the similarity can be simulated by assigning the associated properties of solute

of

molecules in solution to the particles of the algorithm. Secondly, the solution process is a mass transfer process caused by the transport of material and it is

ro

closely related to the diffusion theory. Based on the diffusion theory and solubility

-p

and particle evolution algorithms, in order to simulate the molecular diffusion

re

process, we explored the movement of CO2 in the diffusion process and simulated the diffusion process based on the movement of evolution particles. Currently,

lP

the PSO algorithm was seldom used to analyze the migration of particles,

ur na

particularly in thermodynamics. Therefore, in this investigation, the diffusion theory was introduced into the PSO algorithm to improve it. The solution process of SCCO2 in polymer was simulated with the PSO algorithm and then an improved

Jo

PSO algorithm was proposed combined the diffusion theory and then applied in ANN training. Finally, a hybrid artificial intelligence prediction model of SCCO2 solubility in polymers was obtained.

8

2. Model theory 2.1. Standard PSO algorithm In 1995, Eberhart and Kennedy described the particle swarm optimization (PSO) as an evolution algorithm[46]. The movement of the particle is illustrated by the

k

k

k

k

(1)

k

v i, d =  v i, d + c 1 (p i, d - x i, d ) + c 2 (p g, d - x i, d )

k +1 i, d

k

k +1

i, d

i, d

= x +v

(2)

-p

x

ro

k +1

of

following formulas:

k

p i,d

and

k

p g,d

and

k

x i,d

are respectively the velocity and position of

are the position of the individual and global extrema.

ur na

particle i;

k

v i,d

lP

are learning factors;

re

where i = 1, …… , m (m indicates particle number);  is inertia weight; C1 and C2

2.2. CAPSO Algorithm

Jo

In recent years, an improved PSO variant with the extremely high global convergence performance, accelerated particle swarm optimization (APSO), has been widely concerned among scholars. In the APSO algorithm, inertial weight factors or cognitive factors are not considered and global exploration factors are utilized to improve the algorithm. The algorithm considers the global search 9

exploration factors for particle update. In the entire search process, particles are only restrained by global extremum, so the search velocity is accelerated. Position update formula is provided as: k +1

k

k

x i, d = (1 - C 2 )x i, d + C 2 pg, d

(3)

 C1 r

of

where C1r is a random number and can allow the algorithm to escape from the

ro

local optimum.

-p

Compared to the standard PSO algorithm, APSO adopts two variables C1 and C2 to

re

reduce randomness during iteration. C1 is expressed as a monotonically decreasing function: C1   t , 0    1 . Thus, in the APSO algorithm, C2 is the key

lP

determinant of the algorithm performance and its range is [0.2, 0.7]. When C2 is 1,

ur na

the particles can converge to the current global extreme at any time and show no change and this global extremum barely represents the actual global extremum. Conversely, when C2 is 0, the search velocity of the algorithm is extremely slow.

Jo

Hence, it is important to optimize C2 by analyzing the performance of APSO algorithm.

However, in the APSO algorithm, the premature convergence problem still exists and some extrema may be avoided. According to the characteristics of learning 10

factor C2 in APSO, it can be described by chaotic mapping, and chaos theory can be used to optimize the parameter C2. Therefore, the classical logistic equation can be applied to achieve evolution and optimization of chaotic parameters. The used iterative formula is provided as follows: k 1 i

= 4x (1  x ) k

k

i

i

(4)

k

0 < xi  1

of

x

ro

2.3. Double-population CAPSO algorithm with diffusion theory

-p

In the application of CAPSO, a particle is designed as a potential solution to the actual problem and the optimal solution is updated by the iteration of the

re

algorithm. There are two phases in the system: SCCO2 melt and polymer melt. The

lP

molecular energy required to overcome the barrier from the original position to other locations, which is called the diffusion energy. The molecules move with the

ur na

diffusion energy, and then achieve mass transfer. The diffusion velocity of a molecule is proportional to the movement speed and the temperature of the system determines the movement speed of the molecule. In the molecular

Jo

system, the system temperature is a statistical variable, which is also proportional to the average kinetic energy of the molecule. Temperature is proportional to the speed of molecular movement.

11

Based on the thermodynamic diffusion theory, a novel CAPSO algorithm, called DP-DT-CAPSO algorithm, was proposed in this paper. In the DP-DT-CAPSO algorithm, the molecules movement is replaced by the particles movement to simulate the molecular force field on particles. The diffusion temperature is

of

determined by the thermal motion of the molecules. The algorithm assumes that the particle has a population temperature, which represents the average

ro

molecular temperature of the system. The related concepts and definitions are

-p

respectively described below.

lP

Definition 1: Particle diffusion energy

re

2.3.1. Related concepts and definitions

1 E  mv 2 2

ur na

In CAPSO algorithm, the kinetic energy of the particle population is defined as: (5)

where v and m are respectively the speed and mass of the particles. The particle

Jo

with kinetic energy can spread to a new position when it can overcome the movement barrier and the kinetic energy is called the particle diffusion energy. If the mass of all particles is 1, the particle diffusion energy

Qi 

1 n 2 Vij 2 j 1

(6) 12

Qi

can be defined as:

where

is the speed of the particles.

Vij

Definition 2: Center of mass of the population The convergence probability of PSO algorithm is 100% when all the particles move toward a center. Therefore, in the entire population, a population center is set

x m m i

(7)

i

ro

X cen 

of

and the particles move toward the center, the formula is updated as follows:

where

X cen

-p

i

is the center of mass of the population; m is the size of the

lP

re

population ; xi is the position.

Definition 3: Distance to center of mass di

ur na

The distance to center of mass di  xi

Xc

is expressed as: (8)

e n

Jo

Definition 4: Particle diffusion probability The particle diffusion probability indicates the proportion of successful diffusion of particles that meet the diffusion conditions and is depending on both energy and temperature. Particle diffusion probability 13

Pi

is defined as:



Qi

1 e T Pi  d

(9)

where T is temperature.

d

is the distance difference between the particles to

the centroids. (10)

of

B d  xi  X cenA  xi  X cen

ro

2.3.2. DP-DT-CAPSO algorithm

The SCCO2/polymer solution system, which have two-phase, one is the gas phase

-p

and the other is the melt phase. Therefore, we used two populations to model

re

the two-phase system. Population A is the gas molecular population and

lP

Population B is the polymer melt molecular population. The operation and function of the two phases are basically the same in the algorithm.

ur na

Taking Population A as an example, in the execution of the DP-DT-CAPSO algorithm, the particles diffusion energy in Population A is first calculated and then the particles diffusion probability is obtained. The particles with higher

Jo

diffusion probability are copied to the candidate Population A, the particle carried the largest diffusion probability is used to update the worst fitness particles in the candidate Population B. Then, the global extremum is obtained. The algorithm is executed as follows: 14

Step 1: Initialization. Population A and Population B are initialized and then the maximum number of iterations are set. The values of some parameters such as population size, particle fitness value, and particle position are randomly set. Step 2: Fitness value calculation. The fitness value of the population particles is

of

calculated.

ro

Step 3: Extremum value update. According to the fitness value of the particle, the

-p

extremum of the individual particle in the population is updated.

re

Step 4: Termination condition calculation. When the number of iterations reaches the maximum value or the convergence accuracy meets the criteria, go to Step 11;

lP

otherwise, go to the next Step.

ur na

Step 5: the diffusion probability of population particles is calculated. Step 6: Candidate population. If the particles diffusion probability is greater than the generated random number, the particles are copied into the candidate

Jo

population.

Step 7: Information sharing and diffusion between population particles. Taking Population A as an example, the particles with the highest diffusion probability are selected from A and the particles with the worst fitness values among the 15

Candidate Population B are replaced. After particle information sharing, the mass transfer is completed. Step 8: Update the global extremum. When the particle's fitness value is better, the global extremum is updated to the current particle.

of

Step 9: Iterative process data are saved. Various groups and candidate sets and

ro

extreme values are saved.

-p

Step 10: Iterative update. Particle attributes and population attributes are

lP

Step 11: The final result is saved.

re

updated.

2.4. Artificial neural network trained by the DP-DT-CAPSO

ur na

Among many artificial neural networks, RBF ANN has unique advantages and is one of the widely concerned models. It has a three-layer structure. In this paper, Gaussian function was selected as the activation function: xk  ci

Jo gi ( xk )  exp(

 i2

2

(11)

)

16

where xk (1  k  n) is the output vector of the K-th iteration; ci (1  i  c) is the center of the basis function;  i is the expansion factor; n is the number of samples. In the training process of RBF ANN network, three parameters are mainly

of

optimized: the basis function center, the extension constant, and the connection

ro

weight. The network is trained through the iterations of ci ,  i , wi . The output of

-p

RBF ANN network is defined as: c

O( xk )   wi gi ( xk )

(12)

re

i 1

lP

where wi is the connection weight of the i-th hidden node. In this paper, we used the DP-DT-CAPSO algorithm to train the three parameters

ur na

ci ,  i , wi The structure of the particle is defined as: . y  f (wh,o ,  h,o , ci )

(13)

Jo

where wh,o (1  h  c) (1  o  p) is the weight;  h,o is the extension factor between the hidden node and output node.

17

A prediction model based on RBF ANN and DP-DT-CAPSO algorithm applied in the prediction of SCCO2 solubility in polymers is developed, called DP-DT-CAPSO RBF ANN. 3. Model establishment

of

3.1. Experimental data

ro

Firstly, 327 sets of data of 8 common polymers in industrial production are used

-p

to establish the model database and the data sources are provided in Table 1.

re

In order to obtain the better performance, in this paper, we divided the experimental data into three subsets randomly. The training set (including 70% of

lP

the data) was used for the model training. The verification set (including 15% of the data) was used to optimize the trained model. The testing set (containing 15%

ur na

of the data) was used to test the the model. Table 2 shows the data distribution statistics of each polymer. 3.2. Model evaluation

Jo

The average relative deviation (ARD), root mean square error of prediction (RMSEP), and squared correlation coefficient (R2) were used to evaluate the performance of the model in this paper.

18

ARD 

1 N y i  yi  N i 1 yi

RMSEP 

1 N

(14)

N

 ( yi  yi )2

(15)

i=1

2

N    ( yi  yave )( yi  yave )   R 2  N i=1 N  ( yi  yave )2  ( yi  yave )2

of

i=1

ro

i=1

(16)

yi

and yave are the experimental value and

re

and the predicted average value;

-p

where N is the sample number; yi and yave are respectively the predicted value

3.3. Model structure

lP

the average data of experimental value.

ur na

The proposed model uses a three-layer structure composed of the input layer, the hidden layer and the output layer. The input layer contains two nodes of temperature and pressure. The output layer consists of one node indicating the

Jo

solubility of SCCO2 in the polymer. In order to see the effect of the nodes in the hidden layer on the performance of the model, the number of nodes in the hidden layer is optimized by the heuristic method. The number of nodes is changed from 3 to 13. Therefore, 11 models are obtained. Fig. 1 shows the error curves with different nodes in the hidden layers. 19

As shown in Fig. 1, as the number of nodes increases, the error firstly decreases and then increases. When the number of nodes in the hidden layer is 6, the error is the smallest. Therefore, in this paper, the number of nodes in the hidden layer is 6.

of

4. Results and discussion

ro

4.1. Results of the proposed model

Experiments were performed in Windows 7 SP1 64-bit OS (4.00 GB of memory

-p

and Intel (R) Core ™ i5-4460 processor). Through Matlab 2010a software

re

programing, a three-layer model with the 2-6-1 structure, called DP-DT-CAPSO

lP

RBF ANN, was developed. The proposed model was used to calculate the SSCO2 solubility in 8 polymers. In the training set, Fig. 2 draws the experimental values

ur na

and the predicted values of the proposed model. As shown in Fig. 2, in the training set, the predicted data of 8 polymers are close to the experimental data and the distribution of the predicted data is similar to

Jo

the ideal prediction line. The predicted values were consistent with the experimental values, indicating that the DP-DT-CAPSO RBF ANN model was well trained. Fig. 3 shows the data distribution of the predicted and experimental values in the validation set. The prediction data of the model were close to the 20

experimental data (Fig. 3). The vertical distance from the data point to the straight line indicated that the DP-DT-CAPSO RBF ANN model had the higher prediction accuracy, and had a better correlation. The distribution of the predicted data of the model was close to that of the

of

experimental data (Fig. 4), indicating that the prediction performance was good.

ro

The prediction points in the testing set were relatively more dispersed than

others, indicating that the accuracy in the testing set was slightly lower. Table 3

-p

shows the model performance values for each data set.

re

It can be seen from Table 3 that the model has a better prediction performance,

lP

higher accuracy and correlation in the three sets. Table 4 shows the related data of the model of predicting the SCCO2 solubility in 8 kinds of polymers. The data

ur na

showed that the prediction accuracy of the model was basically the same among 8 polymers. The correlation coefficient was also above 0.99. The model showed

Jo

the better prediction ability and better scalability. 4.2. Comparison with other models The above confirmed that the higher performance of the proposed model in predicting the SCCO2 solubility in polymers. In order to further explore the performance, it is necessary to carry out the comparison between the current 21

work and previous works, especially those ones based on the artificial intelligence and ANN. Therefore, the model proposed in this paper was compared with three types of models used PSO and RBF ANN, such as RBF ANN, PSO BP ANN PSO RBF ANN. The convergence curves of these compared models are shown in Fig. 5.

of

As shown in Fig.5, the models of DP-DT-CAPSO RBF ANN, PSO RBF ANN, PSO BP

ro

ANN and RBF ANN tended to be stable after 200, 400, 300 and 700 times,

respectively. The model proposed in this paper has faster convergence speed. The

-p

prediction accuracy of the models decreased as the following order: DP-DT-CAPSO

re

RBF ANN > PSO RBF ANN > PSO BP ANN > RBF ANN. The accuracy of the

lP

DP-DT-CAPSO RBF ANN model was close to 0. The proposed model is better in terms of convergence accuracy and convergence speed. Fig. 6 shows the

model.

ur na

correlation between predicted and experimental values for each comparison

As shown in Fig. 6, various models have different prediction data distributions and

Jo

the prediction values of the proposed model are closer to the experimental data. In terms of prediction accuracy, the models decreased as the following order: DP-DT-CAPSO RBF ANN > PSO RBF ANN > PSO BP ANN > RBF ANN. The

22

DP-DT-CAPSO RBF ANN model had the highest prediction accuracy. Table 5 provides the statistical parameters of each model. In terms of the ARD and RMSEP data, the prediction accuracy of DP-DT-CAPSO RBF ANN model was better. In term of the correlation coefficient, the

of

DP-DT-CAPSO RBF ANN model was also significantly better than other models.

ro

The above data indicated that the proposed model exhibited the better

performance in terms of accuracy and correlation. Table 6 gives the computation

-p

time of various models.

re

Execution time of DP-DT-CAPSO RBF ANN model was close to that of RBF ANN

lP

model and shorter than that of PSO BP ANN and PSO RBF ANN. In the intelligent algorithm-based model, the introduction of the intelligent algorithm increased

ur na

calculation time. The diffusion theory was introduced in the particle swarm optimization algorithm in this paper. The thermodynamic parameters were recalculated in the iteration, thus resulting in longer calculation time of the model.

Jo

Compared with other models, this model also showed acceptable calculation time.

4.3. Discussion 4.3.1. Correlation of the model 23

The above experiments confirmed the advantages of the DP-DT-CAPSO RBF ANN model in terms of calculation accuracy, correlation and calculation time. Numerous experiments showed that solubility was proportional to the pressure of the system and inversely proportional to temperature. Fig. 7 shows the solubility

of

curves of the proposed model under different temperatures and pressures.

ro

The solubility of SCCO2 in polymers (PP, HDPE, CPEs, PS, PBS and PBSA) increased with the increase in pressure and decreased with the increase in temperature (Fig.

-p

7). The solubility of SCCO2 in polymers (PLLA and PLGA) increased with the

re

increase in pressure. With the increase in temperature, the solubility SCCO2 in

lP

PLLA and PLGA decreased firstly and then became stable. The predicted trend was consistent with the experimental results.

ur na

4.3.2. Stability of the model

According to Table 4 and Table 5, the proposed model has the better prediction ability of SCCO2 solubility in the 8 polymers, displaying the higher accuracy and

Jo

correlation. There is no abnormal prediction point and the model is the better stability. This paper proposes a model by combining thermodynamic diffusion theory with artificial intelligence, such as swarm intelligence algorithms. Compared with previous studies, it has several obvious characteristics. 1) In terms 24

of prediction accuracy and correlation, Table 4 and Table 5 show that the model in this paper has obvious advantages. 2) Empirical correlation exists in all prediction models based on artificial intelligence algorithms. In this paper, through the separation of the training set, verification set and test set, the

of

interference of empirical correlation is minimized. 3) From the efficiency of the intelligent model, according to the calculation time in Table 6, it can be seen that

ro

the efficiency of various models based on intelligent algorithms is better.

-p

4.3.3. Fault tolerance and the anti-jamming ability of the model

re

After the model is trained with many normal data, invalid data or abnormal data

lP

can be avoided by the model. The trained model displays the better fault tolerance and strong anti-jamming ability.

ur na

4.3.4. Expansibility of the model

Solubility prediction experiments confirmed the higher prediction performance of the model. In addition, the model can be extended to other applications such as

Jo

the prediction of other chemical and chemical properties, such as data prediction, data processing and optimization of experimental parameters, quantitative structure-activity relationship, and computer-aided design. The model has good scalability. 25

5. Conclusions In this paper, an improved model was proposed based on diffusion theory, particle swarm optimization and artificial neural network and used to predict the solubility of SCCO2 in 8 polymers. Experimental results showed that the

of

DP-DT-CAPSO-RBF-ANN model was better than other comparison models in terms of prediction accuracy, correlation and calculation time. The model has good

ro

stability and scalability and can be applied in prediction, fitting, and processing of

-p

various types of the data of chemical, physical, biological, and information fields.

ur na

Declaration of interests

lP

methods and prediction models.

re

In the future, we will further explore more efficient theoretical calculation

Jo

☒ The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

ACKNOWLEDGEMENTS

The authors gratefully acknowledge the support from the National Natural Science Foundation of China (Grant Numbers: 51663001).

26

REFERENCES

[1] K. Gong, Panuganti, S.R., Chapman, W.G., Study of solubility and swelling ratio in polymer-CO2 systems using the PC-SAFT equation of state, J. Appl. Polym.

of

Sci. 134 (2017). [2] K.A. Rabaeh, Al-Ajaleen, M.S., Abuzayed, M.H., Aldweri, F.M., Eyadeh, M.M.,

ro

High dose sensitivity of N-(isobutoxymethyl)acrylamide polymer gel

-p

dosimeters with improved monomer solubility using acetone co-solvent,

re

Nuclear Instruments & Methods in Physics Research Section B-Beam Interactions with Materials and Atoms. 442 (2019) 67-72.

lP

[3] T.V. Duncan, Pillai, K., Release of Engineered Nanomaterials from Polymer

ur na

Nanocomposites: Diffusion, Dissolution, and Desorption, Acs Appl. Mater. Interfaces. 7 (2015) 2-19.

[4] S. Khatsee, Daranarong, D., Punyodom, W., Worajittiphon, P., Electrospinning

Jo

polymer blend of PLA and PBAT: Electrospinnability-solubility map and effect of polymer solution parameters toward application as antibiotic-carrier mats, J. Appl. Polym. Sci. 135 (2018).

[5] M. Ota, Sato, Y., Smith, R.L., Inomata, H., Predictive dimensionless solubility (pDS) model for solid solutes in supercritical CO2 that requires only 27

pure-component physical properties, Chem. Eng. Res. Des. 136 (2018) 251-261. [6] G. Guerin, Molev, G., Pichugin, D., Rupar, P.A., Qi, F., Cruz, M., Manners, I., Winnik, M.A., Effect of Concentration on the Dissolution of One-Dimensional

of

Polymer Crystals: A TEM and NMR Study, Macromolecules. 52 (2019) 208-216.

ro

[7] J. Sun, Wang, H., Chen, M., Ye, J., Jiang, B., Wang, J., Yang, Y., Ren, C., Solubility

-p

measurement of hydrogen, ethylene, and 1-hexene in polyethylene films

re

through an intelligent gravimetric analyzer, J. Appl. Polym. Sci. 134 (2017). [8] A. Baghban, Sasanipour, J., Habibzadeh, S., Zhang, Z.E., Sulfur dioxide solubility

lP

prediction in ionic liquids by a group contribution - LSSVM model, Chem. Eng.

ur na

Res. Des. 142 (2019) 44-52.

[9] J. Koszinowski, Diffusion and solubility of n-alkanes in polyolefines (Retraction of vol 32, pg 4765, 1986), J. Appl. Polym. Sci. 133 (2016).

Jo

[10] A. Aftab, Shariff, A.M., Garg, S., Lal, B., Shaikh, M.S., Faiqa, N., Solubility of CO2 in aqueous sodium beta-alaninate: Experimental study and modeling using Kent Eisenberg model, Chem. Eng. Res. Des. 131 (2018) 385-392.

[11] T. Cowen, Karim, K., Piletsky, S.A., Solubility and size of polymer nanoparticles, Polym. Chem. 9 (2018) 4566-4573. 28

[12] P. Zhou, Kasuga, T., Dissolution behavior of Mg/Si-doped vaterite particles in biodegradable polymer composites, Express Polymer Letters. 12 (2018) 171-179. [13] H. Ziaee, Hosseini, S.M., Sharafpoor, A., Fazavi, M., Ghiasi, M.M., Bahadori, A.,

of

Prediction of solubility of carbon dioxide in different polymers using support vector machine algorithm, J. Taiwan Inst. Chem. Eng. 46 (2015) 205-213.

ro

[14] J.A. Lazzús, Cuturrufo, F., Pulgar-Villarroel, G., Salfate, I., Vega, P., Estimating

-p

the Temperature-Dependent Surface Tension of Ionic Liquids Using a Neural

re

Network-Based Group Contribution Method, Ind. Eng. Chem. Res. 56 (2017) 6869-6886.

lP

[15] J.A. Lazzus, Neural network-particle swarm modeling to predict thermal

ur na

properties, Math. Comput. Model. 57 (2013) 2408-2418. [16] E. Hosaini-Alvand, Mirshekar, H., Khorasani, M.T., Parvazinia, M., Joorabloo, A., Fabricating and robust artificial neural network modeling nanoscale

Jo

polyurethane fiber using electrospinning method, J. Appl. Polym. Sci. 134 (2017).

[17] M.S. Giri Nandagopal, Selvaraju, N., Prediction of Liquid–Liquid Flow Patterns in a Y-Junction Circular Microchannel Using Advanced Neural Network Techniques, Ind. Eng. Chem. Res. 55 (2016) 11346-11362. 29

[18] B. Mehdizadeh, Movagharnejad, K., A comparison between neural network method and semi empirical equations to predict the solubility of different compounds in supercritical carbon dioxide, Fluid. Phase. Equilibr. 303 (2011) 40-44.

of

[19] S.L. Qiu, Liu, Q.H., Zhou, S.J., Wu, C.J., Review of Artificial Intelligence Adversarial Attack and Defense Technologies, App.l Sci-Basel. 9 (2019).

ro

[20] M. Lashkarbolooki, Bayat, M., Prediction of surface tension of liquid normal

-p

alkanes, 1-alkenes and cycloalkane using neural network, Chem. Eng. Res. Des.

re

137 (2018) 154-163.

[21] M.S. Li, Zhang, H.J., Chen, B.S., Wu, Y., Guan, L.X., Prediction of pKa Values for

ur na

Rep. 8 (2018) 3991.

lP

Neutral and Basic Drugs based on Hybrid Artificial Intelligence Methods, Sci.

[22] J.A. Lazzus, Prediction of flammability limit temperatures from molecular structures using a neural network-particle swarm algorithm, J. Taiwan Inst.

Jo

Chem. Eng. 42 (2011) 447-453. [23] K. Yao, Herr, J.E., Toth, D.W., McKintyre, R., Parkhill, J., The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics, Chem. Sci. 9 (2018) 2261-2269.

30

[24] Y. Bakhbakhi, Neural network modeling of ternary solubilities of 2-naphthol in supercritical CO2: A comparative study, Math. Comput. Model. 55 (2012) 1932-1941. [25] A.Z. Hezave, Raeissi, S., Lashkarbolooki, M., Estimation of Thermal

of

Conductivity of Ionic Liquids Using a Perceptron Neural Network, Ind. Eng. Chem. Res. 51 (2012) 9886-9893.

ro

[26] F. Gharagheizi, Eslamimanesh, A., Mohammadi, A.H., Richon, D., Artificial

-p

Neural Network Modeling of Solubilities of 21 Commonly Used Industrial Solid

re

Compounds in Supercritical Carbon Dioxide, Ind. Eng. Chem. Res. 50 (2011) 221-226.

lP

[27] A. Eslamimanesh, Gharagheizi, F., Mohammadi, A.H., Richon, D., Artificial

ur na

Neural Network modeling of solubility of supercritical carbon dioxide in 24 commonly used ionic liquids, Chem. Eng. Sci. 66 (2011) 3039-3044. [28] H. Pahlavanzadeh, Nourani, S., Saber, M., Experimental analysis and modeling

Jo

of CO2 solubility in AMP (2-amino-2-methyl-1-propanol) at low CO2 partial pressure using the models of Deshmukh-Mather and the artificial neural network, J. Chem. Thermodyn. 43 (2011) 1775-1783.

[29] S. Al-Shihabi, Arafeh, M., Barghash, M., An improved hybrid algorithm for the set covering problem, Comput. Ind. Eng. 85 (2015) 328-334. 31

[30] J.A. Lazzus, Hybrid Particle Swarm-Ant Colony Algorithm to Describe the Phase Equilibrium of Systems Containing Supercritical Fluids with Ionic Liquids, Communications in Computational Physics. 14 (2013) 107-125. [31] N. Kalaiarasi, Dash, S.S., Padmanaban, S., Paramasivam, S., Morati, P.K.,

of

Maximum Power Point Tracking Implementation by Dspace Controller Integrated Through Z-Source Inverter Using Particle Swarm Optimization

ro

Technique for Photovoltaic Applications, App.l Sci-Basel. 8 (2018).

-p

[32] M.S. Li, Zhang, H.J., Liu, L., Chen, B.S., Guan, L.X., Wu, Y., A Quantitative

re

Structure-Property Relationship Model Based on Chaos-Enhanced Accelerated Particle Swarm Optimization Algorithm and Back Propagation

lP

Artificial Neural Network, App.l Sci-Basel. 8 (2018) 1121.

ur na

[33] L.V. Pavao, Borba Costa, C.B., Ravagnani, M.A.S.S., Heat Exchanger Network Synthesis without stream splits using parallelized and simplified simulated Annealing and Particle Swarm Optimization, Chem. Eng. Sci. 158 (2017)

Jo

96-107.

[34] S. Arefi-Oskoui, Khataee, A., Vatanpour, V., Modeling and Optimization of NLDH/PVDF Ultrafiltration Nanocomposite Membrane Using Artificial Neural Network-Genetic Algorithm Hybrid, ACS Combinatorial Science. 19 (2017) 464-477. 32

[35] X.G. Liu, Zhao, C.Y., Melt index prediction based on fuzzy neural networks and PSO algorithm with online correction strategy, Aiche J. 58 (2012) 1194-1202. [36] J.A. Lazzus, Ponce, A., Chilla, L., Application of particle swarm optimization to

of

model the phase equilibrium of complex mixtures, Fluid. Phase. Equilibr. 317 (2012) 132-139.

ro

[37] A. Khajeh, Modarress, H., Prediction of solubility of gases in polystyrene by

-p

Adaptive Neuro-Fuzzy Inference System and Radial Basis Function Neural

re

Network, Expert. Syst. Appl. 37 (2010) 3070-3074.

[38] M.A. Hussain, Aroua, M.K., Yin, C.Y., Rahman, R.A., Ramli, N.A., Hybrid neural

lP

network for prediction of CO2 solubility in monoethanolamine and

ur na

diethanolamine solutions, Korean J. Chem. Eng. 27 (2010) 1864-1867. [39] M.S. Li, Huang, X.Y., Liu, H.S., Liu, B.X., Wu, Y., Prediction of the gas solubility in polymers by a radial basis function neural network based on chaotic

Jo

self-adaptive particle swarm optimization and a clustering method, J. Appl. Polym. Sci. 130 (2013) 3825-3832.

[40] M.S. Li, Huang, X.Y., Liu, H.S., Liu, B.X., Wu, Y., Deng, X.Z., Solubility prediction of gases in polymers using fuzzy neural network based on particle swarm

33

optimization algorithm and clustering method, J. Appl. Polym. Sci. 129 (2013) 3297-3303. [41] Y. Wu, Liu, B.X., Li, M.S., Tang, K.Z., Wu, Y.B., Prediction of CO2 Solubility in Polymers by Radial Basis Function Artificial Neural Network Based on Chaotic

of

Self-adaptive Particle Swarm Optimization and Fuzzy Clustering Method, Chin. J. Chem. 31 (2013) 1564-1572.

ro

[42] M. Li, Huang, X., Liu, H., Liu, B., Wu, Y., Wang, L., Solubility prediction of

-p

supercritical carbon dioxide in 10 polymers using radial basis function artificial

re

neural network based on chaotic self-adaptive particle swarm optimization and K-harmonic means, RSC. Adv. 5 (2015) 45520-45527.

lP

[43] M. Li, Wu, W., Chen, B., Wu, Y., Huang, X., Solubility prediction of gases in

ur na

polymers based on an artificial neural network: a review, RSC. Adv. 7 (2017) 35274-35282.

[44] M.S. Li, Huang, X.Y., Liu, H.S., Liu, B.X., Wu, Y., Ai, F.R., Solubility Prediction of

Jo

Gases in Polymers based on Chaotic Self-adaptive Particle Swarm Optimization Artificial Neural Networks, Acta Chim. Sinica. 71 (2013) 1053-1058.

[45] M.S. Li, Huang, X.Y., Liu, H.S., Liu, B.X., Wu, Y., Xiong, A.H., Dong, T.W., Prediction of gas solubility in polymers by back propagation artificial neural 34

network based on self-adaptive particle swarm optimization algorithm and chaos theory, Fluid. Phase. Equilibr. 356 (2013) 11-17. [46] J. Kennedy, Eberhart, R., Particle swarm optimization, in: 1995 IEEE International Conference on Neural Networks Proceedings, Proceedings of

of

ICNN'95 - International Conference on Neural Networks, IEEE Australia Council, Perth, 1995, pp. 1942-1948.

ro

[47] A. Khajeh, Modarress, H., Mohsen-Nia, M., Solubility prediction for carbon

-p

dioxide in polymers by artificial neural network, Iran. Polym. J. 16 (2007)

re

759-768.

[48] Y. Sato, Fujiwara, K., Takikawa, T., Sumarno, Takishima, S., Masuoka, H.,

lP

Solubilities and diffusion coefficients of carbon dioxide and nitrogen in

ur na

polypropylene, high-density polyethylene, and polystyrene under high pressures and temperatures, Fluid. Phase. Equilibr. 162 (1999) 261-276. [49] Z.G. Lei, Ohyabu, H., Sato, Y., Inomata, H., Smith, R.L., Solubility, swelling

Jo

degree and crystallinity of carbon dioxide-polypropylene system, J. Supercrit. Fluid. 40 (2007) 452-461.

[50] D.C. Li, Liu, T., Zhao, L., Yuan, W.K., Solubility and Diffusivity of Carbon Dioxide in Solid-State Isotactic Polypropylene by the Pressure-Decay Method, Ind. Eng. Chem. Res. 48 (2009) 7117-7124. 35

[51] E. Aionicesei, Skerget, M., Knez, Z., Measurement of CO2 solubility and diffusivity in poly(L-lactide) and poly(D,L-lactide-co-glycolide) by magnetic suspension balance, J. Supercrit. Fluid. 47 (2008) 296-301. [52] M. Skerget, Mandzuka, Z., Aionicesei, E., Knez, Z., Jese, R., Znoj, B., Venturini,

of

P., Solubility and diffusivity of CO2 in carboxylated polyesters, J. Supercrit. Fluid. 51 (2010) 306-311.

ro

[53] Y. Sato, Yurugi, M., Fujiwara, K., Takishima, S., Masuoka, H., Solubilities of

-p

carbon dioxide and nitrogen in polystyrene under high temperature and

re

pressure, Fluid. Phase. Equilibr. 125 (1996) 129-138.

[54] S. Hilic, Boyer, S., Padua, A., Grolier, J., Simultaneous measurement of the

lP

solubility of nitrogen and carbon dioxide in polystyrene and of the associated

ur na

polymer swelling, J. Polym. Sci. B-Polym. Phys. 39 (2001) 2063-2070. [55] Y. Sato, Takikawa, T., Takishima, S., Masuoka, H., Solubilities and diffusion coefficients of carbon dioxide in poly(vinyl acetate) and polystyrene, J.

Jo

Supercrit. Fluid. 19 (2001) 187-198. [56] Y. Sato, Takikawa, T., Sorakubo, A., Takishima, S., Masuoka, H., Imaizumi, M., Solubility and diffusion coefficient of carbon dioxide in biodegradable polymers, Ind. Eng. Chem. Res. 39 (2000) 4813-4819.

36

of ro -p re

Jo

ur na

mean squared error

lP

Fig. 1. Correlations between the number of nodes of in the hidden layer and

37

of ro -p

Jo

ur na

lP

re

Fig. 2 Experimental values and predicted values in the training set.

Fig. 3 Experimental values and predicted values in the validation set.

38

of ro

-p

Fig. 4. Relationship between predicted values and experimental data in the

Jo

ur na

lP

re

testing set.

Fig. 5 Curve of mean squared error versus iteration number

39

of ro -p

Jo

ur na

lP

re

Fig. 6 Predicted values VS. experimental data.

40

of ro -p re lP ur na Jo

Fig. 7. Solubility variations with pressure and temperature

41

Table 1 Sources and statistics of experimental data in this work T (K)

P (MPa)

Solubility(g/g)

Data points

References

PP

313.20-483.70

7.400-24.910

0.03950-0.26170

67

[47-50]

PLLA

308.00-323.00

9.620-31.460

0.16520-0.43010

27

[51]

HDPE

433.15-473.20

10.731-18.123

0.00551-0.12296

20

[47, 48]

CPEs

306.00-344.00

10.150-31.020

0.09840-0.63660

56

[52]

PS

338.22-473.15

7.540-44.410

0.02641-0.16056

70

[47, 53-55]

PBS

323.15-453.15

8.008-20.144

0.04534-0.17610

31

PBSA

323.15-453.16

7.870-20.128

0.04763-0.17411

29

PLGA

308.00-323.00

10.140-31.470

0.09030-0.29630

27

Total

306.00-483.70

7.400-44.410

0.00551-0.63660

327

PP

47

PLLA

19

HDPE

14

CPEs PS PBS PBSA

ro

[47, 56]

-p

[51]

Testing

Total

10

10

67

4

4

27

3

3

20

38

9

9

56

50

10

10

70

21

5

5

31

21

4

4

29

19

4

4

27

Jo

PLGA

Validation

lP

Training

[47, 56]

ur na

Polymer

re

Table 2 Distribution of experimental data

of

Polymer

42

Table 3 Values of ARD, R2, and RMSEP for different subsets. ARD

R2

RMSEP

Training

0.0036

0.9973

0.0147

Validation

0.0035

0.9971

0.0152

Testing

0.0037

0.9966

0.0157

Average

0.0036

0.9970

0.0152

of

Subset

ro

Table 4 Values of ARD, R2, and RMSEP for different polymers in the testing set. Polymer

ARD

R2

PP

0.0031

0.9972

PLLA

0.0037

0.9966

0.0138

HDPE

0.0042

0.9967

0.0137

CPEs

0.0037

PS

0.0037

PBS

re

-p

RMSEP 0.0147

0.0152

0.9964

0.0146

0.0042

0.9967

0.0146

PBSA

0.0036

0.9966

0.0152

PLGA

0.0033

0.9968

0.0146

Jo

ur na

lP

0.9973

43

Table 5 Statistical parameters of various models CPEs

PS

PBS

PBSA

PLGA

Average

RBF ANN

0.0112

0.0096

0.0113

0.0104

0.0106

0.0121

0.0111

0.0098

0.0108

PSO BP ANN

0.0078

0.0076

0.0086

0.0081

0.0068

0.0089

0.0088

0.0075

0.0080

PSO RBF ANN

0.0065

0.0064

0.0072

0.0056

0.0064

0.0078

0.0076

0.0063

0.0067

DP-DT-CAPSO RBF ANN

0.0023

0.0031

0.0042

0.0037

0.0034

0.0041

0.0038

0.0031

0.0035

RBF ANN

0.9589

0.9613

0.9621

0.9558

0.9642

0.9616

0.9643

0.9621

0.9613

PSO BP ANN

0.9853

0.9842

0.9823

0.9813

0.9811

0.9911

0.9813

0.9821

0.9836

PSO RBF ANN

0.9875

0.9876

0.9871

0.9856

0.9845

0.9842

0.9852

0.9834

0.9856

DP-DT-CAPSO RBF ANN

0.9972

0.9963

0.9966

0.9971

RBF ANN

0.0664

0.0589

0.0687

0.0921

PSO BP ANN

0.0482

0.0472

0.0399

PSO RBF ANN

0.0421

0.0423

0.0421

DP-DT-CAPSO RBF ANN

0.0149

0.0133

0.0137

ro

of

HDPE

0.9968

0.9962

0.9966

0.9967

0.9967

0.0887

0.0961

0.0952

0.0872

0.0817

-p

RMSEP

PLLA

0.0467

0.0466

0.0512

0.0514

0.0488

0.0475

0.0442

0.0475

0.0468

0.0476

0.0472

0.0450

0.0142

0.0146

0.0149

0.0147

0.0144

re

R2

PP

lP

ARD

Model

0.0145

Table 6 Computation time of various models

RBF ANN PSO BP ANN PSO RBF ANN

ur na

Model

Computation Time(S) 32 41 43 27

Jo

DP-DT-CAPSO RBF ANN

44