Journal Pre-proof Neural network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for solubility prediction Mengshan Li, Suyun Lian, Fan Wang, Yanying Zhou, Bingsheng Chen, Lixin Guan, Yan Wu
PII:
S0263-8762(20)30004-6
DOI:
https://doi.org/10.1016/j.cherd.2020.01.003
Reference:
CHERD 3953
To appear in:
Chemical Engineering Research and Design
Received Date:
29 July 2019
Revised Date:
2 December 2019
Accepted Date:
2 January 2020
Please cite this article as: Li M, Lian S, Wang F, Zhou Y, Chen B, Guan L, Wu Y, Neural network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for solubility prediction, Chemical Engineering Research and Design (2020), doi: https://doi.org/10.1016/j.cherd.2020.01.003
This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. © 2020 Published by Elsevier.
Neural Network modeling based double-population chaotic accelerated particle swarm optimization and diffusion theory for
of
Solubility prediction
ro
Mengshan LI*, Suyun LIAN, Fan WANG, Yanying ZHOU, Bingsheng CHEN, Lixin GUAN, Yan WU
-p
College of Physics and Electronic Information, Gannan Normal University, Ganzhou, 341000, Jiangxi,
lP
re
CHINA;
ur na
Correspondence to: Mengshan LI (E-mail:
[email protected])
Jo
Graphical abstract
1
of ro -p re lP
Highlights
A novel method is proposed for solubility prediction.
An improved PSO algorithm based on the diffusion theory is proposed.
RBF ANN trained by improved APSO is proposed.
Proposed method has a broad application perspective
Jo
ur na
ABSTRACT
Solubility is as a key chemical and physical property. Solubility prediction methods are applied in diverse fields including preparation synthesis and modifications of 2
materials. To overcome the shortcomings of existing solubility prediction methods, taking the mass transfer of two-phase system as an example, a solubility prediction model based on the diffusion theory and hybrid artificial intelligence method was proposed in this paper. An improved double-population chaotic
of
accelerated particle swarm optimization (APSO) algorithm combined diffusion theory was developed according to the particle evolution utilizing diffusion energy.
ro
The developed algorithm was applied in the training of parameters of the radial
-p
basis function artificial neural network and then a model for predicting solubility
re
was developed. The experimental results of supercritical carbon dioxide solubility in 8 polymers were consistent with the predicted values by the model, indicating
lP
the high prediction accuracy. The average relative deviation, squared correlation
ur na
coefficient, and root mean square error were respectively 0.0036, 0.9970, and 0.0152, displaying its higher comprehensive performance. The model may also be
Jo
applied in other physicochemical fields.
ABBREVIATIONS
PVT
Pressure, Volume, Temperature
ANN
Artificial Neural Network 3
RBF
Radial Basis Function
PSO
Particle Swarm Optimization
CAPSO
Chaotic accelerated PSO
Poly(butylene succinate).
PBSA
Poly(butylene succinate-co-adipate).
PS
Polystyrene.
re
Polypropylene.
lP
PP
-p
PBS
Carboxylated polyesters
PLLA
Poly(l-lactide)
Poly(D,l-lactide-co-glycolide) High-density polyethylene
Jo
HDPE
ur na
CPEs
PLGA
ARD R2
MSE
ro
of
DP-DT-CAPSO Double population CAPSO based on diffusion theory
Average Relative Deviation
Squared Correlation Coefficient Mean Square Error 4
RMSEP
Root Mean Square Error of Prediction
Keywords: Neural Network; double-population; particle swarm optimization;
ro
of
diffusion theory; Solubility prediction
1. Introduction
-p
Solubility of supercritical carbon dioxide (SCCO2) in polymeric compounds is
re
important for the modifications, synthesis, and preparation of new materials[1-6].
lP
Under high-temperature and high-pressure supercritical conditions, a solubility experiment is time-consuming and costly and it is not easy to obtain experimental
ur na
data. Therefore, it is necessary to design a prediction model with high accuracy. The solubility of SCCO2 in polymers is affected by polarity of molecules, density, temperature, and pressure. These factors show the complicated non-linear
Jo
associations with the solubility and the relationships among these factors are also complex. Therefore, traditional prediction methods based on thermodynamic equation of state or other empirical equations cannot provide the satisfactory prediction accuracy[7-12]. Ziaee et al.[13] predicted carbon dioxide solubility in different polymers with support vector machine and indicated that the proposed 5
models were efficient. Artificial neural network (ANN) possesses the ability of self-organizing, non-linear processing, and fault tolerance and can overcome the challenges of prediction[14-23]. Bakhbakhi et al.[24] and Lashkarbolooki et al.[25] compared the performances of ANN and the thermodynamic equation of
of
state in solubility prediction and showed that the ANN method possessed the higher prediction accuracy than the thermodynamic equation of state.
ro
Gharagheizi [26] showed that the ANN model had a higher prediction accuracy of
-p
the SCCO2 solubility in compounds. Similarly, through SCCO2 solubility prediction
re
experiments, Eslamimanesh et al.[27] demonstrated a high performance of ANN. Pahlavanzadeh et al.[28] predicted the solubility based on ANN model and
lP
Deshmukh-Mather method and demonstrated that ANN model was more
ur na
effective than conventional thermodynamic equations. The accuracy and reliability of ANN are dependent on the adopted training algorithm and the essence of training is to optimize the model structure and
Jo
variables. Several algorithms have been developed to optimize ANN training, including ant colony algorithm[29, 30], particle swarm optimization algorithm (PSO)[31-33], simulated annealing algorithm, and genetic algorithm[34]. With the online strategy and PSO algorithm, Liu et al.[35] trained the fuzzy neural network 6
and made a successful prediction of the melt flow rate (MFR). Lazzus et al.[36] used PSO algorithm to predict the phase equilibrium data of SCCO2 and achieved the better prediction results. Khajeh et al.[37] proposed the solubility prediction method of ANN and ANFIS and realized the higher prediction accuracy compared combined Kent-Eisenberg model with
of
to traditional methods. Hussain et al.[38]
ANN, proposed a hybrid neural network solution model, and achieved the better
ro
performance in terms of prediction. LI et al. [39-45] combined the chaos theory
-p
with the PSO and clustering methods, proposed several solubility prediction
re
models through improving ANN training algorithm, and achieved the higher
lP
prediction accuracy.
Previous studies mainly focused on improving the model itself without
ur na
considering the solubility element. Under static conditions, the solution process is basically a diffusion process in which solute molecules are adsorbed onto the surface. This process is driven by thermal movement of molecules and dependent
Jo
on the gradient of temperature or density between two media. Solute molecules migrate from the denser medium to the less dense medium continuously to produce an equilibrium at which the movement of solute molecules stops. In a two-phase system of polymer/SCCO2, at the interfacial film layer, due to interface 7
force, CO2 molecules attached to the interface spread to the interfacial film and are dissolved in the melted polymer. PSO algorithm may be used to analyze the solution process. Firstly, particles in PSO algorithm resemble those in solution and the similarity can be simulated by assigning the associated properties of solute
of
molecules in solution to the particles of the algorithm. Secondly, the solution process is a mass transfer process caused by the transport of material and it is
ro
closely related to the diffusion theory. Based on the diffusion theory and solubility
-p
and particle evolution algorithms, in order to simulate the molecular diffusion
re
process, we explored the movement of CO2 in the diffusion process and simulated the diffusion process based on the movement of evolution particles. Currently,
lP
the PSO algorithm was seldom used to analyze the migration of particles,
ur na
particularly in thermodynamics. Therefore, in this investigation, the diffusion theory was introduced into the PSO algorithm to improve it. The solution process of SCCO2 in polymer was simulated with the PSO algorithm and then an improved
Jo
PSO algorithm was proposed combined the diffusion theory and then applied in ANN training. Finally, a hybrid artificial intelligence prediction model of SCCO2 solubility in polymers was obtained.
8
2. Model theory 2.1. Standard PSO algorithm In 1995, Eberhart and Kennedy described the particle swarm optimization (PSO) as an evolution algorithm[46]. The movement of the particle is illustrated by the
k
k
k
k
(1)
k
v i, d = v i, d + c 1 (p i, d - x i, d ) + c 2 (p g, d - x i, d )
k +1 i, d
k
k +1
i, d
i, d
= x +v
(2)
-p
x
ro
k +1
of
following formulas:
k
p i,d
and
k
p g,d
and
k
x i,d
are respectively the velocity and position of
are the position of the individual and global extrema.
ur na
particle i;
k
v i,d
lP
are learning factors;
re
where i = 1, …… , m (m indicates particle number); is inertia weight; C1 and C2
2.2. CAPSO Algorithm
Jo
In recent years, an improved PSO variant with the extremely high global convergence performance, accelerated particle swarm optimization (APSO), has been widely concerned among scholars. In the APSO algorithm, inertial weight factors or cognitive factors are not considered and global exploration factors are utilized to improve the algorithm. The algorithm considers the global search 9
exploration factors for particle update. In the entire search process, particles are only restrained by global extremum, so the search velocity is accelerated. Position update formula is provided as: k +1
k
k
x i, d = (1 - C 2 )x i, d + C 2 pg, d
(3)
C1 r
of
where C1r is a random number and can allow the algorithm to escape from the
ro
local optimum.
-p
Compared to the standard PSO algorithm, APSO adopts two variables C1 and C2 to
re
reduce randomness during iteration. C1 is expressed as a monotonically decreasing function: C1 t , 0 1 . Thus, in the APSO algorithm, C2 is the key
lP
determinant of the algorithm performance and its range is [0.2, 0.7]. When C2 is 1,
ur na
the particles can converge to the current global extreme at any time and show no change and this global extremum barely represents the actual global extremum. Conversely, when C2 is 0, the search velocity of the algorithm is extremely slow.
Jo
Hence, it is important to optimize C2 by analyzing the performance of APSO algorithm.
However, in the APSO algorithm, the premature convergence problem still exists and some extrema may be avoided. According to the characteristics of learning 10
factor C2 in APSO, it can be described by chaotic mapping, and chaos theory can be used to optimize the parameter C2. Therefore, the classical logistic equation can be applied to achieve evolution and optimization of chaotic parameters. The used iterative formula is provided as follows: k 1 i
= 4x (1 x ) k
k
i
i
(4)
k
0 < xi 1
of
x
ro
2.3. Double-population CAPSO algorithm with diffusion theory
-p
In the application of CAPSO, a particle is designed as a potential solution to the actual problem and the optimal solution is updated by the iteration of the
re
algorithm. There are two phases in the system: SCCO2 melt and polymer melt. The
lP
molecular energy required to overcome the barrier from the original position to other locations, which is called the diffusion energy. The molecules move with the
ur na
diffusion energy, and then achieve mass transfer. The diffusion velocity of a molecule is proportional to the movement speed and the temperature of the system determines the movement speed of the molecule. In the molecular
Jo
system, the system temperature is a statistical variable, which is also proportional to the average kinetic energy of the molecule. Temperature is proportional to the speed of molecular movement.
11
Based on the thermodynamic diffusion theory, a novel CAPSO algorithm, called DP-DT-CAPSO algorithm, was proposed in this paper. In the DP-DT-CAPSO algorithm, the molecules movement is replaced by the particles movement to simulate the molecular force field on particles. The diffusion temperature is
of
determined by the thermal motion of the molecules. The algorithm assumes that the particle has a population temperature, which represents the average
ro
molecular temperature of the system. The related concepts and definitions are
-p
respectively described below.
lP
Definition 1: Particle diffusion energy
re
2.3.1. Related concepts and definitions
1 E mv 2 2
ur na
In CAPSO algorithm, the kinetic energy of the particle population is defined as: (5)
where v and m are respectively the speed and mass of the particles. The particle
Jo
with kinetic energy can spread to a new position when it can overcome the movement barrier and the kinetic energy is called the particle diffusion energy. If the mass of all particles is 1, the particle diffusion energy
Qi
1 n 2 Vij 2 j 1
(6) 12
Qi
can be defined as:
where
is the speed of the particles.
Vij
Definition 2: Center of mass of the population The convergence probability of PSO algorithm is 100% when all the particles move toward a center. Therefore, in the entire population, a population center is set
x m m i
(7)
i
ro
X cen
of
and the particles move toward the center, the formula is updated as follows:
where
X cen
-p
i
is the center of mass of the population; m is the size of the
lP
re
population ; xi is the position.
Definition 3: Distance to center of mass di
ur na
The distance to center of mass di xi
Xc
is expressed as: (8)
e n
Jo
Definition 4: Particle diffusion probability The particle diffusion probability indicates the proportion of successful diffusion of particles that meet the diffusion conditions and is depending on both energy and temperature. Particle diffusion probability 13
Pi
is defined as:
Qi
1 e T Pi d
(9)
where T is temperature.
d
is the distance difference between the particles to
the centroids. (10)
of
B d xi X cenA xi X cen
ro
2.3.2. DP-DT-CAPSO algorithm
The SCCO2/polymer solution system, which have two-phase, one is the gas phase
-p
and the other is the melt phase. Therefore, we used two populations to model
re
the two-phase system. Population A is the gas molecular population and
lP
Population B is the polymer melt molecular population. The operation and function of the two phases are basically the same in the algorithm.
ur na
Taking Population A as an example, in the execution of the DP-DT-CAPSO algorithm, the particles diffusion energy in Population A is first calculated and then the particles diffusion probability is obtained. The particles with higher
Jo
diffusion probability are copied to the candidate Population A, the particle carried the largest diffusion probability is used to update the worst fitness particles in the candidate Population B. Then, the global extremum is obtained. The algorithm is executed as follows: 14
Step 1: Initialization. Population A and Population B are initialized and then the maximum number of iterations are set. The values of some parameters such as population size, particle fitness value, and particle position are randomly set. Step 2: Fitness value calculation. The fitness value of the population particles is
of
calculated.
ro
Step 3: Extremum value update. According to the fitness value of the particle, the
-p
extremum of the individual particle in the population is updated.
re
Step 4: Termination condition calculation. When the number of iterations reaches the maximum value or the convergence accuracy meets the criteria, go to Step 11;
lP
otherwise, go to the next Step.
ur na
Step 5: the diffusion probability of population particles is calculated. Step 6: Candidate population. If the particles diffusion probability is greater than the generated random number, the particles are copied into the candidate
Jo
population.
Step 7: Information sharing and diffusion between population particles. Taking Population A as an example, the particles with the highest diffusion probability are selected from A and the particles with the worst fitness values among the 15
Candidate Population B are replaced. After particle information sharing, the mass transfer is completed. Step 8: Update the global extremum. When the particle's fitness value is better, the global extremum is updated to the current particle.
of
Step 9: Iterative process data are saved. Various groups and candidate sets and
ro
extreme values are saved.
-p
Step 10: Iterative update. Particle attributes and population attributes are
lP
Step 11: The final result is saved.
re
updated.
2.4. Artificial neural network trained by the DP-DT-CAPSO
ur na
Among many artificial neural networks, RBF ANN has unique advantages and is one of the widely concerned models. It has a three-layer structure. In this paper, Gaussian function was selected as the activation function: xk ci
Jo gi ( xk ) exp(
i2
2
(11)
)
16
where xk (1 k n) is the output vector of the K-th iteration; ci (1 i c) is the center of the basis function; i is the expansion factor; n is the number of samples. In the training process of RBF ANN network, three parameters are mainly
of
optimized: the basis function center, the extension constant, and the connection
ro
weight. The network is trained through the iterations of ci , i , wi . The output of
-p
RBF ANN network is defined as: c
O( xk ) wi gi ( xk )
(12)
re
i 1
lP
where wi is the connection weight of the i-th hidden node. In this paper, we used the DP-DT-CAPSO algorithm to train the three parameters
ur na
ci , i , wi The structure of the particle is defined as: . y f (wh,o , h,o , ci )
(13)
Jo
where wh,o (1 h c) (1 o p) is the weight; h,o is the extension factor between the hidden node and output node.
17
A prediction model based on RBF ANN and DP-DT-CAPSO algorithm applied in the prediction of SCCO2 solubility in polymers is developed, called DP-DT-CAPSO RBF ANN. 3. Model establishment
of
3.1. Experimental data
ro
Firstly, 327 sets of data of 8 common polymers in industrial production are used
-p
to establish the model database and the data sources are provided in Table 1.
re
In order to obtain the better performance, in this paper, we divided the experimental data into three subsets randomly. The training set (including 70% of
lP
the data) was used for the model training. The verification set (including 15% of the data) was used to optimize the trained model. The testing set (containing 15%
ur na
of the data) was used to test the the model. Table 2 shows the data distribution statistics of each polymer. 3.2. Model evaluation
Jo
The average relative deviation (ARD), root mean square error of prediction (RMSEP), and squared correlation coefficient (R2) were used to evaluate the performance of the model in this paper.
18
ARD
1 N y i yi N i 1 yi
RMSEP
1 N
(14)
N
( yi yi )2
(15)
i=1
2
N ( yi yave )( yi yave ) R 2 N i=1 N ( yi yave )2 ( yi yave )2
of
i=1
ro
i=1
(16)
yi
and yave are the experimental value and
re
and the predicted average value;
-p
where N is the sample number; yi and yave are respectively the predicted value
3.3. Model structure
lP
the average data of experimental value.
ur na
The proposed model uses a three-layer structure composed of the input layer, the hidden layer and the output layer. The input layer contains two nodes of temperature and pressure. The output layer consists of one node indicating the
Jo
solubility of SCCO2 in the polymer. In order to see the effect of the nodes in the hidden layer on the performance of the model, the number of nodes in the hidden layer is optimized by the heuristic method. The number of nodes is changed from 3 to 13. Therefore, 11 models are obtained. Fig. 1 shows the error curves with different nodes in the hidden layers. 19
As shown in Fig. 1, as the number of nodes increases, the error firstly decreases and then increases. When the number of nodes in the hidden layer is 6, the error is the smallest. Therefore, in this paper, the number of nodes in the hidden layer is 6.
of
4. Results and discussion
ro
4.1. Results of the proposed model
Experiments were performed in Windows 7 SP1 64-bit OS (4.00 GB of memory
-p
and Intel (R) Core ™ i5-4460 processor). Through Matlab 2010a software
re
programing, a three-layer model with the 2-6-1 structure, called DP-DT-CAPSO
lP
RBF ANN, was developed. The proposed model was used to calculate the SSCO2 solubility in 8 polymers. In the training set, Fig. 2 draws the experimental values
ur na
and the predicted values of the proposed model. As shown in Fig. 2, in the training set, the predicted data of 8 polymers are close to the experimental data and the distribution of the predicted data is similar to
Jo
the ideal prediction line. The predicted values were consistent with the experimental values, indicating that the DP-DT-CAPSO RBF ANN model was well trained. Fig. 3 shows the data distribution of the predicted and experimental values in the validation set. The prediction data of the model were close to the 20
experimental data (Fig. 3). The vertical distance from the data point to the straight line indicated that the DP-DT-CAPSO RBF ANN model had the higher prediction accuracy, and had a better correlation. The distribution of the predicted data of the model was close to that of the
of
experimental data (Fig. 4), indicating that the prediction performance was good.
ro
The prediction points in the testing set were relatively more dispersed than
others, indicating that the accuracy in the testing set was slightly lower. Table 3
-p
shows the model performance values for each data set.
re
It can be seen from Table 3 that the model has a better prediction performance,
lP
higher accuracy and correlation in the three sets. Table 4 shows the related data of the model of predicting the SCCO2 solubility in 8 kinds of polymers. The data
ur na
showed that the prediction accuracy of the model was basically the same among 8 polymers. The correlation coefficient was also above 0.99. The model showed
Jo
the better prediction ability and better scalability. 4.2. Comparison with other models The above confirmed that the higher performance of the proposed model in predicting the SCCO2 solubility in polymers. In order to further explore the performance, it is necessary to carry out the comparison between the current 21
work and previous works, especially those ones based on the artificial intelligence and ANN. Therefore, the model proposed in this paper was compared with three types of models used PSO and RBF ANN, such as RBF ANN, PSO BP ANN PSO RBF ANN. The convergence curves of these compared models are shown in Fig. 5.
of
As shown in Fig.5, the models of DP-DT-CAPSO RBF ANN, PSO RBF ANN, PSO BP
ro
ANN and RBF ANN tended to be stable after 200, 400, 300 and 700 times,
respectively. The model proposed in this paper has faster convergence speed. The
-p
prediction accuracy of the models decreased as the following order: DP-DT-CAPSO
re
RBF ANN > PSO RBF ANN > PSO BP ANN > RBF ANN. The accuracy of the
lP
DP-DT-CAPSO RBF ANN model was close to 0. The proposed model is better in terms of convergence accuracy and convergence speed. Fig. 6 shows the
model.
ur na
correlation between predicted and experimental values for each comparison
As shown in Fig. 6, various models have different prediction data distributions and
Jo
the prediction values of the proposed model are closer to the experimental data. In terms of prediction accuracy, the models decreased as the following order: DP-DT-CAPSO RBF ANN > PSO RBF ANN > PSO BP ANN > RBF ANN. The
22
DP-DT-CAPSO RBF ANN model had the highest prediction accuracy. Table 5 provides the statistical parameters of each model. In terms of the ARD and RMSEP data, the prediction accuracy of DP-DT-CAPSO RBF ANN model was better. In term of the correlation coefficient, the
of
DP-DT-CAPSO RBF ANN model was also significantly better than other models.
ro
The above data indicated that the proposed model exhibited the better
performance in terms of accuracy and correlation. Table 6 gives the computation
-p
time of various models.
re
Execution time of DP-DT-CAPSO RBF ANN model was close to that of RBF ANN
lP
model and shorter than that of PSO BP ANN and PSO RBF ANN. In the intelligent algorithm-based model, the introduction of the intelligent algorithm increased
ur na
calculation time. The diffusion theory was introduced in the particle swarm optimization algorithm in this paper. The thermodynamic parameters were recalculated in the iteration, thus resulting in longer calculation time of the model.
Jo
Compared with other models, this model also showed acceptable calculation time.
4.3. Discussion 4.3.1. Correlation of the model 23
The above experiments confirmed the advantages of the DP-DT-CAPSO RBF ANN model in terms of calculation accuracy, correlation and calculation time. Numerous experiments showed that solubility was proportional to the pressure of the system and inversely proportional to temperature. Fig. 7 shows the solubility
of
curves of the proposed model under different temperatures and pressures.
ro
The solubility of SCCO2 in polymers (PP, HDPE, CPEs, PS, PBS and PBSA) increased with the increase in pressure and decreased with the increase in temperature (Fig.
-p
7). The solubility of SCCO2 in polymers (PLLA and PLGA) increased with the
re
increase in pressure. With the increase in temperature, the solubility SCCO2 in
lP
PLLA and PLGA decreased firstly and then became stable. The predicted trend was consistent with the experimental results.
ur na
4.3.2. Stability of the model
According to Table 4 and Table 5, the proposed model has the better prediction ability of SCCO2 solubility in the 8 polymers, displaying the higher accuracy and
Jo
correlation. There is no abnormal prediction point and the model is the better stability. This paper proposes a model by combining thermodynamic diffusion theory with artificial intelligence, such as swarm intelligence algorithms. Compared with previous studies, it has several obvious characteristics. 1) In terms 24
of prediction accuracy and correlation, Table 4 and Table 5 show that the model in this paper has obvious advantages. 2) Empirical correlation exists in all prediction models based on artificial intelligence algorithms. In this paper, through the separation of the training set, verification set and test set, the
of
interference of empirical correlation is minimized. 3) From the efficiency of the intelligent model, according to the calculation time in Table 6, it can be seen that
ro
the efficiency of various models based on intelligent algorithms is better.
-p
4.3.3. Fault tolerance and the anti-jamming ability of the model
re
After the model is trained with many normal data, invalid data or abnormal data
lP
can be avoided by the model. The trained model displays the better fault tolerance and strong anti-jamming ability.
ur na
4.3.4. Expansibility of the model
Solubility prediction experiments confirmed the higher prediction performance of the model. In addition, the model can be extended to other applications such as
Jo
the prediction of other chemical and chemical properties, such as data prediction, data processing and optimization of experimental parameters, quantitative structure-activity relationship, and computer-aided design. The model has good scalability. 25
5. Conclusions In this paper, an improved model was proposed based on diffusion theory, particle swarm optimization and artificial neural network and used to predict the solubility of SCCO2 in 8 polymers. Experimental results showed that the
of
DP-DT-CAPSO-RBF-ANN model was better than other comparison models in terms of prediction accuracy, correlation and calculation time. The model has good
ro
stability and scalability and can be applied in prediction, fitting, and processing of
-p
various types of the data of chemical, physical, biological, and information fields.
ur na
Declaration of interests
lP
methods and prediction models.
re
In the future, we will further explore more efficient theoretical calculation
Jo
☒ The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
ACKNOWLEDGEMENTS
The authors gratefully acknowledge the support from the National Natural Science Foundation of China (Grant Numbers: 51663001).
26
REFERENCES
[1] K. Gong, Panuganti, S.R., Chapman, W.G., Study of solubility and swelling ratio in polymer-CO2 systems using the PC-SAFT equation of state, J. Appl. Polym.
of
Sci. 134 (2017). [2] K.A. Rabaeh, Al-Ajaleen, M.S., Abuzayed, M.H., Aldweri, F.M., Eyadeh, M.M.,
ro
High dose sensitivity of N-(isobutoxymethyl)acrylamide polymer gel
-p
dosimeters with improved monomer solubility using acetone co-solvent,
re
Nuclear Instruments & Methods in Physics Research Section B-Beam Interactions with Materials and Atoms. 442 (2019) 67-72.
lP
[3] T.V. Duncan, Pillai, K., Release of Engineered Nanomaterials from Polymer
ur na
Nanocomposites: Diffusion, Dissolution, and Desorption, Acs Appl. Mater. Interfaces. 7 (2015) 2-19.
[4] S. Khatsee, Daranarong, D., Punyodom, W., Worajittiphon, P., Electrospinning
Jo
polymer blend of PLA and PBAT: Electrospinnability-solubility map and effect of polymer solution parameters toward application as antibiotic-carrier mats, J. Appl. Polym. Sci. 135 (2018).
[5] M. Ota, Sato, Y., Smith, R.L., Inomata, H., Predictive dimensionless solubility (pDS) model for solid solutes in supercritical CO2 that requires only 27
pure-component physical properties, Chem. Eng. Res. Des. 136 (2018) 251-261. [6] G. Guerin, Molev, G., Pichugin, D., Rupar, P.A., Qi, F., Cruz, M., Manners, I., Winnik, M.A., Effect of Concentration on the Dissolution of One-Dimensional
of
Polymer Crystals: A TEM and NMR Study, Macromolecules. 52 (2019) 208-216.
ro
[7] J. Sun, Wang, H., Chen, M., Ye, J., Jiang, B., Wang, J., Yang, Y., Ren, C., Solubility
-p
measurement of hydrogen, ethylene, and 1-hexene in polyethylene films
re
through an intelligent gravimetric analyzer, J. Appl. Polym. Sci. 134 (2017). [8] A. Baghban, Sasanipour, J., Habibzadeh, S., Zhang, Z.E., Sulfur dioxide solubility
lP
prediction in ionic liquids by a group contribution - LSSVM model, Chem. Eng.
ur na
Res. Des. 142 (2019) 44-52.
[9] J. Koszinowski, Diffusion and solubility of n-alkanes in polyolefines (Retraction of vol 32, pg 4765, 1986), J. Appl. Polym. Sci. 133 (2016).
Jo
[10] A. Aftab, Shariff, A.M., Garg, S., Lal, B., Shaikh, M.S., Faiqa, N., Solubility of CO2 in aqueous sodium beta-alaninate: Experimental study and modeling using Kent Eisenberg model, Chem. Eng. Res. Des. 131 (2018) 385-392.
[11] T. Cowen, Karim, K., Piletsky, S.A., Solubility and size of polymer nanoparticles, Polym. Chem. 9 (2018) 4566-4573. 28
[12] P. Zhou, Kasuga, T., Dissolution behavior of Mg/Si-doped vaterite particles in biodegradable polymer composites, Express Polymer Letters. 12 (2018) 171-179. [13] H. Ziaee, Hosseini, S.M., Sharafpoor, A., Fazavi, M., Ghiasi, M.M., Bahadori, A.,
of
Prediction of solubility of carbon dioxide in different polymers using support vector machine algorithm, J. Taiwan Inst. Chem. Eng. 46 (2015) 205-213.
ro
[14] J.A. Lazzús, Cuturrufo, F., Pulgar-Villarroel, G., Salfate, I., Vega, P., Estimating
-p
the Temperature-Dependent Surface Tension of Ionic Liquids Using a Neural
re
Network-Based Group Contribution Method, Ind. Eng. Chem. Res. 56 (2017) 6869-6886.
lP
[15] J.A. Lazzus, Neural network-particle swarm modeling to predict thermal
ur na
properties, Math. Comput. Model. 57 (2013) 2408-2418. [16] E. Hosaini-Alvand, Mirshekar, H., Khorasani, M.T., Parvazinia, M., Joorabloo, A., Fabricating and robust artificial neural network modeling nanoscale
Jo
polyurethane fiber using electrospinning method, J. Appl. Polym. Sci. 134 (2017).
[17] M.S. Giri Nandagopal, Selvaraju, N., Prediction of Liquid–Liquid Flow Patterns in a Y-Junction Circular Microchannel Using Advanced Neural Network Techniques, Ind. Eng. Chem. Res. 55 (2016) 11346-11362. 29
[18] B. Mehdizadeh, Movagharnejad, K., A comparison between neural network method and semi empirical equations to predict the solubility of different compounds in supercritical carbon dioxide, Fluid. Phase. Equilibr. 303 (2011) 40-44.
of
[19] S.L. Qiu, Liu, Q.H., Zhou, S.J., Wu, C.J., Review of Artificial Intelligence Adversarial Attack and Defense Technologies, App.l Sci-Basel. 9 (2019).
ro
[20] M. Lashkarbolooki, Bayat, M., Prediction of surface tension of liquid normal
-p
alkanes, 1-alkenes and cycloalkane using neural network, Chem. Eng. Res. Des.
re
137 (2018) 154-163.
[21] M.S. Li, Zhang, H.J., Chen, B.S., Wu, Y., Guan, L.X., Prediction of pKa Values for
ur na
Rep. 8 (2018) 3991.
lP
Neutral and Basic Drugs based on Hybrid Artificial Intelligence Methods, Sci.
[22] J.A. Lazzus, Prediction of flammability limit temperatures from molecular structures using a neural network-particle swarm algorithm, J. Taiwan Inst.
Jo
Chem. Eng. 42 (2011) 447-453. [23] K. Yao, Herr, J.E., Toth, D.W., McKintyre, R., Parkhill, J., The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics, Chem. Sci. 9 (2018) 2261-2269.
30
[24] Y. Bakhbakhi, Neural network modeling of ternary solubilities of 2-naphthol in supercritical CO2: A comparative study, Math. Comput. Model. 55 (2012) 1932-1941. [25] A.Z. Hezave, Raeissi, S., Lashkarbolooki, M., Estimation of Thermal
of
Conductivity of Ionic Liquids Using a Perceptron Neural Network, Ind. Eng. Chem. Res. 51 (2012) 9886-9893.
ro
[26] F. Gharagheizi, Eslamimanesh, A., Mohammadi, A.H., Richon, D., Artificial
-p
Neural Network Modeling of Solubilities of 21 Commonly Used Industrial Solid
re
Compounds in Supercritical Carbon Dioxide, Ind. Eng. Chem. Res. 50 (2011) 221-226.
lP
[27] A. Eslamimanesh, Gharagheizi, F., Mohammadi, A.H., Richon, D., Artificial
ur na
Neural Network modeling of solubility of supercritical carbon dioxide in 24 commonly used ionic liquids, Chem. Eng. Sci. 66 (2011) 3039-3044. [28] H. Pahlavanzadeh, Nourani, S., Saber, M., Experimental analysis and modeling
Jo
of CO2 solubility in AMP (2-amino-2-methyl-1-propanol) at low CO2 partial pressure using the models of Deshmukh-Mather and the artificial neural network, J. Chem. Thermodyn. 43 (2011) 1775-1783.
[29] S. Al-Shihabi, Arafeh, M., Barghash, M., An improved hybrid algorithm for the set covering problem, Comput. Ind. Eng. 85 (2015) 328-334. 31
[30] J.A. Lazzus, Hybrid Particle Swarm-Ant Colony Algorithm to Describe the Phase Equilibrium of Systems Containing Supercritical Fluids with Ionic Liquids, Communications in Computational Physics. 14 (2013) 107-125. [31] N. Kalaiarasi, Dash, S.S., Padmanaban, S., Paramasivam, S., Morati, P.K.,
of
Maximum Power Point Tracking Implementation by Dspace Controller Integrated Through Z-Source Inverter Using Particle Swarm Optimization
ro
Technique for Photovoltaic Applications, App.l Sci-Basel. 8 (2018).
-p
[32] M.S. Li, Zhang, H.J., Liu, L., Chen, B.S., Guan, L.X., Wu, Y., A Quantitative
re
Structure-Property Relationship Model Based on Chaos-Enhanced Accelerated Particle Swarm Optimization Algorithm and Back Propagation
lP
Artificial Neural Network, App.l Sci-Basel. 8 (2018) 1121.
ur na
[33] L.V. Pavao, Borba Costa, C.B., Ravagnani, M.A.S.S., Heat Exchanger Network Synthesis without stream splits using parallelized and simplified simulated Annealing and Particle Swarm Optimization, Chem. Eng. Sci. 158 (2017)
Jo
96-107.
[34] S. Arefi-Oskoui, Khataee, A., Vatanpour, V., Modeling and Optimization of NLDH/PVDF Ultrafiltration Nanocomposite Membrane Using Artificial Neural Network-Genetic Algorithm Hybrid, ACS Combinatorial Science. 19 (2017) 464-477. 32
[35] X.G. Liu, Zhao, C.Y., Melt index prediction based on fuzzy neural networks and PSO algorithm with online correction strategy, Aiche J. 58 (2012) 1194-1202. [36] J.A. Lazzus, Ponce, A., Chilla, L., Application of particle swarm optimization to
of
model the phase equilibrium of complex mixtures, Fluid. Phase. Equilibr. 317 (2012) 132-139.
ro
[37] A. Khajeh, Modarress, H., Prediction of solubility of gases in polystyrene by
-p
Adaptive Neuro-Fuzzy Inference System and Radial Basis Function Neural
re
Network, Expert. Syst. Appl. 37 (2010) 3070-3074.
[38] M.A. Hussain, Aroua, M.K., Yin, C.Y., Rahman, R.A., Ramli, N.A., Hybrid neural
lP
network for prediction of CO2 solubility in monoethanolamine and
ur na
diethanolamine solutions, Korean J. Chem. Eng. 27 (2010) 1864-1867. [39] M.S. Li, Huang, X.Y., Liu, H.S., Liu, B.X., Wu, Y., Prediction of the gas solubility in polymers by a radial basis function neural network based on chaotic
Jo
self-adaptive particle swarm optimization and a clustering method, J. Appl. Polym. Sci. 130 (2013) 3825-3832.
[40] M.S. Li, Huang, X.Y., Liu, H.S., Liu, B.X., Wu, Y., Deng, X.Z., Solubility prediction of gases in polymers using fuzzy neural network based on particle swarm
33
optimization algorithm and clustering method, J. Appl. Polym. Sci. 129 (2013) 3297-3303. [41] Y. Wu, Liu, B.X., Li, M.S., Tang, K.Z., Wu, Y.B., Prediction of CO2 Solubility in Polymers by Radial Basis Function Artificial Neural Network Based on Chaotic
of
Self-adaptive Particle Swarm Optimization and Fuzzy Clustering Method, Chin. J. Chem. 31 (2013) 1564-1572.
ro
[42] M. Li, Huang, X., Liu, H., Liu, B., Wu, Y., Wang, L., Solubility prediction of
-p
supercritical carbon dioxide in 10 polymers using radial basis function artificial
re
neural network based on chaotic self-adaptive particle swarm optimization and K-harmonic means, RSC. Adv. 5 (2015) 45520-45527.
lP
[43] M. Li, Wu, W., Chen, B., Wu, Y., Huang, X., Solubility prediction of gases in
ur na
polymers based on an artificial neural network: a review, RSC. Adv. 7 (2017) 35274-35282.
[44] M.S. Li, Huang, X.Y., Liu, H.S., Liu, B.X., Wu, Y., Ai, F.R., Solubility Prediction of
Jo
Gases in Polymers based on Chaotic Self-adaptive Particle Swarm Optimization Artificial Neural Networks, Acta Chim. Sinica. 71 (2013) 1053-1058.
[45] M.S. Li, Huang, X.Y., Liu, H.S., Liu, B.X., Wu, Y., Xiong, A.H., Dong, T.W., Prediction of gas solubility in polymers by back propagation artificial neural 34
network based on self-adaptive particle swarm optimization algorithm and chaos theory, Fluid. Phase. Equilibr. 356 (2013) 11-17. [46] J. Kennedy, Eberhart, R., Particle swarm optimization, in: 1995 IEEE International Conference on Neural Networks Proceedings, Proceedings of
of
ICNN'95 - International Conference on Neural Networks, IEEE Australia Council, Perth, 1995, pp. 1942-1948.
ro
[47] A. Khajeh, Modarress, H., Mohsen-Nia, M., Solubility prediction for carbon
-p
dioxide in polymers by artificial neural network, Iran. Polym. J. 16 (2007)
re
759-768.
[48] Y. Sato, Fujiwara, K., Takikawa, T., Sumarno, Takishima, S., Masuoka, H.,
lP
Solubilities and diffusion coefficients of carbon dioxide and nitrogen in
ur na
polypropylene, high-density polyethylene, and polystyrene under high pressures and temperatures, Fluid. Phase. Equilibr. 162 (1999) 261-276. [49] Z.G. Lei, Ohyabu, H., Sato, Y., Inomata, H., Smith, R.L., Solubility, swelling
Jo
degree and crystallinity of carbon dioxide-polypropylene system, J. Supercrit. Fluid. 40 (2007) 452-461.
[50] D.C. Li, Liu, T., Zhao, L., Yuan, W.K., Solubility and Diffusivity of Carbon Dioxide in Solid-State Isotactic Polypropylene by the Pressure-Decay Method, Ind. Eng. Chem. Res. 48 (2009) 7117-7124. 35
[51] E. Aionicesei, Skerget, M., Knez, Z., Measurement of CO2 solubility and diffusivity in poly(L-lactide) and poly(D,L-lactide-co-glycolide) by magnetic suspension balance, J. Supercrit. Fluid. 47 (2008) 296-301. [52] M. Skerget, Mandzuka, Z., Aionicesei, E., Knez, Z., Jese, R., Znoj, B., Venturini,
of
P., Solubility and diffusivity of CO2 in carboxylated polyesters, J. Supercrit. Fluid. 51 (2010) 306-311.
ro
[53] Y. Sato, Yurugi, M., Fujiwara, K., Takishima, S., Masuoka, H., Solubilities of
-p
carbon dioxide and nitrogen in polystyrene under high temperature and
re
pressure, Fluid. Phase. Equilibr. 125 (1996) 129-138.
[54] S. Hilic, Boyer, S., Padua, A., Grolier, J., Simultaneous measurement of the
lP
solubility of nitrogen and carbon dioxide in polystyrene and of the associated
ur na
polymer swelling, J. Polym. Sci. B-Polym. Phys. 39 (2001) 2063-2070. [55] Y. Sato, Takikawa, T., Takishima, S., Masuoka, H., Solubilities and diffusion coefficients of carbon dioxide in poly(vinyl acetate) and polystyrene, J.
Jo
Supercrit. Fluid. 19 (2001) 187-198. [56] Y. Sato, Takikawa, T., Sorakubo, A., Takishima, S., Masuoka, H., Imaizumi, M., Solubility and diffusion coefficient of carbon dioxide in biodegradable polymers, Ind. Eng. Chem. Res. 39 (2000) 4813-4819.
36
of ro -p re
Jo
ur na
mean squared error
lP
Fig. 1. Correlations between the number of nodes of in the hidden layer and
37
of ro -p
Jo
ur na
lP
re
Fig. 2 Experimental values and predicted values in the training set.
Fig. 3 Experimental values and predicted values in the validation set.
38
of ro
-p
Fig. 4. Relationship between predicted values and experimental data in the
Jo
ur na
lP
re
testing set.
Fig. 5 Curve of mean squared error versus iteration number
39
of ro -p
Jo
ur na
lP
re
Fig. 6 Predicted values VS. experimental data.
40
of ro -p re lP ur na Jo
Fig. 7. Solubility variations with pressure and temperature
41
Table 1 Sources and statistics of experimental data in this work T (K)
P (MPa)
Solubility(g/g)
Data points
References
PP
313.20-483.70
7.400-24.910
0.03950-0.26170
67
[47-50]
PLLA
308.00-323.00
9.620-31.460
0.16520-0.43010
27
[51]
HDPE
433.15-473.20
10.731-18.123
0.00551-0.12296
20
[47, 48]
CPEs
306.00-344.00
10.150-31.020
0.09840-0.63660
56
[52]
PS
338.22-473.15
7.540-44.410
0.02641-0.16056
70
[47, 53-55]
PBS
323.15-453.15
8.008-20.144
0.04534-0.17610
31
PBSA
323.15-453.16
7.870-20.128
0.04763-0.17411
29
PLGA
308.00-323.00
10.140-31.470
0.09030-0.29630
27
Total
306.00-483.70
7.400-44.410
0.00551-0.63660
327
PP
47
PLLA
19
HDPE
14
CPEs PS PBS PBSA
ro
[47, 56]
-p
[51]
Testing
Total
10
10
67
4
4
27
3
3
20
38
9
9
56
50
10
10
70
21
5
5
31
21
4
4
29
19
4
4
27
Jo
PLGA
Validation
lP
Training
[47, 56]
ur na
Polymer
re
Table 2 Distribution of experimental data
of
Polymer
42
Table 3 Values of ARD, R2, and RMSEP for different subsets. ARD
R2
RMSEP
Training
0.0036
0.9973
0.0147
Validation
0.0035
0.9971
0.0152
Testing
0.0037
0.9966
0.0157
Average
0.0036
0.9970
0.0152
of
Subset
ro
Table 4 Values of ARD, R2, and RMSEP for different polymers in the testing set. Polymer
ARD
R2
PP
0.0031
0.9972
PLLA
0.0037
0.9966
0.0138
HDPE
0.0042
0.9967
0.0137
CPEs
0.0037
PS
0.0037
PBS
re
-p
RMSEP 0.0147
0.0152
0.9964
0.0146
0.0042
0.9967
0.0146
PBSA
0.0036
0.9966
0.0152
PLGA
0.0033
0.9968
0.0146
Jo
ur na
lP
0.9973
43
Table 5 Statistical parameters of various models CPEs
PS
PBS
PBSA
PLGA
Average
RBF ANN
0.0112
0.0096
0.0113
0.0104
0.0106
0.0121
0.0111
0.0098
0.0108
PSO BP ANN
0.0078
0.0076
0.0086
0.0081
0.0068
0.0089
0.0088
0.0075
0.0080
PSO RBF ANN
0.0065
0.0064
0.0072
0.0056
0.0064
0.0078
0.0076
0.0063
0.0067
DP-DT-CAPSO RBF ANN
0.0023
0.0031
0.0042
0.0037
0.0034
0.0041
0.0038
0.0031
0.0035
RBF ANN
0.9589
0.9613
0.9621
0.9558
0.9642
0.9616
0.9643
0.9621
0.9613
PSO BP ANN
0.9853
0.9842
0.9823
0.9813
0.9811
0.9911
0.9813
0.9821
0.9836
PSO RBF ANN
0.9875
0.9876
0.9871
0.9856
0.9845
0.9842
0.9852
0.9834
0.9856
DP-DT-CAPSO RBF ANN
0.9972
0.9963
0.9966
0.9971
RBF ANN
0.0664
0.0589
0.0687
0.0921
PSO BP ANN
0.0482
0.0472
0.0399
PSO RBF ANN
0.0421
0.0423
0.0421
DP-DT-CAPSO RBF ANN
0.0149
0.0133
0.0137
ro
of
HDPE
0.9968
0.9962
0.9966
0.9967
0.9967
0.0887
0.0961
0.0952
0.0872
0.0817
-p
RMSEP
PLLA
0.0467
0.0466
0.0512
0.0514
0.0488
0.0475
0.0442
0.0475
0.0468
0.0476
0.0472
0.0450
0.0142
0.0146
0.0149
0.0147
0.0144
re
R2
PP
lP
ARD
Model
0.0145
Table 6 Computation time of various models
RBF ANN PSO BP ANN PSO RBF ANN
ur na
Model
Computation Time(S) 32 41 43 27
Jo
DP-DT-CAPSO RBF ANN
44