Journal Pre-proof The cytotoxicity of nanomaterials: Modeling multiple human cells uptake of functionalized magneto-fluorescent nanoparticles via nano-QSAR Ronghua Qi, Yong Pan, Jiakai Cao, Zhenhua Jia, Juncheng Jiang PII:
S0045-6535(20)30368-4
DOI:
https://doi.org/10.1016/j.chemosphere.2020.126175
Reference:
CHEM 126175
To appear in:
ECSN
Received Date: 2 October 2019 Revised Date:
4 February 2020
Accepted Date: 9 February 2020
Please cite this article as: Qi, R., Pan, Y., Cao, J., Jia, Z., Jiang, J., The cytotoxicity of nanomaterials: Modeling multiple human cells uptake of functionalized magneto-fluorescent nanoparticles via nanoQSAR, Chemosphere (2020), doi: https://doi.org/10.1016/j.chemosphere.2020.126175. This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. © 2020 Published by Elsevier Ltd.
Author Contributions Data curation, Ronghua Qi; Formal analysis, Ronghua Qi and Jiakai Cao; Funding acquisition, Yong Pan; Methodology, Ronghua Qi and Jiakai Cao; Software, Ronghua Qi; Validation, Jiakai Cao and Yong Pan; Writing–original draft, Ronghua Qi and Yong Pan; Writing–review & editing, Zhenhua Jia, Juncheng Jiang and Yong Pan.
1
The cytotoxicity of nanomaterials: Modeling multiple human cells
2
uptake of functionalized magneto-fluorescent nanoparticles via
3
nano-QSAR
4
Ronghua Qi 1, Yong Pan1,*, Jiakai Cao1, Zhenhua Jia 2, Juncheng Jiang 1
5
1. Jiangsu Key Laboratory of Hazardous Chemicals Safety and Control, College of Safety Science
6
and Engineering, Nanjing Tech University, Nanjing, 210009, China.
7
2. Institute of Advanced Synthesis, School of Chemistry and Molecular Engineering, Nanjing Tech
8
University, Nanjing, 210009, China.
9
* Corresponding author: Yong Pan
10
Tel.: +86-25-58139873.
11
E-mail address:
[email protected] (Y. Pan).
12
Postal address: 30 South Puzhu Road, Jiangbei new district, Nanjing 211816, P.R.China.
13
Declarations of interest: None
14
I
1
The cytotoxicity of nanomaterials: Modeling multiple human cells
2
uptake of functionalized magneto-fluorescent nanoparticles via
3
nano-QSAR
4
Ronghua Qi 1, Yong Pan1,*, Jiakai Cao1, Zhenhua Jia 2, Juncheng Jiang 1
5
1. Jiangsu Key Laboratory of Hazardous Chemicals Safety and Control, College of Safety Science
6
and Engineering, Nanjing Tech University, Nanjing, 210009, China.
7
2. Institute of Advanced Synthesis, School of Chemistry and Molecular Engineering, Nanjing Tech
8
University, Nanjing, 210009, China.
9
* Corresponding author. Tel.: +86-25-58139873.
10
E-mail address:
[email protected] (Y. Pan).
11
Abstract
12
The vast majority of nanomaterials have attracted an upsurge of interest since their discovery and
13
considerable researches are being carried out about their adverse outcomes for human health and
14
the environment. In this study, two regression-based quantitative structure-activity relationship
15
models for nanoparticles (nano-QSAR) were established to predict the cellular uptakes of 109
16
functionalized magneto-fluorescent nanoparticles to pancreatic cancer cells (PaCa2) and human
17
umbilical vein endothelial cells (HUVEC) lines, respectively. The improved SMILES-based
18
optimal descriptors encoded with certain easily available physicochemical properties were
19
proposed to describe the molecular structure characteristics of the involved nanoparticles, and the
20
Monte Carlo method was used for calculating the improved SMILES-based optimal descriptors.
21
Both developed nano-QSAR models for cellular uptake prediction provided satisfactory statistical
22
results, with the squared correlation coefficient (R2) being 0.852 and 0.905 for training sets, and
1
23
0.822 and 0.885 for test sets, respectively. Both models were rigorously validated and further
24
extensively compared to literature models. Predominant physicochemical features responsible for
25
cellular uptake were identified by model interpretation. The proposed models could be reasonably
26
expected to provide guidance for synthesizing or choosing safer, more suitable surface modifiers
27
of desired properties prior to their biomedical applications.
28
Keywords
29
Cellular uptake; Nanoparticles; Cytotoxicity; Nano-QSAR; The improved SMILES-based optimal
30
descriptors
31
1. Introduction
32
With the development of nanotechnology (Abbasi et al., 2017; Mortazavi-Derazkola et al., 2017;
33
Zinatloo-Ajabshir et al., 2017; Khojasteh et al., 2019), the design and synthesis of nanoscale
34
materials with novel and extraordinary properties have been extensively advancing for over half a
35
century (Salavati-Niasari, 2004; Salavati-Niasari et al., 2006; Davar et al., 2010; Yousefi et al.,
36
2011; Mir et al., 2012; Mohandes and Salavati-Niasari, 2014; Safardoust-Hojaghan and
37
Salavati-Niasari, 2017), leading to a veritable explosion of nanotechnology in the fields of
38
electronics (Duttaa et al., 2018), catalysis (Wu et al., 2018), adsorbent (Ardekani et al., 2017;
39
Bazrafshan et al., 2017; Sadeghfar et al., 2018; Dil et al., 2019), biosensor(Bahrani et al., 2018),
40
biomedicine (El-Sayed et al., 2006; Bajpai et al., 2018), and other industrial application (Auffan et
41
al., 2009; Dong et al., 2014; Jiang et al., 2015). Metal oxide nanoparticles (MNPs) are one of the
42
most popular used nanomaterials due to their unique small-size effect, high surface reactivity, high
43
surface-to-volume ratio, and quantum-size effect. However, concerns had also been raised about
44
the potential adverse impacts on the ecosystem and living organisms when MNPs interact with
2
45
biological systems (Mattsson et al., 2016). MNPs may intrude into the human organism through
46
multiple entries such as inhalation, skin absorption and ingestion due to either intentional or
47
unintentional exposure to them. Once MNPs have entered into the human body, nanoparticles are
48
wrapped by biomacromolecules, including sugars, proteins, lipids, and nucleic acids (Shang et al.,
49
2014.), followed by cellular uptake.
50
The cellular uptake of MNPs depends on their physicochemical features, thus the intracellular
51
targeting and trafficking could be optimized by fine-tuning the MNP’s physicochemical properties,
52
which helps to synthesize efficient and novel nanomaterials (Petros et al., 2010; Pridgen et al.,
53
2015). The increased cellular uptake to normal cells may contribute to an enhanced interaction of
54
MNPs with subcellular organelles, which in turn directly induces tissue damage caused by
55
overloading and thus reinforces the cytotoxicity. The dominant reason is that the cytotoxicity
56
degree of the MNPs has some cellular absorption dependence to a great extent (Geys et al., 2008).
57
The high toxicity of MNPs appears as higher cellular uptake and vice versa (Singh and Ramarao,
58
2013). Accordingly, for the cells around treated area, lower cellular uptake is needed to exhibit low
59
toxicity, while for specific targeting of cancer cells, higher cellular uptake is required to cure
60
diseases. Consequently, owing to the desired features of functionalized MNPs and their ability for
61
cellular penetration for specific targeting of cancer cells, the MNPs have been widely employed in
62
biomedical applications for DNA structure detecting, drug-delivery carriers, molecular imaging,
63
gene delivery, fluorescent biological labels and so on (Agasti et al., 2010; Tian et al., 2016;
64
Oroojalian et al., 2017; Gai et al., 2018; Liu et al., 2019).
65
In general, experimental in vivo toxicological studies and in vitro short-term cell-based assays
66
are the basic methods to perform the preliminary evaluation of the toxic effects (Pan et al., 2016).
3
67
However, considering a great number of newly functionalized nanoparticles are widely applicable
68
for the biological fields as well as the high cost and long periodicity required for determining their
69
absorption amount in different cell lines, it is impossible to measure the toxicity for all the
70
available nanoparticles. Consequently, there is a strong demand to utilize computational in silico
71
methods flexibly to obtain certain missing data of corresponding substances. In particular, the
72
Quantitative Structure-Activity Relationship (QSAR) approach is one of the most prospective
73
methods that could predict their bioactivity accurately in a simple and efficient way prior to their
74
synthesis (Kar and Roy, 2010). QSAR is primarily based on the assumption that there is a certain
75
correlation between the variation in biological activity of chemicals and the changes in their
76
molecular structures. More recently, a great-increasing number of investigations showed that it is
77
really urgent and necessary to extend the traditional QSAR paradigm to nano-sized materials and
78
to develop “nano-QSAR” models for relating the properties of interest to structure information of
79
newly synthesized nanoparticles as well as providing theoretical guidance for designing
80
functionalized nanoparticles with expected features (Le et al., 2012; Winkler et al., 2013).
81
Current studies on biological interaction and potential risks of the omnipresent exposure to
82
MNPs are not consistent with the rapid developments of nanotechnology and are still very limited
83
(Cheng et al., 2013), but the concerns towards the potential impact of these MNPs on organisms
84
are growing. Recently, various nano-QSAR studies have been conducted on cellular uptake
85
predictions for 109 functionalized magneto-fluorescent MNPs in different cell lines. All MNPs
86
possessed one superparamagnetic core and dextran coating but bearing different biological surface
87
modifications (Weissleder et al., 2005). Fourches et al. (2010) firstly developed nano-QSAR
88
models to describe the cellular uptake to pancreatic cancer cells (PaCa2) via employing kNN
4
89
modeling approach as well as MOE molecular descriptors with the average squared correlation
90
coefficient for the test set being of 0.72. Subsequently, Kar et al. (2014) developed a more accurate
91
cellular uptake model with 6 simply computable and interpretable descriptors for the same cell line
92
by employing the partial least squares (PLS) regression method. Winkler et al. (2014) generated
93
four nano-QSAR models to predict the uptake of PaCa2 and human umbilical vein endothelial
94
cells (HUVEC) nanoparticle by employing both linear and nonlinear methods. Ojha et al. (2019)
95
established nano-QSAR models as well as quantitative inter cell line uptake specificity modeling
96
(QICLUS) using only 2D descriptors on the cellular uptake prediction of 109 MNPs towards
97
PaCa2, HUVEC, and human macrophage cells (U937) lines, respectively. Toropov et al. (2013)
98
also developed reliable nano-QSAR models by employing the SMILES-based optimal descriptors
99
that molecular descriptors were computed directly from each SMILES (Simplified Molecular
100
Input-Line Entry System) attribute fragment of chemicals in view of nanoparticles uptake in
101
PaCa2 for five described random events. However, in most existing studies, the effect of the
102
molecular descriptors on cytotoxicity is directly extracted from nano-QSAR models, except for
103
Toropov et al. (2013)’s work. Different from traditional molecular descriptors (i.e.,
104
quantum-chemical descriptors, physicochemical descriptors), the SMILES-based optimal
105
descriptors proposed by Toropova and Toropov, have been successfully applied to predict the
106
toxicity of organic and nanometer materials (Toropov et al., 2011, 2013; Toropov and Toropova,
107
2015; Toropova and Toropov, 2013, 2017; Toropova et al., 2015, 2016; Trinh et al., 2018; Choi et
108
al., 2019). In 2016, on the basis of the SMILES-based optimal descriptors, our group (Pan et al.,
109
2016) proposed the improved SMILES-based optimal descriptors by integrating codes of certain
110
basic physicochemical properties into the traditional SMILES to improve the characterization of
5
111
nanostructure information. The resulted models developed based on the improved descriptors
112
showed obvious superiority comparing with traditional models and could also imply the direct
113
effect of descriptors on cytotoxicity (Pan et al., 2016).
114
In this study, the improved SMILES-based optimal descriptors encoded with several easily
115
available physicochemical properties were employed to describe the molecular structure
116
characteristics of 109 functionalized magneto-fluorescent MNPs. Then, for the first time, two
117
nano-QSAR models for predicting cellular uptakes of 109 diverse MNPs with surface modifiers in
118
HUVEC and PaCa2 lines were developed respectively, following the Organization for Economic
119
Cooperation and Development (OECD) principles for QSAR modeling. Both developed models
120
were then rigorously validated utilizing internal and external validation procedures and their
121
applicability domains were also defined. The resulting models in this study would be expected to
122
provide guidance for synthesizing or choosing safe and suitable organic coatings with desired
123
characteristics for MNPs prior to their biomedical applications.
124
2. Materials and methods
125
2.1. Data set
126
The experimental values of the cellular uptake of 109 nanoparticles on one predominant metal
127
core platform but bearing different organic coatings (small organic molecules) were taken from the
128
previous literature (Weissleder et al., 2005) and presented in Table S1 (ESI†). All newly
129
synthesized nanoparticles in the dataset shared the same-structure core of 3-nm (Fe2O3)n(Fe3O4)m
130
monocrystalline by multivalent attachment of different small molecules. In order to test the
131
absorption of these MNPs in different cell lines, fluorescein isothiocyanate (FITC) molecules were
132
appended to their surfaces to make MNPs magneto-fluorescent. The MNPs were filtered against
6
133
three different types of cell and two different physiological states of a given cell type, including
134
human pancreatic ductal adenocarcinoma cells (PaCa2), primary resting human macrophages
135
(RestMph), granulocyte macrophage colony stimulating factor-stimulated human macrophages
136
(GMCSF_Mph), human macrophage-like cell line (U937), and human umbilical vein endothelial
137
cells (HUVEC). Among the five tested cell lines, only HUVEC and PaCa2 lines revealed
138
substantial diversity in cellular uptake of the different MNPs. Non-significant variation in cellular
139
uptake was highlighted in other three macrophage or macrophage-like cell lines. Thus in this study,
140
the data of HUVEC uptake and PaCa2 uptake were employed for the model development. The
141
selected endpoint (cellular uptake) is expressed as the decadic logarithm of the concentration (pM)
142
of nanoparticles per cell (log10[NP]/cell) (Fourches et al., 2010), which varies from 2.23 to 4.44
143
for PaCa2 line, and 2.15 to 4.76 for HUVEC line, respectively.
144
2.2. Dataset splitting
145
Division of training and test sets is a necessary step in the development of reliable predictive
146
nano-QSAR models. Prior to the QSAR modeling, all the 109 nanoparticles in the entire dataset
147
were randomly split into a training set with 87 data (80% of the dataset) and a test set with 22 data
148
(20% of the dataset). The training set was employed to develop the nano-QSAR models, while the
149
test set was used to evaluate the ability of the developed models on predicting the invisible data.
150
2.3. Molecular descriptors calculation
151
The calculation of molecular descriptors for the involved nanoparticles is crucial in the
152
nano-QSAR modeling. Since all the nanoparticles have the same metal core and different organic
153
surface modifiers belonging to the class of anhydride (mostly cyclic), amine and amino acid, their
154
differences in cellular uptake are mostly caused by the diversities of molecular species on the
7
155
nanoparticle surface. Accordingly, each nanoparticle could be represented by its structure of
156
organic surface modification.
157
In this study, based on our previous proposed descriptors, the global SMILES attributes (NOSP),
158
which are defined as indicators of the presence or the absence of four chemical atoms: nitrogen,
159
oxygen, sulfur, and phosphorus (Toropov et al., 2011), were considered and employed to further
160
improve the SMILES-based optimal descriptors (DCW), which are described as following: DCWthreshold, N = CWS + CWSS + CWC + CWCC + CWNOSP
(1)
161
Where, Sk and SSk are local SMILES attributes obtained from the SMILES; CW(Sk) and
162
CW(SSk) are the correlation weights of the attributes. Similarly, Ck and CCk are codes of k-th
163
structure features; CW(Ck) and CW(CCk) are the correlation weights for these features. NOSP are
164
global SMILES attributes which are extracted from the SMILES notation.
165
Considered here codes of features can be defined by the following scheme:
166
(I) Standardization of feature Xk based on the formula NormX =
minX + X minX + maxX
(2)
167
(II) After normalized calculation, each physicochemical feature was classified into one category
168
between categories 1 and 9 according to the corresponding scale (Toropova et al., 2013). (Fig. S2,
169
ESI†). Alphabet characters of A, B, D, E, M were expressed as molecular weight, mass percentage
170
of carbon atoms, topological polar surface area, ring count and atom count, respectively.
171
The local SMILES attributes of Sk (or Ck) and SSk (or CCk) could be represented as: "ABDEM":"A","B","D","E","M"; "ABDEM":"AB","BD","DE","EM";
172
The assigned codes were binary combinations of capital letters and numbers (i.e., A0, A1,… )
8
173
and the randomly selected letters couldn’t repeat the same alphabet characters for atoms in
174
SMILES structures.
175
The optimal descriptors are then calculated with the correlation weights of active
176
physicochemical features by the Monte Carlo optimization. The Monte Carlo optimization method
177
was applied to find the preferable values of threshold T* and epoch N* corresponding to the
178
maximum correlation coefficients for the test set (Toropov et al., 2011). The threshold and Nepoch
179
are parameters of the Monte Carlo optimization. The threshold is a tool to define two classes of
180
molecular features: noise and active. The Nepoch is the number of epochs of the Monte Carlo
181
optimization.
182 183
The nano-QSAR model for predicting cellular uptake can be represented by the following equation: log-. /NP0/cell = C0 + C1 × DCWT ∗ , N ∗
(3)
184
Where, C0 and C1 are the intercept and slope.
185
All calculations were performed in CORAL software, which can be available from
186
http://www.insilico.eu/coral for free.
187
In this study, All the SMILES came from the previous report (Fourches et al., 2010). A total of
188
five different common physicochemical properties, named as molecular weight (MW), mass
189
percentage of carbon atoms (C%), topological polar surface area (TPSA), ring count (RC), and
190
atom count (AC), were employed as codes to develop the improved SMILES-based optimal
191
descriptors. Among the five selected physicochemical properties, the molecular weight (MW) has
192
been proved to be an important parameter for effectively altering the physicochemical properties
193
of different nano-drug carriers related to intracellular drug uptake (Feng et al., 2015; Liu et al.,
9
194
2016; Zhang et al., 2017), the topological polar surface area (TPSA) is widely used in virtual
195
screening of drug targets (Monge et al., 2006; Prasanna and Doerksen, 2009), while ring count
196
(RC), atom count (AC) and mass percentage of carbon atoms (C%) are the most commonly used
197
constitutional properties for indicating the bioactivity and cytotoxicity of nanomaterials to a
198
certain degree (Jia et al., 2005; Zhang et al., 2017; Malik and Kakkar, 2018). Both MW and C%
199
were calculated with Chemdraw (version 14.0), while TPSA, RC and AC were calculated using
200
Chemicalize (https://chemaxon.com/products/chemicalize).
201
Pearson correlation coefficients between each pair of physicochemical features (including MW,
202
C%, TPSA, RC and AC) were calculated to examine whether there was any special colinearity
203
between these parameters, and the results were shown in Fig. S1 (ESI†). The resulting highest
204
correlation coefficient (0.57) was between MW and AC, due to the fact that the more atoms would
205
lead to the higher molecular weights for small organic molecules.
206
2.4. Model validation
207
The best way to measure the validity of a model is how well it could predict the properties of
208
new MNPs that were not employed to generate it. Any developed QSAR model needs to be
209
reasonably validated according to the OECD QSAR validation principles (OECD, 2014). In this
210
study, we employed various validation strategies to verify the fitness, robustness, and predictivity
211
of the proposed nano-QSAR models.
212
The most commonly used statistical parameters, the squared correlation coefficient (R2),
213
standard error (s), and root-mean-square error (RMSE), were used to evaluate the fitness of the
214
resulting nano-QSAR models, while both the internal and external validations were used to
215
indicate the predictivity. Here, the robustness and internal predictive ability of both models were
10
216
indicated by the leave-one-out cross-validation parameter (Q2LOO), which was known as the most
217
reliable internal validation method. The external validation is necessary and essential in
218
determining both the generalizability and the true predictivity of the nano-QSAR models for
219
newly synthesized MNPs. Here, the external validation was performed by randomly dividing the
220
entire available data set into a training set and an external test set. The training set is used for the
221
model development, and the test set is used for external validation. Also, both models were tested
222
on a sufficiently large number of MNPs that were not used in the model development (20% of the
223
dataset) to avoid obtaining the QSAR results by chance or obtaining non-general conclusions
224
(Tropsha et al., 2003). Moreover, Y-randomization test was performed to further ensure the model
225
reliability and robustness of the developed models (Rücker et al., 2007), while the external
226
validation criteria proposed by Golbraikh and Tropsha (2002), Roy and Roy (2008) were also
227
employed to verify the acceptability of the developed models.
228
2.5. Applicability domain (AD)
229
According to the OECD principle 3 (OECD, 2014), a nano-QSAR model should have a defined
230
domain of applicability (AD). As nano-QSAR models are probably to be employed for reliably
231
estimating the properties of newly designed MNPs, the extent to which the newly synthesized
232
MNP belonging to the AD of the model requires reasonable consideration. Here, we analyzed the
233
AD of both models by the leverage approach to detect the existence of influential nanoparticles in
234
the training set, and to verify the prediction reliability for MNPs in the test set. The leverage value
235
hi is defined as hi = xiT(XTX)-1xi, where xi is a row vector of descriptors for a particular ith MNP
236
and X is the n × m matrix of m model descriptors for n training set MNPs. A hi value greater than
237
the warning leverage value h* implies that the structure of the MNP considerably differs from
11
238
those used for the calibration. The h* value can be calculated as, h*= 3(p+1)/n, where p is the
239
number of variables used in the model, n is the data size of the training set (Puzyn et al., 2011).
240
3. Results and Discussion
241
3.1. Computational results: nano-QSAR models
242
Based on the cellular uptake data of 109 magneto-fluorescent MNPs with surface modifiers in
243
HUVEC and PaCa2 lines, two statistically significant nano-QSAR models using uptake by single
244
cell line as the dependent variable were developed, respectively. The corresponding nano-QSAR
245
models were presented as following:
246
Model 1. HUVEC model log-. /NP0/cell = −3.1057±0.0341 + 0.0954±0.0005 × DCW1,15 B
n = 87, R = 247
0.852, QBEFF
(4) = 0.845, s = 0.237, F = 489, p < 0.0001
Model 2. PaCa2 model log-. /NP0/cell = −3.0330±0.0328 + 0.0966±0.0005 × DCW1,20 B
n = 87, R = 248 249
0.905, QBEFF
(5) = 0.899, s = 0.131, F = 812, p < 0.0001
Where, n is the number of MNPs in the training set; R2 is the squared correlation coefficient; Q2LOO is the cross-validated R2; s is standard error; F is Fischer ratio; p is p-value.
250
Subsequently, both nano-QSAR models were utilized to predict the cellular uptake values of the
251
MNPs in the test set for external validation. The predicted cellular uptakes were shown in Table S1
252
(ESI†), while the main statistical parameters were presented in Table 1. Both nano-QSAR models
253
for cellular uptake prediction of the 109 MNPs yielded high correlations (R2) between the
254
experimental and predicted cellular uptake endpoints in the respective training (>0.852) and test
255
(>0.822) sets. A value of R2 in the test set exceeding 0.6 would render a model with good
12
256
predictive capability (Alexander et al., 2015). Furthermore, the difference of R2 values between
257
the training and test set (Table 1) for both models was below 0.3, a standard for good predictivity
258
of the model (Eriksson et al., 2003). The RMSE values were not only low but also as similar as
259
possible for the training and external test sets, indicating that the proposed nano-QSAR models
260
have predictive ability (low values) and generalization performance (similar values). Both the
261
good fit and high predictive capability have been verified by an excellent agreement between the
262
experimentally determined and model (nano-QSAR) predicted MNPs cellular uptake values in the
263
respective training and test sets (Fig. 1(a)-(b)).
264
Table 1 Comparisons of statistical parameters between the presented and literature models
Cell lines
HUVEC
PaCa2
Training set
Models
R
2
Test set
RMSE
n
R
2
RMSE
n
Ojha et al. (2019)
0.782
0.299
87
0.704
-
22
Epa et al. (2012)
0.55
0.38
87
0.72
0.30
21
Winkler et al.
linear
0.74
0.34
87
0.63
0.36
21
(2014)
nonlinear
0.70
0.30
87
0.66
0.33
21
Basant and Gupta (2016)
0.973
0.100
83
0.966
0.104
21
Model 1
0.852
0.235
87
0.822
0.241
22
Ojha et al. (2019)
0.814
0.198
87
0.893
-
22
Epa et al. (2012)
0.64
0.26
87
0.62
0.32
21
Winkler et al.
linear
0.76
0.19
87
0.79
0.24
21
(2014)
nonlinear
0.77
0.15
87
0.54
0.28
21
Kar et al. (2014)
0.806
0.20
89
0.879
0.12
20
Fourches et al. (2010)
-
-
87
0.72
0.18
22
Toropov et al. (2013)
0.76
0.19
91
0.86
0.14
18
Singh and Gupta (2014)
0.945
0.13
87
0.897
0.18
22
Basant and Gupta (2016)
0.974
0.067
83
0.944
0.109
21
Ghorbanzadeh et al. (2012)
0.934
0.121
90
0.943
0.214
19
Model 2
0.905
0.130
87
0.885
0.140
22
265
The developed models were also satisfied with the external validation criteria proposed by
266
Golbraikh and Tropsha (2003), Roy and Roy (2008) as given in Table S2 (ESI†). All of these
267
statistical parameters are within the acceptable limit.
268
3.2. Model validation and results analysis 13
269
In order to avoid the existence of “correlation-by-chance”, and to prove the statistical
270
significance of both established nano-QSAR models, the Y-randomization test was carried out
271
additionally on the same dataset for 10 times for each model. As expected, all the newly calculated
272
R2 values, were much lower than the ones calculated when the dependent variables were not
273
scrambled (Table S3, ESI†). Thus, the Y-randomization test clearly showed that these models were
274
not obtained by chance, and confirmed the significance of the nano-QSAR models.
275
Furthermore, a visual analysis of the residual plots between the experimental and predicted
276
values for the resulting nano-QSAR models were calculated and shown in Fig. 1(c)-(d). As most
277
of the calculated residuals are randomly distributed on both sides of the zero baseline without
278
obvious regularity, it could be reasonably concluded that no systematic errors exist in the
279
development of both nano-QSAR models.
280
All the results discussed above indicated the satisfactory stability and predictivity of the
281
developed nano-QSAR models after carrying out various rigorous model validation strategies,
282
which could be reasonably employed to predict the cytotoxicity of MNPs within their ADs.
283
Considering the limitation of existing experimental data and the complexity of current cytotoxicity
284
tests for MNPs, it was difficult to further improve the model predictions beyond the current
285
results.
286
To our delighted, this work showed that the MNPs cellular uptake to both HUVEC and PaCa2
287
lines could be successfully predicted by the QSAR approach using the improved SMILES-based
288
optimal descriptors. Once properly developed, the nano-QSAR models could be expected to
289
predict the cellular uptake for new MNPs or for other MNPs for which experimental values are
290
unknown, using only SMILES structures as well as some basic physicochemical properties. The
14
291
SMILES could be obtained from freely accessible software like ChemSketch, while the involved
292
physicochemical properties can be generated by inputting SMILES structures into software such
293
as Chemicalize, or directly obtained from structural formula.
5.0
5.0
Training set Test set
Predicted values of log(UP)
Predicted values of log(UP)
5.5
4.5 4.0 3.5 3.0
(a)
2.5 2.0 2.0
2.5
3.0
3.5
4.0
4.5
5.0
4.5 4.0 3.5 3.0 2.5 2.0 2.0
5.5
1.0
2.5
3.0
3.5
4.0
4.5
0.0
-0.5
0.5
Training set Test set
0.0
-0.5
(c) -1.0 2.0
2.5
3.0
3.5
4.0
4.5
5.0
(d) -1.0 2.0
5.5
Experimental values of log(UP)
3.5
4.0
4.5
5.0
3
Standardized residuals
Standardized residuals
3.0
4
3
Training set Test set
2 1 0 -1
(e)
-2 -3
294
2.5
Experimental values of log(UP)
4
-4 0.00
5.0
1.0
Training set Test set Residual values
Residual values
(b) Experimental values of log(UP)
Experimental values of log(UP)
0.5
Training set Test set
Training set Test set
2 1 0 -1 -2
(f)
-3
h*=0.069 0.02
0.04
0.06
0.08
Leverages
-4 0.00
h*=0.069 0.02
0.04
0.06
0.08
Leverages
295
Fig. 1 (a) Plots of experimental versus predicted cellular uptake values to HUVEC line, (b) Plots of experimental
296
versus predicted cellular uptake values to PaCa2 line, (c) Plots of the residuals for predicting cellular uptake to
297
HUVEC line, (d) Plots of the residuals for predicting cellular uptake to PaCa2 line, (e) Williams plots describing
298
applicability domains of nano-QSAR model to HUVEC line and (f) Williams plots describing applicability
299
domains of nano-QSAR model to PaCa2 line.
15
300
3.3. Applicability domain analysis
301
Once a nano-QSAR model is developed, the applicability domain (AD) of the developed model
302
should be defined, since any nano-QSAR model could properly evaluate the cytotoxicity of newly
303
synthetic MNP, only if the MNP lies within its AD.
304
In this study, the plots of the standardized residuals versus leverages (Williams plot) for both
305
models were shown in Fig. 1(e)-(f), which indicated that all the 109 magneto-fluorescent MNPs
306
were located in a squared area within ± 3 standard deviation units and a warning leverage
307
h*=0.069. That is to say, there were no obvious outliers in both the structural similarity axis and
308
the cellular uptake predictions to the corresponding cell line.
309
3.4. Comparisons with other literature models
310
The proposed models for predicting MNPs cellular uptake values in different cell lines were
311
based on the almost completely same datasets as the literature reported. Considering the
312
comparative assessment of statistical parameters of all these models as presented in Table 1, one
313
can conclude that the proposed models calculated based on the improved SMILES-based optimal
314
descriptors are generally better than or at least comparable to those reported models. More other
315
specific details of the model comparisons are further discussed as below.
316
Comparisons between model 1 and other reported models based on the same dataset
317
(cellular uptake to HUVEC)
318
Firstly, it should be noted that Epa et al.’s (2012), Winkler et al. ’s (2014) and Ojha et al.’s
319
(2019) works were developed based on different sets of interpretable 2D DRAGON descriptors,
320
Basant and Gupta ’s (2016) work used 1D and 2D theoretical descriptors calculated by PaDEL
321
program, while our model 1 employed the improved SMILES-based optimal descriptors. All types
16
322
of descriptors are conceptually simple and easy to calculate. However, the 2D DRAGON
323
descriptors have definite physical meanings, which are more interpretable than the improved
324
SMILES-based optimal descriptors. Basant and Gupta (2016) employed decision tree boost (DTB)
325
approach, which may result in the risk of overfitting, while our model 1 was a multiple linear
326
regressions (MLR) model, which is always considered to be more transparent and easier to apply.
327
Regarding the model applicability domain, the analysis of AD is missing in both Epa et al.’s (2012)
328
and Winkler et al. ’s (2014) works, and the analysis in Ojha et al.’s (2019) work is not enough.
329
Ojha et al. (2019) only checked AD of the test set by using DModX (distance to model X)
330
approach, but no further explanation was given for the training set. In this study, the AD of model
331
1 was analyzed in detail and visually defined by the Williams plot.
332
Comparisons between model 2 and other reported models based on the same dataset
333
(cellular uptake to PaCa2)
334
More nano-QSAR models are available in literature concerning cellular uptake to PaCa2
335
comparing to cellular uptake to HUVEC. Winkler et al. (2014) employed 19 interpretable
336
mechanistically 2D DRAGON descriptors as the input parameters for nano-QSAR modeling,
337
which implied the possible risk of overfitting for the resulting model. Meanwhile, only five easily
338
obtained physicochemical parameters and SMILES structures were involved in the model 2
339
developed in this study. It is well known that QSAR modeling aims at finding optimum
340
quantitative relationship between the molecular structures and desired properties with as less
341
descriptors as possible. The fewer descriptors are employed, the more robust the nano-QSAR
342
models will be considered.
343
It is worth mentioning that the traditional SMILES-based optimal descriptors were used to
17
344
characterize the structural characteristics of the nanoparticles by Toropov et al. (2013). While in
345
this work, additional physicochemical properties were included in SMILES structures to improve
346
the characterization of nanostructure information, leading to an obvious better calibration ability
347
for the training set (0.905 vs. 0.76), as well as an improved predictive ability for the test set (0.885
348
vs. 0.86).
349
Moreover, Fourches et al. (2010) employed the k nearest neighbors (kNN) modeling approach,
350
Basant and Gupta (2016) employed decision tree boost (DTB) approach, Singh and Gupta (2014)
351
employed decision tree forest (DTF) and DTB implementing bagging and boosting techniques,
352
and Ghorbanzadeh et al. (2012) used multilayered perceptron neural network (MLP-NN)
353
modeling technique to develop reliable but non-intuitive nonlinear models. Also, Ghorbanzadeh et
354
al.’s (2012) Singh and Gupta’s (2014), and Basant and Gupta’s (2016) models were superior to
355
model 2 in terms of statistical parameters for the test set. However, our model 2 was a multiple
356
linear regressions (MLR) model, which is considered to be more transparent and easier to apply.
357
As well-known, the neural network method always suffers some disadvantages, such as
358
overtraining, overfitting, and reproducibility of results. Also, the decision tree approach may result
359
in the risk of overfitting.
360
With regards to the model applicability domain, the analysis of AD is absent in Epa et al.’s
361
(2012), Winkler et al.’s (2014) and Toropov et al.’s (2013) works, and Ojha et al. (2019) only
362
checked the AD for the test set by employing DModX approach ignoring the AD for the training
363
set. In this study, the AD of model 2 was analyzed in details and visually defined by the Williams
364
plot.
365
Consequently, both the proposed models can be considered to predict the cellular uptake
18
366
endpoint of MNPs to HUVEC and PaCa2 lines with some obvious superiority in comparison to
367
the literature models. However, it should also be stated that the proposed models have some
368
limitations: The employed physicochemical properties are selected based on literature researches,
369
but these empirical parameters may be not necessarily the optimal parameters for this study, which
370
need to be further studied. In addition, the resulting models could only be used to predict the
371
cellular uptake of metal oxide nanoparticles in HUVEC and PaCa2 lines.
372
3.5. Mechanistic interpretation
373
From the developed nano-QSAR models, it can be highlighted that a lower value of the optimal
374
descriptor (DCW) resulted in a lower value of cellular uptake endpoint (i.e., lower cytotoxicity).
375
The optimal descriptor was the sum of the correlation weight (CW) of all attribute fragments of
376
improved SMILES-based optimal descriptors. Thus, the higher correlation weights always
377
correspond to the higher cytotoxicity. 4.0 HUVEC
Average absolute CW
3.5
PaCa2
3.0 2.5 2.0 1.5 1.0 0.5 0.0
378
MW
C%
TPSA
RC
AC
379
Fig. 2 Average absolute correlation weights (CW) value of each physicochemical feature
380
in both generated nano-QSAR models.
381
In this study, a total of five easily obtained physicochemical properties were combined in the
382
SMILES structures to improve the structure characterization of MNPs. Lower CW led to lower
383
cellular uptake value and thus lower cytotoxicity because positive CW implied the promotion of
19
384
an increase in cellular uptake. Among the five employed parameters, AC had the highest average
385
absolute CW value in both generated nano-QSAR models as plotted in Fig. 2. That is to say, AC
386
was found to have the most influential effects on cellular uptake of the MNPs to both cell lines.
387
Based on the correlation weight tables (Table S4-S5, ESI†), it can be concluded that, for HUVEC
388
line, the correlation weights of low MW group A1-A6 (smaller than 3.0) are lower than the
389
weights of high MW group A7-A9 (larger than 3.0), suggesting that the high MW group is more
390
toxic than low MW group. The correlation weights of C% decreases first and then increases with
391
the increase of C%, suggesting that the cytotoxicity is decreased and then enhanced with the
392
increase of C%. All the correlation weights of TPSA are low, excluding D4 and D9, implying that
393
the changes of TPSA have no significant effect on cytotoxicity. The correlation weights of low RC
394
group E0-E3 (larger than 3.0) are higher than the weights of high RC group E4-E9 (smaller than
395
3.0), suggesting that the low RC group is more toxic than high RC group. The correlation weights
396
showed a lowering trend with increase of AC, excluding M1, M5 and M6, suggesting that the
397
cytotoxicity was enhanced in a nanoparticle with low atom count. While for PaCa2 line, all the
398
correlation weights of MW are low excluding A4, implying that the changes of MW have no
399
significant effect on cytotoxicity. The correlation weights of lower C% group B2-B3 (larger than
400
3.0) are higher than the weights of higher C% group B4-B9 (smaller than 3.0) except for B6,
401
suggesting that the lower C% group exerted more toxic effects than higher C% group. All the
402
correlation weights of TPSA are low excluding D3, implying that the changes of TPSA may have
403
no significant effect on cytotoxicity. The correlation weights of lower RC group E0-E4 (larger
404
than 3.0) are higher than the weights of higher RC group E5-E9 (smaller than 3.0), suggesting that
405
the lower RC group exerted more toxic effects than higher RC group. However, it should be noted
20
406
that there are no obvious rules in the numerical changes of the correlation weights for AC.
407
Table 2 global SMILES Attributes (NOSP) chosen according to two Criteria: 1) these are not rare for models
408
of the both objects of interest; and 2) their presence in the molecules has a clear effect. HUVEC
global SMILES
PaCa2
attribute
CW
NSsa
CW
NSsa
NCvb
1
NOSP01000000
8.605
43
9
5.563
43
9
2
NOSP01100000
1.578
1
1
1.152
1
1
3
NOSP10000000
0.873
17
7
2.511
17
7
4
NOSP11000000
-2.097
26
4
2.653
26
4
NCv
b
a
NSs is the number of substances which containing a structural attribute in the training set;
b
NCv is the number of substances which containing a structural attribute in the test set.
409
Table 2 presents four mentioned global SMILES attributes involved in both models (the cellular
410
uptake to HUVEC and PaCa2 lines), which were chosen according to the following criteria: 1)
411
these attributes are not rare for models of the both endpoints; and 2) their presence in the
412
molecules has a clear effect (attributes with positive correlation weights are promoters of endpoint
413
increase and attributes with negative correlation weights are promoters of endpoint decrease)
414
(Toropov et al., 2011). It could be noted that the attributes1, 2, 3 are promoters of the increase for
415
cellular uptake of both HUVEC and PaCa2 lines, whereas the attribute 4 is a promoter of the
416
decrease of HUVEC cellular uptake and promoter of the increase of PaCa2 cellular uptake. It
417
could be concluded that the analysis of the presence of attribute 4 in the molecular structure would
418
be helpful to search for similarities and differences in the mechanisms of the cellular uptake to
419
HUVEC and PaCa2 lines.
420
4. Conclusions
421
In this study, the improved SMILES-based optimal descriptors were employed for nano-QSAR
422
modeling on the cytotoxicity of 109 MNPs to HUVEC and PaCa2 lines. Two robust and predictive
423
nano-QSAR models have been successfully developed and rigorously validated. Furthermore, the
21
424
predominant nano-structure characteristics responsible for cellular uptake of MNPs were
425
identified, and differences in the mechanisms of their cytotoxicity to different types of cells were
426
also discussed. The main findings are summarized below:
427
1) The improved SMILES-based optimal descriptors for 109 MNPs can be calculated directly
428
from SMILES and some easily available physicochemical properties without any complicated
429
calculation process. These reproducible descriptors could efficiently encode the cellular
430
uptake of MNPs leading to models with satisfactory statistical quality as well as certain
431
interpretability.
432
2) Two statistically significant and robust nano-QSAR models for predicting the cellular uptake
433
of MNPs to both HUVEC and PaCa2 lines have been successfully developed. When
434
compared with the previously reported models, the resulting models were considered to be
435
conceptually simple, easy to apply, and with acceptable model interpretability.
436
3) For HUVEC line, the atom count and molecular weight were found to be the most significant
437
factors for the cellular uptake, while for PaCa2 line, the atom count and mass percentage of
438
carbon atoms were the most important factors for the cellular uptake. The presented
439
nano-QSAR models revealed the differences in the mechanisms of cytotoxicity of MNPs to
440
HUVEC and PaCa2 lines.
441
4) This study could provide a new way for predicting the cytotoxicity of unmeasured or novel
442
surface modifiers of MNPs for which experimental values are unknown. The developed
443
models could also be used to improve the safety design and virtual screening of desired MNPs
444
prior to their biomedical applications.
445
Acknowledgements
22
446
This research was supported by National Natural Science Fund of China (No. 51974165,
447
81803274), and Natural Science Fund of Jiangsu Higher Education Institutions of China (No.
448
18KJA620002).
449
Appendix A. Supplementary materials
450
Supplementary information.docx: This file provides supplementary tables and figures.
451
23
452
References
453
Abbasi, A., Hamadanian, M., Salavati-Niasari, M., Mortazavi-Derazkola, S., 2017. Facile
454
size-controlled preparation of highly photocatalytically active ZnCr2O4 and ZnCr2O4/Ag
455
nanostructures for removal of organic contaminants. J. Colloid Interf. Sci. 500, 276–284.
456
Agasti, S.S., Rana, S., Park, M.-H., Kim, C.K., You, C.-C., Rotello, V.M., 2010. Nanoparticles for
457
detection and diagnosis. Adv. Drug Deliver. Rev. 62, 316–328.
458
Alexander, D.L., Tropsha, A., Winkler, D.A., 2015. Beware of R2: Simple, unambiguous
459
assessment of the prediction accuracy of QSAR and QSPR models. J. Chem. Inf. Model. 55,
460
1316–1322.
461
Ardekani, P.S., Karimi, H., Ghaedi, M., Asfaram,A., Purkait, M.K., 2017. Ultrasonic assisted
462
removal of methylene blue on ultrasonically synthesized zinc hydroxide nanoparticles on
463
activated carbon prepared from wood of cherry tree: Experimental design methodology and
464
artificial neural network. J. Mol. Liq. 229, 114–124.
465
Auffan, M., Rose, J., Bottero, J.-Y., Lowry, G.V., Jolivet, J.-P., Wiesner, M.R., 2009. Towards a
466
definition of inorganic nanoparticles from an environmental, health and safety perspective.
467
Nat. Nanotechnol. 4, 634–641.
468
Bahrani, S., Razmi, Z., Ghaedi, M., Asfaram, A., Javadian, H., 2018. Ultrasound-accelerated
469
synthesis of gold nanoparticles modified choline chloride functionalized graphene oxide as a
470
novel sensitive bioelectrochemical sensor: Optimized meloxicam detection using CCD-RSM
471
design and application for human plasma sample. Ultrason. Sonochem. 42, 776–786.
472
Bajpai, V.K., Shukla, S., Kang, S.-M., Hwang, S.K., Song, X., Huh, Y.S., Han Y.-K., 2018.
473
Developments of cyanobacteria for nano-marine drugs: relevance of nanoformulations in
24
474
cancer therapies. Mar. Drugs. 16, 179–201.
475
Basant, N., Gupta, S., 2016. Modeling uptake of nanoparticles in multiple human cells using
476
structure- activity relationships and intercellular uptake correlations. Nanotoxicology. 11, 20–
477
30..
478
Bazrafshan, A.A., Ghaedi, M., Hajati, S., Naghiha, R., Asfaram, A., 2017. Synthesis of
479
ZnO-nanorod-based materials for antibacterial, antifungal activities, DNA cleavage and
480
efficient ultrasound-assisted dyes adsorption. Ecotox. Environ. Safe. 142, 330–337.
481 482
Cheng, L.-C., Jiang, X., Wang, J., Chen, C., Liu, R.-S., 2013. Nano-bio effects: interaction of nanomaterials with cells. Nanoscale. 5, 3547–3569.
483
Choi, J.-S., Trinh, T.X., Yoon, T.-H., Kim, J., Byun, H.-G., 2019. Quasi-QSAR for predicting the
484
cell viability of human lung and skin cells exposed to different metal oxide nanomaterials.
485
Chemosphere. 217, 243–249.
486
Davar, F., Salavati-Niasari, M., Mir, N., Saberyan, K., Monemzadeh, M., Ahmadi, E., 2010.
487
Thermal decomposition route for synthesis of Mn3O4 nanoparticles in presence of a novel
488
precursor. Polyhedron. 29, 1747–1753.
489
Dil, E.A., Ghaedi, M., Asfaram, A., Mehrabi, F., Sadeghfar, F., 2019. Efficient adsorption of Azure
490
B onto CNTs/Zn:ZnO@Ni2P-NCs from aqueous solution in the presence of ultrasound wave
491
based on multivariate optimization. J. Ind. Eng. Chem. 74, 55–62.
492 493
Dong, J., Zink, J.I., 2014. Taking the temperature of the interiors of magnetically heated nanoparticles. ACS Nano. 8, 5199–5207.
494
Duttaa, T., Kim, K.-H., Deep, A., Szulejko, J.E., Vellingiri, K., Kumar, S., Kwon, E.E., Yun, S.-T.,
495
2018. Recovery of nanomaterials from battery and electronic wastes: A new paradigm of
25
496
environmental waste management. Renew. Sust. Energ. Rev. 82, 3694–3704.
497
El-Sayed, I.H., Huang, X., El-Sayed, M.A., 2006. Selective laser photo-thermal therapy of
498
epithelial carcinoma using anti-EGFR antibody conjugated gold nanoparticles. Cancer Lett.
499
239, 129–135.
500 501
Epa, V.C, Burden, F.R., Tassa, C., Weissleder, R., Shaw, S., Winkler, D.A., 2012. Modeling biological activities of nanoparticles. Nano Lett. 12, 5808–5812.
502
Eriksson, L., Jaworska, J., Worth, A.P., Cronin, M.T., McDowell R.M., Gramatica, P., 2003.
503
Methods for reliability and uncertainty assessment and for applicability evaluations of
504
classification- and regression-based QSARs. Environ. Health Persp. 111, 1361–1375.
505 506 507 508
Feng, S., Nie, L., Zou, P., Suo, J., 2015. Effects of drug and polymer molecular weight on drug release from PLGA-mPEG microspheres. J. Appl. Polym. Sci. 132, 41431–41438. Fourches, D., Pu, D., Tassa, C., Weissleder, R., Shaw, S.Y., Mumper, R.J., Tropsha, A., 2010. Quantitative nanostructure - activity relationship modeling. ACS Nano. 4, 5703–5712.
509
Gai, X., Jiang, Z., Liu, M., Li, Q., Wang, S., Li, T., Pan, W., Yang, X., 2018. Therapeutic effect of
510
a novel nano-drug delivery system on membranous glomerulonephritis rat model induced by
511
cationic bovine serum. AAPS PharmSciTech. 19, 2195–2202.
512
Geys, J., Nemmar, A., Verbeken, E., Smolders, E., Ratoi, M., Hoylaerts, M.F., Nemery, B., Hoet,
513
P.H., 2008. Acute toxicity and prothrombotic effects of quantum dots: impact of surface
514
charge. Environ. Health Persp. 116, 1607–1613.
515
Ghorbanzadeh, M., Fatemi, M.H., Karimpour, M., 2012. Modeling the cellular uptake of
516
magnetofluorescent nanoparticles in pancreatic cancer cells: A quantitative structure activity
517
relationship study. Ind. Eng. Chem. Res. 51, 10712–10718.
26
518
Golbraikh, A., Tropsha, A., 2002. Beware of q2!. J. Mol. Graph. Model. 20, 269–276.
519
Jia, G., Wang, H., Yan, L., Wang, X., Pei, R., Yan, T., Zhao, Y., Guo, X., 2005. Cytotoxicity of
520
carbon nanomaterials: single-wall nanotube, multi-wall nanotube, and fullerene. Environ. Sci.
521
Technol. 39, 1378–1383.
522 523 524
Jiang, S., Gao, Q., Chen, H., Roco, M.C., 2015. The roles of sharing, transfer, and public funding in nanotechnology knowledge-diffusion networks. J. Assoc. Inf. Sci. Tech. 66, 1017–1029. Kar, S., Gajewicz, A., Puzyn, T., Roy, K., 2014. Nano-quantitative structure-activity relationship
525
modeling
526
magnetofluorescent engineered nanoparticles in pancreatic cancer cells. Toxicol. in Vitro. 28,
527
600–606.
528 529
using
easily computable
and
interpretable
descriptors for
uptake
of
Kar, S., Roy, K., 2010. Predictive toxicology using QSAR: a perspective. J. Indian Chem. Soc. 87, 1455–1515.
530
Khojasteh, H., Safajo, H., Mortazavi-Derazkola, S., Salavati-Niasari, M., Heydaryan, K., Yazdani,
531
M., 2019. Economic procedure for facile and eco-friendly reduction of graphene oxide by
532
plant extracts; a comparison and property investigation. J. Clean. Prod. 229, 1139–1147.
533
Le, T., Epa, V.C., Burden, F.R., Winkler, D.A., 2012. Quantitative structure-property relationship
534
modeling of diverse materials properties. Chem. Rev. 112, 2889–2919.
535
Liu, M., Tang, F., Yang, Z., Xu, J., Yang, X., 2019. Recent progress on gold-nanocluster-based
536
fluorescent probe for environmental analysis and biological sensing. J. Anal. Methods Chem.
537
2019, 1–10.
538
Liu, R., Wang, Y., Ma, Y., Wu, Y., Guo, Y., Xu, L., 2016. Effects of the molecular weight of PLGA
539
on degradation and drug release in vitro from an mPEG-PLGA nanocarrier. Chem. Res. Chin.
27
540 541 542
Univ. 32, 848–853. Malik, P., Kakkar, R., 2018. Effects of increasing number of rings on the ion sensing ability of CdSe quantum dots: a theoretical study. J. Nanopart. Res. 20, 114–128.
543
Manganelli, S., Leone, C., Toropov, A.A., Toropova, A.P., Benfenati, E., 2016. QSAR model for
544
cytotoxicity of silica nanoparticles on human embryonic kidney cells. Mater. Today Proc. 3,
545
847–854.
546
Mattsson, K., Adolfsson, K., Ekvall, M.T., Borgström, M.T., Linse, S., Hansson, L.-A., Cedervall,
547
T., Prinz, C.N., 2016. Translocation of 40 nm diameter nanowires through the intestinal
548
epithelium of daphnia magna. Nanotoxicology. 10, 1160–1167.
549
Mir, N., Salavati-Niasari, M., Davar, F., 2012. Preparation of ZnO nanoflowers and Zn glycerolate
550
nanoplates using inorganic precursors via a convenient rout and application in dye sensitized
551
solar cells. Chem. Eng. J. 181–182, 779–789.
552
Mohandes, F., Salavati-Niasari, M., 2014. Freeze-drying synthesis, characterization and in vitro
553
bioactivity of chitosan/graphene oxide/hydroxyapatite nanocomposite. RSC Adv. 4, 25993–
554
26001.
555
Monge, A., Arrault, A., Marot, C., Morin-Allory, L., 2006. Managing, profiling and analyzing a
556
library of 2.6 million compounds gathered from 32 chemical providers. Mol. Divers. 10,
557
389–403.
558
Mortazavi-Derazkola, S., Salavati-Niasari, M., Khojasteh, H., Amiri, O., Ghoreishi, S.M., 2017.
559
Green synthesis of magnetic Fe3O4/SiO2/HAp nanocomposite for atenolol delivery and in
560
vivo toxicity study. J. Clean. Prod. 168, 39–50.
561
Ojha, P.K., Kar, S., Roy, K., Leszczynski, J., 2019. Toward comprehension of multiple human
28
562
cells uptake of engineered nano metal oxides: quantitative inter cell line uptake specificity
563
(QICLUS) modeling. Nanotoxicology. 13, 14–34.
564 565
Organisation for Economic Co-operation and Development (OECD), 2014. Guidance document on the validation of (quantitative) structure-activity relationship [(Q) SAR] models.
566
Oroojalian, F., Rezayan, A.H., Mehrnejad, F., Nia, A.H., Shier, W.T., Abnous, K., Ramezani, M.,
567
2017. Efficient megalin targeted delivery to renal proximal tubular cells mediated by
568
modified-polymyxin B-polyethylenimine based nano-gene-carriers. Mat. Sci. Eng. C. 79,
569
770–782.
570
Pan, Y., Li, T., Cheng, J., Telesca, D., Zink, J.I., Jiang, J., 2016. Nano-QSAR modeling for
571
predicting the cytotoxicity of metal oxide nanoparticles using novel descriptors. RSC Adv. 6,
572
25766–25775.
573 574 575 576 577 578
Petros, R.A., DeSimone, J.M., 2010. Strategies in the design of nanoparticles for therapeutic applications. Nat. Rev. Drug Discov. 9, 615–627. Prasanna, S., Doerksen, R.J., 2009. Topological polar surface area: A useful descriptor in 2D-QSAR. Curr. Med. Chem. 16, 21–41. Pridgen, E.M., Alexis, F., Farokhzad, O.C., 2015. Polymeric nanoparticle drug delivery technologies for oral delivery applications. Expert Opin. Drug Del. 12, 1459–1473.
579
Puzyn, T., Rasulev, B., Gajewicz, A., Hu, X., Dasari, T.P., Michalkova, A., Hwang, H.-M.,
580
Toropov, A., Leszczynska, D., Leszczynski, J., 2011. Using nano-QSAR to predict the
581
cytotoxicity of metal oxide nanoparticles. 6, 175–178.
582 583
Roy, P.P., Roy, K., 2008. On some aspects of variable selection for partial least squares regression models. QSAR Comb. Sci. 27, 302–314.
29
584 585 586
Rücker, C., Rücker, G., Meringer, M., 2007. Y-randomization and its variants in QSPR/QSAR. J. Chem. Inf. Model. 47, 2345–2357. Sadeghfar, F., Ghaedi, M., Asfaram, A., Jannesar, R., Javadian, H., Pezeshkpour, V., 2018.
587
Polyvinyl
alcohol/Fe3O4@carbon
nanotubes
nanocomposite:
Electrochemical-assisted
588
synthesis, physicochemical characterization, optical properties, cytotoxicity effects and
589
ultrasound-assisted treatment of aqueous based organic compound. J. Ind. Eng. Chem. 65,
590
349–362.
591
Safardoust-Hojaghan, H., Salavati-Niasari, M., 2017. Degradation of methylene blue as a pollutant
592
with N-doped graphene quantum dot/titanium dioxide nanocomposite. J. Clean. Prod. 148,
593
31–36.
594
Salavati-Niasari, M., 2004. Zeolite-encapsulation copper(II) complexes with 14-membered
595
hexaaza macrocycles: synthesis, characterization and catalytic activity. J. Mol. Catal.
596
A-Chem. 217, 87–92.
597
Salavati-Niasari, M., Davar, F., 2006. In situ one-pot template synthesis (IOPTS) and
598
characterization of copper(II) complexes of 14-membered hexaaza macrocyclic ligand
599
“3,10-dialkyl-dibenzo-1,3,5,8,10,12-hexaazacyclotetradecane’’. Inorg. Chem. Commun. 9,
600
175–179.
601 602 603 604 605
Shang, L., Nienhaus, K., Nienhaus, G.U., 2014. Engineered nanoparticles interacting with cells: size matters. J. Nanobiotechnol. 12, 5–15. Singh, K.P., Gupta, S., 2014. Nano-QSAR modeling for predicting biological activity of diverse nanomaterials. RSC Adv. 4, 13215–13230. Singh, R.P., Ramarao, P., 2013. Accumulated polymer degradation products as effector molecules
30
606
in cytotoxicity of polymeric nanoparticles. Toxicol. Sci. 136, 131–143.
607
Tian, Y., Guo, R., Jiao, Y., Sun, Y., Shen, S., Wang, Y., Lu, D., Jiang, X., Yang, W., 2016. Redox
608
stimuli-responsive hollow mesoporous silica nanocarriers for targeted drug delivery in cancer
609
therapy. Nanoscale Horiz. 1, 480–487.
610 611
Toropov, A.A., Toropova, A.P., 2015. Quasi-QSAR for mutagenic potential of multi-walled carbon-nanotubes. Chemosphere. 124, 40–46.
612
Toropov, A.A., Toropova, A.P., Benfenati, E., Gini, G., Leszczynska, D., Leszczynski, J., 2011.
613
SMILES-based QSAR approaches for carcinogenicity and anticancer activity: Comparison of
614
correlation weights for identical SMILES attributes. Anti-Cancer Agent. Me. 11, 974–982.
615
Toropov, A.A., Toropova, A.P., Puzyn, T., Benfenati, E., Gini, G., Leszczynska, D., Leszczynski, J.,
616
2013. QSAR as a random event: Modeling of nanoparticles uptake in PaCa2 cancer cells.
617
Chemosphere. 92, 31–37.
618
Toropova, A.P., Toropov, A.A., 2013. Optimal descriptor as a translator of eclectic information
619
into the prediction of membrane damage by means of various TiO2 nanoparticles.
620
Chemosphere. 93, 2650–2655.
621 622
Toropova, A.P., Toropov, A.A., 2017. Nano-QSAR in cell biology: Model of cell viability as a mathematical function of available eclectic data. J. Theor. Biol. 416, 113–118.
623
Toropova, A.P., Toropov, A.A., Rallo, R., Leszczynska, D., Leszczynski, J., 2015. Optimal
624
descriptor as a translator of eclectic data into prediction of cytotoxicity for metal oxide
625
nanoparticles under different conditions. Ecotox. Environ. Safe. 112, 39–45.
626
Toropova, A.P., Toropov, A.A., Veselinović, A.M., Veselinović, J.B., Leszczynska, D., Leszczynski,
627
J., 2016. Monte carlo−based quantitative structure−activity relationship models for toxicity of
31
628
organic chemicals to daphnia magna. Environ. Toxicol. Chem. 35, 2691–2697.
629
Trinh, T.X., Choi, J.-S., Jeon, H., Byun, H.-G., Yoon, T.-H., Kim, J., 2018. Quasi-SMILES-based
630
nano-quantitative structure−activity relationship model to predict the cytotoxicity of
631
multiwalled carbon nanotubes to human lung cells. Chem. Res. Toxicol. 31, 183–190.
632
Tropsha, A., Gramatica, P., Gombar, V.K., 2003. The importance of being earnest: Validation is the
633
absolute essential for successful application and interpretation of QSPR models. QSAR
634
Comb. Sci. 22, 69–77.
635
Weissleder, R., Kelly, K., Sun, E.Y., Shtatland, T., Josephson, L., 2005. Cell-specific targeting of
636
nanoparticles by multivalent attachment of small molecules. Nat. Biotechnol. 23, 1418–1423.
637
Winkler, D.A., Burden, F.R., Yan, B., Weissleder, R., Tassa, C., Shaw, S., Epa, V.C., 2014.
638
Modelling and predicting the biological effects of nanomaterials. SAR QSAR Environ. Res.
639
25, 161–172.
640
Winkler, D.A., Mombelli, E., Pietroiusti, A., Tran, L., Worth, A., Fadeel, B., McCall, M.J., 2013.
641
Applying quantitative structure-activity relationship approaches to nanotoxicology: Current
642
status and future potential. Toxicology. 313, 15–23.
643 644
Wu, J., Li, S., Wei, H., 2018. Multifunctional nanozymes: enzyme-like catalytic activity combined with magnetism and surface plasmon resonance. Nanoscale Horiz. 3, 367–382.
645
Yousefi, M., Gholamian, F., Ghanbari, D., Salavati-Niasari, M., 2011. Polymeric nanocomposite
646
materials: Preparation and characterization of star-shaped PbS nanocrystals and their
647
influence on the thermal stability of acrylonitrile–butadiene–styrene (ABS) copolymer.
648
Polyhedron. 30, 1055–1060.
649
Zhang, M., Ma, Y., Wang, Z., Han, Z., Gao, W., Gu, Y., 2017. Optimizing molecular weight of
32
650
octyl chitosan as drug carrier for improving tumor therapeutic efficacy. Oncotarget. 8, 64237–
651
64249.
652 653
Zhang, M.-Q., Zhao, Y.-X., Liu, Q.-Y., Li, X.-N., He, S.-G., 2017. Does each atom count in the reactivity of vanadia nanoclusters? J. Am. Chem. Soc. 139, 342–347.
654
Zinatloo-Ajabshir, S., Mortazavi-Derazkola, S., Salavati-Niasari, M., 2017. Nd2O3 nanostructures:
655
Simple synthesis, characterization and its photocatalytic degradation of methylene blue. J.
656
Mol. Liq. 234, 430–436.
657
33
Highlights
The improved SMILES-based optimal descriptors are used to describe nanostructures
Two QSAR models are developed to predict cellular uptake of MNPs to two types of cells
Predominant structure features responsible for MNPs’ cellular uptake are identified
A new way for predicting the cellular uptake of new MNP to target cells is provided
Declaration of interests ☒ The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. ☐The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: