Journal Pre-proof GPU-accelerated smoothed particle hydrodynamics modeling of granular flow Jian-Yu Chen, Fue-Sang Lien, Chong Peng, Eugene Yee PII:
S0032-5910(19)30851-4
DOI:
https://doi.org/10.1016/j.powtec.2019.10.017
Reference:
PTEC 14765
To appear in:
Powder Technology
Received Date: 5 June 2019 Revised Date:
1 October 2019
Accepted Date: 4 October 2019
Please cite this article as: J.-Y. Chen, F.-S. Lien, C. Peng, E. Yee, GPU-accelerated smoothed particle hydrodynamics modeling of granular flow, Powder Technology (2019), doi: https://doi.org/10.1016/ j.powtec.2019.10.017. This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. © 2019 Published by Elsevier B.V.
Head on collision of granular jets
Granular impact with wave structure
GPU-accelerated smoothed particle hydrodynamics modeling of granular flow Jian-Yu Chena , Fue-Sang Lienb , Chong Pengc,∗, Eugene Yeeb a School
of Mechanical Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei Street, Xuanwu District, Nanjing, 210094, China b Department of Mechanical and Mechatronics Engineering, University of Waterloo, 200 University Avenue West, Waterloo, Ontario, N2L 3G1, Canada c Institut f¨ ur Geotechnik, Universit¨at f¨ur Bodenkultur, Feistmantelstrasse 4, 1180 Vienna, Austria
Abstract In this paper, the dynamical behavior of granular media is investigated numerically using smoothed particle hydrodynamics (SPH) with an elastoplastic and a phase-change constitutive model. A GPU-accelerated SPH code for granular flow modeling is implemented and validated using four test cases: (1) the collapse of an axisymmetric sand pile; (2) the impact force of sand on a rigid wall; (3) the head-on collisions of dense granular jets; and, (4) the impact of a granular jet with a wave structure on the granular film. The results of SPH simulations with the two models are compared to available experimental and numerical results. The validation showed that the dynamics of granular media are well predicted using the SPH methodology with an appropriate material model. An SPH simulation of granular flow run on a rather outdated GPU was already about 160 times as efficient as the same simulation run on a mainstream single CPU. Keywords: elastoplastic, CUDA, GPU, granular flow, phase change, SPH
1
1. Introduction
2
Various geotechnical problems associated with large deformations, such as landslides and granular impacts, are
3
difficult to address using conventional grid-based methods. More specifically, the large deformations that characterize
4
this class of problems frequently result in severe mesh distortions that provoke instabilities in the numerical algorithm,
5
which in turn cause the simulation to terminate prematurely. Owing to these difficulties, we will investigate the ap-
6
plication of an alternative mesh-free methodology for the simulation of granular impact. To this purpose, smoothed
7
particle hydrodynamics (SPH) implemented using the compute unified device architecture (CUDA) programming
8
interface for parallel computing on a graphics processing unit (GPU) is used to address the problem of large defor-
9
mations that generally characterize three-dimensional (3D) granular impact problems. The rationale for using SPH
10
that harnesses the power of a GPU for the simulation of granular impact are as follows: (1) the Lagrangian and
11
meshless property of the SPH methodology facilitates the simulation of granular flow and impact problems involving ∗ Corresponding
author Email address:
[email protected] (Chong Peng )
Preprint submitted to Powder Technology
October 5, 2019
12
large deformations as compared to the more conventional grid-based methods; and, (2) the implementation of SPH
13
using CUDA allows the parallelizable parts of the computation to off-loaded to a GPU to give a high floating-point
14
computing performance.
15
The SPH methodology was originally proposed by Gingold and Monaghan [1] and Lucy [2] and applied to address
16
various astrophysics problems. Subsequently, this methodology was applied to problems in other areas of scientific
17
endeavor (e.g., explosive detonations, geomechanics problems). Bui et al. [3, 4] report some seminal work concerning
18
the implementation of an elastoplastic constitutive model for the simulation of the elastic and plastic behavior of soil
19
within the SPH framework. Peng et al. [5, 6] introduced the hypoplastic constitutive model into the SPH methodology
20
and applied it to the simulation of granular flow. This effort demonstrated that the hypoplastic constitutive model
21
can provide good predictions in granular flow problems involving large deformations. Fan et al. [7] and Fan and
22
Li [8] simulated landmine detonations using three different soil constitutive models within the SPH methodology in
23
conjunction with the peridynamics method. Chen and Lien [9] and Chen et al. [10] applied the SPH methodology with
24
an elastoplastic and a hypoplastic constitutive model to investigate phenomoneology associated with the detonation
25
of landmines and its effect on structures.
26
Recently, a number of experiments and simulations related to granular impacts have been conducted using a mesh-
27
free methodology. This work concerning granular material impacts is of significant importance for various industrial
28
applications including ink-jet printing and inpinging jet milling. Pacheco-V´azquez and Ruiz-Su´arez [11] carried out
29
impact cratering experiments using granular projectiles with different porosities. Ellowitz [12] investigated the phe-
30
nomenon of two-dimensional (2D) head-on collisions of dense granular jets using discrete particle simulations . Shi
31
et al. [13] studied the dynamic behavior of a dense granular jet impacting on a circular target using the Discrete Ele-
32
ment Method (DEM). To date, most of the modeling effort concerning granular impact modeling has been conducted
33
using the DEM method as described in Cundall and Strack [14]. While accurate, the DEM method involves solving
34
Newton’s equation of motion for each grain individually. In consequence, this method is computationally prohibitive
35
for problems involving a large physical domain [15]. In comparison to DEM, the SPH methodology is advantageous
36
(and hence, preferred) for the simulation of granular flows for two reasons. Firstly, it is straightforward to implement
37
different constitutive models for the representation of the mechanical behavior of granular materials within the SPH
38
framework. Secondly, the material domain in the SPH methodology is discretized using particles (discrete moving
39
Lagrangian elements) that carry the relevant physical variables (properties) of the flow. Unlike DEM, these particles
40
are not physical grains of the material and, as a result, their size can be chosen in accordance to the problem that needs
41
to be addressed. Indeed, for certain problems, the size of the SPH particles can be much larger than the physical grains
42
of the material, which can result in a significant reduction in the computational cost of SPH simulations per number
43
of particles (in comparison to DEM).
44
Although numerous investigations of the simulation of granular flows have been conducted, the application of the
45
SPH methodology to the modeling and simulation of 3D granular flow is still in its infancy. In this paper, the phase-
46
change constitutive model will be incorporated into the SPH framework and used to investigate various phenomena 2
47
associated with granular flow for the first time. This material constitutive model was recently proposed by Dunatunga
48
and Karmrin [15] who used it to investigate 2D granular flow problems within the material point method (MPM)
49
framework. In comparison with the more commonly used rate-independent elastoplastic and Mohr–Coulomb models,
50
the phase-change constitutive model is more suitable for addressing problems involving dense zones and rapid flows.
51
Furthermore, the effects of particle size have also been incorporated into the phase-change constitutive model [15].
52
The search for the neighbors of a particle and the calculation of the rates of change of various physical variables
53
are extremely time-consuming operations in the SPH methodology. As a consequence, the parallelization of the
54
SPH methodology is necessary, especially for simulations of 3D granular dynamic behavior that necessarily involve
55
large numbers of particles. To this purpose, it is noted that CUDA, which is a parallel computing platform and
56
programming interface for GPUs developed by NVIDIA, is especially well-suited for the efficient implementation
57
of the SPH methodology. The implementation of the SPH methodology on GPUs was first described by Harada
58
et al. [16]. Subsequently, Crespo et al. [17] reported GPU-accelerated SPH simulations for the investigation of
59
complex free surface flows. These researchers showed that their SPH simulations on a single GPU were two orders
60
of magnitude faster than that conducted on a single-core CPU. Recently, GPU-accelerated SPH models have been
61
applied to the simulation of particle-fluid flows and to the solution of the shallow water equations [18, 19], and the
62
computational efficiency of these models has been further improved through the incorporation of a adaptive particle
63
splitting and merging methodology [20]. Furthermore, a GPU-accelerated DEM methodology has been implemented
64
to tackle the challenges associated with modelling powder compaction [21, 22].
65
In this paper, the CUDA programming interface has been used for the implementation of a new SPH code for sim-
66
ulation of granular flows on a GPU. The in-house parallelized SPH code has been implemented in C++ and designed
67
specifically for the computationally efficient and accurate simulation of 3D granular impact problems involving large
68
numbers of particles. To the best of our knowledge, this is the first time that the phase-change constitutive model has
69
been incorporated in the SPH methodology and used to investigate the phenomenology associated with 3D granular
70
impact problems. Furthermore, phenomena associated with 3D head-on collisions and the impact of granular jets with
71
a wave structure on a granular film are modeled and simulated within the SPH framework for the first time.
72
The paper is organized into six sections. Section 2 contains a brief introduction of the SPH methodology. Section
73
3 describes the phase-change and elastoplastic constitutive models. Section 4 presents the computational details of
74
our implementation of the SPH methodology on a GPU. The two constitutive models mentioned above are validated
75
using four test cases in Section 5: namely, the collapse of an axisymmetric sand pile, the impact force of sand on
76
a rigid wall, head-on collisions of dense granular jets, and the impact of a granular jet with a wave structure on a
77
granular film. Finally, the key results and novelty of the research are summarized in Section 6.
3
78
2. Fundamentals of the SPH method
79
2.1. Function approximations in SPH
80
The derivation of the SPH methodology can be divided into two steps. The first step involves a kernel approxi-
81
mation, whereby all the physical variables are approximated using an integral transform involving the convolution of
82
the physical variable with a chosen kernel function. To this purpose, the kernel approximation of a function f (x) at
83
position x = (x, y, z) in a physical domain Ω is h f (x)i =
84
85
Z Ω
f (x0 )W(x − x0 , h)dx0 ,
(1)
where W is the kernel function and h is the smoothing length. The kernel approximation of the gradient of the function ∇ f (x) can also be obtained in the same manner to give Z h∇ f (x)i = − f (x0 )∇W(x − x0 , h)dx0 . Ω
(2)
86
The Wendland C2 function is used as the kernel function W for the SPH simulations conducted herein. This kernel
87
function assumes the form
88
89
q 4 1 − 2 (2q + 1) 0 ≤ q < 2 ; W(q, h) = αd 0 q≥2,
(3)
where q is the Euclidean distance r normalized by the kernel width h so q = r/h. The coefficient αd = 7 4πh2 in a two-dimensional space and αd = 21 16πh3 in a three-dimensional space. In SPH, the kernel representation for a
90
function and its gradient as exhibited above can be approximated by a Riemann summation over the particles used in
91
the simulation, so h f (xi )i =
N X
f (x j )Wi j
j=1 92
and h∇ f (xi )i = −
N X j=1
mj , ρj
f (x j )∇Wi j
(4)
mj , ρj
(5)
93
where ρ j and m j are the density and mass of particle j, respectively, and N is the number of particles. For notational
94
convenience, W(xi − x j , h) is abbreviated as Wi j .
95
The second step of the SPH methodology involves solving the equations governing granular impact; namely, the
96
continuity and momentum equations. Within the SPH framework, these equations can be discretized with the kernel
97
approximation, respectively, as follows: N X ∂Wi j dρi m j vαi j · = , (a) dt ∂xiα j=1 N σαβ + σαβ ∂W X i j ij dvαi = m + Πi j δαβ , (b) j dt β ρi ρ j ∂x j=1
(6)
i
98
where ρi , vi , pi , mi are the density, velocity, pressure, and mass of particle i; d( · ) dt denotes the time derivative of a
99
physical quantity; and, vαi j = vαi −vαj . The term Πi j in the momentum equation is the Monaghan-type artificial viscosity. 4
100
2.2. Artificial viscosity
101
In order to smooth potential unphysical oscillations, to prevent unphysical particle-particle penetrations, and to
102
stabilize the numerical solutions, a Monaghan-type artificial viscosity is incorporated into the momentum equation as
103
follows [23]: −αci j φi j +βφ2i j ρi j Πi j = 0
(7)
vij · xij ≥ 0 ,
+ c j ), ρi j =
+ ρ j ), hi j =
+ h j ), vij = vi − vj , and
104
where φi j = hi j vij · xij
105
xij = xi − xj . Here, ci is the speed of sound associated with the particle i; α and β are constant coefficients that have
106
values of about 1.0; and, ϕ = 0.1hi j is applied in order to prevent any unphysical penetration of two particles.
107
2.3. Neighboring particle search
|xij |2 + ϕ2 , ci j =
vij · xij < 0 ;
1 2 (ci
1 2 (ρi
1 2 (hi
108
Owing to the fact that calculation of any physical variable at a given position in the SPH methodology involves the
109
summation over nearest neighbor particles to that position, a critical issue related to the computational performance
110
of SPH involves the formulation of an efficient algorithm for searching for the neighboring particles at a specified lo-
111
cation in a problem domain. A number of different approaches has been proposed for determination of a dynamically
112
evolving list of neighbouring particles required for the computation of sums in SPH. In this paper, the cell-linked list
113
(CLL) is used for the search of these neighborhood sets or lists (see Fig. 1) [24]. In the CLL approach for a neighbor-
114
hood search, the computational domain is divided into cells with a side length of 2h. A particle in the simulation is
115
referenced with respect to the cell that it belongs to. In consequence, this action results in a list of particles that have
116
been reordered with respect to the uniform grid of cells used in the discretization of the computational domain. For the
117
calculation of the rates of change of a physical variable in SPH, each particle i in a particular cell of the computational
118
domain only needs to query the cells adjacent to this particular cell in order to find all the nearest neighbor particles
119
j of particle i. More specifically, if the distance between particle i and particle j is less than 2h (cut-off limit), then
120
particle j is a genuine neighbor of the given particle i.
121
2.4. Boundary conditions and impact force calculation
122
Because SPH is a Lagrangian method, the imposition of a specific boundary condition is problematic. A number
123
of approaches has been proposed for the specification of boundary conditions in SPH [25, 26, 27]. In this paper,
124
a methodology involving the introduction of dummy particles is used for the imposition of the no-slip boundary
125
condition along a solid (wall) surface [28] (see Fig. 2). The velocity of the solid wall boundary vw is assigned to the
126
velocity of the dummy particles vd (viz., vd = vw ). For the calculation of the rate of change of a physical quantity
127
associated with a real particle κ, the components of the stress tensor of the dummy particles are extrapolated using the
5
128
values of the neighboring granular particles, so N X
σidj =
σiκj W(rdκ ) − (gi − aid )
κ=1
N X
i ρκ rdκ W(rdκ )δi j
κ=1 N X
,
(8)
W(rdκ )
κ=1 129
where subscripts d and κ are used to denote the dummy and granular particles, respectively. Moreover, ad is the
130
prescribed acceleration of the dummy particles; g is the gravitational acceleration; rdκ is the displacement vector
131
from the granular particle κ to the dummy particle d; rdκ is the Euclidean distance between the granular particle κ
132
and the dummy particle d, so rdκ = ||rdκ ||; δi j is the Kronecker delta function (equivalent to the identity matrix in
133
three dimensions); and, N denotes the number of granular particles within the support domain (taken to be a spherical
134
region with a radius that is some multiple k of the kernel smoothing length h) of the dummy particle d.
135
When calculating the impact force of a granular material on the boundary, we first use the extrapolated stresses
136
of the dummy particles given above to determine the interaction force on each dummy particle through application of
137
the momentum equation as follows [29]: Fdα = md
X
mi
σαβ + σαβ i
d
ρd ρi
i
+ Πdi δαβ
∂W
di
∂xdβ
.
(9)
138
Following this calculation, the impact force FW exerted on the solid wall boundary can be calculated as the summation
139
of the forces Fd on the dummy particles, so FW =
XX d
md mi nα
σαβ + σαβ i
d
ρd ρi
i
+ Πdi δαβ
∂W
di
∂xdβ
,
(10)
140
where the subscripts i and d denote the granular and dummy particles, respectively; and, n is the normal vector to the
141
solid wall boundary.
142
2.5. Physical variables update
143
The evolution of the various physical variables is obtained by integrating equation (6) with respect to the time t.
144
To this purpose, a second-order predictor-corrector scheme is used to update the physical variables. For the prediction
145
step, the position xi , velocity vi , and density ρi associated with particle i are updated as follows: t+ ∆t2
xi
= xti +
∆t t v , 2 i
(11)
= vit +
∆t t a , 2 i
(12)
146
t+ ∆t2
vi 147
and t+ ∆t2
ρi
= ρti + 6
∆t dρi 2 dt
!t .
(13)
The stress tensor associated with particle i can be updated in a similar manner, so !t ∆t dσi t+ ∆t2 t = σi + . σi 2 dt
(14)
For the correction step, the various physical variables are updated in accordance to
148
t+ ∆t2
= xti + ∆tvi xt+∆t i
,
(15)
,
(16)
149
t+ ∆t2
vit+∆t = vit + ∆tai 150
and
!t+ ∆t dρi 2 , (17) = + ∆t dt where ai is the acceleration of particle i, which is calculated using equation (6(b)). By contrast, in the correction step, ρit+∆t
ρti
the stress tensor is updated using the value of the stress tensor obtained at end to the prediction step (half-time step) in order to simplify the computation and minimize the storage required in the GPU memory. As a consequence, !t+ ∆t ∆t dσi 2 t+ ∆t2 t+∆t . (18) σi = σi + 2 dt 151
Finally, the Courant-Friedrichs-Lewy (CFL) condition is imposed on this explicit time integration scheme in order
152
to guarantee the stability of this second-order predictor-corrector scheme. To this purpose, the time step for the
153
integration is chosen as ! 1/2 h ∆t = 0.2 min , h/c , (19) max ||ai || where c is the speed of sound for the granular material. The speed of sound is determined in accordance to c =
154
155
(E/ρ)1/2 , where E is Young’s modulus of the granular material.
156
3. Constitutive equations for the modeling of granular impact
157
158
A constitutive equation is required in order to calculate the stress tensor σ and to close the governing equations. Towards this objective, the stress tensor is decomposed into two parts as follows: σαβ = −pδαβ + S αβ ,
(20)
where −p and S are the pressure and the deviatoric stress tensor, respectively. Normally, the pressure −p is calculated using an equation of state (EOS), while the deviatoric stress S is obtained through a constitutive equation. However, in the simulations conducted herein, the stress tensor σ is calculated directly using a constitutive equation and, subsequently, the pressure is obtained from the mean stress without having to use an EOS. More specifically, p=−
σαα 1 = − σ11 + σ22 + σ33 , 3 3
(21)
159
where σ11 , σ22 , and σ33 are the components of the normal stress tensor in x, y and z directions, respectively. Both the
160
phase-change and elastoplastic constitutive models are adapted for use in our SPH simulations in order to update the
161
stress tensor σ. 7
162
3.1. Phase-change constitutive model
163
The phase-change constitutive model proposed by Dunatunga and Kamrin [15] is implemented within the SPH
164
framework for the first time. This constitutive model allows the granular media to change through several com-
165
mon phases (viz., the grains can behave like a solid, liquid or gas). The granular matter is considered to be elasto-
166
viscoplastic when the media is sufficiently dense and is treated as a disconnected material when the density of the
167
material is smaller than a critical value. In order to calculate an objective rate of change for the stress tensor σ, ˚ the
168
Jaumann frame of reference is used: σ˚ = σ˙ + σ ω ˙ − ωσ ˙ ,
169
170
(22)
where σ˙ is the stress rate tensor and ω ˙ is the spin tensor. Now, the components of the trial stress tensor σαβ tr can be calculated as ! ! 1 αβ γγ αγ βγ γβ αγ αβ γγ αβ σαβ = ∆t σ ω ˙ + σ ω ˙ + 2G ε ˙ − δ ε ˙ + K ε ˙ δ + σαβ , tr 3
(23)
where ε˙ is the strain rate tensor. The shear modulus G and the elastic bulk modulus K are determined as follows: K=
E , 3(1 − 2ν)
(24)
G=
E , 2(1 + ν)
(25)
and
171
where E is Young’s modulus and ν is Poisson’s ratio.
172
Using the trial stress tensor determined above, the procedure for updating the stress consists of the following steps.
173
1. If the particle density ρn+1 is less than the critical value ρc , then set the stress tensor of the particle to σ n+1 = 0
174
175
for the next time step. 2. For all other cases, if the trial pressure ptr is negative, then σ n+1 = 0 and ptr = − 31 tr σtr . 1/2 3. Otherwise, the stress tensor is computed as follows. If τtr ≤ S 0 where τtr = 21 (Str : Str ) and S 0 = µ s ptr , the granular material is in the elastic regime, so set σ n+1 = σtr . If τtr > S 0 , the granular material is in the plastic flow regime, and τn+1 is determined from
τn+1
2
− Bτn+1 + H = 0 .
(26)
This quadratic equation for τn+1 can be solved explicitly to give τn+1 =
2H B+
B2
− 4H
1/2 ,
(27)
where B and H are obtained from B = S 2 + τtr + α and H = S 2 τtr + S 0 α. Furthermore, S 2 and α are determined from S 2 = µ2 ptr and α = ξG∆tptr , where ξ is a parameter of the granular media, µ s is the friction coefficient at zero shear rate, and µ2 is the maximum value of the friction coefficient. The stress tensor at the next time step is given by σ n+1 =
τn+1 Str − ptr I . τtr 8
(28)
176
3.2. Elastoplastic constitutive model
177
The classical elastoplastic constitutive model has also been incorporated into SPH framework to model the gran-
178
ular impact. The results obtained from this constitutive model will be compared with those obtained from the phase-
179
change constitutive model. In the elastoplastic constitutive model, the plastic deformation occurs only if the Drucker-
180
Prager criteria is satisfied. This criteria is given by f (I1 , J2 ) = J21/2 − ϕφ I1 − kc = 0 ,
(29)
181
where I1 = σαα and J2 = 21 S αβ S αβ are the first invariant of the stress tensor and the second invariant of the deviatoric
182
stress tensor, respectively. The constants ϕφ and kc in the Drucker-Prager model are calculated from the cohesion c
183
and the internal friction φ as ϕφ =
184
tan φ 9 + 12 tan2 φ
(30)
and 3c
kc =
(31) . 9 + 12 tan2 φ 1/2 The Jaumann frame of reference is used also with respect to the elastoplastic constitutive model. The final stress-
185
186
1/2 ,
strain relationship of the granular flow model evolves in accordance to dσαβ 1 αβ γγ G αβ αγ βγ γβ αγ αβ γγ αβ i αβ = σi ω ˙ i + σi ω ˙ i + 2G ε˙ i − δi ε˙ i + K ε˙ i δi − λ˙ i 9K sin ψδ + 1/2 S i , dt 3 J2
(32)
187
where ψ is the dilatancy angle. The parameters K and G can also be obtained from Eqs (24) and (25), respectively.
188
The plastic multiplier λ˙ i is determined from λ˙ i =
189
1/2 αβ αβ 3ϕφ K ε˙ γγ S i ε˙ i i + G/J2 . 27ϕφ K sin ψ + G
The strain rate tensor and spin tensor are approximated within the SPH framework as follows: N 1 X m j α ∂Wi j αβ β ∂Wi j ε˙ i = v ji β + v ji α , 2 j=1 ρ j ∂xi ∂x
(33)
(34)
i
190
and ω ˙ αβ i
N
1 X mj = 2 j=1 ρ j
α ∂Wi j β ∂Wi j v ji − v ji α . β ∂xi ∂xi
(35)
191
The reader is referred to Bui et al. [3] for a more detailed description of the stress-strain relationship in the context of
192
the elastoplastic constitutive model.
193
4. GPU implementation
194
4.1. GPU solution procedure
195
The application of SPH for the simulation of granular flow in three dimensions is computationally intensive owing
196
to the large number of particles required for such simulations. As a consequence, to enable these types of simulations 9
197
to complete in a reasonable execution time, it is imperative to accelerate the computational workload by parallelization
198
of the SPH simulations on a graphics processing unit (GPU). The parallelization of the SPH methodology on a GPU
199
is depicted in Fig. 3. There are a number of steps involved in the implementation of the SPH methodology on a GPU.
200
Firstly, the data describing the initial configuration of the granular impact is loaded into the central processing unit
201
(CPU). Next, this information is transferred from the CPU to the GPU. The GPU is used to facilitate the following
202
operations: building the neighborhood lists and searching these neighborhood sets (which necessarily involve dynam-
203
ically changing lists of neighboring particles), determination of the particle interactions, and update of the physical
204
variables which involve the computation of sums over the dynamically evolving neighborhood sets. At the end of
205
this computational process, the information computed by the GPU is transferred from the GPU to the CPU and the
206
relevant numerical data is saved. All the SPH simulations of granular flow reported herein were conducted on a PC
207
with a single NVIDIA Quadro K620 graphics processor. The performance of the in-house SPH code used for our
208
simulations, such as the accuracy and computational efficiency, is described in detail in [29, 30].
209
4.2. Pseudocode for determination of particle interactions on GPU
210
The computation of the time development of the physical variables is a compute-intensive process that must be
211
parallelized in order to obtain a good computational performance for an SPH simulation. In consequence, the use of
212
the CUDA programming interface in our in-house SPH code is a key aspect in the acceleration of the computational
213
workload on a GPU. A pseudocode for the algorithm used for the computation of the particle interactions on a GPU
214
is summarized in Algorithm 1 below. The implementation utilizes the CUDA platform (viz., the parallel computing
215
platform and application programming interface developed by NVIDIA for their GPUs).
216
5. Numerical cases
217
5.1. Axisymmetric collapse of dry sand
218
We first test the phase-change and elastoplastic constitutive models for SPH simulations of the 3D axisymmetric
219
collapse of dry sand. Lube et al. [31] conducted experiments of the axisymmetric collapse of columns of dry sand
220
that was initially contained within a cylindrical column resting on a flat surface (see Fig. 4). The nature and mode of
221
collapse of these granular columns is determined by the initial column height-to-halfwidth ratio a = h0 /d0 .
222
Four different cases involving different ratios of a were simulated using our SPH code and the results of these
223
simulations were compared to the experimental results in order to validate the two constitutive models. Table 1 sum-
224
marizes the basic information for each simulation. The parameters for the elastoplastic and phase-change constitutive
225
models are listed in Table 2. These parameters were used for all subsequent SPH simulations reported herein, unless
226
noted otherwise.
227
The magnitude of the velocity of the granular collapse at two different ratios a = 0.5 and 4.0, obtained using the
228
phase-change constitutive model, is shown in Fig. 5. It can be seen that in the final phase of the granular collapse, 10
229
the top surface of the shorter and stouter sand column is flatter than that of the taller and more slender sand column.
230
Qualitatively, it is noted that the overall general shape of the sand pile in the final phase (steady state) of collapse
231
predicted by the simulation is similar to that obtained from the experimental measurements conducted by Lube et
232
al. [31].
233
The stress distributions of the sand collapse using the elastoplastic constitutive model are exhibited in Fig. 6.
234
The stress in vertical direction yy is proportional to the depth of the sand. This is consistent with the vertical stress
235
distribution of the sand confined in a cylindrical column and subjected to a gravitational force. Furthermore, the
236
numerical results for the runout distance d∞ and the final height h∞ obtained using the phase-change and elastoplastic
237
constitutive models are summarized in Figs 7 and 8, respectively. Superimposed on these two figures are some
238
experimental measurements of d∞ and h∞ obtained by Lube et al. [31]. A perusal of Figs 7 and 8 shows that the
239
numerical predictions for d∞ and h∞ are generally in good conformance with the experimental results. Finally, it is
240
noted that the quality of the current numerical simulations of 3D granular collapse obtained using the SPH framework
241
with both the phase-change and elastoplastic constitutive models are comparable to the 2D sand collapse simulations
242
obtained by Dunatunga and Kamrin [15] using the MPM framework with the phase-change constitutive model.
243
We close this subsection with an evaluation of the computational efficiency of our GPU-accelerated SPH code.
244
To this purpose, we will measure the compututational efficiency of the implementation of our GPU-accelerated SPH
245
code in terms of frames per second (FPS), which is the number of computational time steps executed in one second
246
of wall-clock time (see Table 3 which summarizes the FPS and total wall-clock execution time T for four test cases
247
involving the simulation of the collapse of a sand pile with an initial aspect ratio of a = 0.5). The total physical time
248
of the simulation for all the test cases summarized in Table 3 is fixed at a value 0.1 s. A careful examination of this
249
table shows that the FPS decreases as the number of particles used in the simulation increases. More specifically, it
250
is seen that for the case that uses nearly 500,000 particles in the simulation, the FPS achieved (≈ 6.8 time steps per
251
second) is still very good. As mentioned previously, the simulations reported here were conducted on a PC with a
252
single NVIDIA Quadro K620 graphics processor. Note that this graphic card was released in 2014, five years ago. It
253
is an outdated card and only has 384 cuda cores. It is expected that the computational performance reported here will
254
be significantly improved on a more powerful NVIDIA graphics processor, such as the NVIDIA GeForce RTX 2080
255
Ti.
256
In order to characterize the acceleration in computational performance provided by the use of a GPU, the SPH
257
simulations summarized in Table 3 were also conducted on a PC with a single CPU (AMD Opteron Processor 6320).
258
The speed-up in computational performance between the GPU and CPU is defined as Sp =
FPS on a single GPU . FPS on a single CPU
(36)
259
Note that the time required for the input of the data files and the output of the simulation results is excluded from the
260
execution time. The speed-up in the computational performance of the SPH code executed on a single GPU versus
261
that on a single CPU is displayed in Fig. 9 for five test cases (four of which have been summarized in Table 3). It 11
262
is seen that a speed-up of up to about a factor of 160 is achieved as the number of particles N used in the simulation
263
increases to about 750,000.
264
We can compare our GPU-accelerated SPH implementation with an open-source SPH code. To this purpose, it
265
is noted that a granular flow cannot be simulated directly using either the open-source code SPHysics [32] or the
266
code developed by Liu and Liu [33] owing to the fact that there is no constitutive model for soil in these codes.
267
In consequence, we have implemented a soil constitutive model in the serial open-source SPH code of Liu and Liu
268
[33] and used it as the basis for comparison with the computational efficiency of our GPU-accelerated SPH code
269
implementation. Towards this objective, we found that the computational efficiency of our GPH-accelerated SPH
270
code is 170 times faster that the serial open-source code of Liu and Liu [33] when 170,000 particles were used
271
in the simulation. This speedup suggests that our GPU-accelerated SPH implementation has achieved a high parallel
272
computational efficiency, in spite of the fact that the simulations reported herein were conducted with a rather outdated
273
graphics card.
274
5.2. Impact force of sand on a rigid wall
275
Having completed the validation of the phase-change and elastoplastic constitutive models in the SPH framework
276
using a case study involving the axisymmetric collapse of dry sand, this numerical framework will now be applied
277
to investigate the phenomenon of granular impacts. Moriguchi et al. [34] conducted experimental measurements of
278
the impact force exerted on a rigid wall resulting from the impingement of dry Toyoura fine sand released from an
279
inclined flume. The sand was initially confined inside a box at the top of the flume. This material was released through
280
a side door, and the sand impacted a rigid wall at the bottom of the flume. The impact force exerted by the sand on
281
the rigid wall was measured as a function of time using a sensor attached to the wall barrier.
282
In this subsection, we will conduct a 3D numerical simulation of this experiment using SPH with the phase-change
283
constitutive model. The initial configuration for the simulation is shown in Fig. 10. The length and height of the sand
284
box are 50 cm and 30 cm, respectively. The total mass of the sand contained initially in the box is 50 kg. The density
285
of the granular material (sand) is ρ0 = 1340 kg m−3 and the critical density is ρc = 500 kg m−3 . The initial particle
286
spacing used in the simulation is 10 mm. A total of 123,012 particles were used in the SPH simulation. Three layers
287
of dummy particles are used to impose the no-slip boundary condition along the solid wall of the incline. The total
288
physical time simulated using the SPH methdology was 3 s. The computational (wall-clock) time required to complete
289
this simulation was 72 minutes.
290
The results for the simulation of granular impact down a 45◦ inclined flume obtained using SPH with the phase-
291
change constitutive model is presented in Fig. 11. The sand moves downward gradually owing to the gravitational
292
force. The magnitude of the velocity (speed) of the sand increases from zero to almost 4.5 m s−1 during this movement.
293
After about 0.70 s after the initial release, the sand impacts the rigid wall and stops almost immediately after the
294
impact. In consequence, the sand at the bottom of the flume accumulates at the rigid wall and this pile of sand grows
295
in size with increasing time. At the end of the process, the pile of sand reaches the top of the wall barrier and flows 12
296
out of the flume. This dynamic behavior of the sand obtained from the numerical simulations is in general agreement
297
with both the experimental measurements reported by Moriguchi et al. [34] and the simulations conducted by Neto
298
and Borja [35]. The dynamic behavior of the sand obtained in the SPH framework using the elastoplastic constitutive
299
model is similar to that obtained using the phase-change constitutive model. However, it is noted that using the
300
elastoplastic constitutive model, the sand is predicted to impact the rigid wall at 0.65 s after the initial release, which
301
occurs earlier than the prediction provided by the phase-change constitutive model.
302
303
304
The impact force exerted by the sand on the rigid wall is calculated using equation (10). This equation required the specification of the normal vector n = (nα ). For the current case, the vector normal to the rigid wall at the bottom √ √ of the flume for an inclination angle of 45◦ is n = 22 , 0, 22 . Fig. 12 exhibits the predicted time variation of the
305
impact force obtained from our SPH simulations using both the phase-change and elastoplastic constitutive models.
306
The predicted peak impact force obtained using the phase-change constitutive model is 192 N. This predicted value
307
for the peak impact force is larger than the measured value of the peak impact force obtained by Moriguchi et al. [34].
308
More specifically, the overprediction of the peak impact force is about 3.2%. It is noted that the prediction of the
309
peak impact force obtained from our SPH simulations with the phase-change constitutive model is significantly better
310
that obtained from either our SPH simulations with the elastoplastic constitutive model or the numerical simulations
311
conducted by Neto and Borja [35]. In the latter two cases, the predicted peak value of the impact force was about
312
212 N which significantly overpredicts the measured value of the peak impact force.
313
After the rapid initial increase to its peak value, the impact force decreases gradually to value of about 160 N.
314
This decrease occurs because some of the sand flows out of the bottom boundary and the velocity of the sand is
315
reduced after its initial impact with the rigid wall. In general, it is noted that SPH simulations with the phase-change
316
constitutive model is capable of capturing the dynamical behavior of the impact of the sand against the rigid wall.
317
Furthermore, these simulations give generally good predictions for the time variation of the impact force. The slight
318
overestimation of peak impact force is probably due to a force contribution from the soil viscosity and the boundary
319
implementation used here. The effects of the soil viscosity, which has the tendency to slow down the granular flow
320
and reduce the impact force, has not been incorporated in our numerical simulations. In addition, it is expected once
321
a more appropriate model for the calculation of the stress tensor of the boundary particles has been developed, its
322
inclusion in our numerical simulation will lead probably to more accurate predictions of the impact force.
323
5.3. Head-on collisions of dense granular jets
324
In this subsection, we investigate the 3D head-on collisions of dense granular jets. A study of the physical mech-
325
anisms involved in the collisions of granular jets will enhance our understanding of the loads applied to structures by
326
the impact of granular matter on them at high velocity. Ellowitz [12] conducted 2D simulations of head-on collisions
327
of dense granular jets having equal speeds using DEM. Here, we extend this effort to consider fully 3D simulations
328
of dense granular jet collisions using the SPH framework. To this purpose, two granular jets having different sizes
329
will be considered in our simulations: namely, the diameter of the first jet is taken to be 2R and that of the second 13
330
jet is taken to be 2kR, where 0 ≤ k ≤ 1. The centers of these two jets are assumed to be aligned, and their initial
331
velocities are both v0 in magnitude. These two jet velocities are in opposite directions, so that the two jets undergo
332
a head-on collision. The ejecta angle ψ0 , which is the maximum angle between the tangent line of the surface of the
333
jet to the horizontal direction, and the velocity u0 of the impact center point at the interface of the two jets are used to
334
characterize the impact process (see Fig. 13). The initial configurations used for three different simulation scenarios
335
for head-on collisions of the two granular jets are summarized in Table 4.
336
Fig. 14 shows the predicted results obtained using SPH with the phase-change constitutive model for the head-on
337
collision of two granular jets with the initial configuration specified by scenario 1 (cf. Table 4). It is noted that the
338
ejecta angle of the granular jet with the larger diameter increases during the impact process, and its configuration after
339
the impact is similar to the 2D geometry described in Ellowitz [12]. The impact center drifts at a constant speed of
340
about u0 = 0.14 m s−1 , which is consistent with the simulations described in [12]. Fig. 15 exhibits the stress and strain
341
distributions obtained using SPH with the phase-change constitutive model at a time of 0.02 s. A persual of this figure
342
shows that the stress in the horizontal direction assumes the largest values within the impact area. The shear strain
343
and stress in the yz direction are antisymmetric with respect to horizontal x direction.
344
The effect of different velocities of the two jets (as characterized by the coefficient k) on the velocity of the impact
345
center has also been studied. To this purpose, the initial configurations for the colliding granular jets are summarized
346
in scenarios 2 and 3 (cf. Table 4). The constant velocity u0 of the impact center is 0 m s−1 for scenario 2 where the
347
radii of the two jets were equal (viz., k = 1) and 3.96 m s−1 for scenario 3 where the radius of one jet is 1/4 that of
348
the other jet (viz., k = 0.25). It seen that a smaller value for k results in a larger value for the velocity u0 of the impact
349
center, a result that is in conformance with the numerical simulations conducted by Ellowitz [12]. The results of our
350
numerical simulations have shown that the application of the SPH methodology with the phase-change constitutive
351
model provides a good alternative scheme to study the impact of two granular jets.
352
In order to demonstrate the ability of the proposed SPH methodology to capture phase changes in the collision
353
of two granular jets, we set the initial density of the disconnected granular jets to have a value of ρ = 1200 kg
354
m−3 . This value for the initial jet density is less than the critical density which has a value of ρc = 1500 kg m−3 .
355
Otherwise, initial conditions for this case are exactly the same as those of scenario 3. The results of the simulation
356
of the impact of the two granular jets for this case obtained using SPH with the phase-change constitutive model
357
are displayed in Fig. 16. An examination of this figure shows that the two disconnected jets undergoes a process of
358
densification after they collide with the each other. This simulation highlights the ability of the SPH methodology
359
to model large inhomogeneous deformations of the granular material. These results suggest that the effects of the
360
granular particle size and fracture on the interaction of high-velocity granular slugs can be investigated using the
361
proposed SPH methodology.
14
362
5.4. The impact of granular jet with a wave structure on a granular film
363
An experimental study of the liquid-like wave structures on a granular film resulting from a granular jet impact
364
has been described by Shi et al. [36]. In this study, the initial configuration of the particles is a sinusoidal wave. In
365
particular, the motion of the center particles at the jet exit is characterized by y = A0 sin (2π f0 t) ,
(37)
366
where A0 and f0 are the amplitude and frequency of the initial granular jet, respectively, and t is the time. In this
367
subsection, we simulate the liquid-like wave structure on a granular film using the SPH methodology with both the
368
phase-change and elastoplastic constitutive models.
369
To this purpose, the wave structure generated by an amplitude of A0 = 0.002 m and a frequency of f0 = 160 Hz
370
for the initial granular jet has been investigated using the SPH methdology. The initial particle spacing is 0.002 m and
371
800,000 particles were used in this simulation. The initial velocity of the granular jet is 4.0 m s−1 . Figure 17 exhibits
372
the experimental wave structure on the granular film measured by Shi et al. [36] and that obtained from the SPH
373
simulations using both the elastoplastic and phase-change constitutive models. It is noted that the wave structures
374
obtained from the two simulations are in very good qualitative agreement with the experimental results.
375
The reason for the wave structures seen in Fig.
17 is that the initial configuration of the granular media is
376
antisymmetric with respect to vertical z-direction and the periodic oscillation of the resulting disturbance results in
377
the generation of wave structure. Furthermore, the effects of the amplitude and the frequency of the disturbance
378
in the granular jet on the wave structure have also been investigated using SPH similations. The results of these
379
investigations are summarized in Fig. 18. In particular, Fig. 18 (a1–a4) exhibit the wave structures for a frequency of
380
f0 = 160 Hz for a series of increasing values of the amplitude A0 : namely, for A0 = 0.0005 m, 0.001 m, 0.002 m,
381
and 0.003 m. Similarly, Fig. 18 (b1–b4) display the wave structure for an amplitude of A0 = 0.002 m for a series
382
of increasing frequencies f0 : namely, for f0 = 80 Hz, 120 Hz, 160 Hz, and 240 Hz. A careful examination of these
383
results shows that the wave structure disappears when the amplitude of the disturbance decreases below a certain
384
value as the disturbance then becomes too small to generate a wave structure (cf. Fig. 18 (a1)). The wavelength of the
385
wave structure on the granular film increases with a decrease in the frequency of the disturbance as is evident from a
386
perusal Fig. 18 (b1–b4). The results reported here demonstrate that the SPH methodology used in conjunction with
387
the phase-change and elastoplatic constitutive models can be used to predict the nature of the wave structures on a
388
granular film.
389
6. Conclusion
390
Several investigations were conducted in the application of a GPU-accelerated SPH methodology for the 3D
391
simulation of granular impact problems involving a large number of particles. To this purpose, the elastoplastic
392
and phase-change constitutive models have been incorporated in the SPH code. The CUDA parallel programming 15
393
interface for an NVIDIA GPU has been used to improve the computational efficiency of the SPH computations for
394
granular flows. To demonstrate the feasibility of the proposed approach, SPH simulations were conducted for four
395
different cases involving granular flow. The results of these simulations have been compared to available experimental
396
data and to numerical data obtained from alternative simulation methdologies.
397
The key conclusions of this study are as follows:
398
• Fully 3D granular impact problems involving large deformations can be addressed successfully using the SPH
399
methodology with either the phase-change or the elastoplastic constitutive models. To the best of our knowl-
400
edge, this is the first time that the phase-change constitutive model has been implemented within SPH frame-
401
work and successfully used for the simulation of 3D granular impact problems.
402
• The SPH methodology can be used to efficiently simulate granular impact problems with a large number of
403
particles on readily available commodity GPUs. The 3D impact of a granular jet with a wave structure on the
404
granular film and the head-on collisions of dense granular jets have been successfully simulated using the SPH
405
methodology for the first time.
406
407
• An SPH simulation is about 160 times as computationally efficient as the same simulation run on a single CPU even with a seriously outdated graphic card.
408
It is expected that a number of alternative constitutive models (e.g., Mohr-Coulomb model) will be implemented
409
in the future in order to investigate the influence and impact of the constitutive models in obtaining high-fidelity SPH
410
simulations of the dynamical behavior of granular media. Finally, our current implementation of the SPH methodology
411
for granular flow can be extended so that it can be executed either on multiple GPUs on a single computer or on
412
multiple GPUs on different computational nodes.
413
Acknowledgements
414
The first author is supported by the China Scholarship Council (No. 201506030072) and the Natural Sciences
415
and Engineering Research Council of Canada (NSERC). This work was made possible by the facilities of the Shared
416
Hierarchical Academic Research Computing Network (SHARCNET) and Compute/Calcul Canada.
417
418
419 420
References [1] R. A. Gingold, J. J. Monaghan, Smoothed particle hydrodynamics: Theory and application to non-spherical stars, Monthly Notices of the Royal Astronomical Society 181 (3) (1977) 375–389.
421
[2] L. B. Lucy, A numerical approach to the testing of the fission hypothesis, The Astronomical Journal 82 (1977) 1013–1024.
422
[3] H. H. Bui, R. Fukagawa, K. Sako, S. Ohno, Lagrangian meshfree particles method (SPH) for large deformation and failure flows of geo-
423
material using elastic–plastic soil constitutive model, International Journal for Numerical and Analytical Methods in Geomechanics 32 (12)
424
(2008) 1537–1570.
16
425
[4] H. H. Bui, K. Sako, R. Fukagawa, J. C. Wells, SPH-based numerical simulations for large deformation of geomaterial considering soil-
426
structure interaction, in: Proceeding of the 12th International Conference of International Association for Computer Methods and Advances
427
in Geomechanics, IACMAG, 2008, pp. 570–578.
428
[5] C. Peng, W. Wu, H. S. Yu, Hypoplastic constitutive model in SPH, in: Computer Methods and Recent Advances in Geomechanics: Pro-
429
ceedings of the 14th International Conference of International Association for Computer Methods and Recent Advances in Geomechanics,
430 431 432 433 434 435 436 437 438 439 440 441 442
IACMAG, 2014, pp. 1863–1868. [6] C. Peng, X. G. Guo, W. Wu, Y. Q. Wang, Unified modelling of granular media with smoothed particle hydrodynamics, Acta Geotechnica 11 (6) (2016) 1231–1247. [7] H. F. Fan, G. L. Bergel, S. F. Li, A hybrid peridynamics – SPH simulation of soil fragmentation by blast loads of buried explosive, International Journal of Impact Engineering 87 (2016) 14–27. [8] H. F. Fan, S. F. Li, A peridynamics-SPH modeling and simulation of blast fragmentation of soil under buried explosive loads, Computer Methods in Applied Mechanics and Engineering 318 (2017) 349–381. [9] J. Y. Chen, F. S. Lien, Simulations for soil explosion and its effects on structures using SPH method, International Journal of Impact Engineering 112 (2018) 41–51. [10] J. Y. Chen, C. Peng, F. S. Lien, Simulations for three-dimensional landmine detonation using the SPH method, International Journal of Impact Engineering 126 (2019) 40–49. [11] F. Pacheco-V´azquez, J. Ruiz-Su´arez, Impact craters in granular media: Grains against grains, Physical Review Letters 107 (21) (2011) 218001.
443
[12] J. Ellowitz, Head-on collisions of dense granular jets, Physical Review E 93 (1) (2016) 012907.
444
[13] Z. H. Shi, W. F. Li, Y. Wang, H. F. Liu, F. C. Wang, DEM study of liquid-like granular film from granular jet impact, Powder Technology 336
445
(2018) 199–209.
446
[14] P. A. Cundall, O. D. Strack, A discrete numerical model for granular assemblies, Geotechnique 29 (1) (1979) 47–65.
447
[15] S. Dunatunga, K. Kamrin, Continuum modelling and simulation of granular flows through their many phases, Journal of Fluid Mechanics
448 449 450 451 452 453 454 455 456 457 458 459 460 461 462
779 (2015) 483–513. [16] T. Harada, S. Koshizuka, Y. Kawaguchi, Smoothed particle hydrodynamics on GPUs, in: Computer Graphics International, Vol. 40, SBC Petropolis, 2007, pp. 63–70. [17] A. C. Crespo, J. M. Dominguez, A. Barreiro, M. G´omez-Gesteira, B. D. Rogers, GPUs, a new tool of acceleration in CFD: Efficiency and reliability on smoothed particle hydrodynamics methods, PloS One 6 (6) (2011) e20685. [18] Y. He, A. E. Bayly, A. Hassanpour, F. Muller, K. Wu, D. Yang, A GPU-based coupled SPH-DEM method for particle-fluid flow with free surfaces, Powder technology 338 (2018) 548–562. [19] X. Xia, Q. Liang, A GPU-accelerated smoothed particle hydrodynamics (SPH) model for the shallow water equations, Environmental modelling and software 75 (2016) 28–43. [20] Q. Xiong, B. Li, J. Xu, GPU-accelerated adaptive particle splitting and merging in SPH, Computer Physics Communications 184 (7) (2013) 1701–1707. [21] N. Govender, D. N. Wilke, S. Kok, Blaze-DEMGPU: Modular high performance DEM framework for the GPU architecture, SoftwareX 5 (2016) 62–66. [22] Y. He, T. J. Evans, A. B. Yu, R. Y. Yang, A GPU-based DEM for modelling large scale powder compaction with wide size distributions, Powder Technology 333 (2018) 219–228.
463
[23] J. J. Monaghan, An introduction to SPH, Computer Physics Communications 48 (1) (1988) 89–96.
464
[24] J. M. Dom´ınguez, A. J. Crespo, M. G´omez-Gesteira, J. C. Marongiu, Neighbour lists in smoothed particle hydrodynamics, International
465
Journal for Numerical Methods in Fluids 67 (12) (2011) 2026–2042.
466
[25] J. J. Monaghan, Simulating free surface flows with SPH, Journal of Computational Physics 110 (2) (1994) 399–406.
467
[26] L. D. Libersky, A. G. Petschek, T. C. Carney, J. R. Hipp, F. A. Allahdadi, High strain Lagrangian hydrodynamics: A three-dimensional SPH
17
468 469 470 471 472 473 474 475 476 477 478 479 480
code for dynamic material response, Journal of Computational Physics 109 (1) (1993) 67–75. [27] S. Adami, X. Y. Hu, N. A. Adams, A generalized wall boundary condition for smoothed particle hydrodynamics, Journal of Computational Physics 231 (21) (2012) 7057–7075. [28] C. Peng, W. Wu, H. S. Yu, C. Wang, A SPH approach for large deformation analysis with hypoplastic constitutive model, Acta Geotechnica 10 (6) (2015) 703–717. [29] L. Zhan, C. Peng, B. Y. Zhang, W. Wu, Three-dimensional modeling of granular flow impact on rigid and deformable structures, Computers and Geotechnics 112 (2019) 257–271. [30] C. Peng, S. Wang, W. Wu, H. S. Yu, C. Wang, J. Y. Chen, LOQUAT: An open-source GPU-accelerated SPH solver for geotechnical modeling, Acta Geotechnica. [31] G. Lube, H. E. Huppert, R. S. J. Sparks, M. A. Hallworth, Axisymmetric collapses of granular columns, Journal of Fluid Mechanics 508 (2004) 175–199. [32] M. G´omez-Gesteira, A. J. Crespo, B. D. Rogers, R. A. Dalrymple, J. M. Dominguez, A. Barreiro, SPHysics–development of a free-surface fluid solver–part 2: Efficiency and test cases, Computers and Geosciences 48 (2012) 300–307.
481
[33] G. R. Liu, M. B. Liu, Smoothed particle hydrodynamics: a meshfree particle method, World scientific, 2003.
482
[34] S. Moriguchi, R. I. Borja, A. Yashima, K. Sawada, Estimating the impact force generated by granular flow on a rigid obstruction, Acta
483 484 485 486
Geotechnica 4 (1) (2009) 57–71. [35] A. H. F. Neto, R. I. Borja, Continuum hydrodynamics of dry granular flows employing multiplicative elastoplasticity, Acta Geotechnica 13 (5) (2018) 1027–1040. [36] Z. Shi, W. Li, H. Liu, F. Wang, Liquid-like wave structure on granular film from granular jet impact, AIChE Journal 63 (8) (2017) 3276–3285.
18
Algorithm 1 Computation of all particle interactions for particle i Output: Update the physical variables for particle i 1:
global void ParticleInteraction cuk(. . . );
2:
index = blockIdx.x ∗ blockDim.x + threadIdx.x;
3:
if (index < particle number) then
4:
Find the neighbour particles;
5:
Calculate the rate of change of the physical quantities;
6:
Update the physical variables;
19
Table 1: Summary of the initial configuration used for four different simulations of the 3D axisymmetric collapse of dry sand.
Case
h0 (mm)
d0 (mm)
a = h0 /d0
Initial particle spacing (mm)
Number of particles
1
0.10
0.20
0.5
4.00
482,430
2
0.20
0.20
1.0
5.00
451,969
3
0.40
0.20
2.0
6.25
530,109
4
0.80
0.20
4.0
8.00
475,329
Table 2: The parameters of the elastoplastic and phase-change constitutive models for the sand collapse problem [15].
Elastoplastic constants
Phase-change parameters
ρ0 (kg m−3 )
E (MPa)
ν
φ (◦ )
ψ (◦ )
c (kPa)
ρc (kg m−3 )
ξ (m1/2 kg−1/2 )
µs
µ2
2450
20.0
0.30
37.0
0.0
0.0
1500
1.1233
0.6419
0.8435
Table 3: Computational efficiency of the GPU-accelerated SPH code for simulation of sand collapse with an aspect ratio of a = 0.5.
Case
Initial particle spacing (mm)
Number of particles
FPS (s−1 )
T (s)
1
6.25
170,221
22.32
127
2
5.00
286,333
12.19
288
3
4.00
482,430
6.78
653
4
3.00
752,187
3.96
1324
Table 4: Initial configuration for three different simulations of head-on collisions of dense granular jets.
Scenario
R (mm)
k
v0 (m s−1 )
Initial particle spacing (mm)
Number of particles
1
0.10
0.50
10.0
4.00
369,000
2
0.10
1.00
10.0
4.00
592,800
3
0.10
0.25
10.0
4.00
314,400
20
487
488
489
List of figures: Figure 1. Sketch of the cell-linked list method for determination of the potential neighbors of a particle of interest (highlighted in blue).
490
Figure 2. Sketch of the dummy particles used to represent a solid wall boundary.
491
Figure 3. Flow diagram depicting the implementation of a 3D SPH methodology on a GPU.
492
Figure 4. Initial setup of the experiment and simulation for the 3D axisymmetric collapse of dry sand.
493
Figure 5. Predictions of time development of 3D sand collapse obtained using the phase-change model for a = 0.5
494
495
496
497
498
499
500
501
502
(a1–a4) and 4.0 (b1–b4). Figure 6. The predicted stress distributions obtained in the SPH framework using the elastoplastic constitutive model for a = 0.5. Figure 7. Predicted and experimental non-dimensionalized runout distance d∞ versus initial aspect ratio a. Experimental results are taken from Lube et al. [31]. Figure 8. Predicted and experimental non-dimensionalized runout distance h∞ versus initial aspect ratio a. Experimental results are taken from Lube et al. [31]. Figure 9. The speed-up on a single GPU compared to that on a single CPU of SPH simulations as a function of the number of particles N used in the simulation.
503
Figure 10. The initial configuration of sand on an inclined flume. All the dimensions shown here are in cm.
504
Figure 11. The magnitude of the velocity of granular impact on 45◦ inclined flume obtained using SPH with the
505
phase-change constitutive model at six different times: namely, at 0.2 s (a1), at 0.6 s (a2), at 1.0 s (a3), at 1.4 s (a4), at
506
1.8 s (a5), and at 2.2 s (a6).
507
Figure 12. Time variation of the impact force for a flume inclination of 45◦ .
508
Figure 13. A sketch of the ejecta angle and the velocity u0 of the impact center.
509
Figure 14. The magnitude of the velocity arising from the head-on collision of two dense granular jets predicted
510
511
512
513
514
using SPH with the phase-change constitutive model for an initial configuration given by scenario 1. Figure 15. The stress and strain distributions arising from head-on granular impact obtained using SPH with the phase-change constitutive model. Figure 16. Numerical simulation of the process of densification of two colliding granular jets obtained using SPH with the phase-change constitutive model.
515
Figure 17. Wave structures on a granular film obtained from (a) an experimental study [13] and from SPH sim-
516
ulations with (b) the phase-change and (c) the elastoplastic constitutive models. The numerical simulations were
517
conducted with initial amplitude of A0 = 0.002 m and a frequency of f0 = 160 Hz.
518
Figure 18. Wave structures on a granular film obtained at an initial frequency of f0 = 160 Hz and for amplitudes
519
of A0 = 0.0005 m (a1), 0.001 m (a2), 0.002 m (a3), and 0.003 m (a4) and at an initial amplitude of A0 = 0.002 m and
520
for frequencies of f0 = 80 Hz (b1), 120 Hz (b2), 160 Hz (b3), and 240 Hz (b4).
21
Highlight • A GPU-accelerated SPH code has been developed to simulate granular flows and impact. • An elastoplastic and a phase-change models were used to model granular materials. • Four cases have been tested to validate the GPU-accelerated SPH. • The numerical results were compared against a variety of published data. • A case run on a outdated GPU was 160 times more efficient than run on a single CPU.
Declaration of interests ☒ The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. ☐The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: