Survey and analysis of multiresolution methods for turbulence data

Computers and Fluids 125 (2016) 39–58 Contents lists available at ScienceDirect Computers and Fluids journal homepage: www.elsevier.com/locate/compf...

Download PDF

4MB Sizes 0 Downloads 169 Views

Report

Full Text

Computers and Fluids 125 (2016) 39–58

Contents lists available at ScienceDirect

Computers and Fluids journal homepage: www.elsevier.com/locate/compfluid

Survey and analysis of multiresolution methods for turbulence data Jesus Pulido a,b,∗, Daniel Livescu b, Jonathan Woodring b, James Ahrens b, Bernd Hamann a a b

Institute for Data Analysis and Visualization, Department of Computer Science, University of California, Davis, CA 95616, USA Los Alamos National Laboratory, Los Alamos, NM 87544, USA

a r t i c l e

i n f o

Article history: Received 12 February 2015 Revised 12 September 2015 Accepted 3 November 2015 Available online 10 November 2015 Keywords: Turbulence Wavelet Curvelet Surfacelet B-spline wavelet

a b s t r a c t This paper compares the effectiveness of various multi-resolution geometric representation methods, such as B-spline, Daubechies, Coiﬂet and Dual-tree wavelets, curvelets and surfacelets, to capture the structure of fully developed turbulence using a truncated set of coeﬃcients. The turbulence dataset is obtained from a Direct Numerical Simulation of buoyancy driven turbulence on a 5123 mesh size, with an Atwood number, A = 0.05,and turbulent Reynolds number, Ret = 1800,and the methods are tested against quantities pertaining to both velocities and active scalar (density) ﬁelds and their derivatives, spectra, and the properties of constant density surfaces. The comparisons between the algorithms are given in terms of performance, accuracy, and compression properties. The results should provide useful information for multi-resolution analysis of turbulence, coherent feature extraction, compression for large datasets handling, as well as simulations algorithms based on multi-resolution methods. The ﬁnal section provides recommendations for best decomposition algorithms based on several metrics related to computational eﬃciency and preservation of turbulence properties using a reduced set of coeﬃcients. © 2015 Elsevier Ltd. All rights reserved.

1. Introduction Most datasets encountered in physical applications, similar to most natural images, present lower dimensional structures whose detection, extraction, and characterization are active areas of research. The search for more eﬃcient algorithms to detect and manipulate such structures has led to the development of a multitude of multi-resolution geometric representations, such as curvelet and surfacelet transforms. The curvelet [1] and surfacelet [2] transforms perform spatial partitioning in Fourier space at multiple resolutions by creating bands using discrete frequency tiling that store localized directional coeﬃcients. One area which has seen signiﬁcant interest in the application of such methods is ﬂuid turbulence. While turbulence is a strongly multi-scale phenomenon with a large range of dynamically relevant spatio-temporal scales, coherent structures are almost always present, due to initial or boundary conditions, injection mechanisms, or arising from internal dynamics. The characterization of these structures, which interact nonlinearly as they are advected by the background ﬂow and signiﬁcantly alter the local topology, is one of

∗ Corresponding author at: Institute for Data Analysis and Visualization, Department of Computer Science, University of California, Davis, CA 95616, USA. Tel.: +12092612674. E-mail addresses: [email protected] (J. Pulido), [email protected] (D. Livescu), [email protected] (J. Woodring), [email protected] (J. Ahrens), [email protected] (B. Hamann).

http://dx.doi.org/10.1016/j.compﬂuid.2015.11.001 0045-7930/© 2015 Elsevier Ltd. All rights reserved.

the fundamental open questions in the study of turbulence. One of the earliest applications of compression algorithms to turbulence is done in Ref. [3]. The focus was on comparing the coherent vortex simulation (CVS) decomposition based on a orthogonal wavelet basis with the Proper Orthogonal Decomposition or Fourier ﬁltering, as applied to a forced homogeneous isotropic turbulence Direct Numerical Simulations (DNS) dataset. It is shown that CVS ﬁltering, which is local in both physical and spectral spaces, can separate the coherent vortex tubes from the incoherent background ﬂow. The latter is structureless, has an equipartition energy spectrum, a Gaussian velocity probability distribution function (PDF), and an exponential vorticity PDF. On the other hand, the Fourier basis does not extract the coherent vortex tubes cleanly and leaves organized structures in the residual high wavenumber modes whose PDFs are stretched exponentials for both the velocity and vorticity. More recently, curvelets have been brieﬂy evaluated by Ma et al. [4] in comparison to the classical wavelet transform. In their work, multi-scale geometric analysis is systematically applied to turbulent ﬂows in two and three dimensions using curvelets. The analysis is based on the constrained minimization of a total variation functional representing the difference between the data and its representation in the curvelet space. Constrained multi-scale minimization results in a minimum loss of the geometric ﬂow features and the extraction of the coherent structures with their edges and geometry properly preserved, which is signiﬁcant for turbulence modeling. The results of this work show that curvelets are very effective in edge and geometry preservation in turbulence data when compared to traditional

40

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

wavelets under a speciﬁc series of tests and wavelet coeﬃcient numbers. One goal of the present paper is to expand those tests to the full range of coeﬃcients for reconstruction of turbulence data as well as include tests for quantities dependent on an active scalar ﬁeld. These methods have been primarily applied to the quantities directly related to the velocity ﬁelds although, recently, the curvelet transform has been used to examine the multi-scale structure of scalar ﬁelds, such as mass concentration [5]. These ﬁelds are, in general, rougher than the advecting velocity and can present unmixed patches in many practical applications (e.g. non-premixed combustion). Thus, representation methods which are designed to capture surfaces or edges, such as surfacelets or curvelets, appear naturally more suited to capture these ﬁelds. For example, Ref. [5] introduces a curvelet-based multi-scale methodology for the study of the nonlocal geometry of eddy structures in turbulence data. The dataset is from a 5123 DNS of passive scalar mixing in isotropic turbulence and the curvelet transform is used to extract, characterize and classify structures pertaining to the passive scalar. The classiﬁcation is based on differential-geometry properties, such as shape index, curvedness, and stretching parameter, which deﬁne the geometrical signature of the surfaces of constant scalar value. These properties are discussed with respect to their relation to the dynamical behavior of passive scalar stirring and mixing. Another goal of the present paper is to compare the curvelet transform with other representation methods for their ability to preserve these properties with a reduced set of coeﬃcients. Accurate simulations of turbulent ﬂows require solving all the dynamically relevant scales of motions. This technique, called DNS (see above), has been successfully applied to a variety of simple ﬂows, however most practical ﬂows would require mesh sizes outside the range of the most powerful supercomputers for the foreseeable future. The resolution requirements can be improved, especially in problems with localized features, by employing an adaptive mesh strategy. However, such approaches often introduce directional bias and use lower order discretization methods, which decreases the accuracy. Adaptive mesh strategies based on wavelet decompositions have been proposed with explicit error control and higher order discretization schemes [6–8]. While these methods extend the range of DNS applicability, accurately solving all the ﬂow scales still imposes severe limitations on the ﬂows which can be simulated. One of the avenues being explored for simulating, with feasible meshes, ﬂows with large range of scales, as encountered in most practical applications, explores coherent/incoherent decompositions allowed by multi-resolution geometric representations [3,4,9,10]. This approach relies on the ability of such methods to represent the coherent part of the ﬂow with a signiﬁcantly reduced set of coeﬃcients (e.g. 1−5%of the coeﬃcients to represent the whole ﬂow) and model the incoherent part using simplifying models (e.g. assume Gaussian statistics). For such applications, the accuracy and computational eﬃciency of the algorithms are both important. So far, only the curvelet, Dual-tree, and orthogonal wavelet transforms have been used in this context and no comprehensive comparison between these transforms has been made. Classical large eddy simulation (LES) approaches for computing turbulence represent the ﬂow on a coarse mesh in either real or spectral spaces and model the sub-grid contributions. While ﬁnding an optimal function set basis to represent turbulence remains an outstanding open question, the representation methods discussed in this paper may offer a better framework for modeling the sub-grid terms in an LES type approach than spectral or physical space based ﬁlters, due to their localization in both spectral and real spaces. Here, we rely on this locality to denote the coherent/incoherent decomposition as applied directly to the primary variables, as opposed to CVS-type decompositions. This is along the lines of the SCALES approach [9] and offers easy generalizations to complex ﬂows and direct connection to LES-type approaches.

In addition, there is a signiﬁcant cost associated with the storage of the data generated by turbulence simulations. Eﬃcient lossy algorithms can take advantage of the coherent/incoherent decompositions of the ﬂow ﬁeld and signiﬁcantly reduce the archival requirements. Data retrieval can be optimized by extracting only the coherent structures in the data for faster data visualization and analysis at multiple levels of resolution. By reducing the retrieval and transmission cost, projects such as the Johns Hopkins Turbulence Database (JHTDB) can be improved by reducing the amount of data processed and transmitted to a client [11]. By only sending structures at a resolution relevant for analysis, the reduced cost can allow for real-time remote data visualization and analysis of large datasets. The focus of this paper is the comparison of new and existing methods used in analysis (feature identiﬁcation, extraction, and analysis) and simulations (based on coherent/incoherent decompositions) of turbulence. These methods include second-generation wavelets such as Haar, biorthogonal B-spline, Daubechies, Coiﬂets, Dual-tree, and newer methods such as curvelets and surfacelets. In order to make the comparisons as comprehensive as possible, a ﬂow has been selected in which the turbulence is accompanied by mixing between initially segregated different density materials (see description below) which are subjected to a constant acceleration. The large scales of the ﬂow are anisotropic and the interfaces between the two materials become highly corrugated. The methods considered are compared in their ability to capture the structures of both velocity and density ﬁelds. It is hoped that this analysis will help both the simulations of turbulent ﬂows using multi-resolution geometrical representations as well as further the study of turbulence physics using such methods.

1.1. Direct numerical simulation dataset The dataset used in this paper is from a DNS of homogeneous buoyancy driven turbulence on a 5123 periodic grid. The simulation used the variable-density version of the petascale CFDNS code [12] to solve the incompressible Navier–Stokes equations for two miscible ﬂuids with different densities, in a triply periodic domain. These equations are obtained from the fully compressible Navier–Stokes equations with two species with different molar masses in the limit c → ∞ (c is the speed of sound) such that the individual densities of the two ﬂuids remain constant [13–15]. The two ﬂuids are initialized as random blobs, consistent with the homogeneity assumption. The ﬂow starts from rest, with only a small amount of dilatational velocity necessary to satisfy the divergence condition and turbulence is generated as the two ﬂuids start moving in opposite directions due to differential buoyancy forces. However, as the ﬂuids become molecularly mixed, the buoyancy forces decrease and at some point the turbulence starts decaying. For comparison between the different compression algorithms, density and velocity ﬁelds are used at the time when the turbulent kinetic energy is maximum. At this time, the turbulent Reynolds number is Ret = 1800and the turbulence is fully developed. The rest of the nondimensional parameters characterizing the ﬂow are Atwood number, A = 0.05,Schmidt number, Sc = 1and Froude number, F r = 1. A similar dataset [16], on a 10243 mesh, covering the whole range of turbulence evolution, from buoyancy driven increase in turbulent kinetic energy to buoyancy mediated turbulence decay, has been recently added to the Johns Hopkins Turbulence databases [11]. The rest of the paper is organized as follows. In Section 2, a background is given of the geometric representation methods considered in this paper. Section 3 summarizes the software used, thresholding techniques, and properties for all of the different methods. The testing methodologies are described and the results are quantiﬁed in Section 4. Finally, Section 5 presents the conclusions with the recommendations of the best schemes suited for the representation and

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

preservation of a variety of quantities relevant to fully developed turbulence in the presence of an active scalar ﬁeld. 2. Background and methods Wavelets are the generalization of the Fourier transform by using bases that represent both location and spatial frequency [17]. A more fundamental background on wavelets can be found in Appendix A. This section gives a brief overview of each of the representation methods used for comparisons. All methods considered in this paper are the discrete versions of their respective continuous signals. Wavelets used in this paper are second-generation wavelets in implementation [18]. 2.1. Orthogonal wavelets and B-spline wavelets The orthogonal Haar wavelet is the simplest transform and is best known for its top hat, piece wise signal and simplistic representation. These simple wavelets can decompose a discrete signal into two half signals represented by a 0 or 1 per each step as seen in Fig. A.21a in the Appendix. Due to their simplicity, they are the fastest to compute. Farge [19] performed an initial analysis on the use of wavelets, including Haar wavelets, to characterize coherent and incoherent turbulent ﬂow parts. The orthogonal family of Daubechies wavelets are similar in construction to Haar wavelets but requiring vanishing higher order moments in the mother signal. Their construction and signals are described in [17] and shown for completeness in Fig. A.21 in the Appendix. As the Daubechies family increases in order, more distortions are produced in the original mother signal. For our analysis, the lower order 3rd , 5th, and 7th families of Daubechies wavelets are considered. As one goes to even higher order families of wavelets, the wider signals may cause overlapping in the analysis and introduce a loss in quality as it is shown later. Goldstein et al. [9,10] used the 5th family of Daubechies wavelets for adaptive eddy simulations. The orthogonal Coiﬂet family of wavelets were ﬁrst proposed by Beylkin et al. [20], then implemented by Daubechies [21] and Tian et al. [22]. As with Daubechies wavelets, Coiﬂets require vanishing moments in the original reconstruction signal. Unlike Daubechies wavelets, Coiﬂets focus on vanishing moments in their scaling function in order to further improve the convergence of the original signal. For this analysis, the ﬁrst three orders are used: Coiﬂet-6, Coiﬂet-12 and Coiﬂet-16. Their respective signals can be seen in Fig. A.21 in the Appendix. Deriaz et al. [23] and Roussel et al. [24] have used Coiﬂet12 for coherent vortex extraction in homogeneous turbulence. Farge et al. [3] used Coiﬂets for CVS decomposition to separate coherent vortex tubes from the incoherent background ﬂow in turbulence. Biorthogonal B-spline wavelets are the natural extension to the Haar wavelet; similar in usage and implementation but differ in basis functions [25]. B-spline wavelets contain basis functions representing sinusoidal signals of varying magnitudes. The construction of both the decomposition and synthesis ﬁlters depend on two control parameters that select the family order and the number of discrete interpolation points to interpret a continuous wave. At the lowest order, constant B-spline wavelets share the same basis function as the Haar wavelets and will therefore refer to this family as Haar wavelets for the remainder of this paper. The ﬁrst six families of B-spline wavelets are considered for this paper: constant (Haar, 1st degree), linear (2nd degree), quadratic (3rd degree), cubic (4th degree), quartic (5th degree), and quintic (6th degree). Family characteristics are further described in Appendix A.1. Biorthogonal wavelets in their lowest order have had some limited use in scientiﬁc computing. The lack of orthogonality in their higher order basis functions and associated computation cost has hindered their wider adoption so far. Adding to the slightly higher compute cost, B-spline wavelets are also not as easily parallelized compared to

41

orthogonal wavelets. Parallelization is possible by using the secondgeneration construction of wavelets based on a lifting scheme [26]. Roussel et al. [24] used biorthogonal (constant) Harten wavelets for the extraction of coherent vortices and found that the biorthogonal nature of the wavelets may cause problems in modeling background noise after extraction. Their results showed an improvement by using orthogonal Coiﬂet-12 over Harten wavelets. These results are expected due to Harten signal being fundamentally similar to the Haar signal as part of the constant family of B-spline wavelets, as seen in Fig. A.22b in the Appendix. Thus, an increase in control points does not generally produce improvements in reconstruction in contrast to increasing the family order. Nevertheless, while one expects a much better performance for higher-order B-spline wavelets, these methods have yet to be tested extensively for turbulent ﬂows. For simple applications, orthogonal wavelets have overshadowed B-spline wavelets; however, higher order transforms such as Bsplines may perform better in applications requiring the preservation of structures. For example, Bertram et al. [27,36] have shown that B-splines do very well in representing smooth surfaces and curved structures with a minimal set of coeﬃcients. For the purpose of this paper, the lack of orthogonality is less important than the better preservation of various turbulence quantities. 2.2. Dual-tree wavelet transform The Dual-tree complex wavelet transform (DTWT) is an enhancement of the discrete wavelet transform [28]. The DTWT implements shift invariance and directionality in multiple dimensions. The DTWT performs two discrete wavelet transform operations to construct a tree-like structure. Two paths are branched off, each representing the real and imaginary components of the transform. The real component is utilized to represent the majority of the energy in a dataset where the imaginary component captures small-scale details. The DTWT addresses issues present in real wavelets, those being oscillations, shift variance, aliasing, and lack of directionality. The redundancy of having two branches provides additional information for analysis, approximate shift-invariance, and a perfect signal reconstruction unlike the discrete wavelet transform. The DTWT contains ﬁlters designed to represent many properties, such as approximate half-sample delays, orthogonal or bi-orthogonal signals, ﬁnite support, vanishing moments/good stop-band, and linear-phase ﬁlters [29]. The ﬁlters considered in this paper include the Farras ﬁlter for the initial scale, and Kingsman’s Q-shift ﬁlter for the subsequent scales. The DTWT has been extensively used by Nguyen et al. [30,31] for signal ﬁltering on Galerkin methods. 2.3. Curvelets The curvelet transform is a ﬁlter-based wave-signal decomposition algorithm that is designed to represent signals by their edges [1]. Curvelets are a non-adaptive method for multi-scale signal representation. Curvelets are derived from ridgelets, which are linear features that represent different lengths with respect to scale in different positions and as many orientations as can be deﬁned for a dataset. Fig. 1 demonstrates the frequency tiling for the 2D and 3D discrete transform, the latter being used for this paper. The frequency tiling is deﬁned in Fourier space, where certain ranges of Fourier coeﬃcients are deﬁned as the directional components of the curvelet transform. The curvelet transform extends the bi-directionality of wavelets to multiple directions where basis functions are localized to each direction. Within each direction, the degree of localization varies per scale allowing multi-resolution extraction of signals. Due to the large number of orientations, positions and scales available, the computation of this transform is expensive compared to any of the simpler wavelets. The result of a decomposition

42

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

a

b

Fig. 1. Discrete Curvelet. The continuous curvelet transform is discretized in Fourier space for each dimension ω. Directional wedges are darkened in both ﬁgures where (a) 2-D frequency tiling is performed along with (b) 3-D frequency tiling. Figure is reproduced from [1].

produces a large number of complex coeﬃcients where their amplitudes refer to the underlying structure of the original dataset. So far, curvelets have not been proven to have an orthogonal basis function. The discrete implementation aims to decompose a dataset based on the described band scheme and sub-components. For 3D datasets, this scheme introduces modest levels of data redundancy in order to properly encode directional/angular data. One of the major incentives to consider curvelets is their ability to express orientation versus its predecessors. Although curvelets have proven to be superior in the ﬁeld of 2D image processing, similar areas have not been fully explored in 3D turbulence. Directional structures embedded in the velocity gradient tensor or related to constant active scalar surfaces present in 3D turbulent ﬂows may lend themselves in structure and orientation to the curvelet transform. Opposite to wavelets, curvelets take a bottom-up approach in extracting coeﬃcients. The largest, coarsest coeﬃcients are ﬁrst extracted in the ﬁrst bands of the decomposition and the smallest, ﬁnest coeﬃcients are in the highest bands. Curvelets have been used for geometric analysis of turbulent data by Ma et al. [4] and Bermejo-Moreno and Pullin [5,34]. 2.4. Surfacelets The surfacelet transform uses a combination of directional ﬁlter banks and contourlets in order to represent signal singularities that lie on smooth surfaces. In order to analyze multi-dimensional signals for multiple scales and directions, the directional ﬁlter bank (DFB) was proposed by Bamberger and Smith for 2-D signals [32]. The DFB is a tree-structured decomposition that creates 2k sub-bands with

directional partitioning as in Fig. 2(a). The value k is deﬁned with respect to the number of directions to be extracted. Do and Vetterli later constructed contourlets by combining the DFB with a Laplacian pyramid structure [33]. Previously restricted to 2D, these directional ﬁlter banks were extended into higher dimensions creating the N-dimensional directional ﬁlter bank (NDFB) by Lu and Do [2]. The surfacelet transform combines the extension of contourlets to 3-D space and the NDFB in order to represent surface-like singularities in multidimensional data. The NDFB uses a frequency partitioning scheme that resembles rectangular-based pyramids radiating out from the origin in different directions and multiple tiles as seen in Fig. 2(b). Similar to curvelets, the surfacelet partitioning is deﬁned in Fourier space. Each direction is represented by a number as seen in Fig. 2(a) within Fourier space. The inclusion of higher dimension ﬁlter banks allows surfacelets to have the following distinct properties: directional decomposition and construction, angular resolution, perfect reconstruction and small redundancy [2]. One of the features of surfacelets is the frequency partitioned directional ﬁlter bank with multiple levels in a tree-like structure. As opposed to curvelets, a single directional signal is spread out across its domain requiring the extraction of a lesser number of coeﬃcients. The introduction of this ﬁlter bank allows aliasing to occur for each band in the frequency domain as opposed to being restricted to aliasfree bands in curvelets. Due to the alias-free restriction, curvelets may require much more bands and coeﬃcients to capture certain frequencies. By utilizing the tree-structured NDFB, aliasing can be removed in surfacelets by combining multiple levels and overall producing a

Fig. 2. Discrete Surfacelet. Surfacelet frequency tiling of the N-dimensional ﬁlter bank (NDFB) in Fourier space bounded by π on each dimension ω. (a) Discrete 2D tiling for each numbered angular direction. (b) Discrete 3D tiling highlighting a single angular direction. Figure is reproduced from [2].

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

less redundant extraction of coeﬃcients compared to curvelets. The surfacelet transform has not been previously tested in the context of turbulence. The ability of identifying and preserving rough surfaces could make this transform well suited for representing material ﬁelds in turbulent ﬂow datasets. 2.4.1. Orthogonality Orthogonality is an attractive feature in mathematical transforms since it infers energy preservation. The wavelet families that are fully orthogonal in this analysis are the Haar, Daubechies and Coiﬂet families. The B-spline wavelets lose orthogonality due to their signal and become biorthogonal. Dual-tree wavelets contain two biorthogonal ﬁlter sets with slightly different frequency responses. When combined, the two biorthogonal ﬁlters can become near orthogonal through the use of long ﬁlters leading to a higher computation penalty. Curvelets contain an orthogonal wavelet component during their directional, ﬁnest-scale decomposition but overall are not orthogonal. Candes et al. [1] claim that curvelets behave similar to an orthogonal decomposition but are unable to prove it. Surfacelets, like curvelets, are only orthogonal at their lowest directional component but not in their entirety. The issue of needing orthogonal wavelets is not a priority in our analysis since the preservation of turbulence structures is prioritized over full preservation of energy. In Ref. [24], it is stated that a drawback of using non-orthogonal wavelet methods is a loss in energy. It is shown below that non-orthogonal methods still preserve a large portion of energy and do better in representing smooth structures compared to orthogonal methods that lack directionality support. 2.4.2. Complexity and redundancy Complexities are unknown for higher-order methods such as curvelets and surfacelets but they are much higher compared to the other wavelets. Since they are based on discretizations of continuous signals, Haar, B-spline, Daubechies and Coiﬂet wavelets are of linear complexity O(N) where N is the total number of points in the data. The eﬃciency of any method can be asserted by observing its redundancy. Redundancy represents a combination of computational effort and space consumption. Redundancy can also be seen in the number of coeﬃcients extracted and comparable to the ratio between the total number of elements in a dataset and the number of coeﬃcients extracted. The number of coeﬃcients extracted and ratios for a sample 5123 dataset are shown in Table 1. While Haar, B-spline, Daubechies and Coiﬂet wavelets have a redundancy near 1, surfacelets and Dual-tree wavelets have a redundancy of about 3 to 4 in the 3-D scale. Curvelets also have a

Table 1 Coeﬃcients. Number of extracted coeﬃcients and ratio compared to the total grid points of a 5123 dataset. Method

# of coeﬃcients extracted

Ratio

Source grid Haar Linear B-spline Quadratic B-spline Cubic B-spline Quartic B-spline Quintic B-spline Daubechies-3 Daubechies-5 Daubechies-7 Coiﬂet-6 Coiﬂet-12 Coiﬂet-18 Dual-Tree Surfacelet Curvelet

134,217,728 134,217,728 138,553,029 136,538,751 142,978,473 145,512,802 152,517,105 138,553,029 142,978,473 147,717,100 138,553,029 145,512,802 152,517,105 536,870,720 459,538,432 553,095,117

1.00 1.00 1.03 1.02 1.07 1.08 1.14 1.03 1.07 1.10 1.03 1.08 1.14 4.00 3.42 4.12

43

redundancy between 4 to 5 if they use wavelets for their ﬁnest scale, or 40 if they use curvelets again. Curvelets and surfacelets are more redundant due to their native multi-resolution storage components and Dual-tree wavelets due to the capture of directional features. Although this is the case, the curvelet, surfacelet, and Dual-tree wavelet implementations with the lowest redundancy are used for the course of this paper. To formulate a fair comparison, a number of four directional bands for curvelets is used and matched by surfacelets. For further tests which measure the amount of the energy acquired per each band, the number of extracted bands is increased to better measure the energy distribution per band for both methods. Our redundancy evaluations are similar to those presented in Ref. [2]. 2.4.3. Coeﬃcients and their storage components Haar, B-spline, Daubechies and Coiﬂet wavelets have no speciﬁc multi-band coeﬃcient extraction scheme and therefore typically decompose N3 coeﬃcients, where N is the grid size in each direction. The implementations used in this paper support wavelet signal extrapolation on boundary data through symmetrization. This results in numbers of coeﬃcients slightly larger than a ratio of 1.0 compared to the original dataset but also in much more accurate representations of the boundaries. Curvelets and surfacelets require multiband storage schemes and, therefore, they extract many more coefﬁcients, as suggested by their redundancy amounts. Similarly, Dualtree wavelets are also very redundant since they also extract four sets of directional coeﬃcients per scale. Curvelet coeﬃcients are stored with respect to their extraction based on their frequency partitioning per scale and direction. Surfacelet coeﬃcients are similarly stored but differ in partitioning schemes in scale and direction compared to Curvelets. Table 1 shows the number of extracted coeﬃcients in the system for our sample dataset. 3. Implementation and thresholding techniques For this paper, the discrete variants of all of the methods are utilized. The original software provided by the respective authors of each method is used, except for B-spline, Daubechies and Coiﬂet wavelets. Although some methods such as curvelets provide parallel and out-of-core methods to increase the computation speed which may be beneﬁcial in high performance computing environments, the single threaded implementations of all methods is used for fair compute benchmarks. Haar Wavelets, B-spline, Daubechies and Coiﬂet wavelets are tested through the GNU Scientiﬁc Library (GSL). The GSL was modiﬁed to add support for 3D wavelet decomposition, as well as expand the variety of wavelets supported. All methods use their C++ variants and are compiled in Windows 7 Professional 64-bit using Microsoft Visual Studio 2008. The term thresholding is used to summarize the act of selecting distinct coeﬃcients, after a decomposition operation, using a certain criterium. In general, the magnitude of these coeﬃcients for all transforms represents the energy for that speciﬁc component. Typical thresholding is done by considering a set of coeﬃcients where all values are sorted by magnitude. Once sorted, a subset of coeﬃcients may be chosen by percentage or by a speciﬁc number. Hard thresholding is deﬁned by the selection of only speciﬁc coeﬃcients and removing the rest. This approach is utilized for the extent of this paper when comparing the different representation methods. Hard thresholding is used either a priori or, after domain-speciﬁc knowledge of the data is gained, the coeﬃcients are thresholded for speciﬁc magnitudes above a certain value. For this paper, the decomposition of the original turbulent ﬁelds into coherent/incoherent parts is targeted in addition to the ability of the multi-resolution methods to reproduce turbulence structures with a minimal set of coeﬃcients. The ansatz that the coherent, nonuniversal part of the ﬂow can be captured with a reduced set of coeﬃcients and, thus, does not require the representation and preservation

44

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

a

b

Fig. 3. Total kinetic energy and density variance. The ratios of total kinetic energy (a) and density variance from the original data compared to the compressed ﬁelds by percentage of coeﬃcients. Large gains in accuracy occur up to 1% coeﬃcients, when all methods capture more than 98% of the kinetic energy and 96% of the density variance.

of the highest frequency, smallest coeﬃcients is tested. In representations and analysis of developed turbulence, the hope is that these coeﬃcients correspond to some universal properties (often understood and describable as white noise). Specialized optimal values for thresholding are only available for orthogonal methods and Gaussian noise [3,24]. For example, the maximum quadratic error, E || fˆ − f ||22,N /N,between the denoised signal, fˆ,and the original signal, f, can be minimized using the minimax

threshold [43]. Here, ||v||22,N denotes the usual lN2 norm and N is the resolution. For large data samples, this threshold approaches its upper bound deﬁned by = (2σ lnN)1/2 ,where σ is the variance of the noise. For turbulence data, however, the noise is likely not identically distributed Gaussian (both spatial correlation and non-Gaussian PDF are likely present) and the noise level is not known a priori. In order to compare the methods considered, the hard thresholding of the coeﬃcients based on the percentage of coeﬃcients in the system is used. The minimax threshold was derived for selected orthogonal methods and approximated to be below 0.5% coeﬃcients. Due to the large differences in the ratio of captured kinetic energy between methods, a larger percentage amount was selected for a fairer comparison. As in Ref. [3], comparisons are made after the selection of 3% coeﬃcients. However, unlike CVS type decompositions, where the vorticity ﬁeld is decomposed, thresholded and then reconstructed, while the coherent and incoherent velocity parts are obtained from the corresponding vorticity parts via the Bio-Savart law, here, the thresholding is performed on the decompositions of the primary variables, velocity and density. The motivation, described in the Introduction, follows from the applications considered in the comparison of the methods: LES-type simulations of more general ﬂows than incompressible single-ﬂuid turbulence and turbulence data compression for storage, analysis and visualization. Fig. 3 compares the amounts of kinetic energy and density variance captured by several of the methods, as a function of the percentage of the coeﬃcients. The ratios to the kinetic energy and density variance in original ﬁelds show that, even at low percentage of coeﬃcients, both quantities can be well captured. Large gains in accuracy occur up to 1% coeﬃcients, when all methods capture more than 98% of the kinetic energy and 96% of the density variance; the further increase in the number of coeﬃcients above 1% results in smaller accuracy gains. Both surfacelets and curvelets are able to more accurately capture the kinetic energy below extreme percentages of 0.4% coeﬃcients but are then overtaken by other methods. The 3% coefﬁcient mark ensures that the kinetic energy captured exceeds 99%

and density variance exceeds 98% for all methods. The remaining 97% coeﬃcients are also computed to estimate the residual coherence or departure from a Guassian, white noise. Hard thresholding at equivalent percentage of coeﬃcients is a good indicator of the potential of the different transforms considered in this paper. For Haar, B-spline, Daubechies and Coiﬂet wavelets, which do not have explicitly deﬁned multi-bands, the coeﬃcients can be ordered and thresholded without ambiguity. However, in general, transforms with multi-band support decompose into a series of coefﬁcients for multiple bands. These bands are further subdivided into a larger series of angular directions. These transforms also extract real and complex coeﬃcients. These features add additional layers of complexity and expand the capabilities of thresholding coeﬃcients. The same number of bands and number of directions are extracted when comparing both surfacelets and curvelets. More advanced thresholding schemes were tested for these multiband methods, including the localization of features by scale and direction. Due to the difference in directional representation of each method, to perform a fair comparison, coeﬃcients are thresholded by their largest magnitudes as a whole per direction and bands. Although thresholding coeﬃcients per individual band or per angle may be more intuitive in ﬁnding speciﬁc features, reconstruction behavior is more consistently obtained when considering all coeﬃcients as a whole. Each method has various parameters that change their functionality such as number of bands to decompose and variables related to the behavior of their ﬁlters. More modern, higherorder methods such as curvelets and surfacelets contain band systems with multiple levels of redundancy in order to capture angularity based on their ﬁlter type. Due to each method having different ﬁlters, basis functions and lack of complete orthogonality, the relation between coeﬃcients from different methods with respect to a given frequency is not trivial. Such a relation can be estimated by performing a power spectrum analysis in Fourier space, which is also covered in the results section. 4. Results and discussion 4.1. Test types and deﬁnitions A series of tests are performed that compare the compressed datasets to the originals for quantities related to the velocity and density ﬁelds. For the velocity ﬁeld, the comparison metrics used are the |2 ), as well PDF of the velocity ﬁeld itself and of enstrophy ( ≡ |ω

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

as enstrophy peak signal-to-noise ratio (PSNR) and mean square error (MSE) and the PDF of the unbiased measure of the state of the deviatoric strain rate tensor, s∗ [39]. For the density ﬁeld, the comparison metrics used are the density PSNR and MSE, density power spectrum, coherent/incoherent representations of the density ﬁeld itself and iso-density surface corresponding to the pure light ﬂuid. The pure light ﬂuid iso-density surface is also characterized using several differential geometry tools: surface area, mean and Gaussian curvatures as well as the surface signature. Previous comparisons related to the coherent/incoherent decompositions of turbulent ﬁelds have been restricted to orthogonal and biorthogonal (Harten) wavelets as applied to velocity and enstrophy PDFs, kinetic energy spectrum and the PDF of helicity and Lamb vector (e.g. [24]). While CVS type decompositions have been mainly used to investigate the coherent/incoherent parts of the ﬂow, here the decomposition is performed at the level of the primary variables, velocity and density. This approach is not entirely new, for example it is the base of the SCALES method [9] and has also been used for the analysis of a passive scalar [5]. Here, the results presented address a more complex ﬂow and encompass more representation methods, with more quantities related to the velocity ﬁeld (e.g. the state of the deviatoric strain rate tensor) and, for the ﬁrst time, quantities related to an active scalar ﬁeld and its constant property surfaces. For all tests, the strongest 3% coeﬃcients are referred to as deﬁning the coherent part and the weakest 97% coeﬃcients as deﬁning the incoherent (residual) part of the respective turbulent quantity. A subset of methods are compared looking at the progression of reconstruction for the number and percentage of coeﬃcients. It is noted that the relationship between percentage and number of coeﬃcients is not the same for all methods as some methods such as curvelets and surfacelets may extract many more coeﬃcients compared to the B-spline wavelets. The subset of methods include Haar, cubic B-spline, Daubechies-5, Coiﬂet-12, Dual-tree wavelets, surfacelets, and curvelets. For clarity, Dual-tree wavelets are only added in a subset of the ﬁgures, especially where they excel beyond the other methods; nevertheless, they are added in all the tables. These methods were selected as representative for each class of methods considered. Finally, the last set of tests evaluate the performance and memory requirements of each of the methods. The performance is evaluated based on the runtime of each transform during its decomposition phase for a single scalar ﬁeld. Only the single-threaded variant of each of the methods is considered. The hardware used to perform these tests contains an Intel Core i7 930 processor running at 3.80GHz and 48GB of DDR3 RAM. 4.1.1. Deﬁnitions and discussion on the quantities used for comparisons The deﬁnitions and short explanations for the various test metrics and turbulence quantities used in this paper are given below. PSNR in Eq. (1) is the primary measurement to evaluate pointwise accuracy. The signal-to-noise-ratio (SNR) was used in Ref. [38] in a similar analysis to compare the effects of compression on data. PSNR is used in this paper rather than SNR since it better reproduces the behavior of the representation methods over all percentages of coeﬃcients thresholded. MSE is superior in showing the behavior at lower percentages with the eventual drop-off to zero due to numerical precision.

PSNR = 10 · log10

MAX 2 MSE

,

(1)

where MAX is the maximum possible value in the range of the data and MSE is the mean square error. ω ≡ ∇ × u ) PDFs have been been used Velocity ( u) and vorticity ( to ascertain the ability of orthogonal and B-spline wavelets to decompose turbulence ﬁelds into coherent and incoherent parts in Ref. [24]. This was accomplished by comparing the PDFs corresponding to

45

the ﬁelds reconstructed using the largest 3% of the coeﬃcients with the PDFs of the original signal and the PDFs corresponding to the ﬁelds reconstructed using the remaining 97% of the coeﬃcients with ) a Gaussian. While these quantities, together with the helicity ( u·ω ), are useful in assessing the representation and Lamb vector ( u×ω of turbulence with a limited number of coeﬃcients, they do not fully address the local structure of turbulence and, thus, the incoherence of the turbulence data. Therefore, here, we also consider the quantity s∗ , which uniquely deﬁnes the state of the deviatoric strain rate tensor for incompressible ﬂows [39]:

s∗ =

√ −3 6αβγ

(α 2 + β 2 + γ 2 )3/2

,

(2)

where α , β and γ are the eigenvalues of the deviatoric strain rate + (∇ u )T ),or by components, Si j = 1/2(∂ ui /∂ x j + tensor, S ≡ 1/2(∇ u ∂ u j /∂ xi ). For the ﬂow considered here, the divergence of the velocity is not zero, but it is small at the time when the ﬂow is being analyzed. In addition, the enstrophy PSNR and MSE are also shown. The pure light, pure heavy and mixed ﬂuid regions are deﬁned as all the points with density values satisfying the relations [14]:

ρ ≤ ρ1 + 0.05(ρ2 − ρ1 ),

(3)

ρ ≥ ρ2 − 0.05(ρ2 − ρ1 ),

(4)

0.45(ρ2 − ρ1 ) ≤ ρ ≤ 0.55(ρ2 − ρ1 ),

(5)

where ρ 1 and ρ 2 are densities of the two pure ﬂuids. For comparing the different representation methods, in this paper we focus on the surface bounding the pure light ﬂuid regions, which is deﬁned by the formula ρ(x, y, z) = ρl ≡ ρ1 + 0.05(ρ2 − ρ1 ). This surface is visualized using the coherent and incoherent parts of the density ﬁeld. In addition, the ability of the representation methods to capture the surface area, mean and Gaussian curvatures and surface signature with a reduced set of coeﬃcients is also discussed. The area of the surface deﬁned by ρ(x, y, z) = ρl can be calculated using Theorem 1.2.4. from Ref. [40], by replacing the Borel measurable nonnegative function with the delta function, δ(ρ(x, y, z) − ρl ),to obtain:

SAρl =

|∇ρ|δ(ρ − ρl ) dv.

(6)

If one further assumes homogeneity of the ﬂow, so that averages can be calculated as volume averages, then SAρl =

|∇ρ||ρ =ρl f (ρl )V,which is the product of the conditional average of the density gradient magnitude and density PDF at f (ρ(x, y, z)) = f (ρl ),and the total volume of the ﬂow. For a uniform discrete representation of the data, as obtained from DNS, the surface area formula becomes

SAρl =

||∇ρ|| (2π )3 , n(ρmax − ρmin )

(7)

where n is the ratio between the total number of points, N, and the number of bins, nb , used to calculate the density PDF. The discretization errors becomes small for large values of n, nb . Following Ref. [5], the surface signature is deﬁned as the joint PDF of the absolute value of the shape index, S, and curvedness, C:

−2

S = C=

π

tan−1

2H 2 − K,

H √ H2 − K

,

(8) (9)

where H and K are the mean and Gaussian curvatures. Since the same surface is considered for all the representation methods and this surface may be multiply connected, the curvedness is not normalized using the surface area and volume as done in Ref. [5].

46

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

for each of the classes considered. All methods yield symmetrical residual velocity PDFs; however, Haar and curvelet transforms lead to exponential shapes, while the others are closer to Gaussian shapes. The Kurtosis values are: 6.13 (Haar), 6.44 (cubic B-spline), 4.62 (surfacelet), 7.10 (curvelet), 5.79 (Daubechies-5), and 6.10 (Coiﬂet-12). In addition, the ﬁeld’s extrema are the smallest for the cubic B-spline, with relatively close values for the other high order wavelet methods. The surfacelet transform has slightly larger extrema, but still in the range of the higher order wavelets, while the curvelets and Haar wavelets have much larger extrema. Thus, while there is no bias towards positive or negative values, the residual still retains large velocity values.

3

10

2

PDF

10

Haar Cubic Surfacelet Curvelet Daub−5 Coiflet−12 Gaussian

1

10

0

10

−1

10 −0.03

0

0.03

Velocity Fig. 4. Residual velocity PDF. The PDF is obtained from the remaining 97% coeﬃcients in the vertical velocity transform. Two Gaussian signals matching the variances of the curvelet and B-spline wavelet results are plotted.

4.2. Quantities related to the velocity ﬁeld In this section, quantities related to velocity ﬁeld are compared for the representation methods discussed, with respect to their ability to separate the ﬂow into coherent and incoherent parts and represent the turbulence structure with a reduced set of coeﬃcients. Some of the quantities considered here (velocity and enstrophy PDFs) have been investigated in previous studies; however, only the orthogonal (Coiﬂet) and bi-orthogonal (Harten) wavelets have been explored. Here, we calculate these quantities as a consistency check with the previous studies and also for comparison with the additional representation methods considered. In addition, several new metrics are examined to better assess the capturing of the local turbulence structure. 4.2.1. Velocity probability density function The velocity PDF itself is Gaussian in most fully developed turbulent ﬂows and this is reﬂected in the PDF of the ﬁelds obtained from the largest 3% of the coeﬃcients obtained for all methods considered here (not shown). Fig. 4 compares the PDFs of the residual vertical velocity PDF for a subset of the methods, which includes one method

a

4.2.2. Vorticity peak signal-to-noise ratio and mean square error Velocity PDF, while useful as a ﬁrst metric to assess the residual part of the velocity ﬁeld, does not contain any spatial information. The next quantity considered for assessing the representation methods is the vorticity. Fig. 5 shows the PSNR for enstrophy (vorticity square) as a function of the percentage of coeﬃcients retained. In this case, the signal is obtained from three lossy scalar components, as each velocity direction is separately represented, then each vorticity component is calculated from the reconstructed velocities. The cubic B-spline wavelets are observed to have the highest overall reconstruction accuracy spanning all percentages, including below 10% coeﬃcients (Fig. 6). Table 2 shows several numerical results for 3% of the coeﬃcients retained (coherent part). All the B-splines methods above linear show consistently high values for PSNR and low values for MSE and maximum error (L∞ norm), with the best results obtained for cubic Bsplines. Overall, all wavelets methods give close results (except the Haar wavelets, which is expected), with slightly worse values obtained for Daubechies and Coiﬂets (in this order). The surfacelets yield results within the range of Coiﬂets (but worse than higher order Coiﬂets) and curvelets show the largest point-wise departure from the original signal (in all three metrics considered in Table 2) for 3% of coeﬃcients retained. Interestingly, for each family there is an optimal order which yields the best point-wise representation, so that further increasing the order of the method starts to deteriorate the results. Thus, the discrete signal in lower families tends to not have enough representation to capture the original ﬁeld while the higher order representations within each family introduce oscillations which reduce the overall quality. The main focus for the remaining tests will be on the subset of representation methods, consisting of Haar, cubic B-splines,

b

Fig. 5. Enstrophy PSNR as a function of the percentage of coeﬃcients (a) and number of coeﬃcients (b) retained for reconstruction. Higher PSNR values represent a more accurate point-wise reconstruction. Cubic B-splines have the highest quality through all percentages and number of coeﬃcients.

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

Fig. 6. Enstrophy MSE variation with the percentage of coeﬃcients retained for reconstruction. Lower MSE values represent a more accurate point-wise reconstruction. Cubic B-splines have the lowest MSE at very low percentages of coeﬃcients, but are matched by Daubechies-5 and Coiﬂet-12 wavelets at higher percentages.

47

Fig. 7. Residual vorticity PDF. The PDF is obtained from the remaining 97% coeﬃcients in the vertical vorticity transform. Two Gaussian signals matching the variances of the curvelet and B-spline wavelet results are plotted.

Daubechies-5, Coiﬂet-12, Dual-tree wavelets, surfacelets and curvelets. The 3% thresholding for the velocity representation captures 94.4%, 99.4%, 99.6%, 99.6%, 98.1%, 97.7%, and 95.4% (respectively) of the total enstrophy for this subset. Within each class of the methods, the total vorticity captured using this thresholding generally increases as the order of the method increases, but decreases using the highest methods. 4.2.3. Vorticity probability density function Vorticity PDF, computed for the remaining 97% coeﬃcients, is shown in Fig. 7. Similar to velocity, all methods yield symmetrical residual PDFs. Haar and curvelet transforms lead to wide tails shapes, while the others have sharper shapes, but still different than Gaussian. The Kurtosis values are: 7.415 (Haar), 9.27 (Cubic B-spline), 8.72 (Surfacelet), 23.9 (Curvelet), 8.26 (Daubechies-5), and 8.70 (Coiﬂet12). The ﬁeld’s extrema are the smallest for the cubic B-spline, with relatively close values for the other high order wavelet methods. The moderate increase in family order for B-splines achieved the smallest extrema compared to constant B-splines (Harten-3) used in Ref. [24]. The surfacelet transform has slightly larger extrema, but still in the range of the higher order wavelets, while the curvelets and Haar

Table 2 Enstrophy comparison at 3% coeﬃcients. The methods with the most accurate properties are represented in bold. Cubic splines have the best point-wise characteristics for enstrophy representation compared to any other method using these metrics. Method Haar Linear B-spline Quadratic B-spline Cubic B-spline Quartic B-spline Quintic B-spline Daubechies-3 Daubechies-5 Daubechies-7 Coiﬂet-6 Coiﬂet-12 Coiﬂet-18 Dual-Tree Surfacelet Curvelet

PSNR 48.786 60.691 63.692 64.030 63.645 62.099 61.437 63.010 61.784 57.632 61.865 61.504 60.132 59.319 52.929

MSE

Max error −8

1.9921x10 1.3756x10−8 0.6893x10−8 0.6377x10−8 0.6969x10−8 0.9947x10−8 1.1586x10−8 0.8065x10−8 1.0695x10−8 2.7824x10−8 1.0499x10−8 1.1407x10−8 1.5648x10−8 1.8867x10−8 8.2165x10−8

5.1897x10−2 0.9685x10−2 0.8786x10−2 0.7181x10−2 0.7374x10−2 1.1021x10−2 0.7198x10−2 0.8216x10−2 1.2053x10−2 1.2457x10−2 0.9154x10−2 1.0614x10−2 1.3191x10−2 1.1105x10−2 4.8001x10−2

Fig. 8. PDF of s∗ corresponding to the remaining 97% coeﬃcients. Although all methods yield a near-horizontal PDF, indicating a quasi-Gaussian white noise residual, quadratic B-spline wavelets are observed to have residual signal closest to a Gaussian.

wavelets have much larger extrema. Similar to velocity, while there is no bias towards positive or negative values, the residual still retains large vorticity events. 4.2.4. The state of the deviatoric strain rate tensor Previous studies, while examining several wavelet transforms for their ability to represent quantities related to the velocity ﬁeld, have not investigated in detail the local structure of turbulence. The state of the strain rate tensor can identify the tendency of the ﬂow to organize in certain types of structures (e.g. stable-focus/stretching or unstable-node/saddle/saddle) characteristic of fully developed turbulence [41,42] and may be a better indication of the residual coherence than velocity or vorticity PDFs. One quantity which can uniquely deﬁne the state of the deviatoric strain rate tensor, s∗ , Eq. (2), was ﬁrst derived in Ref. [39]. For most fully developed turbulent ﬂows, the PDF of s∗ peaks at 1, while it becomes ﬂat for a Gaussian signal. Therefore, a structureless incoherent part of the ﬂow should exhibit a ﬂat s∗ PDF; while the departure from a ﬂat PDF indicates that the residual is not structureless. Fig. 8 shows the PDF of s∗ for the velocity ﬁeld reconstructed from the remainder 97% of the coeﬃcients and the skewness and kurtosis of the PDFs are tabulated in Table 3. A ﬂat PDF has

48

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58 Table 3 Statistical properties of residual s∗ . The methods with the best properties are represented in bold. Skewness and kurtosis are computed for the remaining 97% coeﬃcients for all methods. A ﬂat PDF (corresponding to a Gaussian noise residual) has a kurtosis of 1.8. All methods have skewness close to zero, while quadratic B-spline wavelets have the kurtosis closest to 1.8. Method

Skewness

Kurtosis

Haar Linear B-spline Quadratic B-spline Cubic B-spline Quartic B-spline Quintic B-spline Daubechies-3 Daubechies-5 Daubechies-7 Coiﬂets-6 Coiﬂets-12 Coiﬂets-18 Dual-Tree Surfacelet Curvelet

−0.018671 0.000594 −0.010178 −0.000261 −0.000031 −0.000402 −0.002019 −0.000803 −0.000893 0.000221 0.000523 −0.000581 0.005532 −0.007940 −0.010240

1.7837 1.7502 1.7970 1.7622 1.7774 1.7675 1.7605 1.7627 1.7773 1.7485 1.7566 1.7663 1.8299 1.9495 1.9269

skewness zero and kurtosis equal to 1.8. We observe that for most methods, the remaining signal is mostly Gaussian noise in the remaining 97% coeﬃcients. The method closest to a ﬂat s∗ PDF is the quadratic B-splines, while surfacelets and curvelets present the largest departures from a ﬂat PDF.

4.3. Quantities related to the density ﬁeld Previous comparisons between representation methods have been focused on quantities related to the velocity ﬁeld only. Since

many ﬂows occur in the presence of passive or active scalars, here we are also comparing the representation methods with respect to their potential to separate the scalar ﬁeld into coherent and incoherent parts and represent the structure of the ﬁeld with a reduced set of coeﬃcients. In this ﬂow, the scalar (density) is an active scalar, since the differential buoyancy force which drives the turbulence is generated by the variations of density. Since density comparisons have not been performed before, visualizations of the density ﬁeld itself and constant density surfaces are also shown, together with the tools considered above, namely PSNR, MSE and PDFs. The ability of the representation methods to reproduce the density power spectrum with a limited number of coeﬃcients is also discussed. For the surfacelet and curvelet transforms, the power spectra are calculated for each band separately, which shows the wavenumbers captured by each band, as well as the overlap between bands. In order to characterize the representation of constant density surfaces, surface area, mean and Gaussian curvatures, as well as the surface signature are also calculated. 4.3.1. Density visualization The subset of the methods described above are visually compared by using the center 2D slice of the dataset (Fig. 9) and a log-difference map comparing this slice to the original (Fig. 10) for the density ﬁeld reconstructed from the 3% largest coeﬃcients. Except the Haar wavelets, which approximate curved variations by a staircase-like variation, the rest of the wavelet methods reconstruct the density ﬁeld reasonably well. The closest results with the original dataset is obtained using cubic B-splines see also Fig. 10), followed by Daubechies-5, Dual-tree, surfacelets and curvelets (in this order). Both curvelet and surfacelet transforms show relatively large differences between the original ﬁeld and the ﬁeld reconstructed for the 3% percent largest coeﬃcients. The residual parts based on the remaining 97% coeﬃcients are also compared and visualized in Fig. 11. Similar to Fig. 10, Cubic B-spline

Fig. 9. Density visualizations. 2D center slices for a subset of the methods are compared at 3% coeﬃcients. From left to right: original, Haar, cubic B-spline, Daubechies-5, Dual-tree wavelets, curvelets, and surfacelets. The white iso-line represents the boundary of the pure-light ﬂuid (blue). Haar wavelets perform noticeably worse and have strong artifacts. The rest of the methods are able to better reconstruct the original ﬁeld. (For interpretation of the references to color in this ﬁgure legend, the reader is referred to the web version of this article)

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

49

Fig. 10. Density logarithmic difference between the original 2D center slice for methods at 3% coeﬃcients and plotted in a logarithmic scale. From left to right: Haar, cubic B-spline, Daubechies-5, Dual-tree wavelets, curvelets and surfacelets. Haar wavelets have the most visible error, and cubic B-spline wavelets the least.

wavelets perform the best, as they exhibit the least amount of residual density in their remaining coeﬃcients. The Haar wavelet reconstruction contains the most remaining structures. Daubechies-5 and Dual-tree wavelets contain slightly more residual density than cubic B-splines. Relative to cubic B-splines, curvelet and surfacelet coeﬃcients contain larger amounts of remaining structures. Between the two multi-band methods, the curvelets reconstruction leads to the least structureless residual density ﬁeld. 4.3.2. Density peak signal-to-noise ratio and mean square error Fig. 12 shows the PSNR variation as a function of the percentage and number of coeﬃcients, while Fig. 13 shows the MSE variation for the subset of the methods deﬁned above. As the number or percentage of coeﬃcients are increased, the amount of increase in accuracy is similar across all methods tested. Numerical results for all methods at 3% coeﬃcients are presented in Table 4. Similar to vorticity PSNR and MSE, at low percentages of coeﬃcients used in reconstruction, cubic B-spline wavelets are superior by having the highest PSNR and lowest MSE. The higher-order B-spline wavelets (quadratic, cubic, and quartic), in general, out-perform the rest of the methods tested. When comparing methods within each individual family at 3% coeﬃcients, there is an observed parabolic behavior in reconstruction quality where the best reconstruction is achieved by using the middle members. These results are consistent to those obtained for vorticity, since the lower families tend to not have enough representation to capture the original density ﬁeld, while the higher families introduce oscillations which reduce the overall quality of the representation. Surfacelets have the beneﬁt of a PSNR that is close to B-splines and a low point-wise maximum error while achieving multi-band functionality. As expected, curvelets perform better than Haar wavelets but not as well as the remaining methods. When increasing the

number of coeﬃcients, curvelets do not surpass the PSNR quality of the alternative methods mainly due to the high levels of redundancy needed to support energy separation in bands and directional coeﬃcients. By decreasing the number of bands, curvelets can be made to extract a smaller number of coeﬃcients causing a left-ward shift of the PSNR plot. Even though this will increase the pointwise accuracy of the curvelet transform for a given percentage of coeﬃcients, it will still be lower than that obtained using cubic Bspline wavelets. Daubechies wavelets generally perform better than Coiﬂets, but both are still below B-spline wavelets. When considering the accuracy for percentages of coeﬃcients above 20%, the highest member of B-spline wavelets family tested, quintic, is the most accurate. When instead comparing total number of coeﬃcients, which is more relevant for compression and storage purposes, cubic B-spline wavelets once again provide the most accurate reconstruction using the least number of coeﬃcients. The relative ranking in reconstruction quality from previous tests is observed again for quadratic, cubic, and quartic B-splines. These methods once again show slight improvement in accuracy when the number of coeﬃcients is increased. Finally, due to the number of bands extracted causing high redundancy levels, the PSNR results for curvelets and surfacelets are generally worse than the rest of the methods. After a certain point, surfacelets manage to overtake the accuracy results of Haar wavelets despite extracting signiﬁcantly more coeﬃcients. When comparing both band-based methods, surfacelets perform better than curvelets. 4.3.3. Residual density probability density function Density PDF, computed for the remaining 97% coeﬃcients, is shown in Fig. 14. Similar to velocity, all methods yield symmetrical residual PDFs. Haar and curvelet transforms lead to widest tails, while

50

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

Fig. 11. Residual density visualization. The remaining density residing in the weakest 97% coeﬃcients is shown for Haar, Cubic B-spline, Daubechies-5 and Dual-tree wavelets, curvelets and surfacelets (left to right). Cubic B-spline wavelets show the weakest structures in the remaining density. Curvelets and surfacelets extract many more redundant coeﬃcients, and their remaining structures are clearly visible.

a

b

Fig. 12. Density ﬁeld PSNR as a functions of the percentage of coeﬃcients (left) and number of coeﬃcients (right). Higher PSNR values represents a more accurate point-wise reconstruction. The cubic B-splines provide the best accuracy at the lowest percentages and number of coeﬃcients compared to other methods. Although not shown, for larger percentages of coeﬃcients retained (i.e. > 20%), within each class, higher order methods such as quintic B-splines perform slightly better.

the other methods have narrower shapes, but still wider than a Gaussian. The Kurtosis values are: 8.00 (Haar), 6.63 (Cubic B-spline), 4.38 (Surfacelet), 5.53 (Curvelet), 5.79 (Daubechies-5), and 6.20 (Coiﬂet12). The ﬁeld’s extremas are the smallest for the cubic B-spline, with relatively close values for the other high order wavelet methods. The surfacelet transform has slightly larger extremas, but still in the range of the higher order wavelets, while the curvelets and Haar wavelets have much larger extremas.

4.3.4. Density power spectrum and multi-band analysis In order to examine how the coherent and incoherent part of the density are distributed across scales, Figs. 15 and 16 compare the power spectrum at coherent 3% coeﬃcients, and the remaining 97% coeﬃcients. All methods capture the large and part of the inertial range scales, and the results depart from the original spectrum at small scales. The wavelet methods introduce small scale noise, which translates into increased values at higher wave-numbers. Within

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

51

Table 4 Density comparison at 3% coeﬃcients. The methods with the most accurate properties are represented in bold. The cubic B-splines have the Cubic splines have the best point-wise characteristics for enstrophy representation compared to any other method using these metrics. Although not as accurate from the point of view of PSNR and MSE, quartic B-splines exhibit a more consistent maximum point-wise error throughout the entire dataset. Method

Fig. 13. Density ﬁeld MSE variation with the percentage of coeﬃcients. Lower MSE values represent a more accurate point-wise reconstruction. The error becomes indistinguishable for more than 10% coeﬃcients used in the reconstruction. The results show that cubic B-splines have the lowest MSE at very low percentages of coeﬃcients.

Fig. 14. Residual density PDF. PDF for the remaining 97% coeﬃcients. Two Gaussian signals matching the variances of the curvelet and B-spline wavelet results are plotted.

Haar Linear B-spline Quadratic B-spline Cubic B-spline Quartic B-spline Quintic B-spline Daubechies-3 Daubechies-5 Daubechies-7 Coiﬂet-6 Coiﬂet-12 Coiﬂet-18 Dual-Tree Surfacelet Curvelet

PSNR 30.114 37.940 41.095 41.282 40.553 39.753 38.029 39.858 39.142 34.760 38.860 38.830 39.020 38.169 33.075

MSE

Max Error −5

1.0791x10 1.7807x10−6 8.6113x10−7 8.2476x10−7 9.7552x10−7 1.1729x10−6 1.7442x10−6 1.1449x10−6 1.3502x10−6 3.7028x10−6 1.4406x10−6 1.4505x10−6 1.3884x10−6 1.6892x10−6 5.4580x10−6

5.0793x10−2 1.9407x10−2 1.4169x10−2 1.1944x10−2 1.1424x10−2 1.4073x10−2 1.5653x10−2 1.5414x10−2 1.6532x10−2 2.2010x10−2 1.4867x10−2 1.5929x10−2 1.4690x10−2 1.3875x10−2 2.8111x10−2

each class, higher order methods capture better the high frequency features of the density ﬁeld. The Dual-tree wavelet transform is able to provide the closest reconstruction, using 3% of the coeﬃcients, to the original spectrum. The same order of B-spline family is able to achieve the next closest reconstruction. On the other hand, the surfacelet and curvelet transforms tend to smooth out the high frequency components and the resulting spectrum is lower than the original spectrum at high wave-numbers. Both methods also capture a large amount of energy in the remaining 97% coeﬃcients due to their high redundancy. To compare the surfacelet and curvelet band systems with respect to the wavenumber coverage, a number of 6 bands are chosen, where band 0 represents the largest features and band 5 represents the smallest. As previously explained, a larger number of bands are used in this set of the calculations to better distribute the energy and better measure the strengths of each method with respect to the coherent/incoherent decomposition and capture of the ﬂow structure within each band. Except band 0, the curvelet bands are narrower than the surfacelet bands, providing more localization in wavenumber space (Fig. 17). This can be seen, quantitatively, from Table 5 which measures the contribution of each band to the total density variance. As proposed by Pullin et al. [5], the integral velocities for each band despite the different length of scales sum up to

Fig. 15. Density power spectrum for the coherent density ﬁeld. The k−5/3 variation is also plotted for comparison. A close-up for higher wavenumbers is provided in b).

52

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

Fig. 16. Density power spectrum for the residual density. The k2 variation corresponding to a Gaussian signal is also plotted. A close-up for higher wavenumbers is provided in b).

Table 5 Band contribution to density variance.

a

Band

Surfacelet ρ 2 i / ρ 2

Curvelet ρ 2 i / ρ 2

Original 0 1 2 3 4 5

1.0000000 0.6785391 0.2080657 0.0861972 0.0237471 0.0032868 0.0001637

1.0000000 0.8000469 0.1447320 0.0465525 0.0080909 0.0005679 0.0000095

virtually 1 although regions overlap. This shows both methods, although overly redundant, are able to preserve energy well when factoring in all bands. From the point of view of ﬂow analysis, the narrower curvelet bands may allow better representation of speciﬁc ﬂow structures associated with certain frequencies. However, the wider bands surfacelets provide lead to better overall representation of the data in terms of percentage of coeﬃcients as seen in Fig. 12 above.

b

Fig. 17. Density power spectrum band-based decomposition using six bands for (a) curvelets and (b) surfacelets. Higher level surfacelet bands are wider compared to those of curvelets.

4.3.5. Pure light ﬂuid isosurface visualization So far, the performance of the representation methods has been investigated with respect to describing various turbulent ﬁelds and their separation into coherent and incoherent parts. However, in turbulence problems involving scalar mixing, e.g. pollutant dispersion or non-premixed combustion, accurately capturing constant property surfaces is also very important. Fig. 18 visualizes isosurfaces of pure light ﬂuid obtained using the reconstruction based on 3% coefﬁcients. Although the wavelet methods may have higher point-wise PSNR, the curvelet and surfacelet isosurfaces are much smoother and very near to the original dataset. Both curvelets and surfacelets sacriﬁce point-wise accuracy in exchange for superior curved structure representation. Among the other methods, the Haar wavelets have the coarsest representation of isosurfaces, as expected. The representation becomes very blocky and it has the highest visible difference compared to the original isosurfaces. The coarsening effect is ampliﬁed when derivatives are calculated making the Haar wavelets unsuitable for the preservation of constant property surfaces. Although cubic B-splines, Daubechies, and Coiﬂets are able to reproduce smoother surfaces than Haar wavelets, there are visible ripples in the isosurface visualization related to the sinusoidal-like reconstruction signal that these methods are representing (seen in Fig. A.21 in the Appendix). In comparison, curvelets, surfacelets and Dual-tree wavelets, which are designed to preserve smooth structures, perform much better. The exchange of point-wise accuracy for smooth structure preservation

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

53

Fig. 18. Density Isosurface comparison. Isosurfaces representing the pure light ﬂuid are constructed using the largest 3% coeﬃcients. From left to right: original, Haar, cubic B-spline, Daubechies-5, Dual-tree wavelets, curvelet, and surfacelet transforms. Haar wavelet isosurfaces are extremely blocky. Surfacelets and curvelets provide the smoothest representation, with cubic B-splines, Daubechies, and Dual-tree having noticeable ripples in their isosurfaces.

by these methods can be seen by comparing Fig. 10, showing the log-difference of the 2D slice, and Fig. 18, showing the pure light ﬂuid isosurface. Surfacelets are able to provide the smoothest reconstruction at 3% coeﬃcients with the lowest difference to the original visualization. 4.3.6. Pure light ﬂuid isosurface area, mean and Gaussian curvatures In addition to qualitative results by visually comparing the smoothness of isosurfaces, quantitative assessments can also be made using differential geometry tools. In this section, the surface area and average mean and Gaussian curvatures for the light and fully mixed isosurfaces obtained using the largest 3% of the coeﬃcients are compared with the original ﬁeld, while in the next section the surface signature is discussed. The averages are calculated as sums over all points on the surface divided by the number of points. The deﬁnitions and formulas used are given in Section 4.1.1. Table 6 shows the ratios of this quantities to the original dataset for all methods considered. Values closer to 1 indicate better quality of the reconstruction. In general, the surface area is relatively well captured by all methods (with larger departures from the original for the light ﬂuid isosurface), while the average mean and Gaussian curvatures can be signiﬁcantly under- or over-predicted without an overall winner. Even though surfacelets and curvelets show smoothest surfaces, they can still miss both the surface area and average curvatures. At the time during the ﬂow evolution when the methods are compared, there is relatively little pure light ﬂuid left, while the fully mixed ﬂuid occupies a larger volume. Consequently, the surface area of the mixed ﬂuid isosurface is overall better captured by all methods. The large blockiness in the Haar wavelets representation of constant property surfaces can be clearly seen in the large departures from 1 in all entries of Table 6. For the pure light ﬂuid isosurface, again, within

each class, the middle members provide the best quality reconstruction. Thus, lower families tend to not have enough representation to capture all the folds in the original surfaces, while the higher families introduce oscillations which affect the curvature of the surface. Such a trend is not seen for the fully mixed ﬂuid isosurface, which has more folds and wrinkles due to the larger volume encompassed. In this case, the oscillations introduced by the higher order methods may help capture more of the details of the surface when they overlap to the surface wrinkles; however, such a match cannot not lead to a consistent trend. 4.3.7. Surface signature of the pure light, pure heavy and mixed ﬂuids isosurfaces The surface signature can provide details on the structure of the surface [5]. Following the deﬁnition given in Section 4.1.1, Fig. 19 compares the surface signature properties of the original dataset and a subset of the methods. There is a large disparity between the original dataset, the Haar wavelets and the remaining schemes. To start, the original surface signatures characterize all coarse and ﬁne features in the dataset. The dark region near the center of the 2D histogram of the original surface signature corresponds to the ﬁne features of the isosurfaces. None of the methods is able to reproduce the ﬁne portion of the original dataset. In general, all higher order methods over-predict the amount of small scale features of the pure ﬂuids isosurfaces and under-predict this amount for the mixed ﬂuid isosurface. This is consistent with the representation of the average mean and Gaussian curvatures discussed in the previous section. Again, at the time in the ﬂow evolution when the comparison are made, the amount of mixed ﬂuid is much larger than either pure ﬂuids and, consequently, the mixed ﬂuid isosurface presents more folds and wrinkles. The

54

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58 Table 6 Isosurface area and average curvatures. Methods closest to 1.0 are in bold. Ratios of surface area and average mean and Gaussian curvatures for the pure light and fully mixed ﬂuids isosurfaces reconstructed using 3% coeﬃcients to the original dataset. Pure light ﬂuid

Fully mixed ﬂuid

Method

Surface area

Mean curvature

Gaussian curvature

Surface area

Mean curvature

Gaussian curvature

Haar Linear B-spline Quadratic B-spline Cubic B-spline Quartic B-spline Quintic B-spline Daubechies-3 Daubechies-5 Daubechies-7 Coiﬂet-6 Coiﬂet-12 Coiﬂet-18 Dual-Tree Surfacelet Curvelet

1.346 1.173 1.143 1.014 0.954 0.988 1.080 1.024 1.017 1.199 1.053 1.015 0.987 0.921 1.183

0.523 0.298 0.453 1.192 2.867 2.607 1.745 2.710 3.985 1.355 2.164 3.073 2.643 2.862 2.005

219.030 0.577 0.829 1.078 1.346 1.259 0.094 1.356 1.505 1.062 1.051 1.250 1.085 1.994 1.126

0.991 0.983 0.971 0.995 0.995 0.994 0.991 0.994 0.991 1.001 0.998 0.999 0.965 0.957 0.920

0.627 0.914 0.880 0.981 0.947 0.968 1.020 0.264 1.060 0.960 0.970 0.903 0.900 0.995 0.891

11.483 0.655 0.835 0.625 0.682 0.661 0.590 0.605 0.609 0.531 0.613 0.574 0.653 0.931 0.574

Table 7 Performance. Compute time in seconds, memory usage in megabytes (MB) and Eﬃciency as compute time per coeﬃcient in seconds ∗ 10−7 . The simplest methods are faster to compute compared to the higher order methods. Surfacelets have the best eﬃciency by extracting the most coeﬃcients for their measured compute time. Method

Compute (s)

Memory (MB)

Eﬃciency

Haar Linear B-spline Quadratic B-spline Cubic B-spline Quartic B-spline Quintic B-spline Daubechies-3 Daubechies-5 Daubechies-7 Coiﬂet-6 Coiﬂet-12 Coiﬂet-18 Dual-Tree Surfacelet Curvelet

16.390 17.282 17.601 22.320 24.304 30.373 18.866 21.863 25.342 20.130 25.221 30.045 79.667 31.666 77.660

1065 1072 1080 1092 1106 1179 1057 1090 1125 1059 1113 1152 4096 4928 8214

1.2211 1.2474 1.2891 1.5611 1.6703 1.9914 1.3616 1.5291 1.7156 1.4528 1.7333 1.9699 1.4839 0.6891 1.4041

oscillations introduced by the representation methods discussed will over-predict these features for the pure ﬂuids isosurfaces and underpredict for the mixed ﬂuid isosurface. In addition, the departure from the original dataset can not be attributed to Gaussian white noise. These results underline the speciﬁc problem of representing turbulence ﬁelds with non-physics based, universal basis functions. While in many problems the details of the small scales are not important, there are instances, e.g. reaction fronts in non-premixed combustion, when the lack of accurate representation of the scalar ﬁelds can lead to serious errors. For these cases, physics based sub-grid modeling can remedy the situation; however, ﬁnding optimal such models is still an outstanding open question.

4.4. Performance and memory usage Performance is evaluated by measuring the runtime of each transform during its decomposition stage for a single scalar ﬁeld from the dataset. Only the time taken during decomposition is presented as the synthesis operation generally takes the same amount of time. As before, the single-threaded variant of all methods is evaluated. Memory usage is deﬁned as the storage space consumed for the decomposed coeﬃcients and does not include the space used to hold the input of the original dataset.

The results in Table 7 represent the compute time required only for the forward transform of a targeted 3% coeﬃcients, averaged over the series of ﬁve runs. The amount of effort required in extracting the strongest 3% coeﬃcients is evaluated. During this evaluation, it is considered that the multi-band transforms can be performed without the need to extract all coeﬃcients. The term ‘Eﬃciency’ is deﬁned as the amount of effort needed to compute one coeﬃcient. Effort is a function of time taken over the total number of coeﬃcients extracted. Due to their simplistic nature, the lowest-order wavelets perform the fastest and require the lowest amount of storage. The increase in wavelet complexity and family order coincides with the increase of compute time as well as number of coeﬃcients extracted and memory usage. Surfacelets provide the best of compromises in terms of compute time per coeﬃcient, overall compute time, and the ability to extract multiple bands from the data at a comparable cost to other methods. The curvelet transform takes the longest amount of time to compute due to the extraction of many coeﬃcients. When requesting the same number of bands as surfacelets, curvelets take much longer to compute and extract more coeﬃcients. In terms of eﬃciency, the surfacelet transform is the clear winner. Although the curvelet transform may take long to compute, it does performs remarkably well for the amount of coeﬃcients extracted compared to the simpler B-splines and Daubechies wavelets. The curvelet transform implementation provided by the respective authors contains a message passing interface (MPI), out-of-core C++ implementation where an almost linear speedup is observed in relation to the number of additional processors used when increased by powers of two. Although costly in performance, the MPI implementation makes the curvelet transform readily usable for large scientiﬁc computing applications where distributed compute nodes can be used to offset their high compute cost and memory requirements. 5. Conclusions We have compared the effectiveness of various multi-resolution representation methods, including B-spline, Daubechies, Coiﬂet and Dual-tree wavelets, curvelets and surfacelets, to evaluate these methods’ ability to capture the structure of fully developed turbulence using a truncated set of coeﬃcients. The methods were tested by considering quantities pertaining to both velocity and active scalar (density) ﬁelds and their derivatives, spectra, and the properties of constant density surfaces. Previous comparisons related to such decompositions of turbulent ﬁelds were restricted to orthogonal and biorthogonal (Harten) wavelets as applied to velocity and enstrophy PDFs, the kinetic energy spectrum and the PDF of helicity and Lamb vector [24] and the curvelet transform as a multi-scale analysis tool of

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

55

Fig. 19. Surface signatures. Surface signatures described as a joint probability density function of the shape index, S, and curvedness, C, together with their marginal PDFs. From top to bottom: original, Haar, cubic B-spline, Daubechies-5 wavelets, curvelets and surfacelets. From left to right: pure light, fully mixed and pure heavy ﬂuid isosurfaces.

turbulence [4,5]. While CVS-type decompositions have been mainly used to investigate the coherent/incoherent decomposition of ﬂow, here the decomposition is performed at the level of the primary variables, velocity and density. This approach is not entirely new, for example, it is the base of the SCALES method [9] and has also been used

for the analysis of a passive scalar [5]. The results presented address a more complex ﬂow and encompass more representation methods, with more quantities related to the velocity ﬁeld (e.g., the state of the deviatoric strain rate tensor, which may be a better measure of the local ﬂow coherence/incoherence than the velocity and vorticity PDFs)

56

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58 Table 8 Summary. Compilation of the best respective methods for each test performed. Test type

Best method

Runner-up

Velocity PDF Vorticity PSNR and MSE Vorticity PDF Strain rate tensor Density visualization Density PSNR and MSE Density PDF Density power spectrum Isosurface Visualization Curvature quantities Surface signatures Performance

Cubic B-spline Cubic B-spline Cubic B-spline Quadratic B-spline Surfacelets Cubic B-spline Cubic B-spline Dual-Tree Surfacelets Cubic B-spline All methods except Haar Surfacelets

Daubechies-5 Quadratic B-spline Quadratic B-spline Cubic B-spline Cubic B-spline Quadratic B-spline Daubechies-5 Quintic B-Spline Curvelets Surfacelets – Haar

and, for the ﬁrst time, quantities related to an active scalar ﬁeld and its constant property surfaces. In addition, comparisons between the algorithms are given in terms of performance, accuracy, and compression properties. The results provide useful information for multi-resolution analysis of turbulence, coherent feature extraction, compression for large datasets handling, as well as simulations algorithms based on multiresolution methods. A list of recommended methods for each test is provided in Table 8. In general, any method is superior to Haar wavelets as well as the constant members of the wavelet classes considering our series of tests. On the other hand, the results show that increasing the family order above a certain value is not always the ideal solution towards higher accuracy. At 3% coeﬃcients, large structures within the ﬂow are well preserved and at that percentage it is not necessary to go to very high order methods. Since the results are very similar across the higher orders of B-splines, it is not recommended to go above cubic B-splines, sacriﬁcing more compute time and questionable accuracy gains. Daubechies wavelets are generally overshadowed by B-splines so their use is not recommended unless orthogonal properties must be preserved. Although Coiﬂets have a few advantages to Daubechies in derived surface quantities and preservation of kinetic energy, their representations of curved surfaces and scalar quantities are not ideal. Dual-tree wavelets contain properties that make them useful for curved surface preservation and energy separation, especially above 3% coeﬃcients, but at the cost of high redundancy, point-wise accuracy, and a large compute cost. Based on their overall performance, the cubic B-spline wavelet is recommended as the general method for the turbulence data applications considered here. Surfacelets and curvelets have speciﬁc applications and advantages where they are able to identify speciﬁc features at different scales in turbulence, taking full advantage of the multi-scale interface of these methods but further analysis is required. Both surfacelets and curvelets provide superior representation of smooth surfaces compared to any other method for general applications in turbulence. Surfacelets are recommended over curvelets since they are much more eﬃcient in computation, reconstruct more accurately at the same number of coeﬃcients, and capture curved surfaces closest to the original data even compared to all the methods tested. The selection process described in this paper should be useful in several areas, including multi-resolution analysis of turbulence, coherent feature extraction, compression for large datasets handling, as well as simulations algorithms based on multi-resolution methods. While some of the algorithms discussed have already been used for simulations algorithms based on multi-resolution methods and others for multi-resolution analysis of turbulence, this survey offers a comprehensive view of most of the methods which are candidates for multi-resolution analysis and computation of turbulence. In addition, with the continuous increase in the computational platform speed and size, we would like to stress the emerging importance

of compression algorithms for large dataset handling. For example, multi-resolution visualization for large datasets [37] and simulation applications for in-situ analysis and visualization can be facilitated through the use of these algorithms. Projects such as the JHTDB [11] can be improved to support multi-resolution analysis and visualization to both reduce storage and transmission costs as well as create faster data visualizations of interesting structures. Opportunities for remote visualization are also available through the application of these methods at the data and image level. Acknowledgments We would like to thank our colleagues John Patchett and Curt Canada in the Computer, Computational, and Statistical Sciences division for their feedback and support, as well as the Institute for Next Generation Visualization and Analysis (INGVA). Los Alamos National Laboratory is operated by Los Alamos National Security, LLC for the US Department of Energy National Nuclear Security Administration under Contract No. DE-AC52-06NA25396. Computational resources were provided by the Los Alamos National Laboratory Institutional Computing Program. Daniel Livescu acknowledges the support of the U.S. Department of Energy through the LANL/LDRD Program for this work. Appendix A. Wavelet construction and their signals Wavelet signals in general have several vanishing moments. This property allows for a sparse but accurate representation of an input dataset with only a small number of coeﬃcients. A signal is decomposed through multiple steps involving “folds” at the largest scale until it reaches the smallest data scale. For the 3-D Wavelet Transform, an input data of size X (length) by Y (width) by Z (height) is processed by utilizing the 1-D signal decomposition step in the multiple dimensions. After a single iteration of the 1-D decomposition for each dimension, the result consists of a series of seven high-pass coeﬃcients and one set of pure low-pass coeﬃcients shown in Fig. A.20. Any region that has full or partial high-pass coeﬃcients such as WHLL and WHHH can be used to describe the original data at that resolution. The low-pass coeﬃcients WLLL are recursively computed by halving the size of each dimension per iteration until the data can no longer be halved further. This method can be described as a top-down approach, where the small-scale features in the data are captured initially and for each iteration, larger features are described. The ﬁnest coeﬃcients, smallest in magnitude are extracted ﬁrst and in the end, the coarsest, largest in magnitude coeﬃcients are extracted. The continuous representations of the wave signals we use here can be found in Fig. A.21. A1. B-spline wavelet families When increasing the family order for B-spline wavelets, an improvement is expected in the representation of non-rigid surfaces. The change in signal behavior by increasing the family order can be seen in Fig. A.21. An increase in the number of control points within each family generally introduces a large number of oscillations in the data and reduces the overall quality of a reconstruction. The effect of increased control points on the constant family of signals can be seen in Fig. A.22. As a result, the B-spline wavelets chosen for this paper were tested with the least number of control parameters per family. Rather than increasing the number of control point parameters, an increase in family order enhances the complexity of the signal and improves the quality of a reconstruction. The discrete versions of these signals were used in form of ﬁlters as described by Shensa [35]. By considering a sinusoidal signal instead of the top-hat, a wavelet transform can capture superior directionality of a dataset.

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

57

Fig. A.20. Signal Decomposition. Decomposition of a wavelet for dataset X using a single step in three dimensions with the remaining coarse (WL ) and ﬁne (WH ) coeﬃcients. For each subsequent step, the low-pass coeﬃcients WLLL are iterated upon.

Fig. A.21. Wavelet mother reconstruction signals. As the family order increases, signals in the frequency domain become more complex creating more vanishing moments. The following signals represent biorthogonal wavelets: (a) Haar, constant (b) linear B-spline (c) quadratic B-spline (d) cubic B-spline (e) quartic B-spline and (f) quintic B-spline. Orthogonal signals are also shown in comparison as (g) Daubechies-3 (h) Daubechies-5 (i) Daubechies-7 (j) Coiﬂet-6 (k) Coiﬂet-12 and (l) Coiﬂet-18.

Fig. A.22. Constant biorthogonal B-splines and control points. As the number of control parameters increases, signals become more complex. The reconstruction wavelet functions presented are subsets of the constant biorthogonal wavelet family using (a) 1 control point, Haar (b) three control points, Harten and (c) ﬁve control points.

58

J. Pulido et al. / Computers and Fluids 125 (2016) 39–58

References [1] Candes E, Demanet L, Donoho D, Ying L. Fast discrete curvelet transforms. 2005. [2] Lu YM, Do MN. Multidimensional directional ﬁlter banks and surfacelets. IEEE Trans Image Process 2007:918–31. [3] Farge M, Schneider K, Pellegrino G, Wray A, Rogallo R. Cvs decomposition of 3d homogeneous turbulence using orthogonal wavelets. Proc Summer Program 2000:305. [4] Ma J, Hussaini MY, Vasilyev OV, Le Dimet F. Multiscale geometric analysis of turbulence by curvelets. Phys Fluids 2009;21(7). [5] Bermejo-Moreno I, Pullin DI. On the non-local geometry of turbulence. J Fluid Mech 2008;603:101–35. [6] Vasilyev OV, Paolucci S. A fast adaptive collocation algorithm for multidimensional PDEs. J Comput Phys 1997;138:16–56. [7] Vasilyev OV, Bowman C. Second generation wavelet collocation method for the solution of partial differential equations. J Comput Phys 2000;165:660–93. [8] Paolucci S, Zikoski ZJ, Wirasaet D. WAMR: an adapative method for the simulation of compressible reacting ﬂow. Part I. accuracy and eﬃciency of the algorithm. J Comput Phys 2014;272:814–41. [9] Goldstein D, Vasilyev OV. Stochastic adaptive large eddy simulation method. Phys Fluids 2004;16:2497. [10] Goldstein DE, Vasilyev OV, Kevlahan NK. Cvs and scales simulations of 3d isotropic turbulence. J Turbul 2005. [11] Perlman E, Burns R, Li Y, Meneveau C. Data exploration of turbulence simulations using a database cluster. In: Proceedings of the ACM/IEEE Conference on Supercomputing; 2007. [12] Livescu D, Mohd-Yusof J, Petersen MR, Grove JW. CFDNS: a computer code for direct numerical simulation of turbulent ﬂows. Tech. Rep. Los Alamos National Laboratory; 2009. LA-CC-09-100. [13] Livescu D, Ristorcelli JR. Buoyancy-driven variable-density turbulence. J Fluid Mech 2007;591:43–71. [14] Livescu D, Ristorcelli JR. Variable-density mixing in buoyancy-driven turbulence. J Fluid Mech 2008;605:145–80. [15] Livescu D. Numerical simulations of two-ﬂuid turbulent mixing at large density ratios and applications to the rayleigh-taylor instability. Phil Trans R Soc A 2013;371:20120185. [16] Livescu D, Canada C, Kalin K, Burns R, staff I, Pulido J. Homogeneous buoyancy driven turbulence data set. Tech. Rep. Los Alamos National Laboratory; 2014. LAUR-14-20669, available at http://turbulence.pha.jhu.edu/docs/README-HBDT. pdf. [17] Daubechies I. Ten lectures on wavelets. PA: SIAM 1992. [18] Sweldens W. The lifting scheme: a construction of second generation wavelets. SIAM J Math Anal 1998;29(2):511–46. [19] Farge M. Wavelet transforms and their applications to turbulence. Annu Rev Fluid Mech 1992;24(1):395–458. [20] Beylkin G, Coifman RR, Rokhlin V. Fast wavelet transforms and numerical algorithm. Pure Appl Math 1991;44:141–83. [21] Daubechies I. Orthonormal bases of compactly supported wavelets ii. variations on a theme. SIAM J Math Anal 1993;24(2). [22] Tian J, Wells RO Jr. Vanishing moments and wavelet approximation. Tech. Rep. Computational Math. Lab., Rice Univ; 1995.

[23] Deriaz E, Domingues MO, Perrier V, Schneider K, Farge M. Divergence-free wavelets for coherent vortex extraction in 3d homogeneous isotropic turbulence. ESAIM: Proc 2007;16:146–63. [24] Roussel O, Schneider K, Farge M. Coherent vortex extraction in 3d homogeneous turbulence: comparison between orthogonal and biorthogonal wavelet decompositions. J Turbul 2005:N11. [25] Cohen A, Daubechies I, Feauveau JC. Biorthogonal bases of compactly supported wavelets. Commun Pure Appl Math 1992;45(5):485–500. [26] González P, Cabaleiro JC, Pena TF. Parallel computation of wavelet transforms using the lifting scheme. J Supercomput 2001;18(2):141–52. [27] Bertram M, Duchaineau MA, Hamann B, Joy KI. Generalized b-spline subdivisionsurface wavelets for geometry compression. IEEE Trans Vis Comput Graph 2004;10(3):326–38. [28] Kingsbury NG. Complex wavelets for shift invariant analysis and ﬁltering of signals. Appl Comput Harmon Anal 2001;10(3):234–53. [29] Salesnick IW, Baraniuk RG, Kingsbury NG. The dual-tree complex wavelet transform. Signal Process Mag, IEEE 2005;22(6):123–51. [30] Nguyen van yen RN, Farge M, Schneider K. Wavelet regularization of a fouriergalerkin method for solving the 2d incompressible euler equations. ESAIM: Proc 2009;29:89–107. [31] Pereira RM, Nguyen van yen R, Farge M, Schneider K. Wavelet methods to eliminate resonances in the galerkin-truncated burgers and euler equations. Phys Rev E 2013;87(3):033017. [32] Bamberger RH, Smith MJT. A ﬁlter bank for the directional decomposition of images: theory and design. IEEE Trans Signal Process 1992;40(4):882–93. [33] Do MN, Vetterl M. The contourlet transform: an eﬃcient directional multiresolution image representation. IEEE Trans Image Process 2005;14(12):2091–106. [34] Yang Y, Pullin DI, Bermejo-Moreno I. Multiscale geometric analysis of lagrangian structures in isotropic turbulence. J Fluid Mech 2010;654:233–70. [35] Shensa MJ. The discrete wavelet transform: wedding the a trous and mallat algorithms. IEEE Trans on Signal Process 1992;40(10):2464–82. [36] Bertram M. Multiresolution modeling for scientiﬁc visualization , Ph.D. thesis. Center for Image Processing and Integrated Computing, University of California, Davis; 2000. [37] Wiley DF, Childs HR, Hamann B, Joy KI, Max NL. Best quadratic spline approximation for hierarchical visualization. In: Data Visualisation 2002, Eurographics Association, Aire-la-Ville, Switzerland; 2002. p. 133–40. [38] Woodring J, Mniszewski SM, Brislawn CM, DeMarle DE, Ahrens JP. Revisiting wavelet compression for large-scale climate data using jpeg 2000 and ensuring data precision. In: Rogers D, Silva CT, editors. LDAV. IEEE; 2011. p. 31–8. [39] Lund TS, Rogers MM. An improved measure of strain state probability in turbulent ﬂows. Phys Fluids 1994;6(5):1838–47. [40] Maz’ja V. Sobolev spaces. Berlin, Germany: Springer-Verlag; 1985. [41] Ryu J, Livescu D. Turbulence structure behind the shock in canonical shockvortical turbulence interaction. J Fluid Mech 2014;756:R1. [42] Perry AE, Chong MS. A description of eddying motions and ﬂow patterns using critical-point concepts. Annu Rev Fluid Mech 1987;19:125. [43] Donoho DL, Johnstone IM. Ideal spatial adaptation by wavelet shrinkage. Biometrika 1994;81:425–55.

Survey and analysis of multiresolution methods for turbulence data

Survey and analysis of multiresolution methods for turbulence data

Recommend Documents