Thermodynamic modeling of hydrous-melt–olivine equilibrium using exhaustive variable selection

Thermodynamic modeling of hydrous-melt–olivine equilibrium using exhaustive variable selection

Journal Pre-proof Thermodynamic modeling of hydrous-melt–olivine equilibrium using exhaustive variable selection Kenta Ueki, Tatsu Kuwatani, Atsushi ...

2MB Sizes 0 Downloads 21 Views

Journal Pre-proof Thermodynamic modeling of hydrous-melt–olivine equilibrium using exhaustive variable selection

Kenta Ueki, Tatsu Kuwatani, Atsushi Okamoto, Shotaro Akaho, Hikaru Iwamori PII:

S0031-9201(19)30169-4

DOI:

https://doi.org/10.1016/j.pepi.2020.106430

Reference:

PEPI 106430

To appear in:

Physics of the Earth and Planetary Interiors

Received date:

19 June 2019

Revised date:

29 November 2019

Accepted date:

17 January 2020

Please cite this article as: K. Ueki, T. Kuwatani, A. Okamoto, et al., Thermodynamic modeling of hydrous-melt–olivine equilibrium using exhaustive variable selection, Physics of the Earth and Planetary Interiors(2018), https://doi.org/10.1016/ j.pepi.2020.106430

This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

© 2018 Published by Elsevier.

Journal Pre-proof

1

Thermodynamic modeling of hydrous-melt–olivine equilibrium using exhaustive variable selection Kenta Ueki1,* ORCID: 0000-0002-5936-3856 Tatsu Kuwatani1,2 ORCID: 0000-0001-7161-9458 Atsushi Okamoto3 ORCID: 0000-0001-5757-6279 Shotaro Akaho4 ORCID: 0000-0002-4623-2718 Hikaru Iwamori1,5,6 ORCID: 0000-0001-5913-6410

Research Institute for Marine Geodynamics, Japan Agency for Marine-Earth Science and

oo f

1

Technology, 2-15 Natsushima-cho, Yokosuka 237-0061, Japan 2

PRESTO, Japan Science and Technology Agency (JST), 4-1-8 Honcho, Kawaguchi,

pr

Saitama 332-0012, Japan

Graduate School of Environmental Studies, Tohoku University, Sendai 980-8579, Japan

4

Human Informatics Research Institute, National Institute of Advanced Industrial Science and

e-

3

5

Pr

Technology, 1-1-1 Umezono, Tsukuba, Ibaraki 305-8568, Japan Earthquake Research Institute, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo

6

al

113-0032, Japan

Department of Earth and Planetary Sciences, Tokyo Institute of Technology, 2-12-1

rn

Ookayama, Meguro-ku, Tokyo 152-8550, Japan

Jo u

*Corresponding author: Kenta Ueki ([email protected]) Tel.: +81-46-867-9628

Keywords: Thermodynamics ‧ Machine learning ‧ Hydrous melt ‧ Mantle melting ‧ thermodynamic equilibrium model ‧ Basalt Highlights A machine learning method for petrological thermodynamic modeling is presented The method selects key variables from experimental data on hydrous-melt systems A robust thermodynamic model of the hydrous-melt–olivine equilibrium is constructed Model calculations reproduce measured effect of H2 O on olivine liquidus temperature Improved understanding of the underlying mechanisms of hydrous-melt systems

Journal Pre-proof

2

Abstract Water in silicate melt influences the phase relations of a hydrous- melt system. Given the importance of water in silicate melts, a quantitative thermodynamic understanding of the nonideality of hydrous melt is necessary to properly model natural magmatic processes. This paper presents a novel method for quantitative thermodynamic modeling of hydrous- melt– olivine equilibrium. Specifically, a machine learning method, exhaustive variable selection (ES), is used to model the non- ideality of hydrous melts. Using the ES method, we quantitatively validate the predictive capacities of all possible combinations of variables and then adopt the combination with the highest predictive capacity as the optimal model

oo f

equation. The ES method allows us to obtain the underlying thermodynamic relationship of the hydrous- melt–olivine system, such as the relative importance of different variables to the thermodynamic equilibrium, as well as to construct a robust and generalized model. We show

pr

that the combination of a linear term and a squared term of the total water concentration of

e-

melt is significant for describing the hydrous- melt–olivine equilibrium. This result is interpreted in terms of the microstructural changes related to the dissociation of water in

Pr

silicate melt. Calculations using the optimal model reproduce the experimentally determined effects of water on the olivine liquidus and the distribution coefficient for Mg between

al

olivine and hydrous melt. Our study demonstrates that the ES method yields a thermodynamic equilibrium model that captures the essential thermodynamic relationship

1 Introduction

Jo u

rn

explaining the high-dimensional and complex experimental data.

Understanding the phase relations of hydrous melt is fundamental to ascertaining the processes of mantle melting under hydrous conditions and the differentiation of hydrous magmas in arc settings (e.g., Tatsumi, 1989; Sisson and Grove, 1993; Kawamoto, 1996; Grove et al., 2003). Melting experiments have revealed that water drastically changes the phase relations of hydrous- melt systems (e.g., Kushiro et al., 1968; Kushiro, 1969; Green, 1973; Mysen and Boettcher, 1975a, b; Médard and Grove, 2008; Green et al., 2010, 2014). Such experiments are performed by varying the pressure, temperature, melt water concentration, and bulk composition of the system (Fig. 1). Thermodynamic modeling allows such discrete experimental data to be integrated and a model calculation to be conducted under various conditions (e.g., Ghiorso and Sack, 1995; Ghiorso et al., 2002; Gualda et al., 2012; Ueki and Iwamori, 2013, 2014; Jennings and Holland, 2015).

Journal Pre-proof

3

Natural magmatic systems are open systems in which the bulk chemical composition and total energy of the system evolve with melting, solidification, and melt migration (e.g., Langmuir et al., 1977). The relationships between pressure (P), temperature (T), bulk composition, phase stability, and phase properties (composition, fraction, and volume), and the mass and energy balances between phases need to be considered to properly investigate a dynamic and open-system magma environment. Thermodynamic calculations can yield consistent relationships between mass and energy balances. Various equations from an ideal mixing model (e.g., Ueki and Iwamori, 2013) and from more complex models that consider the interactions between various end- member components (e.g., Ghiorso and Sack, 1995)

oo f

have been proposed to model the thermodynamic relationship between the composition of silicate melt and its Gibbs free energy. Ueki and Iwamori (2013, 2014) demonstrated that a relatively simple thermodynamic equation for silicate melt could reproduce the melting phase

pr

relation of anhydrous alkaline-absent spinel lherzolite. However, hydrous melts have not

e-

been considered in some recent thermodynamic models of mantle melting (e.g., Ueki and Iwamori, 2013, 2014; Jennings and Holland, 2015) because of the difficulty of performing

Pr

such modeling.

As the detailed microstructural and chemical properties of hydrous melt under high

al

pressure and temperature are complicated, the experimental and theoretical understandings of the structure- free energy relation (i.e., the thermodynamic configurations and equations for

rn

modeling) are correspondingly poorly constrained. Consequently, despite its importance, the hydrous melt has

not yet been adequately

Jo u

phase relation of multi-component

thermodynamically modeled. Experimental studies on the microstructure (e.g., Mysen, 2007, 2014; Yamada et al., 2011) and first-principles molecular dynamics simulations (Mookherjee et al., 2008; Karki and Stixrude, 2010; Karki et al., 2010) have shown that water in silicate melt acts to depolymerize the melt via the reaction

(e.g.,

Wasserburg, 1957; Stolper, 1982). Ueki and Iwamori (2016) argued that a non- ideal interaction that considered the dissociation of water in silicate melt would be required to model the phase relation of a hydrous- melt system on the basis that such a system shows strongly non- linear relationships between melt compositions and phase relations (e.g., Hirschmann et al., 1998; Médard and Grove, 2008). To model the thermodynamic behavior of hydrous melt in continuous pressure–temperature–composition space, the appropriate model equation and parameters for composition-related non-ideality must be developed. Machine learning (ML) techniques (i.e., the science of using computers for the automatic detection of patterns in data; e.g., Bishop, 2006) can be used to understand the relationships

Journal Pre-proof

4

between variables and construct predictive models. ML techniques are being rapidly introduced to petrological and geochemical studies. For example, ML has been used for discrimination and feature extraction using labeled geochemical data (e.g., Kuwatani et al., 2014; Petrelli and Perugini, 2016; Ueki et al., 2018) and for pattern recognition in highdimensional geochemical data (e.g., Iwamori and Albaréde, 2008; Ueki and Iwamori, 2017; Yoshida et al., 2018). The present paper reports the novel results of an ML-based model-selection approach to the thermodynamic modeling of hydrous melt. “Model selection” refers to the ML task of selecting a statistical model that best reflects the various possible underlying mechanisms that

oo f

might have produced the patterns in the data. Using the ML method of exhaustive variable selection (ES), we are able to construct a predictive model from high-dimensional and complex data, and understand the physical mechanisms and processes underlying the data

pr

(e.g., Nakamura et al., 2017).

e-

The objective of the present study is to model the thermodynamic equilibrium of hydrous basaltic melt and olivine under conditions of the uppermost upper mantle. A large number of

Pr

possible equations are expected for modeling multicomponent silicate melts. For example, to select an equation that properly models the thermodynamic relationships among the multiple

al

components in silicate melt, we need to evaluate which components, including their interactions, are thermodynamically more important. In this study, we use an ML-based

rn

model-selection approach, in which we evaluate a set of candidate model equations for the

Jo u

equilibrium of hydrous basaltic melt and olivine and select the optimal model equation. The compiled experimental data set (experimental temperature, pressure, and melt and olivine compositions) is used as input data to determine the optimal equations for the thermodynamic model and its parameters. We use the leave-one-out cross-validation technique to select the optimal equation from the various possible equations. The leave-one-out cross- validation method leaves out one data case from the data set (i.e., validation data) and performs multiple linear regression on the remaining data to construct the model. Using the validation data, we evaluate how well the model can predict unknown data; i.e., how well the model predicts the equilibrium between olivine and melt for the temperature, pressure, and melt and olivine compositions of the validation data. This procedure is repeated for all data points in the data set, and the mean of the squared residuals between the model prediction (difference in molar Gibbs free energy between melt and olivine) and the validation data (= equilibrium state) is considered as the predictive capacity of the model equation for the unknown data (e.g., Bishop, 2006). The total number of candidate model equations is 1024 in the present study.

Journal Pre-proof

5

We evaluate all possible model equations on the basis of the predictive capacities of the equations and select the optimal model equation. Through the modeling of this fundamental system for magma generation, we demonstrate that the ML approach used in this study is a robust method that allows a generalized model of complex magma system to be constructed. The ML method also allows an understanding of the underlying thermodynamic relationship through, for example, quantifying the relative importance of different explanatory variables. The ML method and the constructed model should act as reference points for the future development of multi-component and multiphase melting thermodynamic models (Ueki and Iwamori, 2013, 2014) as well as for the

oo f

modeling of various geological and geochemical processes, mechanisms, and conditions, including metamorphic reactions (e.g., Okamoto and Toriumi, 2004) and ultrahigh-pressure

pr

phase relations such as metal–melt equilibria (e.g., Walter and Cottell, 2013).

e-

2 Basic thermodynamics

We model the thermodynamic equilibrium between hydrous basaltic melt and olivine at a

Pr

pressure range corresponding to spinel lherzolite stability (~1.0–2.5 GPa). Specifically, we model the thermodynamic relationship between the silicate components, water concentration,

al

pressure, and temperature of melt and the composition of olivine (Mg/(Mg + Fe)). Olivine is the primary constituent mineral of the upper mantle and is also the primary crystalline phase

rn

derived from basaltic melt. Therefore, the thermodynamic modeling of this system yields

Jo u

essential constraints on hydrous-mantle melting (e.g., Hirschmann et al., 2009) and on the differentiation of hydrous basaltic magma (e.g., Sisson and Grove, 1993). To model the hydrous basaltic melt–olivine equilibrium, we consider the following reaction: Mg2 SiO4 melt = Mg2 SiO 4 ol where superscript

melt

(1)

denotes the silicate melt phase, and superscript

ol

denotes the olivine

phase. The molar Gibbs free energy of the melt phase is modeled on the basis of the thermodynamic equation formulated by Ueki and Iwamori (2013, 2014), in which the difference in molar Gibbs free energy ( ) between the melt ( solid end-member component ( The

) is expressed as

of the reaction in Eq. 1 is written as (2)

(

) and the corresponding ).

Journal Pre-proof where

6

is the difference in molar Gibbs free energy between pure melt and the

corresponding pure solid end- member component (see Appendix A for details). Consequently,

is dependent on temperature and pressure, and is independent of

melt and olivine compositions.

denotes the reaction quotient (e.g., Putirka, 2017). As a

reaction proceeds, the reaction quotient changes to reduce the total Gibbs free energy of the system. When an equilibrium state is achieved (

),

is equal to the equilibrium

constant, and Eq. 2 can be written as (

(3)

is the activity.

oo f

where

)

By assuming the dissociation of the forsterite end- member in melt as Mg2 SiO 4 melt = 2MgOmelt + SiO 2 melt ,

in Eq. 3 can be rewritten as (4)

is the molar fraction of end- member component i in a phase, and

composition-related non-ideality.

is a

e-

where

)

pr

(

corresponds to an ideal solution.

Pr

In the present study, we focus on . Selection of a non- ideality model of silicate melt is critical to constructing a model that adequately reproduces the experimentally determined

al

melting phase equilibria (e.g., Beattie, 1993; Putirka, 2017). It has been suggested that nonideal interactions between H2 O and other melt end- member components are required to

rn

reproduce the non- linear relationship between the total water concentration of a melt (i.e., the total amount of dissolved water in the melt) and the phase equilibria (e.g., Médard and Grove,

Jo u

2008; Ueki and Iwamori, 2016). The immiscibility between silicate melt and aqueous fluid (e.g., Hunt and Manning, 2012) also indicates the non- ideal mixing behavior of water in silicate melt. The

term (Eq. 4), which describes the relationship between the total water

concentration of melt and the Gibbs free energy, needs to be considered in hydrous- melt thermodynamics. To develop such a numerical model for melt–olivine equilibrium, we use an ML approach to formulate a robust equation that describes the relationship between melt composition and Gibbs free energy.

3 Variable selection and model optimization using an ML approach 3.1 Basic configuration of the thermodynamic model We select and optimize the variables in , the composition-related non-ideality (Eq. 4), that adequately capture the underlying thermodynamic relationship of the hydrous- melt– olivine thermodynamic system. For this purpose, we first present a basic equation for variable

Journal Pre-proof

7

selection. We enumerate all possible terms in the basic equation and then select the essential terms using an ML approach. We consider an equation based on a symmetric regular solution as the basic equation. The composition-related non-ideality of multi-component silicate melt can be thermodynamically modeled using a symmetric regular solution type equation (e.g., Ghiorso et al., 1983). The composition-related non- ideality of an n end- member component symmetric regular solution is written as follows (e.g., Ghiorso et al., 1983): (5) (J/mol) represents the interaction parameters of end-member component i.

oo f

where

The maximum number of unknown thermodynamic parameters (

) in Eq. 5 is

because symmetric interactions between two different end- member

pr

components are considered in this equation. The model-selection approach determines the variables. Consequently, the total number of

e-

optimal combination of variables from the possible equations in this case is

. Such a large number of unknown parameters would

Pr

make the computational cost too high to adopt the ML method employed in this study (i.e., the problem of combinatorial explosion). In addition, a large dataset would be required to

al

obtain generalized and robust results when optimizing a large number of parameters (e.g., Bishop, 2006). Therefore, we employ a simplified equation based on Eq. 5 as the basic

rn

equation for the non-ideality model of this study.

Jo u

Carmichael (2002) analyzed

of experimental hydrous melts, and found the

linear relationship between the molar fractions of major-element oxide end- member components (

and

) and

range of basalt to andesite. In particular,

of hydrous melts in the compositional was found to strongly affect

. Their

results suggest that the linear term (first term of the right-hand side of Eq. 5) could satisfactorily model the effects of major-element oxide end- member components on the nonideality of mafic silicate melts. Following Carmichael (2002), we consider the following equation as the basic equation of the non-ideality model developed in the present study: (6) Eq. 6 is a simplified equation obtained by omitting the second term of the right-hand side of Eq. 5. As a result, the maximum number of unknown thermodynamic parameters (

) is

10 in this study. The total number of possible model equations is reduced to 210 with the

Journal Pre-proof

8

simplified equation. As shown in section 4, this simplified equation properly models the thermodynamic relationship between hydrous melt and olivine. Although only SiO 2 and MgO are involved in the chemical reaction considered here (Eq. 1), alkalis (Na2 O and K2 O) also have substantial effects on the thermodynamic properties of mafic silicate melts (e.g., Kushiro, 1975; Carmichael, 2002; Hirschmann et al., 1998; Putirka et al., 2007), meaning that other end-member components (in addition to SiO 2 and MgO) may also need to be considered in modeling the hydrous-melt–olivine equilibrium. Therefore, the importance of the various oxide end- member components in the thermodynamic equation should be quantified when modeling the thermodynamic behavior of silicate melt.

oo f

Consequently, we consider SiO 2 –Al2 O3 –FeO–MgO–CaO–Na2O–K2 O as the major-element oxide end-member components of melt.

To describe dissociated water in silicate melt, we consider two terms in addition to total

pr

water concentration in the melt (e.g., Stolper, 1982; Nowak and Behrens, 1995). Hunt and

e-

Manning (2012) performed thermodynamic modeling of the SiO 2 –H2 O system by considering the molecular H2 O, bridging oxygens, and OH group attached to a silicate

Pr

polymer in silicate melt and reproduced experimentally derived SiO 2 solubility in the aqueous fluid, hydrous melting of quartz, and the critical endpoint between silicate melt and

al

aqueous fluid. Their results based on the SiO 2 –H2 O system suggest that thermodynamic modeling considering the speciation of water may improve the model’s ability to reproduce

rn

the thermodynamic behavior of hydrous silicate melt.

Jo u

The dissociation reaction of water in silicate melt is written as follows (Stolper, 1982): (7)

where H2 Omolecular, O0 , and OH denote the molecular H2 O, bridging oxygens, and OH group attached to a silicate polymer in silicate melt, respectively. The mass balance of the reaction in Eq. 7 is written as ⁄ where

(8)

denotes the concentrations of the species in a silicate melt, and H2 Ototal denotes the

total water concentration of a silicate melt. With a fixed equilibrium constant of the reaction in Eq. 7,

is proportional to

, and

is proportional to

(Stolper, 1982). Bridging oxygens are consumed in the reaction of Eq. 7, meaning that the amount of bridging oxygen decreases in proportion to Consequently, in addition to the linear term (

(Stolper, 1982).

), we consider two terms for H2 O

Journal Pre-proof concentration (

,

9

) to model the dissociation of water.

Consequently, we have

(9) where the linear term (

) represents the total amount of dissolved water in the melt,

the squared term (

) is a proxy thermodynamic term for

with the consumed bridging oxygen, and the square-root term (

) is a proxy

oo f

thermodynamic term for OH (melt). Finally, substituting Eq. 9 into Eqs. 3, 4, and 6, we have (10)

The left-hand side of Eq. 10 can be calculated using experimental results (pressure,

pr

temperature, and melt and olivine compositions) and the non- ideality model of olivine (Sack

e-

and Ghiorso, 1989). Therefore, we can use linear regression to determine W from experimental results (Fig. 1). The maximum number of explanatory variables in the present

Pr

case is 10. On the basis of Eq. 10, various combinations of variables are possible, such as

al

where only the linear term is considered for the

rn

effect of water,

Jo u

where the water-related terms are not considered,

and more sparse equations such as ,

and

. The total number of

such candidate model equations is 1024 (=210 ). We evaluate the various combinations of variables to model the equilibrium of hydrous basaltic melt and olivine using an ML-based model-selection approach. All possible combinations of variables are evaluated, and the optimal combination of end- member components for representing the hydrous-melt–olivine equilibrium is selected.

3.2 Exhaustive variable selection On the basis of the basic configuration (Eq. 9), we implement an ML-based model-selection approach to understand the underlying thermodynamic relationship that explains the data and construct a robust and generalized model of the thermodynamic equilibrium state between

Journal Pre-proof

10

hydrous melt and olivine, in which the most informative set of variables to describe the relationship between melt composition and the molar Gibbs free energy is incorporated. The problem is to find the appropriate combination of variables from the major-element oxide end- member components

,

, and

,

,

,

, and the water-related terms

,

, , and

in Eq. 9. This type of problem can generally be formulated within multiple linear regression, a type of multivariate analysis that is used to construct a multinomial model that relates many

oo f

explanatory variables to the objective variable (e.g., Chatfield and Collins, 1980; Le Maitre, 1982). In the optimization of thermodynamic equilibrium experiments, the multinomial equation can be written as follows:

indicates the objective variable calculated using the experimental results as ;

e-

where

pr

(11)

indicates the number of explanatory terms;

Pr

indicates the explanatory variables, which are the observed composition of melt ( and the squared and square-root terms of water composition, , respectively; and ,

and

indicates each unknown thermodynamic parameter to be

,

,

rn

estimated (i.e.,

al

,

,

,

, and

).

Jo u

In cases where the required explanatory variables have already been specified and where the amount and quality of the observed data are suitable, the unknown parameters can be appropriately estimated, meaning that multiple linear regression can be used to construct a statistical model that has a high predictive capacity (e.g., Chatfield and Collins, 1980). However, in most problems of thermodynamic parameter optimization the parameters are presumably unknown and the variation in the experimental conditions of the observed data (pressure, temperature, and composition) are generally insufficient for properly determining all the thermodynamic parameters that exhibit high predictive capacity in the entire pressure, temperature, and composition range of interest. In the present study, the model equation is unknown, and the P, T, and melt composition of the input experimental data are discrete and heterogeneous. Applying a simple multiple linear regression analysis to such a problem could cause overfitting (e.g., Bishop, 2006); i.e., the model is incorrectly fitted to (noisy) observed data using too many irrelevant and unnecessary variables, which results in an extremely low

Journal Pre-proof

11

predictive capacity of the obtained model when applied to a new dataset. In multiple linear regression analysis, a model that uses irrelevant (redundant) variables can reproduce the input data set used for regression but may not predict unknown data. To construct a model that has high predictive capacity, appropriate combinations of explanatory variables should be determined by removing unnecessary variables. In doing so, an alternative evaluation criterion is required based not only on the reproducibility of observed data (e.g., sum of squared residuals) but also on predictive capacity when applied to new datasets (e.g., Bishop, 2006; Kuwatani et al., 2014). The ES method is the most robust approach for variable selection (Nagata et al., 2015;

oo f

Igarashi et al., 2016). This method automatically extracts the essential variables for describing high-dimensional data and constructs the model with the highest predictive capacity. The ES method has been used in many previous studies in a range of scientific

pr

fields, including psychology (Ichikawa et al., 2014; Nagata et al., 2015), astronomy (Igarashi

e-

et al., 2018), and Earth sciences (Kuwatani et al., 2014; Nakamura et al., 2017). The ES method evaluates the predictive capacity of the constructed model for all combinations of

Pr

explanatory variables. In testing all possible combinations of variables, there are two possible states for each variable: a used state and a discarded state. If this is expanded to all variables,

al

the total number of possible states is 2n , where n is the number of explanatory variables. In this study, we adopted a cross-validation (CV) technique to evaluate the predictive capacity

rn

of the model for unknown data (i.e., the generalization capability of the model). The CV

Jo u

technique can estimate the generalization capability for new datasets using only the available observed dataset. The technique divides the available dataset into two parts: the regression data, which are used to construct the model, and the validation data, which are used to evaluate the generalization capability of the model. By using the ES method with the CV technique, the generalization capability of thermodynamic equations for magmatic equilibria can be objectively evaluated, and the model with the best generalization capability can be identified. We used the ES method for variable selection based on Eq. 10. The maximum number of unknown thermodynamic parameters (

) is 10, meaning that the predictive capacity of the

model for 1024 (= 210 ) combinations of variables was evaluated using the CV technique. In this study, we used the leave-one-out CV technique, whereby M − 1 data (where M = the total number of data in the dataset = 296 in the present study) of the melting experiments were considered as the regression data {

} and

Journal Pre-proof the remaining one case was taken as the validation data

12 . Multivariate linear

regression was applied to the regression data to construct the training models

.

Generalization capability (predictive capacity) was evaluated using the constructed model and the validation data. To obtain the generalization capability, we used the cross- validation error (CVE), defined by the root mean square error which is obtained by repeating the above calculation for all M samples: √

.

(12)

Using this calculation, all possible ways (= M ways) of dividing the original data into the regression data and the validation data were able to be tested, and the root mean squared

oo f

residual between model prediction and the validation data (i.e., CVE) was regarded as the measure of the validity of the model equation (i.e., its generalization capability). Using the

pr

leave-one-out CV technique, we uniquely determined the CVE of each equation. To implement the calculations, we used the R language (R Core Team, 2016), an open-source

e-

programming language for statistical computing. The R source code and the input data are

Pr

provided in Appendix C (Supplementary Materials 1 and 2, respectively).

3.3 Dataset for optimization

al

The dataset for model-selection and parameter optimization comprised previously reported experimental results that were obtained according to the following three criteria:

rn

1. The data describe ultramafic and mafic bulk compositions.

Jo u

2. The system consists of at least the following six major-element oxide components: SiO 2 , Al2 O 3 , Fe2 O 3 or FeO, MgO, CaO, and Na2 O, without CO2 . 3. The system includes olivine and melt ± orthopyroxene ± clinopyroxene ± spinel, without garnet or plagioclase.

Accordingly, 296 individual experimental runs, 72 of which are hydrous experiments, were used for model-selection and optimization during this study. Some of the experimental results were obtained from the Library of Experimental Phase Relations open database of melting experiments (Hirschmann et al., 2008). Anhydrous experiments were obtained from experimental datasets compiled by Ueki and Iwamori (2014). P–T ranges and the total water concentrations of melt are displayed in Fig. 1, and data sources are given in the caption to that figure. The dataset used for the present study is given in Appendix C (Supplementary Material 2). The P–T range of the dataset is 0.8–2.5 GPa and 1050–1600 °C, and the total water concentration of melt ranges from 0 (anhydrous) to 8.96 wt% (Fig. 1). As we focused

Journal Pre-proof

13

on those experiments that were based on mantle systems, almost all experimental melts (294 experimental results) exhibit basaltic to basaltic-andesite compositions. Two experimental melts exhibit andesitic and dacitic compositions. For the water concentration of melt, we used the values reported in each paper; i.e., we did not perform an independent estimation, such as a calculation based on a solubility model (e.g., Newman and Lowenstern, 2002). Fig. 1 shows that the experimental conditions are discrete and heterogeneous. For example, the number of experimental runs decreases with increasing pressure. Both partial- melting and crystallization experiments are included, as are K 2 O-present and K2 O-absent experiments.

oo f

4 Results and discussion

4.1 Model assessment and the relative importance of explanatory variables By investigating the CVE-based ranking of models and comparing the top models, we can

pr

identify informative variables in describing the data, and relationships between variables. Fig.

e-

2 presents the relationship between the CVE (the measure of model generalization capability) and the number of variables used in the equations. The variables used, parameters, and CVEs

Pr

of the best 100 models are shown in Fig. 3 (see Appendix C, Supplementary Material 3, for the full list of models). The variables used, optimized parameters, and CVEs of the top 10

al

model equations are given in Table 1. Residuals obtained by CV analysis (determined as the differences between the values predicted using the constructed model and each validation

) in Fig. B1 of Appendix B. The root mean squared residual of Fig. B1 was regarded as

Jo u

(

in Eq. 12) are plotted against total water concentration in the melt

rn

data; i.e.,

the measure of the validity of the model equation. The non-ideality model with the lowest CVE (i.e., the model with the best generalization capability or optimal model) was obtained using nine variables (SiO 2 , Al2 O3 , FeO, MgO, CaO, Na2 O, K2 O, H2 O, and H2 O2 ) and is written as

. (13) We note that using all 10 variables yields a model with the second-lowest CVE, indicating that using all 10 variables results in overfitting (Fig. 3). Although intuitively it might seem that the most complex model (i.e., the 10-variable model) should yield the highest generalization capability, the use of redundant explanatory variables that explain negligible

Journal Pre-proof

14

variation in the objective variable acts to reduce the generalization capability of models of the hydrous-melt–olivine equilibrium. The ideal-solution model (Eq. 6 = 0) yields the highest CVE; i.e., the worst generalization capability (Fig. 2). SiO 2 , Al2 O3 , FeO, MgO, CaO, and Na2 O are present in all of the top 10 models, and K2 O, in addition to the six major elements, is present in all of the top 5 models (Fig. 3), implying that these seven major elements are particularly informative in describing the phase equilibrium between hydrous basaltic melt and olivine. Also of note, all of the top 100 models used the SiO 2 and MgO terms. FeO, CaO, H2 O, H2 O2 , Al2 O3 , Na2O, H2 O0.5 , and K2O were selected in 79, 74, 65, 64, 63, 63, 53, and 50 of the top 100 models, respectively.

oo f

These results mean that of the major-element end- member components, the SiO 2 , MgO, and FeO concentrations of melt are particularly informative in modeling of the silicate- melt– olivine equilibrium. Although this result might appear to be self-evident, given that we

pr

consider Mg–Fe olivine here, the result should be emphasized because this information was

e-

objectively and automatically defined using the ES method. The CVE values of the top 10 models range from 2.32 to 2.47 kJ, comparable with the experimental uncertainty on the

Pr

enthalpy of fusion for forsterite of ±2.6 kJ (Sugawara, 2005). is the most commonly used water-related term (among in the top 100 models. Conversely,

al

, and

, is

rn

the least commonly adopted of these terms in the top 100 models and is not adopted in the best model. All of the top five models include all seven major elements, and different

Jo u

combinations of water-related terms are present in the top five models. Regarding the parameters of the water-related terms, among the

tends to exhibit the highest absolute values

of the three water-related terms. In contrast,

absolute values of all

exhibits the lowest

(Table 1), which means that of the water-related terms, the square-

root term is the least informative regarding phase equilibrium. Fig. 2 indicates that models using SiO 2 , MgO, FeO, and the water-related terms tend to exhibit better generalization capabilities (i.e., lower CVEs) than those of models not using these terms. Models considering only water-related terms and/or alkaline (Na2 O and K2 O) terms tend to exhibit high CVEs. In addition, SiO 2 -absent models exhibit high CVEs and have poor predictability. The high CVE values of the models using water-related terms exclusively (i.e., models in which

,

, and/or

were used but major oxide end-

components were not) can be interpreted in terms of the input experimental data. Models

Journal Pre-proof

15

considering only the water-related terms would be expected to perform poorly in predicting the results of anhydrous experiments, and the model calculation for the anhydrous condition is the same as the ideal model in this case. In addition, the analytical error on the water concentration might be large compared with those on the major oxide components. Therefore, models using water-related terms exclusively have the lowest predictive capacities of all variable combinations tested here. Generally, concentrations of alkalis (Na2 O and K2 O) in silicate melt have a strong correlation. As the model-selection approach involves highly correlated values, only one variable with less noise tends to be selected from a highly correlated group of variables as a

oo f

significant variable. However, Na2 O and K2 O are present in all of the top five models, meaning that both of these elements have a considerable influence on the thermodynamic equilibrium between silicate melt and olivine (e.g., Kushiro, 1975; Hirschmann et al., 1998).

pr

In contrast, models using exclusively alkaline (Na2 O and K2 O) terms tend to exhibit high

e-

CVEs, and K 2O is the least commonly adopted variable in the top 100 models. Similar to the case of water, both K2 O-present and K2 O-absent experiments were included in the

Pr

experimental dataset. In terms of the thermodynamic equation (Eq. 4), directly contribute to the molar Gibbs free energy of melt through the

term, whereas

al

the concentrations of other oxide end- member components, as well as

the

and

, indirectly . As such,

rn

contribute to the molar Gibbs free energy of melt by the dilution of

and

terms represent the excess contribution of elements to free energy above that term. As we are concerned with ultramafic and mafic systems in

Jo u

contributed by the

this study, only trace amounts of alkaline elements were involved in the experiments, meaning that contributions of their non-ideal excess energy (the

terms) to the total Gibbs

free energy of the system are small in comparison with the contributions made through dilution. In addition, experimental and analytical errors on the trace amounts of concentrations of alkalis might be large compared with those of other major-element concentrations. Consequently, the experimental data on alkalis are relatively noisy, and K2 O and Na2 O are the least and second-least commonly selected major elements in the top 100 models, respectively, and models using exclusively alkaline (Na2 O and K 2O) terms tend to exhibit high CVEs. Our results obtained using the ES method suggest that experimental settings and related errors should be considered when constructing a robust thermodynamic equilibrium model based on the experimental results.

Journal Pre-proof

16

To test reproducibility, anhydrous melt–olivine equilibrium temperatures calculated for a given pressure and melt and olivine compositions using the optimal model are compared with the results of the melting experiments of peridotite at 1–3 GPa of Hirose and Kushiro (1993) in Supplementary Material 4. The results calculated using the optimal model are also compared with those calculated using the olivine–anhydrous melt thermometry of Beattie (1996) in Supplementary Material 4. The experimental results of Hirose and Kushiro (1993) have not been used for parameter calibration and are suitable to test the reproducibility of our optimal model under the anhydrous condition. The olivine–melt thermometry of Beattie (1996) accurately reproduces olivine– anhydrous melt equilibrium temperatures in the

oo f

pressure range of 0.0001 to 15.5 GPa (Putirka, 2007) and is suitable for testing consistency between models for the anhydrous condition. Overall, the equilibrium temperatures derived using the optimal model are consistent with the experimental temperatures and with those

pr

calculated using the thermometry of Beattie (1996). However, the calculation results for 3

e-

GPa exhibit systematically lower temperatures than those of the experimental results and those derived using the approach of Beattie (1996), indicating that the model cannot be

Pr

extrapolated with respect to pressure; the pressure range of our dataset is 0.8–2.5 GPa. To demonstrate the effects of water on liquidus depression, the olivine liquidus depression

al

calculated using the lowest-CVE model (Eq. 13) as a function of the total water concentration of a melt is presented in Fig. 4. The experimental results plotted in this figure (Tatsumi, 1982;

rn

Pichavant et al., 2002; Almeev et al., 2007; Médard and Grove, 2008) are from experiments

Jo u

that were not used for regression and model selection in the present study. As pressure has little influence on the effect of water on liquidus depression (Médard and Grove, 2008), experiments conducted at the pressure range corresponding to the extrapolation of our data set (<0.8 GPa) are also plotted in Fig. 4 for comparison. Liquidus depression lines calculated using the empirical models of Falloon and Danyushevsky (2000), Sugawara (2000), Ariskin and Barmina (2004; “COMAGMAT”), Almeev et al. (2007), and Médard and Grove (2008), and the thermodynamic model pMELTS (Ghiorso et al., 2002), are also plotted for comparison. The liquidus depression was calculated as the difference between experimental wet liquidus and dry liquidus temperatures. The result obtained using a linear model (i.e., calculated using ,

,

,

, and the linear term (

,

,

,

) of the water-related terms) is

also shown in Fig. 4. Calculations were performed using the same melt compositions as those of the plotted experimental results. Representative values of each experiment were used for

Journal Pre-proof

17

pressure and olivine composition in the calculations. For pMELTS and linear- model calculations, the melt composition reported by Pichavant et al. (2002) was used. Although our thermodynamic formulation was a function of melt composition, olivine composition, pressure, and temperature, the liquidus calculations were simplified using fixed olivine composition and pressure to demonstrate the effects of water in our models and to provide a comparison with independent results from previously reported experimental results and models for liquidus depression. Fig. 4 shows that our thermodynamic calculations for the hydrous-melt–olivine equilibrium using the optimal model reproduce the independent experimental results well at all water concentrations; the slope of the liquidus depression

oo f

curve decreases with increasing H2 O content. In contrast, the calculation using only the linear term to represent the effect of water failed to reproduce the non- linear relationship between the total water concentration and olivine liquidus temperature, although it is the fifth-best

pr

model overall (Table 1; Fig. 3). Liquidus temperature decreases monotonically in the result

Pr

4.2 Thermodynamic implications

e-

obtained by the linear model.

Ghiorso et al. (1983) argued that H2 O does not behave like a regular solution component

al

and proposed an additional term to describe the relationship between the total water concentration and the Gibbs free energy of silicate melt. The equation proposed by Ghiorso et

rn

al. (1983) was used for the MELTS (Ghiorso and Sack, 1995; Gualda et al., 2012) and

Jo u

pMELTS (Ghiorso et al., 2002) software programs for the thermodynamic modeling of phase equilibria in magmatic systems. However, MELTS and pMELTS failed to reproduce the relationship between the total water concentration of melt and the liquidus temperature of olivine (Almeev et al., 2007; Médard and Grove, 2008). In addition, some critical factors, such as the dissociation of water in silicate melt (Stolper, 1982) and its interaction with other oxide end- member components in the melt (e.g., Hunt and Manning, 2012), were not considered in the MELTS and pMELTS models. Putirka et al. (2007) argued that the total water concentration of melt affects the

(logarithm of distribution coefficient for Mg

between olivine and melt) for olivine–melt, and therefore those authors parameterized for olivine–melt as a function of temperature, pressure, water, and the Na2 O and K2 O concentrations of melt. Putirka et al. (2007) parameterized the effect of water on

as a

function of the total water concentration of melt. Carmichael (2002, 2004) parameterized the composition-related non- ideal excess energy of hydrous melt as a linear function of

,

Journal Pre-proof , and total alkalines (

18

). Hirschmann et al. (1999, 2009) and Aubaud et al.

(2004) modeled the effect of dissolved water in melt on the peridotite solidus by considering dilution by dissolved water through the

term. In the present study, we adopted a

model-selection method to formulate a model of the non- ideality of hydrous melt. The result produced by the model-selection approach shows that

(linear term) and

(squared term) were retained in the selection process; consequently, we infer that those two terms are important for describing the relationship between the total water concentration and the molar Gibbs free energy of the silicate melt, in addition to the

oo f

term and seven major elements including alkalis (SiO 2 , Al2 O3 , FeO, MgO, CaO, Na2 O, and K2 O).

The optimal model equation selected using the ES approach (Eq. 13) is considered to

pr

describe the underlying melt structure that explains the data; i.e., the relationship between free energy and the microscopic chemical features of hydrous melt. As shown above,

and the amount of bridging

Pr

(squared term) represents

e-

(linear term) represents the effect of the total water concentration, and

oxygen consumed by the dissolution of water. To demonstrate the contribution of the

al

polynomial function to hydrous- melt–olivine equilibrium and to examine the relationship between thermodynamic equilibrium and the microscopic chemical features of hydrous melt,

for olivine–melt was calculated using Eq. 13 as

Jo u

Fig. 5.

values are presented as a function of the total water concentration of melt in

rn

calculated

. The

temperature was varied from 1100 to 1500 °C, the pressure was fixed at 1 GPa, and melt and olivine compositions were fixed as the mean values of the entire experimental dataset. values of the experimental dataset are also plotted in Fig. 5. The experimental

values

exhibit wide variations at lower temperatures (<1200 °C), possibly related to the difficulties in performing low-temperature near-solidus hydrous experiments (e.g., Green et al., 2012). Although some of the experimental between experimental

values exhibit scatter, a parabolic relationship

and the total water concentration of melt is observed in Fig. 5 and

is particularly recognizable at temperatures of 1200–1300 °C. Fig. 5 indicates that our optimal model, which uses both the

and

terms for the effect of

water, captures the parabolic structure of the variation in concentration. Both the experimental and calculated

with changing total water

values decrease with an increasing

Journal Pre-proof

19

total water concentration of melt from 0 to 5 wt%, and then increase with a further increase in the total water concentration to 9 wt%. In contrast, the linear model, in which only of the water-related terms was used, does not reproduce the relationship between and the total water concentration of melt;

decreases monotonically with increasing

total water concentration for the result obtained by the linear model. for anhydrous- melt–olivine depends on the melt composition and, consequently, on the microstructure of melt, and it exhibits a parabolic relationship as a function of NBO/T (nonbridging oxygen per tetrahedrally coordinated cations) (Kushiro and Mysen, 2002; Mibe et al., 2006; Filiberto and Dasgupta, 2011). Kushiro and Mysen (2002) attributed the

oo f

parabolic relationship to the change in the coordination states of Mg in melt associated with depolymerization (Fiske and Stebbins, 1994). Mg2+ in polymerized melts (low NBO/T) is

pr

predominantly four coordinated, and it changes to six coordination with an increase in NBO/T. As a result, Mg in polymerized melts tends to enter olivine in which Mg is s ix

e-

coordinated, and with an increase in six-coordinated Mg in the melt, more Mg enters the melt (Kushiro and Mysen, 2002; Mysen and Dubinsky, 2004; Mysen and Shang, 2005). With

Pr

respect to hydrous melt, Putirka et al. (2007) showed that

is sensitive to the water

concentration of melt. Pu et al. (2017) attributed this dependence of

to the coordination

al

environments of Mg. Other studies have suggested that the reaction between Mg and OH in

rn

melt could affect the partitioning behavior of Mg and mafic minerals (Crabtree and Lange, 2011; Frey and Lange, 2011; Waters and Lange, 2013). NBO/T changes systematically with

Jo u

the total water concentration of melt (e.g., Mysen, 2014). Therefore, the proportions of the different coordination states of Mg could be a function of the total water concentration of melt, and, consequently, melt. The minimum

could also be a function of the total water concentration of

value of mantle peridotite partial melt is located at NBO/T > 2.5

(Kushiro and Mysen, 2002; Mibe et al., 2006), which is more depolymerized than NBO/T values for anhydrous-based compositions of the input data of this study (0.15 to 1.88, calculated according to Jaeger and Drake, 2000). As dissolved water increases NBO/T, the actual NBO/T values of hydrous melts in the input data could be higher than the anhydrousbased calculated values. The minimum

value at ~5 wt% H2 O (Fig. 5) might represent

the NBO/T value corresponding to the maximum proportion of six-coordinated Mg in hydrous melt. As explained in Section 3.1, the amount of consumed oxygen associated with dissociation of water in melt is proportional to

(Stolper, 1982). Accordingly,

Journal Pre-proof we infer that

20

represents the structural change of the silicate melt associated

with the dissociation of water. The relationship between

, the total water concentration, and the microscopic structure

of melt could be further investigated with a detailed experimental analysis of the microscopic structures of Mg-bearing hydrous melts. Furthermore, first-principles calculations (e.g., De Koker and Stixrude, 2009; Taniuchi and Tsuchiya, 2018) may provide quantitative understandings of the detailed relationship between, and the mechanism connecting, thermodynamic properties and microscopic structures, potentially including the water

oo f

speciation, coordination environment, and NBO/T of hydrous silicate melt.

4.3 Machine learning for petrological thermodynamic modeling

pr

We constructed a thermodynamic model based on the selection of important variables by using data for melting experiments. This type of data-driven approach is becoming

e-

increasingly popular in various fields of natural science (Hey et al., 2009; Igarashi et al., 2016; Ueki et al., 2018). Specifically, we used the ES method to select the important

Pr

variables and the CV technique to quantitatively validate the predictive capacity of various equations. On the basis of the predictive capacities of the model equations, we were able to

al

obtain a generalized and robust equation for the hydrous-melt–olivine equilibrium, as well as

rn

to capture the underlying thermodynamic relationship, such as the importance of variables, the most informative combination of variables (Figs. 2 and 3), and the reproducibility of

Jo u

observed data (Appendix B). The proposed method for model-selection using heterogeneous input data could be applied to other petrological and geochemical problems, such as the modeling of metamorphic reactions and of high-pressure experimental data corresponding to lower-mantle conditions.

We simplified the thermodynamic equation and the model settings, and we will expand our modeling approach in the future for application to multi-component and more complex natural magmatic systems. Although we considered a maximum of 10 possible terms, the number could be increased, and the use of non- linear terms could also be further explored. For example, if we consider the full equation of the symmetric regular solution (Eq. 5), 245 combinations of variables must be evaluated because the maximum number of unknown thermodynamic parameters (W) is 45. The maximum number of unknown thermodynamic parameters increases as more complex equations (e.g., asymmetric regular solutions, or temperature- or pressure-dependent non-ideality) are introduced. Furthermore, other aspects,

Journal Pre-proof

21

such as the influence of magma oxygen fugacity (e.g., Moore et al., 1995; Toplis and Carroll, 1995), and of trace components in olivine (e.g., Ca; Libourel, 1999; Aubaud et al., 2004; Gavrilenko et al., 2016), could be important for application to natural magmatic systems. In such cases, the potentially large number of combinations of variables and the size of the input dataset will need to be considered when employing the ES method (i.e., the problem of combinatorial explosion). For such a case, an alternative would be to use a different statistical treatment, such as ℓ 1-norm constraints (e.g., Tibshirani, 1996) or Akaike’s Information Criterion (e.g., Akaike, 1974), to obtain optimal model equations.

oo f

As we used experiments based exclusively on mantle conditions, the investigated compositional range was limited to mafic primary magma compositions. Evolved melt compositions corresponding to natural arc magmas should be incorporated in future modeling

pr

by expanding the compositional range of the input dataset. Also, our present model considers only the forsterite (Mg2 SiO4 ) component in olivine, meaning that we have fewer equations to

e-

constrain genetic or equilibrium conditions based on the composition of natural basaltic

Pr

samples using Eq. 13. By expanding our present model to include the fayalite (Fe2 SiO4 ) endmember component in olivine, we could use the Fe/Mg exchange reaction to constrain the temperature and melt water concentration simultaneously from the magma and olivine

al

compositions. Our results indicate that the ML-based model-selection approach is a robust

rn

and useful approach for constructing a thermodynamic petrological model. The model construction can be expanded to include multiple phases, and could be incorporated into a

Jo u

total energy minimization and bulk conservation algorithm for thermodynamic calculation of multi-component systems (Ueki and Iwamori, 2013, 2014) to model multi-component and multi-phase natural magmatic systems.

5 Summary We presented a novel method and outcome for thermodynamic modeling of the hydrous basaltic melt–olivine equilibrium. Specifically, we applied a machine-learning-based modelselection approach, the exhaustive variable selection method, to formulate a robust and generalized thermodynamic model that describes the underlying thermodynamic relationship of the hydrous basaltic melt–olivine equilibrium. A linear term and a squared term of the total water concentration of melt (

and

) were found to be significant

explanatory variables in the optimal model of the hydrous- melt–olivine equilibrium. This relationship may be attributed in physical terms to the microstructural changes that occur as a

Journal Pre-proof

22

result of the dissociation of water in silicate melt. Calculations using the constructed optimal model reproduced the experimentally determined effects of water on the liquidus temperature of olivine and the distribution coefficient for Mg between olivine and melt. Our results indicate that ML is a useful approach for constructing a petrological numerical model and for analyzing experimental results characterized by high-dimensional and complex data, from which we are able to extract fundamental information, in the present case regarding hydrousmelt–olivine thermodynamics.

oo f

Acknowledgments

We thank Keith Putirka for a constructive and detailed review, and Kei Hirose for careful editorial handling of the manuscript. Maurizio Petrelli and an anonymous reviewer are

pr

thanked for useful and detailed comments on an earlier version of the manuscript. We thank

e-

Taku Tsuchiya, Ryuichi Nomura, and Atsushi Yasumoto for insightful discussions. We gratefully acknowledge grants from The Joint Usage/Research Center programs of the

Pr

Earthquake Research Institute, University of Tokyo, Japan, 2015-B-04 (Geochemical Data Analysis Using Machine Learning) and 2018-B-01 (Data-driven Geoscience: Application to

al

Dynamics in Mobile Belts). K.U. was supported by Japan Society for the Promotion of Science KAKENHI Grant Numbers JP15H05833, JP17H02063, and JP19K04026, and T.K.

rn

was supported by Japan Society for the Promotion of Science KAKENHI Grant numbers

Jo u

JP15K20864 and JP25120005 and by Japan Science and Technology Agency PRESTO Grant Number JPMJPR1676. S.A. and T.K. were supported by Japan Science and Technology Agency CREST Grant No. JPMJCR1761. Some of the experimental data used for the modelselection and parameter optimization were obtained from the Library of Experimental Phase Relations open database of melting experiments (Hirschmann et al., 2008). The R source code and dataset used in this study are provided in Appendix C.

Appendix A: Molar Gibbs free energy of pure solid and melt end-member components Following the formulation by Ueki and Iwamori (2013) and Putirka (2017), the Gibbs free energy of the melt–olivine (ol) reaction is written as * (A1)

+

Journal Pre-proof where

, and

23

are the temperature, enthalpy, and entropy of the fusion of solid

end- member component j at 1 bar, respectively.

denotes the reaction quotient (e.g., Putirka,

2017). When an equilibrium state is achieved (

),

the activity coefficient of end-member component i.

is equal to

, where

is

denotes the difference in molar

Gibbs free energy between the melt and the corresponding solid end- member components. is the difference in molar specific heat between the solid end- member component j and the melt end- member component i, and is written as

.

is the

difference in molar volume between solid end- member component j and melt end- member .

oo f

component i, written as

Although the P–V relation of silicate melt exhibits a non- linear relationship (e.g., Ueki and Iwamori, 2016),

values during melting within the mantle can be assumed to be constant

pr

at the pressures of interest in this study, regardless of temperature (Ueki and Iwamori, 2014).

e-

In addition, the total volume and compressibility of silicate melt can be treated as linear combinations of these parameters of oxide end- member components (Lange and Carmichael, was simply

Pr

1987, 1990) and the H2 O end- member component of the melt. Therefore,

derived by a linear regression between the partial molar volumes of silicate melt and olivine using the equation of state for hydrous melt (Ueki and Iwamori, 2016) and that for olivine used in this study is a function of pressure [P (bar)], as follows:

rn

al

(Berman, 1988).

As noted in our previous study (Ueki and Iwamori, 2014), only a weak pressure dependence ) is obtained.

Jo u

(

Previous experimental studies have shown that excess specific heat related to the mixing of end- member components, including water, in silicate melt is negligible (e.g., Bouhifd et al., 2006; Di Genova et al., 2014). Ueki and Iwamori (2013, 2014) reported that the value of during melting can be assumed to be constant at the temperatures and pressures of interest in the present study. Values of experimental results, with

,

,

, and

= 2174 K (Sugawara, 2005),

and Bottinga, 1985; Gillet et al., 1991; Médard and Grove, 2008), (Sugawara, 2005), and Ghiorso (1989) was used for

were taken from previous = 100 (J/mol-K) (Richet = 138.90 (kJ/mol)

= 63.89 (J/K-mol) (Sugawara, 2005). The model of Sack and .

To summarize the temperature- and pressure-dependent and melt-composition- and olivine-composition- independent terms, we introduced a term

as

Journal Pre-proof *

+

. (A2)

, Eq. A1 can be rearranged into Eq. 2 of the main text.

Jo u

rn

al

Pr

e-

pr

oo f

By introducing the term

24

Journal Pre-proof

25

al

Pr

e-

pr

oo f

Appendix B: Residuals vs total water concentration (mole fraction) of the top 10 models

validation data; i.e.,

rn

Fig. B1. Residuals (difference between values predicted using the constructed model and the in Eq. 12) obtained during CV analysis plotted against

Jo u

the total water concentration in melt (

). Residuals were calculated using Eq. 10

(difference between the right- and left- hand sides of Eq. 10). Each circle represents an independent experimental result.

Conflict of Interest: The authors declare that they have no conflicts of interest.

Journal Pre-proof

26

References Akaike H., 1974. A new look at the statistical model identification. IEEE Autom. Control 19, 716–723. Almeev R., Holtz F., Koepke J., Parat F., Botcharnikov R.E., 2007. The effect of H2 O on olivine crystallization in MORB: experimental calibration at 200 MPa. Am. Mineral. 92, 670–674. Ariskin A.A., Barmina G.S., 2004. COMAGMAT: development of a magma crystallization model and its petrological applications. Geochem. Int. 42, S1. Aubaud C., Hauri E.H., Hirschmann M.M., 2004. Hydrogen partition coefficients between

oo f

nominally anhydrous minerals and basaltic melts. Geophys. Res. Lett. 31, L20611. https://doi.org/10.1029/2004GL021341.

Beattie, P., 1993. Olivine–melt and orthopyroxene–melt equilibria. Contrib. Mineral. Petrol.

pr

115, 103–111.

e-

Berman R.G., 1988. Internally consistent thermodynamic data for minerals in the system Na2 O–K2 O–CaO–MgO–FeO–Fe2 O 3–Al2 O 3 –SiO2 –TiO2 –H2 O–CO2 , J. Petrol. 29, 445–522.

Pr

Bishop C.M., 2006. Pattern recognition and machine learning. Springer, New York. Bouhifd M.A., Whittington A., Roux J., Richet P., 2006. Effect of water on the heat capacity

722.

al

of polymerized aluminosilicate glasses and melts. Geochim. Cosmochim. Acta 70, 711–

rn

Carmichael I.S.E., 2002. The andesite aqueduct: perspectives on the evolution of

143, 641–663.

Jo u

intermediate magmatism in west-central (105–99 W) Mexico. Contrib. Mineral. Petrol.

Carmichael I.S.E., 2004. The activity of silica, water, and the equilibration of intermediate and silicic magmas. Am. Mineral. 89, 1438–1446. Chatfield C., Collins A.J., 1980. Introduction to multivariate analysis. Chapman and Hall, London. Crabtree S.M., Lange R.A., 2010. Complex phenocryst textures and zoning patterns in andesites and dacites: evidence of degassing- induced rapid crystallization? J. Petrol. 52, 3–38. De Koker N., Stixrude L., 2009. Self-consistent thermodynamic description of silicate liquids, with application to shock melting of MgO periclase and MgSiO 3 perovskite. Geophys. J. Int. 178, 162–179. Di Genova D., Romano C., Giordano D., Alletti M., 2014. Heat capacity, configurational heat capacity and fragility of hydrous magmas. Geochim. Cosmochim. Acta 142, 314–333.

Journal Pre-proof

27

Falloon T.J., Danyushevsky L.V., 2000. Melting of refractory mantle at 1.5, 2, and 2.5 GPa under anhydrous and H2 O-undersaturated conditions: implications for the petrogenesis of high-Ca boninites and the influence of subduction components on mantle melting. J. Petrol. 41, 257–258. Falloon T.J., Danyushevsky L.V., Green D.H., 2001. Peridotite melting at 1 GPa: reversal experiments on partial melt compositions produced by peridotite–basalt sandwich experiments. J. Petrol. 42, 2363–2390. Falloon T.J., Green D.H., Danyushevsky L.V., Faul U.H., 1999. Peridotite melting at 1.0 and 1.5 GPa: an experimental evaluation of techniques using diamond aggregates and mineral

oo f

mixes for determination of near-solidus melts. J. Petrol. 40, 1343–1375. Falloon T.J., Green D.H., O’Neill H.S.C., Hibberson W.O., 1997. Experimental tests of low degree peridotite partial melt compositions: implications for the nature of anhydrous near-

pr

solidus peridotite melts at 1 GPa. Earth Planet. Sci. Lett. 152, 149–162.

e-

Filiberto J., Dasgupta R., 2011. Fe2+–Mg partitioning between olivine and basaltic melts: applications to genesis of olivine–phyric shergottites and conditions of melting in the

Pr

Martian interior. Earth Planet. Sci. Lett. 304, 527–537. Fiske P.S., Stebbins J.F., 1994. The structural role of Mg in silicate liquids: a high25

Mg, 23 Na, and

29

Si NMR study. Am. Mineral. 79, 848–861.

al

temperature

Frey H.M., Lange R.A., 2011. Phenocryst complexity in andesites and dacites from the

rn

Tequila volcanic field, Mexico: resolving the effects of degassing vs. magma mixing.

Jo u

Contrib. Mineral. Petrol. 162, 415–445. Gaetani GA., Grove T.L., 1998. The influence of water on melting of mantle peridotite. Contrib. Mineral. Petrol. 131, 323–346. Gavrilenko M., Herzberg C., Vidito C., Carr M.J., Tenner T., Ozerov A., 2016. A calcium- inolivine geohygrometer and its application to subduction zone magmatism. J. Petrol. 57, 1811–1832. Ghiorso M.S., Carmichael I.S.E., Rivers I., Sack R.O., 1983. The Gibbs free energy of mixing of natural silicate liquids: an expanded regular solution approximation for the calculation of magmatic intensive variables. Contrib. Mineral. Petrol. 84, 107–145. Ghiorso M.S., Hirschmann M.M., Reiners P.W., Kress V.C., 2002. The pMELTS: a revision of MELTS for improved calculation of phase relations and major element partitioning related to partial melting of the mantle to 3 GPa. Geochem. Geophys. Geosys. 3, 1030. https://doi.org/10.1029/2001GC00217.

Journal Pre-proof

28

Ghiorso M.S., Sack R.O., 1995. Chemical mass transfer in magmatic processes, IV. A revised and internally consistent thermodynamic model for the interpolation and extrapolation of liquid–solid equilibria in magmatic systems at elevated temperatures and pressures. Contrib. Mineral. Petrol. 119, 197–212. Gillet P., Richet P., Guyot F., Fiquet G., 1991. High- temperature thermodynamic properties of forsterite. J. Geophys. Res. 96, 11805–11816. Green D.H., 1973. Experimental melting studies on a model upper mantle composition at high pressure under water-saturated and water-undersaturated conditions. Earth Planet. Sci. Lett. 19, 37–53.

oo f

Green D.H., Hibberson W.O., Kovács I., Rosenthal A., 2010. Water and its influence on the lithosphere–asthenosphere boundary. Nature 467, 448–451.

Green D.H., Hibberson W.O., Rosenthal A., Kovács I., Yaxley G.M., Falloon T.J., Brink F.,

e-

the upper mantle. J. Petrol. 55, 2067–2096.

pr

2014. Experimental study of the influence of water on melting and phase assemblages in Green D.H., Rosenthal A., Kovács I., 2012. Comment on “The beginnings of hydrous mantle

Pr

wedge melting”, CB Till., TL Grove., AC Withers. Contrib. Mineral. Petrol. 164, 1077– 1081.

2003.

Fractional

al

Grove T.L., Elkins-Tanton L.T., Parman S.W., Chatterjee N., Müntener O., Gaetani G.A., crystallization

and

mantle- melting

controls

on

calc-alkaline

rn

differentiation trends. Contrib. Mineral. Petrol. 145, 515–533.

Jo u

Gualda G.A., Ghiorso M.S., Lemons R.V., Carley T.L., 2012. Rhyolite-MELTS: a modified calibration of MELTS optimized for silica-rich, fluid-bearing magmatic systems. J. Petrol. 53, 875–890.

Hey T., Tansley S., Tolle K.M., 2009. The fourth paradigm: data- intensive scientific discovery (Vol. 1). Microsoft Research, Redmond, WA. Hirose K., 1997. Melting experiments on lherzolite KLB-1 under hydrous conditions and generation of high-magnesian andesitic melts. Geology 25, 42–44. Hirose K., Kawamoto T., 1995. Hydrous partial melting of lherzolite at 1 GPa: the effect of H2 O on the genesis of basaltic magmas. Earth Planet. Sci. Lett. 133, 463–473. Hirose, K., Kushiro, I., 1993. Partial melting of dry peridotites at high pressures: determination of compositions of melts segregated from peridotite using aggregates of diamond. Earth Planet. Sci. Lett. 114, 477–489. Hirschmann M.M., Asimow P.D., Ghiorso M.S., Stolper E.M., 1999. Calculation of peridotite partial melting from thermodynamic models of minerals and melts. III. Controls

Journal Pre-proof

29

on isobaric melt production and the effect of water on melt production. J. Petrol. 40, 831– 851. Hirschmann M.M., Baker M.B., Stolper E.M., 1998. The effect of alkalis on the silica content of mantle-derived melts. Geochim. Cosmochim. Acta 62, 883–902. Hirschmann M.M., Ghiorso M.S., Davis F.A., Gordon S.M., Mukherjee S., Grove T.L., Krawczynski M., Médard E., Till C.B., 2008. Library of Experimental Phase Relations (LEPR): a database and web portal for experimental magmatic phase equilibria data. Geochem. Geophys. Geosys. 9, Q03011. https://doi.org/10.1029/2007GC001894. Hirschmann M.M., Tenner T., Aubaud C., Withers A.C., 2009. Dehydration melting of

oo f

nominally anhydrous mantle: the primacy of partitioning. Phys. Earth Planet. Inter. 176, 54–68.

Hunt J.D., Manning C.E., 2012. A thermodynamic model for the system SiO 2 –H2 O near the

pr

upper critical end point based on quartz solubility experiments at 500–1100 °C and 5–20

e-

kbar. Geochim. Cosmochim. Acta 86, 196–213.

Ichikawa H., Kitazono J., Nagata K., Manda A., Shimamura K., Sakuta R., Okada M.,

Pr

Yamaguchi M.K., Kanazawa S., Kakigi R., 2014. Novel method to classify hemodynamic response obtained using multi-channel fNIRS measurements into two groups: exploring

al

the combinations of channels. Front. Hum. Neurosci., 8, 480. Igarashi Y., Nagata K., Kuwatani T., Omori T., Nakanishi-Ohno Y., Okada M., 2016. Three

rn

levels of data-driven science. J. Phys. 699, 012001.

Jo u

Igarashi Y., Takenaka H., Nakanishi-Ohno Y., Uemura M., Ikeda S., Okada M 2018. Exhaustive search for sparse variable selection in linear regression. J. Phys. Soc. Jpn. 87, 044802.

Iwamori H., Albaréde F., 2008. Decoupled isotopic record of ridge and subduction zone processes in oceanic basalts by independent component analysis. Geochem. Geophys. Geosys. 9, Q04033. https://doi.org/10.1029/2007GC001753. Jaeger, W.L., Drake, M.J., 2000. Metal–silicate partitioning of Co, Ga, and W: dependence on silicate melt composition. Geochim. Cosmochim. Acta 64, 3887–3895. Jennings E.S., Holland T.J.B., 2015. A simple thermodynamic model for melting of peridotite in the system NCFMASOCr. J. Petrol. 56, 869–892. Karki B.B., Bhattarai D., Mookherjee M., Stixrude L., 2010. Visualization-based analysis of structural and dynamical properties of simulated hydrous silicate melt. Phys. Chem. Miner. 37, 103–117.

Journal Pre-proof

30

Karki B.B., Stixrude L., 2010. First-principles study of enhancement of transport properties of

silica

melt

by

water.

Phys.

Rev.

Lett.

104,

215901.

https://doi.org/10.1103/PhysRevLett.104.215901. Kawamoto T., 1996. Experimental constraints on differentiation and H2 O abundance of calcalkaline magmas. Earth Planet. Sci. Lett. 14, 577–589. Kinzler R.J., 1997. Melting of mantle peridotite at pressures approaching the spinel to garnet transition: application to mid-ocean ridge basalt petrogenesis. J. Geophys. Res. 102, 853– 874.

and methods. J. Geophys. Res. 97, 6885–6906.

oo f

Kinzler R.J., Grove T.L., 1992. Primary magmas of mid-ocean ridge basalts: 1. Experiments

Kogiso T., Hirose K., Takahashi E., 1998. Melting experiments on homogeneous mixtures of peridotite and basalt: application to the genesis of ocean island basalts. Earth Planet. Sci.

pr

Lett. 162, 45–61.

pressures. Am. J. Sci. 267, 269–294.

e-

Kushiro I., 1969. The system forsterite–diopside–silica with and without water at high

Pr

Kushiro I., 1975. On the nature of silicate melt and its significance in magma genesis: regularities in the shift of the liquidus boundaries involving olivine, pyroxene, and silica

al

minerals. Am. J. Sci. 275, 411–431.

Kushiro I., Mysen B.O., 2002. A possible effect of melt structure on the Mg–Fe2+ partitioning

rn

between olivine and melt. Geochim. Cosmochim. Acta 66, 2267–2272.

Jo u

Kushiro I., Syono Y., Akimoto S., 1968. Melting of a peridotite nodule at high pressures and high water pressures. J. Geophys. Res. 73, 6023–6029. Kuwatani T., Nagata K., Okada M., Watanabe T., Ogawa Y., Komai T., Tsuchiya N., 2014. Machine- learning techniques for geochemical discrimination of 2011 Tohoku tsunami deposits. Sci. Rep. 4, 7077. https://doi.org/10.1038/srep07077. Lange R.A., Carmichael I.S.E., 1987. Densities of Na2 O–K2 O–CaO–MgO–FeO–Fe2O3 – Al2 O 3 –TiO 2 –SiO 2 liquids: new measurements and derived partial molar properties. Geochim. Cosmochim. Acta 51, 2931–2946. Lange R.L., Carmichael I.S.E., 1990. Thermodynamic properties of silicate liquids with emphasis on density, thermal expansion and compressibility. Rev. Mineral. 24, 25–64. Langmuir C.H., Bender J.F., Bence A.E., Hanson G.N., Taylor S.R., 1977. Petrogenesis of basalts from the FAMOUS area: mid-Atlantic Ridge. Earth Planet. Sci. Lett. 36, 133–156.

Journal Pre-proof

31

Laporte D., Toplis M.J., Seyler M., Devidal J.L., 2004. A new experimental technique for extracting liquids from peridotite at very low degrees of melting: application to partial melting of depleted peridotite. Contrib. Mineral. Petrol. 146, 463–484. Le Maitre R.W., 1982. Numerical petrology. Statistical interpretation of geochemical data. Developments in Petrology, vol. 8. Elsevier, Amsterdam. Libourel G., 1999. Systematics of calcium partitioning between olivine and silicate melt: implications for melt structure and calcium content of magmatic olivines. Contrib. Mineral. Petrol. 136, 63–80. Longhi J., 2002. Some phase equilibrium systematics of lherzolite melting: I. Geochem.

oo f

Geophys. Geosys. 3(3), 1–33. https://doi.org/10.1029/2001GC000204. McDade P., Blundy J.D., Wood B.J., 2003. Trace element partitioning on the Tinaquillo lherzolite solidus at 1.5 GPa. Phys. Earth Planet. Inter. 139, 129–147.

pr

Médard E., Grove T.L., 2008. The effect of H2 O on the olivine liquidus of basaltic melts:

e-

experiments and thermodynamic models. Contrib. Mineral. Petrol. 155, 417–432. Mibe K., Fujii T., Yasuda A., Ono S., 2006. Mg–Fe partitioning between olivine and

Pr

ultramafic melts at high pressures. Geochim. Cosmochim. Acta 70, 757–766. Mitchell A.L., Grove T.L., 2015. Melting the hydrous, subarc mantle: the origin of primitive

al

andesites. Contrib. Mineral. Petrol. 170, 13.

Mookherjee M., Stixrude L., Karki B., 2008. Hydrous silicate melt at high pressure. Nature

rn

452, 983–986.

Jo u

Moore G., Righter K., Carmichael I.S.E., 1995. The effect of dissolved water on the oxidation state of iron in natural silicate liquids. Contrib. Mineral. Petrol. 120, 170–179. Mysen B.O., 2007. The solution behavior of H2 O in peralkaline aluminosilicate melts at high pressure with implications for properties of hydrous melts. Geochim. Cosmochim. Acta 71, 1820–1834.

Mysen B.O, 2014. Water–melt interaction in hydrous magmatic systems at high temperature and pressure. Prog. Earth Planet. Sci. 1, 1–18. Mysen B.O., Boettcher A.L., 1975. Melting of a hydrous mantle: I. Phase relations of natural peridotite at high pressures and temperatures with controlled activities of water, carbon dioxide, and hydrogen. J. Petrol. 16, 520–548. Mysen B.O., Boettcher A.L., 1975. Melting of a hydrous mantle: II. Geochemistry of crystals and liquids formed by anatexis of mantle peridotite at high pressures and high temperatures as a function of controlled activities of water, hydrogen, and carbon dioxide. J. Petrol. 16, 549–593.

Journal Pre-proof

32

Mysen B.O., Dubinsky E.V., 2004. Melt structural control on olivine/melt element partitioning of Ca and Mn. Geochim. Cosmochim. Acta 68, 1617–1633. Mysen B.O., Shang J., 2005. Evidence from olivine/melt element partitioning that nonbridging oxygen in silicate melts are not equivalent. Geochim. Cosmochim. Acta 69, 2861–2875. Nagata K., Kitazono J., Nakajima S., Eifuku S., Tamura R., Okada M., 2015. An exhaustive search and stability of sparse estimation for feature selection problem. IPSJ. Trans. 8, 25– 32. Nakamura K., Yasutaka T., Kuwatani T., Komai T., 2017. Development of a predictive

oo f

model for lead, cadmium and fluorine soil–water partition coefficients using sparse multiple linear regression analysis. Chemosph. 186, 501–509.

Newman S., Lowenstern J.B., 2002. VolatileCalc: a silicate melt–H2 O–CO 2 solution model

pr

written in Visual Basic for Excel. Comput. Geosci. 28, 597–604.

e-

Nowak M., Behrens H., 1995. The speciation of water in haplogranitic glasses and melts determined by in situ near-infrared spectroscopy. Geochim. Cosmochim. Acta 59, 3445–

Pr

3450.

Okamoto A., Toriumi M., 2004. Optimal mixing properties of calcic and subcalcic

al

amphiboles: application of Gibbs’ method to the Sanbagawa schists, SW Japan. Contrib. Mineral. Petrol. 146, 529–545.

and

predictive

modeling.

J.

Geophys.

Res.

109,

B02201.

Jo u

data

rn

Parman S.W., Grove T.L., 2004. Harzburgite melting with and without H2 O: experimental

https://doi.org/10.1029/2003JB002566. Petrelli M., Perugini D., 2016. Solving petrological problems through machine learning: the study case of tectonic discrimination using geochemical and isotopic data. Contrib. Mineral. Petrol. 171, 81.

Pichavant M., Mysen B.O., Macdonald R., 2002. Source and H2 O content of high-MgO magmas in island arc settings: an experimental study of a primitive calc-alkaline basalt from St. Vincent, Lesser Antilles arc. Geochim. Cosmochim. Acta 66, 2193–2209. Pickering-Witter J., Johnston A.D., 2000. The effects of variable bulk composition on the melting systematics of fertile peridotitic assemblages. Contrib. Mineral. Petrol. 140, 190– 211. Pu X., Lange R.A., Moore G., 2017. A comparison of olivine–melt thermometers based on D Mg and D Ni: the effects of melt composition, temperature, and pressure with applications to MORBs and hydrous arc basalts. Am. Mineral. 102, 750–765.

Journal Pre-proof

33

Putirka K., 2017. Geothermometry and Geobarometry. In: White W. (ed.) Encyclopedia of Geochemistry. Encyclopedia of Earth Sciences Series. Springer, Cham. Putirka K.D., Perfit M., Ryerson F.J., Jackson M.G., 2007. Ambient and excess mantle temperatures, olivine thermometry, and active vs. passive upwelling. Chem. Geol. 241, 177–206. R Core Team, 2016. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. Richet P., Bottinga Y., 1985. Heat capacity of aluminum- free liquid silicates. Geochim. Cosmochim. Acta 49, 471–486.

oo f

Robinson J.A.C., Wood B.J., Blundy J.D., 1998. The beginning of melting of fertile and depleted peridotite at 1.5 GPa. Earth Planet. Sci. Lett. 155, 97–111. Sack R.O., Ghiorso M.S., 1989. Importance of considerations of mixing properties in

pr

establishing an internally consistent thermodynamic database: thermochemistry of

e-

minerals in the system Mg2 SiO 4 –Fe2 SiO4 –SiO 2 . Contrib. Mineral. Petrol. 102, 41–68. Schwab B.E., Johnston A.D., 2001. Melting systematics of modally variable, compositionally

Pr

intermediate peridotites and the effects of mineral fertility. J. Petrol. 42, 1789–1811. Sisson T.W., Grove T.L., 1993. Experimental investigations of the role of H2 O in calc143–166.

al

alkaline differentiation and subduction zone magmatism. Contrib. Mineral. Petrol. 113,

Jo u

2609–2620.

rn

Stolper E., 1982. The speciation of water in silicate melts. Geochim. Cosmochim. Acta 46,

Sugawara T., 2000. Empirical relationships between temperature, pressure, and MgO content in olivine and pyroxene saturated liquid. J. Geophys. Res. 105, 8457–8472. Sugawara T., 2005. Thermodynamic properties of magmatic liquid: review and future directions. Bull. Volcanol. Soc. Jpn. 50, 103–142. Takahashi E., 1980. Melting relations of an alkali-olivine basalt to 30 kbar, and their bearing on the origin of alkali basalt magmas. Carn. Inst. Wash. Year Book 79, 271–276. Taniuchi T., Tsuchiya T., 2018. The melting points of MgO up to 4 TPa predicted based on ab initio thermodynamic integration molecular dynamics. J. Phys. Cond. Matter 30, 114003. Tatsumi Y., 1982. Origin of high- magnesian andesites in the Setouchi volcanic belt, southwest Japan, II. Melting phase relations at high pressures. Earth Planet. Sci. Lett. 60, 305–317.

Journal Pre-proof

34

Tatsumi Y., 1989. Migration of fluid phases and genesis of basalt magmas in subduction zones. J. Geophys. Res. 94, 4697–4707 Tibshirani R., 1996. Regression shrinkage and selection via the Lasso. J. Roy. Stat. Soc. B 58, 267–288. Toplis M.J., Carroll M.R., 1995. An experimental study of the influence of oxygen fugacity on Fe–Ti oxide stability, phase relations, and mineral–melt equilibria in ferro-basaltic systems. J. Petrol. 36, 1137–1170. Ueki K., Hino H., Kuwatani T., 2018. Geochemical discrimination and characteristics of magmatic tectonic settings: a machine- learning-based approach. Geochem. Geophys.

oo f

Geosys. 19(4), 1327–1347. https://doi.org/10. 1029/2017GC007401. Ueki K., Iwamori H., 2013. Thermodynamic model for partial melting of peridotite by system

energy

minimization.

Geochem.

Geosys.

14,

342–366.

pr

https://doi.org/10.1029/2012GC004143.

Geophys.

relations

of

spinel

lherzolite.

e-

Ueki K., Iwamori H., 2014. Thermodynamic calculations of the polybaric melting phase Geochem.

Pr

https://doi.org/10.1002/2014GC005546.

Geophys.

Geosys.

15,

5015–5033.

Ueki K., Iwamori H., 2016. Density and seismic velocity of hydrous melts at crustal and mantle

conditions.

Geochem.

Geophys.

Geosys.

17,

1799–1814.

al

upper

http://dx.doi.org/10.1002/2015GC006242.

rn

Ueki K., Iwamori H., 2017. Geochemical differentiation processes for arc magma of the

Jo u

Sengan volcanic cluster, Northeastern Japan, constrained from principal component analysis. Lithos 290–291, 60–75. Wagner T.P., Grove T.L. 1998. Melt/harzburgite reaction in the petrogenesis of tholeiitic magma from Kilauea volcano, Hawaii. Contrib. Mineral. Petrol. 131, 1–12. Walter M.J., Cottrell E., 2013. Assessing uncertainty in geochemical models for core formation in Earth. Earth Planet. Sci. Lett. 365, 165–176. Wasserburg G.J., 1957. The effects of H2 O in silicate systems. J. Geol. 65, 15–23. Wasylenki L.E., Baker M.B., Kent A.J.R., Stolper E.M., 2003. Near-solidus melting of the shallow upper mantle: partial melting experiments on depleted peridotite. J. Petrol. 44, 1163–1191. Waters L.E., Lange R.A., 2013. Crystal-poor, multiply saturated rhyolites (obsidians) from the Cascade and Mexican arcs: evidence of degassing- induced crystallization of phenocrysts. Contrib. Mineral. Petrol. 166, 731–754.

Journal Pre-proof

35

Yamada A., Inoue T., Urakawa S., Funakoshi K., Funamori N., Kikegawa T., Irifune T., 2011. In situ X-ray diffraction study on pressure-induced structural changes in hydrous forsterite and enstatite melts. Earth Planet. Sci. Lett. 308, 115–123. Yoshida K., Kuwatani T., Hirajima T., Iwamori H., Akaho S., 2018. Progressive evolution of whole-rock composition during metamorphism revealed by multivariate statistical

Jo u

rn

al

Pr

e-

pr

oo f

analyses. J. Metamorph. Geol. 36, 41–54.

Journal Pre-proof

36

Rank

WSiO2

WAl2 O3

WFeO

WMgO

WCaO

WNa2 O

WK2 O

WH2 O

WH2 O2

2319

1

165702

115981

198584

142204

164325

318651

240832

123461

189732

2326

2

165676

116006

198491

142368

164235

318708

241339

132531

168468

2330

3

164845

118056

198309

143594

164276

319304

243483

195683

2357

4

166833

113856

199559

139773

164586

317045

234356

2376

5

162164

126073

200690

143310

167548

319937

235383

2422

6

172617

117492

199796

134838

152627

299425

123550

181418

2432

7

172620

117486

199815

134807

152651

299422

121646

185885

2436

8

171770

119765

199627

136087

152583

299892

191284

2454

9

173500

115469

200762

132621

153283

298405

2470

10

169080

127119

201786

136056

155965

301073

oo f

CVE (J)

Table 1

pr

ePr al rn Jo u

−2158 −15638

497525

27805

157992

453 −14409

488172 156599

Parameters (J/mol) and cross-validation error (CVE) values of the top 10 (lowest CVE) model equations.

WH2 O0.5

27951

37

e-

pr

oo f

Journal Pre-proof

Pr

Fig. 1

Variations in temperature and pressure (a) and in temperature and the total water

al

concentration of melt (b) for the dataset of experimental results used in this study. The anhydrous-melting experimental results are those reported by Ueki and Iwamori (2014) (i.e.,

rn

Takahashi, 1980; Kinzler and Grove, 1992; Falloon et al., 1997, 1999, 2001; Kinzler, 1997;

Jo u

Kogiso et al., 1998; Robinson et al., 1998; Wagner and Grove, 1998; Falloon and Danyushevsky, 2000; Pickering-Witter and Johnston, 2000; Schwab and Johnston, 2001; Longhi, 2002; McDade et al., 2003; Wasylenki et al., 2003; Laporte et al., 2004; Parman and Grove, 2004), and the hydrous-melting experimental results are those reported by Hirose and Kawamoto (1995), Hirose (1997), Gaetani and Grove (1998), Falloon and Danyushevsky (2000), Laporte et al. (2004), Parman and Grove (2004), and Mitchell and Grove (2015).

38

Pr

e-

pr

oo f

Journal Pre-proof

Fig. 2

al

Bivariate plot of cross- validation error (CVE) values versus the number of variables used in the model equations. Equation types are indicated by circles and colors: the filled red circle

rn

represents the ideal model; filled light- gray circles represent models using water-related terms exclusively; filled dark-gray circles represent models using water-related terms and

Jo u

K2O exclusively; filled dark-blue circles represent models using water- and alkaline (Na2 O and K2 O)-related terms exclusively; filled yellow circles represent models using the SiO 2 , FeO, and MgO terms; open black and open blue circles represent models using and not using the SiO 2 term, respectively; and open green circles represent models not using water-related terms.

39

Pr

e-

pr

oo f

Journal Pre-proof

Fig. 3

al

CVE values, variables, and parameters of the top 100 (lowest CVE) equations. Each row

rn

represents a single equation. The CVE of each equation is shown at the far right. Parameter values in each equation are represented by color according to the scale shown. A blank (white

Jo u

space) in the matrix indicates that a particular variable was not used in that equation.

40

Pr

e-

pr

oo f

Journal Pre-proof

Fig. 4

al

Calculated and experimentally determined olivine liquidus depression as a function of the total water concentration of melt. Liquidus depression was calculated as the difference

rn

between the experimental wet liquidus temperature and the dry liquidus temperature. The black region in the diagram represents the result derived using the optimal model (Eq. 13).

Jo u

Open and closed symbols represent data for just above and just below olivine liquidus conditions, respectively, obtained from the experiments of Tatsumi (1982), Pichavant et al. (2002), Almeev et al. (2007), and Médard and Grove (2008). The experimental results plotted in this figure were obtained from experiments that did not satisfy our data compilation criteria or from studies that lacked the analytical data required for model-selection and parameter optimization; consequently, these data were not used for these tasks in the present study. Calculations were performed using the same melt compositions as those of the plotted experimental results. Representative values of each experiment were used for pressure and olivine composition in the calculations. Therefore, the temperature range of the calculated liquidus depression in Fig. 4 represents the variations in the melt and olivine compositions and pressure conditions of the experiments. Liquidus depression lines were calculated using the empirical models of Falloon and Danyushevsky (2000), Sugawara (2000), Ariskin and Barmina (2004; “COMAGMAT”), Almeev et al. (2007), Médard and Grove (2008), and the

Journal Pre-proof

41

thermodynamic model pMELTS (Ghiorso et al., 2002). The result obtained using the linear model [a model containing only the linear term of the water-related terms (in addition to the element-oxide terms); see text for details] is also shown. For pMELTS and linear- model calculations, the melt composition reported by Pichavant et al. (2002) were used. The results obtained using previous models are indicated by the following abbreviations: Av, Almeev et al. (2007); Ct, COMAGMAT; FD, Fallon and Danyushevsky (2000); MG, Médard and

Jo u

rn

al

Pr

e-

pr

oo f

Grove (2008); pM, pMELTS; Sg, Sugawara (2000).

42

e-

pr

oo f

Journal Pre-proof

Pr

Fig. 5

Calculated and experimentally determined values of the distribution coefficient

al

for melt–olivine plotted against the total water concentration of melt. The solid lines show the results obtained using the optimal model, and the dashed lines show the

rn

results obtained using the linear model (i.e., the model using only the linear term of the

Jo u

water-related terms in addition to the element-oxide terms; see text for details). For melt and olivine compositions, the averaged values of the experimental dataset were used. The pressure was fixed at 1 GPa. Also shown are

values calculated from the experimental

results that were used for model-selection and parameter optimization.

Journal Pre-proof

43

Declaration of interests

☒ The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Jo u

rn

al

Pr

e-

pr

oo f

☐The authors declare the following financial interests/personal relationships which may be considered as potential competing interests:

Journal Pre-proof

44

Author Statement Kenta Ueki: Conceptualization, Methodology, Software, Formal analysis, Investigation, Data Curation, Writing - Original Draft Tatsu Kuwatani: Conceptualization, Methodology, Writing Original Draft Atsushi Okamoto: Supervision Shotaro Akaho: Software, Supervision Hikaru

Jo u

rn

al

Pr

e-

pr

oo f

Iwamori: Writing - Review & Editing, Supervision

Journal Pre-proof

45

Highlights 

A machine learning method for petrological thermodynamic modeling is presented



The method selects key variables from experimental data on hydrous-melt systems



A robust thermodynamic model of the hydrous-melt–olivine equilibrium is constructed Model calculations reproduce measured effect of H2 O on olivine liquidus temperature



Improved understanding of the underlying mechanisms of hydrous-melt systems

Jo u

rn

al

Pr

e-

pr

oo f