Integration of Fuzzy Analytic Hierarchy Process into multi-objective Computer Aided Molecular Design

Integration of Fuzzy Analytic Hierarchy Process into multi-objective Computer Aided Molecular Design

Accepted Manuscript Title: Integration of Fuzzy Analytic Hierarchy Process into Multi-objective Computer Aided Molecular Design Authors: Jecksin Ooi, ...

665KB Sizes 0 Downloads 14 Views

Accepted Manuscript Title: Integration of Fuzzy Analytic Hierarchy Process into Multi-objective Computer Aided Molecular Design Authors: Jecksin Ooi, Michael Angelo B. Promentilla, Raymond R. Tan, Denny K.S. Ng, Nishanth G. Chemmangattuvalappil PII: DOI: Reference:

S0098-1354(17)30416-7 https://doi.org/10.1016/j.compchemeng.2017.11.015 CACE 5959

To appear in:

Computers and Chemical Engineering

Received date: Revised date: Accepted date:

7-7-2017 12-10-2017 14-11-2017

Please cite this article as: Ooi, Jecksin., Promentilla, Michael Angelo B., Tan, Raymond R., Ng, Denny KS., & Chemmangattuvalappil, Nishanth G., Integration of Fuzzy Analytic Hierarchy Process into Multi-objective Computer Aided Molecular Design.Computers and Chemical Engineering https://doi.org/10.1016/j.compchemeng.2017.11.015 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Integration of Fuzzy Analytic Hierarchy Process into Multiobjective Computer Aided Molecular Design Jecksin Ooia, Michael Angelo B. Promentillab, Raymond R. Tanb, Denny K. S. Nga, Nishanth

a

IP T

G. Chemmangattuvalappila Department of Chemical and Environmental Engineering/ Centre of Excellence for Green

SC R

Technologies, The University of Nottingham Malaysia Campus, Broga Road, 43500 Semenyih, Selangor, Malaysia b

Centre for Engineering and Sustainable Development Research, De La Salle University,

U

2401 Taft Avenue, 0922 Manila, Philippines

N

HIGHLIGHTS

Novel methodology for multiobjective molecular design

๏‚ท

Integrate Fuzzy Analytic Hierarchy Process in molecular design

๏‚ท

Consideration of different classes of property targets in molecular design

๏‚ท

Application example in solvent design for oil extraction

PT

CC E

Abstract

ED

M

A

๏‚ท

In this paper, a novel Computer Aided Molecular Design (CAMD) framework is developed to solve multi-objective molecular design problems. CAMD can be formulated as a multi-

A

objective optimisation problem when there are multiple target properties to be optimised simultaneously. A major obstacle faced by multi-objective CAMD problems is the difficulty in assigning weighting factors to the target properties, since the relative importance of these factors is not always defined. It is particularly difficult to compare target properties which belong to different categories, such as physicochemical, safety, health and environmental 1

properties, on a common scale. This paper presents a systematic CAMD algorithm built on Fuzzy Analytic Hierarchy Process (FAHP) to deal with the ambiguity involved in evaluating the weights of target properties in multi-objective CAMD problem. Instead of using exact numerical values, FAHP approach expresses the pairwise comparison of target properties through triangular fuzzy numbers, which allow the degree of confidence of decision maker

IP T

to be quantified. Hence, the proposed approach can address the uncertainties arising from

ambiguity involved during value judgement elicitation in multi-objective CAMD problems.

SC R

The solutions generated provide a better balance of performance for a set of identified target

properties. The proposed methodology is illustrated through a case study on designing a

U

better solvent for extracting residual oil from palm pressed fibre.

N

Keywords: Computer Aided Molecular Design (CAMD), Fuzzy Analytic Hierarchy Process

A

(FAHP), multi-objective optimisation, multi-objective molecular design

M

Nomenclature

ED

CAMD = Computer Aided Molecular Design AHP = Analytic Hierarchy Process

PT

FAHP = Fuzzy Analytic Hierarchy Process

CC E

lo g K o c = Soil sorption coefficient

ORCs = Organic Rankine Cycles

A

TFN = Triangular fuzzy number IPCCWN = Inter-plant chilled and cooling water network GCM = Group contribution method C i = Contribution of the first-order group of type-i

2

D

= Contribution of the second-order group of type-j

j

E k = Contribution of the third-order group of type-k

N i = Number of occurrence of the first-order group of type-i

j

= Number of occurrence of the second-order group of type-j

O k = Number of occurrence of the third-order group of type-k

= Value of target property

L

= Lower bound of product specification

U

= Upper bound of product specification

vp

A

vp

ED

M

IEX = Explosiveness based index IFL = Flammability based index

U

p

N

V

SC R

= Target property

p

IP T

M

ISI = Inherent Safety Index

PT

NFPA = National Fire Protection Association

CC E

IOHI = Inherent Occupational Health Index IEL = Exposure limit index

A

IAH = Acute health hazards ๐ผ๐‘†๐ป๐ผ,๐‘ค = Total weighted penalty score for safety and health aspects of molecules I = Binary integer variable wi / w

j

= solution ratio

3

A

= Fuzzy pairwise comparison matrix

aห† i j

= Fuzzy number

Lห† i j

= lower bound of triangular fuzzy number

ij

= modal value of triangular fuzzy number

Uห† i j =

IP T

Mห†

upper bound of triangular fuzzy number

w

SC R

๐›ฟ๐‘ = degree of confidence = optimal priority vector

U

ฮป = consistency index

A

N

GT= Total number of groups selected

M

v i = valence of group i

w m = weighting factor

ED

F w e ig h te d s u m = Overall objective function

= normalised target property operator

CC E

๏ฌ pm

PT

๏ฌ p = target property operator

p m in

๏—

p m ax

A

๏—

= minimum value of target property operator = maximum value of target property operator

PPF = Palm pressed fibre TAGs = Triglycerides ฮด = Hildebrand solubility parameter 4

Rcarotene = difference of Hildebrand solubility parameter between solvent and carotene T b = boiling point

F p = Flashpoint ๏ญ

= Viscosity

rat acute toxicity

L C 5 0 = Fathead

= Bioconcentration factor

M

BCF

= Photochemical oxidation potential

N

PCO

minnow toxicity

U

L D 5 0 = Oral

SC R

= Molecular weight

w

A

M

IP T

ฯƒ = surface tension

ED

lo g K o w = Octanol-water partition coefficient

PEL = Permissible exposure limit

PT

LEL = Lower explosion limit

CC E

UEL = Upper explosion limit

LFL = Lower flammability limit

A

UFL = Upper flammability limit D AB

vA

๏ช

= Diffusivity of oil in solvent B

= Molar volume of oil = Association factor for solvent

5

1. Introduction During the past three decades, Computer Aided Molecular Design (CAMD) has become a successful tool for addressing some challenges in the design and selection of molecules for various applications, such as solvent design, polymer design and refrigerant design (Austin et al., 2016). CAMD is a reverse engineering approach, aimed at generating molecules with a

IP T

set of predefined target properties and using molecular building blocks (Harper and Gani, 2000). CAMD techniques can provide a promising systematic route to design molecules.

SC R

In conventional practice, the target properties of a molecule usually include physical and thermodynamic properties such as boiling point and viscosity. More recently, safety, health

U

and environmental (SHE) aspects have also become important due to progressively stricter

N

SHE regulations in most countries. Chemical industries have thus been prompted to use

A

molecules with desirable product functionality but do not pose major hazards to humans and

M

the environment. When multiple target properties need to be optimised simultaneously, CAMD problem can be formulated as a multi-objective optimisation problem. Such problems

ED

can be solved using different strategies. Preference-based methods and solution-generating methods are the two commonly used categories for solving multi-objective optimisation

PT

problem (Diwekar, 2003). Goal programming is the most common preference-based method,

CC E

whereas the ๏ฅ-constraint method and weighting methods are the two common solutiongenerating methods. In goal programming, the decision maker needs to define an a priori goal for each objective, and subsequently solve the model to minimise the total deviation

A

from goals. On the other hand, the ๏ฅ-constraint method generates a set of Pareto optimal solutions from which a final solution can be selected; however, the method itself does not provide further basis for selecting a final solution from the Pareto set. By comparison, the weighting method can identify a unique optimal solution for actual implementation.

6

In the weighting method, the weight of each objective is specified before solving the multiobjective optimisation as an equivalent single-objective problem. The main drawback of the method is the inherent subjectivity in specifying values of the weighting factor. In addition, it is also difficult to evaluate the relative importance of target properties that belong to different categories, such as physical, safety, health and environmental aspects.. Hence, there will be

IP T

uncertainties arising from ambiguity involved during judgement elicitation. This step becomes problematic in the absence of a systematic procedure to evaluate the relative

SC R

importance of the properties. In such cases, the assigned weights will then provide no insight

into the underlying multi-objective CAMD problem. It is particularly important to address

U

this issue, as good alternatives may be eliminated due to faulty weighting. Moreover, even

N

slightly different weighting factors of objectives can generate different optimal molecules. To

A

overcome this limitation, a procedure that enables the user to systematically evaluate the

M

relative importance between properties must be integrated into the CAMD framework. In this work, Fuzzy Analytic Hierarchy Process (FAHP) approach is proposed to quantify the

involved in the problem.

ED

subjective opinion of decision makers in eliciting the relative importance of each property

PT

FAHP is an extension of the classical Analytic Hierarchy Process (AHP), a decision analysis

CC E

technique which structures the decision-making problem in a hierarchical manner. Both methods are capable of comparing qualitative and quantitative criteria in a systematic manner through a decomposition strategy. In the conventional AHP approach, Saaty (1980)

A

developed a fundamental 9-point scale to help a decision maker to assess pairwise comparison. The intensity of absolute scale assigned can express the relative importance of two criteria. Although the discrete numerical scale of AHP has the benefit of simplicity, in its basic form it cannot successfully manage the uncertainty of subjective opinions involved during the pairwise comparison. This is because the point estimate represented by a single 7

numerical value cannot model the degree of confidence that a decision maker puts in a particular judgement. In order to eliminate this limitation, FAHP approach allows the uncertainty and vagueness of decision makersโ€™ opinion to be captured through fuzzy set theory (Zadeh, 1965). Fuzzy set theory is a well-known framework that can account for uncertainty and vagueness in human reasoning and decision-making. In the FAHP approach,

IP T

a fuzzy scale is used to express the preference or relative importance of target properties,

instead of allocating the exact numerical numbers in pairwise comparison. Hence, FAHP

SC R

approach both relative importance and the confidence level to be captured during pairwise comparison. For these reasons, FAHP approach is a potential solution to address the

U

ambiguity involved in assigning the relative weights of target properties in multi-objective

N

CAMD problem. The following sections present a summary of current state of the art in

A

CAMD and FAHP approach.

M

1.1 Computer Aided Molecular Design (CAMD)

CAMD technique has developed as a promising tool in chemical product design due to its

ED

ability to predict, estimate and design molecules with a set of predefined target properties and

PT

molecular building blocks (Harper and Gani, 2000). As stated by Papadopoulos and Linke (2006), it is important to formulate CAMD problem as a multi-objective optimisation

CC E

problem to capture the relationship between all the physicochemical properties and SHE aspects of a molecule. Different approaches have been proposed for this purpose. For instance, an iterative step-wise methodology has been proposed by Pistikopoulos and Stefanis

A

(1998) to design environmentally benign solvents which can achieve optimal separation performance at minimum cost. In their work, a methodology for environmental impact minimisation (MEIM), which considers environmental properties and economics at the design stage, has been incorporated into CAMD framework. Likewise, Papadopoulos and Linke (2006) have developed a novel multi-objective CAMD methodology for identifying 8

Pareto optimal solvents for liquid-liquid extraction and gas absorption processes based on multiple environmental indices and technical performance metrics. The proposed work allows the solvent design information to be incorporated into process synthesis. However, the proposed methodology does not guarantee that the generated Pareto points are evenly distributed into solvent solution space. In practice, there is a need to identify a unique

IP T

optimum solution on the Pareto front. Papadopoulos et al., (2010) then further applied the

multi-objective CAMD methodology to design and identify the optimal working fluids which

SC R

possess desirable physicochemical characteristic as well as having favourable environmental,

safety and economic performance for a low temperature Organic Rankine Cycles (ORCs).

U

Also, Ng et al., (2014) have proposed a novel methodology to solve a multi-objective

N

molecular design problem by incorporating fuzzy and bilevel optimisation into CAMD

A

techniques. In his work, soil sorption coefficient (log Koc) together with vapour pressure and

M

molar volume of the solvent are optimised simultaneously to design an amine-based solvent which exerts favourable performance and environmental characteristics. Recently,

ED

Papadopoulos et al., (2016) developed a systematic two-stage procedure to determine and choose a set of promising solvents for chemical absorption of carbon dioxide. In their work,

PT

reactivity, thermodynamic behaviour and sustainability criteria have been chosen as performance criteria in the solvent design. In another contribution, a single-stage chemical

CC E

product design framework employing CAMD technique has been proposed by Ten et al., (2016) to design molecules which satisfy a set of desired properties and pose a low hazard

A

risk to the environment and community. The measurement of safety and health indicators are based on the molecular properties that have effects on both aspects. Furthermore, CAMD technique has recently adapted to design sustainable solvents to extract residual oil from palm pressed fibres (PPF) (Khor et al., 2017). In order to design solvents which are safe for the

9

food industry, safety and health properties are optimized simultaneously together with physical properties. 1.2 Fuzzy Analytic Hierarchy Process (FAHP) FAHP is a variant of the widely known AHP, which was first developed by Saaty (1977). AHP approach structures a problem in a hierarchical form, with decision-making objective at

IP T

the top, criteria and/or sub-criteria at the intermediate levels, and the alternatives at the bottom. AHP approach utilises a decomposition strategy based on the principle that it is more

SC R

accurate and easier for a human decision maker to compare two criteria at a time instead of comparing multiple criteria simultaneously. At the same time, in AHP, each layer in the

U

hierarchy is initially solved separately; the final step of the methodology allows the

N

components to be synthesized into a coherent decision. In AHP, the criteria or alternatives are

A

compared in a pairwise manner based on discrete numerical scales which is a fundamental 9-

M

point scale proposed by Saaty (1980). In addition, AHP also has the advantage of allowing the judgement on both qualitative and quantitative criteria to be performed on the same scale

ED

of analysis. However, the process of eliciting the relative importance of two criteria is usually complex and uncertain. It is often difficult to give exact numerical values which reflect the

PT

true character of human thinking in pairwise comparison (Chen and Fan, 2011). As

CC E

previously mentioned, AHP approach utilises discrete numerical scales which makes it alone difficult to handle the fuzziness or ambiguity of subjective opinions that occurr during pairwise comparison. To overcome this limitation, the FAHP method is developed.

A

Fuzzy set theory has proven to be an effective framework for dealing with vagueness and uncertainty (Zadeh, 1965). It was developed to provide a framework that mimics human reasoning for many decision problems (Tseng et al., 2009). Triangular fuzzy numbers (TFNs) are often used due to their simplicity, since their membership functions can be characterized using only three numbers. The FAHP method translates linguistic judgements into TFNs, 10

organized in fuzzy pairwise comparison matrices. FAHP has been widely applied in various fields of application. For example, the extended analysis method proposed by Chang (1996) to derive the crisp weighting factor from TFNs has been applied by Yang et al., (2011) in prioritizing environmental issues in offshore oil and gas operations. Besides, Tan et al., (2014) have proposed a FAHP methodology for process engineering problems which involve

IP T

multiple criteria and alternatives. This methodology can determine a set of crisp weighting factors from scaled fuzzy judgements. In recent years, FAHP approach has also been applied

SC R

in synthesizing inter-plant chilled and cooling water network (IPCCWN) (Leong et al., 2016). In this contribution, a two-stage procedure which involves fuzzy optimisation and FAHP

U

method have been developed for generating alternative IPCCWN designs and selecting the

N

most optimum network design based on qualitative and quantitative criteria. Note that the

A

above examples have shown that FAHP method is well suited to solving complex real-world

M

decision-making problems which involve subjective evaluation. In this work, FAHP approach

2. Methodology

ED

is extended into multi-objective CAMD problems.

PT

The main purpose of this work is to present a novel multi-objective CAMD methodology for systematic allocation of weighting factors to each target property the problem. The

CC E

integration of FAHP approach into multi-objective CAMD framework can account for the inherent vagueness and uncertainty arising from the determination of weighting factors of each property. By using the proposed methodology, a molecule that gives the best balance of

A

performance in terms of physicochemical properties as well as SHE aspects can be designed. The proposed framework is summarised in the following steps, illustrated in Figure 1:

11

Step 1: Determine design objective for CAMD problem

Step 2: Identify and analyze target properties

Step 3: Determine property prediction model

Step 4: Select safety and health indices

Is property prediction model available for target property?

No

IP T

Develop statistical property prediction model

Yes

Perform disjunctive programming

Is the safety and health index score depend on property value?

Yes

U

No

SC R

Step 5: Determine limits for property constraint

A

N

Are there multiple target properties to be optimized?

No

Yes

M

Step 6: Establish hierarchy model

A

CC E

PT

ED

Step 7: Construct fuzzy pairwise comparison matrix

Step 8: Compute optimal priority vectors using NLP formulation

Is consistency index, ฮป value between 0-1?

No

Yes Step 9: Repeat step 7-8 for all hierarchy level and obtain overall weighting factors

Step 10: Molecular design stage

Enumerate molecular structure

Step 11: Define objective function and solve mathematical model

Figure 1: A multi-objective molecular design framework embedded with Fuzzy Analytic Hierarchy Process approach 12

2.1 Identification of design objective In the first step, the identification of design objective for CAMD problem is done by determining the needs of chemical product based on customersโ€™ preferences, or from operating condition of industrial processes. Physical and thermodynamic properties are usually used to define the product specifications. Other than these properties, SHE criteria are

IP T

also considered to ensure that the designed molecule also achieves favourable performance.

Safety and health performance of molecules can be characterised by appropriate indices,

SC R

which will be further illustrated in Section 2.3. The desired target properties and criteria will then be selected as design objectives to ensure that the designed molecules simultaneously

U

achieve the functionality and pose low safety, health and environmental hazard levels.

N

2.2 Determination of property prediction model

A

Once the product specifications and SHE targets are identified, the next step is the estimation

M

of all identified target properties via property prediction models. Group contribution method (GCM) is one of the most common approaches in CAMD problems as only molecular

ED

structure of the pure component is needed to estimate physical and chemical properties

f ( p) ๏€ฝ

๏ƒฅ

N iC i ๏€ซ w ๏ƒฅ M j D j ๏€ซ z ๏ƒฅ O k E k

CC E

i

PT

(Mattei et al., 2014). GCM equation is expressed by Equation (1) (Marrero and Gani, 2001):

j

(1)

k

where C i represents the contribution of the first-order group of type-i that occurs N i times, D

j

the contribution of the second-order group of type-j that occurs M

j

times and E k the

A

contribution of the third-order group of type-k that has O k occurrences in a compound. The left-hand side of Equation (1) is the simple function f ( p ) of the target property p . In this work, first-order groups are applied as only simple-structured molecules are to be considered.

13

Empirical relationships and correlations can be used to estimate properties if no GCM available. Next, the upper and lower bounds of property constraints must be first determined to ensure the generated molecules will show similar physical characteristics as a conventional solvent. The target property ranges are identified based on product specifications, and is usually

IP T

defined by process requirements or customersโ€™ need. The product specification range can be

vp ๏‚ฃ Vp ๏‚ฃ vp

where

U

Vp

๏€ขp ๏ƒŽ P

is the value of target property p,

L

vp

and

U

vp

(2)

represents the lower and upper limit of

U

L

SC R

represented by Equation (2):

A

2.3 Identification of safety and health indices

N

product specification.

M

Due to the growing number of industrial accidents, there is an increased emphasis on process safety in chemical industries. Safety hazards can be minimized by having milder process

ED

operating condition and replacing those hazardous materials with less harmful substances.

PT

Besides, occupational health should not be neglected as health hazards will often lead to chronic diseases after a prolonged exposure. The safety and health impact of a chemical

CC E

product can be assessed by appropriate safety and health indices. Under safety aspects, two safety indices are applied in this work to evaluate the safety

A

characteristics of the molecules, namely explosiveness based index (IEX) and flammability based index (IFL). The index score for IEX is obtained from Inherent Safety Index (ISI) (Heikkilรค, 1999) whereas the scores for IFL are taken from National Fire Protection Association (NFPA) flammability rating (National Fire Protection Association, 2007). On the other hand, health indices such as exposure limit based index (IEL) and acute health hazards (IAH) are selected to assess the health performance of designed molecules. The index score for 14

IEL is taken from Inherent Occupational Health Index (IOHI) (Hassim and Hurme, 2010) whereas the index score for IAH is obtained from the NFPA health hazard rating (National Fire Protection Association, 2007). The penalty score for both safety and health indices are presented in Appendix 6.1. The total weighted penalty score for safety and health aspects of molecules (๐ผ๐‘†๐ป๐ผ,๐‘ค ) can be calculated by summing up all the sub-index values assigned to it,

weighting factor for each sub index will be determined by FAHP approach.

SC R

๐ผ๐‘†๐ป๐ผ,๐‘ค = ๐‘ค๐น๐ฟ ๐ผ๐น๐ฟ + ๐‘ค๐ธ๐‘‹ ๐ผ๐ธ๐‘‹ + ๐‘ค๐ธ๐ฟ ๐ผ๐ธ๐ฟ + ๐‘ค๐ด๐ป ๐ผ๐ด๐ป

IP T

which is dictated by Eq. (3), where wi represents the weighting factor for sub-index i. The

2.4 Problem formulation

(3)

U

Since the assignment of penalty score to a molecule is dependent on the property value, the

N

disjunctive programming can be applied to translate the molecular properties into index

A

scores which imply the hazard level of molecules (Ten et al., 2016b). Disjunctive

M

programming is a modelling approach which applies discontinuous functions to trace the abrupt changes over a certain decision variable (El-Halwagi, 2011). For example, a score of

ED

IA will be assigned to a sub-index score if the corresponding property,

p sw itc h .

๏ƒฌIA Ip ๏€ฝ ๏ƒญ ๏ƒฎ IB

value is equal to or

The sub-index score model is shown by Eq. (4):

CC E

larger than

p

value is smaller

PT

than a certain property value whereas a score of IB is allocated when

p

p ๏€ผ p s w itc h

p ๏‚ณ p s w itc h

(4)

A

Binary integer variables are normally introduced to develop models for such cases which involve discontinuous function. Discontinuous functions can be transformed to the following mixed-integer formulation using a binary integer variable (I): I p ๏€ฝ I A * I ๏€ซ I B * ( I ๏€ญ 1)

(5)

15

Subject to the following condition: ๏ƒฌ 0 p ๏€ผ p s w itc h I ๏€ฝ ๏ƒญ ๏ƒฎ 1 p ๏‚ณ p s w itc h

(6)

To ascertain that the model allocates the correct value to I to fulfil condition (6), the

( p L ๏€ญ p sw itc h ) * I ๏€ผ p ๏€ญ p sw itc h ๏‚ฃ ( p U ๏€ญ p sw itc h ) * (1 ๏€ญ I )

<

pL

p sw itch

and

pU

I ๏ƒŽ ๏ป 0 ,1๏ฝ

(7)

are lower and upper limits respective to any feasible value of

, the term

p ๏€ญ p s w itc h

becomes negative, forcing p

>

. When

p

to have value 1 to satisfy both p sw itch

,

I

is forced to be 0 to again

U

equalities in the constraint (6). On the contrary, when

I

p

SC R

where

IP T

constraints written in Eq. (7) must be included:

N

fulfil both equalities in constraint (6). The illustrative example for the formulation of

A

disjunctive programming will be further discussed in section 3.3.

M

Since there are multiple target properties as well as safety and health indices to be

ED

simultaneously optimised in the CAMD problem, FAHP approach is utilised to systematically determine a weighting factor for each property. The initial step is to model the

PT

decision problem as a hierarchy which is established in the way where the goal of the design problem is located at the top, followed by criteria and/or sub-criteria, and finally with the

CC E

alternatives at the bottom (Saaty, 1980). The relationship between elements of one level with those of the level below can be illustrated with the use of a hierarchical framework.

A

This step is then followed by performing pairwise comparisons based on expert judgement to derive the relative importance of criteria at a particular level. For example, the comparison between two main properties is carried out by asking the question: โ€œHow much more important is property A compared to property B with respect to a satisfaction of the goal of decision problem?โ€ The intensity of importance of one property over the other property

16

within the same level with respect to a common property in the upper level is represented in the form of solution ratios (

wi / w

j

). The solution ratio

wi / w

j

denotes the relative

importance of property in ith row over the property in the jth column with respect to the specific element in the upper level. In a typical AHP problem, the 9-point scale (Saaty, 1980) wi / w

j

.

IP T

is used to translate a linguistically expressed judgement into a numerical ratio

However, to capture the uncertainty and vagueness of judgement during the pairwise

the exact numerical value to approximate the solution ratio aห† i j

represent the value judgement

๏ƒถ ๏ƒท aห† 2 n ๏ƒท ๏ƒท ๏ƒท ๏€ผ 1, 1, 1 ๏€พ ๏ƒธ

aห† n 2

๏€ฉ

(8)

ED Lห† i j

, modal value

Mห†

aห† i j

ij

is the fuzzy judgement which is

and upper bound

Uห† i j ).

For instance,

PT

represented by TFN (lower bound

is identified to be more or less equally important, it is represented by the TFN < 1/1 +

CC E

aห† i j

๏€จ

w h e r e aห† ji ๏€ฝ 1 / aห† ij ๏€ฝ 1 / Lห† ij ,1 / Mห† ij , 1 / Uห† ij

where A is the fuzzy pairwise comparison matrix,

if

, a fuzzy scale is used to

M

๏€ผ 1, 1, 1 ๏€พ

j

U N

aห† 1 n

A

aห† 1 2

wi / w

as TFN. The general form of a FAHP pairwise comparison

matrix can be expressed by Eq. (8). ๏ƒฆ ๏€ผ 1, 1, 1 ๏€พ ๏ƒง aห† 2 1 A ๏€ฝ ๏ƒง ๏ƒง ๏ƒง aห† n 1 ๏ƒจ

SC R

comparison, TFNs are used to extend the standard AHP 9-point scale. Thus, instead of using

๐›ฟ๐‘ ,1 , 1 + ๐›ฟ๐‘ > whereas if

aห† i j

is identified to be more important over the other, it is

represented using the fuzzy scale written in Table 1. Note that ๐›ฟ๐‘ is the degree of confidence

A

of the decision maker, with a lower value suggests a higher degree of confidence, and viceversa. ๐›ฟ๐‘ value of one, two and three indicate the high, moderate and low degree of confidence while eliciting the value judgement (Tan et al., 2014). Table 1: Summary of fuzzy scale (Tan et al., 2016) Fuzzy number,

aห† i j

Linguistic scale for comparison of criteria 17

< 1/1 + ๐›ฟ๐‘ ,1 , 1 + ๐›ฟ๐‘ >

More or less equally important

< max (1, 3 โ€“ ๐›ฟ๐‘ ), 3, min (9, 3 + ๐›ฟ๐‘ )>

Moderately more important

< max (1, 5 โ€“ ๐›ฟ๐‘ ), 5, min (9, 5 + ๐›ฟ๐‘ )>

Strongly more important

< max (1, 7 โ€“ ๐›ฟ๐‘ ), 7, min (9, 7 + ๐›ฟ๐‘ )>

Very strongly more important

< max (1, 9 โ€“ ๐›ฟ๐‘ ), 9, min (9, 9 + ๐›ฟ๐‘ )>

Extremely more important

IP T

The optimal priority vectors (๐‘ค) that approximate the solution ratio in pairwise comparison matrix are then computed using the nonlinear programming (NLP) formulation proposed by

SC R

Promentilla et al., (2014), which is shown in Eq. (9): max ๐œ†;

U

subject to:

N

๐œ†(๐‘€๐‘–๐‘— โˆ’ ๐ฟ๐‘–๐‘— )(๐‘ค๐‘— ) โˆ’ ๐‘ค๐‘– + ๐‘ค๐‘— ๐ฟ๐‘–๐‘— โ‰ค 0;

M

๐œ†(๐‘ˆ๐‘–๐‘— โˆ’ ๐‘€๐‘–๐‘— )(๐‘ค๐‘— ) โˆ’ ๐‘ค๐‘– + ๐‘ค๐‘— ๐‘ˆ๐‘–๐‘— โ‰ค 0;

A

๐œ†(๐‘€๐‘—๐‘– โˆ’ ๐ฟ๐‘—๐‘– )(๐‘ค๐‘– ) โˆ’ ๐‘ค๐‘— + ๐‘ค๐‘– ๐ฟ๐‘—๐‘– โ‰ค 0;

(9a)

(9b) (9c) (9d) (9e)

โˆ‘๐‘›๐‘š=1 ๐‘ค๐‘š = 1; ๐‘ค๐‘š > 0

(9f)

PT

ED

๐œ†(๐‘ˆ๐‘—๐‘– โˆ’ ๐‘€๐‘—๐‘– )(๐‘ค๐‘– ) โˆ’ ๐‘ค๐‘— + ๐‘ค๐‘– ๐‘ˆ๐‘—๐‘– โ‰ค 0;

where ๐ฟ๐‘–๐‘— is the lower bound, ๐‘€๐‘–๐‘— is the modal value lambda, ๐‘ˆ๐‘–๐‘— is the upper bound and ฮป is

CC E

the consistency index, which indicate the degree of satisfaction of all computed weight ratios that would satisfy the initial fuzzy judgments. ฮป value shall range from 0 to 1. ฮป value of 0

A

denotes that the judgements are fulfilled at the boundaries whereas a value of 1 indicates perfect consistency (Tan et al., 2014). Therefore, by maximising ฮป, the most consistent set of weighting factors within the fuzzy bounds elicited from decision maker can be determined. However, if ฮป value falls out of the range, it implies a completely inconsistent pairwise

18

comparison and there will be no solution. In such case, it is necessary to elicit the judgement again. After obtaining the optimal priority vector of each property and sub-property, the final weighting factor of each sub-property, ๐‘ค๐‘š in the overall system are calculated by multiplying

IP T

the priority vector of sub property with the priority vector of its main property. 2.5 Molecular design stage

SC R

The subsequent step includes the selection of potential molecular building blocks based on the nature of the target molecules. Structural constraints are applied to ensure that the

molecule formed is structurally feasible. To design a molecule, the summation for the number

U

of occurrences for all selected groups must be greater than zero. This rule is explained

N

mathematically by Equation (10):

๏ƒฅ

A

GT

Ni ๏€พ 0

(10)

M

i ๏€ฝ1

ED

where N i is the number of occurrences of group i while GT is the total number of groups selected to generate the molecules. Besides, the octet rule of structural feasibility is applied to

GT

N i ๏€จ 2 ๏€ญ vi ๏€ฉ ๏€ฝ 2 g

CC E

๏ƒฅ

PT

ensure that a molecule has no free attachment. This rule is dictated in Equation (11):

(11)

i ๏€ฝ1

where v i is the valence of group i and g is 1, 0, -1 or -2 for acyclic, monocyclic, bicyclic and

A

tricyclic compounds. In this work, simple-structured acyclic compounds and monocyclic compounds are considered. Hence, Equation (11) can be reduced to Equations (12) and (13) for acyclic and monocyclic compound respectively: GT

๏ƒฅ

N i ๏€จ 2 ๏€ญ vi ๏€ฉ ๏€ฝ 2

(12)

i ๏€ฝ1

19

GT

๏ƒฅ

N i ๏€จ 2 ๏€ญ vi ๏€ฉ ๏€ฝ 0

(13)

i ๏€ฝ1

2.6 Optimisation model Based on the identified design objective, CAMD problem is formulated as a multi-objective optimisation problem. Weighted sum method is applied to solve this multi-objective

IP T

optimisation model. Weighted sum method allows multiple objectives to be converted into an aggregated scalar objective function by first allocating each objective function with a

SC R

weighting factor, and then summing up all the contributors to obtain the overall objective function. However, all the target properties and sub-indices are depicted by different

U

measurement units and scales. Thus, normalisation step is important in bringing them to the

N

same magnitude.

A

The normalisation step can be carried out using Equations (14) and (15). Equation (14) is

M

applied to normalise target property or sub-indices that needs to be maximized whereas Equation (15) is used to normalise target property or sub-indices that needs to be minimized.

๏—

๏ฌp ๏€ฝ

๏€ญ๏—

p m ax

p m in

๏€ญ๏—

(14)

p m in

CC E

๏—

p

.

PT

๏ฌ pm

operator,

ED

The normalised target property or sub-indices is then referred as normalised target property

๏—

๏ฌp ๏€ฝ

A

๏—

where ๏ฌ pm

๏—

p m ax

p m ax

p m in

๏€ญ๏—

๏€ญ๏—

and

p

(15)

p m in

๏—

p m ax

are the minimum and maximum value of target property operator.

will now have values ranging from 0 to 1. The consistent set of weighting factor

computed from FAHP approach together with

๏ฌ pm

can then be used to represent the overall

objective function, which is shown in Equation (16): 20

F w e ig h te d s u m ๏€ฝ w 1 ๏ฌ p 1 ๏€ซ w 2 ๏ฌ p 2 ๏€ซ ... ๏€ซ w m ๏ฌ p m

(16)

where F w e ig h te d s u m is the overall objective function and w m is the weighting factor for each normalised target property operator ๏ฌ

pm

.

IP T

The design objective of this work is to maximize F w e ig h te d s u m . The solution with the highest F w e ig h te d s u m value will be the optimal molecule. After identifying a solution from the

SC R

optimisation model, an integer cut procedure is performed to obtain a ranked list of optimal

and near-optimal molecules. Using the proposed methodology, a more precise and consistent weighting factor is assigned to each target property as the relative importance of each target

U

property in molecular design can be systematically elicited.

A

N

3. Case study:

M

3.1 Problem statement

A case study by Khor et al., (2017) is revisited by considering physicochemical properties

ED

and SHE aspects as design criteria in the early decision-making stages. Palm pressed fibre (PPF) is a by-product produced from the palm oil milling process. According to Choo et al.,

PT

(1996), 5-7% of residual oil is retained in PPF after the oil extraction process. Residual oil

CC E

retained in PPF is enriched with 4500 โ€“ 8500 ppm of sterols, 2400 โ€“ 3500 ppm of vitamin E and 4000 โ€“ 6000 ppm of carotenoids (Choo et al., 1996). Vitamin E and carotenoids are commonly utilised in the food industry because of their antioxidant properties. In current

A

practice, solvent extraction is extensively applied to extract residual oil from PPF. Hexane has been the solvent of choice for extracting residual oil due to its low cost and high oil solubility (de Oliveira et al., 2013). Nonetheless, there is a growing interest in identifying alternative solvents to replace hexane because of several major drawbacks. For example, the high boiling point of hexane causes carotenoids to degrade at a faster rate. In addition, its 21

high flammability leads to higher operating cost as additional layers of protection must be installed in the plant. Furthermore, hexane is not suitable for edible oil extraction due to its adverse effects on human health after a long-term exposure. Fugitive emissions of hexane vapor also contribute to the formation of photochemical smog. Due to these limitations, this work focuses on improving the physical performance and SHE characteristics of solvent via

IP T

multi-objective CAMD. Determination of design objective

their respective translated quantitative properties.

SC R

Table 2 shows the summary of desirable qualitative attributes of the designed solvents and

Table 2: The desirable qualitative attributes of designed solvent with their respective translated quantitative properties

M

ED

Low flammability based index IFL

Low boiling point ( T b ) low viscosity ( ๏ญ ) and surface tension ( ๏ณ ) Small difference of Hildebrand solubility parameter (ฮด) between solvent and carotene (Rcarotene) High flashpoint ( F p )

PT

Safety aspects

Quantitative properties

U

N

Ensure good percolation and surface wetting which will lead to high rates of oil extraction (Bockisch, 1998) Ensure both carotene and triglycerides (TAGs) has high solubility in designed solvents

A

Physical attributes

Desirable qualitative attributes Reduce the degradation rates of carotene

Small difference between lower explosion limit (LEL) and upper explosion limit (UEL) Low exposure limit index IEL High permissible exposure limit (PEL) Low acute health hazard index IAH High oral rat LD50 Low aquatic toxicity High fathead minnow LC50 Minimise the formation of photochemical Low photochemical smog oxidation potential (PCO) Reduce accumulation of solvent in one Low soil sorption place coefficient (log Koc) Reduce the tendency of solvent to Low bioconcentration concentrate in aquatic organism factor (BCF)

CC E

Low explosiveness based index IEX

Health Aspects

A

Environmental Aspects

22

3.2 Identification of property prediction models The following step includes the determination of property prediction models for the identified target properties for the design problem. Tb, ฯƒ, ยต, ฮด, Fp, PEL, Mw, LC50, LD50, PCO, log Kow and BCF are properties that can be estimated using GCM equations whereas log K oc, upper

IP T

explosion limit (UEL) and lower explosion limit (LEL) can be calculated through empirical

SC R

correlations. GCM equations for the selected target properties are displayed in Appendix 6.2

lo g K o c can be estimated through the correlation given in terms of octanol-water partition

coefficient, lo g K o w . The correlation is expressed by the following equation (Seth et al., 1999): (17)

N

U

lo g K o c ๏€ฝ 1 .0 3 lo g K o w ๏€ญ 0 .6 1

A

PEL can be calculated using the equation in Table 3 has a unit of mol/m3. However, the value

M

of PEL used for penalty score calculation has the unit of ppm. Thus, Equation. (17) can be

ED

used to convert the unit of PEL into ppm. P E L ' ๏€ฝ V g a s , s td ๏‚ด P E L ๏‚ด 1 0 0 0

P E L ' has

the unit of ppm whereas

PT

where

(18)

V g a s , s td

is the molar volume of gas or vapour at

CC E

standard conditions (1 atm and 298 K), which has the value of 24.45 dm3/mol. From Table A.2, the difference between upper explosion limit (UEL) and lower explosion limit (LEL), which is denoted as S, can be used to calculate the explosiveness value. Both

A

UEL and LEL are used interchangeably with upper flammability limit (UFL) and lower flammability limit (LFL). Note that the limits can be calculated using the following correlations:

LFL ๏€ฝ

100% 1 ๏€ซ 9 .0 4 5 4 C O

(19)

23

100%

UFL ๏€ฝ

CO

is the oxygen stoichiometric coefficient in a reaction (Ma et al., 2013). Consider a

general compound

C x H yO z N w

C x H yO z N w ( g ) ๏€ซ ( x ๏€ซ

4

๏€ญ

z 2

)O 2 ( g ) ๏‚ฎ x C O 2 ( g ) ๏€ซ

y 2

H 2O ( g ) ๏€ซ

w 2

N 2(g )

can then be calculated using the following equation:

Co ๏€ฝ x ๏€ซ

y 4

๏€ญ

SC R

CO

y

undergoes a complete combustion in the air:

z 2

(21)

IP T

where

(20)

1 ๏€ซ 1 .1 8 4 3 C O

(22)

U

The next step is to identify the upper and lower bounds of property constraint for solvent

N

design. Other than the property constraint, upper and lower limits are also introduced to target

A

properties such as Tb, Rcarotene as well as log Koc and BCF. This step is necessary to ensure

M

that the designed molecules will have a better performance than hexane.

ED

Table 3: Upper and lower bound of properties for solvent design Property

Tb (ยฐC) Rcarotene(unit)

CC E

lo g K o c

PT

Fp (K)

log BCF

Lower Bound

Upper Bound

242

-

40

80

-

3.4

-

4.5

-

3.3

A

3.3 Disjunctive optimisation In this section, the application of disjunctive programming on the index scoring is demonstrated for explosiveness based index (IEX). Consider the following criteria:

24

๏ƒฌ1 ๏ƒฏ ๏ƒฏ2 ๏€ฝ ๏ƒญ ๏ƒฏ3 ๏ƒฏ4 ๏ƒฎ

I EX

0 ๏‚ฃ S ๏€ผ 20 20 ๏‚ฃ S ๏€ผ 45

(23)

45 ๏‚ฃ S ๏€ผ 70 70 ๏‚ฃ S ๏€ผ 100

The explosiveness sub-index,

I EX

score may one, two, three or four depending on S value of I EX

functions

IP T

molecule. Binary integer variables have been used to model these functions.

can be translated to the following mixed-integer formulation using three integer variables I E X 2 and I E X 3 ):

SC R

( I EX 1 ,

I EX ๏€ฝ I EX 1 ๏€ซ I EX 2 ๏€ซ I EX 3 ๏€ซ 1

S ๏€ผ 20

๏ƒฌ0 ๏€ฝ ๏ƒญ ๏ƒฎ1

S ๏€ผ 45

๏ƒฌ0 I EX 3 ๏€ฝ ๏ƒญ ๏ƒฎ1

S ๏€ผ 70

A

(27)

In order to model conditions (25), (26) and (27) which allocate the values of

I EX 1 , I EX 2

and

be either 0 or 1 based on S value of molecule, the following constraints are considered:

CC E

I E X 3 to

(26)

M

S ๏‚ณ 45

S ๏‚ณ 70

(25)

ED

2

S ๏‚ณ 20

PT

I EX

N

๏ƒฌ0 I EX 1 ๏€ฝ ๏ƒญ ๏ƒฎ1

U

Subject to the following conditions:

(24)

I E X 1 ๏ƒŽ ๏ป 0 ,1๏ฝ

(28)

(0 ๏€ญ 4 5 ) * (1 ๏€ญ I E X 2 ) ๏€ผ S ๏€ญ 4 5 ๏‚ฃ (1 0 0 ๏€ญ 4 5 ) * I E X 2

I E X 2 ๏ƒŽ ๏ป 0 ,1๏ฝ

(29)

(0 ๏€ญ 7 0 ) * (1 ๏€ญ I E X 3 ) ๏€ผ S ๏€ญ 7 0 ๏‚ฃ (1 0 0 ๏€ญ 7 0 ) * I E X 3

I E X 3 ๏ƒŽ ๏ป 0 ,1๏ฝ

(30)

A

(0 ๏€ญ 2 0 ) * (1 ๏€ญ I E X 1 ) ๏€ผ S ๏€ญ 2 0 ๏‚ฃ (1 0 0 ๏€ญ 2 0 ) * I E X 1

25

3.4 Development of hierarchical model A three-level hierarchical decision structure is constructed for this case study and shown in Figure 2. The designed solvents should attain their functionalities while having favourable safety, health and environmental characteristics. Under physicochemical properties, there are four sub-properties naming boiling point (Tb), surface tension (ฯƒ), viscosity (ยต) and

IP T

Hildebrand solubility parameter (ฮด) difference between solvent and carotenoids (Rcarotene). IFL and IEX are the sub-indices chosen for safety aspects whereas IEL and IAH are the sub-indices

SC R

selected for assessing health criterion. There are four sub-properties under environmental criteria, such as aquatic toxicity (LC50), photochemical oxidation potential (PCO), soil

U

sorption coefficient (log Koc) and bioconcentration factor (BCF).

Physical Properties

ยต

ฮดcarotene

M

ฯƒ

Environmental Aspects

Health Aspects

IFL

IEX

IEL

IAH

LC50

PCO

log KOC

BCF

ED

Tb

Safety Aspects

A

N

Solvent Design for Oil Extraction

Figure 2: Hierarchical decision structure for this case study

PT

3.5 Computation of weighting factors from Decision Makerโ€™s value judgement

CC E

The judgement in eliciting the relative importance of each criterion is identified from supporting literature or theory, which will be discussed in this section. The pairwise comparisons among criteria have been done as in standard AHP. However, the degree of

A

confidence is also being taken into account while eliciting the judgements. For instance, when comparing the relative importance between physical properties and safety aspects, questions asked are: โ€œwhich criterion is more important and how much more important is it to satisfy the requirement in designing a solvent for extracting residual oil from PPF?โ€ In addition, the decision maker can choose the degree of confidence when eliciting judgement, 26

such as having high, moderate or low confidence while assigning the intensity of important to the criteria. In this case study, the safety aspect is given the most priority compared to other aspects. This ensures that the designed solvents are inherently safe, and will not bring many adverse effects to the surrounding community if there is any unintentional release. Moreover, by using a less hazardous material in a process, the operational cost can be reduced as the

IP T

number of protection layers used in a plant can be minimised. Thus, the safety aspect of a

molecule is assumed to be moderately more important than physical properties taken together;

SC R

it is also considered to be strongly more important than environmental and health aspects of

the molecule. On the other hand, physical properties are assumed to be strongly more

U

important than environmental and health aspects of the molecules. In order to ensure that the

N

designed solvent will meet the environmental specifications, constraints are imposed on the environmental properties. Table 4 shows the assessment of the relative importance of main

M

A

criteria with respect to the main objective using a fuzzy scale. Note that the ฮป value is larger than zero, denoting that the judgement is consistent in this pairwise comparison matrix.

ED

Table 4: Fuzzy pairwise comparison matrix of main criteria for solvent design on oil extraction

1

CC E

Physical Properties

PT

Physical Properties

Environmental Aspects Safety Aspects

A

Health Aspects

Environmental Safety Aspects Aspects

Health Aspects

Priority Vector

(3, 5, 7)

(1/5, 1/3, 1)

(3, 5, 7)

0.3105

1

(1/7, 1/5, 1/3)

(1/2, 1, 2)

0.0844

1

(3, 5, 7)

0.5206

1

0.0844

ฮป = 0.3384

27

A similar question is asked when carrying out a pairwise comparisons between physical subproperties: โ€œwhich is more important and how much more important is it with respect to a satisfaction of the physical properties of solvent?โ€ The main goal in this case study is to design a solvent whereby carotene degradation rate will be lower during the solvent recovery process. The degradation rate of carotene can be minimised by lowering the boiling point of

IP T

solvent, Tb. Thus, Tb is assumed to be strongly more important than Rcarotene and moderately more important than surface tension and viscosity property. Surface tension and viscosity are

SC R

assumed to be more or less equally important with a high degree of confidence as both properties are crucial in improving solvent diffusivity. Also, surface tension and viscosity are

U

slightly more important than Rcarotene since it is important to ensure that the solvent is able to

N

first diffuse into the matrix of PPF followed by extracting the residual oil which is trapped in

A

the PPF matrix. High surface tension can also impede the penetration of designed solvent into

M

a matrix of PPF, whereas solvent with low viscosity is preferable since part of the extraction process is governed by capillary flow (Johnson and Lusas, 1983). In addition, the diffusion of

ED

residual oil from the matrix of PPF into the designed solvent can be modelled using Equation (32) (Tzia and Liadakis, 2003). From Equation (32), it clearly shows that diffusion of residual

๏€ญ18

(๏ช M

B

)

0 .5

T

where

D AB

is the diffusivity of oil in solvent B,

the temperature, ๏ญ is the solution viscosity,

A

(32)

๏ญ vA

0 .6

CC E

D AB ๏€ฝ

1 1 7 .3 ๏‚ด 1 0

PT

oil depends on the viscosity of solvent.

M

vA

B

is the molecular weight of solvent, T is

is the molar volume of oil and ๏ช is the

association factor for the solvent. The fuzzy pairwise comparison matrix of physical sub-properties is reported in Table 5. The elicited judgement is consistent since ฮป value is bigger than zero.

28

Table 5: Fuzzy pairwise comparison of physical sub-properties for solvent design on oil extraction Surface tension

Viscosity

Priority Vector

1

(3, 5, 7)

(1, 3, 5)

(1, 3, 5)

0.5213

1

(1/3, 1/2, 1)

(1/3, 1/2, 1)

0.1006

1

(1/2, 1, 2)

0.1891

1

0.1891

Solubility Surface tension Viscosity

SC R

ฮป = 0.8786

IP T

Boiling point

Boiling Point Rcarotene

Similar questions are also asked when performing the pairwise comparison for environmental sub-properties. The assumption made in this case study is that the environmental effect

U

caused by oil extraction from PPF using solvent will be similar to that from fresh plant

N

biomass or oilseeds. Moncada et al., (2016) have performed an environmental assessment of

A

the process of extracting essential oil from rosemary and oregano. Based on their report,

M

solvent extraction technology using hexane has the most harmful effect on the environment

ED

compared to supercritical fluid and water distillation technology. Similar results have been reported when calculating the potential environmental impact (PEI) per kilogram of these

PT

essential oils extracted using hexane. This is is attributed to the potential formation of photochemical smog and contribution to aquatic toxicity. Based on the results obtained for

CC E

rosemary oil, the PEI per kilogram of rosemary oil extracted for aquatic toxicity and photochemical oxidation potential is approximately 0.03 and 0.07, respectively (Moncada et

A

al., 2016). Hence, in order to generate a solvent which will significantly minimise the formation of photochemical smog, it is assumed that photochemical oxidation potential is strongly more important than aquatic toxicity.

On the other hand, aquatic toxicity is

considered to be moderately more important than log Koc and BCF. Besides, it is also important to ensure that the generated solvent will have low tendency to accumulate in one

29

place as degradation products with low biodegradability may persist in the environment. Since log Koc is a measure of the tendency of a solvent to accumulate in one place, while BCF is a measure of the tendency of a solvent to accumulate in aquatic organisms, both log Koc and BCF are assumed to be equally important while designing the molecules. In addition, upper bounds have been applied on both log koc and BCF to ensure that the designed solvent

IP T

is not bio-accumulative. The intensity of importance assigned to each property along with the degree of confidence is shown in Table 6.

log Koc

1

(1/7, 1/5, 1/3)

(2, 3, 4)

1

(5, 7, 9)

PCO log Koc

1

A

BCF

Priority Vector

(2, 3, 4)

0.1827

(5, 7, 9)

0.6583

(1/2, 1, 2)

0.0794

1

0.0794

M

ฮป = 0.3009

BCF

U

PCO

N

LC50

LC50

SC R

Table 6: Fuzzy pairwise comparison of environmental sub properties for solvent design on oil extraction

ED

When comparing the relative importance of flammability and explosiveness index, both indices are assumed to more or less equally important. This is because the flammable range

PT

of a material is indicated by the lower explosion limit (LEL) and the upper explosion limit (UEL). The terms lower flammability limit (LFL) and upper flammability limit (UFL) are

CC E

used interchangeably with LEL and UEL (National Fire Protection Association, 2007). It is equally important to prevent fire and explosion hazards in a plant. Table 7 shows that both flammability and explosiveness have equal relative weights after carrying out the fuzzy

A

pairwise comparison considering the degree of confidence of decision maker. Table 7: Fuzzy pairwise comparison of safety sub-index for solvent design on oil extraction

Flammability Explosiveness

Flammability

Explosiveness

Priority Vector

1

(1/3, 1, 3)

0.5

1

0.5 30

ฮป=1

Exposure limit and acute health hazard indices are assumed to be equally important with moderate confidence. The current permissible exposure limit for hexane in Malaysia is relatively low, at 50ppm TWA. Since hexane has high volatility, it is easily inhaled by a

IP T

human, leading to the risk of nerve when there is an increase in exposure time and level. Thus, while designing the alternative solvent to replace hexane, it is necessary to use a penalty

SC R

score for exposure limit. Hexane has relatively low acute toxicity towards rat through

ingestion where their oral rat LD50 is 28000 mg/kg. However, it is also important to consider

U

acute health hazard of a molecule during the design stage to ensure that the designed

N

molecule has low acute toxicity when being accidentally swallowed. The intensity of

A

importance assigned to each health sub-index along with the degree of confidence is shown in

M

Table 8.

Table 8: Fuzzy pairwise comparison of health sub-index for solvent design on oil extraction

Exposure limit

1

Priority Vector

(1/3, 1, 3)

0.5

1

0.5

CC E

ฮป=1

Acute health hazard

PT

Acute health hazard

ED

Exposure limit

The following steps include the calculation of final weighting factors of each sub-property in the overall system. It can be calculated by multiplying weighting factors of sub-property with

A

the weighting factor of its main property. The final weighting factors of target properties in the overall system for this case study are reported in Table 9. Table 9: Final weighting factors of target properties in the overall system Target Property

Final weighting factor

Boiling point

0.1618 31

Surface tension

0.0587

Viscosity

0.0587

LC50

0.0154

PCO

0.0556

log Koc

0.0067

BCF

0.0067

Flammability

0.2603

Explosiveness

0.2603

Exposure limit

0.0507

Acute health hazard

0.0507

IP T

0.0312

SC R

Solubility

U

3.6 Molecular design stage

N

The following step includes the selection of molecular building block based on the molecular

A

structure of the conventionally used solvents in oil extraction process. The chosen molecular

M

groups for this case study include C, CH, CH2, CH3, OH, COOH, CHO, CH-O, CH3CO, CH3O, CH2O, NH2 and CH2=CH. To ensure the formation of a structurally feasible molecule

ED

without having any free bonds, structural constraints expressed in Equation (10) and Equation (12) or Equation (13) are implemented. Equation (10) and Equation (12) are used for

PT

designing acyclic compounds whereas Equation (10) and Equation (13) are utilised for the

CC E

design of monocyclic compounds. 3.7 Optimisation model The target property models, as well as safety and health indices, are first transformed into

A

their property operators as shown in Appendix 6.3. The property operators are represented by the linear combinations of the number of occurrence for a molecular group of type-i and its corresponding contribution. After determining the property operators, property operators are normalised using Equation (14) or Equation (15) depending on whether the target property is to be minimised or maximised. 32

The overall objective function for this case study can be expressed by Equation (31): F w e ig h te d s u m ๏€ฝ w 1 ๏ฌ T ๏€ซ w 2 ๏ฌ ๏ณ ๏€ซ w 3 ๏ฌ ๏ญ ๏€ซ w 4 ๏ฌ R b

๏€ซ w8๏ฌ BCF ๏€ซ w9๏ฌ I

where w 1 , w 2 , โ€ฆ,

w1 2

๏€ซ w10 ๏ฌ I

F L

c a r o te n e

๏€ซ w5๏ฌ LC

๏€ซ w11๏ฌ I

EX

EL

50

๏€ซ w 6 ๏ฌ P C O ๏€ซ w 7 ๏ฌ lo g K

๏€ซ w12 ๏ฌ I

oc

(31)

AH

are the weighting factors determined from FAHP approach.

IP T

The design objective now is to maximise the value of Fweightedsum. This optimisation model is a mixed integer linear programming (MILP) formulation, which allows the global optimum to

SC R

be readily identified without major computational difficulties. Integer cuts are then introduced to produce alternate optimal and near-optimal solutions.

U

3.8 Results and discussions

N

Both acyclic and monocyclic solvents are generated for this case study. Ten acyclic solvents

A

are listed as solvent A1- A10 whereas the ten monocyclic solvents are listed as solvent B1-

M

B10. The molecular structures of the top ten acyclic and monocyclic solvents are displayed in

CH3

H 3C O

O

H 3C

O

Solvent A1: Dimethoxymethane Solvent A2: Propionaldehyde

O

O

CH3

CC E

Solvent A6

PT

O

O

H 3C

CH3

H 2C O

O

Solvent A3: Acrolein

CH3

H 3C

A

H3C

Solvent B6: 2-(2,2-Dimethylpropryl)oxirane

O

H 3C

Solvent A8: Methyl Acetate

Solvent A9: Butyraldehyde

CH3

CH3

H3C H3C

O

Solvent A10: Ethyl Formate

CH3

O

O

CH3

Solvent B2: methylcyclohexane

O H3C

Solvent B4

Solvent B3

H3C

Solvent B7

O

CH3

CH3

H3C

CH3

H 3C

O

Solvent A7

Solvent B5

CH3

O

O CH3

CH3 Solvent A5: Isobutyraldehyde

Solvent A4: Butanone

O

H 3C CH2

O

H3C

O

CH3

H3C

CH3 Solvent B1: 1,2,4 trimethylcyclopentane

H 3C

H 3C

ED

Figure 3.

H3C

O

CH3 H3C

Solvent B8: epoxyhexane

Solvent B9: 1,1,3 Trimethylcyclopentane

H3C

Solvent B10: methoxycyclopentane

Figure 3: Molecular structure of the best ten acyclic and monocyclic solvents

33

Appendix 6.4 shows the properties of the best ten generated acyclic and monocyclic solvents. The generated solutions will be selected based on their rankings. Nevertheless, it should be noted that the generated ranking for each molecule is not absolute, since there will generally be uncertainties in values of target properties and prediction models. In this case, the rankings of molecules represent the potential solutions from a huge search space. When a molecule is

IP T

positioned at a higher ranking, it means that the molecule has a better potential for a specific application. Thus, in a later stage, the molecule can be selected for further verification

SC R

through experiments.

From the results, solvents A1 and B1 have the highest Fweightedsum values which make them

U

ranked the first among all the generated acyclic and monocyclic solvents. The properties of

N

the designed molecules were then compared with that of hexane. The properties of hexane are

A

shown in Appendix 6.6. The results showed that Tb of top three acyclic solvents (solvent A1

M

to A3) and solvent A5, as well as all the ten monocyclic solvents, are lower than that of hexane. Besides, Rcarotene of all generated solvents is smaller than that of hexane which means

ED

that carotenoids can dissolve better in those generated solvents than in hexane. Based on the results, Solvent A1 and solvent B3 have the lowest Rcarotene values among the generated

PT

acyclic and monocyclic solvent. When compared the generated solutions with the existing

CC E

solution in literature, it is found out that solvent A8 (methyl acetate) has been shown to be a suitable solvent for extracting oil from oilseeds. In addition, all the generated solvents have improved properties in terms of soil sorption as their log koc values are lower than that of

A

hexane. In terms of safety aspects, although all the generated solvents are carrying the same safety sub-index (IFL and IEX) scores as that of hexane, their flashpoint properties, especially for acyclic solvents, are slightly better than that of hexane. Solvent A6 and solvent A10 will be the least flammable compared to the other generated solvents as their Fp values are significantly higher. From the point of view of health aspects, the generated monocyclic 34

solvents exert a better characteristic in terms of the permissible exposure limit. All the generated monocyclic solvents obtained a better IEL score compared to that of hexane. Besides, by comparing the overall ISHI scores, solvent B1 and solvent B2 exhibit better safety and health performance compared to all the generated solvents and hexane. In addition, the results were compared to the case where FAHP method is not used for

IP T

assigning the weighting factor to each property. In such case, each property is assumed to

carry the same weighting factor and the results are displayed in Appendix 6.5. From the

SC R

obtained results, it shows that there is an abrupt change in the ranking of molecules for both

acyclic and monocyclic solvents. For acyclic solvents, solvent A6 and solvent A8 which

U

ranked the 6th and 8th place have become the 2nd and 4th best solution respectively in the case

N

where all the property is assumed to carry the same weighting factor. This may be caused by

A

the weighting factor assigned to the boiling point is significantly different. When using the

M

proposed FAHP method, the relative importance of each property can be known and Tb is carrying a weighting factor of 0.1618. On the other hand, when assuming all the properties

ED

are having an equal weighting factor, a weighting factor of 0.08333 is assigned to Tb. This may be the reason why solvent A6 and A8 which were having significantly higher Tb values

PT

were ranked the 2nd and 4th place. For monocyclic solvents, solvent D5 and D9 have dropped

CC E

out of top ten solvents when proper weighting factor is assigned by using FAHP approach. Thus, it shows that the molecular ranking is sensitive to the weighting factor of each property. It is important to evaluate the relative importance of each property to assign a proper

A

weighting factor to each property as different weighting factor will generate different optimal solutions. 4. Conclusion In this work, a systematic multi-objective CAMD framework using FAHP weighting approach has been developed for the optimal design of molecules with multiple properties. 35

This proposed methodology quantifies the subjective judgement of a decision maker through a fuzzy scale that considers their degree of confidence when eliciting the relative importance of each property. With the use of this methodology, the process of evaluating the relative importance of product properties can be done more systematically. Hence, a more precise relative weighting factor which reflects the preferences of decision-makers can be assigned to

IP T

each criterion. A case study on solvent design for extracting residual oil from palm pressed fibre is solved to illustrate the proposed methodology. The results obtained show the best

properties as well as safety, environmental and health aspects.

SC R

balance of performance for criteria from different categories such as physicochemical

U

The assumption made in FAHP approach is that all the criteria involved are independent of

N

each other. However, in practice the relationship among criteria is usually complex, and there

A

might be interdependencies. Thus, future work can be conducted by extending the multi-

M

objective CAMD methodology to capture such interdependencies among various criteria involved in molecular design. For instance, business or economic aspects may conflict with

ED

SHE considerations. Even then, it remains crucial to consider the cost factor during the

PT

molecular design stage to identify the optimal process-product design. Acknowledgements

CC E

The financial support from the Ministry of Higher Education, Malaysia through the LRGS Grant (LRGS/2013/UKM-UNMC/PT/05) is gratefully acknowledged.

A

5. References

Austin, N.D., Sahinidis, N. V, Trahan, D.W., 2016. Computer-aided molecular design: An introduction and review of tools, applications, and solution techniques. Chem. Eng. Res. Des. 116, 2โ€“26. Bockisch, M. (Ed.), 1998. Chapter 5 - The Extraction of Vegetable Oils, in: Fats and Oils Handbook. AOCS Press, pp. 345โ€“445. Chang, D.-Y., 1996. Applications of the extent analysis method on fuzzy AHP. Eur. J. Oper. Res. 95, 649โ€“655. 36

Chen, S., Fan, J., 2011. Measuring Corporate Social Responsibility Based on a Fuzzy Analytical Hierarchy Process. Int. J. Comput. Netw. Inf. Secur. 3, 13โ€“22. Choo, Y.-M., Yap, S.-C., Ooi, C.-K., Ma, A.-N., Goh, S.-H., Ong, A.S.-H., 1996. Recovered oil from palm-pressed fiber: A good source of natural carotenoids, vitamin E, and sterols. J. Am. Oil Chem. Soc. 73, 599โ€“602. de Oliveira, R.C., de Barros, S.T.D., Gimenes, M.L., 2013. The extraction of passion fruit oil with green solvents. J. Food Eng. 117, 458โ€“463.

IP T

Diwekar, U.M., 2003. Introduction to Applied Optimization. Kluwer Academic Publishers, The Netherlands.

SC R

El-Halwagi, M.M., 2011. Sustainable Design Through Process Integration: Fundamentals and Applications to Industrial Pollution Prevention, Resource Conservation, and Profitability Enhancement. Elsevier Science. Harper, P.M., Gani, R., 2000. A multi-step and multi-level approach for computer aided molecular design. Comput. Chem. Eng. 24, 677โ€“683.

U

Hassim, M.H., Hurme, M., 2010. Inherent occupational health assessment during process research and development stage. J. Loss Prev. Process Ind. 23, 127โ€“138.

N

Heikkilรค, A.M., 1999. Inherent safety in process plant design. An index-based approach. VTT Publ.

A

Johnson, L.A., Lusas, E.W., 1983. Comparison of alternative solvents for oils extraction. J. Am. Oil Chem. Soc. 60, 229โ€“242.

ED

M

Khor, S.Y., Liam, K.Y., Loh, W.X., Tan, C.Y., Ng, L.Y., Hassim, M.H., Ng, D.K.S., Chemmangattuvalappil, N.G., 2017. Computer Aided Molecular Design for alternative sustainable solvent to extract oil from palm pressed fibre. Process Saf. Environ. Prot. 106, 211โ€“223.

PT

Leong, Y.T., Tan, R.R., Aviso, K.B., Chew, I.M.L., 2016. Fuzzy analytic hierarchy process and targeting for inter-plant chilled and cooling water network synthesis. J. Clean. Prod. 110, 40โ€“53. Ma, T., Wang, Q., Larraรฑaga, M.D., 2013. Correlations for estimating flammability limits of pure fuels and fuel-inert mixtures. Fire Saf. J. 56, 9โ€“19.

CC E

Marrero, J., Gani, R., 2001. Group-contribution based estimation of pure component properties. Fluid Phase Equilib. 183โ€“184, 183โ€“208.

A

Mattei, M., Kontogeorgis, G.M., Gani, R., 2014. A comprehensive framework for surfactant selection and design for emulsion based chemical product design. Fluid Phase Equilib. 362, 288โ€“299. Moncada, J., Tamayo, J.A., Cardona, C.A., 2016. Techno-economic and environmental assessment of essential oil extraction from Oregano (Origanum vulgare) and Rosemary (Rosmarinus officinalis) in Colombia. J. Clean. Prod. 112, Part, 172โ€“181. National Fire Protection Association, 2007. Standard System for the Identification of the Hazards of Materials for Emergency Response, National Fire Protection Association, Quincy. Ng, L.Y., Chemmangattuvalappil, N.G., Ng, D.K.S., 2014. A Multiobjective Optimization37

Based Approach for Optimal Chemical Product Design. Ind. Eng. Chem. Res. 53, 17429โ€“17444. Papadopoulos, A.I., Badr, S., Chremos, A., Forte, E., Zarogiannis, T., Seferlis, P., Papadokonstantakis, S., Galindo, A., Jackson, G., Adjiman, C.S., 2016. Computer-aided molecular design and selection of CO2 capture solvents based on thermodynamics, reactivity and sustainability. Mol. Syst. Des. Eng. 3, 1645โ€“1669. Papadopoulos, A.I., Linke, P., 2006. Multiobjective molecular design for integrated processโ€ solvent systems synthesis. AIChE J. 52, 1057โ€“1070.

IP T

Papadopoulos, A.I., Stijepovic, M., Linke, P., 2010. On the systematic design and selection of optimal working fluids for Organic Rankine Cycles. Appl. Therm. Eng. 30, 760โ€“769.

SC R

Pistikopoulos, E.N., Stefanis, S.K., 1998. Design of chemical compoundsOptimal solvent design for environmental impact minimization. Comput. Chem. Eng. 22, 717โ€“733. Promentilla, M.A.B., Aviso, K.B., Tan, R.R., 2014. A Group Fuzzy Analytic Network Process to Prioritize Low Carbon Energy Systems in the Philippines. Energy Procedia 61, 808โ€“811.

U

Saaty, T.L., 1977. A scaling method for priorities in hierarchical structures. J. Math. Psychol. 15, 234โ€“281.

N

Satty, T.L., 1980. The analytical hierarchy process: planning, priority setting, resource allocation. RWS Publ. Pittsbg.

M

A

Seth, R., Mackay, D., Muncke, J., 1999. Estimating the Organic Carbon Partition Coefficient and Its Variability for Hydrophobic Chemicals. Environ. Sci. Technol. 33, 2390โ€“2394.

ED

Tan, J., Low, K.Y., Sulaiman, N.M.N., Tan, R.R., Promentilla, M.A.B., 2016. Fuzzy analytic hierarchy process (FAHP) for multi-criteria selection of microalgae harvesting and drying processes. Clean Technol. Environ. Policy 18, 2049โ€“2063.

PT

Tan, R.R., Aviso, K.B., Huelgas, A.P., Promentilla, M.A.B., 2014. Fuzzy AHP approach to selection problems in process engineering involving quantitative and qualitative aspects. Process Saf. Environ. Prot. 92, 467โ€“475.

CC E

Ten, J.Y., Hassim, M.H., Chemmangattuvalappil, N., Ng, D.K.S., 2016a. A novel chemical product design framework with the integration of safety and health aspects. J. Loss Prev. Process Ind. 40, 67โ€“80. Ten, J.Y., Hassim, M.H., Ng, D.K.S., Chemmangattuvalappil, N.G., 2016b. A molecular design methodology by the simultaneous optimisation of performance, safety and health aspects. Chem. Eng. Sci. 159, 140โ€“153.

A

Tseng, M.L., Lin, Y.H., Chiu, A.S.F., 2009. Fuzzy AHP-based study of cleaner production implementation in Taiwan PWB manufacturer. J. Clean. Prod. 17, 1249โ€“1256. Tzia, C., Liadakis, G., 2003. Extraction Optimization in Food Engineering, Food Science and Technology. CRC Press. Yang, M., Khan, F.I., Sadiq, R., 2011. Prioritization of environmental issues in offshore oil and gas operations: A hybrid approach using fuzzy inference system and fuzzy analytic hierarchy process. Process Saf. Environ. Prot. 89, 22โ€“34. Zadeh, L.A., 1965. Fuzzy sets. Inf. Control 8, 338โ€“353. 38