Exploration of a new consumer test method based on metacognitive certainty

Exploration of a new consumer test method based on metacognitive certainty

Journal Pre-proofs Exploration of a new consumer test method based on metacognitive certainty In-Ah Kim, Ha-Yeon Cho, Michael J. Hautus, Hye-Seong Lee...

925KB Sizes 0 Downloads 8 Views

Journal Pre-proofs Exploration of a new consumer test method based on metacognitive certainty In-Ah Kim, Ha-Yeon Cho, Michael J. Hautus, Hye-Seong Lee PII: DOI: Reference:

S0950-3293(19)30623-8 https://doi.org/10.1016/j.foodqual.2019.103857 FQAP 103857

To appear in:

Food Quality and Preference

Received Date: Revised Date: Accepted Date:

9 August 2019 20 November 2019 20 November 2019

Please cite this article as: Kim, I-A., Cho, H-Y., Hautus, M.J., Lee, H-S., Exploration of a new consumer test method based on metacognitive certainty, Food Quality and Preference (2019), doi: https://doi.org/10.1016/j.foodqual. 2019.103857

This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

© 2019 Published by Elsevier Ltd.

1

Exploration of a new consumer test method based on metacognitive certainty

2 3 4

IN-AH KIM1, HA-YEON CHO1, MICHAEL J. HAUTUS2, and HYE-SEONG LEE1*

5 6 7 8

1Department

of Food Science and Technology, College of Engineering,

Ewha Womans University, Seoul 03760, South Korea 2School

of Psychology, The University of Auckland, New Zealand

9 10 11 12

* Corresponding Author: HYE-SEONG LEE

13

Department of Food Science and Technology, College of Engineering

14

Ewha Womans University, 03760, South Korea

15

Email: [email protected]

16

Tel.: 82-2-3277-6687

17

Fax: 82-2-3277-6687

19

Abstract

20

Successful product development and marketing necessitate a study of the consumer concept

21

of culture-specific or deep-positioned branded food. In this study, a new consumer test

22

method was designed based on an authenticity test and used as a reference frame for the

23

target concept without an upsetting story. The response format of this method included the

24

metacognitive certainty response following the sensory authenticity response using the A-Not

25

A test procedure. The method was applied to study the concept of goso flavor, as perceived

26

by 91 female consumers with three commercial soymilk products, having each consumer

27

evaluate each product 45 times over three days. The repeated responses of sensory

28

authenticity were analyzed as mean scores and signal detection theory (SDT) d-prime (d')

29

values of the product difference. From the metacognitive certainty responses after the sensory

30

authenticity response, a new quantitative group measure of d-prime metacognition (d'MC) was

31

calculated in the SDT context and compared with the other outputs. The measure ranged from

32

negative to positive values, indicating a mismatch to a match for the concept of each product.

33

Data analyses were conducted on both pooled data and segmented data, which was driven

34

from the results of cluster analyses using the mean sensory authenticity scores and SDT C

35

values (estimates of response bias about the concept tested). The results showed that d'MC of

36

each product corresponded to the mean scores and d' with the advantage of easy

37

interpretation. Overall, d'MC can be a useful group measure for studying the consumer concept

38

towards food and beverages.

39

Keywords: authenticity test; concept measurement; metacognition; signal detection theory

40

(SDT); consumer research

41

1. Introduction

42

Measurements of the consumer concept are important for achieving a range of

43

business objectives, such as product development, quality control, and marketing in the food

44

and beverage industry (Lee & O’Mahony, 2005). Consumer concepts can be defined as the

45

constructions and associations between the identity of a particular object and other conceptual

46

associations that allow people to interpret, understand, and assign meaning to a perceptual

47

space (Thomson, Crocker, & Marketo, 2010; Stolzenbach, Bredie, Christensen, & Byrne,

48

2016). Consumers who consume local or ethnic cuisine in their everyday life create a concept

49

of its typical taste or brand. Such sensory concepts of a typical taste of certain food can be

50

obtained without the consumers’ awareness or effort and remain in implicit memories

51

(Frandsen et al., 2003). As heavy users also have built the concept of their most loved brand,

52

the quantitative consumer concept measurement is becoming more critical for a range of

53

well-branded processed foods and foods with strong nationality.

54

The authenticity test has been applied to discriminate different brands or treatments

55

of foods as an affect-based discrimination test method (Mojet & Köster, 1986; Köster, 1998;

56

Frandsen et al., 2003, 2007; Boutrolle et al., 2009). Mojet and Köster (1986) conducted an

57

authenticity test on consumers of a particular Dutch brand beer with different brands of beer

58

by applying an emotional story for the beer brewing process to increase the consumers’

59

involvement in and attention to the test. They found that consumers improved their ability to

60

discriminate among different brands of beer with an implicitly learned reference in the

61

authenticity tests, while consumers did not discriminate among the same set of beers in the

62

repeated preference-ranking test. In the authenticity test (Köster, 1998), there is no right or

63

wrong answer; the consumers’ implicitly learned reference was used to evaluate the subtle

64

differences among the products. The outcome of the authenticity tests, where consumers have

65

to report whether the given product is ‘authentic’ (‘real’ or ‘original’) or not by a comparison

66

with their internal reference, indicates whether the consumers have a stable internal concept

67

for a typical taste or brand of the particular product. Therefore, the authenticity test can be

68

expanded to evaluate the sensory authenticity of national or local foods and beverages.

69

In authenticity tests, an upsetting story regarding the products is presented to

70

consumers to evoke the consumers’ emotional mindset (i.e., a negative affective attitude

71

towards inauthentic products) as emotional priming. Frandsen et al. (2003, 2007) applied

72

authenticity tests for Danish consumers by asking them to identify milk samples as either

73

Danish and foreign from among a set of milk samples that had undergone different treatments

74

(i.e., animal feed types and storage length). Consumers were told the following upsetting

75

story to provoke a suspicious mindset before the test: “We want to investigate the possibility

76

of selling fresh, foreign milk on the Danish market. This will only be possible if the

77

consumers accept the product, and if it tastes like Danish milk. Practically, there are no

78

barriers to get fresh, foreign milk in the Danish shops. Even though the purchase of the

79

product will be cheaper for the retailer, the price will be the same for the consumers”. They

80

reported that after applying the authenticity tests, which relies on affective responses, the

81

consumers’ ability to perceive subtle flavor differences were sharpened.

82

Another application of using a sensory evaluation methodology for measuring the

83

consumer concept about the sensory authenticity may include affective discrimination tests

84

used to test the consumer discriminability of re-formulated products from the original using

85

variants of the reminder designs, including duo-trio tests (Kim, Chae, van Hout, & Lee, 2014;

86

Jeong, Kang, et al., 2016; Kim & Lee, 2015; Kim, Yoon, & Lee, 2015; Kim, Dessirier, van

87

Hout, & Lee, 2015; Kim, Sim, & Lee, 2015, 2016; Jeong, van Hout, Groeneschild, & Lee,

88

2017; Kim, van Hout, Dessirier, & Lee, 2018). In the applications of overall difference tests,

89

subtle sensory differences between products were measured by applying affective reference

90

frames for the original product to evoke the consumers’ internal concept regarding the

91

original product. Therefore, although these methods did not incorporate the negative

92

emotional involvement of the subjects, they were similar to the authenticity tests in terms of

93

an evaluation scheme for comparing the test product to the given original (i.e., reference)

94

product. These sensory discrimination methods using reminder designs were more sensitive

95

to other overall difference test methods using balanced-reference designs, such as same-

96

different and triangle and tetrad methods (Kim, et al., 2014; Jeong, Kang, et al., 2016; Jeong,

97

van Hout, et al., 2017). When the reference does not need to be presented physically but only

98

retained in the affective state of mind with positive framing, the A-Not A test format, which

99

is procedurally the same as authenticity tests, could be used as an alternative format for

100 101

sensory authenticity tests to measure the consumer concept. On the other hand, in addition to the authenticity tests and A-Not A reminder designs,

102

the general test design to study the consumer concept has been to use a range of formats of

103

single-step simple scaling (Lee & O’Mahony, 2005). Simple scaling provides a numerical

104

estimation for each product and allows quantitative comparisons to be made across different

105

products. Thus, it is used widely to study various concepts regarding food and beverages.

106

However, when applied to consumer tests, the pooled data generated by consumers who are

107

not trained would be expected to yield a large boundary variance, leading to lower

108

discrimination sensitivity (Lee & O’Mahony, 2005). In this regard, for both authenticity tests

109

and sensory discrimination tests, Signal Detection Theory (SDT; Green & Swets, 1966;

110

Macmillan & Creelman, 2005) and Thurstonian modeling (TM; Thurstone, 1927; Ennis, 1990)

111

have been applied broadly (i.e., Frandsen et al., 2007; Jeong et al., 2016; Stocks et al., 2017)

112

to measure the discrimination sensitivity independently from the response bias. Therefore,

113

these approaches have been extended to measure various consumer concepts and sensory

114

perceptions, such as softness (Seo, 2013), taste fullness (Kim, 2015; Lee, 2017), product-

115

benefit (Kim, Dessirier, van Hout, & Lee, 2015; Kim, van Hout, Dessirier, & Lee, 2018), and

116

holistic product usage experience (Kim, Hopkinson, van Hout, & Lee, 2017a,b). One of the

117

SDT and TM parameters, the d-prime (d') estimate, can be obtained from the equal-variance

118

normal-distribution model, and it has been accepted broadly as a sensory distance measure.

119

This indicates the difference in the perceived strengths of the sensory or conceptual

120

properties between different products when assuming one product as the signal level and the

121

other as the noise level. Therefore, it always necessitates responses to the reference, which

122

does not always exist in practice, to set a noise distribution, and it cannot provide an

123

independent index for each product like the mean scores of scaling (Kim, Hopkinson, et al.,

124

2017b). Extending the SDT approach to drive quantitative outputs, such as d' measures, based

125

solely on the certainty and solidity of the consumer’s internally built concept regarding the

126

sensory authenticity and computed for each product without the need to provide a reference

127

would be very useful.

128

Therefore, a new consumer test method was designed to measure the consumer

129

concepts, including sensory authenticity, using the A-Not A test design with a reference

130

frame for the target concept without an upsetting story. This test design was modified further

131

by adding the metacognitive certainty response following the sensory authenticity response to

132

explore a means of calculating the consumer group measure for each product in the SDT and

133

TM context. In the present study, the output index proposed as a new quantitative group

134

measure driven from an analysis of the metacognitive certainty responses was called d-prime

135

metacognition (d'MC), and compared with the mean scores for each product and d' estimates

136

for the sensory differences between products, which were calculated from the repeated

137

responses of the sensory authenticity only. Metacognition is the ability to think about our

138

own thoughts and knowledge and to recognize our own cognitive processing, i.e., people can

139

assess retrospectively whether they provide the right decision or not, in perceptual or memory

140

tasks (Fleming & Lau, 2014; Massoni, Gajdos, & Vergnaud, 2014; Grimaldi, Lau, & Basso,

141

2015). Thus, in the present study, it was investigated whether the pooled metacognitive

142

certainty responses, which are a group of consumers’ confidence judgements on the sensory

143

authenticity response to a given sensory concept, can represent the certainty or solidity of

144

consumer concept for each product.

145

The chosen concept studied in the present study was the goso flavor; three

146

commercial soymilk products were evaluated. The goso flavor is one of the representative

147

flavoring ingredients in Korean cuisine (Cha et al., 2010). Therefore, Korean consumers who

148

consume soymilk habitually have built an internal concept of goso flavor for soymilk. As the

149

segmentation issue is very important in a consumer study, data analyses were conducted to

150

analyze pooled and segmented data, which was driven from the results of cluster analyses

151

using the mean sensory authenticity scores and estimates of the response bias about the

152

concept tested (SDT C values).

153 154

2. Materials and Methods

155

2.1. Test products

156

Seven soymilk products commercially available at local supermarkets in South Korea

157

were used. Among these seven products, three (coded as P1, P2, and P3), which represent the

158

salient sensory characteristics of commercial soymilk products, were selected for use in the

159

main test. The remaining four were used in the practice test. The products were presented in

160

counterbalanced order to eliminate any carry-over effects and were served monadically in a

161

random order. All products were stored at room temperature (22.5 ± 0.5°C) and served in 20

162

ml aliquots in disposable plastic cups (3-oz, Lotte Aluminum Co. Ltd., Seoul, South Korea)

163

with 3-digit random numbers on their labels. The plastic cups were wrapped in opaque paper

164

to prevent any bias from the color of the product. All tests were conducted under fluorescent

165

lighting conditions (light intensity 650-720 Lux) in separate sensory booths.

166 167

2.2. Consumers

168

Ninety-one female consumers (age range: 20-34 years) were recruited through an

169

online posting at Ewha Womans University (Seoul, South Korea). The consumers usually

170

consumed soybean products, including soymilk, and had a concept as to what constitutes the

171

typical goso flavor of fresh soybean. All consumers were instructed not to gargle or eat any

172

food except for water for at least one hour before the test.

173

The Institutional Review Board of Ewha Womans University approved the ethical

174

and safety aspects of this study. All consumers provided informed consent after being

175

explained the test procedures, and they received 20,000 won (~US$17) for their participation

176

after completing the test.

177 178

2.3. Experimental design and test procedures

179

The experiment consisted of three stages: 1) delivering the cover story, 2) practice

180

test, and 3) main test. First, to evoke their affective state of mind, all consumers received the

181

following cover story: “We will present three soymilk products that were manufactured by

182

different processes. Thus, you will be able to perceive the goso flavor differently. Please

183

evaluate the degree of perceived goso flavor by recalling the flavor driven by the freshness of

184

soybean”. A practice test was next conducted with the four practice products to enable the

185

consumers to experience a wide range of goso flavors and reinforce their evaluative criteria

186

relevant to the goso flavor, as well as to gain familiarity with the test procedures for the main

187

test. The main test was conducted over three days separated by a one-week interval. On each

188

day, the test was split into five sessions, with five-minute breaks between sessions. In each

189

session, all three products were evaluated three times per consumer. Therefore, from the main

190

test, each consumer provided 45 rating responses (three days x five sessions/day x three times

191

= 45) for each product, as shown in Table 1.

192 193 194

Table 1 SHOULD BE ABOUT HERE (across two columns in a portrait layout)

195 196

A new consumer test method, the metacognitive authenticity test method, which

197

included the metacognitive certainty response following the sensory authenticity response,

198

was first designed, as shown in Fig. 1. To discriminate samples with subtle differences better,

199

the metacognitive authenticity test method was applied to a series of two sequential rating

200

options and used in the practice and main tests (Fig. 2). First, the consumers were instructed

201

to rate the degree of the perceived goso flavor of each product using a 6-point category scale

202

(“not goso”, “slightly not goso”, “unsure but probably not goso”, “unsure but probably goso”,

203

“slightly goso”, and “goso”), which was regarded as the sensory authenticity responses.

204

Subsequently, they were asked to rate the degree of certainty to their response on the previous

205

sensory authenticity response to the goso flavor of each product using a 4-point category

206

scale (“uncertain”, “slightly uncertain”, “slightly certain”, and “certain”), which was regarded

207

as the metacognitive certainty responses. These rating responses were collected on a

208

computer using E-prime 2.0 software (Psychology Software Tools, Inc.).

209

During the tests, the consumers were instructed to rinse their mouths with de-ionized

210

water (22.5 ± 0.5°C) before starting each set of sub-sessions and with lemon-added de-

211

ionized water (22.5 ± 0.5°C) after every sub-session. Re-tasting and changing the response

212

were not permitted. All consumers were asked to swallow the first sip of the product, and

213

they were allowed to expectorate the rest of the stimulus as desired.

214

After completing the evaluation of the main test, a socio-demographic survey was

215

also administered (data were not published). The experiment lasted for approximately one

216

hour.

217

218 219 220 221

Fig. 1 SHOULD BE ABOUT HERE (across two columns in a portrait layout) Fig. 2 SHOULD BE ABOUT HERE (across two columns in a portrait layout)

222 223

2.4. Data analysis

224

2.4.1. Data analysis of pooled data

225

Two sequential rating responses were collected from the metacognitive authenticity

226

test method: sensory authenticity, and metacognitive certainty responses (Fig. 2). The mean

227

scores and d' were calculated using the pooled data of sensory authenticity responses. In

228

addition, a new quantitative group measure of d'MC was calculated using the pooled data of

229

the metacognitive certainty responses.

230

ANOVA was conducted to analyze the sensory authenticity responses by treating the

231

products as a fixed factor and consumers as a random effect, and a Tukey’s test was

232

conducted for multiple pairwise comparisons at a significance level of 5%. XLSTAT

233

software (Addinsoft, 2019) was used for data analysis.

234

Next, a standard SDT approach was also applied to measure the sensory differences

235

between a pair of stimuli using the sensory authenticity responses. The degree of difference

236

in the perceived goso flavor between the two stimuli was calculated in terms of the d'

237

estimate. As shown in Fig. 2a, the signal, and noise distributions were decided based on the

238

response frequency for the authenticity ratings: S1 has higher response frequencies for

239

‘goso(merging the responses to “unsure but probably goso”, “slightly goso” and “goso”)’

240

than S2 in a pair. Thus, S1 was considered a signal distribution, whereas S2 was considered a

241

noise distribution. Therefore, three product pairs were constructed and are denoted as P2 ‒ P1,

242

P2 ‒ P3, and P1 ‒ P3, suggesting that the first and second products were assumed to be the

243

signal and noise distributions, respectively (signal ‒ noise). From this, the d' estimates for

244

each of the three product pairs were calculated assuming normal distributions with equal

245

variance, as described by O’Mahony (1992) or Lee and van Hout (2009).

246

To analyze the metacognitive certainty responses, a new quantitative group measure

247

of d'MC was applied in the context of SDT and TM approaches. In Fig. 2b, for each stimulus,

248

the responses to ‘goso’ were treated as a signal, whereas those to ‘not goso’ were treated as a

249

noise distribution. This is based on the idea of Type 2 sensitivity, where the metacognition is

250

calculated from the consumers’ confidence judgments on both correct and incorrect responses

251

(Galvin, Podd, Drga, & Whitmore, 2003; Higham, Perfect, & Bruno, 2009; Macmillan &

252

Creelman, 2005; Maniscalco & Lau, 2012). Compared to the d' estimates, constructing the

253

response data matrix does not require any physical noise level. Therefore, it can be computed

254

for each stimulus, not for each pair of stimuli. Thus, the d'MC estimates for each soymilk

255

product (P1, P2, and P3) were computed, as were the mean scores.

256

Both d' and d'MC represent a group mean measure calculated from the pooled

257

responses of a group of consumers. A nonparametric approach was used for their calculation

258

because it does not require any assumptions regarding the consumers’ response distributions,

259

and it is applicable for relatively small sample sizes (Macmillan & Creelman, 2005; Lee &

260

van Hout, 2009; Bi et al., 2013; Kim et al., 2017b). Moreover, d' and d'MC were used instead

261

of R-index or other types of nonparametric measure because the concept of d' estimates has

262

recently been accepted widely as a measure of difference in the psychological and sensory

263

science field (Ennis, Rousseau, & Ennis, 2014; Macmillan & Creelman, 2005; Kim et al.,

264

2017b). Based on Green’s area theorem (Green and Swets, 1966), R-index was converted

265

to d' and d'MC using the equation shown in Fig. 2, where Φ-1(.) denotes the inverse standard

266

normal transform. The theorem states that the R-index in the A-Not A test (a monadic

267

test) equals the proportion of correct responses (correctly identifying the signal) by an

268

unbiased observer in a hypothetical paired comparison task. The SDT Assistant program

269

(Hautus, 2014) was used to calculate the d' and d'MC estimates, as well as their estimates of

270

the variance and standard errors through the bootstrap method (bootstrap replications: 10000).

271

The R/S-Plus code, i.e., dstest, developed by the Sensometrics Research and Service and

272

authorized by Bi (2006) was used to test the significance of the differences in multiple d' and

273

d'MC estimates.

274 275

2.4.2. Data analysis of segmentation and segmented data

276

Consumer segments were established based on the concept of the sensory

277

authenticity of the goso flavor of soymilk products using cluster analysis with the following

278

two different types of data of segmentation variables:

279 280

(1) mean sensory authenticity scores – the repeated sensory authenticity responses of each consumer were averaged into the mean sensory authenticity scores

281

(2) SDT C values – estimates of the response bias about the concept tested were

282

calculated into C values using a cumulative link model function in the R-package ordinal

283

(Christensen, Cleaver, & Brockhoff, 2011; Christensen, 2015). Data correction was applied to

284

avoid the problem of zero responses (empty cells) in the middle two categories by adding one

285

to each of two categories in the middle of the response categories, similar to the log-linear

286

correction in the estimation of d' (Hautus, 1995).

287

These two data sets for cluster analysis were arranged in 91 rows and three columns

288

regarding the consumers and products. Agglomerative hierarchical clustering (dissimilarity

289

proximity type, Ward’s method, and Euclidean distance) was firstly applied, and three

290

clusters were established based on the dendrogram plot. K-means cluster analysis was then

291

conducted to classify the consumers based on the criterion of the Determinant (W). XLSTAT

292

software (Addinsoft, 2019) was used for cluster analysis.

293

For the segmented data driven from the results of cluster analysis using the mean

294

sensory authenticity scores, the mean scores were calculated, while for the segmented data

295

driven from the results of the cluster analysis using SDT C values, the d' and d'MC estimates

296

were calculated according to the data analysis process mentioned above.

297 298

3. Results

299

3.1. Comparison of the results obtained from pooled data collected from the metacognitive

300

authenticity test method

301

The mean scores for each product and the d' estimates for each product pair were

302

calculated using the sensory authenticity responses, and are listed in Tables 2 and 3,

303

respectively. The results showed that P2 had the highest score of the three products. P2 ‒ P3

304

had the largest difference compared to the other product pairs, but it was less than one (Table

305

2).

306

The degree of differences in the perceived goso flavor between two products was

307

calculated in terms of the d' estimates, and all three d' estimates were significantly greater

308

than zero (Table 3). This suggests that the product assumed to be signal was perceived to

309

have more goso flavor than the product assumed to be noise based on the 95% confidence

310

interval. For example, d' estimates P2 ‒ P1 and P2 ‒ P3 were significantly greater than zero,

311

suggesting that P2 can be more perceived to have goso flavor than P1 and P3. A comparison

312

of the differences between the d' estimates revealed no significant difference (Table 3). On

313

the other hand, the results showed that the difference between P2 and P3 (P2 ‒ P3) was the

314

largest of the three product pairs and was the same as the results shown in Table 2.

315

A new quantitative group measure of d'MC was also applied to analyze the

316

metacognitive certainty responses for each product, and the results are listed in Table 4. The

317

d'MC estimate can be interpreted differently depending on the significance testing with zero: 1)

318

significantly lower than zero (negative value), indicating a mismatch between sensory

319

perception and concept; or 2) significantly greater than zero, indicating that sensory

320

perception is matched to the concept. Therefore, the results shown in Table 4a suggest that

321

the sensory perception of P2, but not P3, was matched to the concept. One of the advantages

322

of d'MC is that it can be calculated for each product like the mean scores because it does not

323

require a physical noise level. Therefore, when comparing with the results of mean scores

324

shown in Table 2, the d'MC provided similar results to the mean score including the significant

325

differences between products. Moreover, as shown in Table 4b, the differences in the d'MC

326

estimates of the two products were also calculated to compare with the results of the d'

327

estimates (Table 3). The results showed that the difference between P2 and P3 (P2 ‒ P3) was

328

largest among the three product pairs, as shown in Table 3, indicating that the d'MC estimates

329

can predict the results of the difference between two products.

330 331 332

Table 2 SHOULD BE ABOUT HERE (in a single column in a portrait layout)

333 334 335 336

Table 3 SHOULD BE ABOUT HERE (in a single column in a portrait layout)

337 338

Table 4 SHOULD BE ABOUT HERE (across two columns in a portrait layout)

339 340

3.2. Consumer segmentation based on differences in consumers’ concept perception and

341

comparison of results obtained from the segmented data

342

From cluster analysis, three different consumer segments were identified based on

343

the different sensory concepts of goso flavor of three soymilk products. Two different cluster

344

analysis approaches using the mean sensory authenticity scores and SDT C values resulted in

345

identical results, except that only one consumer who classified Group 1 when using the mean

346

sensory authenticity scores and Group 3 when using the C value for cluster analysis. From the

347

results of cluster analysis, the mean scores for the groups driven from the results of cluster

348

analysis were calculated using the mean sensory authenticity scores, as listed in Table 5. The

349

d' and d'MC estimates were also calculated for the groups driven from the results of cluster

350

analysis using the SDT C values and are listed in Tables 6 and 7, respectively.

351

As shown in Table 5, the overall pattern of scores for the consumer segments was

352

different from the results of the overall consumers shown in Table 2, highlighting the

353

importance of consumer segmentation when studying consumer perception on sensory

354

concepts. In addition, P2 had the highest scores of the three products in Group 1 and 3,

355

whereas the consumers in Group 2 gave high scores for P1 and low scores for P2. The

356

product with the lowest score was different in all three groups.

357

The d' estimates for P1 ‒ P3 in Group 1 and P2 ‒ P1 in Group 2 were significantly

358

lower than zero, indicating that the sensory perceptions of the products assumed to be a noise

359

level were perceived to have more goso flavor than those assumed to be a signal level (Table

360

6). Moreover, the d' estimates for each group showed that the order of the size of the

361

differences among the three product pairs was different from the results shown in Table 3.

362

For example, in the case of P2 ‒ P1, the d' estimate was 0.311 in Table 3, suggesting that

363

these two products had only a slight difference in the degree of sensory perception. After

364

cluster analysis, however, the d' estimates of P2 ‒ P1 were more likely to increase in all three

365

groups (Table 6). A comparison with the mean scores in Table 5 revealed some differences.

366

For example, the d' estimates of P2 ‒ P3 in Group 2 were approximately zero, indicating that

367

there was no significant difference between these two products (Table 6), whereas P3 had

368

significantly higher scores than P2 (Table 5).

369

An inspection of the d'MC estimates for each consumer group (Table 7a) revealed the

370

sensory perception of P2 to be matched to the concept in the Groups 1 and 3, whereas it was

371

not matched to the concept in Group 2. On the other hand, P1 was perceived as being

372

matched to the concept by the consumers in Groups 2 and 3, not in Group 1. The results for

373

the d'MC estimates were also compared with the results of mean scores shown in Table 5. The

374

d'MC estimates provided identical results only in the case of Group 1. A comparison of the

375

difference in the d'MC estimates for two products (Table 7b) with the d' estimates in each

376

consumer group (Table 6) revealed similar results to those of d' estimates in terms of the

377

order and direction of size of the difference between the product pairs, as in the pooled data.

378 379 380

Table 5 SHOULD BE ABOUT HERE (in a single column in a portrait layout)

381 382 383

Table 6 SHOULD BE ABOUT HERE (in a single column in a portrait layout)

384 385 386 387

Table 7 SHOULD BE ABOUT HERE (across two columns in a portrait layout)

388

4. Discussion

389

In the present study, as a new consumer concept test method, the metacognitive

390

authenticity test method was designed to include the metacognitive certainty response

391

following the authenticity response from which an SDT-based quantitative consumer concept

392

measure for an individual product could be calculated. This method was then applied

393

successfully to the consumer concept of perceived goso flavor using three commercial

394

soymilk products. It was demonstrated that the pooled metacognitive certainty responses,

395

which are a group of consumers’ confidence judgements on the sensory authenticity response

396

to the concept of goso flavor, could represent the certainty or solidity of such concept for

397

each product. The method differs from the original authenticity test in two aspects: cover

398

story delivery stage, and data collection and data analysis.

399

Regarding the cover story delivery stage, unlike in previous authenticity tests

400

(Frandsen et al., 2003, 2007; Boutrolle et al, 2009), where an upsetting story was used to

401

elicit an emotional response from the subjects, in the new authenticity test method, the cover

402

story was designed using only a reference frame for the target concept without an upsetting

403

story. Stocks et al. (2017) reported that the effect of an authenticity test with an upsetting

404

story on the performance of product discrimination varies according to the product

405

differences (small vs. large). Therefore, a cover story was used only to remind the consumers

406

what they thought of the sensory authenticity of the target concept and did not evoke a

407

negative affective attitude. This was to develop a test procedure capable of being applied to

408

broader business cases and be a more general consumer concept test method. As the present

409

study was the first exploration of the new consumer test method for studying the consumer

410

concept about the perceived goso flavor of commercial soymilk, validation in follow-up

411

studies will be necessary. In future research, more effective methods and test designs that can

412

evoke the consumers’ thinking and attitude for the target concept should be studied for a

413

range of concepts with diverse food and beverage products. The effects of negative versus

414

positive framing of the concept in the test instruction on the sensitivity of the test should also

415

be studied further.

416

Regarding data collection and data analysis, two sequential rating response options

417

were used: sensory authenticity and metacognitive certainty responses, as shown in Fig. 2.

418

More conventional approaches of mean scores and d' estimates were calculated to analyze the

419

sensory authenticity responses. To examine the metacognitive certainty responses, d'MC

420

estimates were calculated in the context of SDT and TM approaches, and the results were

421

compared to find a more applicable group measure of the consumer concept. As shown in

422

Tables 3 and 6, the d' estimates were computed to measure the degree of difference in the

423

perceived goso flavor for two products, indicating that one product was perceived to have

424

more goso flavor than the other. Because it was based on a pair-wise comparison, only the

425

relative distance/difference between two products could be calculated, and a physical noise

426

level was always required. In contrast, the mean scores and d'MC estimate could be calculated

427

for each product. Hence, these two scores allow quantitative comparisons to be made across

428

different products. Previously, Kim et al. (2017b) proposed a novel nonparametric estimate of

429

a group mean output measure of the affect magnitude (d'A) to measure the affect magnitude

430

for each attribute and for each product. This novel group measure can be calculated

431

independently for each product because it does not require a physical noise level. Therefore,

432

it was applied to develop product usage experience profiles of a household surface cleaner

433

(Kim et al., 2017b) and extend it for use as an alternative approach to conventional

434

descriptive analysis (Kim et al., 2018). Like the d'A estimate, a physical noise level is not

435

needed to calculate the d'MC estimate, which can be calculated independently for each product

436

as an absolute measure based on the idea of Type 2 sensitivity. In addition, its range from

437

negative (‒) to positive (+) values for each product indicates ‘mismatch with the concept’ to

438

‘match with the concept’. This provides a clear boundary for the certain consumer concept,

439

goso flavor in this case, by testing the significant difference from zero across a range of

440

product categories, and this boundary can be used as a business action standard. This is one

441

of the main advantages of this new measure compared to the mean scores, which would be

442

more arbitrary and dependent on the product types and categories.

443

Boutrolle et al. (2009) emphasized the importance of performing subgroup analysis

444

(i.e., cluster analysis) to consider the individual response patterns because an analysis with

445

overall data sometimes does not produce sufficient results. Therefore, in their study, the

446

participants received products monadically, but not the reference product, because they had to

447

compare the products with their internal reference. They were then asked to answer whether

448

the product was original or not (i.e., copy). Four different responses, ‘original’ and ‘copy’

449

responses concerning two products (X, Y), were obtained to conduct cluster analysis based

450

on the SDT matrix of consumers’ individual responses. The purpose was to examine not only

451

how the consumers’ response patterns differed among the different consumer groups but also

452

how their responses regarding authenticity differed when the decision criterion they used was

453

measured dependently. In this regard, Kim, van Hout, and Lee (2018) calculated the C values

454

to reflect the extent to which the subjects’ responses were skewed towards satisfaction or

455

dissatisfaction by applying the cumulative link model, which can be applied to obtain the TM

456

parameters, including the C values from the responses of A-Not A with rating categories

457

(Christensen, Cleaver, & Brockhoff, 2011). They highlighted the importance of considering

458

the subjective response bias in terms of providing meaningful information related to a study

459

of consumer perception and behavior.

460

Regarding the importance of the evaluative criterion, the C values, which represent

461

the consumer’s response bias towards the concept of goso flavor, were used for cluster

462

analysis. The results showed that these C values could be used to reveal the consumer

463

segments similar to the mean scores. The resulting study findings also showed that a

464

consideration of the individual response pattern is very important for a better understanding

465

of the consumers’ responses compared to the results of cluster analysis of the overall

466

consumers. These results are consistent with Boutrolle et al. (2009)’s suggestion in that

467

individual response patterns need to be reviewed because different conclusions can be drawn

468

depending on the type of data (pooled vs. subgroup).

469

After clustering the consumer groups based on their response pattern, the d'MC

470

estimates for each group were also calculated to indicate the degree of the perceived goso

471

flavor of each product for each group of consumers. As with the results of pooled data, it

472

provided similar results to the mean scores because it can be calculated for each product

473

individually. In addition, the product that is perceived to be well matched to the concept

474

could be interpreted easily for individuals in a certain consumer group. Moreover, a

475

comparison of the d' estimates results proved that the d'MC estimates could also predict the

476

product differences. Therefore, the use of d'MC estimates avoids the need to measure the

477

relative difference between two products. As a result, d'MC estimates driven from

478

metacognitive certainty responses might be an appropriate concept measure to provide

479

valuable information on the individual scores for each product as well as the relative

480

difference between products with the advantage of data interpretation. Although it is beyond

481

the scope of this paper, further validation of a new concept measure of d'MC is necessary,

482

focusing on the computation/calculation process of d'MC estimate and practical application of

483

the new measure (i.e., number of trials (sample size), application to the predictive model

484

development). Therefore, more theoretical examination and discussion from a statistical

485

perspective for such an approach is required in future studies.

486

Regarding the consumer concept measurement, the general measurement approach

487

has been the scaling method. Previous studies have measured how well consumers fitted a

488

concept statement to given products and their attributes, using a 9-point category scale with

489

labels (Carr, Craig-Petsinger, & Hadlich, 2001). The conceptual attributes of different

490

cheeses were also measured using a 7-point category scale (Bárcenas, Pérez de San Roman,

491

Pérez Elortondo, & Albisu, 2001), and a consumer concept of the refreshingness of

492

toothpaste was measured using a ranking task for a group of consumers (Lee & O’Mahony,

493

2005). Chae, Lee and Lee (2010) examined consumer discrimination between milks with

494

subtle differences using the same-different rating. Although such measures have been used to

495

investigate the consumer concept, response bias arises because of the different uses of the

496

response categories. Such response bias caused by the measurement tool should be separated

497

from the consumers’ natural response bias (response pattern) towards certain or uncertain

498

categories for the consumer concept. In addition, the scaling method produces scores that are

499

strongly dependent on the experimental context (i.e., a range of products and use of a

500

category of scale), producing relative scores (Kim, van Hout, Dessirier, & Lee, 2018). This

501

also yields a large boundary variance in the scores (Lee & O’Mahony, 2005;), making it

502

difficult to interpret the results.

503

For this reason, the SDT C values were used for consumer segmentation, and a novel

504

approach of computing d'MC was applied to measure the consumer concept without needing to

505

determine a reference or noise level product. These results suggest that the novel measure of

506

d'MC driven from the metacognitive certainty response following the authenticity response can

507

be an efficient alternative to the standard difference measure and mean scores for studying

508

the sensory and conceptual attribute of food and beverages. This novel approach of the

509

consumer test method using the repeated two-step ratings and computation of d'MC as a

510

measure of an abstract concept or authentic feel has the potential for a range of consumer

511

measurements and consumer segmentations. The applicability of this model warrants further

512

investigation using various abstract and synthetic consumer concepts under different

513

experimental situations with more realistic constraints.

514

515

Acknowledgements

516

This work was supported by the National Research Foundation of Korea(NRF) grants funded

517

by the Korea government(MSIT)(No. 2012R1A1A1001768; 2015R1A1A1A05001170) and

518

the

Ministry

of

Education(No.

2018R1A6A3A01012357).

519

References

520

Bárcenas, P., Pérez de San Roman, R., Pérez Elortondo, F. J., & Albisu, M. (2001).

521

Consumer preference structures for traditional Spanish cheeses and their relationship

522

with sensory properties. Food Quality and Preference, 12, 269–279.

523 524

Bi, J. (2006). Sensory discrimination tests and measurements: Statistical principles, procedures and tables. Iowa: Blackwell Publishing (Chapter 10).

525

Bi, J., Lee, H.-S., & O’Mahony, M. (2013). Statistical analysis of receiver operating

526

characteristic (ROC) curves for the ratings of the A-Not A and the same-different

527

methods. Journal of Sensory Studies, 28, 34–46.

528

Boutrolle, I., Julien D., Köster, E. P., Aranz, D., & Danzart, M. (2009). Use of a test of

529

perceived authenticity to trigger affective responses when testing food. Food Quality and

530

Preference, 20, 418–426.

531

Carr, B. T., Craig-Petsinger, D., & Hadlich, S. (2001). A case study in relating sensory

532

descriptive data to product concept fit and consumer vocabulary. Food Quality and

533

Preference, 12, 407–412.

534

Cha, S.-M., Chung, L., Chung, S.-J., Kim, K.-O., Han, G.-J., & Lee, S.-R. (2010). Exploring

535

Korean typical tastes, flavors and foods using Delphi technique. Korean Journal of Food

536

and Cookery Science, 26, 155–164.

537

Chae, J.-E., Lee, Y.-M., & Lee, H.-S. (2010). Affective same-different discrimination tests

538

for assessing consumer discriminability between milks with subtle differences. Food

539

Quality and Preference, 21, 427–438.

540

Christensen, R. H. B., Cleaver, G., & Brockhoff, P. B. (2011). Statistical and Thurstonian

541

models for the A-not A protocol with and without sureness. Food Quality and Preference,

542

22, 542–549.

543 544 545 546 547 548 549 550

Christensen, R. H. B. (2015). Ordinal – Regression models for ordinal data. R package version 2015.6–28.. Ennis, D. M. (1990). Relative power of difference testing methods in sensory evaluation. Food Technology, 44, 114–118. Ennis, J. M., Rousseau, B., & Ennis, D. M. (2014). Sensory difference tests as measurement instruments: A review of recent advances. Journal of Sensory Studies, 29, 89–102. Fleming, S. M., & Lau, H. C. (2014). How to measure metacognition. Frontiers in human neuroscience, 8, 443.

551

Frandsen, L., Dijksterhuis, G., Brockhoff, P., Nielsen, J., & Martens, M. (2003). Subtle

552

differences in milk: Comparison of an analytical and an affective test. Food Quality and

553

Preference, 14, 515–526.

554

Frandsen, L., Dijksterhuis, G., Brockhoff, P., Nielsen, J., & Martens, M. (2007). Feelings as a

555

basis for discrimination: Comparison of a modified authenticity test with the same-

556

different test for slightly different types of milk. Food Quality and Preference, 18, 97–

557

105.

558

Galvin, S. J., Podd, J. V., Drga, V., & Whitmore, J. (2003). Type 2 tasks in the theory of

559

signal detectability: Discrimination between correct and incorrect decisions.

560

Psychonomic Bulletin & Review, 10, 843–876.

561

562 563 564

Green, D.M., & Swets, J.A. (1966). Signal detection theory and psychophysics. New York: John Wiley & Sons Ltd. Grimaldi, P., Lau, H., & Basso, M. A. (2015). There are things that we know that we know,

565

and there are things that we do not know we do not know: Confidence in decision-

566

making. Neuroscience & Biobehavioral Reviews, 55, 88-97.

567

Hautus, M. J. (1995). Corrections for extreme proportions and their biasing effects on

568

estimated values of d′. Behavior Research Methods, Instruments, & Computers, 27, 46–

569

51.

570 571

Hautus, M. J. (2014). SDT Assistant [Computer software]. Auckland, New Zealand. Retrieved April 1, 2019. Available from http://www.hautus.org/sdt-assistant.php.

572

Higham, P. A., Perfect, T. J., & Bruno, D. (2009). Investigating strength and frequency

573

effects in recognition memory using Type-2 signal detection theory. Journal of

574

Experimental Psychology: Learning, Memory, and Cognition, 35, 57–80.

575

Jeong, Y.-N., Kang, B.-A., Jeong, M.-J., Song, M.-J., & Lee, H.-S. (2016). Sensory

576

discrimination by consumers of multiple stimuli from a reference: Stimulus configuration

577

in A-Not AR and constant-ref. duo-trio superior to triangle and unspecified tetrad? Food

578

Quality and Preference, 47, 10-22.

579

Jeong, Y.-N., van Hout, D., Groeneschild, C., & Lee, H. S. (2017). Comparative

580

categorization method: Using 2-AFC strategy in constant-reference duo-trio for

581

discrimination of multiple stimuli from a reference. Food Quality and Preference, 62,

582

284-295.

583

584

Kim, H.-J. (2015). Investigation of consumer acceptance measure using two different SDT

585

indirect scaling approaches for sodium reduction. Master’s thesis. Ewha Womans

586

University, Seoul, South Korea.

587

Kim, I.-A., den-Hollander, E., & Lee, H.-S. (2018). Two-step rating-based ‘double-faced

588

applicability’ test for sensory analysis of spread products as an alternative to descriptive

589

analysis with trained panel. Food Research International, 105, 250-260.

590

Kim, I.-A., Hopkinson, A., van Hout, D., & Lee, H.-S. (2017a). A novel two-step rating-

591

based ‘double-faced applicability’ test. Part 1: Its performance in sample discrimination

592

in comparison to simple one-step applicability rating. Food Quality and Preference, 56,

593

189-200.

594

Kim, I.-A., Hopkinson, A., van Hout, D., & Lee, H.-S. (2017b). A novel two-step rating-

595

based ‘double-faced applicability’ test. Part 2: Introducing a novel measure of affect

596

magnitude (d′A) for profiling consumers’ product usage experience based on Signal

597

Detection Theory. Food Quality and Preference, 59, 141-149.

598

Kim, I-A., Yoon, J.-Y., & Lee, H-S. (2015). Measurement of Consumers’ Sensory

599

Discrimination and Preference: Efficiency of Preference-Difference Test Method

600

Utilizing 3-Point Preference Test and Same-Different Test. Food Science and

601

Biotechnology, 24, 1355-1366.

602

Kim, M.-A, Chae, J.-E., van Hout, D., & Lee, H.-S. (2014). Higher performance of constant-

603

reference duo–trio test incorporating affective reference framing in comparison with

604

triangle test. Food Quality and Preference, 32, 113–125.

605

606

Kim, M.-A., Dessirier, J.-M, van Hout, D., & Lee, H.-S. (2015). Consumer context-specific

607

sensory acceptance tests: Effects of a cognitive warm-up on affective product

608

discrimination. Food Quality and Preference 41, 163-171.

609

Kim, M.-A., & Lee, H.-S. (2015). Duo-trio difference-preference test with two replications:

610

Use of psychological biases for measuring meaningful preference. Journal of sensory

611

studies 30, 211-224.

612

Kim, M.-A., Sim, H.-M., & Lee, H.-S. (2015). Affective discrimination methodology:

613

Determination and use of a consumer-relevant sensory difference for food quality

614

maintenance. Food Research International 70, 47-54.

615

Kim, M.-A., Sim, H.-M., & Lee, H.-S. (2016). Idiographic duo–trio tests using a constant-

616

reference based on preference of each consumer: Sample presentation sequence in

617

difference test can be customized for individual consumers to reduce error. Food

618

Research International, 89, 454-462.

619

Kim, M.-A., van Hout, D., Dessirier, J.-M, & Lee, H.-S. (2018). Degree of satisfaction-

620

difference (DOSD) method for measuring consumer acceptance: A signal detection

621

measurement with higher reliability than hedonic scaling. Food Quality and Preference,

622

63, 28-37.

623

Kim, M.-A., van Hout, D., & Lee, H.-S. (2018). Degree of satisfaction-difference (DOSD)

624

method for measuring consumer acceptance: Comparative and absolute measures of

625

satisfaction based on signal detection theory. Food quality and preference, 68, 167-172.

626

Köster, E. P. (1998). Épreuves hédoniques. In F. Depledt, & F. Strigler (Eds.), Évaluation

627

sensorielle: manuel méthodologique (2ème éd., pp. 182–207). Paris, London, New York:

628

Lavoisier Tec & Doc.

629

Lee, H.-S., & O’Mahony, M. (2005). Sensory evaluation and marketing: measurement of a

630 631 632

consumer concept. Food Quality and Preference, 16, 227–235. Lee, H.-S., & van Hout, D. (2009). Quantification of sensory and food quality: the R-index analysis. Journal of Food Science, 74, R57–R64.

633

Lee, S.-Y. (2015). Develeopment of sensory measures for the perceived kokumi of chicken

634

bouillon products (Unpublished master’s thesis). Ewha Womans University, Seoul,

635

South Korea.

636 637

Macmillan, N. A., & Creelman, C. D. (2005). Detection theory: A user’s guide. (2nd ed.). Mahwah, New Jersey: Lawrence Erlbaum Associates.

638

Maniscalco, B., & Lau, H. (2012). A signal detection theoretic approach for estimating

639

metacognitive sensitivity from confidence ratings. Consciousness and Cognition, 21,

640

422–430.

641 642

Massoni, S., Gajdos, T., & Vergnaud, J. C. (2014). Confidence measurement in the light of signal detection theory. Frontiers in psychology, 5, 1455.

643

Mojet, J., & Köster, E. P. (1986). Investigation into the appreciation of three low-alcohol

644

beers. Utrecht University: Report Psychological Laboratory (Confidential research

645

report), Utrecht, Netherlands.

646

O’Mahony, M. (1992). Understanding discrimination tests: A user‐friendly treatment of

647

response bias, rating and ranking R‐index tests and their relationship to signal detection.

648

Journal of Sensory Studies, 7, 1–47.

649

650 651 652

Seo, J.-E. (2013). Development of a measure for the perceived holistic softness of personal products. Master’s thesis. Ewha Womans University, Seoul, South Korea. Stocks, M., Shepherd, D., Lee, H.-S., van Hout, D., & Hautus, M. J. (2017). Cognitive

653

decision strategies adopted by consumers in reminder difference tests: Influence of the

654

authenticity test. Food Research International, 97, 265-271.

655

Stolzenbach, S., Bredie, W. L., Christensen, R. H. B., & Byrne, D. V. (2016). Understanding

656

liking in relation to sensory characteristics, consumer concept associations, arousal

657

potential and “appropriateness for use” using apple juice as an application. Journal of

658

Sensory Studies, 31, 135-142.

659

Thurstone, L. L. (1927). A law of comparative judgment. Psychological Review, 34, 273–286.

660

Thomson, D. M. H., Crocker, C., & Marketo, C. G. (2010). Linking sensory characteristics to

661

emotions: An example using dark chocolate. Food Quality and Preference, 21, 1117-

662

1125.

663

Research Highlights

664



A new consumer test method, the metacognitive authenticity test, was designed.

665



It included the metacognitive certainty response following sensory authenticity response.

666



A new group measure of d'MC was calculated and compared with the mean scores and d'.

667



These comparisons were performed based on both pooled data and segmented data.

668



d'MC corresponded to the mean scores and d' and could be interpreted easily.

669

670

Fig. 1. Illustration of the response classification of the metacognitive authenticity test

671

method for the computation of d' and d'MC estimations based on signal detection theory

672

(SDT): SDT response matrix for computation of (a) d' for S1 and S2 pair, (b) d'MC for S1

673

and (c) d'MC for S2.

674 675

Fig. 2. Illustration of data analysis for computation of d' and d'MC estimations: (a) d' from

676

the sensory authenticity response, and (b) d'MC from the metacognitive certainty response.

677 678

Table 1

679

Number of responses and tasting products for each consumer and across all consumers.

Main test

Type of

Total No. of responses for each product

responses For each consumer

For all consumers (pooled data using data analysis)

: 3 days with a week interval for

Sensory

15 responses x 3 days

authenticity

= 45 responses

45 sensory authenticity responses x 91 consumers = 4,095 responses

response

31

Calculating the mean scores and d' estimates

each product (5 sessions/day)

Metacogniti ve certainty response

15 responses x 3 days

45 metacognitive certainty responses x 91 consumers = 4,095 responses

= 45 responses

: 3 products

Calculating the d'MC estimates

were evaluated Total number

45

responses

of tastings

products

x

3

4095 responses x 3 products = 12,285 tasting

= 135 tasting

680

32

681

Table 2

682

Mean scores for each product (N=91, n=4,095)* Product

Mean score (SD)

P1

3.491b (1.784)

P2

4.048c (1.517)

P3

3.091a (1.513)

683

*N:

684

consumers) for each product

685

a-c

Total No. of consumers, n: Total No. of responses (45 sensory authenticity responses x Total No. of

Different superscripts indicate a significant difference between the mean scores in a column (p < 0.05).

686

33

687

Table 3

688

d' estimates for each product pair (N=91, n=8,190)* Product pair (Signal



d' (95% CI)**

Noise) P2 ‒ P1

0.311a (0.268, 0.355)

P2 ‒ P3

0.629a (0.584, 0.673)

P1 ‒ P3

0.234a (0.190, 0.278)

689

*N:

690

consumers x Two products) for each product pair

691

**

692

a-c

693

test, p < 0.05).

Total No. of consumers, n: Total No. of responses (45 sensory authenticity responses x Total No. of

Values in bold represent a significant difference from zero. Different superscripts indicate a significant difference between the d' estimates in a column (chi-square

694 695 696

34

697

Table 4

698

d'MC estimates for each product (N=91, n=4,095)* (a) For each product

(b) For each product pair Difference

Product

d'MC (95% CI)**

d'MC

between products

P1

0.048b (-0.011, 0.106)

P2 ‒ P1

0.205

P2

0.253c (0.191, 0.314)

P2 ‒ P3

0.471

P1 ‒ P3

0.266

-0.218a

P3

(-0.278,

0.158)

699

*N:

700

consumers) for each product

701

**

702

a-c

703

test, p < 0.05).

-

Total No. of consumers, n: Total No. of responses (45 metacognitive certainty responses x Total No. of

Values in bold represent a significant difference from zero. Different superscripts indicate a significant difference between the d'

704

35

MC

estimates in a column (chi-square

705

Table 5

706

Mean scores for each product for each consumer group Group

Group 1 (N=38, n=1,710)*

Group 2 (N=35, n=1,575)

Group 3 (N=18, n=810)

Product

Mean score (SD)

P1

2.140a (1.306)

P2

4.716c (1.245)

P3

3.406b (1.530)

P1

4.685c (1.311)

P2

2.979a (1.332)

P3

3.097b (1.490)

P1

4.021b (1.497)

P2

4.716c (1.141)

P3

2.415a (1.290)

707

*N:

708

consumers) for each product

709

a-c

710

group (p < 0.05).

Total No. of consumers, n: Total No. of responses (45 sensory authenticity responses x Total No. of

Different superscripts indicate a significant difference between the mean scores in a column within each

711

36

712

Table 6

713

d' estimates for each product pair for each consumer group Product pair Group

(Signal ‒

d' (95% CI)**

Noise) P2 ‒ P1

1.786c (1.705, 1.867)

P2 ‒ P3

0.864b (0.795, 0.934)

Group 1 (N=37, n=3,330)*

P1 ‒ P3

P2 ‒ P1 Group 2 (N=35,

P2 ‒ P3

n=3,150)

Group 3 (N=19, n=1,710)

-0.945a

(-1.107,

-

(-1.313,

-

0.874) -1.236a 1.159) -0.042b

(-0.111,

0.028)

P1 ‒ P3

1.096c (1.021, 1.171)

P2 ‒ P1

0.542a (0.446, 0.638)

P2 ‒ P3

1.860c (1.741, 1.979)

P1 ‒ P3

1.088b (0.985, 1.191)

714

*N:

715

consumers x Two products) for each product pair

716

**

717

a-c

718

group (chi-square test, p < 0.05).

Total No. of consumers, n: Total No. of responses (45 sensory authenticity responses x Total No. of

Values in bold represent a significant difference from zero. Different superscripts indicate a significant difference between the d' estimates in a column within each

719 720

37

721

Table 7

722

d'MC estimates for each product for each consumer group (a) For each product

(b) For each product pair Difference

Group Product

d'MC (95% CI)**

between

d'MC

products

P1 Group 1 (N=37,

P2

n=1,665)* P3

P1 Group 2 P2 (N=35, n=1,575) P3

-0.716a

(-0.844,

-

0.587) 0.578c (0.448, 0.708) -0.091b

(-0.186,

0.003) 0.644b (0.502, 0.786) -0.176a

(-0.275,

-

(-0.236,

-

0.076) -0.140a 0.044)

P2 ‒ P1

1.294

P2 ‒ P3

0.669

P1 ‒ P3

-0.624

P2 ‒ P1

-0.820

P2 ‒ P3

-0.036

P1 ‒ P3

0.784

P1

0.239b (0.104, 0.373)

P2 ‒ P1

0.225

P2

0.463b (0.266, 0.661)

P2 ‒ P3

0.831

P1 ‒ P3

0.606

Group 3 (N=19, n=855)

P3

-0.367a

(-0.541,

-

0.193)

723

*N:

724

consumers) for each product

725

**

726

a-c

727

group (chi-square test, p < 0.05).

Total No. of consumers, n: Total No. of responses (45 metacognitive certainty responses x Total No. of

Values in bold represent a significant difference from zero. Different superscripts indicate a significant difference between the d'

728

38

MC

estimates in a column within each

729 730

Author Statement

731 732

In-Ah Kim: Conceptualization, Methodology, Investigation, Writing - Original Draft,

733

Writing- Reviewing and Editing. Ha-Yeon Cho: Conceptualization, Methodology,

734

Investigation, Writing - Original Draft. Michael J. Hautus: Software, Writing - Reviewing

735

and Editing. Hye-Seong Lee: Supervision, Conceptualization, Methodology, Writing -

736

Original Draft, Writing- Reviewing and Editing.

737

39