Alvarado or RIPASA score for diagnosis of acute appendicitis? A meta-analysis of randomized trials

Alvarado or RIPASA score for diagnosis of acute appendicitis? A meta-analysis of randomized trials

Accepted Manuscript Alvarado or RIPASA score for diagnosis of acute appendicitis? A meta-analysis of randomized trials Maximos Frountzas, Konstantinos...

2MB Sizes 0 Downloads 26 Views

Accepted Manuscript Alvarado or RIPASA score for diagnosis of acute appendicitis? A meta-analysis of randomized trials Maximos Frountzas, Konstantinos Stergios, Dimitra Kopsini, Dimitrios Schizas, Konstantinos Kontzoglou, Konstantinos Toutouzas PII:

S1743-9191(18)31539-5

DOI:

10.1016/j.ijsu.2018.07.003

Reference:

IJSU 4720

To appear in:

International Journal of Surgery

Received Date: 24 March 2018 Revised Date:

9 June 2018

Accepted Date: 6 July 2018

Please cite this article as: Frountzas M, Stergios K, Kopsini D, Schizas D, Kontzoglou K, Toutouzas K, Alvarado or RIPASA score for diagnosis of acute appendicitis? A meta-analysis of randomized trials, International Journal of Surgery (2018), doi: 10.1016/j.ijsu.2018.07.003. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT Alvarado or RIPASA score for diagnosis of acute appendicitis? A metaanalysis of randomized trials. Maximos Frountzas1(MF) MD, Konstantinos Stergios1(KS) MD, Dimitra Kopsini1(DK) MD, Dimitrios Schizas2(DS) MD, PhD, Konstantinos Kontzoglou3(KK) MD, PhD,

RI PT

Konstantinos Toutouzas4(KT) MD, PhD

Affiliation

Laboratory of Experimental Surgery and Surgical Research N.S. Christeas,

Athens Medical School, Athens, Greece.

First Department of Surgery, Laiko General Hospital, Athens Medical

School, Athens, Greece. 3

M AN U

2

SC

1

Second Department of Propaedeutic Surgery, Laiko General Hospital,

School of Medicine, Athens Medical School, Athens, Greece. First Department of Propaedeutic Surgery, Hippocration Hospital,

TE D

4

Athens Medical School, Athens, Greece.

AC C

EP

Correspondence: Maximos Frountzas, MD, PhD(c) 39, Mikras Asias str. Athens 11527 – GREECE Phone: +0030 6978880045 E-mail: [email protected],

Disclosure: The authors report no conflict of interest. Source of funding: None to disclose for all authors.

Category: Review (meta-analysis)

Keywords: Alvarado; RIPASA; appendicitis; diagnosis; meta-analysis

ACCEPTED MANUSCRIPT Short title: Alvarado or RIPASA for appendicitis diagnosis? Word counts: abstract: 246, text: 2,729 Number of Figures: 4 Number of Tables: 3

AC C

EP

TE D

M AN U

SC

RI PT

Number of references: 32

ACCEPTED MANUSCRIPT 1

Abstract

2

Backround: The electronic diagnostic tools of acute appendicitis present

4

serious disadvantages, thus some clinical scores have been formed in order

5

to reach the diagnosis easily and safely. Alvarado and RIPASA scores are the

6

most commonly used and the purpose of this meta-analysis is to compare the

7

diagnostic accuracy of these two scoring systems.

8

Method: We searched MEDLINE (1966-2017), Scopus (2004-2017),

9

ClinicalTrials.gov (2008-2017), Google Scholar (2004–2017) and Cochrane

10

Central Register of Controlled Trials CENTRAL (1999-2017) databases. We

11

selected all observational cohort studies that reported diagnostic parameters

12

of Alvarado and RIPASA diagnostic scores on patients with clinical status of

13

acute appendicitis. Statistical meta-analysis was performed with Meta Disc 1.4

14

software.

15

Results: Twelve studies were included in our meta-analysis which enrolled

16

2,161 patients. The sensitivity of RIPASA score was 94% (95% CI, 92% to

17

95%) and the specificity was 55% (95% CI, 51% to 55%). In addition, the area

18

under the Roc Curve (AUC) was 0.9431 and the diagnostic Odds Ratio was

19

24.66 (95% CI, 8.06 to 75.43). The sensitivity of Alvarado score was 69%

20

(95% CI, 67% to 71%) and the specificity was 77% (95% CI, 74% to 80%).

21

Moreover, the AUC was 0.7944 and the diagnostic Odds Ratio was 7.99 (95%

22

CI, 4.75 to 13.43).

23

Conclusion: RIPASA scoring system is more sensitive than Alvarado one,

24

but the low specificity forms the need of a supplementary mean to provide the

25

accurate diagnosis. Nevertheless, the wide and safe use of both tests is

AC C

EP

TE D

M AN U

SC

RI PT

3

1

ACCEPTED MANUSCRIPT recommended in health systems that lack electronic diagnostic tests, such us

27

developing countries or rural hospitals.

28

Introduction

29

Acute appendicitis (AA) is one of the most common surgical emergencies in

30

the world, with 7-12% of the general population being affected (1). Prompt and

31

accurate diagnosis, mainly based on clinical assessment and laboratory

32

results (2), is of paramount importance in order to reduce complications and

33

mortality rates. However, due to the fact that only half of the patients present

34

with the typical periumbilical pain followed by nausea, vomiting and migration

35

of the pain to the right lower quadrant, as well as the diseases mimicking AA,

36

diagnosis of appendicitis is a challenging undertaking (3, 4). Ultrasound, multi-

37

detector CT scan and diagnostic laparoscopy have been important, but costly,

38

supplements to the differential diagnosis of acute abdominal pain.

39

Laparoscopy is a vital tool in the diagnosis and management of lower

40

abdominal pain in young and fertile women (5, 6), while CT scan is perceived

41

as the best diagnostic tool for AA (7), both reducing the false positive cases

42

before the operation.

43

However, ionizing radiation and the associated cancer risk especially for the

44

pediatric patients, limited availability and high cost are major disadvantages of

45

CT scan (8). Surgery is not without risks as well. Negative appendectomy

46

(NA) due to early intervention, perforation due to delayed intervention, wound

47

infection, hernia development and adhesions are some of the drawbacks of

48

appendectomy. Also, the cost of negative appendectomy (NA) to both the

49

patient and the health care system, are important things to consider when

50

treating patients with a possible appendicitis (9). These facts, have made the

AC C

EP

TE D

M AN U

SC

RI PT

26

2

ACCEPTED MANUSCRIPT need for an assisting tool in diagnosis of AA mandatory and this is how clinical

52

scoring systems emerged. Since 1980, more than ten systems have been

53

developed, with Alvarado, RIPASA, Fenyo, Tzakis, Eskelinen and Ohmann

54

being some of them. These scoring systems stratify patients into scaled

55

groups of likelihood to suffer from appendicitis, according to the number of

56

symptoms and signs they present. (10).

57

The Alvarado score was introduced in 1986 and takes into consideration eight

58

parameters that were found to be important in the diagnosis of acute

59

appendicitis: localized tenderness in the right lower quadrant, leukocytosis,

60

migration of pain, leukocyte shift to the left, temperature elevation, nausea-

61

vomiting, anorexia and rebound pain (11). In 2010, a new scoring system

62

called RIPASA score was developed. It was thought to have increased

63

sensitivity and specificity when applied in Indian population and compared to

64

Alvarado’s 8 parameters, it uses 15: age, gender, right iliac fossa (RIF) pain,

65

migration of pain to RIF, nausea and vomiting, anorexia, duration of

66

symptoms (less or more than 48 hours), RIF tenderness, guarding, rebound

67

tenderness, Rovsing's sign, fever, raised white cell count, negative urinalysis

68

and foreign national registration identity card (12).

69

The aim of our study is to compare these two widely and easily used scoring

70

systems, in order to identify which one has the best sensitivity and specificity

71

and leads to the most accurate diagnosis of acute appendicitis.

AC C

EP

TE D

M AN U

SC

RI PT

51

72 73

Material and methods

74

Literature search and data collection

3

ACCEPTED MANUSCRIPT The present study was designed according to the Preferred Reporting Items

76

for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (13).

77

Eligibility criteria were predetermined by the authors. Language or date

78

restrictions were avoided during the literature search. Case reports and review

79

articles were excluded. Two authors (MF, DK) performed the electronic search

80

of articles and tabulated data on duplicated pre-structured forms. The data

81

where then reviewed by a third author (KS) and every discrepancy regarding

82

the evaluation of the methodology, retrieval of articles and statistical analysis

83

was resolved by consensus of three authors (DS, KK, KT).

84

We used the Medline (1966-2017), Scopus (2004-2017), Clinicaltrials.gov

85

(2008-2017), Cochrane Central Register of Controlled Trials CENTRAL

M AN U

SC

RI PT

75

(1999-2017) and Google Scholar (2004–2017) databases in our primary

87

search along with the reference lists of electronically retrieved full-text papers.

88

The date of our last search was on December 20th, 2017. The PRISMA flow

89

chart of study selection could be found on Figure 1. According to that, 142

90

studies excluded selection, due to lack of one diagnostic tool from the

91

comparison or due to reference to a different diagnostic tool; the inadequate

92

data of some studies about statistical values necessary for our analysis was

93

an additional reason for excluding them from selection.

94

The search strategy included the keywords “acute”, “appendicitis”, “diagnosis”,

95

“score”, in combination with Boolean operators (AND, OR, NOT). The stages

96

of article selection are depicted in the PRISMA flow diagram (Figure 1).

97

Quality assessment

98

The methodological quality of the included studies was assessed with the

AC C

EP

TE D

86

4

ACCEPTED MANUSCRIPT QUADAS-2 tool, which comprises 4 domains: patient selection, index test,

100

reference standard, and flow and timing. Each domain is assessed in terms of

101

risk of bias, and the first 3 domains are also assessed in terms of concerns

102

regarding applicability. Signaling questions are included to help judge risk of

103

bias, and the possible answers are “low”, “high” or “unclear”. At the end, a

104

summary of the number of studies that had a low, a high, or an unclear risk of

105

bias or concerns about applicability for each domain is formed, represented by

106

a different color (Figure 2).

107

Statistical analysis

108

Statistical analysis was performed using the Meta Disc 1.4 software.

109

Confidence intervals were set at 95%. Pooled sensitivity, pooled specificity,

110

diagnostic Odds Ratios (OR), Summary Receiver Operating Characteristic

111

(SROC) curves and 95% confidence intervals (CI) for all outcomes were

112

calculated, using the DerSimonian-Laird random effect model due to the

113

significant heterogeneity in the methodological characteristics of included

114

studies (Table 1). Publication bias was not tested due to the gross

115

heterogeneity of included studies, which is a significant confounder that may

116

influence the methodological integrity of these tests.

117

Definitions

118

Alvarado scoring system includes abdominal pain that migrates to the right

119

iliac fossa (1 point), anorexia (loss of appetite) or ketones in the urine (1

120

point), nausea or vomiting (1 point), tenderness in the right iliac fossa (2

121

points), rebound tenderness (1 point), fever of 37.3 °C or more (1 point),

122

leukocytosis > 10,000 (2 points), neutrophilia > 70% (1 point). A score of 5 or

123

6 is compatible with the diagnosis of acute appendicitis. A score of 7 or 8

AC C

EP

TE D

M AN U

SC

RI PT

99

5

ACCEPTED MANUSCRIPT indicates a probable appendicitis, and a score of 9 or 10 indicates a very

125

probable acute appendicitis (14). A score above 7.0 considered as positive for

126

appendicitis. All the included studies used classic Alvarado scoring system,

127

except from one that used the modified Alvarado test and one that used both

128

tests. Nevertheless, all studies considered the same cut-off point (score > 7.0)

129

for diagnosis of acute appendicitis.

130

RIPASA scoring system includes age (less than 40 years = 1 point; greater

131

than 40 years = 0.5 point), gender (male = 1 point; female = 0.5 point), right

132

Iliac fossa (RIF) pain (0.5 point), migration of pain to RIF (0.5 point), nausea

133

and vomiting (1 point), anorexia (1 point), duration of symptoms (less than 48

134

hours = 1 point; more than 48 hours = 0.5 point), RIF tenderness (1 point),

135

guarding (2 points), rebound tenderness (1 point), Rovsing's sign (2 points),

136

fever (1 point), raised white cell count (1 point), negative urinalysis (1 point)

137

and foreign national registration identity card (1 point) (15). A score above 7.5

138

considered as positive for appendicitis.

139

TE D

M AN U

SC

RI PT

124

Results

141

Figure 1 shows the process that our study group followed in order to conclude

142

to the included studies after search of the literature and evaluation of the

143

results. Two studies were excluded, because they referred to Alvarado score

144

by itself (12, 16). Another three studies were excluded, because they were

145

exclusively associated to the RIPASA score (15, 17) and one study excluded

146

due to correlation with an irrelevant scoring system (18).

147

studies were included in our meta-analysis which enrolled 2,161 patients (7,

148

19-29). All patients that completed the inclusion criteria undergone

AC C

EP

140

Finally, twelve

6

ACCEPTED MANUSCRIPT assessment following both RIPASA and Alvarado diagnostic tests. Table 1

150

shows the main characteristics of the studies involved in the present meta-

151

analysis. In addition, Table 2 refers to the demographic data of the patients

152

that took place in our study, as well as other possible diagnoses that

153

confirmed post-operatively. In addition, it presents other means that helped

154

the diagnostic procedure (CT, U/S) other than surgery and histopathologic

155

analysis, which provided the final diagnostic confirmation in all studies.

156

Finally, Table 3 present the statistical parameters of the two tests, which our

157

further analysis was based on.

158

Quality assessment

159

After the quality assessment of the included studies, according to QUADAS-2

160

tool, 25% of the studies present a low risk of bias during the patient selection

161

process, 50% of the studies present a low risk of bias relative to the index

162

test, 15% of the studies have a low risk of bias relative to the reference

163

standard and 45% of the studies present a low risk of bias associated with

164

timing and flow (Figure 2). In addition, 75% of the included studies present low

165

applicability concerns during the patient selection process, 80% of the studies

166

have low applicability concerns about the index test and 25% of the studies

167

present low applicability concerns correlated to the reference standard (Figure

168

2).

169

Outcomes

170

The sensitivity of RIPASA score among the included studies was 94% (95%

171

CI, 92% to 95%) (Fig. 3a). On the other hand, the specificity of RIPASA score

172

was 55% (95% CI, 51% to 55%) (Fig. 3b). In addition, the area under the Roc

173

Curve (AUC) was 0.9431 for the RIPASA score (Fig. 3c). Finally, the

AC C

EP

TE D

M AN U

SC

RI PT

149

7

ACCEPTED MANUSCRIPT diagnostic Odds Ratio for the RIPASA score was 24.66 (95% CI, 8.06 to

175

75.43) (Fig. 3d).

176

The sensitivity of Alvarado score was 69% (95% CI, 67% to 71%) among the

177

included studies (Fig. 4a). The specificity of the same test was 77% (95% CI,

178

74% to 80%) (Fig. 4b). Moreover, the AUC for the Alvarado test was found to

179

be 0.7944 (Fig. 4c). Finally, the diagnostic Odds Ratio of the Alvarado test

180

was 7.99 (95% CI, 4.75 to 13.43) (Fig. 4d).

SC

181

RI PT

174

Discussion

183

Acute appendicitis is the most frequent cause of abdominal pain, excluding

184

the cases, where no cause will be identified (30). Most of the surgeons, at

185

some point of their practice, will face an important dilemma, between

186

performing an unnecessary appendectomy and delaying one, requesting

187

further diagnostic tests and finally operating when perforation occurs. Our

188

study revealed that the RIPASA score has a very high sensitivity but low

189

specificity, lower than the Alvarado one. On the other hand, the Alvarado

190

system presents a low sensitivity. Moreover, the RIPASA presents a higher

191

diagnostic Odds Ratio and a greater AUC. Overall, both systems can help the

192

physicians diagnose acute appendicitis, but the RIPASA one is more sensitive

193

and provides better statistical parameters than the Alvarado one.

194

A wide range of diagnostic tests, such as CT scan and ultrasonography, are

195

today available and present high sensitivity and specificity rates, although

196

their use is importantly limited by different factors, such as the prompt

197

availability, the costs, the ionizing radiation risk of developing cancer, the

198

national guidelines and the common sense among healthcare professionals

AC C

EP

TE D

M AN U

182

8

ACCEPTED MANUSCRIPT that an equivocal scan has nothing to offer, which would change the patient’s

200

management (31). On the other hand, identifying a normal appendix during

201

the laparoscopy, carries the risks of an operation under general anesthesia,

202

the overall cost is high and the surgeon will face again a dilemma between

203

performing or not an appendectomy. At the end of the day most surgeons will

204

remove the appendix, making all the diagnostic work up at this stage looking

205

not very useful. The fact that the complications rate is the same irrespective of

206

the inflammation or not of the appendix, complicates further the patient’s

207

management and makes the implementation of a prediction scoring system

208

necessary (32). Different clinical diagnostic scoring systems have been

209

proposed in the past, the last one by Sammalkorpi et al. (32), but the RIPASA

210

followed the Alvarado Score, as the most frequently used and studied. The

211

Alvarado scoring system has been found in a recent study to have a low

212

negative appendectomy rate of 18.9%, comparing to Izbicki and Christian

213

scoring systems (33). Alvarado and RIPASA are accurate, simple rapid,

214

reliable and of low cost.

215

The main advantage of our study is that it deals with a very common medical

216

emergency, whose incidence is very high in all age groups and the diagnosis

217

is very simple, despite the extremely serious complications that may occur to

218

the misdiagnosed cases. These complications would be eliminated by the

219

accurate and precise diagnosis, which the present study attribute to.

220

Furthermore, our study focuses on two scoring systems that could be

221

available to every physician even at the most remote health units and require

222

very low cost to perform. The extensive search of the literature and the

223

collection of all the available studies that were relative to the subject of our

AC C

EP

TE D

M AN U

SC

RI PT

199

9

ACCEPTED MANUSCRIPT meta-analysis led to precise and useful conclusions, that could directly apply

225

to the clinical practice. On the other hand, the two scoring systems that our

226

study refers to are mainly based on the clinical presentation, and that fact

227

makes the diagnosis difficult for some groups of patients, such as the elderly,

228

the diabetics and the children. In addition, very few of the studies included in

229

our meta-analysis have been originated in western health systems, such us

230

Western Europe or America; the explanation of that is the overuse of

231

electronic diagnostic means in such health systems, like ultrasound and CT.

232

Another disadvantage of our study is that it does not compare the two scoring

233

systems with other diagnostic tests, like CT scan or ultrasonography, due to

234

lack of relevant studies from the literature. Finally, the two scoring systems

235

that are included in our study use a wide number of criteria, which require

236

accurate recall of the ranges and the point system, a fact that may cause

237

confusions and wrong diagnoses at the very noisy and messy environment of

238

the emergency rooms.

239

An ideal prediction tool would aid to avoid unnecessary operations, and

240

missing out real appendicitis which finally perforates, keeping low the negative

241

appendicectomy rate, the missing out of appendiceal perforations and the

242

possibility of perforation in case of conservative treatment. In addition, since

243

the incidence of acute appendicitis depends on the sex, future studies should

244

provide stratified data according to sex, in order a subgroup analysis to

245

become possible and a correlation between the diagnostic value of each

246

scoring system and the sex of each patient to be investigated. In fact, we are

247

still far from having an ideal scoring system and definitely more studies with

248

strict implementation of a scoring system are necessary, in order to obtain

AC C

EP

TE D

M AN U

SC

RI PT

224

10

ACCEPTED MANUSCRIPT more robust evidence. Comparing the available diagnostic tools, we conclude

250

that RIPASA scoring system is more sensitive than Alvarado one, but the lack

251

of high specificity for the RIPASA scoring system, forms the need of a

252

supplementary mean to provide the accurate diagnosis, depending on the

253

clinical case each time. In addition, the wide and safe use of both tests could

254

be extremely crucial in health systems of developing countries, in which

255

electronic diagnostic tests are not widely available, or in distant rural health

256

units, where the available equipment limits the diagnostic potentials.

SC

RI PT

249

257

Acknowledgements

259

We would like to thank Dr. VP for his critical evaluation of the meta-analysis

260

and his useful remarks during the writing of the present manuscript.

261

TE D

262

M AN U

258

References

264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284

1. Chalazonitis AN, Tzovara I, Sammouti E, Ptohis N, Sotiropoulou E, Protoppapa E, et al. CT in appendicitis. Diagnostic and interventional radiology (Ankara, Turkey). 2008;14(1):19-25. 2. Sengupta A, Bax G, Paterson-Brown S. White cell count and C-reactive protein measurement in patients with possible appendicitis. Annals of the Royal College of Surgeons of England. 2009;91(2):113-5. 3. Birnbaum BA, Wilson SR. Appendicitis at the millennium. Radiology. 2000;215(2):337-48. 4. Karamanakos SN, Sdralis E, Panagiotopoulos S, Kehagias I. Laparoscopy in the emergency setting: a retrospective review of 540 patients with acute abdominal pain. Surgical laparoscopy, endoscopy & percutaneous techniques. 2010;20(2):11924. 5. Gaitan HG, Reveiz L, Farquhar C, Elias VM. Laparoscopy for the management of acute lower abdominal pain in women of childbearing age. The Cochrane database of systematic reviews. 2014(5):CD007683. 6. Kadam PD, Chuan HH. Erratum to: Rectocutaneous fistula with transmigration of the suture: a rare delayed complication of vault fixation with the sacrospinous ligament. International urogynecology journal. 2016;27(3):505. 7. Liu W, Wei Qiang J, Xun Sun R. Comparison of multislice computed tomography and clinical scores for diagnosing acute appendicitis. The Journal of international medical research. 2015;43(3):341-9.

AC C

EP

263

11

ACCEPTED MANUSCRIPT

EP

TE D

M AN U

SC

RI PT

8. Brenner DJ, Doll R, Goodhead DT, Hall EJ, Land CE, Little JB, et al. Cancer risks attributable to low doses of ionizing radiation: assessing what we really know. Proceedings of the National Academy of Sciences of the United States of America. 2003;100(24):13761-6. 9. Khairy G. Acute appendicitis: is removal of a normal appendix still existing and can we reduce its rate? Saudi journal of gastroenterology : official journal of the Saudi Gastroenterology Association. 2009;15(3):167-70. 10. McKay R, Shepherd J. The use of the clinical scoring system by Alvarado in the decision to perform computed tomography for acute appendicitis in the ED. The American journal of emergency medicine. 2007;25(5):489-93. 11. Alvarado A. A practical score for the early diagnosis of acute appendicitis. Annals of emergency medicine. 1986;15(5):557-64. 12. Chong CF, Adi MI, Thien A, Suyoi A, Mackie AJ, Tin AS, et al. Development of the RIPASA score: a new appendicitis scoring system for the diagnosis of acute appendicitis. Singapore medical journal. 2010;51(3):220-5. 13. Liberati A, Altman DG, Tetzlaff J, Mulrow C, Gotzsche PC, Ioannidis JP, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. BMJ (Clinical research ed). 2009;339:b2700. 14. Douglas CD, Macpherson NE, Davidson PM, Gani JS. Randomised controlled trial of ultrasonography in diagnosis of acute appendicitis, incorporating the Alvarado score. BMJ (Clinical research ed). 2000;321(7266):919. 15. Butt MQ, Chatha SS, Ghumman AQ, Farooq M. RIPASA score: a new diagnostic score for diagnosis of acute appendicitis. Journal of the College of Physicians and Surgeons--Pakistan : JCPSP. 2014;24(12):894-7. 16. Ohle R, O'Reilly F, O'Brien KK, Fahey T, Dimitrov BD. The Alvarado score for predicting acute appendicitis: a systematic review. BMC medicine. 2011;9:139. 17. Khadda S, Yadav AK, Ali A, Parmar A, Sakrani JK, Beniwal H. CLINICAL STUDY TO EVALUATE THE RIPASA SCORING SYSTEM IN THE DIAGNOSIS OF ACUTE APPENDICITIS. 18. Rathod S, Ali I, Bawa APS, Singh G, Mishra S, Nongmaithem M. Evaluation of Raja Isteri Pengiran Anak Saleha Appendicitis score: A new appendicitis scoring system. Medical Journal of Dr DY Patil University. 2015;8(6):744. 19. Chong CF, Thien A, Mackie AJ, Tin AS, Tripathi S, Ahmad MA, et al. Comparison of RIPASA and Alvarado scores for the diagnosis of acute appendicitis. Singapore medical journal. 2011;52(5):340-5. 20. Ferlengez E, Ferlengez AG, Akbulut H, Kadioglu H. Evaluation of four different scoring systems in the management of acute appendicitis; a prospective clinical study/Akut Apandisit Degerlendirilmesinde Dort Farkli Skorlama Sisteminin Degerlendirilmesi; ileri Donuk Klinik Calisma. The Medical Bulletin of Haseki. 2013;51(1):15-8. 21. Reyes-García N, Zaldívar-Ramírez FR, Cruz-Martínez R, Sandoval-Martínez MD, Gutiérrez-Banda CA, Athié-Gutiérrez C. Precisión diagnóstica de la escala RIPASA para el diagnóstico de apendicitis aguda: análisis comparativo con la escala de Alvarado modificada. Cirujano general. 2012;34(2):101-6. 22. Alnjadat I, Abdallah B. Alvarado versus RIPASA score in diagnosing acute appendicitis. Rawal Med J. 2013;38:147-51. 23. Erdem H, Cetinkunar S, Das K, Reyhan E, Deger C, Aziret M, et al. Alvarado, Eskelinen, Ohhmann and Raja Isteri Pengiran Anak Saleha Appendicitis scores for diagnosis of acute appendicitis. World journal of gastroenterology. 2013;19(47):905762. 24. N N, Mohammed A, Shanbhag V, Ashfaque K, S AP. A Comparative Study of RIPASA Score and ALVARADO Score in the Diagnosis of Acute Appendicitis. Journal of clinical and diagnostic research : JCDR. 2014;8(11):NC03-5.

AC C

285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338

12

ACCEPTED MANUSCRIPT

RI PT

SC

M AN U

TE D EP

367

25. Verma M, Chanchal K, Vashisht M, Goyal K, Yadav P. Comparison of Alvarado and RIPASA scoring systems in diagnosis of acute appendicitis paripex. Indian J Res. 2015;4(8):55-7. 26. Walczak DA, Pawełczak D, Żółtaszek A, Jaguścik R, Fałek W, Czerwińska M, et al. The Value of Scoring Systems for the Diagnosis of Acute Appendicitis. Polish Journal of Surgery. 2015;87(2):65-70. 27. Golden SK, Harringa JB, Pickhardt PJ, Ebinger A, Svenson JE, Zhao Y-Q, et al. Prospective evaluation of the ability of clinical scoring systems and physiciandetermined likelihood of appendicitis to obviate the need for CT. Emerg Med J. 2016:emermed-2015-205301. 28. Singla A, Singla S, Singh M, Singla D. A comparison between modified Alvarado score and RIPASA score in the diagnosis of acute appendicitis. Updates in surgery. 2016;68(4):351-5. 29. Sinnet P, Chellappa PM, Kumar S, Ethirajulu R, Thambi S. Comparative study on the diagnostic accuracy of the RIPASA score over Alvarado score in the diagnosis of acute appendicitis. Journal of Evidence Based Medicine and Healthcare. 2016;3(80):4318-21. 30. Cervellin G, Mora R, Ticinesi A, Meschi T, Comelli I, Catena F, et al. Epidemiology and outcomes of acute abdominal pain in a large urban Emergency Department: retrospective analysis of 5,340 cases. Annals of translational medicine. 2016;4(19):362. 31. Ozkan S, Duman A, Durukan P, Yildirim A, Ozbakan O. The accuracy rate of Alvarado score, ultrasonography, and computerized tomography scan in the diagnosis of acute appendicitis in our center. Nigerian journal of clinical practice. 2014;17(4):413-8. 32. Shogilev DJ, Duus N, Odom SR, Shapiro NI. Diagnosing appendicitis: evidence-based review of the diagnostic approach in 2014. The western journal of emergency medicine. 2014;15(7):859-71.

AC C

339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366

13

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

368

371 372

TE D EP

370

Figure 1. The PRISMA flow chart of study selection.

Figure 2. The methodological assessment of the included studies according to the QUADAS-2 TOOL.

AC C

369

14

ACCEPTED MANUSCRIPT Sensitivity (95% CI)

0,2

0,4 0,6 Sensitivity

Pooled Sensitivity = 0,94 (0,92 to 0,95) Chi-square = 59,97; df = 11 (p = 0,0000) 1 Inconsistency (I-square) = 81,7 %

0,8

M AN U

373

Figure 3a. Pooled sensitivity of the RIPASA test. CI, confidence intervals.

375 376

EP

0,2

0,4 0,6 Specificity

0,8

1

0,81 0,62 0,85 0,25 0,28 0,91 0,74 0,11 0,09 0,65 0,80 0,36

(0,72 - 0,89) (0,52 - 0,71) (0,55 - 0,98) (0,01 - 0,81) (0,14 - 0,45) (0,71 - 0,99) (0,64 - 0,82) (0,00 - 0,48) (0,02 - 0,23) (0,41 - 0,85) (0,28 - 0,99) (0,29 - 0,43)

Pooled Specificity = 0,55 (0,51 to 0,59) Chi-square = 150,42; df = 11 (p = 0,0000) Inconsistency (I-square) = 92,7 %

AC C

0

Specificity (95% CI)

2011; Chong 2012; Alnjadat 2012; Garcia 2012; Ferlengez 2013; Erdem 2014; Nanjundaiah 2015; Liu 2015; Verma 2015; Walczak 2016; Sinnet 2016: Singla 2016; Golden

TE D

374

(0,93 - 1,00) (0,91 - 0,95) (0,81 - 0,97) (0,77 - 0,97) (0,95 - 1,00) (0,92 - 0,98) (0,91 - 0,98) (0,96 - 1,00) (0,76 - 0,95) (0,89 - 0,99) (0,85 - 0,99) (0,68 - 0,86)

SC

0

0,98 0,93 0,91 0,90 1,00 0,96 0,95 1,00 0,88 0,96 0,96 0,78

RI PT

2011; Chong 2012; Alnjadat 2012; Garcia 2012; Ferlengez 2013; Erdem 2014; Nanjundaiah 2015; Liu 2015; Verma 2015; Walczak 2016; Sinnet 2016: Singla 2016; Golden

Figure 3b. Pooled sensitivity of RIPASA test. CI, confidence intervals.

15

ACCEPTED MANUSCRIPT Sensitivity 1

SROC Curve

Symmetric SROC AUC = 0,9431 SE(AUC) = 0,0433 Q* = 0,8814 SE(Q*) = 0,0558

0,9

0,8

RI PT

0,7

0,6

0,5

SC

0,4

0,2

0,1

0

0

0,2

0,4

0,8

1

EP

TE D

Figure 3c. The area under the roc curve (AUC) for the RIPASA test.

AC C 0,01

0,6

1-specificity

377 378 379

M AN U

0,3

1 Diagnostic Odds Ratio

2011; Chong 2012; Alnjadat 2012; Garcia 2012; Ferlengez 2013; Erdem 2014; Nanjundaiah 2015; Liu 2015; Verma 2015; Walczak 2016; Sinnet 2016: Singla 2016; Golden

Diagnostic OR (95% CI) 215,47 (48,28 - 961,63) 22,05 (12,98 - 37,45) 57,20 (9,80 - 333,85) 3,08 (0,26 - 37,09) 61,42 (3,48 - 1.084,40) 252,86 (49,14 - 1.301,02) 55,24 (25,00 - 122,04) 32,29 (1,22 - 855,56) 0,67 (0,16 - 2,78) 39,46 (10,13 - 153,77) 86,00 (6,32 - 1.169,48) 1,93 (1,10 - 3,41)

Random Effects Model Pooled Diagnostic Odds Ratio = 24,66 (8,06 to 75,43) Cochran-Q = 112,74; df = 11 (p = 0,0000) 100,0 Inconsistency (I-square) = 90,2 % Tau-squared = 3,0828

380 381

Figure 3d. The pooled diagnostic odds ratio for the RIPASA test.

16

M AN U

Figure 4a. The pooled sensitivity of the Alvarado test. CI, confidence intervals.

386 387

AC C

EP

TE D

382 383 384 385

SC

RI PT

ACCEPTED MANUSCRIPT

Figure 4b. The pooled specificity of the Alvarado test. CI, confidence intervals.

388 389 390 391

17

ACCEPTED MANUSCRIPT Sensitivity 1

SROC Curve

Symmetric SROC AUC = 0,7944 SE(AUC) = 0,0265 Q* = 0,7310 SE(Q*) = 0,0229

0,9

0,8

RI PT

0,7

0,6

0,5

SC

0,4

0,3

0,1

0

0

0,2

0,4

0,8

1

Figure 4c. The area under the roc curve (AUC) for the Alvarado test.

AC C

EP

TE D

394 395

0,6

1-specificity

392 393

M AN U

0,2

396 397

Figure 4d. The pooled diagnostic odds ratio for the Alvarado test.

18

ACCEPTED MANUSCRIPT

Table 1. Study characteristics No of patients

Inclusion criteria

Exclusion criteria

2011; Chong

Prospective

192

Patients of all age groups presenting with RIF pain and suspicion of AA

2012; Alnjadat

Prospective

600

2012; Garcia

Prospective

70

2012; Ferlengez 2013; Erdem

Prospective

45

All patients >14 years old who underwent appendectomy Patients with abdominal pain suggesting AA N/A

Patients presenting with non-RIF pain and those who had been previously admitted for other complaints but subsequently developed RIF pain during their admission episodes N/A

Prospective

113

2014; Nanjundaiah 2015; Liu

Prospective

206

Retrospective cohort

297

Patients eligible to calculate RIPASA and Alvarado scores + patients with abdominal pain

2015; Verma

Prospective

100

Prospective Prospective

92 109

Patients of all age groups with RIF pain suspected to have AA N/A All patients with RIF pain, vomiting and fever. All patients presenting with RIF pain

SC

N/A N/A

TE D

M AN U

Patients admitted at the hospital due to suspected AA All patients with RIF pain

N/A

RI PT

Type of study

Children <15 years old, pregnant women, patients with RIF mass, h/x urolithiasis, h/x PID Children (<18 years old), pregnant female patients, patients allergic to iodinated contrast material, cases of AA with appendectomy performed >24 h following CT examination N/A

AC C

EP

N/A <12years old, complications of AA (appendicular mass and malignancy, perforated appendix, elective appendectomy) 2016; Singla Prospective 50 Multiple co-morbid diseases, coagulation disorders, adverse anesthetic history, suspected/proven malignancy 2016; Golden Prospective 287 Patients > 11 years old and a CT Patients who were incarcerated, pregnant, post-appendectomy, unable to ordered to evaluate for appendicitis. have intravenous contrast, unable to speak or read English, or who lacked capacity to provide informed consent/assent were excluded. Table 1. The characteristics of the studies that were included in the present meta-analysis. N/A, data were not available; RIF, right iliac fossa; AA, acute appendicitis; PID, pelvic inflammatory disease 2015; Walczak 2016; Sinnet

19

ACCEPTED MANUSCRIPT

Mal e 92

Sex Femal e 100

Confirmed AA Histologically 101

Other Diagnoses

RI PT

Table 2. Patients’ demographics Age

25,1 ± 12,7

2012; Alnjadat

360

240

498

102 NA. Mesenteric lymphadenitis (3), Ruptured ovarian cyst (4), Ectopic pregnancy (4), Crohn’s disease (3), Cecal tumors (3)

36 27

34 18

57 41

N/A N/A

2013; Erdem

Mean: 26,52 26,15 (M) 27,08 (F) 33,8 30,64+12,09 30,2 ± 10,1

62

51

77

N/A

2014; Nanjundaiah 2015; Liu

27,82 ± 9,262 47,9 ± 17,6

127

79

184

Operated: Ruptured Ovarian cyst (3) IBD (2), Carcinoid tumor (1) Non-operated: Urinary system disease (8), Gastroenteritis (4), Mesenteric lymphadenitis (1), IND (1), Gynecologic problem (1) N/A

158

139

187

Yes

2015; Verma

28,10 ± 10,887 38

67

33

91

Crohn’s disease (2) Gastrointestinal perforation (3) Inflammation of caecum and/or ascending colon (9) Adhesive ileus (5) Intussusception (2) Volvulus (4) Right ureter tumor (3) Right ureter calculus (8) Right accessory tumor (12) Right pelvic endometrioma (2) Uterine myoma (3) Appendiceal tumor (3) Gastrointestinal tumor (44) N/A

46

46

N/A (US done) N/A

N/A N/A

N/A

N/A

AC C

EP

TE D

M AN U

SC

2011; Chong

2012; Garcia 2012; Ferlengez

Normal appendix (23), periappendicitis (7)

Performed CT

Mesenteric lymphadenitis (6), Ruptured Ovarian cyst (2), extra-uterine pregnancy (1), acute No pancreatitis (1), necrotic omental fragment (1) 2016; Sinnet 28 40 69 89 N/A N/A 2016; Singla 25,74 32 18 45 N/A N/A 2016; Golden 33 ± 15,2 115 172 94 N/A Yes Table 2. The characteristics of the patients that were included in the present meta-analysis and other diagnoses that were confirmed post-operatively. Data are mean ± SD or median (range) unless otherwise specified. N/A, data were not available; M, male; F, female. Age in years. 2015; Walczak

57

20

ACCEPTED MANUSCRIPT

Table 3. Diagnostic values of the scoring tests

2013; Erdem 2014; Nanjundaiah 2015; Liu 2015; Verma 2015; Walczak 2016; Sinnet 2016; Singla 2016; Golden

85,34% vs 86,25% 92,2% vs 92% 96,3% vs 92,7% Ν/Α

97,37% vs 71,43% 64,9% vs 34,8% 68,8% vs 60% Ν/Α

28% vs 75%

75% vs 88%

96,2% vs 58,9% 95,2% vs 63,1% 100% vs 82,42% 88% vs 85%

90,5% vs 85,7% 73,6% vs 80,9% 11,11% vs 44,44% 9% vs 16%

98,9% vs 97,3% N/A

100% vs 66% 73,1% vs 19,1% N/A

91,92% vs 93,75% 68% vs 74%

95,51% vs 65,16% 95,6% vs 53,3% 78% vs 61%

65% vs 90%

92,39% vs 96,67% 97,7% vs 100% 39% vs 53%

80% vs 100% 36% vs 74%

AUC

PLR

NLR

N/A

N/A

15,7% vs 18,3%

0,9183 vs 0,8651 0,9149 vs 0,7432 0,93 vs 0,89

Ν/Α

Ν/Α

2,4368% vs 2,0880% 5,93% vs 2.908% Ν/Α

0,1105% vs 0,4065% 10,4% vs 15,2% Ν/Α

77% vs 80%

25% vs 12%

N/A

N/A

96,2% vs 58,9%

N/A

N/A

N/A

87,2% vs 69,7%

N/A

0,857 vs 0,818 (?) 0,982 vs 0,849 N/A

N/A

N/A

100% vs 20% 20% vs 29%

92% vs 79%

8,1% vs 6,3%

N/A

N/A

N/A

N/A

N/A

N/A

N/A

N/A

76,47% vs 36,73% 66,7% vs 19,2% 76% vs 79%

89,9% vs 69,73% 94% vs 58%

7,61% vs 3,33%

N/A

N/A

N/A

N/A

N/A

N/A

0,943 vs 0,862 0,960 vs 0,580 0,67 vs 0,0,72

130% vs 220%

50% vs 60%

Ν/Α

RI PT

81.32% vs 87,91% 61,8% vs 68,6% 84,6% vs 69,2% 25% vs 75%

Negative appendicitis rate 14,7% vs 13,8% 7,8% vs 8%

SC

98,02% vs 68,32% 93,2% vs 73,7% 91,2% vs 89,5% 90,2% vs 34,1% 100% VS 82 %

Diagnostic accuracy 91,83% vs 86,51% 91,5% vs 74,3%

M AN U

2012; Ferlengez

NPV

TE D

2012; Garcia

PPV

EP

2012; Alnjadat

Specificity

AC C

2011; Chong

Sensitivity

2% vs 0%

Table 3. The diagnostic parameters of the two scoring tests (RIPASA vs Alvarado) among the included studies. N/A, data were not available; PPV, positive predictive value; NPV, negative predictive value; AUC, area under the roc-curve; PLR, positive likelihood ratio; NLR, negative likelihood ratio.

21

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

22

ACCEPTED MANUSCRIPT •

RIPASA test had greater sensitivity than Alvarado test



Alvarado test had greater specificity than RIPASA test



RIPASA test presented greater Area Under the Roc Curve (AUC) than Alvaradi test

RI PT

RIPASA had higher diagnostic Odds Ratio than Alvarado test

AC C

EP

TE D

M AN U

SC



ACCEPTED MANUSCRIPT

PRISMA 2009 Checklist Section/topic

Reported on page #

# Checklist item

Title

RI PT

TITLE 1

Identify the report as a systematic review, meta-analysis, or both.

1

2

Provide a structured summary including, as applicable: background; objectives; data sources; study eligibility criteria, participants, and interventions; study appraisal and synthesis methods; results; limitations; conclusions and implications of key findings; systematic review registration number.

3

Rationale

3

Describe the rationale for the review in the context of what is already known.

4

Objectives

4

Provide an explicit statement of questions being addressed with reference to participants, interventions, comparisons, outcomes, and study design (PICOS).

5

Protocol and registration

5

Indicate if a review protocol exists, if and where it can be accessed (e.g., Web address), and, if available, provide registration information including registration number.

-

Eligibility criteria

6

Specify study characteristics (e.g., PICOS, length of follow-up) and report characteristics (e.g., years considered, language, publication status) used as criteria for eligibility, giving rationale.

6

Information sources

7

Describe all information sources (e.g., databases with dates of coverage, contact with study authors to identify additional studies) in the search and date last searched.

5

Search

8

Present full electronic search strategy for at least one database, including any limits used, such that it could be repeated.

6

Study selection

9

State the process for selecting studies (i.e., screening, eligibility, included in systematic review, and, if applicable, included in the meta-analysis).

6

Structured summary

SC

ABSTRACT

M AN U

INTRODUCTION

AC C

EP

TE D

METHODS

Data collection process

10

Describe method of data extraction from reports (e.g., piloted forms, independently, in duplicate) and any processes for obtaining and confirming data from investigators.

6

Data items

11

List and define all variables for which data were sought (e.g., PICOS, funding sources) and any assumptions and simplifications made.

6

Risk of bias in individual studies

12

Describe methods used for assessing risk of bias of individual studies (including specification of whether this was done at the study or outcome level), and how this information is to be used in any data synthesis.

6

Summary measures

13

State the principal summary measures (e.g., risk ratio, difference in means).

7

Synthesis of results

14

Describe the methods of handling data and combining results of studies, if done, including measures of consistency 2 (e.g., I ) for each meta-analysis.

7

ACCEPTED MANUSCRIPT

PRISMA 2009 Checklist Page 1 of 2

Reported on page #

# Checklist item

RI PT

Section/topic Risk of bias across studies

15

Specify any assessment of risk of bias that may affect the cumulative evidence (e.g., publication bias, selective reporting within studies).

6

Additional analyses

16

Describe methods of additional analyses (e.g., sensitivity or subgroup analyses, meta-regression), if done, indicating which were pre-specified.

-

Study selection

17

Give numbers of studies screened, assessed for eligibility, and included in the review, with reasons for exclusions at each stage, ideally with a flow diagram.

Study characteristics

18

For each study, present characteristics for which data were extracted (e.g., study size, PICOS, follow-up period) and provide the citations.

8

Risk of bias within studies

19

Present data on risk of bias of each study and, if available, any outcome level assessment (see item 12).

9

Results of individual studies

20

For all outcomes considered (benefits or harms), present, for each study: (a) simple summary data for each intervention group (b) effect estimates and confidence intervals, ideally with a forest plot.

9

Synthesis of results

21

Present results of each meta-analysis done, including confidence intervals and measures of consistency.

9

Risk of bias across studies

22

Present results of any assessment of risk of bias across studies (see Item 15).

9

Additional analysis

23

Give results of additional analyses, if done (e.g., sensitivity or subgroup analyses, meta-regression [see Item 16]).

-

Summary of evidence

24

Summarize the main findings including the strength of evidence for each main outcome; consider their relevance to key groups (e.g., healthcare providers, users, and policy makers).

11

Limitations

25

Discuss limitations at study and outcome level (e.g., risk of bias), and at review-level (e.g., incomplete retrieval of identified research, reporting bias).

11

Conclusions

26

Provide a general interpretation of the results in the context of other evidence, and implications for future research.

12

27

Describe sources of funding for the systematic review and other support (e.g., supply of data); role of funders for the systematic review.

-

FUNDING Funding

M AN U

TE D

EP

AC C

DISCUSSION

SC

RESULTS

8

From: Moher D, Liberati A, Tetzlaff J, Altman DG, The PRISMA Group (2009). Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med 6(7): e1000097. doi:10.1371/journal.pmed1000097

For more information, visit: www.prisma-statement.org. Page 2 of 2