Treating a broader range of depressed adolescents with combined therapy

Treating a broader range of depressed adolescents with combined therapy

Accepted Manuscript Treating a broader range of depressed adolescents with combined therapy Simon Foster PhD , Prof. Meichun Mohler-Kuo ScD PII: DOI:...

889KB Sizes 0 Downloads 19 Views

Accepted Manuscript

Treating a broader range of depressed adolescents with combined therapy Simon Foster PhD , Prof. Meichun Mohler-Kuo ScD PII: DOI: Reference:

S0165-0327(18)30384-7 https://doi.org/10.1016/j.jad.2018.08.027 JAD 10024

To appear in:

Journal of Affective Disorders

Received date: Revised date: Accepted date:

23 February 2018 26 July 2018 7 August 2018

Please cite this article as: Simon Foster PhD , Prof. Meichun Mohler-Kuo ScD , Treating a broader range of depressed adolescents with combined therapy, Journal of Affective Disorders (2018), doi: https://doi.org/10.1016/j.jad.2018.08.027

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT Highlights 

Response distributions of depressed adolescents were compared across treatments Cognitive-behavioral therapy was no more effective than placebo



Fluoxetine was effective only in the middle range of response levels



Combining cognitive-behavioral therapy and fluoxetine was broadly effective



Combining the two treatments was also effective in low responders

AC

CE

PT

ED

M

AN US

CR IP T



1

ACCEPTED MANUSCRIPT

Treating a broader range of depressed adolescents with combined therapy

Authors:

Meichun Mohler-Kuo 1,2,3, ScD, Prof.

Author affiliations: Epidemiology, Biostatistics and Prevention Institute

AN US

1

University of Zurich Hirschengraben 84

M

8001 Zürich, Switzerland

Swiss Research Institute for Public Health and Addiction associated with the University of

Zurich

PT

Konradstrasse 32

ED

2

CR IP T

Simon Foster 1,2, PhD

La Source, School of nursing sciences

AC

3

CE

8031 Zurich, Switzerland

HES-SO University of Applied Sciences and Arts of Western Switzerland Av. Vinet 30

1004 Lausanne, Switzerland

2

ACCEPTED MANUSCRIPT

Corresponding author:

AC

CE

PT

ED

M

AN US

CR IP T

Simon Foster 1

1

Epidemiology, Biostatistics and Prevention Institute

University of Zurich Hirschengraben 84 8001 Zürich, Switzerland Phone: +41 43 556 40 04 Fax:

+41 41 44 448 11 70

Email: [email protected] 3

ACCEPTED MANUSCRIPT Abstract Background: Traditional statistical analyses of clinical trials encompass the central tendency of outcomes and, hence, are restricted to a treatment‘s average effectiveness. Our aim was to get a more complete picture of the effectiveness of standard treatment options for adolescent depression, by analyzing treatment effects across low, middle, and high levels of response.

CR IP T

Methods: Secondary data analysis was performed of the Treatment for Adolescents with Depression Study (TADS, ClinicalTrials.gov, NCT00006286), a randomized controlled trial comparing fluoxetine (FLX), cognitive-behavioral therapy (CBT), and their combination (COMB) against placebo treating adolescents with major depression (n=439). The

AN US

proportional change from baseline to week 12 in the Children‘s Depression Rating ScaleRevised was used as an index of response. Response levels were analyzed via quantile regression models, thereby estimating treatment effects across the entire response level

M

distribution, adjusted for baseline depression, study site, and patients‘ treatment expectancies. Results: Whereas CBT was no more effective than placebo across response levels, COMB

ED

was more effective than FLX in that its quantile treatment effects were both larger in magnitude and spread out across a broader range of response levels, including the low end of

CE

1.33-1.45).

PT

the response level distribution. Cohen‘s d of the difference was 1.39 (95% confidence interval

Limitations: Ad-hoc analysis using data from a trial that was not originally designed to

AC

accommodate such analysis. Conclusion: The combination of cognitive-behavioral therapy and fluoxetine was more effective than either treatment used alone, not just in average effectiveness, but in the breadth of patients in whom it was effective.

Key words

4

ACCEPTED MANUSCRIPT adolescent; major depressive disorder; second-generation antidepressive agents; cognitive

AC

CE

PT

ED

M

AN US

CR IP T

therapy; combined modality therapy; statistical data interpretation

5

ACCEPTED MANUSCRIPT Introduction Among young people, major depression accounts for a substantial portion of the burden of disease (Gore et al., 2011; Merikangas et al., 2010). Furthermore, depression has a variety of adverse effects, both immediately and in adulthood. In particular, depression during adolescence is associated with an increased risk for depression and other psychopathology in

CR IP T

adulthood (Fergusson and Woodward, 2002; Jones, 2013; Rutter et al., 2006); an increased risk for self-harm and suicidal behaviors (Fergusson and Woodward, 2002; Gould et al., 1998; Hawton et al., 2012; Thapar et al., 2012); an increased risk for reduced social functioning and work difficulties (Costello and Maughan, 2015; Fergusson and Woodward, 2002; Thapar et

AN US

al., 2012); and an increased risk of future adverse birth outcomes among women (NkansahAmankra and Tettey, 2015). It is clear, then, that depression in adolescents needs to be treated

M

swiftly and effectively.

The ‗Treatment for Adolescents with Depression Study‘ (TADS) was a corner-stone clinical

ED

trial that tested the effectiveness of standard treatment options for major depression in adolescents, including treatment with the selective serotonin reuptake inhibitor (SSRI)

PT

fluoxetine, treatment with cognitive-behavioral therapy, and these two treatments combined

CE

(March et al., 2004; Thapar et al., 2012). Statistical analysis of the acute phase indicated superiority of fluoxetine used alone, and of the combined treatment over a pill placebo, as

AC

well as of combined treatment over either fluoxetine or cognitive-behavioral therapy used alone; cognitive-behavioral therapy, however, failed to outperform the placebo (March et al., 2004).

Treatment effects were estimated using standard statistical methods, including mixed-effects and logistic regression (March et al., 2004). Whereas such analyses are indispensable to establishing a treatment‘s general level of effectiveness, they only truly examine outcomes for 6

ACCEPTED MANUSCRIPT each treatment in terms of their central tendency, for example their mean value across all subjects within each treatment arm. However, looking at central tendency fails to reveal whether any detected treatment effect exists only on average or across the entire distribution of outcome values (Cleveland, 1994; Koenker and Hallock, 2001; Wilcox, 2012).

CR IP T

The quantile regression framework allows for a more complete look at the differences between the outcome distributions of two treatment conditions. With quantile regressions, the quantiles of an outcome‘s distribution are modeled as a function of the predictor variable(s), in contrast to the traditional regression framework, wherein only the mean of the outcome is

AN US

modeled (Beyerlein, 2014; Briollais and Durrieu, 2014; Koenker and Hallock, 2001; Petscher and Logan, 2014). This allows for comparing all quantiles of the treatment condition‘s outcome distribution against the corresponding quantiles of the placebo condition‘s outcome

M

distribution. Comparing quantiles is the most informative way of comparing two groups because it shows whether the groups differ in a simple or in a complicated way (Cleveland,

ED

1994; Wilcox, 2012). In the simple case, one group has consistently higher values than the other. In a more complicated case, low, middle, and high outcome values differ by different

PT

amounts. It is possible that the groups differ in an even more complicated way, for example

CE

with one group outperforming the other at lower outcome values, but being outperformed at higher outcome values. Such complicated group differences cannot be uncovered by

AC

comparing the groups‘ central tendencies.

Assessing a treatment‘s effect across the entire outcome range would be especially attractive if the outcome is an index of response (e.g., indexing the degree of change from baseline to the trial‘s end), since such an analysis would show whether a given treatment has a consistent versus differential effect on low, middle, and high response levels. Thus, rather than examining whether the average response level is higher among subjects receiving active 7

ACCEPTED MANUSCRIPT treatment relative to controls, one could examine whether the treatment has some level of effectiveness across the entire range of response levels, including special subject subsets like low and high responders (Beyerlein, 2014; Briollais and Durrieu, 2014; Petscher and Logan, 2014). A treatment that is able to exert a consistent effect across response levels is more

CR IP T

clinically valuable than a treatment with a more restricted breadth of effectiveness.

Our current aim was to gain a more complete picture of TADS treatment effects, by re-

evaluating them across the entire response range. We did so by analyzing patients‘ change in depression severity from baseline to the end of the TADS acute phase as an index of response,

AN US

using quantile regression models (Beyerlein, 2014; Briollais and Durrieu, 2014; Koenker and Hallock, 2001; Petscher and Logan, 2014). This approach allowed us to analyze both central tendency and outcome extremes, thereby providing more insights into the breadth of the

M

treatments‘ effectiveness than traditional regression analyses. Since we are not aware of any comparable previous analyses, the study was exploratory. The research question was whether

CE

Methods

PT

ED

treatment effectiveness varied between low, middle, and high response levels.

Study design

AC

The current study entailed secondary analysis of TADS trial data (ClinicalTrials.gov, NCT00006286). TADS is a randomized controlled trial that was designed to compare the effectiveness of cognitive-behavioral therapy CBT), a specific SSRI called fluoxetine, and the two treatments combined, all versus a pill placebo for the treatment of adolescents with a major depressive disorder. We analyzed data from the acute phase of the trial, which encapsulated the first 12 weeks of treatment. The data was acquired through a limited access data certificate by the first author from the National Institute of Mental Health (NIMH) Data 8

ACCEPTED MANUSCRIPT Archive (NDA) of the United States of America (USA). Details of study design have already been published elsewhere (March et al., 2004; TADS Team, 2003). All patients and at least one of their parents provided written informed consent. The Duke University Medical Center (Durham, NC) and the institutional review boards at each study site approved and monitored the study protocol; TADS was monitored quarterly by the data safety and monitoring board of

CR IP T

the NIMH.

Study population

Detailed descriptions have already been published of TADS-subject eligibility criteria, subject

AN US

recruitment, sample size calculation, and data collection, as well as a detailed description of the TADS sample, including its comparison against both clinical and epidemiological samples (March et al., 2004; TADS Team, 2003, 2005). The final sample consisted of 439 adolescents,

M

recruited at 13 clinical sites, who met the criteria for major depressive disorder, as defined in the Diagnostic and Statistical Manual of Mental Disorders, fourth edition (DSM-IV)

ED

(American Psychiatric Association, 1994). All patients were 12-17 years old, and 54.4% were female. In terms of racial background, 73.8% were Caucasian, 12.5% were African American,

PT

8.9% were Hispanic, and 4.8% were other. Socio-economically, 61.2% reported family

CE

incomes of $40,000 or higher over the preceding 12 months. Almost half (47.8%) had at least

AC

one coexisting DMS-IV disorder.

Interventions

Detailed descriptions of the interventions offered in TADS have been published previously (March et al., 2004; TADS Team, 2003). The acute phase trial included four treatment arms: a pill placebo (PBO); the active drug, fluoxetine (FLX); cognitive-behavioral therapy (CBT); and a combination of FLX and CBT (COMB). Eligible patients were randomly assigned to one of the four treatments in a 1:1:1:1 allocation ratio (PBO: n = 112, FLX: n = 109, CBT: n = 9

ACCEPTED MANUSCRIPT 111, COMB: n = 107), with computerized randomization stratified by study site and subject gender (March et al., 2004).

Outcomes Depression was assessed using the Children‘s Depression Rating Scale-Revised (CDRS-R),

CR IP T

both at baseline and at week 12 by independent evaluators blinded to treatment arm assignments (March et al., 2004). The CDRS-R is a validated, clinician-administered rating scale that integrates information collected interviewing both the adolescent and a parent

(Mayes et al., 2010; Poznanski and Mokros, 1995). The raw summary score was used as a

AN US

measure of depression (Curry et al., 2006; March et al., 2004).

To satisfy our primary study aim, we analyzed the patients‘ response to treatment rather than

M

the summary score itself. As an index of each patient‘s response level, we used his or her proportional change in depression from baseline to the end of acute phase, since this measure

ED

is intuitive and routinely used to define ―response‖ in clinical trials (Ali and Lam, 2011; Emslie et al., 2010; Moller et al., 2012; Tao et al., 2009; Usala et al., 2008). Note that we did

PT

not dichotomize this measure around some threshold (usually 50%); instead, we used raw

CE

proportions to estimate treatment effects across the entire spectrum of response levels. The quantile treatment effect was then the difference between the response levels of the treatment

AC

and the placebo group at a particular quantile. For example, the treatment effect at the 50th quantile (i.e. the median) was the difference between the median of response levels in the treatment group and the median of response level in the placebo group.

Adjustment variables Adjustment variables included baseline depression scores, study site, and patients‘ treatment expectancies. Baseline depression was included to control for regression to the mean in the 10

ACCEPTED MANUSCRIPT response levels (Laird, 1983; Vickers and Altman, 2001), whereas study site was included to control for study site effects (March et al., 2004). The patients‘ treatment expectancies were included because it was impossible to totally blind all the subject groups to their treatment arm (those receiving CBT, either alone or in combination with FLX, invariably knew this) (March et al., 2004). Furthermore, the study did not include a CBT-plus-PBO-arm. As a

CR IP T

result, treatment expectancies could be a confounder of treatment effectiveness (Weersing and Brent, 2006).

Before randomization, patients‘ expectancies as to how much they would improve if

AN US

randomized to each of the three active treatments were assessed separately for FLX, CBT, and COMB (Curry et al., 2006). A single expectancy score was generated by assigning each patient the expectancy score corresponding to the treatment he or she actually received (Curry

M

et al., 2006). Patients in the placebo condition were assigned their expectancy about FLX

ED

(Curry et al., 2006).

Missing values

PT

Missing values for the outcome measure were replaced by the scores predicted by the random

CE

coefficient regression model used in the original TADS analyses. These predicted scores were included in the TADS data set and previously used to deal with missing depression scores

AC

(Curry et al., 2006). According to the initial authors, the predicted-score imputation method provides a less biased estimate of the treatment effect than the last carried forward approach. In TADS, 61 of 439 patients (13.9%) had a missing outcome value.

Missing values for patients‘ treatment expectancies were imputed using the method proposed by Stekhoven and Bühlmann (2012). This method is based on machine learning and was

11

ACCEPTED MANUSCRIPT shown to be comparative to established imputation methods. Missing expectancy values were replaced for 36 of 439 (8.2%) patients.

Statistical analysis The intention-to-treat principle was used for all analyses. We first calculated the quantiles of

CR IP T

each treatment arm‘s response levels and used quantile-quantile-plots to provide unadjusted treatment-placebo-comparisons. In a quantile-quantile-plot, the corresponding quantiles of two distributions are plotted against each other. If the distributions do not differ, the quantiles of the two distributions do not differ and, hence, all values lie on the diagonal of the plot. In

AN US

contrast, if the distributions differ in any way, the values deviate from the diagonal

accordingly. Quantile-quantile-plots thereby provide a comprehensive summary of a

M

treatment-placebo-difference across the entire range of response levels.

We then employed quantile regression models to estimate treatment effects across response

ED

levels. To provide a comprehensive assessment of each treatment effect across the entire range of response levels, we modelled the 1st to the 99th quantiles with intervals of 1 (i.e., the

PT

1st, 2th, 3th, …, 99th quantiles). Assessing such a set of quantiles produces a curve that shows

CE

each treatment effect as a function of the quantiles. The question then is whether the curves of the treatments differ from one another. To quantify differences between the various

AC

treatments‘ curves, we calculated areas under the curve (AUC), and did so for both above(positive area) and below-zero values (negative area) on the y-axis. As a result, curves containing values less than zero were penalized.

To derive confidence intervals (CI) for the AUCs, we used the non-parametric bootstrap procedure, which allowed us to empirically estimate the probability distribution of an arbitrary statistical quantity (Efron and Tibshirani, 1994). Bootstrapping was stratified by 12

ACCEPTED MANUSCRIPT treatment condition (Efron and Tibshirani, 1994) and encompassed 3000 replications. We then used the ―bias-corrected and accelerated‖ method to calculate 95% CIs; this is the recommended method for deriving general-purpose bootstrap CIs (Efron and Tibshirani, 1994).

CR IP T

To formally test for differences between the treatment arms‘ AUCs, we used the samples of paired AUC values generated by the bootstrap procedure. That is, for each bootstrap iteration, an AUC was calculated for CBT, FLX, and COMB, resulting in an estimated AUC for each treatment within that particular bootstrap sample. We compared these paired AUCs via paired

AN US

t-tests (one-tailed), accompanied by Cohen‘s d for paired samples as an effect size measure. Note that, due to the large sample size of 3000 in these tests, the p-values are non-informative and all interpretations of results should be based on the effect sizes. We examined the

M

following comparisons: a) CBT versus FLX

ED

b) CBT versus COMB

PT

c) FLX versus COMB

CE

We estimated two quantile regression models. Model 1 included the treatment effects adjusted for baseline depression. In model 2, we additionally adjusted for study site and patients‘

AC

treatment expectancies. Analyses were conducted using the statistical software program R, version 3.3.2 (R Core Team, 2016). Quantile regressions were estimated via the add-on package ―quantreg‖ (Koenker, 2015). AUCs were estimated using the trapezoid rule, as implemented in the package ―flux‖ (Jurasinski et al., 2014). Bootstrapping was done using the package ―boot‖ (Canty and Ripley, 2016).

13

ACCEPTED MANUSCRIPT Results Table 1 provides the 10th to 90th response level quantiles of each treatment arm. Response levels were rather dispersed. In the placebo condition, the response level on the 10th quantile was 0.067 and as high as 0.61 on the 90th quantile. This corresponds to a reduction of the baseline depression score by 6.7% on the 10th quantile and by 60.1% on the 90th quantile.

CR IP T

Similar ranges of response levels were found for CBT and FLX. In contrast, COMB had clearly higher response levels, achieving a response level of 0.184 – and thus a reduction of the baseline depression score by 18.4% - already on the 10th quantile. COMB also had the

AN US

highest response level across all treatment arms on the 90th quantile, with a level of 0.667.

Figure 1 shows quantile-quantile-plots of all treatment-placebo-comparisons. As can be seen, the PBO vs. CBT response levels hardly differed, as indicated by the fact that the vast

M

majority of quantile values lied on the diagonal of the plot. In contrast, the FLX quantiles were higher than the PBO quantiles at the middle range of response levels, whereas no

ED

differences were present among low and high quantiles. Finally, throughout most of the range of the response level distribution, the COMB quantiles were higher than the PBO quantiles.

PT

Only at the higher end the difference tapered off. Thus, considering the unadjusted quantiles,

CE

only COMB clearly outperformed PBO. FLX showed only some sign of effectiveness in the

AC

middle range of response levels, whereas CBT performed similarly to PBO.

The upper half of Figure 2 depicts the quantile treatment effects estimated for model 1. Every 5th quantile is shown to increase the figure‘s clarity (complete results are available in the online supplementary material, Tables S1-2). The picture gained from the unadjusted analyses provided in table 1 and figure 1 is largely confirmed: 1. CBT was no more effective than placebo across the entire range of response levels.

14

ACCEPTED MANUSCRIPT 2. FLX was effective at the middle range of response levels, however, the effect of FLX was restricted to a rather small range of response levels, spanning from approximately the 24th and 66th quantiles. 3. COMB outperformed PBO across a large range of response levels, spanning from roughly the 2th to the 89th quantiles.

CR IP T

Thus, whereas both FLX and COMB exhibited some level of effectiveness, COMB was effective across a broader range of response levels than FLX. Further adjusting the treatment effects for study site and the patients‘ treatment expectancies (model 2) produced the results in the lower half of Figure 1, where the general picture remains the same, except that the

AN US

adjustments diminished COMB‘s effectiveness among higher-level responders.

Comparing the AUCs confirmed that there was an order of effectiveness among the

M

treatments, with COMB the most effective (table 2). Specifically, the comparisons derived from model 1 indicate that FLX was more effective than CBT (Cohen‘s d: 2.09, 95% CI:

ED

2.03-2.16) and COMB was more effective than both FLX (Cohen‘s d: 2.56, 95% CI: 2.492.63) and CBT (Cohen‘s d: 5.09, 95% CI: 4.99-5.19, table 2). Considering model 2, the

PT

difference between COMB and FLX was less pronounced, but COMB still clearly

CE

outperformed FLX (Cohen‘s d: 1.39, 95% CI: 1.33-1.45), in line with Figure 1. Note that the Cohen‘s d-values correspond to what is commonly considered a ―large effect‖ (Bech, 2015;

AC

Leucht et al., 2012; Weisz et al., 2013; Weisz et al., 2006).

Additional analyses We conducted some additional analyses to examine the robustness of our results. First, we calculated the AUC after truncating the COMB quantile treatment effects at the average of the three highest FLX effects. This was done to assure that the high AUC of COMB was not simply due to a small number of large COMB effects. Second, we tested the AUC comparison 15

ACCEPTED MANUSCRIPT at a stricter level, by calculating the AUCs for the lower bound of the 95% CIs, rather than for the effects themselves. As can be seen in Table 3, these additional analyses fully confirmed the above-reported findings.

Finally, some authors have claimed that proportional change may be a problematic outcome

CR IP T

(Curran-Everett and Williams, 2015; Vickers, 2001). Alternative approaches are to use the absolute change or to use analysis of co-variance (ANCOVA) predicting the end-of-trial

outcome by the treatment conditions, with the outcome‘s baseline values as an adjustment variable (Curran-Everett and Williams, 2015; Laird, 1983; Nash et al., 2014; Senn, 2006).

AN US

We, therefore, repeated our analysis using both the change score and an ANCOVA approach. The results of these analyses are available in the online supplementary material (Figures S1-2,

M

Tables S3-4). They confirmed all the above-reported findings.

ED

Discussion

In the TADS sample, whereas cognitive-behavioral therapy was found to be no more effective

PT

than placebo across response levels, our analyses revealed that combining cognitive-

CE

behavioral therapy and fluoxetine was not only effective, but had a more comprehensive effectiveness than fluoxetine used alone. This was indicated by higher AUCs for the

AC

combined-treatment effects, reflecting simultaneously that these effects were greater in magnitude and spread out across a broader range of response levels than for either treatment used alone. The latter point is of special importance, as it means that the combined-treatment was able to outperform a pill placebo even among low and high responders, whereas fluoxetine alone had a more restricted breadth of effectiveness.

16

ACCEPTED MANUSCRIPT An important question arising from our analysis is: who are those patients with low and high response levels? Some of the high response levels might have been due to measurement errors; i.e., unusually high or low depression scores at baseline or unusually low scores at week 12. Note, however, that we included baseline depression as a covariate to adjust for such cases. In addition, some might be dropouts for which there were no actual end-of-trial data.

CR IP T

Some low and high response levels, on the other hand, are likely to be legitimate. For example, treatment-resistant and chronic depression might give rise to lower levels of

response (Emslie et al., 2010). Some of the low responses might also reflect adverse reactions to the medication, resulting in treatment stoppage or aggravated depression in the medication

AN US

groups. It is also likely that the depression level of some patients would have changed

regardless of their enrollment in the trial; for example, due to spontaneous remission, or secondary to positive or negative life events within the study period, which either facilitated

M

or impeded a given treatment‘s effectiveness, respectively (Wilkinson et al., 2009). Finally,

ED

effect modifiers might have impacted patients‘ levels of response (Curry et al., 2006).

It is unlikely that any of these mechanisms is solely responsible for creating low or high

PT

response levels. Rather, low- and high-level responses are likely mechanistically

CE

multifactorial. The crucial point about these mechanisms is that they are extraneous to the trial – or more generally the clinical care a patient gets – and, hence, not under the treating

AC

clinicians‘ full control. It seems reasonable to assume that there are many such extraneous mechanisms that shape a patient‘s illness course, either because they impede or boost healing. Such mechanisms might then act as limiting factors of what can be achieved by any treatment, thereby creating low and high responders.

If treatment effectiveness is constrained by such limiting factors, our finding that combining cognitive-behavioral therapy and fluoxetine was able to exert an additional positive effect 17

ACCEPTED MANUSCRIPT within low and high responding groups becomes even more clinically relevant, as this treatment apparently was more robust than either therapy used alone. In particular, the finding that the combined treatment had also an effect on low response levels implies that it showed some robustness against factors that hampered recovery. It seems that the treatment was capable of extending the boundary of ―being treatable‖, in contrast to the more constrained

CR IP T

effectiveness of fluoxetine used alone. The combined treatment can therefore be expected to work in a larger set of depressed adolescents and might allow for effectively treating some of the adolescents for whom the drug alone does not provide relief.

AN US

The combination of cognitive-behavioral therapy and fluoxetine was already found to be the most useful treatment in the TADS study, in several regards: a) it was the most effective acute-phase treatment (March et al., 2004); b) its effectiveness was robust against effect

M

modifiers (Curry et al., 2006); and c) it had the best benefit-to-harm ratio (March et al., 2007). Our results provide additional justification for using a combined cognitive-behavioral

ED

therapy-antidepressant medication approach, in that the combination of cognitive-behavioral therapy and fluoxetine was a robust treatment approach that outperformed placebo in a

PT

broader range of depressed adolescents than the corresponding monotherapies. For the

CE

treating clinician, these findings suggest that combining cognitive-behavioral therapy and an antidepressant is more likely to provide some relief for the patient than either therapy alone.

AC

The combined treatment thereby seems to be a powerful option for treating adolescents with moderate to severe depression.

On the other hand, the literature comparing combined depression treatments to corresponding monotherapies for adolescents is inconclusive (Brent et al., 2008; Cox et al., 2014; Dubicka et al., 2010; Goodyer et al., 2008; Hetrick et al., 2011; Ma et al., 2014; Melvin et al., 2006). Similarly, the existing evidence concerning the effectiveness of SSRIs in depressed 18

ACCEPTED MANUSCRIPT adolescents is inconclusive (Bridge et al., 2007; Cheung et al., 2008; Cipriani et al., 2016; Cox et al., 2012; Emslie et al., 2004; Emslie et al., 2008; Findling et al., 2013; Hetrick et al., 2012; Le Noury et al., 2015, 2016; Masi et al., 2010; Thapar et al., 2012; Usala et al., 2008; Wagner et al., 2003; Wagner et al., 2006; Wagner et al., 2004; Whittington et al., 2004), though fluoxetine – the antidepressant used in the TADS trial – is the SSRI that has received

CR IP T

the most consistent empirical support (Cipriani et al., 2016; Masi et al., 2010; Usala et al., 2008). The negative TADS results for cognitive-behavioral therapy were surprising, as cognitive-behavioral therapy has usually been found effective and was deemed a well-

established treatment for adolescent depression (Harrington et al., 1998; Hollon et al., 2005;

AN US

Klein et al., 2007; Masi et al., 2010; Reinecke et al., 1998; Weersing and Brent, 2006; Weersing et al., 2017; Weersing et al., 2009; Weisz et al., 2006).

M

The overall picture is, thus, inconclusive for cognitive-behavioral therapy alone, an antidepressant alone, and the combination in adolescents, meaning that further research is

ED

needed for clarification. We are also not aware of any other analysis comparable to ours. The degree to which our findings are replicable must therefore be left for future research. Note,

PT

however, that TADS was designed to be an effectiveness trial. Accordingly, a sample

CE

covering the full range of treatment-seeking adolescents in the USA was recruited. It was, therefore, suggested that the TADS results should be broadly applicable to youths seeking

AC

treatment for depression in the USA (Hollon et al., 2005; March et al., 2004; TADS Team, 2005). We see no reason why this suggestion should not also apply to our findings, especially considering that our ―average‖ results were in line with the conclusions drawn by the TADS team, based upon their own acute-phase analyses (March et al., 2004). As such, our results might be generalizable, at least to adolescents who are comparable to the TADS sample.

19

ACCEPTED MANUSCRIPT Limitations of the TADS trial have been noted before (Curry et al., 2006; March et al., 2004) and our analyses do not transcend these limitations. Some additional issues should also be noted. First, commonly-used depression rating scales have been criticized for not being psychometrically valid (Bech, 2010, 2015; Fried and Nesse, 2015; Fried et al., 2016) and this applies to the CDRS-R, as well (Isa et al., 2014). Measurement error might, therefore, have

CR IP T

clouded both the previous and our analyses. Second, our analysis focused on effectiveness and did not include any harm-benefit evaluation. Third, estimating fine-grained quantile intervals of 1 was limited by the sample sizes of the TADS treatment arms that ranged from 107 to 112 patients. With these sample sizes, there were only around 1 to 3 additional patients per

AN US

interval. If sample sizes are even smaller, larger intervals (e.g. every 5th or every 10th quantile) are more appropriate. Fourth, as with conventional statistical analyses, the quantile treatment effects apply only probabilistically to individual patients. Thus, although the combined

M

treatment was found to work even at the low end of the response level distribution and to have the most consistent effectiveness, it cannot be inferred that the highest response of a particular

ED

patient will necessarily occur in the combined treatment or that an initially treatment-resistant patient will necessarily show a response when later treated with the combined treatment.

PT

Finally, our analysis was an ad-hoc analysis using data from a trial that was not originally

CE

designed to accommodate such analysis. Our results must, therefore, be replicated and

AC

validated in future studies.

Conclusion

We found that, in the TADS sample, the combination of cognitive-behavioral therapy and the SSRI fluoxetine was more effective than either treatment used alone, not just in average effectiveness, but in the breadth of patients in whom it was effective.

20

ACCEPTED MANUSCRIPT Declarations of interest: None.

Funding source This research did not receive any specific grant from funding agencies in the public,

CR IP T

commercial, or not-for-profit sectors.

Acknowledgements

Data used in the preparation of this manuscript were obtained and analysed from the

AN US

controlled access datasets distributed from the National Institute of Mental Health (NIMH)supported National Database for Clinical Trials (NDCT, https://dataarchive.nimh.nih.gov/ndct/). NDCT is a collaborative informatics system created by the

M

NIMH to provide a national resource to support and accelerate discovery related to clinical trial research in mental health. Dataset identifier: Clinical Trials #2145. This manuscript

ED

reflects the views of the authors and may not reflect the opinions or views of the NIMH or of

PT

the submitters submitting original data to NDCT.

CE

Author statement

AC

Contributors

Conceptualization of study: SF and MMK. Data acquisition and preparation: SF. Statistical analysis: SF. Interpretation of data: SF and MMK. Drafting the manuscript: SF. Revising the manuscript for important intellectual content: MMK. All authors have approved the final manuscript.

Funding source 21

ACCEPTED MANUSCRIPT This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Acknowledgements Data used in the preparation of this manuscript were obtained and analysed from the

CR IP T

controlled access datasets distributed from the National Institute of Mental Health (NIMH)supported National Database for Clinical Trials (NDCT, https://data-

archive.nimh.nih.gov/ndct/). NDCT is a collaborative informatics system created by the

NIMH to provide a national resource to support and accelerate discovery related to clinical

AN US

trial research in mental health. Dataset identifier: Clinical Trials #2145. This manuscript

reflects the views of the authors and may not reflect the opinions or views of the NIMH or of

M

the submitters submitting original data to NDCT.

ED

Declarations of interest:

AC

CE

PT

None.

22

ACCEPTED MANUSCRIPT References Ali, M.K., Lam, R.W., 2011. Comparative efficacy of escitalopram in the treatment of major depressive disorder. Neuropsychiatric disease and treatment 7, 39-49. American Psychiatric Association, 1994. Diagnostic and Statistical Manual of Mental Disorders, 4th Edition (DSM-IV). American Psychiatric Association, Washington, DC.

CR IP T

Bech, P., 2010. Is the antidepressive effect of second-generation antidepressants a myth? Psychological medicine 40, 181-186.

Bech, P., 2015. The responsiveness of the different versions of the Hamilton Depression

Scale. World psychiatry : official journal of the World Psychiatric Association (WPA) 14,

AN US

309-310.

Beyerlein, A., 2014. Quantile regression-opportunities and challenges from a user's perspective. American journal of epidemiology 180, 330-331.

M

Brent, D., Emslie, G., Clarke, G., Wagner, K.D., Asarnow, J.R., Keller, M., Vitiello, B., Ritz, L., Iyengar, S., Abebe, K., Birmaher, B., Ryan, N., Kennard, B., Hughes, C., DeBar, L.,

ED

McCracken, J., Strober, M., Suddath, R., Spirito, A., Leonard, H., Melhem, N., Porta, G., Onorato, M., Zelazny, J., 2008. Switching to another SSRI or to venlafaxine with or without

PT

cognitive behavioral therapy for adolescents with SSRI-resistant depression: the TORDIA

CE

randomized controlled trial. Jama 299, 901-913. Bridge, J.A., Iyengar, S., Salary, C.B., Barbe, R.P., Birmaher, B., Pincus, H.A., Ren, L.,

AC

Brent, D.A., 2007. Clinical response and risk for reported suicidal ideation and suicide attempts in pediatric antidepressant treatment: a meta-analysis of randomized controlled trials. Jama 297, 1683-1696. Briollais, L., Durrieu, G., 2014. Application of quantile regression to recent genetic and -omic studies. Human genetics 133, 951-966. Canty, A., Ripley, B., 2016. boot: Bootstrap R (S-Plus) Functions. R package version 1.3-18.

23

ACCEPTED MANUSCRIPT Cheung, A., Kusumakar, V., Kutcher, S., Dubo, E., Garland, J., Weiss, M., Kiss, A., Levitt, A., 2008. Maintenance study for adolescent depression. Journal of child and adolescent psychopharmacology 18, 389-394. Cipriani, A., Zhou, X., Del Giovane, C., Hetrick, S.E., Qin, B., Whittington, C., Coghill, D., Zhang, Y., Hazell, P., Leucht, S., Cuijpers, P., Pu, J., Cohen, D., Ravindran, A.V., Liu, Y.,

CR IP T

Michael, K.D., Yang, L., Liu, L., Xie, P., 2016. Comparative efficacy and tolerability of antidepressants for major depressive disorder in children and adolescents: a network metaanalysis. Lancet (London, England) 388, 881-890.

Cleveland, W.S., 1994. The elements of graphing data, revised ed ed. Hobart Press, Summit,

AN US

New Jersey.

Costello, E.J., Maughan, B., 2015. Annual research review: Optimal outcomes of child and adolescent mental illness. Journal of child psychology and psychiatry, and allied disciplines

M

56, 324-341.

Cox, G.R., Callahan, P., Churchill, R., Hunot, V., Merry, S.N., Parker, A.G., Hetrick, S.E.,

ED

2014. Psychological therapies versus antidepressant medication, alone and in combination for depression in children and adolescents. The Cochrane database of systematic reviews 11,

PT

CD008324-CD008324.

CE

Cox, G.R., Fisher, C.A., De Silva, S., Phelan, M., Akinwale, O.P., Simmons, M.B., Hetrick, S.E., 2012. Interventions for preventing relapse and recurrence of a depressive disorder in

AC

children and adolescents. The Cochrane database of systematic reviews 11, CD007504. Curran-Everett, D., Williams, C.L., 2015. Explorations in statistics: the analysis of change. Advances in physiology education 39, 49-54. Curry, J., Rohde, P., Simons, A., Silva, S., Vitiello, B., Kratochvil, C., Reinecke, M., Feeny, N., Wells, K., Pathak, S., Weller, E., Rosenberg, D., Kennard, B., Robins, M., Ginsburg, G., March, J., 2006. Predictors and moderators of acute outcome in the Treatment for Adolescents

24

ACCEPTED MANUSCRIPT with Depression Study (TADS). Journal of the American Academy of Child and Adolescent Psychiatry 45, 1427-1439. Dubicka, B., Elvins, R., Roberts, C., Chick, G., Wilkinson, P., Goodyer, I.M., 2010. Combined treatment with cognitive-behavioural therapy in adolescent depression: metaanalysis. The British journal of psychiatry : the journal of mental science 197, 433-440.

CR IP T

Efron, B., Tibshirani, R., 1994. An introduction to the bootstrap. Chapman and Hall/CRC, Boca Raton.

Emslie, G.J., Heiligenstein, J.H., Hoog, S.L., Wagner, K.D., Findling, R.L., McCracken, J.T., Nilsson, M.E., Jacobson, J.G., 2004. Fluoxetine treatment for prevention of relapse of

AN US

depression in children and adolescents: a double-blind, placebo-controlled study. Journal of the American Academy of Child and Adolescent Psychiatry 43, 1397-1405. Emslie, G.J., Kennard, B.D., Mayes, T.L., Nightingale-Teresi, J., Carmody, T., Hughes, C.W.,

M

Rush, A.J., Tao, R., Rintelmann, J.W., 2008. Fluoxetine versus placebo in preventing relapse of major depression in children and adolescents. The American journal of psychiatry 165,

ED

459-467.

Emslie, G.J., Mayes, T., Porta, G., Vitiello, B., Clarke, G., Wagner, K.D., Asarnow, J.R.,

PT

Spirito, A., Birmaher, B., Ryan, N., Kennard, B., DeBar, L., McCracken, J., Strober, M.,

CE

Onorato, M., Zelazny, J., Keller, M., Iyengar, S., Brent, D., 2010. Treatment of Resistant Depression in Adolescents (TORDIA): Week 24 Outcomes. The American journal of

AC

psychiatry 167, 782-791. Fergusson, D.M., Woodward, L.J., 2002. Mental health, educational, and social role outcomes of adolescents with depression. Archives of general psychiatry 59, 225-231. Findling, R.L., Robb, A., Bose, A., 2013. Escitalopram in the treatment of adolescent depression: a randomized, double-blind, placebo-controlled extension trial. Journal of child and adolescent psychopharmacology 23, 468-480.

25

ACCEPTED MANUSCRIPT Fried, E.I., Nesse, R.M., 2015. Depression sum-scores don't add up: why analyzing specific depression symptoms is essential. BMC medicine 13, 72. Fried, E.I., van Borkulo, C.D., Epskamp, S., Schoevers, R.A., Tuerlinckx, F., Borsboom, D., 2016. Measuring Depression Over Time . . . or not? Lack of Unidimensionality and Longitudinal Measurement Invariance in Four Common Rating Scales of Depression.

CR IP T

Psychological assessment 28, 1354-1367. Goodyer, I.M., Dubicka, B., Wilkinson, P., Kelvin, R., Roberts, C., Byford, S., Breen, S., Ford, C., Barrett, B., Leech, A., Rothwell, J., 2008. Cognitive behaviour therapy in adolescents with major depression. Health Technology Assessment 12, 12-15.

AN US

Gore, F.M., Bloem, P.J., Patton, G.C., Ferguson, J., Joseph, V., Coffey, C., Sawyer, S.M., Mathers, C.D., 2011. Global burden of disease in young people aged 10-24 years: a systematic analysis. Lancet (London, England) 377, 2093-2102.

M

Gould, M.S., King, R., Greenwald, S., Fisher, P., Schwab-Stone, M., Kramer, R., Flisher, A.J., Goodman, S., Canino, G., Shaffer, D., 1998. Psychopathology associated with suicidal

ED

ideation and attempts among children and adolescents. Journal of the American Academy of Child and Adolescent Psychiatry 37, 915-923.

PT

Harrington, R., Whittaker, J., Shoebridge, P., Campbell, F., 1998. Systematic review of

CE

efficacy of cognitive behaviour therapies in childhood and adolescent depressive disorder. BMJ (Clinical research ed.) 316, 1559-1563.

AC

Hawton, K., Saunders, K.E., O'Connor, R.C., 2012. Self-harm and suicide in adolescents. Lancet (London, England) 379, 2373-2382. Hetrick, S.E., Cox, G.R., Merry, S.N., 2011. Treatment-resistant depression in adolescents: is the addition of cognitive behavioral therapy of benefit? Psychology research and behavior management 4, 97-112.

26

ACCEPTED MANUSCRIPT Hetrick, S.E., McKenzie, J.E., Cox, G.R., Simmons, M.B., Merry, S.N., 2012. Newer generation antidepressants for depressive disorders in children and adolescents. The Cochrane database of systematic reviews 11, CD004851. Hollon, S.D., Garber, J., Shelton, R.C., 2005. Treatment of depression in adolescents with cognitive behavior therapy and medications: A commentary on the TADS project. Cognitive

CR IP T

and Behavioral Practice 12, 149-155. Isa, A., Bernstein, I., Trivedi, M.H., Mayes, T.L., Kennard, B., Emslie, G., 2014. Childhood Depression Subscales Using Repeated Sessions on Children's Depression Rating Scale –

Revised (CDRS-R) Scores. Journal of child and adolescent psychopharmacology 24, 318-324.

AN US

Jones, P.B., 2013. Adult mental health disorders and their age at onset. The British Journal of Psychiatry 202, s5-s10.

Jurasinski, G., Koebsch, F., Guenther, A., Beetz, S., 2014. flux: Flux rate calculation from

M

dynamic closed chamber measurements, R package version 0.3-0.

Klein, J.B., Jacobs, R.H., Reinecke, M.A., 2007. Cognitive-behavioral therapy for adolescent

ED

depression: a meta-analytic investigation of changes in effect-size estimates. Journal of the American Academy of Child and Adolescent Psychiatry 46, 1403-1413.

PT

Koenker, R., 2015. quantreg: Quantile Regression. R package version 5.26.

CE

Koenker, R., Hallock, K.F., 2001. Quantile Regression. Journal of Economic Perspectives 15, 143-156.

AC

Laird, N., 1983. Further comparative analyses of pretest-posttest research designs. The American Statistician 37, 329-330. Le Noury, J., Nardo, J.M., Healy, D., Jureidini, J., Raven, M., Tufanaru, C., Abi-Jaoude, E., 2015. Restoring Study 329: efficacy and harms of paroxetine and imipramine in treatment of major depression in adolescence. BMJ (Clinical research ed.) 351, h4320. Le Noury, J., Nardo, J.M., Healy, D., Jureidini, J., Raven, M., Tufanaru, C., Abi-Jaoude, E., 2016. Study 329 continuation phase: Safety and efficacy of paroxetine and imipramine in 27

ACCEPTED MANUSCRIPT extended treatment of adolescent major depression. The International journal of risk & safety in medicine 28, 143-161. Leucht, S., Hierl, S., Kissling, W., Dold, M., Davis, J.M., 2012. Putting the efficacy of psychiatric and general medicine medication into perspective: review of meta-analyses. The British journal of psychiatry : the journal of mental science 200, 97-106.

CR IP T

Ma, D., Zhang, Z., Zhang, X., Li, L., 2014. Comparative efficacy, acceptability, and safety of medicinal, cognitive-behavioral therapy, and placebo treatments for acute major depressive disorder in children and adolescents: a multiple-treatments meta-analysis. Current medical research and opinion 30, 971-995.

AN US

March, J., Silva, S., Petrycki, S., Curry, J., Wells, K., Fairbank, J., Burns, B., Domino, M., McNulty, S., Vitiello, B., Severe, J., 2004. Fluoxetine, cognitive-behavioral therapy, and their combination for adolescents with depression: Treatment for Adolescents With Depression

M

Study (TADS) randomized controlled trial. Jama 292, 807-820.

March, J.S., Silva, S., Petrycki, S., Curry, J., Wells, K., Fairbank, J., Burns, B., Domino, M.,

ED

McNulty, S., Vitiello, B., Severe, J., 2007. The Treatment for Adolescents With Depression

64, 1132-1143.

PT

Study (TADS): long-term effectiveness and safety outcomes. Archives of general psychiatry

CE

Masi, G., Liboni, F., Brovedani, P., 2010. Pharmacotherapy of major depressive disorder in adolescents. Expert opinion on pharmacotherapy 11, 375-386.

AC

Mayes, T.L., Bernstein, I.H., Haley, C.L., Kennard, B.D., Emslie, G.J., 2010. Psychometric properties of the Children's Depression Rating Scale-Revised in adolescents. Journal of child and adolescent psychopharmacology 20, 513-516. Melvin, G.A., Tonge, B.J., King, N.J., Heyne, D., Gordon, M.S., Klimkeit, E., 2006. A comparison of cognitive-behavioral therapy, sertraline, and their combination for adolescent depression. Journal of the American Academy of Child and Adolescent Psychiatry 45, 11511161. 28

ACCEPTED MANUSCRIPT Merikangas, K.R., He, J.P., Burstein, M., Swanson, S.A., Avenevoli, S., Cui, L., Benjet, C., Georgiades, K., Swendsen, J., 2010. Lifetime prevalence of mental disorders in U.S. adolescents: results from the National Comorbidity Survey Replication--Adolescent Supplement (NCS-A). Journal of the American Academy of Child and Adolescent Psychiatry 49, 980-989.

CR IP T

Moller, H.J., Bitter, I., Bobes, J., Fountoulakis, K., Hoschl, C., Kasper, S., 2012. Position statement of the European Psychiatric Association (EPA) on the value of antidepressants in the treatment of unipolar depression. European psychiatry : the journal of the Association of European Psychiatrists 27, 114-128.

AN US

Nash, R., Bunce, C., Freemantle, N., Dore, C.J., Rogers, C.A., 2014. Ophthalmic Statistics Note 4: analysing data from randomised controlled trials with baseline and follow-up measurements. The British journal of ophthalmology 98, 1467-1469.

M

Nkansah-Amankra, S., Tettey, G., 2015. Association between depressive symptoms in adolescence and birth outcomes in early adulthood using a population-based sample.

ED

Preventive medicine reports 2, 371-378.

Petscher, Y., Logan, J.A., 2014. Quantile regression in the study of developmental sciences.

PT

Child development 85, 861-881.

CE

Poznanski, E., Mokros, H., 1995. Children‘s Depression Rating Scale-Revised (CDRS-R). WPS, Los Angeles, Calif.

AC

R Core Team, 2016. R: A Language and Environment for Statistical Computing, 3.3.2 ed. R Foundation for Statistical Computing, Vienna. Reinecke, M.A., Ryan, N.E., DuBois, D.L., 1998. Cognitive-behavioral therapy of depression and depressive symptoms during adolescence: a review and meta-analysis. Journal of the American Academy of Child and Adolescent Psychiatry 37, 26-34.

29

ACCEPTED MANUSCRIPT Rutter, M., Kim-Cohen, J., Maughan, B., 2006. Continuities and discontinuities in psychopathology between childhood and adult life. Journal of child psychology and psychiatry, and allied disciplines 47, 276-295. Senn, S., 2006. Change from baseline and analysis of covariance revisited. Statistics in medicine 25, 4334-4344.

CR IP T

Stekhoven, D.J., Bühlmann, P., 2012. MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics (Oxford, England) 28, 112-118.

TADS Team, 2003. Treatment for Adolescents With Depression Study (TADS): rationale, design, and methods. Journal of the American Academy of Child and Adolescent Psychiatry

AN US

42, 531-542.

TADS Team, 2005. The Treatment for Adolescents With Depression Study (TADS): demographic and clinical characteristics. Journal of the American Academy of Child and

M

Adolescent Psychiatry 44, 28-40.

Tao, R., Emslie, G., Mayes, T., Nakonezny, P., Kennard, B., Hughes, C., 2009. Early

ED

prediction of acute antidepressant treatment response and remission in pediatric major

48, 71-78.

PT

depressive disorder. Journal of the American Academy of Child and Adolescent Psychiatry

CE

Thapar, A., Collishaw, S., Pine, D.S., Thapar, A.K., 2012. Depression in adolescence. Lancet (London, England) 379, 1056-1067.

AC

Usala, T., Clavenna, A., Zuddas, A., Bonati, M., 2008. Randomised controlled trials of selective serotonin reuptake inhibitors in treating depression in children and adolescents: a systematic review and meta-analysis. European neuropsychopharmacology : the journal of the European College of Neuropsychopharmacology 18, 62-73. Vickers, A.J., 2001. The use of percentage change from baseline as an outcome in a controlled trial is statistically inefficient: a simulation study. BMC medical research methodology 1, 6. 30

ACCEPTED MANUSCRIPT Vickers, A.J., Altman, D.G., 2001. Statistics notes: Analysing controlled trials with baseline and follow up measurements. BMJ (Clinical research ed.) 323, 1123-1124. Wagner, K.D., Ambrosini, P., Rynn, M., Wohlberg, C., Yang, R., Greenbaum, M.S., Childress, A., Donnelly, C., Deas, D., 2003. Efficacy of sertraline in the treatment of children and adolescents with major depressive disorder: two randomized controlled trials. Jama 290,

CR IP T

1033-1041. Wagner, K.D., Jonas, J., Findling, R.L., Ventura, D., Saikali, K., 2006. A double-blind,

randomized, placebo-controlled trial of escitalopram in the treatment of pediatric depression. Journal of the American Academy of Child and Adolescent Psychiatry 45, 280-288.

AN US

Wagner, K.D., Robb, A.S., Findling, R.L., Jin, J., Gutierrez, M.M., Heydorn, W.E., 2004. A randomized, placebo-controlled trial of citalopram for the treatment of major depression in children and adolescents. The American journal of psychiatry 161, 1079-1083.

M

Weersing, V.R., Brent, D.A., 2006. Cognitive behavioral therapy for depression in youth. Child and adolescent psychiatric clinics of North America 15, 939-957, ix.

ED

Weersing, V.R., Jeffreys, M., Do, M.T., Schwartz, K.T., Bolano, C., 2017. Evidence base update of psychosocial treatments for child and adolescent depression. Journal of clinical

PT

child and adolescent psychology : the official journal for the Society of Clinical Child and

CE

Adolescent Psychology, American Psychological Association, Division 53 46, 11-43. Weersing, V.R., Rozenman, M., Gonzalez, A., 2009. Core components of therapy in youth:

AC

do we know what to disseminate? Behavior modification 33, 24-47. Weisz, J.R., Kuppens, S., Eckshtain, D., Ugueto, A.M., Hawley, K.M., Jensen-Doss, A., 2013. Performance of evidence-based youth psychotherapies compared with usual clinical care: a multilevel meta-analysis. JAMA psychiatry 70, 750-761. Weisz, J.R., McCarty, C.A., Valeri, S.M., 2006. Effects of psychotherapy for depression in children and adolescents: a meta-analysis. Psychological bulletin 132, 132-149.

31

ACCEPTED MANUSCRIPT Whittington, C.J., Kendall, T., Fonagy, P., Cottrell, D., Cotgrove, A., Boddington, E., 2004. Selective serotonin reuptake inhibitors in childhood depression: systematic review of published versus unpublished data. Lancet (London, England) 363, 1341-1345. Wilcox, R., 2012. Introduction to robust estimation and hypothesis testing, 3. ed. Academic Press, Amsterdam; Boston.

CR IP T

Wilkinson, P., Dubicka, B., Kelvin, R., Roberts, C., Goodyer, I., 2009. Treated depression in adolescents: predictors of outcome at 28 weeks. The British journal of psychiatry : the journal

AC

CE

PT

ED

M

AN US

of mental science 194, 334-341.

32

Quantile

PBO

CBT

FLX

COMB

10th

0.067

0.068

0.040

0.184

20 th

0.135

0.148

0.146

0.312

30 th

0.197

0.214

0.257

0.356

40 th

0.239

0.264

0.378

0.412

50 th

0.298

0.312

0.403

0.439

60 th

0.369

0.364

0.455

0.492

70 th

0.418

0.423

0.515

0.549

80 th

0.498

0.480

0.537

0.612

90 th

0.610

0.560

0.621

0.667

AN US

Table 1. Quantiles of response levels in placebo and treatment conditions.

CR IP T

ACCEPTED MANUSCRIPT

PBO = Placebo; CBT = Cognitive-behavioral therapy; FLX = Fluoxetine; COMB = Combination of CBT and FLX

M

Response levels were defined as the proportional change in depression from baseline to end of acute phase. For example, a response level of 0.067

AC

CE

PT

ED

means that the baseline depression score was reduced by 6.7%.

33

CR IP T

ACCEPTED MANUSCRIPT

Table 2. Areas under the curves of quantile treatment effects and comparisons of the areas under the curves between treatments. condition Model 1

Model 2

Treatments AUC

t-value

df

p-value

Cohen’s d

95% CI

-114.71

2999

< 0.00001

2.09

2.03 - 2.16

0.013 - 0.121 CBT vs. COMB

-278.81

2999

< 0.00001

5.09

4.99 - 5.19

0.080 - 0.177 FLX vs. COMB

-140.16

2999

< 0.00001

2.56

2.49 - 2.63

-136.48

2999

< 0.00001

2.49

2.42 - 2.56

0.023 - 0.129 CBT vs. COMB

-220.76

2999

< 0.00001

4.03

3.94 - 4.12

0.059 - 0.163 FLX vs. COMB

AN US

Treatment

-76.10

2999

< 0.00001

1.39

1.33 - 1.45

95% CI

CBT

0.006

FLX

0.065

COMB

0.131

CBT

0.009

FLX

0.076

COMB

0.113

compared

-0.045 - 0.057 CBT vs. FLX

-0.043 - 0.064 CBT vs. FLX

AUC = Area under the curve; CBT = Cognitive-behavioral therapy; FLX = Fluoxetine; COMB = Combination of CBT and FLX; CI = Confidence interval.

Treatment effects were adjusted for baseline depression severity in model 1 and additionally for study site and patients‘ treatment expectancies in

AC

CE

PT

ED

M

model 2.

34

CR IP T

ACCEPTED MANUSCRIPT

Table 3. Areas under the curves of quantile treatment effects and comparisons of the areas under the curves between treatments: Results of additional analyses. Model

Type of analysis

AUC

95% CI

Treatments compared

t-value

df

p-value

Cohen’s d

95% CI

CBT

0.006

-0.045 - 0.057

CBT vs. FLX

-114.71

2999

< 0.00001

2.09

2.03 - 2.16

FLX

0.065

0.013 - 0.121

CBT vs. COMB truncated

-265.31

2999

< 0.00001

4.84

4.74 - 4.94

COMB truncated

0.111

0.061 - 0.153

FLX vs. COMB truncated

-169.86

2999

< 0.00001

3.10

3.03 - 3.18

CBT lower-CI

-0.073

-0.125 - -0.021

CBT lower-CI vs. FLX lower-CI

-98.16

2999

< 0.00001

1.79

1.73 - 1.85

FLX lower-CI

-0.017

-0.074 - 0.041

CBT lower-CI vs. COMB lower-CI

-270.06

2999

< 0.00001

4.93

4.83 - 5.03

COMB lower-CI

0.054

0.001 - 0.102

FLX lower-CI vs. COMB lower-CI

-141.19

2999

< 0.00001

2.58

2.51 - 2.65

CBT

0.009

-0.043 - 0.064

CBT vs. FLX

-136.48

2999

< 0.00001

2.49

2.42 - 2.56

FLX

0.076

0.023 - 0.129

CBT vs. COMB truncated

-208.36

2999

< 0.00001

3.80

3.72 - 3.89

COMB truncated

0.1

0.047 - 0.146

FLX vs. COMB truncated

-75.62

2999

< 0.00001

1.38

1.32 - 1.44

CBT lower-CI

-0.07

-0.124 - -0.014

CBT lower-CI vs. FLX lower-CI

-124.78

2999

< 0.00001

2.28

2.21 - 2.34

FLX lower-CI

-0.004

-0.061 - 0.052

CBT lower-CI vs. COMB lower-CI

-214.58

2999

< 0.00001

3.92

3.83 – 4.00

COMB lower-CI

0.037

-0.016 - 0.089

FLX lower-CI vs. COMB lower-CI

-77.11

2999

< 0.00001

1.41

1.35 - 1.46

Treatment

truncated a

CI b

Model 2

truncated a

CI b

M

Model 1

AN US

condition

ED

AUC = Area under the curve; CBT = Cognitive-behavioral therapy; FLX = Fluoxetine; COMB = Combination of CBT and FLX; CI = Confidence interval.

Treatment effects were adjusted for baseline depression severity in model 1 and additionally for study site and patients‘ treatment expectancies in

PT

model 2. a

In this analysis, the AUC for COMB was calculated after truncating the COMB-coefficients at the average of the three highest FLX-coefficients.

b

In this analysis, for each treatment, the AUC of the lower bounds of the 95% confidence intervals rather than that of the coefficients themselves

AC

CE

was calculated.

35

CR IP T

ACCEPTED MANUSCRIPT

Figure legends

Figure 1. Quantile-quantile-plots of the response level distributions of all treatments against placebo

CBT = Cognitive-behavioral therapy; FLX = Fluoxetine; COMB = Combination of CBT and FLX.

AN US

A quantile-quantile-plot plots the corresponding quantiles of two distributions against each other, e.g. the 50th quantile against the 50th quantile (i.e. the median of the first distribution against the median of the second distribution). If the two distributions do not differ, then the values at each quantile are the same (e.g. the median response level is the same in PBO and CBT) and hence, lie on the diagonal of the plot. Otherwise, the values lie below or above the diagonal. For example, COMB‘s quantiles are consistently higher than the PBO quantiles throughout most of the range of the response levels, meaning that COMB produced consistently higher response levels than PBO.

Response levels were defined as the proportional change in depression from baseline to end of acute phase. For example, a response level of 0.5

AC

CE

PT

ED

M

means that the baseline depression score was reduced by 50%.

36

M

AN US

CR IP T

ACCEPTED MANUSCRIPT

ED

Figure 2. Treatment effects with 95% confidence intervals as a function of quantiles estimated by model 1 (upper part) and model 2 (lower part)

CBT = Cognitive-behavioral therapy; FLX = Fluoxetine; COMB = Combination of CBT and FLX; CI = Confidence Interval.

PT

The treatment effect is the difference between the response levels of the treatment and the placebo group, adjusted for baseline depression severity in model 1 (upper part of figure) and additionally for study site and patients‘ treatment expectancies in model 2 (lower part of figure). For example, the treatment effect at the 50th quantile is the (adjusted) difference between the median response level in the placebo group and the median response

AC

CE

level in the treatment group.

37

AC

CE

PT

ED

M

AN US

CR IP T

ACCEPTED MANUSCRIPT

38