Journal Pre-proof Variations in stepped-wedge cluster randomized trial design: Insights from the ‘patient-centered care transitions in heart failure’ trial
Rudy R. Unni, Shun Fu Lee, Lehana Thabane, Stuart Connolly, Harriette GC Van Spall PII:
S0002-8703(19)30222-4
DOI:
https://doi.org/10.1016/j.ahj.2019.08.017
Reference:
YMHJ 5969
To appear in:
American Heart Journal
Received date:
22 January 2019
Accepted date:
26 August 2019
Please cite this article as: R.R. Unni, S.F. Lee, L. Thabane, et al., Variations in steppedwedge cluster randomized trial design: Insights from the ‘patient-centered care transitions in heart failure’ trial, American Heart Journal(2019), https://doi.org/10.1016/ j.ahj.2019.08.017
This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
© 2019 Published by Elsevier.
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall Title: Variations in stepped-wedge cluster randomized trial design: Insights from the ‘Patient-Centered Care Transitions in Heart Failure’ trial Rudy R. Unni1, Shun Fu Lee2, Lehana Thabane3, Stuart Connolly2, Harriette GC Van Spall2,3,4*.
of
1. Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada 2. Population Health Research Institute, McMaster University, Hamilton, Ontario,
ro
Canada
-p
3. Department of Health Research Methods, Evidence, and Impact, McMaster
re
University, Hamilton, Ontario, Canada
lP
4. Department of Medicine, McMaster University, Hamilton, Ontario Canada Corresponding Author: Harriette GC Van Spall, M.D, MPH, FRCPC. Population Health Research Institute, 20 Copeland Avenue David Braley Research Institute Suite
C3-117,
Hamilton
na
Building,
ON
L8L
0A3.
Electronic
address:
ur
[email protected]. Phone: (905) 527-4322 Ext. 40309
Jo
Short Title: Variations in stepped-wedge trial design Funding: This study is funded by Canadian Institutes of Health Research and Ontario’s Ministry of Health and Long Term Care Health System Research Fund. Dr. Van Spall receives research salary support from Ontario’s Ministry of Health and Hamilton Health Sciences Career Award. Disclosures: None The senior author is responsible for the design and conduct of the PACT-HF study. The authors are responsible for the present analyses, drafting and editing of the paper, and its final contents.
1
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall Abstract The stepped-wedge (SW) cluster randomized controlled trial (RCT), in which clusters cross over in a randomized sequence from control to intervention is ideal for the implementation and testing of complex health service interventions. In certain cases, however, implementation of the intervention may pose logistical challenges, and variations in SW design may be required.
of
We examine the logistical and statistical implications of variations in SW design, using
ro
the optimization of the Patient-Centered Care Transitions in Heart Failure (PACT-HF)
-p
trial for illustration. We review the following complete SW design variations: a typical
re
SW design; a SW design with multiple clusters crossing over per period to achieve
lP
balanced cluster sizes at each step; hierarchical randomization to account for higherlevel clustering effects; nested sub-studies to measure outcomes requiring a smaller
na
sample size than the primary outcomes; and hybrid SW design, which combines parallel
ur
cluster with SW design to improve efficiency. We also reviewed three incomplete SW design variations in which data is collected in some but not all steps to ease
Jo
measurement burden. These include designs with a learning period that improve fidelity to the intervention, designs with reduced measurements to minimize collection burden, and designs with early and late blocks to accommodate cluster readiness. Variations in SW design offer pragmatic solutions to logistical challenges but have implications to statistical power. Advantages and disadvantages of each variation should be considered before finalizing the design of a SW RCT.
2
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall Introduction The stepped-wedge (SW) cluster randomized controlled trial (RCT) is one in which clusters initially assigned to control cross over unidirectionally to the intervention in a randomized sequence (Figure 1) [1]. This differs from a parallel cluster design in which clusters are randomized to either control or intervention without cross over [2], and from a cluster randomized cross-over design in which clusters initially randomized to
of
intervention or control cross over one or more times to the opposite arm [3]. The SW
ro
design has increased in popularity, particularly in studies testing complex health-care
-p
service interventions [1,4,5]. Relative to a parallel cluster design, advantages of a SW
re
design include delivery of the intervention to all clusters, a staged approach to and associated lead time for implementation, and need for fewer clusters to achieve
lP
statistical power [2]. Relative to a parallel cluster design, limitations of a SWD include
na
methodological complexity, longer trial duration, and susceptibility to temporal effects.
ur
The “Patient-Centered Care Transitions in Heart Failure” (PACT-HF) trial was a multicenter SW trial that evaluated the effectiveness of a transitional care model
Jo
delivered in hospital, at home, and in ambulatory clinics after hospitalization for heart failure (HF) [6]. The primary outcomes were composites of clinical endpoints measured at 3 months and 30 days. Secondary outcomes included patient-reported measures of discharge preparedness, quality of care, and quality of life. Given the complexity of the intervention and the integration required across institutions that typically function independently of each other, we considered variations to the SW design. This paper describes the features and rationale for SW design, variations in SW methodology and implications on statistical power, using the PACT-HF trial for illustration.
3
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall Rationale for a SW trial All clusters receive intervention The SW design is advantageous over parallel cluster designs when it is desirable for all participating clusters to receive the intervention; there may be preliminary evidence that the intervention offers clinical benefit with minimal risk, and there may be a desire to implement the intervention and test its effect on a range of outcomes in real world
of
settings [2,7]. Other types of cluster randomized cross-over designs can also facilitate
ro
delivery of the intervention in all clusters, but not all clusters receive the intervention at
-p
the end of the trial; because as all clusters in a SW design are receiving the intervention
re
by trial’s end, there is no incremental work to implement the intervention following the
lP
trial [3].
Decision-makers in the hospitals enrolled in PACT-HF preferred a SW over a parallel
na
cluster design so that all hospitals received the intervention. Provincial policy initiatives
ur
at the time provided incentives for hospitals to implement interventions aimed at
Jo
reducing readmissions in HF. The transitional care services comprising the PACT-HF intervention were shown in prior explanatory RCTs to decrease readmissions among individuals hospitalized for HF [8,9]. The goal of the PACT-HF trial was to assess whether combining these services and implementing them in hospitals improved a range of clinical, cost, and patient-reported outcomes. Because study outcomes were not available for several months after the trial’s end, there would have been insufficient justification to implement the intervention among control sites had a parallel cluster design been used. Sequential implementation allows lead-time 4
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall The sequential implementation of a SW design can facilitate the lead-time and focused attention required for preparation of each cluster to initiate the intervention. This is of benefit in complex health service interventions [2,10]. The PACT-HF model involved the delivery of transitional care services across 10 hospital clusters and their regional home care agencies and ambulatory clinics. The intervention required training of personnel and integration of services across institutions that typically function independently of
of
each other in a publicly funded health care system. The complexity of the work was
ro
simplified by initiating the intervention in a single cluster at each step; the lead-time at
-p
each cluster provided by the SW design was required to reliably redesign the
re
ambulatory services, train personnel, and deliver services. While a modified parallel cluster design would have also allowed sequential implementation, this would have only
lP
pertained to the intervention group and all hospitals would not have received the
na
intervention[11].
ur
Fewer clusters are required
Jo
A SW trial typically requires fewer clusters than a parallel cluster RCT to achieve acceptable statistical power, making it ideal in situations where a limited number of clusters are available for trial participation [12,13]. Each cluster in a SW trial is exposed to both control and intervention. Thus, each cluster acts as its own control and reduces the impact of clustering effects (the correlation between any two individuals of the same cluster). This results in a gain in statistical power that is most notable when the clustering effects are large [13,14]. This gain can be attenuated or absent if clustering effects are small. [15]. This is in contrast to a parallel cluster design, where solely intercluster treatment effects can be examined, and the maximum power attained with few clusters cannot be overcome with increased cluster sizes[12]. The complexity of 5
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall services in the PACT-HF trial made it undesirable to recruit a large number of clusters, and the desired number of clusters would have provided insufficient statistical power for a parallel cluster RCT to answer the primary research question. General Principles of SW Design Design pattern and step length
of
The steps in a SW design mark the periods when clusters sequentially cross over
ro
unidirectionally from control to intervention, resulting in two ‘stepped wedges’ of
-p
treatment allocation (Figure 1). Outcomes are measured at baseline before intervention
re
and at each crossover step, including after the final cluster crosses over to receive the
lP
intervention.
Step length is an important consideration. For a complex intervention requiring a period
na
of time after implementation for fidelity to be achieved, measurements recorded
ur
immediately after crossover may not accurately reflect the intervention’s effect. Increasing the length of each step to allow for a learning or adjustment period could
Jo
more reliably reflect the intervention’s effectiveness [14,16]. However, the resultant increase in trial duration may increase study costs, increase susceptibility to temporal effects (see “Limitations of SW Design” below), delay intervention delivery to participating clusters, and potentially deter participation. The PACT-HF clinical trial had a total duration of 1 year, with 11 steps that each lasting 1 month in duration. This was based on sample size estimations, the intervention lead time required per site, and feasibility of collecting outcomes (provincial databases report outcomes on a monthly basis).
6
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Cross-sectional vs. cohort SW design The PACT-HF trial was a cross-sectional SW design [14], in which unique trial participants – patients discharged after hospitalization for HF – were recruited at each time step. Treatment allocation to control or intervention depended on when the patients
of
were hospitalized relative to their hospital’s crossover date [17]. Measurements were
ro
recorded from different participants at each step. Patients were recruited during their
-p
index hospitalization and censored if rehospitalized, so only included in the analysis
re
once.
A ‘cohort’ SW trial describes a protocol where individuals are identified at the beginning
lP
of the trial and are measured repeatedly as they are exposed to both control and
na
intervention [2,17,18]. Cohort SW trials can add measurement burden on participants if the measures are collected directly from them (versus from databases that track their
ur
outcomes), increasing the risk of participants leaving the trial prior to completion. The
Jo
induced correlation between measurements drawn from the same participant in a cohort SW design must be accounted for in analysis [18]. Additionally, a cohort SW trial may not be ideal when achieving an outcome that may change a patient’s response to treatment during the remainder of the trial [17]; in such cases, the patient may need to be censored, and this needs to be accounted for in sample size calculations to avoid ‘healthy survivor bias’. Sample size calculations for cross-sectional SW
designs are often applied
inappropriately to cohort designs, and further efforts are required to characterize this design, determine its applications, and validate statistical analytic approaches [4,17]. 7
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Sample Size Calculation and Power Randomizing clusters instead of individuals in cluster RCTs results in a loss of statistical power because responses within a cluster are more similar than responses of different
𝜎𝑏2 2 2) (𝜎𝑏 +𝜎𝑤
(1)
ro
𝜌=
of
clusters; this can be quantified by the intra-cluster correlation (ICC) denoted as 𝜌:
-p
where 𝜎𝑏2 is the between-cluster variance, and 𝜎𝑤2 is the within-cluster variance [19].
re
This measure describes the proportion of total variance attributable to clustering. The
lP
ICC, number of clusters, and trial duration are used in sample size calculations for cluster RCTs [20].
na
Hussey and Hughes [1] were first to describe power formulae for the SW design in
ur
2007. From this work, Woertman et al. [21] derived a now widely-used approach to
Jo
calculate a single inflation factor or “design effect” denoted as 𝐷𝐸𝑠𝑤 . This is multiplied by the sample size required for an equivalent individually randomized trial to produce the sample size for a SW [17–20], given by: 𝐷𝐸𝑠𝑤 = (𝑡 + 1)
1+𝜌(𝑡𝑚+𝑚−1) 3(1−𝜌) 1+𝜌(
𝑡𝑚 +𝑚−1) 2
∙
1 𝑡
2(𝑡− )
(2)
where 𝑡 is the number of steps, 𝜌 is the ICC, and 𝑚 is the number of subjects within a period in a cluster. The statistical power of a parallel cluster trial drops as the ICC increases, requiring additional clusters to retain power; increasing cluster size is of limited utility [18]. In 8
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall contrast, the statistical power of a SW trial is relatively insensitive to the value of ICC because clusters act as their own control [10,18]. In the absence of temporal trends and when the ICC and cluster sizes are large, trials with the SW design can be more statistically efficient than those with parallel cluster designs [15,21,22]. For the first primary outcome of PACT-HF, the design effect approach was used to estimate that a sample size of 320 patients per hospital across 10 hospitals and 11
of
steps would provide ≥ 80% statistical power to detect a 25% difference in the first
ro
primary outcome. This assumed a typical SW design and an annual event rate of 28%
-p
in the control group. With the same assumptions, a parallel cluster design would have
re
provided a statistical power of only 43% to detect the same difference in primary
na
Variations in SW trial design
lP
outcome.
ur
Complete versus incomplete SW trial design. In a ‘complete’ SW design, data is
Jo
collected from each cluster at every step, requiring more resources but optimizing statistical power [10]. In an ‘incomplete’ SW design, data from each cluster is collected during some but not all steps, reducing the measurement burden, data collection and also the statistical power – unless adjustments are made to the duration of each step, number of clusters, or cluster size. In the following sections, we evaluate potential variations of complete and incomplete SW designs and assess their efficiency by estimating their statistical power using the PACT-HF trial as an example (Table 1). As outlined in the supporting information (S1 Appendix), estimated power was calculated
9
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall using a statistical modelling approach based on Hussey and Hughes’ power formulae (alpha < 0.05, 2-tailed for all estimations) [1].
Complete typical SW design In a complete ‘typical’ SW trial (Figure 1), the number of steps is equal to one greater
of
than the total number of clusters, and data is collected at each step. The sample size
ro
and power estimations assume balanced cluster sizes, fixed step durations, and
-p
intervention fidelity without a required learning period. Given the number and size of
re
clusters in PACT-HF, a complete typical SW design was estimated to yield a statistical power of 82% to demonstrate a 25% change in the primary composite outcome (alpha <
na
lP
0.05, 2-tailed).
ur
Complete SW design with grouping multiple clusters per crossover
Jo
Randomization can be stratified by cluster size in order to facilitate balanced cluster sizes across steps between control and intervention. This may ease the burden of implantation with a small number of steps particularly when the number of clusters is large for a small trade off in power. Figure 2 illustrates this design variation, with a modified PACT-HF trial and two clusters crossing over to intervention per step. The number of steps is reduced to six and statistical power for the primary composite outcome is modestly reduced from 82% to 80%.
10
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall Complete SW design with multi-level clustering Multi-level clustering may be considered when clusters are not independent because higher-level factors influence outcomes within them [10]. This design is depicted in Figure 3, in which the intervention is implemented sequentially across 4 geographic regions, each containing multiple hospital sites that share transitional care service
of
providers and resources. Randomization in this model would first occur at the level of the region, and then at the level of the hospital. This would allow for the practical
ro
deployment of regional resources to clusters, while minimizing the contamination that
-p
may occur due to a prolonged delay in crossover between some clusters and others.
re
Analysis is complex however, and limited progress has been made in fully
na
lP
characterizing this variation [10].
ur
Complete SW design with unequal cluster size per period Statistical power calculations for SW trials are often estimated under the assumption of
Jo
equal cluster size which can sometimes be impossible to achieve in practice [23]. While significant imbalance in cluster sizes may lead to loss of statistical power, allowing for unequal cluster sizes in the estimation of statistical power could encourage recruitment of diverse clusters and improve generalizability of results. For example, smaller community hospital sites as well as larger academic ones could both be used to examine an intervention’s effect. The coefficient of variation of cluster size (CV) is a measure for the degree of variation in cluster sizes. It is defined as the ratio of the standard deviation of cluster size to the
11
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall mean cluster size. It can be incorporated into the design effect (see equation (2) in “Sample Size Calculation and Power” above) to account for unequally sized clusters, by multiplying the cluster size per period per cluster 𝑚, by (1+CV2) [24]. In the PACT-HF trial, it was estimated that 320 patients per cluster during the trial would provide a statistical power of 82% to detect a 25% change in the primary outcome. In the setting of imbalanced cluster size with a CV of 0.42, an average cluster size of 377 patients per
of
cluster would be required to retain a power of 82%. Without an increase in average
ro
cluster size, the statistical power to detect a change in the primary outcome would
re
-p
decrease to 77%.
lP
Complete SW design with nested sub-study
na
A nested sub-study may be considered when a study includes secondary aims or substudies that require a smaller sample size than the primary aim. This approach can
ur
reduce measurement burden. The secondary outcomes of the PACT-HF clinical
Jo
included patient reported measures that required only 8 clusters to yield 90% power to detect a 5% difference in outcomes after accounting for dropout, loss of follow-up, and death rates before data collection. Investigating these secondary outcomes in all 10 hospitals would result in overpowering. A nested sub-study was thus designed for collection of these outcomes within the existing SW trial (Figure 4).
Complete SW combined with parallel cluster design (hybrid SW design)
12
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall A ‘hybrid’ SW trial combines a parallel cluster and complete SW design, such that a balanced proportion of clusters are randomized to either intervention or control without crossing over; and the remainder of the clusters cross over unidirectionally in a randomized sequence as in typical SW design. The inclusion of clusters that do not cross over helps reduce the potential bias of temporal changes across the trial. Consequently, a hybrid SW trial is more statistically efficient than a complete SW trial
of
[25]. One obvious drawback to a hybrid design is that some clusters will not receive
ro
intervention during the trial. Just as with parallel cluster designs, this may deter cluster
-p
enrollment if all clusters wish to receive a potentially beneficial intervention.
re
Varying combinations of parallel cluster and SW design can be examined. Using the
lP
PACT-HF trial’s sample size of 10 hospitals, possibilities included: (1) a 20:80 hybrid combination, with a parallel design involving 2 sites and a SW design involving 8 sites,
na
as seen in Figure 5; (2) 40:60 hybrid combination; and (3) a 60:40 hybrid combination.
ur
The lower the proportion of SW design clusters in a hybrid design, the lower the statistical power. The estimated power for each of these hybrid SW design
Jo
combinations is 85%, 85% and 80%, respectively.
Incomplete SW design with a learning period In an incomplete SW trial with a ‘learning period’, data is not collected during the initial step(s) after crossover to intervention at each site [10]. This is to allow for the time and site-specific optimization that may be required before the intervention is delivered with fidelity, and before outcomes reflect the effectiveness of the intervention. An incomplete design can allow participating clusters to focus resources on uptake of the intervention 13
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall without simultaneously collecting data. Figure 6 shows an incomplete SW trial with a ‘1step’ learning period. Outcomes are collected at baseline in each cluster, but not during the first crossover step – where there is learning, adjustment to, and efforts to improve fidelity to the intervention. Measurement of outcomes restarts at the second time period after crossover, as the next cluster begins its crossover step. Similar to a complete SW design, all clusters receive intervention for the final time step. The total number of time
of
steps in the trial is greater by 1 than an equivalent complete SW design, potentially
ro
increasing the duration of the trial. Under the same assumptions for the PACT-HF trial
-p
outlined above, the estimated statistical power for primary outcomes for the incomplete
lP
re
design with learning period would drop from 82% to 74%.
na
Incomplete SW design with limited measurements prior to and after crossover Another iteration of incomplete SW trial includes limited measurements per cluster, with
ur
the intention of reducing the measurement burden and costs at participating clusters
Jo
[10]. Figure 7 shows a modified PACT-HF study design pattern in which data is measured during one time step per cluster prior to crossover and two time steps after. This protocol reduces the total and simultaneous data collection required. However, the estimated loss in power with a fixed number of clusters is significant with such omission of data. If the design outlined in Figure 7 was utilized for the PACT-HF trial, holding sample size and cluster number constant, the estimated power would have dropped from 82% in a complete design to 52%. This reduction in power could be attenuated by increasing the number of before and after measurements, the sample size, and the number of clusters. 14
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Incomplete SW design with early and late blocks For a typical SW design, clusters must be ready to deliver the intervention and adhere to measurement and reporting protocols prior to randomization. Cluster specific-factors such as scheduling conflicts and lack of resources may limit the willingness of some
of
clusters to cross over from usual care to intervention early in the study. We devised a
ro
possible option to address this limitation by stratifying the participating sites into two
-p
equal blocks - representing ‘early’ and ‘late’ implementation blocks - and randomizing the sequence of crossover within each block. Figure 8 is an illustration of this
re
incomplete design using the PACT-HF trial. Half of the participating clusters can be
lP
randomized within the early block and the other half randomized within the late block – akin to two separate SW trials run in sequence. Analysis should include adjustment of
na
outcomes for the early or late block to address bias, as allocation to each block is not
ur
random and the late block may include sites with different characteristics than the early
Jo
block. Using the PACT-HF trial as an example with the above assumptions, this design would reduce the statistical power from 82% (typical complete SW design) to 75%, assuming the sample size is constant per cluster.
Limitations of SW trials Prolonged exposure to intervention A SW trial can prolong the exposure of patients to potential risks of intervention because of its longer duration and limited precision compared to randomization at the 15
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall individual level [26,27]. In the setting of adverse effects, de-implementation in a SW design may be more complex than in a randomized RCT or parallel cluster trial [22]. Additionally, due to the greater number of measurements in control than in treatment arms early in a SW design, interim analyses are unreliable unlike a parallel or individually randomized RCT [22].
For these reasons, a SW design is not
recommended when there is clinical equipoise. Measured outcomes take longer to
of
establish due to the longer trial duration and this needs consideration [7,13,28,29].
ro
Parallel cluster designs may also provide information on an intervention’s efficacy faster
-p
than an equivalent SW design, which could accelerate systems-wide implementation or
re
trial discontinuation.
lP
Complexity and measurement burden
The complexity of a SW protocol requires preparation and consensus between
na
investigators and all stakeholders. Stakeholders from each cluster should ideally be
ur
involved in the trial design to ensure that clusters are able to meet training,
Jo
implementation, measuring, and reporting protocols. Planning must be undertaken to ensure adherence to crossover schedules, fidelity to the protocol, and prevention of contamination[30]. In the PACT-HF trial, the sequence and date of crossover from control to intervention were concealed from participating clusters until three months prior to their intervention phase. Training sessions and resources for the intervention were provided to clusters only in the month immediately preceding crossover to prevent inadvertent implementation of the intervention prior to the intervention phase. Investigators must consider the local socio-economic, political, cultural, and health-care infrastructure contexts into which a SW trial is to be implemented. The measurement 16
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall burden on clusters, researchers and individuals from the repeated measurements in a SW should also be considered during the planning stages. However, the amount of measurement burden depends on the type of SWD and the outcomes chosen. In PACTHF, there was no measurement burden for the primary clinical outcomes as these were obtained from administrative datasets. Each patient had the same number of measures
of
for the primary outcomes.
ro
Temporal effects must be considered
-p
Temporal trends must be considered in the planning of a SW design [31]. Because clusters in a SW design begin the trial as control and sequentially crossover to
re
intervention, earlier measurements are made up primarily of control cluster data, while
lP
those made near the end of the trial are mostly sourced from intervention cluster. External factors influencing cluster behavior or measurements (such as new health
na
system policies or regulations affecting some or all clusters) can thus add bias if their
ur
impact arises during a SW trial. Temporal or seasonal trends in the data should be
Jo
accounted for in the analysis as a fixed-effects variable [1,32]. Delayed effects of intervention, and heterogeneity of intervention effect across the steps also contribute to complexity of statistical modelling [33]. PACT-HF included time as a fixed effect in its analysis. Additionally, unlike in parallel cluster designs, extending data collection after completion of the initially planned trial protocol to meet recruitment targets, will only result in further intervention data. This should be considered during the planning stages of a SW trial. Understanding this limitation, we recruited an extra cluster in the event that any of our assumptions to estimate statistical power were not met.
17
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall Rationale of selected design for PACT-HF trial Given the challenges posed by organizing and executing a multicentre study of a complex health service intervention alongside multiple community and hospital stakeholders, we considered each SW design variation and its implications. For the final design we chose the complete typical SW design involving 10 clusters to adequately
of
power the study for the primary outcomes and allow all sites to receive the intervention. Randomizing one cluster to cross over per time step allowed for focused direction of
ro
resources at each site prior to the intervention phase and limited the risk of pooling
-p
together dissimilar sites. We included a nested sub-study involving a subset of 8
re
hospital sites for the secondary outcomes to conserve resources and limit measurement
na
lP
burden.
ur
Conclusions and future directions
The SW trial is a type of cluster RCT that can be used to evaluate cluster-level
Jo
interventions with a sequential implementation approach. It is ideal for the assessment of health services interventions when there is some evidence that benefits of the intervention outweigh risks, when all clusters wish to receive the intervention, and when there is limited cluster availability. Variations in SW design methodology can be considered for logistic or statistical reasons, but further research is warranted to facilitate accurate estimation of statistical power [34,35].
18
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall Table 1: Summary of SW design methodological implications and estimated statistical power for the primary outcome.
METHODOLOGICAL IMPLICATIONS
Multi-level clustering
re
Unequal cluster sizes (CV = 0.60)
na
40:60 Hybrid SW 60:40 Hybrid SW Incomplete SW
lP
20:80 Hybrid SW
Jo
ur
Learning period (1 time step)
Reduced measurements (1 time step prior crossover, 2 time steps after crossover)
82% 80% (2 clusters cross over/time step)
of
Grouped Cluster Crossover
Standard design Reduction in crossover points and time steps, balanced crossover Includes nonindependency between clusters May increase recruitment Some clusters do not receive intervention. Increased efficiency. “ “ “ Allows extended optimization of intervention. Increases fidelity of data
ro
Complete SW Typical SW
-p
SW DESIGN VARIATION
ESTIMATED STATISTICAL POWER FOR PRIMARY OUCOME OF PACTHF*
Reduces measurement burden
--77% 85% 85% 80%
74%
52%
Accommodates varying 75% cluster readiness SW, Stepped-wedge (cluster randomized trial); PACT-HF, Patient-Centered Care Transitions in Heart Failure; CV, Coefficient of Variation. Early and Late Blocks
Statistical power for Multi-level clustering was not completed due to complexity of analysis. *
Power estimations are calculated assuming 320 index patients per cluster, 10 clusters, a control event rate of 28% at 1 year, and an ICC of 0.01. All power calculations were estimated to demonstrate a 25% change in the primary composite outcome, and with α < 0.05, 2-tailed. The number of time steps and measurements are modified between variations. 19
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall Figures:
Time Steps
Clusters (Hospital Site) 2
3
4
5
6
7
8
9
10
11
1
0
1
1
1
1
1
1
1
1
1
1
2
0
0
1
1
1
1
1
1
1
1
1
3
0
0
0
1
1
1
1
1
1
1
1
4
0
0
0
0
1
1
1
1
1
1
1
5
0
0
0
0
0
1
1
1
1
1
1
6
0
0
0
0
0
1
1
1
1
1
7
0
0
0
0
8
0
0
0
0
9
0
0
0
0
10
0
0
0
ro
of
1
0
0
1
1
1
1
0
0
0
0
1
1
1
0
0
0
0
0
1
1
0
0
0
0
0
0
0
1
na
lP
0
re
-p
0
Jo
ur
Figure 1. Complete typical SW design: The PACT-HF design (complete, typical SW) for the clinical outcomes, where “0” represents sites not exposed to the intervention and “1” represents sites exposed to the intervention. There are 10 hospital sites, 11 steps, 55 control cross-sections, and 55 intervention cross-sections. The clear cells represent the control wedge and the shaded cells represent the intervention wedge. There are 11*10 = 110 observations for the clinical outcomes.
20
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Time Steps 2
3
4
5
6
1
0
1
1
1
1
1
2
0
1
1
1
1
1
3
0
0
1
1
1
1
4
0
0
1
1
1
1
5
0
0
0
1
1
1
6
0
0
0
1
1
1
7
0
0
0
0
1
1
8
0
0
0
0
1
1
9
0
0
0
0
0
1
10
0
0
0
0
1
-p re
lP na
0
of
1
ro
Clusters (Hospital Site)
Jo
ur
Figure 2. Complete SWD with grouped cluster crossover. An alternative PACT-HF complete SW design, with 2 “grouped” clusters crossing over at each time step. “0” represents sites not exposed to the intervention and “1” represents sites exposed to the intervention. The clear cells represent the control wedge and the shaded cells represent the intervention wedge. There are 10 hospital sites, 6 steps, 30 control cross-sections, and 30 intervention cross-sections. There are 6*10 = 60 observations for the clinical outcomes.
21
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
5
6
7
8
9
10
11
1
0
1
1
1
1
1
1
1
1
1
1
2
0
0
1
1
1
1
1
1
1
1
1
3
0
0
0
1
1
1
1
1
1
1
1
1
0
0
0
0
1
1
1
1
1
1
1
2
0
0
0
0
0
1
1
1
1
1
1
3
0
0
0
0
0
0
1
1
1
1
1
1
0
0
0
0
0
0
0
1
1
1
1
2
0
0
0
0
0
0
0
0
1
1
1
3
0
0
0
0
0
0
0
0
0
1
1
1
0
0
0
0
0
0
0
0
1
0
0
of
4
ro
4
3
-p
3
2
re
2
1
lP
1
Time Steps
na
Region
Clusters (Hospital Site)
Jo
ur
Figure 3. Complete SW design with multi-level clustering. An alternative PACT-HF complete SW with multi-level clustering at the level of region and hospital site. “0” represents sites not exposed to the intervention and “1” represents sites exposed to the intervention. The clear cells represent the control wedge and the shaded cells represent the intervention wedge. There are 4 regions, 10 hospital sites, 11 steps, 55 control cross-sections, and 55 intervention cross-sections.
22
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Time Steps 2
3
4
5
6
7
8
9
10
11
1
0
1
1
1
1
1
1
1
1
1
1
2
0
0
1
1
1
1
1
1
1
1
1
3
0
0
0
1
1
1
1
1
1
1
1
4
0
0
0
0
1
1
1
1
1
1
1
5
0
0
0
0
0
1
1
1
1
1
1
6
0
0
0
0
0
1
1
1
1
1
7
0
0
0
0
8
0
0
0
0
9
0
0
0
0
10
0
0
0
-p 0
0
0
1
1
1
1
0
0
0
0
1
1
1
0
0
0
0
0
1
1
0
0
0
0
0
0
1
re
lP
0
na
0
of
1
ro
Clusters (Hospital Site)
Jo
ur
Figure 4. Complete SW design with nested sub-study. The PACT-HF study design pattern of a complete SW with a nested sub-study for the patient-reported outcomes. “0” represents sites not exposed to the intervention and “1” represents sites exposed to the intervention. The clear cells represent the control wedge and the shaded cells represent the intervention wedge. The cells within bolded borders represent the 8 sites (of a total of 10 possible sites) collecting the secondary patient-centered outcomes. There are 11*10 = 110 observations for the clinical outcomes, and 9*8 = 72 observations for the patient-centered outcomes.
23
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Time Steps 2
3
4
5
6
7
8
9
1
1
1
1
1
1
1
1
1
1
2
0
1
1
1
1
1
1
1
1
3
0
0
1
1
1
1
1
1
1
4
0
0
0
1
1
1
1
1
5
0
0
0
0
1
of
1
1
1
1
1
6
0
0
0
0
1
1
1
1
7
0
0
0
0
0
0
1
1
1
8
0
0
0
0
0
0
0
1
1
9
0
0
0
0
0
0
0
0
1
10
0
0
0
0
0
0
0
re
lP
0
0
na
0
ro
1
-p
Clusters (Hospital Site)
Jo
ur
Figure 5. Complete SW combined with parallel cluster design (hybrid SW design). An alternative PACT-HF study design pattern of a Hybrid 20:80 design with 2 parallel clusters and 8 clusters in a modified SW. Clusters 1 and 10 are parallel clusters exposed to only control or intervention throughout the trial. Clusters 2 through 9 are arranged in a modified complete typical SW. “0” represents sites not exposed to the intervention and “1” represents sites exposed to the intervention. The clear cells represent the control wedge and the shaded cells represent the intervention wedge. There are 9*8 = 72 observations in the modified SW, and 9*2 = 18 observations in the parallel design component.
24
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Time Steps 2
3
4
5
6
7
8
9
10
11
12
1
0
.
1
1
1
1
1
1
1
1
1
1
2
0
0
.
1
1
1
1
1
1
1
1
1
3
0
0
0
.
1
1
1
1
1
1
1
1
4
0
0
0
0
.
1
1
1
1
1
1
1
5
0
0
0
0
0
.
1
1
1
1
1
6
0
0
0
0
0
0
.
1
1
1
1
7
0
0
0
0
0
ro
1
0
.
1
1
1
1
8
0
0
0
0
0
0
0
0
.
1
1
1
9
0
0
0
0
10
0
0
0
of
1
re
Clusters (Hospital Site)
-p
0
0
0
0
0
0
.
1
1
lP
1
0
0
0
0
0
.
1
0
na
0
Jo
ur
Figure 6. Incomplete SW design with a learning period. An alternative PACT-HF study design pattern of an ‘Incomplete SW’ with a learning period. “0” represents sites not exposed to the intervention and “1” represents sites exposed to the intervention. “.” represents sites during the implementation period where measurements are not collected. There is an increased number of time steps (12). The total number of observations is the same as in an equivalent complete SW of 10*11 = 110.
25
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Time Steps 2
3
4
5
6
7
8
9
10
11
12
1
0
1
1
.
.
.
.
.
.
.
.
.
2
.
0
1
1
.
.
.
.
.
.
.
.
3
.
.
0
1
1
.
.
.
.
.
.
.
4
.
.
.
0
1
1
.
.
.
.
.
.
5
.
.
.
.
0
1
.
.
.
.
.
6
.
.
.
.
.
0
1
.
.
.
.
7
.
.
.
.
.
ro
1
.
1
1
.
.
.
8
.
.
.
.
.
.
.
0
1
1
.
.
9
.
.
.
.
10
.
.
.
of
1
re
Clusters (Hospital Site)
-p
0
.
.
.
.
0
1
1
.
lP
1
.
.
.
.
0
1
1
.
na
.
Jo
ur
Figure 7. Incomplete SW design with limited measurements prior to and after crossover. There are 10 clusters/hospital sites, 12 steps, 10 control cross-sections, and 20 intervention cross-sections. “0” represents sites not exposed to the intervention and “1” represents sites exposed to the intervention. The clear cells represent the control wedge and the shaded cells represent the intervention wedge. Each cluster is measured over one time period prior to crossover, and over two time periods after crossover. There are 3*10 = 30 total observations.
26
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Time Steps
Clusters (Hospital Site) 4
5
6
7
8
9
10
11
12
1
0
1
1
1
1
1
.
.
.
.
.
.
2
0
0
1
1
1
1
.
.
.
.
.
.
3
0
0
0
1
1
1
.
.
.
.
.
.
4
0
0
0
0
1
1
.
.
.
.
.
.
5
0
0
0
0
0
1
.
.
.
.
.
.
6
.
.
.
.
.
7
.
.
.
.
.
8
.
.
.
.
9
.
.
.
.
10
.
.
.
0
ro
1
1
1
1
1
.
0
0
1
1
1
1
.
.
0
0
0
1
1
1
.
.
0
0
0
0
1
1
.
0
0
0
0
0
1
.
re
.
-p
of
3
lP
Late Block
2
.
na
Early Block
1
Jo
ur
Figure 8. Incomplete SW with early and late blocks. “0” represents sites not exposed to the intervention and “1” represents sites exposed to the intervention. The clear cells represent the control wedge and the shaded cells represent the intervention wedge. The first 5 clusters are stratified to the ‘Early’ block and receive intervention in the first half of the design. The second 5 clusters are stratified to the ‘Late’ block and receive intervention in the second half of the design. There are 5*6*2=60 total observations.
27
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall References 1.
Hussey MA, Hughes JP. Design and analysis of stepped wedge cluster randomized trials. Contemp Clin Trials. 2007;28(2):182–91.
2.
Hemming K, Haines TP, Chilton PJ, Girling AJ, Lilford RJ. The stepped wedge cluster randomised trial: rationale, design, analysis, and reporting. BMJ [Internet]. Feb
6;350(feb06
1):h391–h391.
http://www.bmj.com/cgi/doi/10.1136/bmj.h391
from:
Parienti J-J, Kuss O. Cluster-crossover design: A method for limiting clusters level
ro
3.
Available
of
2015
2018
Mar
19];28(3):316–23.
Available
from:
re
[cited
-p
effect in community-intervention studies. Contemp Clin Trials [Internet]. 2007 May
http://www.ncbi.nlm.nih.gov/pubmed/17110172 Barker D, McElduff P, D’Este C, Campbell MJ. Stepped wedge cluster
lP
4.
na
randomised trials: a review of the statistical methodology used and available.
Available
ur
BMC Med Res Methodol [Internet]. 2016 Dec 6 [cited 2017 Aug 23];16(1):69. from:
0176-5 5.
Jo
http://bmcmedresmethodol.biomedcentral.com/articles/10.1186/s12874-016-
Beard E, Lewis JJ, Copas A, Davey C, Osrin D, Baio G, et al. Stepped wedge randomised controlled trials: systematic review of studies published between 2010 and 2014. Trials [Internet]. 2015 Aug 17 [cited 2018 Jun 26];16:353. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26278881
6.
Van Spall HGC, Lee SF, Xie F, Ko DT, Thabane L, Ibrahim Q, et al. Knowledge to action: Rationale and design of the Patient-Centered Care Transitions in Heart Failure (PACT-HF) stepped wedge cluster randomized trial. Am Heart J [Internet]. 28
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall 2018
May
[cited
2018
May
24];199:75–82.
Available
from:
http://www.ncbi.nlm.nih.gov/pubmed/29754670 7.
Freedman B. Equipoise and the Ethics of Clinical Research. N Engl J Med [Internet]. 1987 Jul 16 [cited 2017 Oct 27];317(3):141–5. Available from: http://www.ncbi.nlm.nih.gov/pubmed/3600702
8.
Van Spall HGC, Rahman T, Mytton O, Ramasundarahettige C, Ibrahim Q, Kabali
of
C, et al. Comparative effectiveness of transitional care services in patients
ro
discharged from the hospital with heart failure: a systematic review and network
-p
meta-analysis. Eur J Heart Fail [Internet]. 2017 Feb [cited 2017 Sep 9]; Available
9.
re
from: http://doi.wiley.com/10.1002/ejhf.765
Takeda A, Taylor SJ, Taylor RS, Khan F, Krum H, Underwood M. Clinical service
lP
organisation for heart failure. In: Taylor SJ, editor. Cochrane Database of
[cited
2017
na
Systematic Reviews [Internet]. Chichester, UK: John Wiley & Sons, Ltd; 2012 May
26].
Available
from:
Hemming K, Lilford R, Girling AJ. Stepped-wedge cluster randomised controlled
Jo
10.
ur
http://doi.wiley.com/10.1002/14651858.CD002752.pub3
trials: A generic framework including parallel and multiple-level designs. Stat Med. 2015;34(2):181–96. 11.
Kotz D, Spigt M, Arts ICW, Crutzen R, Viechtbauer W. Use of the stepped wedge design cannot be recommended: A critical appraisal and comparison with the classic cluster randomized controlled trial design. J Clin Epidemiol [Internet]. 2012 Dec
[cited
2019
Jul
24];65(12):1249–52.
Available
from:
https://linkinghub.elsevier.com/retrieve/pii/S0895435612001710 12.
Hemming K, Girling AJ, Sitch AJ, Marsh J, Lilford RJ. Sample size calculations for 29
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall cluster randomised controlled trials with a fixed number of clusters. BMC Med Res Methodol [Internet]. 2011 Jun 30 [cited 2017 Nov 26];11:102. Available from: http://www.ncbi.nlm.nih.gov/pubmed/21718530 13.
Taljaard M, Teerenstra S, Ivers NM, Fergusson DA. Substantial risks associated with few clusters in cluster randomized and stepped wedge designs. Clin Trials [Internet].
2016
[cited
2017
Nov
28];13(4):459–63.
Available
from:
Copas AJ, Lewis JJ, Thompson JA, Davey C, Baio G, Hargreaves JR. Designing
ro
14.
of
http://journals.sagepub.com/doi/pdf/10.1177/1740774516634316
-p
a stepped wedge trial: three main designs, carry-over effects and randomisation
re
approaches. Trials [Internet]. 2015 Aug 17 [cited 2017 Jun 27];16:352. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26279154 de Hoop E, Woertman W, Teerenstra S. The stepped wedge cluster randomized
lP
15.
na
trial always requires fewer clusters but not always fewer measurements, that is, participants than a parallel cluster randomized trial in a cross-sectional design. J
ur
Clin Epidemiol [Internet]. 2013 Dec [cited 2018 Apr 25];66(12):1428. Available
16.
Jo
from: http://www.ncbi.nlm.nih.gov/pubmed/24021612 Kristunas CA, Hemming K, Eborall HC, Gray LJ. The use of feasibility studies for stepped-wedge cluster randomised trials: protocol for a review of impact and scope. BMJ Open [Internet]. 2017 Aug 1 [cited 2018 Jun 26];7(7):e017290. Available from: http://www.ncbi.nlm.nih.gov/pubmed/28765139 17.
Brown CA, Lilford RJ. The stepped wedge trial design: a systematic review. BMC Med Res Methodol [Internet]. 2006 Nov 8 [cited 2017 Aug 29];6:54. Available from: http://www.ncbi.nlm.nih.gov/pubmed/17092344
18.
Baio G, Copas A, Ambler G, Hargreaves J, Beard E, Omar RZ. Sample size 30
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall calculation for a stepped wedge trial. Trials [Internet]. 2015 Aug 17 [cited 2017 Sep 12];16:354. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26282553 19.
Killip S, Mahfoud Z, Pearce K. What is an intracluster correlation coefficient? Crucial concepts for primary care researchers. Ann Fam Med [Internet]. 2004 [cited
2017
May
27];2(3):204–8.
Available
from:
http://www.ncbi.nlm.nih.gov/pubmed/15209195 Campbell MK, Grimshaw JM, Elbourne DR. Intracluster correlation coefficients in
of
20.
ro
cluster randomized trials: empirical insights into how should they be reported.
-p
BMC Med Res Methodol [Internet]. 2004 Dec 28 [cited 2018 Feb 20];4(1):9. from:
re
Available
http://bmcmedresmethodol.biomedcentral.com/articles/10.1186/1471-2288-4-9 Woertman W, de Hoop E, Moerbeek M, Zuidema SU, Gerritsen DL, Teerenstra S.
lP
21.
na
Stepped wedge designs could reduce the required sample size in cluster randomized trials. J Clin Epidemiol [Internet]. 2013 Jul 1 [cited 2017 Nov
de Hoop E, van der Tweel I, van der Graaf R, Moons KGM, van Delden JJM,
Jo
22.
ur
26];66(7):752–8. Available from: http://www.ncbi.nlm.nih.gov/pubmed/23523551
Reitsma JB, et al. The need to balance merits and limitations from different disciplines when considering the stepped wedge cluster randomized trial design. BMC Med Res Methodol [Internet]. 2015 Dec 30 [cited 2017 Dec 6];15(1):93. Available
from:
http://bmcmedresmethodol.biomedcentral.com/articles/10.1186/s12874-0150090-2 23.
Kristunas C, Morris T, Gray L. Unequal cluster sizes in stepped-wedge cluster randomised trials: a systematic review. BMJ Open [Internet]. 2017 Nov 15 [cited 31
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall 2018
Feb
23];7(11):e017151.
Available
from:
http://www.ncbi.nlm.nih.gov/pubmed/29146637 24.
Eldridge SM, Ashby D, Kerry S. Sample size for cluster randomized trials: effect of coefficient of variation of cluster size and analysis method. Int J Epidemiol [Internet]. 2006 Oct 1 [cited 2018 Feb 25];35(5):1292–300. Available from: http://www.ncbi.nlm.nih.gov/pubmed/16943232 Thompson JA, Fielding K, Hargreaves J, Copas A. The optimal design of stepped
of
25.
ro
wedge trials with equal allocation to sequences and a comparison to other trial
-p
designs. Clin Trials [Internet]. 2017 Dec 10 [cited 2018 May 28];14(6):639–47.
26.
re
Available from: http://www.ncbi.nlm.nih.gov/pubmed/28797179 van der Tweel I, van der Graaf R. Issues in the Use of Stepped Wedge Cluster
16
[cited
2018
Apr
25];13(9):23–4.
Available
from:
na
Sep
lP
and Alternative Designs in the Case of Pandemics. Am J Bioeth [Internet]. 2013
http://www.ncbi.nlm.nih.gov/pubmed/23952827 Fatemi Y, Jacobson RM. The Stepped Wedge Cluster Randomized Trial and Its
ur
27.
Jo
Potential for Child Health Services Research: A Narrative Review. Acad Pediatr [Internet]. 2015 Mar [cited 2018 Jun 27];15(2):128–33. Available from: http://linkinghub.elsevier.com/retrieve/pii/S1876285914003908 28.
Hargreaves JR, Copas AJ, Beard E, Osrin D, Lewis JJ, Davey C, et al. Five questions to consider before conducting a stepped wedge trial. Trials [Internet]. 2015
Dec
17
[cited
2018
Jun
8];16(1):350.
Available
from:
http://trialsjournal.biomedcentral.com/articles/10.1186/s13063-015-0841-8 29.
Bonell CP, Hargreaves J, Cousens S, Ross D, Hayes R, Petticrew M, et al. Alternatives to randomisation in the evaluation of public health interventions: 32
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall design challenges and solutions. J Epidemiol Community Health [Internet]. 2011 Jul
1
[cited
2018
Jun
27];65(7):582–7.
Available
from:
http://www.ncbi.nlm.nih.gov/pubmed/19213758 30.
Prost A, Binik A, Abubakar I, Roy A, De Allegri M, Mouchoux C, et al. Logistic, ethical, and political dimensions of stepped wedge trials: critical review and case
http://www.ncbi.nlm.nih.gov/pubmed/26278521
Mdege ND, Man M-S, Taylor (nee Brown) CA, Torgerson DJ. Systematic review
ro
31.
of
studies. Trials [Internet]. 2015 Aug 17 [cited 2018 Jun 26];16:351. Available from:
-p
of stepped wedge cluster randomized trials shows that design is particularly used
re
to evaluate interventions during routine implementation. J Clin Epidemiol [Internet]. 2011 Sep [cited 2018 Jun 27];64(9):936–48. Available from:
Hemming K, Taljaard M, Forbes A. Analysis of cluster randomised stepped wedge
na
32.
lP
http://linkinghub.elsevier.com/retrieve/pii/S089543561000435X
4;18(1):101.
ur
trials with repeated cross-sectional samples. Trials [Internet]. 2017 Dec Available
from:
33.
Jo
http://trialsjournal.biomedcentral.com/articles/10.1186/s13063-017-1833-7 Hughes JP, Granston TS, Heagerty PJ. Current issues in the design and analysis of stepped wedge trials. Contemp Clin Trials [Internet]. 2015 Nov [cited 2017 Aug 22];45:55–60.
Available
from:
http://linkinghub.elsevier.com/retrieve/pii/S1551714415300434 34.
Taljaard M, Hemming K, Shah L, Giraudeau B, Grimshaw JM, Weijer C. Inadequacy of ethical conduct and reporting of stepped wedge cluster randomized trials: Results from a systematic review. Clin Trials J Soc Clin Trials [Internet]. 2017
Aug
8
[cited
2018
Jun
26];14(4):333–41.
Available
from: 33
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall http://www.ncbi.nlm.nih.gov/pubmed/28393537 Thabane A, Dennis BB, Gajic-Veljanoski O, Paul J, Thabane L. Reporting quality of stepped wedge design randomized trials: a systematic review protocol. Clin Epidemiol [Internet]. 2016 [cited 2018 Jun 26];8:261–6. Available from:
ur
na
lP
re
-p
ro
of
http://www.ncbi.nlm.nih.gov/pubmed/27468249
Jo
35.
34
Journal Pre-proof Unni, Lee, Thabane, Connolly, Van Spall
Variations in stepped-wedge cluster randomized trial design: Insights from the ‘PatientCentered Care Transitions in Heart Failure’ trial
Highlights
A stepped wedge (SW) design is a type of cluster randomized trial
All clusters in a SW design sequentially cross over from control to intervention
SW design is useful to study the effect of implementing an intervention
Variations in SW can be used to address methodologic or logistical constraints
Variations include SWs with fewer steps, reduced measurements and nested substudies
Jo
ur
na
lP
re
-p
ro
of
35
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8