Local dynamic stability of treadmill walking: Intrasession and week-to-week repeatability

Local dynamic stability of treadmill walking: Intrasession and week-to-week repeatability

Journal of Biomechanics 47 (2014) 74–80 Contents lists available at ScienceDirect Journal of Biomechanics journal homepage: www.elsevier.com/locate/...

414KB Sizes 1 Downloads 40 Views

Journal of Biomechanics 47 (2014) 74–80

Contents lists available at ScienceDirect

Journal of Biomechanics journal homepage: www.elsevier.com/locate/jbiomech www.JBiomech.com

Local dynamic stability of treadmill walking: Intrasession and week-to-week repeatability Fabienne Reynard b, Philippe Terrier a,b,n a b

IRR, Institute for Research in Rehabilitation, Sion, Switzerland Clinique romande de réadaptation SUVACare, Sion, Switzerland

art ic l e i nf o

a b s t r a c t

Article history: Accepted 7 October 2013

Repetitive falls degrade the quality of life of elderly people and of patients suffering of various neurological disorders. In order to prevent falls while walking, one should rely on relevant early indicators of impaired dynamic balance. The local dynamic stability (LDS) represents the sensitivity of gait to small perturbations: divergence exponents (maximal Lyapunov exponents) assess how fast a dynamical system diverges from neighbor points. Although numerous findings attest the validity of LDS as a fall risk index, reliability results are still sparse. The present study explores the intrasession and intersession repeatability of gait LDS using intraclass correlation coefficients (ICC) and standard error of measurement (SEM). Ninety-five healthy individuals performed 5 min treadmill walking in two sessions separated by 9 days. Trunk acceleration was measured with a 3D accelerometer. Three time scales were used to estimate LDS: over 4–10 strides (λ4–10), over one stride (λ1) and over one step (λ0.5). The intrasession repeatability was assessed from three repetitions of either 35 strides or 70 strides taken within the 5 min tests. The intersession repeatability compared the two sessions, which totalized 210 strides. The intrasession ICCs (70-strides estimates/35-strides estimates) were 0.52/0.18 for λ4–10 and 0.84/0.77 for λ1 and λ0.5. The intersession ICCs were around 0.60. The SEM results revealed that λ0.5 measured in medio-lateral direction exhibited the best reliability, sufficient to detect moderate changes at individual level (20%). However, due to the low intersession repeatability, one should average several measurements taken on different days in order to better approximate the true LDS. & 2013 Elsevier Ltd. All rights reserved.

Keywords: Gait variability Intraclass Correlation Coefficient (ICC) Nonlinear dynamics Maximal Lyapunov exponent

1. Introduction Falls are a major health issue in older adults (Stevens et al., 2006). Moreover, repetitive falls degrade the quality of life of patients suffering of various neurological disorders (Finlayson et al., 2006; Kerr et al., 2010; Ramnemark et al., 2000). It has been shown that stride-to-stride variability of gait kinematics is related to fall risk (Brach et al., 2007; Verghese et al., 2009). This increased variability may be the result of a decreased ability to optimally control gait. Several analytical methods have been developed to better take into account the nonlinear features of gait variability (Hamacher et al., 2011; Hausdorff, 2007; Stergiou and Decker, 2011). Originally developed to detect deterministic chaos in nonlinear dynamical systems, the maximal Lyapunov exponent has been advocated as a relevant method to assess the degree of sensitivity of gait to small perturbations, or in other words the local dynamic stability (LDS). Computed from various continuously

n Corresponding author at: Clinique romande de réadaptation, SUVACare, Av. Gd-Champsec 90, 1951 Sion, Switzerland. Tel.: þ41 27 603 23 91. E-mail address: [email protected] (P. Terrier).

0021-9290/$ - see front matter & 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.jbiomech.2013.10.011

measured kinematic parameters (speed, acceleration, and joint angles), the LDS represents the rate of divergence between neighbor trajectories in a reconstructed state space that describes the dynamics of the system (Dingwell, 2006; Dingwell and Cusumano, 2000; Terrier and Dériaz, 2013). Some recent experimental and clinical findings support the hypothesis that LDS could be used to predict fall risk (Bruijn et al., 2011; Lockhart and Liu, 2008; McAndrew et al., 2011; Roos and Dingwell, 2010; Toebes et al., 2012). The validity of LDS compared to other indicators has been recently discussed and LDS was found to be one of the best stability indices (Bruijn et al., 2013). However, several issues must be solved before LDS can be routinely used as an early fall-risk predictor in clinical settings. In the first place, information regarding the reliability of LDS measurements is still sparse (Bruijn et al., 2009; Kang and Dingwell, 2006). Recently, a good intrassession repeatability (ICC between 0.79 and 0.92) has been described by analyzing long duration outdoor walking (200 strides) in young healthy subjects (N ¼20) (van Schooten et al., 2013). The same study reports lower intersession (between nonconsecutive days) repeatability lying between 0.38 and 0.63. However, more results are needed about the LDS reliability in short indoor walking tests, which are more adapted to patients

F. Reynard, P. Terrier / Journal of Biomechanics 47 (2014) 74–80

with diminished walking capabilities (Hilfiker et al., 2013; Terrier et al., 2013). Other research questions need to be addressed in order to increase the usability of LDS in clinical settings. Two different methods are used to characterize short-term LDS, which is assumed to be the more relevant time scale for assessing gait stability (Bruijn et al., 2011; Roos and Dingwell, 2010): one method computes divergence over the duration of one stride (Manor et al., 2009; McAndrew Young and Dingwell, 2012), and the other one over the duration of one step (Bruijn et al., 2010a; Toebes et al., 2012). It is still not known whether one method offers more precision than the other one. In addition, it is known that a minimal length of continuous signal is necessary to assess the maximal Lyapunov exponents (Kang and Dingwell, 2006; Rosenstein et al., 1993). However, the averaging of several estimates of LDS obtained from distinct short-duration signals produces reliable results, at least at group level (Sloot et al., 2011). Thereby two possibilities should be distinguished: (1) Inherent to the calculation method, a too short-duration signal induces very large error on LDS estimates: long-duration walking tests are therefore mandatory; (2) LDS can be precisely assessed from a short-duration signal, but it substantially varies from strides to strides: as a result, the precision of estimate can be increased by averaging the results obtained from several short-duration walking tests. To address the aforementioned issues, the LDS of 95 healthy adults walking on a treadmill was assessed from trunk acceleration signals. We aimed to answer the following research questions: (1) what is the intrasession (within a 5 min continuous measurement) and intersession (week-to-week) repeatability (i.e. absolute agreement among repetitions) of LDS? (2) How reliable is an estimate of LDS obtained from short-duration walking tests? (3) Does the repeatability of LDS increase with measurement length as expected? (4) Do the two methods that assess short-term LDS exhibit the same repeatability?

2. Methods 2.1. Subjects One hundred healthy subjects (50 males, 50 females) were recruited to participate in the study. They were selected according to their age and sex. Ten males and 10 females for each decade, between 20 and 69 years old, were included. The data of five subjects had to be discarded due to technical issues (age of the discarded participants (yr): 58, 41, 22, 23, 30; 2 males, 3 females). Therefore, the final sample contained 95 participants (48 males, 47 females) whose characteristics were (mean (SD)): age 44 yr (14), body weight 70 kg (14), body height 1.72 m (0.07)). None had a history of neurological or orthopedic conditions likely to affect their mobility. Each subject gave his/her written informed consent. The study was approved by the regional medical ethics committee (Commission Cantonale Valaisanne d'Ethique Médicale, Sion, Switzerland).

2.2. Experimental procedure The subjects were equipped with a tri-axial accelerometer (Physilog system, BioAGM, Switzerland; sampling rate 200 Hz, 16-bit resolution) fixed to a belt at the anterior upper trunk level under the sternal notch. The accelerometer was connected to a lightweight data logger and it measured the body accelerations along three axes: medio-lateral (ML), vertical (V) and antero-posterior (AP). For safety reasons, subjects walked on a level treadmill (HP Cosmos, Venus model, Traunstein, Germany) wearing a harness, which did not perturb the arms and legs movements. To remove any influence of the footwear on gait characteristics, the participants walked barefoot. The subjects walked during 5 min at preferred walking speed (PWS), assessed as described in Dingwell and Marin (2006). The same procedure was repeated on average 9.17 3.6 days later. The same treadmill speed as in the first session was imposed to the participants. The data analysis was then performed with Matlab (Mathworks, MA, USA).

75

2.3. Data analysis Because the switching on of the accelerometer might have caused transient gait perturbations, the first five seconds of the raw acceleration signals were discarded. In order to lower the effect of sensor placement variability among subjects, the 3Dacceleration signals were first re-oriented according to the procedure introduced by Moe-Nilssen (Moe-Nilssen, 1998), which uses the accelerometer's capacity as an inclinometer. The raw 200 Hz signals were filtered down to 50 Hz in order to accelerate the subsequent steps of data analysis; (Chebyshev Type I, zero-phase filtering). The resulting 50 Hz signals (295 s duration) were divided into 3 parts of 95 s duration, beginning respectively at 5 s, 105 s and 205 s. In each of those parts, Step Frequency (SF) was assessed by using Fast Fourier Transform of the vertical acceleration signal. Then the duration corresponding to either 35 strides (1/SF  35) or 70 strides (1/SF  70) were taken from the beginning of each part. The resulting segments, whose length depended upon SF of each participant, were time-normalized to a uniform length of 2,500 samples (35 strides) or 5,000 samples (70 strides), using a polyphase filter implementation (matlab command resample). The length (2500 or 5000) corresponded approximately to the length of the longest original sample. Fig. 1 presents an overall scheme, which summarizes the data pre-processing. The method for quantifying the local dynamic stability (LDS) of gait was based on the computation of the largest Lyapunov exponent using the Rosenstein's algorithm (Dingwell, 2006; Rosenstein et al., 1993; Terrier and Dériaz, 2011). We recently published a thorough description of the method (Terrier and Dériaz, 2013). Therefore, we focus on the particularity of the methodology. A uniform time delay (AP: 5, V: 5, ML: 6) was used for all tests according to the average results of an average mutual information (AMI) analysis. A constant dimension of six was selected, according to the average results of a global false nearest neighbors (GFNN) analysis. Three different time scales were used to compute the slope from the logarithmic divergence curves: (1) Long-term LDS, over 4–10 strides (λ4–10), (2) short term LDS over one stride (λ1), (3) short-term LDS over one step (λ0.5). A constant number of 71 samples per stride was used (i.e. 5000 (2500) samples for 70 (35) strides). 2.4. Statistics Descriptive statistics of the basic gait parameters (speed, cadence, step length) are presented in the supplemental (online) material. In order to illustrate the difference in LDS induced by the length of the measurement, the descriptive statistics (mean, SD, median, and quartiles) for 35 strides and 70 strides are presented separately in Fig. 2, based on the average of the six values obtained from two sessions. Both Intraclass Correlation Coeffecient (ICC) and Standard Error of Measurement (SEM) were used to characterize intra- and intersession repeatability. The ICC method and nomenclature were those proposed in McGraw and Wong (1996). The ICC(A,1) model was used, which assesses the degree of absolute agreement among measurements. The SEM is the group-level estimation of the intra-subject pffiffiffiffiffiffiffiffiffiffiffiaverage variability (Weir, 2005), defined by the following equation: SEM ¼ ST 1  R, where ST is the global standard deviation and R is the corresponding ICC. From pffiffiffiSEM, we also computed the Smallest Detectable Difference (SDD¼ SEM  1.96  2), which is the smallest change that could be considered as significant (Weir, 2005). Normalized by the mean and expressed as percentage, it allows to evaluate whether an observed relative change in one individual could be true and not due to measurement error or to intra-individual variability. The reliability of the average of two repetitions of 35 strides was assessed by using the Spearman–Brown prophecy formula, which predicts the reliability composed by replicating a given test: Rn ¼

NR 1 þ ðN  1ÞR

where Rn is the predicted reliability, N is the number of tests (in this case N ¼ 2) and R is the measured reliability for one test. The purpose was to test whether the predicted reliability for the average of 2  35 strides corresponded to the measured reliability obtained from 70 strides. A substantial difference between predicted and measured reliability would indicate that the error strongly depends upon the length of the measurement. Practically, the reliability within the two sessions (intrasession repeatability, based on three repetitions, ICC(A,1), SEM and SDD) was assessed separately for each session, and then the results were averaged (Table 1). Two complementary approaches were used to characterize the week-to-week reliability: first, we assessed the 70-strides intersession repeatability (two repetitions) by computing pairwisely three ICCs obtained from each 70-strides measurements taken from the two sessions, and then averaging them (Table 2). Second, the LDS results of the three 70-strides measurements were averaged (3  70 strides ¼ 210), and the repeatability between the sessions (two repetitions) was computed (Table 3). The first approach provided information on the reliability of short duration walking test (70 strides), while the second approach mimics the reliability of discontinuous walking tests, from which three repetitions are used to compute an average result (210 strides). The confidence intervals on the estimates (95% CI) were computed by bootstrapping (5000 resamples, bias corrected and accelerated percentile method). A graphical representation of the reliability results is provided in the online supplemental material.

76

F. Reynard, P. Terrier / Journal of Biomechanics 47 (2014) 74–80

95 participants

Session 1 5 s.

105 s.

205 s.

9 days

Session 2 5 s.

105 s.

205 s.

70 strides 35 strides 5 min. treadmill walking at PWS

5 min. treadmill walking at PWS

12 3D acceleration signals per participant

Dependent variables: 1) Long-term LDS ( 4-10) 2) Short term LDS ( 1) 3) Short term LDS ( 0.5)

In AP, V and ML directions

Fig. 1. Overall view of the experimental procedure and of the data pre-processing: LDS: local dynamic stability. PWS: preferred walking speed. AP: antero-posterior. V: vertical. ML: medio-lateral.

3. Results Regarding the descriptive statistics of LDS (Fig. 2), the results are normally distributed among individuals (Lilliefors test, p 40.05), with some outliers. A substantial difference exists between the estimates obtained from 35 strides and from 70 strides, with a more marked effect in long-term LDS; the relative differences (70 strides 35 strides/35 strides  100) were on average þ40% for λ4–10, þ6% for λ1, and þ 8% for λ0.5. The results of the intrasession repeatability (ICC and SEM) are shown in the Table 1. Regarding the 35-strides estimates, a substantial difference exists between long-term LDS (ICC 0.17– 0.20) and short-term LDS (ICC 0.71–0.82). Namely, the 70-strides predicted repeatability (column #7) is comparable to the measured repeatability for short-term LDS, but not for long-term LDS. The 70-strides LDS estimates exhibit higher reliability, with the best relative SDDs for λ0.5 (11%-13%). The results of the intersession repeatability are presented in Tables 2 and 3. Based on 70-strides estimates, the reliability is very low for long term-LDS (SDD 102–142%) and is maximal for λ0.5 (SDD 19–25%). Based on 210 strides estimates, the ICCs are homogenous (λ4–10: 0.58, λ1: 0.59, λ0.5: 0.60). In contrast, the SEM and SDD estimates are higher for long-term LDS, and minimal for λ0.5 (SDD 16–23%). The medio-lateral direction tends to exhibit lower SEM and SDD.

4. Discussion Referring to the research questions presented in the introduction, the results can be summarized as follows: (1) the intrasession repeatability of gait LDS (ICC, 70-strides estimates) was around 0.50 for long-term LDS and 0.85 for short-term LDS, the intersession repeatability was around 0.6 for both short- and long-term LDS estimated from 210 strides (3  70). (2) Long-term LDS

estimated from short-duration measurements (35 strides) exhibited particularly low repeatability (ICC: 0.20), while that of shortterm LDS was around 0.75. (3) Long-term LDS exhibited a substantial discrepancy between the predicted repeatability and the actual repeatability, while it was not the case for short-term LDS. (4) Taking into account both ICC and SEM, short-term LDS measured over one step (λ0.5) tended to have better intra- and inter-session reliability than short-term LDS measured over one stride (λ1). LDS has been computed from various kinematic signals (speed, acceleration, and joint angles) measured at different level (lower limbs, trunk) with different measurement methods (video analysis, and accelerometers). The present study used trunk acceleration measured at the anterior part of the thorax (sternum), a method that has been originally proposed in the first studies that introduced Lyapunov exponents for gait stability assessment (Dingwell and Cusumano, 2000; Dingwell et al., 2001). Few studies have been conducted to compare the validity and reliability among measurement methods (Bruijn et al., 2010b; Kang and Dingwell, 2009). Based on theoretical considerations and empirical evidence, it is thought that different methods can be alternatively used to record the gait dynamics with similar results (Bruijn et al., 2013). The results of the present study are likely generalizable to other measurement methods, but further reliability studies are needed to confirm that hypothesis. The results of the present study clearly show that λ0.5 is more reliable than λ1. Indeed, regarding the intrasession reliability, the λ0.5 repeatability (ICC) is on average 6% higher than the λ1 repeatability. Furthermore, the relative SDD was on average 12% for λ0.5 and 20% for λ1. Regarding intersession reliability computed on 210 strides (Table 3), although both λ0.5 and λ1 exhibit similar ICC (0.60 vs. 0.59), the relative SDD is substantially lower for λ0.5 (average SDD: 20% vs. 28%). The same conclusion can be drawn from the analysis of the 70-strides results (Table 2). Although further investigations are needed to compare the responsiveness

F. Reynard, P. Terrier / Journal of Biomechanics 47 (2014) 74–80

Long−term LDS AP 0.010 (0.006)

0.06

0.014 (0.007)

0

1

0.6

λ

0.4

0.4

35 str.

70 str.

0.2

0.59 (0.09)

0.63 (0.10)

35 str.

70 str.

0.2

Short−term LDS V

Short−term LDS AP

1.4

1.4

1.4

0.6

λ

λ

1

1 0.8

0.5

1.6

0.5

1.6

1.2

1.07 (0.12)

1.15 (0.12)

35 str.

70 str.

0.8 0.6

0.41 (0.06)

0.44 (0.06)

35 str.

70 str.

Short−term LDS ML

1.6

1.2

Short−term LDS ML

λ

1

0.6

1

0.6

0.49 (0.07)

70 str.

4−10

Short−term LDS V

0.8

λ

1

1

70 str.

0.8

0.47 (0.07)

35 str.

0.02

0.8

0.2

0.018 (0.007)

0 35 str.

Short−term LDS AP

0.013 (0.007)

0.04

0.02

70 str.

0.4

0.5

0.06

0 35 str.

λ

0.024 (0.009)

4−10

λ

0.02

1

0.017 (0.009)

Long−term LDS ML

0.04

0.04 λ 4−10

Long−term LDS V

λ

0.06

77

1.2

0.82 (0.07)

0.89 (0.07)

35 str.

70 str.

1 1.18 (0.15)

1.27 (0.16)

35 str.

70 str.

0.8 0.6

Fig. 2. Descriptive statistics of the local dynamic stability (LDS): Ninety-five participants walked during 5 min on a treadmill at two occasions separated by a week. LDS was computed using the maximal finite-time Lyapunov exponents method from trunk acceleration signal measured in antero-posterior (AP), vertical (V), and medio-lateral (ML) directions. Three time scales were used: long-term LDS between 4 and 10 strides (λ4–10); short-term LDS between 0 and 1 stride (λ1); short-term LDS between 0 and 0.5 stride (λ0.5). The LDS estimates were computed either from 35 strides (black) or from 70 strides (grey). The results of both sessions were averaged (see Fig. 1). The spread of the data among participants is presented with boxplots (median and quartiles) and with 95% confidence intervals around the mean (vertical lines).þ indicates outliers. Printed values are mean and SD (N ¼95).

and validity of both λ0.5 and λ1, it is recommended to use λ0.5 in order to increase the statistical power (Perkins et al., 2000). We confirm that the estimates of divergence exponents (λ) increases with the measurement length (Bruijn et al., 2009; Kang and Dingwell, 2006). The relative effect for doubling the length of measurement (35 strides to 70 strides) was on average þ40% for long-term LDS and þ7% for short-term LDS. An explanation could be that a long-duration signal increases the probability of occurrence of very close neighbors that have a larger potential of divergence away from each other (Bruijn et al., 2009). As a result, a careful normalization of sample lengths is required when comparing LDS among groups and conditions, especially if longterm LDS is analyzed. Since the classical works of Spearman (Spearman, 1910) and Brown (Brown, 1910), it is well known that the mean of several replicate measurements on a subject is more reliable than a single measurement. Accordingly, the short-term LDS estimated from 70strides exhibited repeatability, which was similar to repeatability of the average of two 35-strides measurements such as predicted by the Spearman–Brown formula (Table 1). Therefore, there is no fundamental flaw in the Rosenstein's algorithm that would hinder

the computation of the short-term divergence over a small number of consecutive strides. Although short-term LDS may substantially vary from one stride to another, the reliability can be enhanced by averaging several repetitions of short overground walking trials, as already observed in another study (van Schooten et al., 2011). That opens the perspective of LDS assessment in patients with diminished walking capabilities, with sufficient time between successive repetitions to offer recuperation. However, that relies upon the assumption that short repeated overground walking tests will exhibit the same reliability as the continuous treadmill walking, which we used in the present study. We recently obtained results in overground walking that confirm a good intrassession reliability of two consecutive repetitions of 40 strides (ICC about 0.85) (Terrier and Reynard, in press). However, more studies are needed to assess the intra-day reliability of LDS. In contrast to short-term LDS, long-term LDS exhibited very low repeatability when estimated with 35 strides only (Table 1). The increase in reliability using 70-strides estimates was substantially larger than predicted. Thus, the Rosenstein algorithm needs a continuous signal of substantial length to precisely assess divergence over longer time scale.

78

F. Reynard, P. Terrier / Journal of Biomechanics 47 (2014) 74–80

Table 1 Intrasession reliability of local dynamic stability: 95 participants walked during 5 min on a treadmill at two occasions separated by a week. The local dynamic stability (LDS) was computed using the maximal finite-time Lyapunov exponents method from trunk acceleration signal measured in antero-posterior (AP), vertical (V), and medio-lateral (ML) directions. Three time scales were used: long-term LDS between 4 and 10 strides (λ4–10); short-term LDS between 0 and 1 stride (λ1); short-term LDS between 0 and 0.5 stride (λ0.5). The LDS estimates were computed either from 35 strides (left) or from 70 strides (right). The agreement among three within-session repetitions (intraclass correlation coefficient (ICC(A,1)), the standard error of measurement (SEM), and the relative smallest detectable difference (SDD) are shown. The repeatability for a 70 strides test predicted from a 35 strides test (Spearmann–Brown) is also shown. The ICCs, SEMs and SDDs obtained during both sessions were averaged. The 95% confidence intervals (CI) were computed by bootstrapping (5000 resamples). 35 Strides

Long-term LDS (λ4–10)

Short-term LDS (λ1)

Short-term LDS (λ0.5)

AP V ML AP V ML AP V ML

70 str.

70 Strides

Relative SDD

ICC

95% CI

SEM

95% CI

Predicted

ICC

95% CI

SEM

95% CI

35 strides (%)

70 strides (%)

0.20 0.17 0.18 0.78 0.74 0.71 0.82 0.82 0.74

0.11–0.30 0.07–0.30 0.08–0.30 0.73–0.84 0.67–0.80 0.65–0.76 0.79–0.86 0.77–0.87 0.69–0.79

0.010 0.014 0.011 0.041 0.058 0.036 0.060 0.073 0.044

0.009–0.011 0.013–0.015 0.010–0.012 0.037–0.045 0.053–0.065 0.034–0.040 0.055–0.065 0.067–0.081 0.040–0.048

0.33 0.29 0.31 0.87 0.85 0.83 0.90 0.90 0.85

0.54 0.54 0.48 0.85 0.83 0.79 0.88 0.88 0.81

0.46–0.63 0.45–0.64 0.39–0.59 0.81–0.89 0.79–0.88 0.75–0.84 0.86–0.91 0.85–0.91 0.78–0.85

0.006 0.008 0.007 0.034 0.047 0.030 0.049 0.061 0.037

0.006–0.007 0.008–0.009 0.006–0.008 0.031–0.038 0.043–0.053 0.028–0.033 0.045–0.055 0.055–0.069 0.034–0.039

275 231 233 24 27 24 15 17 15

118 95 106 19 21 19 12 13 11

Table 2 Intersession (week-to-week) reliability of local dynamic stability based on 70 strides: 95 participants walked during 5 min on a treadmill at two occasions separated by a week. The local dynamic stability (LDS) was computed using the maximal finite-time Lyapunov exponents method from trunk acceleration signal measured in antero-posterior (AP), vertical (V), and medio-lateral (ML) directions. Three time scales were used: long-term LDS between 4 and 10 strides (λ4–10); short-term LDS between 0 and 1 stride (λ1); short-term LDS between 0 and 0.5 stride (λ0.5). The agreement among two between-session repetitions of 70-strides length (intraclass correlation coefficient (ICC(A,1)), the standard error of measurement (SEM), and the relative smallest detectable difference (SDD) are shown. The 95% confidence intervals (CI) were computed by bootstrapping (5000 resamples).

Long-term LDS (λ4–10)

Short-term LDS (λ1)

Short-term LDS (λ0.5)

AP V ML AP V ML AP V ML

ICC

95% CI

SEM

95% CI

Relative SDD (%)

0.35 0.47 0.37 0.46 0.60 0.53 0.51 0.63 0.50

0.24–0.47 0.36–0.59 0.26–0.51 0.32–0.58 0.47–0.71 0.42–0.62 0.37–0.63 0.53–0.75 0.40–0.61

0.007 0.009 0.008 0.064 0.073 0.046 0.102 0.106 0.060

0.007–0.008 0.008–0.010 0.007–0.009 0.057–0.073 0.065–0.084 0.042–0.054 0.091–0.117 0.095–0.122 0.054–0.067

142 102 117 36 32 29 25 23 19

Table 3 Intersession (week-to-week) reliability of local dynamic stability based on 210 strides: 95 participants walked during 5 min on a treadmill at two occasions separated by a week. The local dynamic stability (LDS) was computed using the maximal finite-time Lyapunov exponents method from trunk acceleration signal measured in antero-posterior (AP), vertical (V), and medio-lateral (ML) directions. Three time scales were used: long-term LDS between 4 and 10 strides (λ4–10); short-term LDS between 0 and 1 stride (λ1); short-term LDS between 0 and 0.5 stride (λ0.5) The LDS estimates were computed from 210 strides (average of the three 70 strides estimates). The agreement among two repetitions (intraclass correlation coefficient (ICC(A,1)), the standard error of measurement (SEM), and the relative smallest detectable difference (SDD) are shown. The 95% confidence intervals (CI) were computed by bootstrapping (5000 resamples).

Long-term LDS (λ4–10)

Short-term LDS (λ1)

Short-term LDS (λ0.5)

AP V ML AP V ML AP V ML

ICC

95% CI

SEM

95% CI

Relative SDD (%)

0.47 0.67 0.60 0.51 0.66 0.60 0.55 0.68 0.58

0.31–0.61 0.57–0.77 0.47–0.73 0.36–0.63 0.52–0.78 0.48–0.70 0.40–0.67 0.57–0.80 0.47–0.68

0.0055 0.0058 0.0050 0.059 0.064 0.040 0.094 0.096 0.051

0.005–0.007 0.005–0.006 0.004–0.006 0.051–0.068 0.055–0.076 0.035–0.047 0.083–0.110 0.084–0.113 0.045–0.060

107 67 76 33 28 25 23 21 16

While there is likely a consensus to judge an ICC value of 0.3 as “low” and an ICC value of 0.95 as “high”, there is no absolute threshold to delimit whether a reliability index is “acceptable” (Lance et al., 2006). On the contrary, the satisfactory reliability level depends upon the use of the investigated variable. In particular, the responsiveness (effect size) of the variable should guide the interpretation of reliability results. Various studies have demonstrated a large spread in LDS sensitivity to different conditions, as for example: when submitting individuals to mechanical

perturbations during treadmill walking, the short-term LDS is lowered by about 70% (McAndrew et al., 2011); fall-prone older adults exhibits a lower LDS as compared to healthy controls: the difference is  20% (Lockhart and Liu, 2008); when balance is artificially impaired by using galvanic vestibular stimulation (GVS), the mean relative effect on short-term LDS is 11% (van Schooten et al., 2011); the use of orthopedic shoes tends to improve shortterm LDS in patients with chronic foot and ankle injuries ( þ10% in ML direction (Terrier et al., 2013)). In experimental studies, which

F. Reynard, P. Terrier / Journal of Biomechanics 47 (2014) 74–80

submit individuals to varying conditions during a single testing session, the short-term LDS reliability (table 1) is likely sufficient to highlight gait stability modification even when using 35-strides estimates, at least at group level. However, such a study design is of limited relevance for clinical studies, which most often aim to assess long term therapy efficiency in longitudinal studies (Yakhdani et al., 2010). In particular, the use of LDS as a diagnostic tool for fall risk assessment at individual level needs a sufficiently low intersession SEM and SDD (Tables 2 and 3). In this context, only λ0.5 measured in ML direction averaged from 210 strides seems reliable enough (SDD: 16%) to highlight a change that corresponds to the difference between healthy and fall-prone older adults (20% (Lockhart and Liu, 2008)). Therefore, there is still a need to further analyze the origin of the substantial weekto-week variability of LDS. As an explanation, one could postulate a modification of sensor placement. Other causes could be change of individual's mood and fatigue. Interestingly, it seems that a difference exists between the reliability of kinematic parameters and the reliability of variability indices. Indeed, high ICC (4 0.90) have been reported for both temporal and spatial kinematic parameters measured in short walking tests either for intersession reliability (van Uden and Besser, 2004) or intrasession reliability (Brach et al., 2008). In contrast, the classical measures of variability exhibits lower repeatability: ICCs between 0.40 and 0.63 have been reported for intrasession repeatability (SD of step length and time) (Brach et al., 2008). In the same manner, it has been reported that a substantial number of consecutive strides (4 60) are needed to precisely assess the variability of gait velocity (Hollman et al., 2010). Therefore, LDS estimates seem to be sufficiently precise as compared to other (classical) gait variability indices.

5. Conclusions To conclude, the following advices should be given for future gait LDS studies. (1) It is recommended to normalize sample length before computing LDS to thwart the trend to higher LDS estimates with longer measurements. (2) Regarding long-term LDS, its low reliability when few strides are analyzed makes the use of a treadmill highly recommended in order to record long duration walking tests. Furthermore, given the limited reliability, its use should be restricted to group-level assessment with a sufficient sample size to lower type II error risk. (3) Regarding short-term LDS, it is recommended to use λ0.5, provided that future studies further document its validity. (4) The ML acceleration signal should be used in priority because it exhibits the lowest SEM. (5) As far as treadmill results could be applied to overground situations, the repetition of several short-duration indoor walking tests would be a solution to assess gait stability in clinical settings. (6) The reliability of short-term LDS may be sufficient to detect moderate changes (around 20%) at individual level. However, due to the lower intersession repeatability, one should aggregate several measurements taken on different days, in order to better approximate the true LDS.

6. Conflict of interest statement There are no known conflicts of interest.

Acknowledgments The authors would like to thank Olivier Dériaz for his valuable support and thoughtful advice. The study was supported by the

79

Swiss accident insurance company SUVA, which is an independent, non-profit company under public law, and by the clinique romande de réadaptation. The IRR (Institute for Research in Rehabilitation) is supported by the State of Valais and the City of Sion.

Appendix A. Supplementary material Supplementary data associated with this article can be found in the online version at http://dx.doi.org/10.1016/j.jbiomech.2013.10.011. References Brach, J.S., Perera, S., Studenski, S., Newman, A.B., 2008. The reliability and validity of measures of gait variability in community-dwelling older adults. Arch. Phys. Med. Rehabil. 89, 2293–2296. Brach, J.S., Studenski, S.A., Perera, S., VanSwearingen, J.M., Newman, A.B., 2007. Gait variability and the risk of incident mobility disability in community-dwelling older adults. J. Gerontol. Ser. A Biol. Sci. Med. Sci. 62, 983–988. Brown, W., 1910. Some experimental results in the correlation of mental abilities. Br. J. Psychol. 1904-1920 (3), 296–322. Bruijn, S.M., Bregman, D.J., Meijer, O.G., Beek, P.J., van Dieen, J.H., 2011. Maximum Lyapunov exponents as predictors of global gait stability: a modelling approach. Med. Eng. Phys. 34, 428–436. Bruijn, S.M., Meijer, O.G., Beek, P.J., van Dieën, J.H., 2010a. The effects of arm swing on human gait stability. J. Exp. Biol. 213, 3945–3952. Bruijn, S.M., Meijer, O.G., Beek, P.J., van Dieën, J.H., 2013. Assessing the stability of human locomotion: a review of current measures. J. R. Soc. Interface 10, 20120999. Bruijn, S.M., Ten Kate, W.R., Faber, G.S., Meijer, O.G., Beek, P.J., van Dieen, J.H., 2010b. Estimating dynamic gait stability using data from non-aligned inertial sensors. Ann. Biomed. Eng. 38, 2588–2593. Bruijn, S.M., van Dieen, J.H., Meijer, O.G., Beek, P.J., 2009. Statistical precision and sensitivity of measures of dynamic gait stability. J. Neurosci. Methods 178, 327–333. Dingwell, J.B., 2006. Lyapunov Exponents. Wiley Encyclopedia of Biomedical Engineering. Dingwell, J.B., Cusumano, J.P., 2000. Nonlinear time series analysis of normal and pathological human walking. Chaos 10, 848–863. Dingwell, J.B., Cusumano, J.P., Cavanagh, P.R., Sternad, D., 2001. Local dynamic stability versus kinematic variability of continuous overground and treadmill walking. J. Biomech. Eng. 123, 27–32. Dingwell, J.B., Marin, L.C., 2006. Kinematic variability and local dynamic stability of upper body motions when walking at different speeds. J. Biomech. 39, 444–452. Finlayson, M.L., Peterson, E.W., Cho, C.C., 2006. Risk factors for falling among people aged 45–90 years with multiple sclerosis. Arch. Phys. Med. Rehabil. 87, 1274–1279. Hamacher, D., Singh, N.B., Van Dieen, J.H., Heller, M.O., Taylor, W.R., 2011. Kinematic measures for assessing gait stability in elderly individuals: a systematic review. J. R. Soc. Interface 8, 1682–1698. Hausdorff, J.M., 2007. Gait dynamics, fractals and falls: finding meaning in the stride-to-stride fluctuations of human walking. Hum. Mov. Sci. 26, 555–589. Hilfiker, R., Vaney, C., Gattlen, B., Meichtry, A., Deriaz, O., Lugon-Moulin, V., AnchisiBellwald, A.-M., Palaci, C., Foinant, D., Terrier, P., 2013. Local dynamic stability as a responsive index for the evaluation of rehabilitation effect on fall risk in patients with multiple sclerosis: a longitudinal study. BMC Res. Notes 6, 260. Hollman, J.H., Childs, K.B., McNeil, M.L., Mueller, A.C., Quilter, C.M., Youdas, J.W., 2010. Number of strides required for reliable measurements of pace, rhythm and variability parameters of gait during normal and dual task walking in older individuals. Gait Posture 32, 23–28. Kang, H.G., Dingwell, J.B., 2006. Intra-session reliability of local dynamic stability of walking. Gait Posture 24, 386–390. Kang, H.G., Dingwell, J.B., 2009. Dynamic stability of superior vs. inferior segments during walking in young and older adults. Gait Posture 30, 260–263. Kerr, G., Worringham, C., Cole, M., Lacherez, P., Wood, J., Silburn, P., 2010. Predictors of future falls in Parkinson disease. Neurology 75, 116–124. Lance, C.E., Butts, M.M., Michels, L.C., 2006. The sources of four commonly reported cutoff criteria what did they really say? Organizational Res. Methods 9, 202–220. Lockhart, T.E., Liu, J., 2008. Differentiating fall-prone and healthy adults using local dynamic stability. Ergonomics 51, 1860–1872. Manor, B., Wolenski, P., Guevaro, A., Li, L., 2009. Differential effects of plantar desensitization on locomotion dynamics. J Electromyogr. Kinesiol 19, e320–e328. McAndrew, P.M., Wilken, J.M., Dingwell, J.B., 2011. Dynamic stability of human walking in visually and mechanically destabilizing environments. J. Biomech. 44, 644–649. McAndrew Young, P.M., Dingwell, J.B., 2012. Voluntarily changing step length or step width affects dynamic stability of human walking. Gait Posture 35, 472–477.

80

F. Reynard, P. Terrier / Journal of Biomechanics 47 (2014) 74–80

McGraw, K.O., Wong, S., 1996. Forming inferences about some intraclass correlation coefficients. Psychol. Methods 1, 30. Moe-Nilssen, R., 1998. A new method for evaluating motor control in gait under real-life environmental conditions. Part 1: The instrument. Clin. Biomech. 13, 320–327. Perkins, D.O., Wyatt, R.J., Bartko, J.J., 2000. Penny-wise and pound-foolish: the impact of measurement error on sample size requirements in clinical trials. Biol. Psychiatry 47, 762–766. Ramnemark, A., Nilsson, M., Borssén, B., Gustafson, Y., 2000. Stroke, a major and increasing risk factor for femoral neck fracture. Stroke 31, 1572–1577. Roos, P.E., Dingwell, J.B., 2010. Influence of simulated neuromuscular noise on movement variability and fall risk in a 3D dynamic walking model. J. Biomech. 43, 2929–2935. Rosenstein, M.T., Collins, J.J., De Luca, C.J., 1993. A practical method for calculating largest Lyapunov exponents from small data sets. Phys. D Nonlinear Phenom. 65, 117–134. Sloot, L.H., van Schooten, K.S., Bruijn, S.M., Kingma, H., Pijnappels, M., van Dieen, J. H., 2011. Sensitivity of local dynamic stability of over-ground walking to balance impairment due to galvanic vestibular stimulation. Ann. Biomed. Eng. 39, 1563–1569. Spearman, C., 1910. Correlation calculated from faulty data. British J. Psychol. 19041920 (3), 271–295. Stergiou, N., Decker, L.M., 2011. Human movement variability, nonlinear dynamics, and pathology: is there a connection? Hum. Mov. Sci. 30, 869–888. Stevens, J., Corso, P., Finkelstein, E., Miller, T., 2006. The costs of fatal and non-fatal falls among older adults. Inj. Prev. 12, 290–295. Terrier, P., Dériaz, O., 2011. Kinematic variability, fractal dynamics and local dynamic stability of treadmill walking. J. Neuroeng. Rehabil. 8, 12. Terrier, P., Dériaz, O., 2013. Nonlinear dynamics of human locomotion: effects of rhythmic auditory cueing on local dynamic stability. Front. Physiol., 4.

Terrier, P., Luthi, F., Dériaz, O., 2013. Do orthopaedic shoes improve local dynamic stability of gait? An observational study in patients with chronic foot and ankle injuries. BMC Musculoskelet. Disord. 14, 94. Terrier, P., Reynard, F. To what extent does not wearing shoes affect the local dynamic stability of walking? Effect size and intrasession repeatability. J. Appl. Biomech. (in press), http://journals.humankinetics.com/jab-in-press/jab-in -press/to-what-extent-does-not-wearing-shoes-affect-the-local-dynamic-stability -of-walking-effect-size-and-intrasession-repeatability. Toebes, M.J., Hoozemans, M.J., Furrer, R., Dekker, J., van Dieen, J.H., 2012. Local dynamic stability and variability of gait are associated with fall history in elderly subjects. Gait Posture 36, 527–531. van Schooten, K.S., Rispens, S.M., Pijnappels, M., Daffertshofer, A., van Dieen, J.H., 2013. Assessing gait stability: the influence of state space reconstruction on inter- and intra-day reliability of local dynamic stability during over-ground walking. J. Biomech. 46, 137–141. van Schooten, K.S., Sloot, L.H., Bruijn, S.M., Kingma, H., Meijer, O.G., Pijnappels, M., van Dieen, J.H., 2011. Sensitivity of trunk variability and stability measures to balance impairments induced by galvanic vestibular stimulation during gait. Gait Posture 33, 656–660. van Uden, C.J., Besser, M.P., 2004. Test-retest reliability of temporal and spatial gait characteristics measured with an instrumented walkway system (GAITRite). BMC Musculoskelet. Disord. 5, 13. Verghese, J., Holtzer, R., Lipton, R.B., Wang, C., 2009. Quantitative gait markers and incident fall risk in older adults. J. Gerontol. A Biol. Sci. Med. Sci. 64, 896–901. Weir, J.P., 2005. Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM. J. Strength Cond. Res. 19, 231–240. Yakhdani, H.R., Bafghi, H.A., Meijer, O.G., Bruijn, S.M., van den Dikkenberg, N., Stibbe, A.B., van Royen, B.J., van Dieen, J.H., 2010. Stability and variability of knee kinematics during gait in knee osteoarthritis before and after replacement surgery. Clin. Biomech. (Bristol, Avon) 25, 230–236.