Accepted Manuscript Title: The role of temporal speech cues in facilitating the fluency of adults who stutter Author: Jin Park Kenneth J. Logan PII: DOI: Reference:
S0094-730X(15)00057-1 http://dx.doi.org/doi:10.1016/j.jfludis.2015.07.001 JFD 5590
To appear in:
Journal of Fluency Disorders
Received date: Revised date: Accepted date:
11-7-2013 1-7-2015 22-7-2015
Please cite this article as: Park, J., and Logan, K. J.,The role of temporal speech cues in facilitating the fluency of adults who stutter, Journal of Fluency Disorders (2015), http://dx.doi.org/10.1016/j.jfludis.2015.07.001 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
The Role of Temporal Cues
1
Ac ce
pt
ed
M
an
us
cr
ip t
Highlights Adults who stutter spoke more fluently in temporally-altered choral conditions than they did when speaking solo. Adults who stutter also spoke slower and exhibited more temporal entrainment with the choral signal than the adults who did not stutter. Adults who stutter seem to make greater use of choral speech signals than adults who do not stutter. Adults who stutter do not require accurate temporal models of phonetic targets in order to enhance fluent speech.
Page 1 of 50
The Role of Temporal Cues
2
The Role of Temporal Speech Cues in Facilitating the Fluency of Adults who Stutter Jin Park Kenneth J. Logan
ip t
University of Florida
Author Note:
cr
Jin Park, Department of Linguistics, Chungnam National University, Daejeon,
us
Korea; Kenneth J. Logan, Department of Speech, Language, and Hearing Sciences, University of Florida, Gainesville, FL, USA.
Ac ce
pt
ed
M
an
Correspondence concerning this manuscript should be addressed to Jin Park, Department of Linguistics, Chungnam National University, Daejeon, Korea, 305-764. Email:
[email protected] telephone: 82-42-821-6391
Page 2 of 50
The Role of Temporal Cues
3
Abstract Purpose: Adults who stutter speak more fluently during choral speech contexts than they do during solo speech contexts. The underlying mechanisms for this effect remain unclear, however. In this study, we examined the extent to which the choral speech effect depended
stutter followed choral signals more closely than typical speakers did.
ip t
on presentation of intact temporal speech cues. We also examined whether speakers who
cr
Method: 8 adults who stuttered and 8 adults who did not stutter read 60 sentences aloud
us
during a solo speaking condition and three choral speaking conditions (240 total sentences), two of which featured either temporally altered or indeterminate word duration patterns.
an
Effects of these manipulations on speech fluency, rate, and temporal entrainment with the choral speech signal were assessed.
M
Results: Adults who stutter spoke more fluently in all choral speaking conditions than they did when speaking solo. They also spoke slower and exhibited closer temporal entrainment
ed
with the choral signal during the mid- to late-stages of sentence production than the adults who did not stutter. Both groups entrained more closely with unaltered choral signals than
pt
they did with altered choral signals.
Ac ce
Conclusions: Findings suggest that adults who stutter make greater use of speech-related information in choral signals when talking than adults with typical fluency do. The presence of fluency facilitation during temporally altered choral speech and conversation babble suggests that temporal/gestural cueing alone cannot account for fluency facilitation in speakers who stutter. Other potential fluency enhancing mechanisms are discussed.
Page 3 of 50
The Role of Temporal Cues
4
The Role of Temporal Speech Cues in Facilitating the Fluency of Adults who Stutter 1.0. Introduction 1.1. Stuttering and speech timing Many researchers have suggested that stuttering is a speech timing/motor sequencing
ip t
disorder (e.g., Alm, 2004; Kent, 1984; MacKay & MacDonald, 1984; Packman, Code, & Onslow, 2007; Perkins, Kent, & Curlee, 1991; Smits-Bandstra & De Nil, 2007; Van Riper,
cr
1982; Zimmerman, 1980). Within this relatively broad theoretical framework, the disorder
us
has been examined in various ways: (a) as a disruption of the temporal coordination of the phonatory, respiratory, and articulatory systems that underlie speech production (e.g.,
an
Conture, Colton, & Gleason, 1988; Max & Gracco, 2005; Van Riper, 1982; Zimmerman, 1980), (b) as a breakdown in the temporal alignment of segmental and prosodic
M
representations prior to speech initiation (Perkins et al., 1991), (c) as a temporal dyssynchrony between language and motor planning and its execution (Au-Yeung, Howell,
ed
& Pilgrim, 1998; Howell, 2004; Howell, Au-Yeung, & Sackin, 1999), (d) as an impairment in the capacity to generate temporal programs that underlie the sequential movements associated
pt
with spoken language (Kent, 1984; Max & Yudman, 2003; Smits-Bandstra & De Nil, 2007)
Ac ce
and (e) as an impairment in the ability to execute speech motor plans once they have been generated (e.g., Kleinow & Smith, 2000; Packman, et al., 2007; Packman, Onslow, Richard, & van Doorn, 1996; Smits-Bandstra & De Nil, 2007). Despite the relatively large amount of empirical research and theoretical conjecture on
matters such as these, the exact role of temporal factors in the speech fluency of speakers who stutter remains unclear. Consequently, one purpose of the present study was to investigate the role of temporal cueing as a means of fluency facilitation with adults who stutter. This was accomplished by examining changes in speech fluency during different types of choral speaking conditions.
Page 4 of 50
The Role of Temporal Cues
5
1.2. Choral speech and fluency enhancement It has been well demonstrated that speakers who stutter exhibit marked improvement in fluency when speaking chorally with others (Andrews, Howie, Dozsa, & Guitar, 1982; Barber, 1939; Bloodstein, 1950; Freeman & Armson, 1998; Guntupalli Kalinowski,
ip t
Saltuklaroglu, & Nanjundeswaran, 2005; Howell, & Powell, 1987; Ingham, Bothe, Jang, Yates, Cotton, & Seybold , 2009; Ingham & Packman, 1979; Ingham, Warner, Byrd, &
cr
Cotton, 2006; Johnson & Rosen, 1937; Kiefte & Armson, 2008; Rami, Kalinowski, Rastatter,
us
Holbert, & Allen, 2005). Early attempts at explaining the fluency enhancing effects associated with choral speech occurred within a psychological framework. For example,
an
Barber (1939) proposed that choral reading is a novel condition that distracts speakers who stutter from their fluency difficulties, thus allowing them to talk more smoothly and with less
M
effort. The distraction hypothesis subsequently was questioned by a number of researchers (e.g., Fransella, 1967; Fransella & Beech, 1965; Stuart, 1999; Wingate, 1969) as being overly
ed
vague and hard to verify empirically, and findings from several studies have not supported a strong form of the distraction hypothesis (see, for example, Arends, Povel, & Kolk, 1988;
pt
Fransella, 1967; Fransella & Beech, 1965; Mallard & Webb, 1980; Stuart, 1999). For
Ac ce
instance, it has been shown that stuttering is not significantly reduced in certain dual task conditions that presumably involve distraction. These include discerning a pattern in an arrhythmic beat while reading aloud (Fransella & Beech, 1965), writing numbers while reading aloud (Fransella, 1967), and turning a light on and off while reading aloud (Mallard & Webb, 1980).
A second possible explanation for improved fluency under choral reading is that choral speaking leads speakers who stutter to focus on the action of speaking instead of focusing on aspects of communication. The attention-based hypothesis is essentially the opposite of the distraction hypothesis that was presented by Barber (1939) and others. That is,
Page 5 of 50
The Role of Temporal Cues
6
it may be that instead of taking one’s mind off speaking, as would be the case with the distraction hypothesis, speakers who stutter, instead, focus their attention more often and/or more fully on speaking. Several neuro-imaging studies (e.g., Boberg, Yeudall, Schopflocher, & Bo-Lassen, 1983; De Nil, Kroll, Lafaille, & Houle, 2003; Kroll, De Nil, Kapur, & Houle,
ip t
1997) have shown that speakers who stutter exhibit post-treatment increases in left hemisphere activation in the auditory cortex relative to pre-treatment baseline levels. In such
cr
studies, the increased left hemisphere activity is associated with the use of newly learned
us
methods of controlled fluency such as regulated speech rate, but not with increased activation of brain regions associated with attention (c.f., De Nil, Kroll, Lafaille, & Houle, 2003;
an
Neumann et al., 2003). In some studies with speakers who stutter (Neumann et al. 2003; Neumann et al., 2005), cortical regions associated with attention and error monitoring (e.g.,
M
the anterior cingulate cortex) seem to be “de-activated” during both pre- and post-treatment speech, while in others (e.g., De Nil et al. 2000), areas that were overly active in a pre-
ed
treatment context show decreased activation following treatment. A third possibility is that fluency-enhancing conditions like choral speech help
pt
speakers who stutter formulate key components of spoken messages and, in doing so, help
Ac ce
speakers synchronize the various neural regions that are necessary for fluent speech (see Neumann et al., 2003 for additional discussion). For example, Kent (1984) proposed a motor modeling explanation for the choral speech effect. Essentially, he suggested that speakers who stutter monitor the choral speech signal and, in doing so, are able to generate the temporal patterns that are necessary for fluent speech. Kent argued that the mechanisms underlying choral speech are similar to those observed in metronome-paced speech (i.e., fluency inducement in response to an external rhythmic signal). In this view, speakers who stutter might use another speaker’s voice as an external model from which they extract temporal information about ongoing speech. It was proposed that speakers then use the
Page 6 of 50
The Role of Temporal Cues
7
extracted information to generate utterances that are more fluent than they would be without such information. In a similar vein, Saltuklaroglu, Kalinowski, and Guntupalli (2004) proposed that choral speech signals activate a speaker’s mirror neuron system. In this view, the
ip t
accompanying choral speech signal provides speakers who stutter with cues about ongoing articulatory gestures that are necessary to produce speech smoothly and with relatively little
cr
effort. Following the Revised Motor Theory of Speech Perception (Liberman & Mattingly,
us
1985; Liberman & Whalen, 2000), Saltuklaroglu et al. argued that articulatory gestures are invariant objects of both speech perception and speech production, and that speech
an
production and speech perception mainly involve the encoding or decoding of shared articulatory gestures. According to Saltuklaroglu et al., another speaker’s voice provides “an
M
external matrix of speech gestures that is rich in redundant speech cues” (p. 342). In this view, choral speech models provide cues that contain information about the spatial and temporal
ed
properties of the gestural patterns that constitute spoken utterances. The speaker then matches these external cues to the developing speech production plan, which facilitates smooth
pt
production of the intended speech utterances. Under a strong form of this hypothesis, one
Ac ce
would expect the facilitative effects of choral speech to diminish when the quality of the choral signal is either altered or degraded. 1.3. Manipulating choral speech signals The effects of choral speech signal manipulation on fluency enhancement have been
examined in a range of studies (e.g., Barber, 1939; Guntupalli et al., 2005; Guntupalli, Nanjundeswaran, Kalinowski, & Dayalu, 2011; Hudock, Dayalu, Saltuklaroglu, Stuart, Zhang, & Kalinowski, 2011; Kiefte & Armson, 2008; Rami et al., 2005). Research findings suggest that fluency facilitation occurs even when the choral speech signal is mismatched with what the speaker is saying (see Barber, 1939; Cherry & Sayers, 1956, for early
Page 7 of 50
The Role of Temporal Cues
8
explorations of this issue). Further, fluency enhancement also occurs when speakers who stutter are provided with only visual images of targeted articulatory movements, and fluency is enhanced further when the visual images lag the speaker’s articulation in a manner analogous to delayed auditory feedback (Hudock et al., 2011).
ip t
Rami et al. (2005) manipulated the acoustic characteristics of choral speech signals to investigate which components of the choral signal are most relevant to the attainment of
cr
fluent speech. Using a low pass filtering technique, Rami et al. created five different versions
us
of a choral speech signal. Speakers who stutter then were asked to read several printed passages while listening to the various choral signals. Results showed a marked reduction of
an
stuttering relative to a solo reading condition during the following four conditions: typical choral speech, choral speech that was filtered at 500Hz, choral speech that was filtered at
M
1000Hz, and whispered choral speech. In contrast, the participants showed no significant reduction in stuttering frequency during choral speech that was filtered at 100 Hz. Based on
ed
the findings, Rami et al. proposed that acoustic cues arising from articulatory events in the vocal tract are necessary for enhancing fluency in speakers who stutter during choral reading.
pt
Guntupalli et al. (2005) compressed and expanded choral speech signals to examine
Ac ce
the effects of temporal cuing on stuttering frequency. Participants who stuttered read passages aloud while listening to choral speech signals in three conditions: real time, temporally compressed speech (-20%, -40%, -60%, -80% of original duration), and temporally expanded speech (+40%, +80% of original duration). Guntupalli et al. hypothesized that choral speech aids in the recovery of gestural (i.e., articulatory) information that pertains to a target utterance and that recovery of gestural information would be affected by the rate at which gestural cues are presented to speaker. Results were generally consistent with this hypothesis, as fluency facilitation was greater when the choral signals were expanded, presented in real time, or compressed by no more than 20% than it was when
Page 8 of 50
The Role of Temporal Cues
9
choral signals were compressed by 40%, 60%, or 80%. Taken together, findings from studies such as these suggest that choral speech is generally a reliable and robust means of enhancing fluency among people who stutter; however, its effectiveness depends to some extent upon the informational properties of choral signal itself.
ip t
1.4. Purpose and rationale The present study extended the existing line of research by examining whether
cr
speakers who stutter depend on an accurate or faithful rendering of temporal information
us
within a choral signal in order to benefit from it. In contrast to the study by Guntupalli et al. (2005), where the timing structure of the choral signal was temporally compressed but still
an
proportionately congruent with typical speech, the durational characteristics of words within some of the choral signals used in the present study were altered such that some words had a
M
longer-than-normal duration relative to surrounding words and other words had a shorterthan-normal duration relative to surrounding words. Consequently, temporal information
ed
within the choral signal was, to some extent, dissociated from the signal’s syntactic structure and, consequently, its expected prosodic structure. It was thought that this manipulation
pt
would reduce moment-to-moment predictability about the temporal structure of an utterance
Ac ce
and might lead to less fluency facilitation for speakers who stutter in comparison to conditions in which speakers were exposed to unaltered, temporally-faithful choral signals (see Barber, 1939, and Hudock et al., 2011 for discussion of this issue). The present study also incorporated multi-speaker conversational babble as another type of temporally unpredictable choral signal. The main rationale for using the conversational babble is that it allows for the concurrent presentation of a speech-based signal that lacks coherent speechrelated information, particularly temporal cues that relate to syllable or word durational patterns. A second purpose of the present study was to examine the extent to which the
Page 9 of 50
The Role of Temporal Cues
10
utterances of speakers who stutter are temporally entrained to choral speech signals and whether the extent of that entrainment differs from what is observed in speakers who do not stutter. This issue has not been examined in past studies of fluency facilitation under choral speech in such a way that allows for determination of how closely the speaker is following
ip t
the choral signal. If speakers who stutter are indeed using choral speech signals to extract critical information about speech timing and/or articulatory configurations (Kent, 1984;
cr
Saltuklaroglu et al., 2004), speakers who stutter might be expected to show more precise
us
entrainment to a choral speech signal when compared to speakers who do not stutter. If speakers who stutter are extracting speech production cues from choral signals, the degree of
an
entrainment that they exhibit should be greatest in conditions where the temporal structure of the auditory signals closely matches the structure of the utterances speakers are attempting to
M
say.
The primary research questions that were addressed in the present study were these:
ed
(a) does the accuracy or availability of temporal cues in a choral speech signal affect the extent to which the signal facilitates the fluency of a speaker who stutters?, and (b) do
pt
participants who stutter show more entrainment to choral speech signals than speakers who
Ac ce
do not stutter? It was predicted that adults who stutter would speak less fluently when talking with temporally altered choral signals than they would when speaking with unaltered choral signals, and that adults who stutter would exhibit more temporal entrainment with the choral speech than adults who do not stutter. 2.0. Method
2.1. Participants Eight adults who stuttered (seven males, one female, M = 22 years, SD = 12.27 years) and eight adults who did not stutter (six males, two females, M = 23 years, SD = 13.75 years) participated in this study. Each of the participants was at least 18 years of age, spoke
Page 10 of 50
The Role of Temporal Cues
11
American English with native competence, and reported having had no exposure to altered auditory feedback (e.g., delayed auditory feedback) during the two years prior to data collection for the present study. In addition, none of the participants reported having participated in fluency interventions that involved altered auditory feedback.
ip t
None of the participants had any known or reported concomitant impairments (other than stuttering for the adults who stutter) that might have influenced their speech fluency
cr
performance during the study. All participants attained a standard score of 85 or higher on the
us
Peabody Picture Vocabulary Test-III (PPVT-III, Dunn & Dunn, 1997) and obtained a standard score of 85 or higher on the reading subtests associated with the Wide Range
an
Achievement Test-3 (WRAT-3; Jastak & Jastak, 1984). In addition, all participants passed a hearing screening. In the stuttering group, all of the participants reported having either no
M
history of formal fluency treatment or no recent history of formal fluency treatment (i.e., within the three years preceding data collection) and, at the time of data collection, each of
ed
the participants in the stuttering group reported feeling dissatisfied with their ability to speak fluently. Stuttering severity was assessed for the participants who stuttered using the
pt
Stuttering Severity Instrument-4 (Riley, 2009). Based on this analysis, it was determined that
Ac ce
two participants exhibited mild stuttering, five exhibited moderate stuttering, and one exhibited severe stuttering. 2.2. Materials
2.2.1. Sentence stimuli. The authors created 60 sentences for use in the experiment.
The 60 sentences featured a variety of syntactic and prosodic patterns but all were of similar length (i.e., 10 to 11 words, 17 to 21 syllables). The sentences were used to elicit speech samples from the participants under various auditory conditions (see the Appendix for examples of sentences). Within the set of 60 sentences was a subset of 20 sentences, each of which contained 10 words, 18 to 20 syllables, and similar syntactic structure. The phrase
Page 11 of 50
The Role of Temporal Cues
12
structures within these 20 sentences were similar as well, with each consisting of one or more phrases that contained a function word (e.g., the, and, by) followed by a multi-syllable content word. Thus, the prosodic patterns within the 20 sentences were highly similar as well. Each of the content words within the 20 sentences began with a consonant and contained at
ip t
least two syllables. Multi-syllable words have been shown to have a higher probability of featuring overt symptoms of stuttering than monosyllabic words (Howell et al., 1999). The 20
cr
structurally matched sentences were designed primarily for use with the entrainment analysis,
us
which is described below.
Stimulus materials were created by having an American-English-speaking adult
an
female with normal speech fluency read each of the 60 sentences aloud into a digital voice recorder (Sony, ICD-P320) while seated in a sound attenuated booth. A computer-based
M
software program (WavePad, version 4.26) was used for capturing and editing the woman’s speech, and all of the audio files were stored in WAV format. These digital recordings were
ed
used to create the choral speech stimuli used in the experiment. 2.2.2. Typical and altered choral speech signals. Two types of choral stimuli were
pt
created from the sentence recordings. The first type, typical choral speech (TCS), consisted
Ac ce
of the recorded sentences that the adult female speaker originally had created. The second type, altered choral speech (ACS) was created using the PRAAT software program (Boersma & Weenink, version 4.3.01), which made it possible to alter the temporal properties of words within the original recordings. With respect to alteration of temporal patterns, each of the words contained in each of the experimental sentences was analyzed individually and its original duration was manipulated. Stress patterns in multi-word utterances are determined largely by the speaker’s communicative goal and the respective grammatical roles that words in the utterance assume in relation to that goal (Wingate, 1984). For instance, pitch-accented words (e.g., content words) tend to have a longer duration than unaccented words (e.g.,
Page 12 of 50
The Role of Temporal Cues
13
function words). In connected speech, word duration also has been shown to depend upon where a word appears in an utterance (Ferreira, 1993). For example, a word generally will have a longer duration when it occurs in a phrase- or clause-final position than when it occurs in a non-final position.
ip t
For each of the experimental sentences, manipulation of word duration was accomplished by using the acoustic signal to identify individual words within the sentences,
cr
and then elongating the duration of the function words (which normally are relatively short)
us
and shortening the duration of the content words (which normally are relatively long). Pause durations between words and phrases were not manipulated. Word duration was manipulated
an
such that function words and content words had similar durations. That is, the mean length of function words was 430 ms and the mean length of content words was 410 ms. The
M
percentage of word expansion (or elongation) with respect to the baseline was 6% and the percentage of compression (or shortening) was 7% with respect to the baseline. The overall
ed
duration of each of the temporally altered sentences, however, remained identical to the duration of the original sentences (i.e., M duration = 4.30 s). Consequently, the speaking rate
pt
(i.e., syllables per second) of the sentences also remained the same. The overall pitch contour
Ac ce
(i.e., intonation) of the temporally altered sentences was not intentionally altered. Thus, it remained relatively consistent with that of the original unaltered versions. The temporal manipulations did not appear to significantly affect sentence
intelligibility. Prior to data collection for the study, the first author presented 20 of the temporally-altered versions of the sentences to five American-English-speaking adults who were unfamiliar with the stimuli and asked them to identify each of the sentences, which each of them did with 100% accuracy. During the course of data collection for the study, participants were regularly reminded to report any difficulties in hearing or understanding the choral sentences, but no such difficulties were reported by any participants.
Page 13 of 50
The Role of Temporal Cues
14
The choral stimuli were finalized by pasting a recorded carrier phrase (The sentence you have to read is…) into the sound files associated with the sentences. The same woman who had read the stimulus sentences also recorded the carrier phrase, and it was positioned within the sound file such that a 500 ms pause was created between end of the carrier phrase
ip t
and the start of the sentence. To minimize the potential for order effects, a unique presentation order for each participant was created in the sentence sets. To minimize the
cr
potential for carryover effects within a sentence set, the 20 sentences that were matched for
us
syntax and prosody were interspersed among the 40 other sentences such that none of the 20 matched sentences was presented consecutively to participants.
an
2.2.3 Multi-speaker babble as a choral signal. Multi-speaker babble noise, which was developed by the International Collegium of Rehabilitative Audiology (ICRA; Dreschler,
M
Vershuure, Ludvigsen, & Westermann, 2001), was used as another type of choral signal. The main rationale for using the multi-speaker babble signal was that it allowed for the concurrent
ed
presentation of a speech-based signal that lacked coherent prosodic (i.e., temporal or durational) and segmental cues. In other words, even though a listener can easily recognize
pt
conversational babble as human speech, a listener is not readily able to extract precise
Ac ce
speech-based information such as temporal cues that relate to word durational patterns from it. The babble signal featured the combination of 6 speakers’ voices. The length of the
original babble recording was one hour and from that, 60 different excerpts, each with a length equal to that of the choral sentence recordings, were selected for presentation as another type of choral signal. To provide a way for speakers to match response initiation with the choral signal, each excerpt of multi-speaker babble noise was preceded by the carrier phrase (The sentence you have to read is…), with the 500 ms pause interposed between the end of the carrier phrase and the start of the babble signal. 2.3. Procedure
Page 14 of 50
The Role of Temporal Cues
15
Each of the participants was seen individually for two sessions. During the first session, participants provided background information by completing a questionnaire and an oral interview, both of which included questions related to basic demographic data, vision and hearing status, medical and educational histories, and communication skills. At this time,
ip t
participants also completed the PPVT-3 and WRAT-3 tests and produced a conversational speech sample. Regarding the latter, participants were asked to talk for at least 3 minutes on
cr
various topics (e.g., a recent movie or television program they had seen, the types of hobbies
us
they have, things that they do on a typical week day, a favorite vacation). The conversational sample was used to verify the diagnosis of stuttering and to estimate stuttering severity.
an
After completing the various preliminary tasks, participants were invited to participate in the second session, one week later, during which the experimental task was presented. The
M
experimental task consisted of reading aloud the set of 60 sentences under the four different auditory conditions. Presentation order was randomized for the four conditions across
ed
participants to control for a possible order effect. Participants were seated in a comfortable chair in front of a laptop computer (Dell Inspiron 2200) in a quiet room. They were asked to
pt
read each of the sentences aloud. The sentences were presented individually on the computer
Ac ce
screen in 14-point Arial Black font.
The sentence reading task was divided into four blocks of 60 sentences (i.e., 240 total
sentences), during which a participant heard either his or her own voice (i.e., Solo condition), or his or her own voice together with one of the following: typical choral speech (i.e., TCS condition), temporally-altered choral speech (i.e., ACS condition), or conversational babble (i.e., BAB condition). Participants heard all of the auditory signals through earphones (Logitech Premium Notebook Headset) at a comfortable listening level (i.e., less than 75 dB SPL, after Rami et al., 2005). All speech responses obtained from the participants were collected with a microphone (Shure, SM 48), which was placed approximately 20 cm below
Page 15 of 50
The Role of Temporal Cues
16
the mouth. The microphone output was sent to a digital audio recorder (Sony DTC-ZA5ES). A digital video camera (Canon, NTSC 2R65MC) was used to record the participants as they completed the sentence production tasks. A backup audio recording of participants’ responses was made using a portable digital recorder (Sony, ICD-P320).
ip t
Prior to commencing the experiment, a research assistant described the tasks to the participants, and then presented them with several practice items. After a participant
cr
successfully responded to the practice items, he or she was asked to commence the main part
us
of the experiment. Participants were asked to wait for completion of the introductory carrier phrase (The sentence you have to read is…) before saying the sentences. Participants were
an
told to say the sentences at whatever pace was most comfortable for them, and they were instructed to speak as naturally as possible and to refrain from using any self-devised
M
techniques (e.g., finger tapping) to reduce or inhibit stuttering-related disfluency. Following completion of each block of experimental sentences, participants were asked to remove the
ed
headphones and read one of several reading passages aloud under normal auditory feedback. The latter activity was performed to minimize carryover effects that might occur across
pt
consecutive experimental blocks. The reading passages dealt with an assortment of topics that
Ac ce
were unrelated to the experimental sentences. They contained an average of 255 words and were written at a middle school level. 2.4. Data analysis
2.4.1. Stuttering frequency. After the speech samples were collected, the first author
replayed the audio/video recordings to analyze each of the syllables within each of the sentences for the presence of stuttering-related behavior such as part-word repetition, audible or inaudible prolongation, and nonverbal behavior such as facial grimacing that appeared to be symptomatic of stuttered speech (see Conture & Kelly, 1991). The number of stuttered syllables per sentence was tallied and used to compute the percent of syllables stuttered
Page 16 of 50
The Role of Temporal Cues
17
(%SS) in the task. 2.4.2. Entrainment with choral signal. The temporal alignment (i.e., entrainment) of selected speech sound segments in the choral model version of a sentence and a participant’s production of the sentence were compared by analyzing 10 randomly selected fluent
ip t
responses from the subset of 20 sentences that featured highly similar syntactic and prosodic structure. The analysis was limited to the TCS and ACS conditions (10 sentences per
cr
condition), because these conditions allowed for a comparison of altered versus unaltered
us
temporal cues in the choral signal. Basic features of the analysis are schematized in Figure 1. The entrainment analysis was conducted by comparing the onset times for five specific
an
reference phones (one sentence-initial phone, three sentence-medial phones, one sentencefinal phone) in a participant’s randomly selected fluent sentences to the onset times for the
M
same five phones in the corresponding choral sentences. Phone onset times were computed by subtracting the time code associated with the end of the standard carrier phrase that
ed
preceded each sentence from the time code associated with the onsets of each of the five reference phones in the sentences. This approach allowed for tracking of a participant’s
pt
entrainment over the course of sentence production. The three sentence-medial phones were
Ac ce
located in standard positions across the syntactically similar sentences (e.g., the onset of the second noun in the subject noun phrase) and each of the sentence-medial words began with a stop consonant to facilitate ease of onset phone identification and accuracy of entrainment value measurement during the acoustic analysis. Temporal measurements for the entrainment analysis were accomplished by recording
the choral speech signals and the participants’ corresponding responses on separate channels in a digital audio recorder through the use of a line-splitter. Differences between the time codes for the choral and participants’ speech signals at each reference point were compared between participant groups to determine whether the speakers who stutter differed from
Page 17 of 50
The Role of Temporal Cues
18
speakers with typical fluency in the extent to which their responses were aligned temporally with the choral models. 2.4.3. Analysis of speech rate. Speech rate was computed for participant responses in each of the four auditory conditions using procedures described by Logan, Byrd, Mazzocchi,
ip t
& Gillam (2011). That is, the number of syllables spoken in the target sentences was summed and then divided by the total amount of time needed to produce each of the target sentences.
cr
This yielded the number of syllables spoken per second. The rate computations were based
us
on the same 20 syntactically and prosodically matched sentences described above in section 2.2.1. The speech rate analysis was conducted to obtain additional information about both the
an
nature of fluency enhancement in the speakers who stutter and the overall entrainment patterns of the nonstuttering and stuttering groups.
M
2.5. Statistical analysis
Because of the relatively small sample size, the distributional pattern of the
ed
participants’ data, and resulting concerns about meeting the statistical assumptions of parametric statistical tests (Conover, 1980), nonparametric statistical methods were used for
pt
analyzing the stuttering frequency data. Two different nonparametric statistics were used.
Ac ce
Within-subject differences across the auditory conditions for speakers who stutter were analyzed using the Friedman test, a nonparametric statistic that utilizes rank-based procedures. To further examine whether there were significant differences in stuttering frequency among the auditory conditions, planned follow-up comparisons were performed using the paired-sample Wilcoxon signed rank test. Mixed-model analysis of variance (ANOVA) was performed to see if there were significant between- or within-subject differences for analyses related to speech entrainment. To control for Type I error rate across these comparisons, the overall alpha level (α = .05) was divided by the number of comparisons in each family of tests (Marasculio &
Page 18 of 50
The Role of Temporal Cues
19
McSweeney, 1977). Effect sizes were reported as follows: partial eta2 (for ANOVA), Cohen’s d (for t-tests), and r (for Wilcoxon signed ranks tests). Guidelines presented by Cohen (1988) and Leech, Barrett, and Morgan (2008) were used to interpret the effect sizes. 2.6. Intra-and Inter-Judge Reliability
ip t
The intra- and inter-judge reliability of stuttering frequency measurement was determined by re-measuring responses from two randomly selected participants who stutter
cr
during two experimental conditions (i.e., Solo, TCS). The intra-judge reliability was assessed
us
by having the first author compute the stuttering frequencies for the participants’ data, and the inter-judge reliability assessed by having the second author re-analyze the same
an
participant data that previously were analyzed by the first author. The average mean difference score across the sentences for intra-judge reliability was -0.02 stuttered syllables
M
per 100 syllables (SD = 0.49). The average mean difference score across the sentences for inter-judge reliability was -0.004 stuttered syllables per 100 syllables (SD = 0.55).
ed
The intra- and inter-judge reliability for the entrainment measurements was determined by re-analyzing data from the TCS and ACS conditions of four randomly selected
pt
participants (two from the stuttering group and two from the nonstuttering group). There were
Ac ce
no significant differences in either the intra-judge measurements, t(199) = 1.051, p = .294, or the inter-judge measurements, t(199) = .600, p = .549. In addition, strong positive Pearson r correlations were found for both the intra-judge measurements, r = .978; p < .001, and interjudge measurements, r = .945; p < .001. 3.0. Results
3.1. Stuttering frequency across the auditory conditions Figure 2 shows the mean frequency of stuttering in the four auditory conditions for the participants who stuttered. The Friedman test showed a significant within-subjects effect for auditory condition (2(3) = 17.625; p = .001), meaning that the frequency of stuttered
Page 19 of 50
The Role of Temporal Cues
20
syllables differed significantly across the four auditory conditions. Results of follow-up testing (adjusted α = .05/3 = .017) for planned comparisons revealed that the participants who stuttered produced less stuttering during the TCS condition than during the Solo condition (Z = -2.524; p = .012, r = -.63). There was no significant difference (Z = -2.060; p = .039, r = -
ip t
.52) in the frequency of stuttering frequency between the TCS and ACS conditions, which suggested that alterations in the accuracy of temporal information did not markedly reduce
cr
the effect of choral auditory signals upon fluency enhancement for adults who stutter.
us
Similarly, no significant differences were found in the frequency of stutter-like disfluencies in the TCS versus BAB comparison (Z = -0.647; p = .518, r = -.16), suggesting that
necessary for fluency enhancement.
M
3.2. Speech rate across auditory conditions
an
presentation of segmental and temporal information to speakers was sufficient, but not
Figure 3 shows the mean speech rate values across the four auditory conditions for the
ed
two participant groups. As indicated in the figure, the sample means for the participants who stuttered appeared to be slower than those for the participants who did not stutter. Results of
pt
mixed-model ANOVA indicated a significant main effect for the between-subjects factor,
Ac ce
fluency group, F(1, 14) = 21.253, p < .001, partial eta2 = .606. Thus, the speakers who stuttered had slower speech rates than speakers who do not stutter. There was, however, no significant effect for the within-subjects factor, auditory condition, F(1.631, 22.837) = 1.870, p =.182, partial eta2 = .118. Further, there was no significant fluency group x auditory condition interaction, F(1.631, 22.837) = 3.153, p = .071, partial eta2 = .184. Thus, participants who stutter did not show marked differences in speech rates across the auditory feedback conditions. 3.3. Entrainment with choral signal Results of mixed-model ANOVA (fluency group x auditory condition, i.e., TCS vs.
Page 20 of 50
The Role of Temporal Cues
21
ACS, x entrainment point) indicated significant main effects for the between-subjects factor, fluency group, F(1,14) = 5.047, p =.041, partial eta2 = .265, and the within-subjects factor, entrainment point, F(2.857, 39.991) = 65.092, p< .001, partial eta2 = .823. However, no significant main effect was found for a second within-subjects factor, auditory condition, F(1,
ip t
14) = 1.456, p = .248, partial eta2 = .094. A significant effect was also found for the fluency group x entrainment point interaction, F(2.857, 39.991) = 4.713, p = .013, partial eta2 = .230,
cr
but no significant effect was found for the fluency group x auditory condition interaction, F(1,
us
14) = .191, p = .579, partial eta2 = .013. A significant effect was found for the entrainment point x fluency group x auditory condition interaction, F(1.290, 18.063) = 7.901, p = .008,
an
partial eta2 = .361.
Because of the presence of the three-way interaction among entrainment point,
M
fluency group, and auditory condition, two-way ANOVA was performed to examine patterns of speaker entrainment across the five reference points. The results for this portion of the
ed
analysis are summarized in Table 1 and Figure 4. At reference points one and two, results of the ANOVA indicated no significant main effects for the between-subjects factor, fluency
pt
group. Thus, there was no significant difference between groups in entrainment during the
Ac ce
early portions of sentence production between groups. There were, however, significant differences between fluency groups for each of the later entrainment points. That is, at reference points three, four, and five, a significant main effect was found for the betweensubject factor, fluency group (see Table 1). Thus, there was a significant difference between fluency groups such that speakers who stuttered exhibited closer entrainment with the choral signals in the mid- to late-stages of choral sentence production than the typically fluent speakers did. Furthermore, significant auditory condition effects were found at reference points two, three, four, and five (see Table 1). In this case, the results indicated that, for both participant groups, more temporal entrainment was found during TCS condition than during
Page 21 of 50
The Role of Temporal Cues
22
the ACS condition. As shown in Table 1, no significant interaction was observed between fluency group and auditory condition across the five reference points; thus, both groups appeared to entrain more closely with typical choral signals than they did with atypical choral signals.
ip t
3.3. Relationships between fluency improvement and entrainment To obtain insight into whether those who showed the most fluent improvement in
cr
response to choral speech models were those who entrained most closely to choral speech
us
models, an additional correlation analysis was conducted. The amount of reduction in stuttering frequency was computed for participants who stuttered by subtracting percentage
an
of syllables stuttered (%SS) during the TCS condition from percentage of syllables stuttered (%SS) during the Solo condition and then dividing the result by the percentage of syllables
M
stuttered (%SS) during the Solo condition and multiplying by 100. The association between this number (amount of reduction in stuttering frequency) and the average of the five
ed
entrainment-point values was examined, with results showing a significant, but modest
4.0. Discussion
pt
positive correlation (rho = .355, p < .05) between the two variables.
Ac ce
4.1 Fluency enhancement: Patterns and possible mechanism As expected, the participants who stuttered spoke less fluently under normal auditory
feedback (i.e., solo speech) than they did when talking along with another speaker’s voice. The fluency enhancing effect of choral speech observed in the present study replicates findings reported in previous research (e.g., Andrews et al., 1982; Guntupalli et al., 2005; Howell, & Powell, 1987; Ingham & Packman, 1979; Kiefte & Armson, 2008; Rami et al., 2005). In the present study, the fluency enhancing effect of choral speech also was observed when temporal information in the choral speech signal was altered substantially to create a mismatch with normal speech prosody (i.e., the TCS condition). The latter finding is broadly
Page 22 of 50
The Role of Temporal Cues
23
consistent with results from studies that have incorporated filtering or compression of choral signals (e.g., Guntupalli et al., 2005; Rami et al., 2005). It also is consistent with findings from Barber (1939), wherein fluency enhancement occurred when speakers who stuttered heard choral signals that were mismatched in content with the passage they read aloud. In the
ip t
present study, evidence of fluency enhancement also was present in the conversational babble condition, a finding which is broadly consistent with results from other studies in which
us
white noise (Andrews et al., 1982; Brayton & Conture, 1978).
cr
enhancement in the speech fluency of speakers who stutter has occurred under exposure to
As discussed in section 1.1, stuttering has been modeled by some as a symptom of a
an
speech-timing deficit (Alm, 2004; Hickock, Houde, & Rong, 2011; Kalinowski, Saltuklaroglu, Guntupalli, & Stuart, 2004; Kent, 1984; Packman, et al., 2007; Van Riper,
M
1982), and it has been proposed that the act of monitoring another person’s speech, as in a choral speaking condition, helps speakers who stutter generate the temporal patterns and/or
ed
articulatory gestures that are necessary for fluent speech. Overall, the findings from the present study do not support a strong view of either the temporal modeling or the articulatory
pt
gesture modeling hypotheses, as fluency enhancement was observed during the temporally
Ac ce
altered choral speech condition, the conversational babble condition, and the typical choral speech condition. The enhancement of fluency during the altered choral speech condition – wherein speech timing and articulatory cues were not precisely matched to lexical information – suggests that if the participants who stuttered did use the choral signal as an informational source for speech production in that context, they did so in a limited or general way (e.g., by utilizing information about the duration of prominent syllables). The enhancement of fluency in the conversational babble condition – wherein speech timing and articulatory cues were unavailable – suggests that fluency facilitation can occur via other mechanisms, as well.
Page 23 of 50
The Role of Temporal Cues
24
The preceding interpretation of the results is generally consistent with that of other researchers (e.g., Howell, 2007; Howell, Powell, & Kahn, 1983; Howell, & Sackin, 2002; Max, Guenther, Gracco, Ghosh, & Wallace, 2004; Stager, Jeffries, & Braun, 2003), who have noted that speakers who stutter can attain improved fluency in conditions such as syllable-
ip t
timed speech, metronomic pacing, and exposure to white noise, wherein externally based segmental and suprasegmental cues are absent. On this basis, these researchers have proposed
cr
that fluency enhancing conditions such as choral speech, syllable-timed speech, and white
us
noise facilitate fluency in speakers who stutters via a general, common mechanism that enhances multimodal sensorimotor integration, particularly in neural systems related to
an
auditory and speech motor system processing. In this view, a speaker’s extraction of information from choral speech signals would be one of possibly several ways to enhance
M
sensorimotor functioning. In choral speech, speech-related information is readily available, flooding the auditory system with external speech input that presumably facilitates a
ed
speaker’s ability to match the auditory targets in a speech motor plan with corresponding somatosensory targets and, in doing so, allows the speaker to compensate for a deficit in the
pt
sensorimotor integration circuit (Hickock et al., 2011).
Ac ce
As shown in a number of brain imaging studies (e.g., Bhatnagar & Buckingham, 2010; Brown, Ingham, Ingham, Laird, & Fox, 2005; De Nil, Kroll, Kapur, & Houle, 2000; Fox, Ingham, Ingham, Hirsch, Down, & Martin, 1996; Fox, Ingham, Ingham, Zamarripa, Xiong, & Lancaster, 2000), speakers who stutter tend to exhibit significant underactivation of the auditory cortex during speech production. The auditory association areas (i.e., the anterior middle temporal gyrus, anterior superior temporal sulcus) are activated by human voice (Belin, Zatorre, Lafaille, Ahad, & Pike, 2000) and intelligible speech signals (Scott, Blank, Rosen, & Wise, 2000), including recordings of one’s own speech (Curio, Neuloh, Numminen, Jousmäki, & Haki, 2000; Houde, Nagarajan, Sekihara, & Merznich, 2002). To the extent that
Page 24 of 50
The Role of Temporal Cues
25
enhanced activation of the auditory cortex facilitates fluency, findings such as these may be at heart of Bloodstein’s (1995) observation that “…virtually any change in stutterers’ accustomed way of hearing themselves speak is likely to alleviate their speech difficulty” (p. 352).
ip t
There is some evidence to suggest that non-speech signals such as white noise lead to activation increases in the secondary auditory cortex, but that they do not directly facilitate
cr
activation of the auditory regions that process speech and voice (Paus, Marrett, Worsley, &
us
Evans, 1996; Paus, Perry, Zatorre, Worsely, & Evans, 1996). In light of this, it is interesting to speculate about whether the fluency enhancement during the conversational babble
an
condition might yield similar cortical activation patterns, and whether any such patterns are accompanied by facilitation of the speech-related regions of the auditory cortex. Additional
M
research is necessary to examine this possibility. Another potential means for altering sensorimotor relationships would be to reduce the extent to which a speaker relies upon
ed
typical, feed-forward processing for motor commands by triggering compensatory activation within the somatosensory mapping system. This approach has been examined by researchers
pt
(e.g., Civier, Tasko, & Guenther, 2010; Max et al., 2004) who have demonstrated that
Ac ce
introduction of novel or unanticipated perturbations of the speech articulators during the course of an ongoing utterance is sufficient to trigger activation of the sensory cortex, and in doing so, effectively alter one’s approach to speech production. Overall, findings such as these support the idea that there may be multiple mechanisms for enhancing sensorimotor integration and, with it, speech fluency. 4.2. Entrainment to choral signals and fluency enhancement Although exposure to speech-based choral signals did not seem to be necessary for fluency facilitation to occur in participants who stuttered, the results from the entrainment analyses provided partial support for idea that the participants who stuttered made use of the
Page 25 of 50
The Role of Temporal Cues
26
choral signals when they were available. For instance, during the mid- to late-stages of sentence production, their speech was more closely aligned to the temporal structure of the choral reference signal than was the speech of the participants who did not stutter. There also was a significant, albeit modest, relationship between the amount of stuttering reduction that
ip t
a speaker who stuttered exhibited under choral speech and the extent to which he or she entrained with a choral signal. In contrast, the participants with typical fluency seemed to stay
cr
with the choral signal for only the first few syllables of a sentence in the TCS and ACS
us
conditions, after which they spoke at a rate that exceeded the one in the choral signal. Both participant groups followed the choral signal more closely over the course of an utterance
an
when it offered a faithful rendering of the target utterance (the TCS condition) than they did when it offered a distorted rendering of the target utterance (the ACS condition). Overall, the
M
differences between participant groups in both entrainment patterns and speech rate suggest that speakers who stutter followed the choral signals during speech production, while
ed
speakers with typical fluency did not.
4.3. Limitations and future directions
pt
Some limitations of the present study should be highlighted. First, this study featured
Ac ce
a small sample size. Also, two of the participants who stutter fell into the mild range of stuttering severity, and thus had less room for fluency enhancement in comparison to participants with moderate and severe stuttering. As such, the main findings should be viewed with caution. Additional research with a larger and more diverse range of severity ratings is necessary for future experiments concerning the role of temporal cues in fluency facilitation. Second, the main experiment consisted of a series of sentence reading tasks. Although many sentences (i.e., 60 sentences in each of the auditory conditions) were used and the sentences were fairly lengthy, the speech production processes used in reading are not exactly the same as those used in spontaneous conversational speech. For example, sentence
Page 26 of 50
The Role of Temporal Cues
27
reading would be expected to place fewer formulation demands on speakers than conversation. Several participants exhibited relatively little stuttering during the sentence task. This may have been related to the use of a sentence-reading task for speech sample elicitation. Use of a reading task was deemed necessary for the present study because of the need
ip t
to systematically manipulate temporal information in the choral signals. Such manipulation would not have been possible had spontaneous conversational speech been used. In future
cr
research, however, it might be feasible to use non-reading speaking tasks such as modeled
us
sentences or production of rehearsed sentences, both of which provide some control over sentence form and content. Both Logan (2001) and Tsiamtsiouris and Cairns (2013) provide
an
examples of such tasks.
Thus far, the discussion has focused primarily on the role of external speech signals as
M
a means of facilitating sensorimotor integration and, with it, speech fluency in speakers who stutter. Based on findings from the present study concerning fluency enhancement in the
ed
context of conversational babble, it is interesting to speculate about how speakers who stutter attain fluency enhancement when cues about the temporal and/or auditory structure of a
pt
target utterance are absent. In addition to the mechanisms mentioned earlier in the discussion,
Ac ce
another possibility is that conditions such as conversational babbling prompt speakers to attend (i.e., self-monitor, self-regulate) more often or more fully to speech articulation, which, in turn, facilitates the sensorimotor processing necessary to support highly fluent speech. As noted earlier in this section and in the introduction, the role of focused attention in facilitating fluency during conditions like conversational babbling is unclear. In speakers who stutter, fluency enhancement under auditory masking (a context with similarities to conversational babble) is accompanied by the perception that speech becomes less effortful than it is when speaking with normal auditory feedback -- though still not equivalent to that of a typical speaker (Ingham et al., 2006; Ingham et al., 2009). The relationships among variables such as
Page 27 of 50
The Role of Temporal Cues
28
speaking effort, implementation of newly learned motor-based fluency management skills, exposure to auditory signals like auditory masking and conversational babble, and attentionrelated mechanisms are not well understood, though. Further research in this area may offer additional insights into both the role of temporal cuing in fluency enhancement and the
ip t
mechanisms of fluency enhancement in situations where cues about the temporal and/or
Ac ce
pt
ed
M
an
us
cr
auditory structure of a target utterance are absent.
Page 28 of 50
The Role of Temporal Cues
29
Acknowledgement
Ac ce
pt
ed
M
an
us
cr
ip t
The authors would like to thank Kristen Rodriguez for her assistance with data collection.
Page 29 of 50
The Role of Temporal Cues
30
References Alm, P.A. (2004). Stuttering and the basal ganglia circuits: A critical review of possible relations. Journal of Communication Disorders, 37, 325-369.
ip t
Andrews, G., Howie, P.M., Dozsa, M., & Guitar, B.E. (1982). Stuttering: Speech pattern characteristics under fluency-inducing conditions. Journal of Speech and Hearing
cr
Research, 25, 208-216.
Journal of Fluency Disorders, 13, 141-151.
us
Arends, N., Povel, D.-J., & Kolk, H. (1988). Stuttering as an attentional phenomenon.
an
Au-Yeung, J., Howell, P., & Pilgrim, L. (1998). Phonological words and stuttering on function words. Journal of Speech, Language, and Hearing Research, 41, 1019-
M
1030.
Barber, V. (1939). Studies in the psychology of stuttering: XV. Chorus reading as a
ed
distraction in stuttering. Journal of Speech Disorders, 4, 371-383. Belin P., Zatorre R.J., Lafaille P., Ahad P. & Pike B. (2000) Voice-selective areas inhuman
pt
auditory cortex. Nature, 403, 309-312.
Ac ce
Bhatnagar, S.C., & Buckingham, H. (2010). Neurogenic stuttering: Its reticular modulation. Current Neurology and Neuroscience Reports, 10, 491-498.
Bloodstein, O. (1950). Conditions under which stuttering is reduced or absent: A review of literature. Journal of Speech Hearing Disorders, 15, 142-153.
Bloodstein, O. (1995). A handbook on stuttering. San Diego, CA: Singular Publishing Group. Boberg, E., Yeudall, L., Schopflocher, D. & Bo-Lassen, P. (1983). The effect of an intensive behavioral program on the distribution of EEG alpha power in stutterers
Page 30 of 50
The Role of Temporal Cues
31
during the processing of verbal and visuospatial information. Journal of Fluency Disorders, 8, 245-273. Boersma, P., & Weenink, D. Praat: Doing phonetics by computer. [Computer Program]. Retrieved from http://www.praat.org.
ip t
Brayton, E.R., & Conture, E.G. (1978). Effects of noise and rhythmic stimulation on the speech of stuttering. Journal of Speech and Hearing Research, 21, 285-294.
cr
Brown, S., Ingham, R., Inghram, J., Laird, A., & Fox, P. (2005). Stuttered and fluent speech
us
production: An ALE meta-analysis of functional neuroimaging studies. Human Brain Mapping, 25, 105-117.
an
Cherry, E., & Sayers, B. (1956).Experiments upon the total inhibition of stammering by external control and some clinical results. Journal of Psychosomatic Research, 1,
M
233-246.
Civier, O., Tasko, S., Guenther, F.H. (2010) Overreliance on auditory feedback may lead to
ed
sound/syllable repetitions: Simulations of stuttering and fluency-inducing
35(3), 246-279
pt
conditions with a neural model of speech production. Journal of Fluency Disorders,
Ac ce
Cohen, J. (1988). Statistical power analysis for the behavioral sciences. (2nd ed.).Hillsdale, NJ: Lawrence Erlbaum Associates, Publishers.
Conover, W.J. (1980). Practical nonparametric statistics (2nd ed.). Wiley: New York. Conture, E. G., Colton, R. H., & Gleason, J. R. (1988). Selected temporal aspects of coordination during fluent speech of young stutterers. Journal of Speech and Hearing Research, 31, 640–653. Conture, E.G., & Kelly, E.M. (1991). Young stutterers' nonspeech behavior during stuttering. Journal of Speech and Hearing Research, 34, 1041–1056.
Page 31 of 50
The Role of Temporal Cues
32
Curio, G., Neuloh, G., Numminen, J., Jousmäki, V., & Hari, R. (2000). Speaking modifies voice-evoked activity in the human auditory cortex. Human Brain Mapping, 9, 183–191. De Nil, L.F., Kroll, R.M., Kapur, S., & Houle, S. (2000). A positron emission tomography
ip t
study of silent and oral single word reading in stuttering and nonstuttering adults. Journal of Speech, Language and Hearing Research, 43, 1038-1054.
cr
De Nil, L.F., Kroll, R.M., Lafaille, S.J., & Houle, S. (2003). A positron emission
us
tomography study short- and long-term treatment effects on functional brain activity in adults who stutter. Journal of Fluency Disorders, 28, 357-379.
an
Dreschler, W. A., Vershuure, H., Ludvigsen, C., & Westermann, S. (2001). ICRA noises: Artificial noise signals with speech-like spectral and temporal properties for
M
hearing instrument assessment. Audiology, 40, 148-157.
Dunn, L.M., & Dunn, L.M. (1997). Peabody Picture Vocabulary Test-Third Edition.
ed
Bloomington, MN: Pearson Assessments. Dworzynski, K., & Howell, P. (2004). Predicting stuttering from phonetic complexity in
pt
German. Journal of Fluency Disorders, 29, 149-173.
Ac ce
Ferreira, F. (1993). The creation of prosody during sentence processing. Psychological Review,100, 233-253.
Fox, P.T., Ingham, R.J., Ingham, J.C. Hirsch, T.B., Downs, J.H., Martin, C. (1996). A PET study of the neural systems of stuttering. Nature, 382, 158-161.
Fox, P.T., Ingham, R.J., Ingham, J.C., Zamarripa, F., Xiong, J.H., & Lancaster, J.L. (2000). Brain correlates of stuttering and syllable production: A PET performancecorrelation analysis, Brain, 123, 1985-2004. Fransella, F. (1967). Rhythm as a distractor in the modification of stuttering. Behavior Research Therapy, 5, 253-255.
Page 32 of 50
The Role of Temporal Cues
33
Fransella, F., & Beech, H.R. (1965). An experimental analysis of the effect of rhythm on the speech of stutterers. Behavior Research Therapy, 3, 195-201. Freeman, K., & Armson, J. (1998). Extent and stability of stuttering reduction during choral reading. Journal of Speech-Language Pathology and Audiology, 22, 188-202.
ip t
Guntupalli, V. K., Kalinowski, G., Saltuklaroglu, T., & Nanjundeswaran, C. (2005). The effects of temporal modification of second speech signals on stuttering inhibition at
cr
two speech rates in adults. Neuroscience Letters, 385, 7–12.
us
Guntupalli, V. K., Nanjundeswaran, C., Kalinowski, J., & Dayalu, V. N. (2011). The effect of static and dynamic visual gestures on stuttering inhibition. Neuroscience Letters,
an
492, 39-42.
Hickok, G., Houde, J., & Rong, F. (2011). Sensorimotor Integration in Speech Processing:
M
Computational Basis and Neural Organization. Neuron, 69, 407-422. Houde, J. F., Nagarajan, S. S., Sekihara, K., & Merzenich, M. M. (2002). Modulation of the
14, 1125–1138.
ed
auditory cortex during speech: An MEG study. Journal of Cognitive Neuroscience,
pt
Howell, P. (2004). Assessment of some contemporary theories of stuttering that apply to
Ac ce
spontaneous speech. Contemporary Issues in Communication Science and Disorders, 31, 123-140.
Howell, P. (2007). The effects of gated speech on the fluency of speakers who stutter. Folia Phoniatrica et Logopaedica, 59, 250-255.
Howell, P., Au-Yeung, J., & Sackin, S. (1999). Exchange of stuttering from function words to content words with age. Journal of Speech, Language, and Hearing Research, 42, 345-354. Howell, P., & Powell, D.J. (1987). Delayed auditory feedback with delayed sounds varying in duration. Perception and Psychophysics, 42, 166–172.
Page 33 of 50
The Role of Temporal Cues
34
Howell, P., Powell, D. J., & Khan, I. (1983). Amplitude contour of the delayed signal and interference in delayed auditory feedback tasks. Journal of Experimental Psychology: Human Perception and Performance, 9, 772-784. Howell, P., & Sackin, S. (2002). Timing interference to speech in altered listening
ip t
conditions. Journal of Acoustical Society of America, 111, 2842-2852. Hudock, D., Dayalu, V. N., Saltuklaroglu, T., Stuart, A., Zhang, J., & Kalinowski, J. (2011).
cr
Stuttering inhibition via visual feedback at normal and fast speech rates.
us
International Journal of Language & Communication Disorders, 46, 169-178. Ingham, R. J., Bothe, A. K., Jang, E., Yates, L., Cotton, J., Seybold, I. (2009). Measurement
an
of speech effort during fluency-inducing conditions in adults who do and do not stutter. Journal of Speech, Language, and Hearing Research, 52, 1286-1301.
M
Ingham, R.J., & Packman, A. (1979). A further evaluation of the speech of stutterers during chorus-and nonchorus reading conditions. Journal of Speech and Hearing
ed
Research, 22, 784-793.
Ingham, R. J., Warner, A., Byrd, A., & Cotton, J. (2006). Speech effort measurement and
pt
stuttering: Investigating the chorus reading effect. Journal of Speech, Language,
Ac ce
and Hearing Research, 49, 660-670. Jastak, J., & Jastak, S. (1984). Wide-Range Achievement Test—Revised. Wilmington, DE: Jastak Associates.
Johnson, W., & Rosen, L. (1937). Studies in the psychology of stuttering VII: effect of certain changes in speech pattern upon frequency of stuttering. Journal of Speech Disorders, 2, 105-110. Kalinowski, J., Saltuklaroglu, T., Guntupalli, V., & Stuart, A. (2004). Gestural recovery and the role of forward and reversed syllabic repetitions as stuttering inhibitors in adults. Neuroscience Letters, 363, 144-49.
Page 34 of 50
The Role of Temporal Cues
35
Kent, R. D. (1984). Stuttering as a temporal programming disorder. In R. Curlee & W. Perkins (Eds.), Nature and treatment of stuttering: New directions (pp. 283-301). San Diego, CA: College Hill Press. Kiefte, M., & Armson, J. (2008). Dissecting choral speech: Properties of the accompanist
ip t
critical to stuttering reduction. Journal of Communication Disorders, 41, 33-48. Kleinow, J., & Smith, A. (2000). Influence of length and syntactic complexity on the speech
cr
motor stability of the fluent speech of adults who stutter. Journal of Speech,
us
Language, and Hearing Research, 43, 548-559.
Kroll, R. M., De Nil, L. F., Kapur, S., & Houle, S. (1997). A positron emission tomography
an
investigation of post-treatment brain activation in stutterers. In H. F. M. Peters & W. Hulstijn (Eds.), Proceedings of the third international conference on speech
M
motor production and fluency disorders (pp. 307-320). Amsterdam, The Netherlands: Elsevier Science Publishers.
ed
Leech, N. L., Barrett, K. C., & Morgan, G. A. (2008). SPSS for intermediate statistics: Use and interpretation (3rd ed.). New York: Lawrence Erlbaum Associates.
pt
Liberman, A. M., & Mattingly, I.G. (1985). The motor theory of speech perception revised.
Ac ce
Cognition, 21, 1-36.
Liberman, A. M., & Whalen, D. H. (2000). On the relation of speech to language. Trends in Cognitive Sciences, 4,187-196.
Logan, K. J. (2001). The effect of syntactic complexity upon the speech fluency of adolescents and adults who stutter. Journal of Fluency Disorders, 26, 85-106.
Logan, K. J., Byrd, C. T., Mazzocchi, E. M., & Gillam, R. B. (2010). Speaking rate characteristics of elementary-school-aged children who do and do not stutter. Journal of Communication Disorders, 44, 130-147.
Page 35 of 50
The Role of Temporal Cues
36
MacKay, D. G., & MacDonald, M. C. (1984). Stuttering as a sequencing and timing disorder. In R.F. Curlee & W.H. Perkins (Eds.), Nature and treatment of stuttering: New directions (pp. 261-82). San Diego: College-Hill Press. Mallard, A. R., & Webb, W.G. (1980). The effects of auditory and visual “distractors” on
ip t
the frequency of stuttering. Journal of Communication Disorders, 13, 207-212.
for the social sciences. Monterey, CA: Brooks/Cole.
cr
Marasculio, L. A., & McSweeney, N. (1977). Nonparametric and distribution-free methods
us
Max, L., & Gracco, V. (2005). Coordination of oral and laryngeal movements in the perceptually fluent speech of adults who stutter. Journal of Speech, Language, and
an
Hearing Research, 48, 524-542.
Max, L., Guenther, F., Gracco, V., Ghosh, S., & Wallace, M. (2004). Unstable or
M
insufficiently activated internal models and feedback-biased motor control as sources of dysfluency: A theoretical model of stuttering. Contemporary Issues in
ed
Communication Sciences and Disorders, 31, 105-122. Max L, & Yudman, E. (2003). Accuracy and variability of isochronous rhythmic timing
pt
access motor systems in stuttering versus nonstuttering individuals. Journal of
Ac ce
Speech, Language and Hearing Research, 46, 146–163. Neumann, K., Euler, H. A., von Gudenberg, A., Giraud, A., Lanfermann, H., Gall, V., & Preibisch, C. (2003). The nature and treatment of stuttering as revealed by fMRI: A within- and between-group comparison. Journal of Fluency Disorders, 28, 381-410. Neumann, K., Preibisch, C., Euler, H. A., von Gudenberg, A., Lanfermann, H., Gall, V., & Giraud, A. (2005). Cortical plasticity associated with stuttering therapy. Journal of Fluency Disorders, 30, 23-39. Packman, A., Code, C., & Onslow, M. (2007). On the cause of stuttering: Integrating theory with brain and behavioral research. Journal of Neurolinguistics, 20, 253-362.
Page 36 of 50
The Role of Temporal Cues
37
Packman, A., Onslow, M., Richard, F., & van Doorn, J. (1996). Syllabic stress and variability: A model of stuttering. Clinical Linguistics and Phonetics, 10, 235-263. Paus, T., Marrett, S., Worsley, K., & Evans, A. (1996). Imaging motor-to-sensory discharges in the human brain: An experimental tool for the assessment of
ip t
functional connectivity. NeuroImage, 4, 78–86. Paus, T., Perry, D. W., Zatorre, R. J., Worsley, K. J., & Evans, A. C. (1996). Modulation of
cr
cerebral blood flow in the human auditory cortex during speech: Role of motor-to-
us
sensory discharges. European Journal of Neuroscience, 8, 2236–2246.
Perkins, W.H., Kent, R.D., & Curlee, R.F. (1991). A theory of neuropsycholinguistic
an
function in stuttering. Journal of Speech and Hearing Research, 34, 734-52. Rami, M.K., Kalinowski, J., Rastatter, M.P., Holbert, D., & Allen, M. (2005). Choral
M
reading with filtered speech: effect on stuttering. Perceptual and Motor Skills, 100, 421-431.
ed
Riley, G.D. & Bakker, K. (2009). Stuttering Severity Instrument-4 (SSI-4). Pro-Ed. Saltuklaroglu, T., Kalinowski, J., & Guntupalli, V.K. (2004). Towards a common neural
pt
substrate in the immediate and effective inhibition of stuttering. International
Ac ce
Journal of Neuroscience, 114, 435-450. Scott, S.K., Blank, C.C., Rosen, S., & Wise, J.R. (2000). Identification of a pathway for intelligible speech in the left temporal lobe, Brain, 123(12), 2400-2406.
Smits-Bandstra, S., & De Nil, L. F. (2007). Sequence skill learning in persons who stutter: implications for cortico-striato-thalamo-cortical dysfunction. Journal of Fluency
Disorders, 32, 251-278. Stager, S.V., Jeffries, K.J., & Braun, A.R. (2003). Common features of fluency-evoking conditions studied in stuttering subjects and controls: An H215OPET study. Journal of Fluency Disorders, 28, 319–336.
Page 37 of 50
The Role of Temporal Cues
38
Stuart, A. (1999). The distraction hypothesis and the practice of pseudoscience: A reply to Bloodstein. Journal of Speech, Language, and Hearing Research, 42, 913-914. Tsiamtsiouris, J., & Cairns, H. S. (2013). Effects of sentence-structure complexity on speech initiation time and disfluency. Journal of Fluency Disorders, 38, 30-44.
ip t
Van Riper, C. (1982). The nature of stuttering. (2nd ed.). Englewood Cliffs, NJ: PrenticeHall.
cr
Wingate, M. E. (1969). Sound and pattern in “artificial” fluency. Journal of Speech and
us
Hearing Research, 12, 677-86.
Wingate, M. E. (1984). Stutter events and linguistic stress. Journal of Fluency Disorders, 9,
an
295-300.
Zimmerman, G. (1980). Articulatory dynamics of fluent utterances of stutterers and
Ac ce
pt
ed
M
nonstutterers. Journal of Speech and Hearing Research, 23, 108-121.
Page 38 of 50
The Role of Temporal Cues
39
Appendix Listed below are examples of the stimulus sentences that were used to elicit speech samples. Sentences 11 – 20 are examples of the subset of stimuli with similar syntax and prosody that were used for speech sample elicitation as well as the speech entrainment analysis.
cr
2. Is the price of the parking decal for motorcycles reasonable?
ip t
1. The athlete who ran for student government selected his running mates.
3. The neighborhood residents will be talking about their primary concerns.
us
4. The journalism majors decided to form a new student newspaper.
5. Philadelphia scored eleven straight points in the fourth quarter and won.
an
6. Many graduate students will receive scholarships to pay for tuition.
M
7. Mr. Peters argued that children deserve quality health insurance coverage. 8. Christopher promised me he would review the problem before Saturday.
ed
9. The police officer ordered the crowd to step back immediately. 10. Jonathan asked, "Was Catherine on the soccer team last semester?"
pt
11. The policeman and detective had ticketed the Canadian by mistake. 12. The composer and teacher had directed the musical from September.
Ac ce
13. The historian from Kentucky will publish the biography on Tuesday. 14. The chancellor from Cambridge could disappoint the people by resigning. 15. The passenger from Baltimore will compliment the captain from Boston. 16. The governor of Tennessee will continue the discussion on Saturday. 17. The technician from Toshiba is purchasing the materials from Gateway. 18. The director of Patriot has donated the valuables to Benjamin. 19. The candidate for Congress was discussing his concerns with Democrats. 20. The governor of Colorado will pardon the lawbreaker by Tuesday.
Page 39 of 50
The Role of Temporal Cues
40
Ac ce
pt
ed
M
an
us
cr
ip t
Figure Captions Fig.1. Sample screen shot showing the extent of entrainment between a choral speech model (top panel) and a participant’s response (bottom panel) at five points within the utterance (dashed lines). The target sentence is The director of Patriot has donated the valuables to Benjamin. The participant’s response is closely aligned in time to the choral signal at entrainment points 1 and 2. At entrainment points 3, 4, and 5, however, the participant’s response is slightly ahead of the choral speech model, indicating that the participant’s articulation rate is faster than that of the choral model. The screen shot was taken from Praat [Boersma & Weenink, 2009]. Fig. 2. Mean frequency of stuttering-like disfluencies (SLDs) for participants who stutter across speaking conditions (Solo = normal auditory feedback; TCS = typical choral signal; ACS = temporally-altered choral signal; BAB = conversational babble feedback; Error bars = standard error.) Fig. 3. Speaking rate (mean number of syllables uttered per second) for participants who stutter and participants who do not stutter across the experimental speaking conditions (Solo = normal auditory feedback, no choral speech; TCS = typical choral signal; ACS = temporally-altered choral signal; BAB = conversational babble feedback; Error bars = standard error.) Fig. 4. Mean differences in phone onset time (in seconds) for speakers who stutter and speakers who do not stutter during the typical (TCS) and altered (ACS) choral speech conditions. Phone onset times for each participant were computed by subtracting the amount of elapsed time from the end of the standard carrier phrase that preceded each sentence from the onset of each of five predetermined reference points within the sentences. Participant phone onset times were then subtracted from the phone onset times in associated choral signal to provide information on the extent to which speaker’s articulation was entrained with the choral signal from the TCS and ACS conditions. Bars in the positive range indicate that the speaker’s articulation lags the choral signal. Alternately, bars in the negative range indicate the speaker’s articulation precedes the choral signal. Bar height indicates the extent of difference between the speaker’s output and the choral signal (RP1 = sentence-initial point; RP2, RP3, RP4 = three sentence-medial points, RP5= sentence-final point; Error bars = standard error)
Page 40 of 50
The Role of Temporal Cues
4
5
pt
ed
M
an
us
cr
ip t
3
Ac ce
12
41
Page 41 of 50
42
Ac ce
pt
ed
M
an
us
cr
ip t
The Role of Temporal Cues
Page 42 of 50
43
Ac ce
pt
ed
M
an
us
cr
ip t
The Role of Temporal Cues
Page 43 of 50
44
Ac ce
pt
ed
M
an
us
cr
ip t
The Role of Temporal Cues
Page 44 of 50
The Role of Temporal Cues
45
Biographical Sketch Jin Park, Ph.D. is a visiting professor in Department of Linguistics, Chungnam National University, Daejeon, Korea. In recent years, Dr. Park has taught courses in psycholinguistics and speech disorders at several universities in Korea. His current research examines the associations between stuttering, prosody, and neuromotor performance.
Ac ce
pt
ed
M
an
us
cr
ip t
Kenneth J. Logan, Ph.D., CCC-SLP, is an Associate Professor and Program Director in the Department of Speech, Language, and Hearing Sciences at the University of Florida, where he teaches courses, conducts research, and provides clinical services in the area of stuttering.
Page 45 of 50
The Role of Temporal Cues
46
ip t
CE Questions 1. What was NOT the main purpose of the present study? a. To investigate the role of temporal cueing in the speech fluency of adults who stutter b. To examine the mechanisms that underlie fluency enhancement during choral speaking c. To investigate whether adults who stutter depend on an accurate rendering of temporal information to benefit from choral speech d. To examine whether adults who stutter fluently under choral reading condition than speaking under delayed auditory feedback e. To examine whether adults who stutter follow choral signals more closely than adults who do not stutter do
Ac ce
pt
ed
M
an
us
cr
2. What was the rationale for using multi-speaker babble noise in the present study? a. It allowed for the concurrent presentation of a speech-like signal that contained coherent segmental cues b. It allowed for the concurrent presentation of a speech-like signal that contained coherent temporal cues c. It allowed for the presentation of a speech-like signal that lacked coherent temporal and segmental cues d. It allowed for examination of the “distraction hypothesis” e. It allowed for presentation of the “auditory masking effect” 3. Results from the present study DOES NOT indicate that: a. Adults who stutter spoke more fluently in temporally-altered choral speaking condition than they did when speaking solo. b. Adults who stutter spoke less fluently while listening to multi-speaker conversational babble than they did when speaking solo. c. Adults who stutter spoke more fluently in temporally-unaltered choral speaking condition than they did when speaking solo. d. Adults who stutter spoke slower in both of temporally-altered and unaltered choral speaking conditions than they did when speaking solo. e. Adults who stutter exhibited more temporal entrainment with the choral speech in both of altered and unaltered choral speaking conditions. 4. Results from the present study support the idea that: a. Adults who stutter make greater use of choral speech signals than adults who do not stutter b. Adults who stutter may not benefit from choral speech signals to attain the enhanced fluency c. Adults who stutter require accurate segmental models of lexical targets in order to enhance speech fluency d. Adults who stutter require accurate temporal models of lexical targets in order to enhance speech fluency e. Adults who stutter require accurate gestural models of phonetic targets in order to enhance speech fluency 5. According to information discussed in the discussion, what are possible explanations for the patterns of fluency facilitation observed during choral speech with speakers who stutter? a. Use of both a “temporal modeling strategy” and a “focused attention” . b. Use of a “relaxation” strategy. c. Use of a “rehearsal” strategy d. Use of both a “relaxation” and “rehearsal” strategy
Page 46 of 50
The Role of Temporal Cues
47
Ac ce
pt
ed
M
an
us
cr
ip t
e. Use of “regulated breathing” strategy.
Page 47 of 50
The Role of Temporal Cues
48
Ac ce
pt
ed
M
an
us
cr
ip t
CE Answers Key: 1 – d; 2 – c; 3 – b; 4 – a; 5 – a.
Page 48 of 50
The Role of Temporal Cues
49
Educational Objectives:
Ac ce
pt
ed
M
an
us
cr
ip t
The reader will be able to (a) summarize competing views on stuttering as a speech timing disorder, (b) describe the extent to which adults who stutter depend on an accurate rendering of temporal information in order to benefit from choral speech, and (c) discuss possible explanations for fluency facilitation in the presence of inaccurate or indeterminate temporal cues.
Page 49 of 50
cr
ip t
Table(s)
2 p .21 <.001 .22
F(1, 14) 8.02 62.96 .29
4 p .01 <.001 .60
2
P.E. .36 .82 .02
F(1, 14) 6.91 9.78 .003
5 p .02 .007 .96
P.E.2 .33 .41 0
M
P.E. .11 .86 .11
an
2
Reference Point 3 F(1, 14) p P.E.2 5.49 .01 .36 62.96 <.001 .83 .64 .44 .04
Ac c
ep te
d
1 Source F(1, 14) p P.E.2 F(1, 14) Group .31 .59 .02 1.78 Task 1.94 .19 .12 84.79 G x Ta 2.05 .17 .13 1.69 2 2 a ( P.E. = partial eta , G = Group; T = Task)
us
Table 1. Results of 2-way Analyses of Variance (Fluency Group x Speaking Task) Across the 5 Reference Points)
Page 50 of 50