Processes of task-set reconfiguration: switching operations and implementation operations

Acta Psychologica 111 (2002) 1–28 www.elsevier.com/locate/actpsy Processes of task-set reconﬁguration: switching operations and implementation operat...

Download PDF

261KB Sizes 2 Downloads 149 Views

Report

PDF Reader
Full Text

Acta Psychologica 111 (2002) 1–28 www.elsevier.com/locate/actpsy

Processes of task-set reconﬁguration: switching operations and implementation operations Thomas Kleinsorge *, Herbert Heuer, Volker Schmidtke Institut f € ur Arbeitsphysiologie, Universit€at Dortmund, Ardeystraße 67, D-44139 Dortmund, Germany Received 29 January 2001; received in revised form 1 October 2001

Abstract When participants are asked to shift between four dimensionally organized tasks which differ in the type of judgment (numerical vs. spatial) and/or the judgment-to-response mapping (compatible vs. incompatible), a characteristic proﬁle of shift costs can be observed. It can be accounted for in terms of two diﬀerent types of operations: generalizing switching operations on a dimensionally organized set of task representations and implementation operations [T. Kleinsorge, H. Heuer, Psycholog. Res. 62 (1999) 300]. In a ﬁrst experiment we corroborated our previous ﬁndings by way of a new procedure that makes it possible to estimate shift costs unconfounded by a number of factors that are likely to aﬀect estimates of shift costs based on more conventional procedures. In a second experiment we investigated the endogenous and exogenous nature of the postulated types of operations. The characteristic proﬁle of shift costs disappeared when long precue intervals (PCIs) were used. Augmented by a formal analysis, this ﬁnding suggests that both switching and implementation operations are endogenously controlled. In addition, there remained some residual shift costs which were essentially insensitive to the nature of the task shift but depended on the diﬃculty of the new task. Most likely they reﬂect a process of consolidation of an already conﬁgured task set. Ó 2002 Elsevier Science B.V. All rights reserved. PsycINFO classiﬁcation: 2340 Keywords: Shift costs; Endogenous control; Exogenous control

*

Corresponding author. Tel.: +49-231-1084-321; fax: +49-231-1084-340. E-mail address: [email protected] (T. Kleinsorge).

0001-6918/02/$ - see front matter Ó 2002 Elsevier Science B.V. All rights reserved. PII: S 0 0 0 1 - 6 9 1 8 ( 0 1 ) 0 0 0 7 6 - 2

2

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

1. Introduction In contrast to cognitive psychology’s progress in understanding the processes and structures that underlie the performance of a wide range of individual tasks, the processes by which humans conﬁgure themselves to perform a certain task (task-set conﬁguration), and thus to deal with the stimuli they are confronted with in a certain way and not in other ways, have been widely neglected. Consequently, Monsell (1996, p. 93) characterized this area as a ‘‘somewhat embarrassing zone of almost total ignorance’’. However, in recent years research on task-set conﬁguration has been intensiﬁed, and there seems to be a dramatic increase in both empirical and theoretical eﬀorts to overcome the ignorance Monsell complained about. The present paper is one of these eﬀorts. It is concerned with the identiﬁcation and characterization of diﬀerent processes involved in task-set conﬁguration and with the eventual convergence of diﬀerent proposals which have been based on diﬀerent methodological approaches. Task shifting has become the main paradigm for the study of task-set conﬁguration. It involves the speeded and frequent alternation between individual tasks. Inferences are based primarily on reaction times, in particular on the reaction-time diﬀerence between trials which involve a task shift (shift trials) and trials which involve a task repetition (non-shift trials). Typically, this diﬀerence is positive and called the shift cost. Several sources have been suggested to contribute to shift costs. On a rather general level a distinction can be drawn between accounts of shift costs in terms of active (re-)conﬁguration processes (e.g., Rogers & Monsell, 1995) and accounts in terms of episodic carry-over (e.g., Wylie & Allport, 2000). Although proposing diﬀerent kinds of mechanisms, both lines of theorizing are certainly not mutually exclusive, and there seems to be a tendency in the recent literature to end up with an integration of both accounts (e.g., Goschke, 2000; Meiran, 2000). While not denying the role of episodic carry-over as a determinant of shift costs (the mere existence of which indicates at least some unspeciﬁc carry-over eﬀect), the account presented in the present paper is focused on the analysis of shift costs in terms of the duration of active reconﬁguration processes. One reason for this is that the methodological approach we have chosen largely abstracts from speciﬁc transitions between individual tasks and in doing so neglects important information needed to account for shift costs in terms of speciﬁc episodic carry-over eﬀects. Nevertheless, some of the critical ﬁndings we interpret as supporting our account in terms of active reconﬁguration alternatively could be explained as episodic carryover eﬀects, therefore we shall discuss such alternative accounts in Section 4. Beyond this the reader should keep in mind that not mentioning the contribution of processes of episodic carry-over does not imply the claim that these processes do not exist, but is a result of our speciﬁc methodological approach and theoretical emphasis. Our eﬀorts to gain insight into the processes that underlie task-set conﬁguration started with the consideration that, although task-shifting experiments typically involve sets of only two tasks in each experimental condition, the use of larger sets

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

3

might have the potential of providing additional insights into the processes of taskset reconﬁguration (cf. Mayr & Keele, 2000, for a similar point of view). This led us to explore shift costs with a set of four dimensionally organized tasks (Kleinsorge & Heuer, 1999), with the two task dimensions being the type of judgment (numerical vs. spatial) and the judgment-to-response mapping (compatible vs. incompatible). Our general hypothesis, which was based on some analogies between task-set conﬁguration and motor programming (cf. Kleinsorge, 2000, for details), was that shift costs should reﬂect the dimensional organization of the four tasks, which indeed they did, though in a somewhat complicated manner. Shift costs were not simply a monotonic function of the number of dimensions on which the task changed from one trial to the next, but the eﬀect of a change on one dimension depended on whether or not certain other dimensions were changed as well. The ﬁrst observation of such a phenomenon dates back at least to Rogers and Monsell (1995): they found that the classical response–repetition beneﬁts (e.g., Kirby, 1980), which showed up in non-shift trials in their experiments too, turned into response–repetition costs in terms of reaction time and/or error rate in shift trials. In fact, response–repetition costs have also been observed with more subtle changes of stimulus category (Kleinsorge, 1999). In our previous study (Kleinsorge & Heuer, 1999) we found in addition that the reaction-time beneﬁts of a repeated judgmentto-response mapping over an alternated mapping turned into reaction-time costs when only the mapping was repeated, but the type of judgment was changed. When the type of judgment was repeated, however, there were always reaction-time beneﬁts as compared to trials in which the type of judgment was changed, no matter whether the mapping was repeated or changed. These observations suggest that switches on certain dimensions imply (or prime) switches on certain other dimensions, while the reverse is not necessarily true. More speciﬁcally, we accounted for the observed proﬁle of shift costs as a function of the relation between successive tasks in terms of two types of operations which serve, ﬁrst, to select the appropriate task representation and, second, to transform the selected task representation into the appropriate task-control structure (implementation). Each of these processes is characterized by its duration. The selection of the appropriate task representation we assumed to be composed of a number of switching operations in a dimensionally organized memory representation which we called the ‘‘task space’’, with each single switching operation having a constant duration. The number of switching operations depends on the relation between successive tasks according to the following rules: 1. In addition to the type of judgment and the judgment-to-response mapping, the responses constitute a third task dimension. These three dimensions form a hierarchy, with the type of judgment on top and the responses at the bottom of the hierarchy. Primary switches on a higher-level dimension generalize to all lower-level dimensions, resulting in concurrent switches on these lower-level dimensions. 2. Corrective re-switches are required when generalized switches are inappropriate; they occur serially and do not generalize to lower-level dimensions.

4

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

The resulting number of switches which, when multiplied with the duration of a single switch, determines the duration of the selection process, is given in the second column of Table 1 for each relation between successive tasks. For example, when only the type of judgment has to be changed from one trial to the next (ﬁfth row in Table 1), this results in three switching operations. First, there is a switch on the judgment level that generalizes to the mapping level and the response level. Second, because the features on these two levels actually are repeated, two corrective reswitches are needed in addition. For the second kind of process involved in task-set reconﬁguration we assumed that its duration is diﬀerent when only a new mapping has to be implemented and when a new type of judgment has to be implemented; in addition we assumed that implementation of a new type of judgment implies implementation of a new mapping of the outcomes of the judgment to the responses. With our previous data it even appeared that implementation of a new type of judgment together with a new mapping required twice the time needed by the implementation of a new mapping only. The assumption of a hierarchical ordering of the three task dimensions is based on the fact that it is logically necessary to know the possible outcomes of a judgment process before they can be mapped on their corresponding responses, with this mapping in turn having logical priority over the activation of a response. Correspondingly, it is plausible that task-set reconﬁguration starts with specifying the required type of judgment before selecting the particular judgment-to-response mapping (cf. Kleinsorge & Heuer, 1999, for details). The assumption that primary but not corrective switches generalize downstream the hierarchy is plausible on the basis of functional considerations. A generalizing switch should be an eﬃcient means of cognitive reconﬁguration under conditions in which a change on a superordinate level of action representation (for example, the level of goals) normally goes along with the requirement to establish another repertoire of subordinate actions. However, when also corrective re-switches would generalize, this would endanger the stability

Table 1 Hypothetical durations of selection and implementation, observed and predicted mean reaction times, and error rates for the diﬀerent relations between successive tasks (previous type of Judgment same or diﬀerent, previous Mapping same or diﬀerent, previous Response same or diﬀerent) Relation

Hypothetical duration

Mean reaction time

J

M

R

Selection

Implementation

Observed

Predicteda

¼ ¼ ¼ ¼ 6 ¼ 6 ¼ 6 ¼ 6 ¼

¼ ¼ ¼ 6¼ 6¼ ¼ 6¼ 6¼

¼ ¼ 6¼ ¼ 6¼ 6¼ ¼ 6¼

0 TS 2TS TS 3TS 2TS 2TS TS

0 0 TM TM TJ TJ TJ TJ

453 478 802 783 923 926 882 875

454 477 804 781 925 902 902 878

Error percentage 1.4 1.67 8.02 4.13 4.22 3.09 2.43 1.09

a With parameter estimates RT0 ¼ 454 ms, duration of switching operation TS ¼ 23:3 ms, duration of mapping implementation TM ¼ 304 ms, duration of judgment plus mapping implementation TJ ¼ 401 ms.

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

5

of the system because with action representations that cover a wider range of intermediate levels a whole sequence of generalizing re-switches might be required. Our account of the proﬁle of shift costs among a set of dimensionally organized tasks leads to a simple additive model which can be ﬁtted to the observed data. Independent of any goodness-of-ﬁt measure, it suggests that certain qualitative diﬀerences in shift costs should exist between certain sets of relations between successive tasks. In addition, the formalism is derived from more general theoretical assumptions. Perhaps the most important of these assumptions is that primary switches generalize across all lower-level task dimensions. This assumption seems to imply certain organizational characteristics of the task space, the organized set of memory representations of all tasks that are possible in a certain situation. The present experiments are intended to bolster and elaborate our account of shifting within a set of dimensionally organized tasks. In the ﬁrst experiment we address a number of methodological criticisms that can be raised against our previous ﬁndings. In the second experiment we relate our hypothesized processes of task-set reconﬁguration, namely selection – characterized by the number of switching operations – and implementation to the distinction of endogenously and exogenously controlled processes (Rogers & Monsell, 1995).

2. Experiment 1 Our account of shifting among a set of dimensionally organized tasks thus far is largely post hoc and based on a pattern of experimental results that was not really predicted. Thus, of course, the ﬁndings need replication, which also allows data analyses guided by speciﬁc a priori hypotheses. Second, and this is perhaps a more serious concern, our results were obtained with a particular variant of the task-shifting paradigm which, as we know now, is not necessarily appropriate for the kind of account we have suggested. Thus, our ﬁrst step here is to exclude potential artefacts which could have aﬀected the switch-cost proﬁle and thus the basis for our tentative conclusions. Variants of the task-shifting paradigm diﬀer in how the sequences of tasks are arranged. For example, in some experiments performance in pure (only one task) and mixed (alternating tasks) blocks of trials is compared (e.g. Allport, Styles, & Hsieh, 1994; Rubinstein, Meyer, & Evans, 2001). In other experiments shift trials and nonshift trials from same blocks are compared, as in the alternating-runs procedure (Rogers & Monsell, 1995) or with random sequences of tasks (Meiran, 1996). In our previous study tasks varied unpredictably within long sequences of trials. This procedure embodies the risk of at least two types of artefacts when the purpose is to assess the durations of processes of task-set reconﬁguration. The ﬁrst type of artefact is related to the baseline measures, that is, to the assessment of performance in non-shift trials. Although Rogers and Monsell (1995, Exp. 6) found essentially no diﬀerences between non-shift trials which were ﬁrst, second, or third task repetitions, such a ﬁnding seems not to be universal, but lagged eﬀects of task shifts have also been observed (e.g., Kleinsorge, 1997; Meiran, Chorev, & Sapir,

6

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

2000). In fact, they were also present in our previous study (Kleinsorge & Heuer, 1999). Whenever this is the case, there is some uncertainty with respect to which trials to use as baseline. In particular artefacts can accrue from this rather arbitrary choice as soon as lagged eﬀects of a task-shift depend on the nature of this shift. The second type of artefact is related to the recent observation that shift costs do not only depend on the current task shift, but also on the preceding one (Mayr & Keele, 2000). Speciﬁcally, the costs of shifting from task B to task A have been shown to be higher when B was preceded by task A than when B was preceded by a third task C, presumably because of the requirement to overcome some residual inhibition of task A in trial n that has been built up when it has been abandoned in trial n 1. The more general conclusion one can draw from this ﬁnding is similar to the conclusion that reaction time in non-shift trials is likely to depend on whether earlier trials were also non-shift trials or not: reaction time in shift trials does not only depend on the current shift but also on preceding ones. With dimensionally organized tasks higher-order sequential eﬀects, as they have been identiﬁed by Mayr and Keele (2000), can become quite complicated and opaque. According to such considerations, strict control of context is required when the purpose is to study shift costs unconfounded by diﬀerent kinds of sequential eﬀects, especially when conclusions are to be drawn on quantitative estimates of the durations of certain conﬁguration processes. As mentioned in Section 1, our account of shifting within a set of dimensionally organized tasks is concerned only with processes of task-set reconﬁguration and neglects additional processes which are now known to aﬀect shift costs. Thus, it is crucial that the data are not confounded by at least those additional components of task-shifting like the overcoming of residual backward inhibition that can readily be supposed on a priori grounds. For the present experiments we chose a procedure similar to the one used by Gopher, Armony, and Greenshpan (2000). Basically, the sequence of trials was split into blocks of ﬁve trials each with breaks between blocks. Within each block a single task shift occurred randomly in the third or fourth trial or not at all. This design allows strict control over all sequential dependencies at least on the time scale of a single block and, among other things, has the advantage that shift and non-shift trials occur in strictly identical contexts, which is not the case, for example, when pure and mixed blocks are compared or with the alternating-runs procedure. 2.1. Method 2.1.1. Participants Twelve female and four male participants took part in the experiment. Their mean age was 24.9 years (range: 18–31 years). They were paid for their participation. 2.1.2. Apparatus Stimuli were displayed on a 1400 VGA monitor, placed at about 60 cm distance from the participants’ eyes. The response device consisted of a small box (width: 11.5 cm, length: 15 cm) which was slightly tilted toward the participant with a height of 1.5 cm at the near end and a height of 3 cm at the far end. Two rows of ‘upper’

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

7

and ‘lower’ keys (1:5 1:5 cm) were let in this box. The lateral distance between the two keys of each row was 6.5 cm, and the distance between the two rows was 6 cm. Only the two lower keys were used as response keys. The experiment was controlled by a 486 microcomputer. 2.1.3. Tasks and stimuli The experimental tasks were created by a factorial combination of the type of judgment (numerical vs. spatial) and the judgment-to-response mapping (compatible vs. incompatible). In each trial, the imperative stimulus consisted of one central and one peripheral digit. Digits were of height 1.2 cm and of width 1.0 cm. The central digit was presented above or below a central ﬁxation star with a vertical separation of about 3 mm. The peripheral digit appeared to the left or to the right of the central digit with a horizontal separation of about 3 mm. Participants were instructed to judge the numerical value of the central digit whenever the digits appeared above the center of the screen. This numerical judgment required a decision whether the central digit was of the set f1; 2; 3; 4g or of the set f6; 7; 8; 9g. Participants were instructed to judge the spatial position of the peripheral digit relative to the central one whenever the digits appeared below the center of the screen. The judgment-to-response mapping was cued by color. When the digits were green, participants were required to apply the compatible mapping. When the digits were red, they were to apply the incompatible mapping. Both digits were always presented in the same color. Based on the fact that smaller numbers tend to elicit a leftward response and larger numbers a rightward response (Dehaene, Bossini, & Giraux, 1993), for the numerical task the compatible mapping was deﬁned according to these tendencies. 1 For the incompatible mapping this assignment was reversed. The compatible mapping was reinforced by displaying the digits 1, 2, 3, 4 at the bottom left side and the digits 6, 7, 8, 9 at the bottom right side of the monitor throughout a block of trials. When the central digit was ‘small’, that is, from the range 1 to 4, the left key of the lower row of keys had to be pressed with a compatible mapping (green digits), whereas the right key of the lower row of keys had to be pressed with an incompatible mapping (red digits). Correspondingly, when the central digit was ‘large’, that is, from the range 6 to 9, the right key had to be pressed with a compatible mapping, whereas the left key had to be pressed with an incompatible mapping. For the spatial task, a compatible mapping required a ‘left’ response to a left peripheral digit and a ‘right’ response to a right peripheral digit. With an incompatible mapping, this assignment was reversed. The keys were pressed with the index ﬁngers of the left and right hands. 2.1.4. Design and procedure Each participant took part in two sessions on two consecutive days. Each session was subdivided into a training phase and a test phase. The training phase of the ﬁrst 1 Meanwhile we have learned that these response tendencies cannot always be observed. However, this issue is not crucial for the present topic.

8

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

day consisted of 128 blocks of ﬁve trials each, the training phase of the second day served as a warm-up and lasted about 10 min after which it was terminated by the experimenter. Each test phase consisted of 256 blocks of ﬁve trials each. Blocks in which an error occurred were repeated at the end of the session. In each block, either the third or the fourth trial was the potential shift trial. The term ‘potential shift trial’ refers to the fact that the possible shifts resulted from a factorial combination of the factors: previous type of judgment (same vs. diﬀerent, relative to the trial before the shift) and previous mapping (same vs. diﬀerent), leading to 25% of blocks in which no task shift was required. The 256 blocks of the test phase resulted from a factorial combination of the factors: type of judgment in the potential shift trial (numerical vs. spatial), mapping in the potential shift trial (compatible vs. incompatible), serial position of the potential shift trial (3 vs. 4), response in the potential shift trial (left vs. right), previous response (same vs. diﬀerent), next response (same vs. diﬀerent) and the already mentioned factors, previous type of judgment and previous mapping. In the training phases, a random subset of these blocks was presented. When the potential shift occurred in the third trial, responses in trials 1 and 5 of the block were determined randomly; when the potential shift occurred in trial 4, the responses of the ﬁrst two trials were chosen randomly. Of the experimental factors, serial position, response and next response served only to balance conditions and were not included in the analyses. Blocks were separated from each other by 5-second intervals during which the monitor switched 10 times between the colors blue and gray at an irregular pace. A block was initiated by simultaneously pressing both upper keys with the two index ﬁngers. This procedure served to segregate the individual blocks and to make higherorder sequential dependencies inoperative. The response–stimulus interval (RSI) was 500 ms.

2.2. Results 2.2.1. Reaction times Our analyses were restricted to the potential shift trials. Nevertheless, whenever an error occurred in any of the ﬁve trials which constituted a block, this block was discarded from the analysis of reaction times. We furthermore screened reaction times for outliers. First, all reaction times longer than 2.5 s were discarded. In a second step, for each participant and each factorial combination of the ﬁve experimental factors (see below) the means and standard deviations were computed and reaction times longer than three standard deviations above the mean were discarded; this procedure was repeated until no more reaction times fell above this criterion. In total, this resulted in an exclusion of 1.7% of trials. The statistical analyses were based on the means of the remaining trials. Individual mean reaction times were determined for each combination of the factors: judgment (numerical vs. spatial), mapping (compatible vs. incompatible), previous judgment (same vs. diﬀerent), previous mapping (same vs. diﬀerent), and previous response (same vs. diﬀerent).

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

9

In a ﬁrst analysis we averaged for each participant and each relation between successive tasks the mean reaction times of the four tasks. The means across participants are given in Table 1 as a function of the relation between successive tasks. First, whenever the type of judgment was changed, reaction time was longer than when the type of judgment was repeated. Second, when the mapping was changed, this resulted in an increased reaction time when the type of judgment was repeated, but a reduced reaction time when the type of judgment was changed. Third, a response alternation resulted in reaction-time costs when only the response was changed, but in reaction-time beneﬁts when any other task characteristic was changed as well, although these beneﬁts were not fully consistent across the other changes. In terms of the relational factors, previous type of judgment (same vs. diﬀerent), previous mapping (same vs. diﬀerent), and previous response (same vs. diﬀerent), the characteristic shift-cost proﬁle of Table 1 results in a set of rather complex interactions. However, more straightforward analyses are suggested by theoretical considerations. In Table 1 are also given the theoretical durations of task selection and implementation according to our account of shifting among a set of dimensionally organized tasks. This account results in a simple additive model: RT ¼ RT0 þ DS þ DI , with RT0 as baseline for repetitions, DS as the duration of task selection (which is an integer multiple of the switch duration TS ), and DI as the duration of implementation (either 0; TM or TJ ). The simple additive model can be ﬁtted to the observed mean reaction times, and the results as well as the parameter estimates are shown in Table 1. The ﬁt is reasonably well (RMSE: 11.2 ms, r2 ¼ :996), and deviations occur mainly when either the response or the mapping changes in addition to the type of judgment. Of course, a reasonable ﬁt of a set of eight data points with a four-parameter model is in no way surprising. Of more importance is that the formal representation of our account of shifting among a set of dimensionally organized tasks makes obvious the constraints which the theoretical considerations impose on the data (cf. Roberts & Pashler, 2000). Such constraints exist, ﬁrst, with respect to the model parameters. Speciﬁcally, we assume that implementation of a new kind of judgment implies implementation of a new mapping of the outcomes of the judgments to the responses. Thus, the constraint is TJ > TM (or in a weaker version: TJ P TM ). In addition, the duration of switching operations should be larger than zero. For a statistical examination of the constraints on model parameters we ﬁtted the model to the individual mean reaction times of each participant. (Mean RMSE of the ﬁt was 18.4 ms, range from 8.7 to 34.9 ms for the 16 participants.) The mean of the estimated switching duration, TS , was 23 ms, with the 95-% conﬁdence interval ranging from 14 to 33 ms. The mean of the estimated duration of implementing a new type of judgment, TJ , was longer than the mean of the estimated duration of implementing a new mapping, TM , 401 vs. 304 ms, the diﬀerence being signiﬁcant, tð15Þ ¼ 6:3, p < :01. Given the constraints on the parameters, the simple model also predicts particular reaction-time diﬀerences between sub-sets of the eight relations between successive tasks. We examined these predictions by means of a series of contrasts. First, from Table 1 it is apparent that a change of the mapping with a repeated type of judgment should be associated with a longer reaction time than a repetition of the mapping

10

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

(theoretically the reaction-time diﬀerence is TM þ TS ). The corresponding contrast was highly signiﬁcant, F ð1; 15Þ ¼ 285:0, p < :01. Second, a change of the type of judgment together with a change of the mapping should be associated with a longer reaction time than a repetition of the type of judgment when the mapping changes (theoretically the reaction-time diﬀerence is TJ TM ). Again the corresponding contrast was highly signiﬁcant, F ð1; 15Þ ¼ 26:6, p < :01. Third, and perhaps most important, a change of the mapping with an alternated type of judgment should be associated with a shorter reaction time than a repetition of the mapping (theoretically the reaction-time diﬀerence is TS ). In this case again the contrast reached statistical signiﬁcance, F ð1; 15Þ ¼ 43:0, p < :01. A ﬁnal set of four contrasts was computed for trials with repeated and alternated responses, separately for the four relations between type of judgment and mapping. (According to Table 1 the theoretical diﬀerences are TS .) The response–alternation costs when both the type of judgment and the mapping were repeated were signiﬁcantly diﬀerent from zero, F ð1; 15Þ ¼ 20:5, p < :01, as were the response–alternation beneﬁts when only the mapping was changed, F ð1; 15Þ ¼ 4:6, p < :05. However, the expected response–alternation beneﬁts when the type of judgment was changed were absent (with repeated mapping) or present, but non-signiﬁcant (with alternated mapping). 2 The present analysis is focused on the shift costs as a function of the relation between successive tasks. Thus it is appropriate to abstract from the speciﬁc tasks by way of averaging. For example, average shift costs for shifts from A to B and from B to A can be attributed to the relation between A and B, while the separate costs of shifting from B to A and from A to B do not only reﬂect relational eﬀects, but also task-speciﬁc ones like diﬀerences in implementation durations for tasks A and B or diﬀerences between switching durations. Nevertheless, we wanted to ascertain that the characteristic shift-cost proﬁle of Table 1 does not only accrue from a subset of the four tasks. Of course, for each individual task it can be somewhat distorted by task-speciﬁc eﬀects; in addition, the proﬁle for each individual task is noisier than the mean proﬁle. The shift-cost proﬁles for the four tasks as a function of the relation to the preceding task are shown in Fig. 1. For each task we ran the same set of contrasts as for the mean shift-cost proﬁle. The results are shown in Table 2. While both the reaction-time costs of a change of the mapping with repeated type of judgment (contrast 1) and of the change of the type of judgment with alternated mapping (contrast 2) were consistently signiﬁcant or almost so across the four tasks, the reaction-time beneﬁts of a change of the mapping with alternated type of judgment (contrast 3) were signiﬁcant in only three of the four tasks. Of the contrasts between repeated and alternated responses only a minority were signiﬁcant. However, all signiﬁcant contrasts were associated with the expected diﬀerences between the mean reaction times. Whenever there were diﬀer2 In a general ANOVA, the crucial interaction of previous judgment and previous mapping was signiﬁcant too, F ð1; 15Þ ¼ 327:27, p < :01, as were the interactions of previous judgment, previous mapping, and previous response, F ð1; 15Þ ¼ 8:81, p < :05, and of previous mapping and previous response, F ð1; 15Þ ¼ 7:07, p < :05.

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

11

Fig. 1. Experiment 1: Mean reaction times as a function of previous type of judgment (J), previous mapping (M), and previous response (R), shown separately for the four tasks.

ences between reaction times that did not conform to the expectations based on the model, these diﬀerences did not only fail to approach statistical signiﬁcance, but they were numerically small in addition. 2.2.2. Errors Error frequencies in the potential shift trials were referred to 256 blocks of trials in each session, while the blocks of trials which were appended because in the original blocks errors occurred in one of the serial positions were neglected. In addition, those blocks of trials were neglected in which the potential shift trials were preceded by an error.

12

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

Table 2 Results of reaction-time contrasts computed separately for the four dimensionally organized tasks (type of judgment/mapping) Task

1

2

3

4a

Numerical/compatible Numerical/incompatible Spatial/compatible Spatial/incompatible

xx xx xx xx

(x) xx xx xx

xx

xx xx

xx xx

4b

x

4c

4d

x

(1) Repeated vs. alternated mapping with repeated type of judgment; (2) Repeated vs. alternated type of judgment with alternated mapping; (3) Repeated vs. alternated mapping with alternated type of judgment; (4) Repeated vs. alternated response with repeated type of judgment and mapping (a), repeated type of judgment and alternated mapping (b), alternated type of judgment and repeated mapping (c), and alternated type of judgment and mapping (d). xx p < :01; x p < :05; ðxÞ p < :10.

While the model of Table 1 provides a guidance for the analysis of reaction times, it does not so for the analysis of error rates. However, regarding the costs and beneﬁts of response repetitions and alternations, these seem to be reﬂected in reaction times and/or error rates (Rogers & Monsell, 1995). The same could hold for the costs and beneﬁts of repetitions and alternations of other task characteristics. Thus, we analyzed error percentages by means of the same set of contrasts as reaction times. As is apparent from Table 1, with a repeated type of judgment a change of the mapping was associated with a higher error rate than a repeated mapping, F ð1; 15Þ ¼ 10:8, p < :01. Second, a change of the type of judgment together with a change of mapping was associated with a smaller rather than a higher error percentage than a change of only the mapping, F ð1; 15Þ ¼ 8:9, p < :01. Third, a change of the mapping with an alternated type of judgment was also associated with a smaller error percentage than a repeated mapping, F ð1; 15Þ ¼ 5:8, p < :05. The diﬀerences between repeated and alternated responses closely matched the corresponding reaction-time diﬀerences. However, none of the four contrasts reached statistical significance. Statistical signiﬁcance was approached when both the judgment and the mapping alternated, F ð1; 15Þ ¼ 3:8, p < :10, and when only one of these task characteristics was alternated the probability of wrong acceptance of the hypothesis of mean error rates being diﬀerent was less than .15.

2.3. Discussion The purpose of the ﬁrst experiment was to explore the robustness of the pattern of shift costs which can be observed when shifts are among a set of dimensionally organized tasks. In particular the procedure was designed to exclude a number of potential artefacts that can arise when there is no strict control over the context of shift and non-shift trials and/or the context for these two types of trials is not strictly equated. The advantage of the present procedure is that shift and non-shift trials occur after exactly identical series of preceding trials; shift and non-shift trials are

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

13

taken from identical serial positions, and there is no reason to suspect that expectations of shifts may diﬀer when shift and non-shift trials are presented (as, for example, is the case with the alternating-runs procedure). The fact that with the improved methodology the shift-cost proﬁle, which we have observed earlier with a procedure subject to potential artefacts (Kleinsorge & Heuer, 1999), showed up again, lends credibility to the hypothesis that it indeed reﬂects the durations of processes of taskset reconﬁguration rather than, for example, lagged eﬀects of preceding task shifts. The shift-cost proﬁle can be accounted for by a simple kind of model which is additive in its component times, although it produces a rather complex interactive pattern of eﬀects of the experimental variables. The model embodies a distinction between two diﬀerent processes. The ﬁrst process serves to select the appropriate task representation. This process is hypothesized to consist of a number of switching operations among the dimensionally organized task representations. These switching operations are likely to be of two kinds. Primary switching operations occur at the highest level of the hierarchy of task dimensions where an alternation occurs, and they generalize to all lower levels. Corrective switching operations follow when the primary switching operations have induced inappropriate generalized switches at lower-level task dimensions. The second process serves to make the selected task representation operative. However, the estimated durations of these implementation operations may reﬂect additional processes. For example, diﬀerences in the times needed to identify task cues may contribute to them. Thus, at present the category of implementation operations covers a range of processes the nature of which has to be determined yet. Importantly, not everything we subsume under the label of implementation operations has to run oﬀ after the switching operations have taken place, although we think that the largest portion of what we call implementation is indeed a consequence of having selected an appropriate task set. Formally it is important to note that the implementation operations account for most of the variability in the shift-cost proﬁle. However, they do not account for the small variations superposed on the large diﬀerences between repeated tasks, alternated mappings, and alternated types of judgment. These small variations reﬂect the switching operations in the dimensionally organized task space. It is apparent that our formal account of shifting among dimensionally organized tasks is a simpliﬁed approximation to reality. For example, there is no reason to claim that the assumption of identical durations of all sorts of switches is correct in any strict sense, but the diﬀerences may be rather small and thus negligible. After all, the purpose of the formal representation is not to reach an excellent quantitative ﬁt, but to capture the major ideas of a distinction between switching operations and implementation operations and of a distinction between generalizing primary switching operations and non-generalizing corrective ones. Thus, the formalism is not a purpose in itself, but it serves to clarify the observable consequences of the theoretical claims and to guide the data analysis accordingly. The error proﬁle turned out to be essentially parallel to the reaction-time proﬁle, so that the latter cannot be attributed to a speed–accuracy tradeoﬀ. However, a major diﬀerence between the reaction-time proﬁle and the error proﬁle was that error

14

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

rates were not increased when the judgment was changed as compared to a change of the mapping. For this there is a simple methodological reason. When participants respond according to the wrong mapping, all responses will be errors (as long as they are correct according to the wrong mapping). But when they respond according to the wrong type of judgment, only 50% of the responses will be errors, because the irrelevant stimulus attribute varied randomly, so that in about 50% of the trials responses to the irrelevant attribute were identical to the (correct) responses to the relevant attribute.

3. Experiment 2 The purpose of the second experiment is to relate our distinction between switching operations and implementation operations as components of task-set reconﬁguration to other process distinctions. Speciﬁcally we ask whether our distinction is identical to the distinction between endogenous and exogenous processes of taskset reconﬁguration (Rogers & Monsell, 1995). Although the distinction between endogenously and exogenously controlled processes of task-set reconﬁguration is quite clear operationally, namely their independence versus dependence on the presentation of the imperative stimulus, the functional diﬀerence between these types of processes is largely unknown. Rogers and Monsell (1995, p. 229) proposed that the endogenous and exogenous components of shift costs basically reﬂect parts of the total duration of the same ‘‘stagelike process of reconﬁguration’’ whose ‘‘completion can be triggered only exogenously by the arrival of a stimulus suitably associated with the task’’. Contrary to that, Meiran (1996, p. 1439) interpreted the (sometimes very small) residual costs for task shifts he observed ‘‘as reﬂecting retroactive adjustment’’, a process that – at least as we interpret it – should be functionally distinct from advance conﬁguration. Finally, in the model of Rubinstein et al. (2001), the endogenous component of shift costs is attributed to a process – goal shifting – which is clearly distinct from the process of rule activation which should result in the exogenous component of shift costs. On a descriptive level there is an obvious similarity of our distinction between switching operations, which serve to select an appropriate task representation, and implementation operations on the one hand and the distinction of Rubinstein et al. (2001) between goal shifting and rule activation on the other hand. However, the databases for the two distinctions are grossly diﬀerent. While our distinction is based on a characteristic proﬁle of shift costs for a set of dimensionally organized tasks, or on a certain pattern of interactive eﬀects of diﬀerent kinds of relations between successive tasks, the distinction of Rubinstein et al. is based largely on additive effects of task precues and complexity of rules on shift costs. This led them to relate their functionally deﬁned hypothesized processes to the operationally deﬁned classes of endogenously and exogenously controlled processes. The present experiment is designed to answer the question whether our distinction can be mapped on the distinction between endogenously and exogenously controlled processes in the same way. The procedure to answer this question is rather straightforward. In one condition

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

15

we determined the shift-cost proﬁle when the participants were informed about the required kind of judgment and mapping concurrently with the imperative stimulus, while in the second condition the information about the next task was provided by precues. An aﬃrmative answer to our question requires that the precues serve to modify the shift-cost proﬁle in a particular way. Speciﬁcally, switching operations related to the selection of the type of judgment and the mapping should be absent, while only switching operations related to the selection of responses (which were not precued) and implementation operations should remain. In terms of the simple additive model this amounts to the expectations that, ﬁrst, the duration of task selection is reduced to 0 or TS , and second, that the implementation durations TM and TJ remain unchanged when task precues are presented. Unfortunately the apparently straightforward procedure requires that all participants exploit the opportunity for advance task-set conﬁguration that is provided by the precues in all trials. However, this requirement is unlikely to be satisﬁed. For example, Rogers and Monsell (1995) found a reduction of shift costs with long preparation intervals only when their duration was constant during a block of trials, but not when short and long preparation intervals were mixed. This suggests that advance preparation is strategic, but not mandatory. More recently, De Jong (2000) argued that residual shift costs, which remain even after long preparation intervals, result more or less completely from failures to prepare for a forthcoming task. This argument was based on the observation that the reaction-time distribution in shifttrials with long preparation intervals could be represented as a mixture of reactiontime distributions in shift-trials with short preparation intervals (duration of task-set conﬁguration included in reaction times) and reaction time distributions in non-shift trials (duration of task-set conﬁguration not included in reaction times). Thus, in shift trials with long preparation intervals residual shift costs are clearly present when the longest reaction times only are compared with those in non-shift trials, but more or less absent when only the shortest reaction times are compared; mean reaction times, of course, represent a kind of intermediate. As a consequence of the strategic nature of advance task-set conﬁguration, mean reaction times observed for precued trials should be intermediate between those expected and those observed without precues. However, the analysis of De Jong (2000) does also suggest a procedure which at least ameliorates the problem: in precued trials it is mainly the right tail of the reaction-time distribution which is aﬀected by failures to make use of the precues, while the left part of the distribution should mainly comprise reaction times of trials with advance task-set conﬁguration. Therefore we supplemented the analysis of individual mean reaction times of each condition by an analysis of individual ﬁrst quartiles, which should be less aﬀected by failures of advance conﬁguration. 3.1. Method 3.1.1. Participants Eleven female and 13 male participants took part in the experiment. Their mean age was 23.0 years (range: 19–27 years). They were paid for their participation.

16

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

3.1.2. Tasks and stimuli Apparatus, tasks, and stimuli were the same as in Experiment 1 (Section 2), except that the RSI was 1500 rather than 500 ms. When precues were presented at the beginning of the precue interval (PCI) of 1200 ms, the ﬁxation star was replaced by a red or green arrow (length: 4 mm) which pointed up or down. The direction of the arrow cued the location of the imperative stimuli and thus the type of judgment required; its color cued the color of the imperative stimuli and thus the judgment-toresponse mapping. 3.1.3. Design and procedure Each participant took part in two sessions on two consecutive days. Each session consisted of a training phase and two test phases of 128 blocks each. In the test phases, blocks in which an error occurred were repeated at the end of the phase. Sequences of blocks were constructed as for Experiment 1 (Section 2), except that the serial position of the potential shift trial (third or fourth) was determined randomly for each block within a sequence of 128 blocks with the constraint of equal frequencies and not fully crossed with the other factors that were used for the construction of blocks of trials. Trials were separated by an RSI of 1500 ms which included a precueing interval (PCI) of 1200 (condition with precues) or 0ms (condition without precues). In the latter condition the colored arrow appeared simultaneously with the imperative stimulus. PCI was varied between sessions, with order counterbalanced across participants. 3.2. Results 3.2.1. Reaction times As in Experiment 1 (Section 2), we restricted our analysis to the potential shift trials and screened the data for errors and outliers (3.8% of trials). For each participant and each of the 32 combinations of four tasks and eight relations to the preceding task we determined the mean reaction time and in addition the ﬁrst quartile of the reaction-time distribution, separately for the trials with 0 ms and 1200 ms PCIs. For the following analyses the means and the ﬁrst quartiles for the four tasks were averaged because only these averages capture the eﬀects of the relation between successive tasks unconfounded by task-speciﬁc eﬀects. The means across participants are shown in Table 3 together with the expected values based on the simple additive model. We shall report the results in detail ﬁrst for the individual means and thereafter for the individual ﬁrst quartiles. 3.2.2. Reaction times: individual means As shown in Fig. 2(a), without precues (PCI of 0 ms) the characteristic proﬁle of shift costs showed up again, although the RSI (1500 ms) was considerably longer than in Experiment 1 (500 ms) (Section 2). The simple additive model again provided a reasonable ﬁt (RMSE: 10.7 ms, r2 ¼ :998). When ﬁtted to the individual data of the 24 participants (mean RMSE: 43.9 ms, range: 15.1–98.5 ms), the mean estimate of the switching duration, TS , was 39 ms, with the 95-% conﬁdence interval ranging

Relation J

Mean RT M

R

First RT quartile

PCI ¼ 0

PCI ¼ 1200 a

PCI ¼ 0 b

Error percentage PCI ¼ 1200

c

PCI ¼ 0

PCI ¼ 1200

d

Obs.

Pred.

Obs.

Pred.

Obs.

Pred.

Obs.

Pred.

¼ ¼ ¼ ¼

¼ ¼ 6¼ 6¼

¼ 6¼ ¼ 6¼

608 622 1110 1060

595 634 1105 1066

547 548 734 713

540 555 731 716

463 487 892 864

459 491 894 862

407 418 511 492

409 417 506 498

2.02 1.32 7.79 2.56

1.12 2.20 5.55 4.15

6¼ 6 ¼ 6 ¼ 6¼

¼ ¼ 6¼ 6¼

¼ 6¼ ¼ 6¼

1287 1224 1219 1201

1272 1233 1233 1193

811 798 759 777

801 786 786 771

1042 1017 1010 971

1042 1010 1010 978

519 520 495 511

519 511 511 503

3.01 2.35 1.87 1.67

2.95 2.18 4.04 1.20

a

Parameters: RT0 ¼ 595 ms, TS ¼ 39:4 ms, TM ¼ 431 ms, TJ ¼ 559 ms. Parameters: RT0 ¼ 540 ms, TS ¼ 14:9 ms, TM ¼ 161 ms, TJ ¼ 216 ms. c Parameters: RT0 ¼ 459 ms, TS ¼ 32:3 ms, TM ¼ 371 ms, TJ ¼ 487 ms. d Parameters: RT0 ¼ 409 ms, TS ¼ 7:8 ms, TM ¼ 81 ms, TJ ¼ 87 ms. b

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

Table 3 Observed and predicted mean reaction times and ﬁrst quartiles as well as error rates for the diﬀerent relations between successive tasks without (PCI ¼ 0 ms) and with (PCI ¼ 1200 ms) task precues

17

18

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

Fig. 2. Experiment 2: Mean reaction times and ﬁrst reaction time quartiles as a function of previous type of judgment (J), previous mapping (M), and previous response (R), (a) without precue, (b) with precue.

from 18 to 61 ms (cf. Table 3). The mean estimates of the implementation durations TM and TJ were 431 and 559 ms, respectively, the diﬀerence being highly signiﬁcant, tð23Þ ¼ 5:8, p < :01. With precues (PCI of 1200 ms) the shift-cost proﬁle was ﬂatter (cf. Fig. 2(b)). We ﬁtted the additive model again, though – depending on the particular assumptions about advance task-set conﬁguration – it is not fully adequate. However, the inadequateness should not necessarily reduce the goodness of ﬁt considerably, but should be reﬂected mainly in the parameter estimates. In particular, when with the long PCI

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

19

the number of switching operations is indeed reduced, this should result in a smaller estimate of the switching duration, and when implementation operations are indeed exogenous in nature, the estimated durations with the long PCI should not diﬀer from the estimated durations with the zero PCI. As shown in Table 3, the ﬁt of the model to the mean reaction times was still reasonable (RMSE: 12.0, r2 ¼ :985), but the parameters did not fully conform to expectations. From the ﬁts to individual data (mean RMSE: 51.9 ms, range: 19.7–116.1 ms), the mean estimate of the switching duration, TS , was 15 ms. This was not signiﬁcantly diﬀerent from zero, the 95% conﬁdence interval ranging from )7 to 37 ms, but also not signiﬁcantly smaller than the 39 ms observed without task precues. The mean estimates of the implementation durations TM and TJ were 161 and 216 ms, respectively, again the diﬀerence being signiﬁcant, tð23Þ ¼ 3:7, p < :01. Contrary to expectations both these estimates were signiﬁcantly smaller than without precues, tð23Þ ¼ 7:8, p < :01, and tð23Þ ¼ 8:7, p < :01, respectively. In addition, the diﬀerence between the two kinds of implementation costs was signiﬁcantly smaller with (55 ms) than without (128 ms) precues, tð23Þ ¼ 2:6, p < :05. Thus, the durations of implementation operations were reduced and became more similar when advance taskset conﬁguration was possible. We computed the same series of contrasts as in Experiment 1 (Section 2) . In addition we compared the contrasts in the no-precue and precue conditions. First, when the type of judgment is repeated, a change of the mapping should be associated with reaction-time costs. This was the case both without, F ð1; 23Þ ¼ 121:4, p < :01, and with precues, F ð1; 23Þ ¼ 103:3, p < :01. In addition, the costs were higher without than with precues, F ð1; 23Þ ¼ 70:1, p < :01. Second, when the mapping is changed, a change of the type of judgment should be associated with a longer reaction time than a repetition of the type of judgment because of the diﬀerence in implementation durations (TJ > TM ). This diﬀerence should be unaﬀected by the presence of precues, provided that only switching operations are endogenously controlled. The respective contrast was highly signiﬁcant without precues, F ð1; 23Þ ¼ 37:2, p < :01, as well as with precues, F ð1; 23Þ ¼ 9:2, p < :01. Nevertheless, as is apparent from Table 3, the reaction-time diﬀerence was larger without than with precues, F ð1; 23Þ ¼ 7:3, p < :05. Third, when the type of judgment is changed, a change of the mapping should be associated with shorter reaction times than a repetition of the mapping. This diﬀerence should be due to a diﬀerence in the number of switching operations and not be observed in precue conditions. The respective contrast was highly signiﬁcant without precues, F ð1; 23Þ ¼ 13:0, p < :01, and just failed to reach statistical signiﬁcance with precues, F ð1; 23Þ ¼ 4:0, p < :10. The interaction was non-signiﬁcant, F < 1. The reaction-time diﬀerences between response repetitions and alternations conformed to expectations without precues, although they did not always reach statistical signiﬁcance. Only the reaction-time costs of response repetitions when the mapping was changed, but not the type of judgment, F ð1; 23Þ ¼ 4:8, p < :05, and when the type of judgment was changed, but not the mapping, F ð1; 23Þ ¼ 10:7, p < :01, were signiﬁcant. With precues the response–repetition beneﬁts and costs did not always conform to expectations and in no case even approached statistical

20

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

signiﬁcance. The same was true for the four interaction contrasts to compare the response–repetition beneﬁts and costs in conditions without and with precues.

3.2.3. Reaction times: individual ﬁrst quartiles The analysis of the individual reaction-time means revealed a ﬂattening of the shift-cost proﬁle in conditions with task precues that was not only due to a reduced number of switching operations, but also to shorter durations of implementation operations. While individual mean reaction times are aﬀected by trials in which the information provided by the precues is not exploited for advance task-set conﬁguration, for the individual ﬁrst quartiles of the reaction-time distributions in the various experimental conditions this confound is absent or at least weaker. Thus, the tendency of the shift-cost proﬁle to ﬂatten out should be intensiﬁed provided that it is indeed due to advance task-set conﬁguration. Without precues (PCI of 0 ms) the mean ﬁrst quartiles exhibited the characteristic shift-cost proﬁle (cf. Fig. 2(a)), and – as is evident from Table 3 – the simple additive model provided an excellent ﬁt (RMSE: 4.2 ms, r2 ¼ :9996). When ﬁtted to the individual data of the 24 participants (mean RMSE: 29.2 ms, range: 11.5–58.4 ms), the mean of the estimated switching duration, TS , was 32 ms, with the 95-% conﬁdence interval ranging from 19 to 46 ms. The mean estimates of the implementation durations TM and TJ were 371 and 487 ms, respectively, the diﬀerence being highly significant, tð23Þ ¼ 9:0, p < :01. With precues (PCI of 1200 ms) the simple additive model still provided a reasonable ﬁt, as can be seen in Table 3 (RMSE: 7.7 ms, r2 ¼ :983), though the shift-cost proﬁle was rather ﬂat (Fig. 2(b)). From the ﬁts to individual data (mean RMSE: 29.7 ms, range: 10.7–62.3 ms) the mean estimate of switching duration, TS , was 8 ms, with the 95-% conﬁdence interval ranging from )8 to 24 ms. This estimate was thus not signiﬁcantly diﬀerent from zero, and it was signiﬁcantly smaller than the 32 ms observed without precues, tð23Þ ¼ 2:5, p < :05. The mean estimates of implementation durations TM and TJ were 81 and 87 ms. The diﬀerence between implementation durations was not signiﬁcant, tð23Þ ¼ 0:7, and both kinds of implementation durations were signiﬁcantly smaller than in conditions without precues, tð23Þ ¼ 8:2, p < :01, and tð23Þ ¼ 12:3, p < :01, respectively. In addition, the diﬀerence between the two kinds of implementation costs was signiﬁcantly smaller with (5 ms) than without (116 ms) precues, tð23Þ ¼ 8:8, p < :01. We computed the same set of contrasts as for the individual mean reaction times. First, with a repeated type of judgment the reaction-time costs associated with a change of the mapping were signiﬁcant both without, F ð1; 23Þ ¼ 108:0, p < :01, and with precues, F ð1; 23Þ ¼ 53:9, p < :01, but without precues the costs were significantly higher than with precues, F ð1; 23Þ ¼ 75:8, p < :01. Second, with an alternated mapping there were reliable costs of a change of the type of judgment without precues, F ð1; 23Þ ¼ 73:5, p < :01, but not with precues, F ð1; 23Þ < 1; without precues the costs were signiﬁcantly larger than with precues, F ð1; 23Þ ¼ 52:2, p < :01. Third, with an alternated type of judgment there were reliable beneﬁts of a change of the mapping without precues, F ð1; 23Þ ¼ 15:6, p < :01, but not with precues, F ð1;

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

21

23Þ ¼ 2:4, p > :10; however, the interaction failed to reach statistical signiﬁcance, F ð1; 23Þ ¼ 2:3, p > :10. The reaction-time diﬀerences between response repetitions and alternations conformed to expectations without precues, but did not always reach statistical signiﬁcance. Only the response–repetition beneﬁts with repeated type of judgment and repeated mapping, F ð1; 23Þ ¼ 5:3, p < :05, as well as the response–repetition costs with alternated type of judgment and alternated mapping, F ð1; 23Þ ¼ 8:4, p < :01, were signiﬁcant. With precues the reaction-time diﬀerences between response repetitions and alternations did not consistently conform to expectations, and the corresponding contrasts did in no case even approach statistical signiﬁcance. Comparing conditions without and with precues, there was a single signiﬁcant interaction contrast: with alternated type of judgment and alternated mapping the response– repetition costs observed without precues were signiﬁcantly diﬀerent from the response–repetition beneﬁts observed with precues F ð1; 23Þ ¼ 7:0, p < :05. The mean ﬁrst quartiles of the individual reaction-time distributions, as shown in Fig. 2, together with the analysis of the model parameters and the series of contrasts, do strongly suggest that the characteristic reaction-time proﬁle as a function of the relation between successive tasks is essentially absent when precues are presented and the data are only little confounded by trials without advance conﬁguration of task sets. The only eﬀect of the relation between successive tasks which remained in the presence of task precues was an increase of reaction time of the order of 100 ms whenever the type of judgment and/or the mapping was changed. In fact, post hoc analyses of the data obtained with the PCI of 1200 ms revealed that there were two sets of relations in terms of reaction times, those with repeated type of judgment and mapping, and those with at least one of these task characteristics being changed. Reaction times of each of the ﬁrst set of relations were signiﬁcantly diﬀerent from reaction times of each of the second set of relations between successive tasks, while there were no signiﬁcant diﬀerences between the reaction times for the relations within each of the two sets.

3.2.4. Errors Error percentages are shown in Table 3. As in Experiment 1 (Section 2), we analyzed them by means of the same set of contrasts as the reaction times. It is apparent from Table 3 that, when the kind of judgment was repeated, a change of the mapping was associated with a higher error rate than a mapping repetition both without, F ð1; 23Þ ¼ 11:9, p < :01, and with precues, F ð1; 23Þ ¼ 12:2, p < :01. The interaction failed to reach statistical signiﬁcance, F ð1; 23Þ < 1. When the mapping was alternated, a change of the kind of judgment was associated with a smaller error rate than a repetition, again both without, F ð1; 23Þ ¼ 11:5, p < :01, and with precues, F ð1; 23Þ ¼ 10:6, p < :01. Although numerically this diﬀerence appeared smaller with the long PCI, the interaction was far from approaching signiﬁcance, F ð1; 23Þ < 1. Finally, the diﬀerence between same and diﬀerent mappings with an alternated type of judgment, which was signiﬁcant in Experiment 1 (Section 2), failed to reach signiﬁcance in both conditions with diﬀerent PCIs.

22

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

As in Experiment 1 (Section 2), the diﬀerences between repeated and alternated responses were somewhat inconsistent as far as their statistical signiﬁcance was concerned. Without precues only one of the four contrasts reached signiﬁcance, namely the response–repetition costs when only the mapping was changed, F ð1; 23Þ ¼ 13:8, p < :01. These response–repetition costs in addition were reliably larger than those observed with precues, F ð1; 23Þ ¼ 5:0, p < :05. With precues again only one of the four contrasts reached signiﬁcance; this time it was the response–repetition costs when both the kind of judgment and the mapping were changed, F ð1; 23Þ ¼ 8:2, p < :01. They were reliably larger than those observed without precues, as indicated by a signiﬁcant interaction, F ð1; 23Þ ¼ 4:5, p < :05. 3.3. Discussion The present experiment was designed to answer the question whether the distinction between switching operations and implementation operations, which is based on a characteristic proﬁle of reaction-time costs for shifts among a set of dimensionally organized tasks, coincides with the distinction between endogenously and exogenously controlled processes of task-set reconﬁguration, which is based on the eﬀects of task precues on shift costs. The hypothesis of such a coincidence has been suggested by a model of Rubinstein et al. (2001) which claims that a process called goal shifting is endogenously controlled, but a process called rule activation exogenously. In terms of the functions subserved by these processes there is an apparent parallel between switching operations and goal selection as well as between implementation operations and rule activation. The data do not conform to this hypothesis. Without precues we again observed the characteristic proﬁle of reaction times as a function of the relation between successive tasks both for individual means and ﬁrst quartiles of reaction-time distributions. With precues the intricate modulations of the reaction times were reduced for the means and essentially absent for the ﬁrst quartiles, which are less aﬀected by trials in which the precues are not exploited for advance conﬁguration of task sets. All what remained was a residual shift cost of the order of 100 ms whenever the type of judgment and/or the mapping were changed, independent of the particular relation between successive tasks. Thus, on the one hand both switching operations and implementation operations turned out to be largely endogenously controlled. On the other hand, the characteristic pattern of response–repetition beneﬁts and costs which should have survived the introduction of task precues because responses were not precued was not clearly present. While implementation operations turned out to be largely endogenously controlled, this was not fully the case, but there remained residual shift costs. There are at least three possible explanations of this ﬁnding, and at present it is impossible to decide between them with a suﬃcient degree of certainty. First, according to De Jong’s claim, these residual shift costs could reﬂect failures of advance preparation (cf. De Jong, 2000). From the results obtained with the individual mean reaction times it is clear that such failures were present in this experiment. Although the use of ﬁrst quartiles reduces the eﬀects of failures of advance preparation on the

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

23

data, it does not necessarily eliminate them. In fact, the unreliable variations of reaction times across the six relations between successive tasks with type of judgment and/or mapping being changed appear not fully random. For example, across the six relations the means observed with precues are still correlated with the means observed without precues (r ¼ 0:53). Thus, while there is no clear-cut evidence in favor of a role of failures of advance preparation, there is also no clear-cut evidence against it. Although De Jong (2000) found that residual shift costs could be fully accounted for in terms of failures of advance preparation, he also suggested that ‘‘true’’ residual shift costs might arise with other populations than young healthy adults or with more complex tasks than the ones he studied. From the perspective of ‘‘true’’ residual shift costs, our ﬁndings could simply imply that implementation operations are basically a homogeneous set of operations which, however, cannot be completed endogenously (cf. Rogers & Monsell, 1995). Alternatively, and perhaps more likely, what we have called ‘‘implementation operations’’ could be a set of heterogeneous processes, some of which are endogenously controlled, while others are exogenously controlled. If this view is correct, questions regarding the nature of the two kinds of processes arise. Thus far not many facts are known on which answers could be based. The analysis of individual ﬁrst quartiles does strongly suggest that the two kinds of processes do diﬀer in that the durations of the endogenously controlled processes depend on the relation between successive tasks, while the durations of the exogenously controlled processes exhibit no (reliable) dependence of this kind. Although the present experiments were focused on the relations between successive tasks rather than on shift costs associated with shifts between particular tasks, we examined task-speciﬁc eﬀects in an exploratory manner. In particular we determined shift costs as a function of the new task. For each of the four combinations of the two types of judgments and the two judgment-to-response mappings we computed the overall residual shift costs. These were the diﬀerences between the mean ﬁrst quartiles observed for all six relations to the preceding task with the type of judgment and/or the mapping being changed (the six rightmost quartile bars in Fig. 2(b)) on the one hand and the mean ﬁrst quartiles observed for the two relations to the preceding task with the type of judgment and the mapping being repeated (the two leftmost quartile bars in Fig. 2(b)) on the other hand. The overall residual shift costs were 81, 205, 44 and 50 ms for numerical judgments with compatible and incompatible mappings and spatial judgments with compatible and incompatible mappings, respectively. A one-way ANOVA of these residual shift costs, which mainly reﬂect the durations of exogenously controlled processes, revealed that the diﬀerences between tasks were statistically signiﬁcant, F ð3; 69Þ ¼ 33:9, p < :01. We performed the same computations for the conditions without task precues and found overall shift costs of 515, 597, 429, and 423 ms for the four tasks. The diﬀerences between the two conditions without and with precues give estimates of the components of shift costs which reﬂect the durations of (potentially) endogenously controlled operations (434, 392, 385, and 373 ms for the four tasks), and a

24

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

one-way ANOVA revealed that these were not signiﬁcantly diﬀerent between tasks, F ð3; 69Þ < 1. Thus, we tentatively conclude that while the endogenous but not the exogenous components of the shift costs depend on the relation between successive tasks, the exogenous but not the endogenous components depend on the nature of the new task. Their order across tasks does roughly correspond to task complexity as indicated by the reaction times in repetition trials (cf. Fig. 1), with the numerical/incompatible task being of highest complexity and the spatial-judgment tasks of lowest complexity. Note that this pattern of results contrasts with the one usually interpreted as evidence for task-set inertia, where costs are higher for shifts to an easier task (e.g., Allport et al., 1994). However, this ﬁnding seems to be restricted to a rather narrow class of situations (cf. Monsell, Yeung, & Azuma, 2000), so it is not too surprising not to observe this ﬁnding in the present experiments. We repeated the same kind of analysis with the individual mean reaction times. If indeed the durations of exogenously controlled processes depend on the nature of the current task, while the durations of endogenously controlled processes do not, this should also be evident from the individual mean reaction times; the ﬁnding should be robust against confounds of the estimates of the durations of exogenously controlled processes by (potentially) endogenously controlled processes. In fact, the residual shift costs estimated from the individual means were 257, 319, 166, and 129 ms for numerical judgments with compatible and incompatible mapping and spatial judgments with compatible and incompatible mapping, respectively. The diﬀerences between tasks were highly signiﬁcant, F ð3; 69Þ ¼ 14:2, p < :01. Without task precues the overall shift costs were 621, 685, 480, and 489 ms for the four tasks, and the differences between conditions without and with precues were 365, 366, 314, and 361ms. These estimates of the durations of endogenously controlled processes were not signiﬁcantly diﬀerent, F ð3; 69Þ < 1. Of course, these estimates are too short overall, because in a number of trials (potentially) endogenously controlled processes were not performed in advance of the imperative signal, so that their durations were included in the residual shift costs and added to the estimates of the durations of exogenously controlled processes. The observation that the exogenous components of shift costs were largely dependent on the current task is inconsistent with the assumption that residual shift costs depend mainly on the characteristics of the previous task (e.g., Allport et al., 1994; Wylie & Allport, 2000). Although the reasons for this discrepancy are not entirely clear, perhaps one important factor in this respect is task complexity. Although at present this is a speculation, episodic carry-over eﬀects may be especially pronounced with very simple tasks while active reconﬁguration processes become more important with more complex tasks, with our tasks being relatively complex compared to other tasks usually employed in task-shifting research. The present data do only weakly constrain functional interpretations of endogenously and exogenously controlled implementation operations. However, one of the apparent possibilities is that the endogenously controlled operations serve to implement the basic task-control structure, that is, the neural network which is required for successful performance of the forthcoming task, while the exogenously controlled operations provide some consolidation or retroactive adjustment (Meiran, 1996) of

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

25

the implemented control structures. The beneﬁts that arise from such a process, that is, the reduction of reaction time by performing a task repeatedly (at least once), should be related to the complexity of the control structure and thus likely to the complexity of the task, but not to the task in preceding trials. According to these considerations implementation operations can be subdivided into conﬁguration of a task-control structure, which is an endogenously controlled process and does not depend on task complexity, and facilitation of the conﬁgured task-control structure, which is necessarily an exogenously controlled process because it requires that the control structure be put to use. While the residual shift costs turned out to be independent of the relation between successive tasks, this was not the case for the error rates in conditions with task precues. In fact, the error proﬁles observed in conditions without and with precues did not diﬀer consistently, and even the few statistically signiﬁcant diﬀerences appeared not particularly systematic. Given that error rates are often somewhat unreliable, it is not really justiﬁed to base any ﬁrm conclusion on these ﬁndings, except, of course, that the reaction-time proﬁle cannot be attributed to a speed–accuracy tradeoﬀ. In addition, the error data are confounded by failures of advance preparation. While for the reaction times it is possible to avoid such a confound at least to some degree by way of focusing the analysis on the faster reaction times, a similar procedure is not possible for the error data. Finally, with task precues the characteristic pattern of response–repetition beneﬁts (when type of judgment and mapping are repeated) and response–repetition costs (when type of judgment and/or mapping are changed) was largely absent. Although this pattern tends to be somewhat unreliable across experiments, it was clearly present without precues, and with task precues there were exceptionally clear deviations. This ﬁnding was unexpected because responses were not precued. It can be taken to suggest that task precues are processed diﬀerently than task cues, perhaps without generalizing switches which account for the response–alternation beneﬁts in taskshift trials. But the ﬁndings on response–repetitions costs and beneﬁts are too unstable to allow any ﬁrm conclusion at present.

4. General discussion The present experiments had been designed to corroborate and extend previous experimental ﬁndings on shifting among a set of dimensionally organized tasks and to substantiate and diﬀerentiate the theoretical considerations which had been suggested by the original results (Kleinsorge & Heuer, 1999). Experiment 1 (Section 2) served to eliminate a number of methodological concerns and to test the expectations based on the theoretical considerations in a more speciﬁc way than had been possible before. Experiment 2 (Section 3) showed that our distinction of switching operations and implementation operations does not coincide with the operational distinction of endogenously and exogenously controlled processes of shifting between tasks. Instead, both kinds of operations turned out to be largely endogenously controlled, although this was not fully the case for implementation operations.

26

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

Beyond the additional evidence for a hierarchical switching mechanism presented in this article we recently were able to bolster our account against a speciﬁc alternative explanation in terms of episodic carry-over. This alternative account is based on the event-ﬁle concept of Hommel (1998) that was recently extended to an explanation of shift costs by P€ osse and Hommel (1999). According to these authors, the elementary features of stimuli and responses of a certain trial become bound into one or several episodic memory traces. When on a subsequent trial any of the features of the preceding trial is repeated this results in a retrieval of the corresponding episodic memory trace. There will be a mismatch of the retrieved information and the features of the current trial whenever some feature is repeated whereas another feature is changed across trials, with this mismatch leading to shift costs. Applied to our experiments, such a mismatch would occur whenever a change of only one of the task dimensions type of judgment and mapping is required. Thus, for both these relations between successive tasks reaction times should be longer than when a change of both the type of judgment and the mapping is required. In fact, our observation of especially high shift costs for a change of the type of judgment only (with mapping repeated) can well be accounted for in terms of episodic carry-over. However, the same line of reasoning would predict especially high shift costs for a change of the mapping only (with type of judgment being repeated). For this relation between successive tasks, however, shift costs were consistently smaller than for changes on both dimensions rather than longer. To further strengthen our approach against an explanation in terms of episodic bindings we recently conducted an experiment in which we employed three instead of two kinds of judgments, three instead of two mapping rules, and three response alternatives (Kleinsorge, Heuer, & Schmidtke, 2001, Exp. 2). When the shift-cost proﬁle we observed with binary task dimensions is indeed a result of episodic bindings, a similar proﬁle should be observed with three-valued task dimensions because a partial feature mismatch across subsequent trials should occur regardless of whether only one or two alternatives are available on a certain task dimension. A generalizing switching mechanism, in contrast, should break down with three-valued task dimensions because no ‘other’ value is uniquely speciﬁed on hierarchically lower task dimensions to which a generalized switch could be directed to. In line with this reasoning, and supporting our notion of generalizing switches, we observed shift costs monotonously growing as a function of the number of task dimensions on which a change took place across subsequent trials. (For minor modiﬁcations of this statement see Kleinsorge et al. (2001)). The latter observations provide evidence against an alternative explanation of our ﬁndings in terms of a speciﬁc form of episodic carry-over. Of course, this does not preclude that other forms of episodic carry-over eﬀects occurred in our experiments. However, these should not have resulted in serious distortions of our results. First, our main conclusions are based on data that are averaged across particular transitions between speciﬁc tasks. This way, asymmetric carry-over eﬀects like those reported by Allport et al. (1994) in favor of their concept of task-set inertia should have become leveled. Second, as observed in Experiment 2 (Section 3), the characteristic shift-cost proﬁle we interpret as evidence for generalized switches is part of the

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

27

endogenous component of shift costs and not subject to passive dissipation like episodic carry-over eﬀects are assumed to be (e.g. Meiran et al., 2000). The fact that generalized switching apparently breaks down with three-valued task dimensions might be interpreted as severely limiting the applicability of our model to real-life situations. However, choices between only two alternatives are quite common in everyday life, and it is not too hard to ﬁnd examples for binary decisions on two hierarchically ordered ‘dimensions’. For instance, a choice situation like ‘Will I visit John or Mary, and will I travel by car or by train’ is structurally analogous to the situations realized in our experiments. When one has ﬁrst decided to visit John using the car but then gets a call by Mary asking to visit her, a tendency to do so by taking the train would be in line with a generalized switch. We have no relevant data to substantiate the plausibility of such a tendency with respect to realworld problems, and it should be noted that even in our experiments participants actually do not behave according to the response tendencies released by generalized switches but in most cases perform the required shift with the help of corrective re-switches. Nevertheless, the demonstration that the structure of our experimental situation can easily be transferred to real-life situations argues against the claim that the restriction to binary choices makes the model ecologically irrelevant. To summarize, at present our account of shift costs with hierarchically ordered task dimensions involves three kinds of processes. First, there are generalizing switching operations which serve to select the appropriate task representation and have the key property that switches on a higher level of a task hierarchy generalize upon lower levels and necessitate corrective re-switches whenever the feature represented on a lower level remains unchanged across trials. These switching operations are indexed by the costs of feature repetitions. Second, there is one kind of implementation operation that is endogenously controlled. The duration of this kind of implementation operation depends on the relation between successive tasks but not on the speciﬁc task that has to be performed. The estimates of this component of shift costs cover also those processes that are associated with the encoding of task (pre-)cues. (As mentioned above, we do not assume that all the processes we subsume under the label of implementation operations run oﬀ after the switching operations.) Third, there is a class of implementation operations that is exogenously controlled. Their duration depends on the complexity of the task to be performed but not on the relation between successive trials. This feature makes them similar to the process of rule activation postulated by Rubinstein et al. (2001), though ‘‘activation’’ then has to be understood in the narrow sense of ‘‘using the rule at least once’’ and not in the broader sense of ‘‘making the rule operative’’.

Acknowledgements We thank Dana Lembke for assistance in analyzing the data and Matthias Manka and Petra Wallmeyer for conducting the experiments. The research reported in this paper was supported by grant Kl 1205/1-1 of the Deutsche Forschungsgemeinschaft to the ﬁrst two authors.

28

T. Kleinsorge et al. / Acta Psychologica 111 (2002) 1–28

References Allport, D. A., Styles, E. A., & Hsieh, S. (1994). Shifting intentional set: Exploring the dynamic control of tasks. In C. Umilta, & M. Moscovitch (Eds.), Attention and performance XV (pp. 421–452). Hillsdale, NJ: Erlbaum. Dehaene, S., Bossini, S., & Giraux, P. (1993). The mental representation of parity and number magnitude. Journal of Experimental Psychology: General, 122, 371–396. De Jong, R. (2000). An intention-activation account of residual switch costs. In S. Monsell, & J. Driver (Eds.), Attention and performance XVIII: Cognitive control (pp. 357–376). Cambridge, MA: MIT Press. Gopher, D., Armony, L., & Greenshpan, Y. (2000). Switching tasks and attention policies. Journal of Experimental Psychology: General, 129, 308–339. Goschke, T. (2000). Intentional reconﬁguration and involuntary persistence in task-set switching. In S. Monsell, & J. Driver (Eds.), Attention and performance XVIII: Cognitive control (pp. 331–355). Cambridge, MA: MIT Press. Hommel, B. (1998). Event ﬁles: Evidence for automatic integration of stimulus–response episodes. Visual Cognition, 5, 183–216. Kirby, N. (1980). Sequential eﬀects in choice reaction time. In A. T. Welford (Ed.), Reaction times (pp. 123–172). London: Academic Press. Kleinsorge, T., 1997. Die Kodierungsspeziﬁt€at von Reiz-Reaktions-Kompatibilit€at [Coding speciﬁcity of SR compatibility]. Unpublished doctoral dissertation, Ruhr-Universit€at Bochum, Bochum, Germany. Kleinsorge, T. (1999). Response repetition beneﬁts and costs. Acta Psychologica, 103, 295–310. Kleinsorge, T. (2000). Explorations in the structure of task spaces. Psychologica Belgica, 40, 247–257. Kleinsorge, T., & Heuer, H. (1999). Hierarchical switching in a multi-dimensional task space. Psychological Research, 62, 300–312. Kleinsorge, T., Heuer, H., & Schmidtke, V. (2001). Task-set reconﬁguration with binary and three-valued task dimensions. Psychological Research, 65, 192–201. Mayr, U., & Keele, S. W. (2000). Changing internal constraints on action: The role of backward inhibition. Journal of Experimental Psychology: General, 129, 4–26. Meiran, N. (1996). The reconﬁguration of processing mode prior to task performance. Journal of Experimental Psychology: Learning, Memory and Cognition, 22, 1423–1442. Meiran, N. (2000). Modeling cognitive control in task-switching. Psychological Research/Psychologische Forschung, 63, 234–249. Meiran, N., Chorev, Z., & Sapir, A. (2000). Component processes in task switching. Cognitive Psychology, 41, 211–253. Monsell, S. (1996). Control of mental processes. In V. Bruce (Ed.), Mysteries of the mind: Tutorial essays on cognition (pp. 93–148). Hove, UK: Erlbaum. Monsell, S., Yeung, N., & Azuma, R. (2000). Reconﬁguration of task-set: Is it easier to switch to the weaker task? Psychological Research, 63, 250–264. P€ osse, B., & Hommel, B. (1999). The role of task relevant and irrelevant dimensions in task switching. In A. Vandierendonck, M. Brysbaert, & K. van der Goten (Eds.), Eleventh conference of the European society for cognitive psychology: Proceedings (p. 200). Gent: ESCoP/Academia Press. Roberts, S., & Pashler, H. (2000). How persuasive is a good ﬁt? A comment on theory testing. Psychological Review, 107, 358–367. Rogers, R. D., & Monsell, S. (1995). The cost of a predictable switch between simple cognitive tasks. Journal of Experimental Psychology: General, 124, 207–231. Rubinstein, J., Meyer, D. E., & Evans, J. E. (2001). Executive control of cognitive processes in task switching. Journal of Experimental Psychology: Human Perception and Performance, 27, 763–797. Wylie, G., & Allport, A. (2000). Task switching and the measurement of ‘‘switch costs’’. Psychological Research, 63, 212–233.

Processes of task-set reconfiguration: switching operations and implementation operations

Processes of task-set reconfiguration: switching operations and implementation operations

Recommend Documents