User`s guide

ManualsBrandsPrecision Power ManualsAutomotiveA200.2

251

252

253

254

255

256

257

258

259

260

E-Prime User’s Guide

Appendix B: Considerations in Research

Page A-35

Statistical Analysis of RT Data

While this brief review of single-trial RT research cannot include an extensive discussion of data

analysis, a few points deserve comment.

The typical analysis of single-trial RT data employs the analysis of variance (ANOVA) to compare

mean RT’s under various treatment conditions as defined by the levels of the independent

variables. For within-subjects variables, a repeated-measures ANOVA is employed. Sometimes,

both within- and between-subjects factors occur in the same experiment, resulting in a mixed

ANOVA. For the example experiment on letter identification, there are two independent variables

that define types of trials for the analysis. One is the location of the stimulus, which has six levels

(0, 1, 2, 4, 8, and 16°). The other is whether or not it was adjusted in size to correct for poor

acuity, which has two levels (adjusted or not). For this analysis, the mean RT for each of the

twelve conditions would be calculated for each subject, and those values would serve as the

data. The ANOVA then compares the means of those values based on all subjects to determine

statistical significance.

In addition to an analysis of RT’s, there should be a parallel analysis of error rates, expressed

either as percent correct or as percent error. (Since percent error is just 100 minus the percent

correct, these analyses yield the same result.) In general, error rates should parallel RT’s faster

conditions have lower error rates. If faster RT’s are associated with higher error rates, a speed-

accuracy trade-off should be suspected, and interpretation of RT differences should only be made

with extreme caution.

In almost all instances, RT analyses are based on correct trials only. It is good practice to

examine the overall error rate for each subject. While what constitutes an acceptable rate will

differ with different experiments, it is common practice to delete the data from any subject whose

error rates are clearly higher than the norm. In this case, it is likely that the subject either

misunderstood the instructions or was simply unable to perform the task. If at all possible, the

maximum error rate should be set in advance, so that there is no danger of deleting a subject’s

data because they do not conform to the expected results. Pilot testing should help in setting a

maximum error rate.

Another issue for analysis of RT data involves outliers, or extremely deviant RT’s that occur on

occasional trials. These usually involve extremely slow RT’s. Many researchers assume that

such extreme RT’s reflect momentary inattention or confusion, therefore they are properly omitted

from the analysis, prior to calculating the mean RT by condition for individual subjects. A

common criterion is to omit any trials whose RT is more than three standard deviations from the

mean for that condition. That can be done either based on the mean and standard deviation of

RT for all subjects, or for individual subjects. The latter is clearly indicated if there are large

differences in RT between subjects. More sophisticated schemes for treating outliers have been

suggested (Ratliff, 1993; Ulrich & Miller, 1994).

The repeated-measures ANOVA, which is almost always used for significance testing with RT

data, makes an assumption of “compound symmetry," or equality of covariances across all pairs

of conditions. That assumption is seldom met in real data. Most statistical packages compute

adjusted values of p based on either the Greenhouse-Geiser statistic or the newer, less

conservative Huynh-Feldt statistic. In general, these corrected values of p should be used in

assessing statistical significance.

A discussion of the controversy surrounding the merits of traditional null hypothesis testing is beyond the

scope of this discussion. See several articles in the January, 1997 issue of Psychological Science and

Chow (1998) for discussions of this topic.