In statistics, the Jonckheere trend test^{[1]} (sometimes called the Jonckheere–Terpstra^{[2]} test) is a test for an ordered alternative hypothesis within an independent samples (between-participants) design. It is similar to the Kruskal–Wallis test in that the null hypothesis is that several independent samples are from the same population. However, with the Kruskal–Wallis test there is no a priori ordering of the populations from which the samples are drawn. When there is an a priori ordering, the Jonckheere test has more statistical power than the Kruskal–Wallis test. The test was developed by Aimable Robert Jonckheere, who was a psychologist and statistician at University College London.
The null and alternative hypotheses can be conveniently expressed in terms of population medians for k populations (where k > 2). Letting θ_{i} be the population median for the ith population, the null hypothesis is:
The alternative hypothesis is that the population medians have an a priori ordering e.g.:
with at least one strict inequality.
The test can be seen as a special case of Maurice Kendall’s more general method of rank correlation^{[3]} and makes use of the Kendall's S statistic. This can be computed in one of two ways:
Note that there will always be ties in the independent variable (individuals are ‘tied’ in the sense that they are in the same group) but there may or may not be ties in the dependent variable. If there are no ties – or the ties occur within a particular sample (which does not affect the value of the test statistic) – exact tables of S are available; for example, Jonckheere^{[1]} provided selected tables for values of k from 3 to 6 and equal samples sizes (m) from 2 to 5. Leach presented critical values of S for k = 3 with sample sizes ranging from 2,2,1 to 5,5,5.^{[4]}
The standard normal distribution can be used to approximate the distribution of S under the null hypothesis for cases in which exact tables are not available. The mean of the distribution of S will always be zero, and assuming that there are no ties scores between the values in two (or more) different samples the variance is given by
Where n is the total number of scores, and t_{i} is the number of scores in the ith sample. The approximation to the standard normal distribution can be improved by the use of a continuity correction: S_{c} = |S| – 1. Thus 1 is subtracted from a positive S value and 1 is added to a negative S value. The z-score equivalent is then given by
If scores are tied between the values in two (or more) different samples there are no exact table for the S distribution and an approximation to the normal distribution has to be used. In this case no continuity correction is applied to the value of S and the variance is given by
where t_{i} is a row marginal total and u_{i} a column marginal total in the contingency table. The z-score equivalent is then given by
In a partial replication of a study by Loftus and Palmer participants were assigned at random to one of three groups, and then shown a film of two cars crashing into each other.^{[5]} After viewing the film, the participants in one group were asked the following question: “About how fast were the cars going when they contacted each other?” Participants in a second group were asked, “About how fast were the cars going when they bumped into each other?” Participants in the third group were asked, “About how fast were the cars going when they smashed into each other?” Loftus and Palmer predicted that the action verb used (contacted, bumped, smashed) would influence the speed estimates in miles per hour (mph) such that action verbs implying greater energy would lead to higher estimated speeds. The following results were obtained (simulated data):
Contacted | Bumped | Smashed | |
---|---|---|---|
Results | 10 | 12 | 20 |
12 | 18 | 25 | |
14 | 20 | 27 | |
16 | 22 | 30 | |
Median | 13 | 19 | 26 |
Mph | Contacted | Bumped | Smashed | Totals t_{i} |
---|---|---|---|---|
10 | 1 | 0 | 0 | 1 |
12 | 1 | 1 | 0 | 2 |
14 | 1 | 0 | 0 | 1 |
16 | 1 | 0 | 0 | 1 |
18 | 0 | 1 | 0 | 1 |
20 | 0 | 1 | 1 | 2 |
22 | 0 | 1 | 0 | 1 |
25 | 0 | 0 | 1 | 1 |
27 | 0 | 0 | 1 | 1 |
30 | 0 | 0 | 1 | 1 |
Totals u_{i} | 4 | 4 | 4 | 12 |
When the ties between samples are few (as in this example) Leach suggested that ignoring the ties and using exact tables would provide a reasonably accurate result.^{[4]} Jonckheere suggested breaking the ties against the alternative hypothesis and then using exact tables.^{[1]} In the current example where tied scores only appear in adjacent groups, the value of S is unchanged if the ties are broken against the alternative hypothesis. This may be verified by substituting 11 mph in place of 12 mph in the Bumped sample, and 19 mph in place of 20 mph in the Smashed and re-computing the test statistic. From tables with k = 3, and m = 4, the critical S value for α = 0.05 is 36 and thus the result would be declared statistically significant at this level.
As , , and , and
the variance of S is then
And z is given by
For α = 0.05 (one-sided) the critical z value is 1.645, so again the result would be declared significant at this level. A similar test for trend within the context of repeated measures (within-participants) designs and based on Spearman's rank correlation coefficient was developed by Page.^{[6]}