Jonckheere's trend test

In statistics, the Jonckheere trend test^[1] (sometimes called the Jonckheere–Terpstra^[2] test) is a test for an ordered alternative hypothesis within an independent samples (between-participants) design. It is similar to the Kruskal–Wallis test in that the null hypothesis is that several independent samples are from the same population. However, with the Kruskal–Wallis test there is no a priori ordering of the populations from which the samples are drawn. When there is an a priori ordering, the Jonckheere test has more statistical power than the Kruskal–Wallis test. The test was developed by A. R. Jonckheere, who was a psychologist and statistician at University College London.

The null and alternative hypotheses can be conveniently expressed in terms of population medians for k populations (where k > 2). Letting θ_i be the population median for the ith population, the null hypothesis is:

H_0: \theta_1 = \theta_2 = \cdots = \theta_k

The alternative hypothesis is that the population medians have an a priori ordering e.g.:

H_A: \theta_1

≤

\theta _{2}

≤

\cdots

≤

\theta_k

with at least one strict inequality.

Procedure

The test can be seen as a special case of Maurice Kendall’s more general method of rank correlation^[3] and makes use of the Kendall’s S statistic. This can be computed in one of two ways:

The ‘direct counting’ method

Arrange the samples in the predicted order
For each score in turn, count how many scores in the samples to the right are larger than the score in question. This is P.
For each score in turn, count how many scores in the samples to the right are smaller than the score in question. This is Q.
S = P – Q

The ‘nautical’ method

Cast the data into an ordered contingency table, with the levels of the independent variable increasing from left to right, and values of the dependent variable increasing from top to bottom.
For each entry in the table, count all other entries that lie to the ‘South East’ of the particular entry. This is P.
For each entry in the table, count all other entries that lie to the ‘South West’ of the particular entry. This is Q.
S = P – Q

Note that there will always be ties in the independent variable (individuals are ‘tied’ in the sense that they are in the same group) but there may or may not be ties in the dependent variable. If there are no ties – or the ties occur within a particular sample (which does not affect the value of the test statistic) – exact tables of S are available; for example, Jonckheere^[1] provided selected tables for values of k from 3 to 6 and equal samples sizes (m) from 2 to 5. Leach presented critical values of S for k = 3 with sample sizes ranging from 2,2,1 to 5,5,5.^[4]

Normal approximation to S

The standard normal distribution can be used to approximate the distribution of S under the null hypothesis for cases in which exact tables are not available. The mean of the distribution of S will always be zero, and assuming that there are no ties scores between the values in two (or more) different samples the variance is given by

\operatorname{VAR}(S)=\frac{2(n^3-\sum t^3_i)+3(n^2-\sum t^2_i)}{18}

Where n is the total number of scores, and t_i is the number of scores in the ith sample. The approximation to the standard normal distribution can be improved by the use of a continuity correction: S_c = |S| – 1. Thus 1 is subtracted from a positive S value and 1 is added to a negative S value. The z-score equivalent is then given by

z =\frac{S_c}{\sqrt{\operatorname{VAR}(S)}}

Ties

If scores are tied between the values in two (or more) different samples there are no exact table for the S distribution and an approximation to the normal distribution has to be used. In this case no continuity correction is applied to the value of S and the variance is given by

\begin{align}\operatorname{VAR}(S)=&\frac{2\left(n^3-\sum t^3_i -\sum u^3_i\right)+3\left(n^2-\sum t^2_i -\sum u^2_i\right)+5n}{18} \\ &{}+\frac{\left(\sum t^3_i-3\sum t^2_i+2n\right)\left(\sum u^3_i-3\sum u^2_i+2n\right)}{9n(n-1)(n-2)} \\ &{}+\frac{\left(\sum t^2_i-n\right)\left(\sum u^2_i-n\right)}{2n(n-1)}\end{align}

where t_i is a row marginal total and u_i a column marginal total in the contingency table. The z-score equivalent is then given by

z =\frac{S}{\sqrt{\operatorname{VAR}(S)}}

A numerical example

In a partial replication of a study by Loftus and Palmer participants were assigned at random to one of three groups, and then shown a film of two cars crashing into each other.^[5] After viewing the film, the participants in one group were asked the following question: “About how fast were the cars going when they contacted each other?” Participants in a second group were asked, “About how fast were the cars going when they bumped into each other?” Participants in the third group were asked, “About how fast were the cars going when they smashed into each other?” Loftus and Palmer predicted that the action verb used (contacted, bumped, smashed) would influence the speed estimates in miles per hour (mph) such that action verbs implying greater energy would lead to higher estimated speeds. The following results were obtained (simulated data):

Contacted	Bumped	Smashed
10	12	20
12	18	25
14	20	27
16	22	30
mdn = 13	mdn = 19	mdn = 26

The ‘direct counting’ method

The samples are already in the predicted order
For each score in turn, count how many scores in the samples to the right are larger than the score in question to obtain P:

P = 8 + 7 + 7 + 7 + 4 + 4 + 3 + 3 = 43

For each score in turn, count how many scores in the samples to the right are smaller than the score in question to obtain Q:

Q = 0 + 0 + 1 + 1 + 0 + 0 + 0 + 1 = 3

S = P - Q = 43 - 3
S = 40

The 'nautical' method

Cast the data into an ordered contingency table

mph	Contacted	Bumped	Smashed	Totals (t_i)
10	1	0	0	1
12	1	1	0	2
14	1	0	0	1
16	1	0	0	1
18	0	1	0	1
20	0	1	1	2
22	0	1	0	1
25	0	0	1	1
27	0	0	1	1
30	0	0	1	1
Totals (u_i)	4	4	4	12

For each entry in the table, count all other entries that lie to the 'South East' of the particular entry. This is P:

P = (1 × 8) + (1 × 7) + (1 × 7) + (1 × 7) + (1 × 4) + (1 × 4) + (1 × 3) + ( 1 × 3) = 43

For each entry in the table, count all other entries that lie to the 'South West' of the particular entry. This is Q:

Q = (1 × 2) + (1 × 1) = 3

S = P − Q = 43 − 3
S = 40

Using exact tables

When the ties between samples are few (as in this example) Leach suggested that ignoring the ties and using exact tables would provide a reasonably accurate result.^[4] Jonckheere suggested breaking the ties against the alternative hypothesis and then using exact tables.^[1] In the current example where tied scores only appear in adjacent groups, the value of S is unchanged if the ties are broken against the alternative hypothesis. This may be verified by substituting 11 mph in place of 12 mph in the Bumped sample, and 19 mph in place of 20 mph in the Smashed and re-computing the test statistic. From tables with k = 3, and m = 4, the critical S value for α = 0.05 is 36 and thus the result would be declared statistically significant at this level.

Computing a standard normal approximation

\text{As } n = 12\text{, }n^2=144 \text{ and } n^3 = 1728. \text{ Also}

sum (t^2_i) = 16

sum(t^3_i) = 24

sum(u^2_i) = 48

sum(u^3_i) = 192

The variance of S is then

\begin{align}\operatorname{VAR}(S)=&\frac{2(1728 - 24 - 192)+3(144 - 16 - 48)+ 60}{18} \\ &+\frac{(24 - 48 + 24)(192 - 144 + 24)}{9 \times 12 \times 11 \times 10} \\ &+\frac{(16 - 12)(48 - 12)}{2 \times 12 \times 11} \\ &= 185.212\end{align}

And z is given by

z =\frac{S}{\sqrt{\operatorname{VAR}(S)}}=\frac{40}{\sqrt{185.212}} = 2.939

For α = 0.05 (one-sided) the critical z value is 1.645, so again the result would be declared significant at this level. A similar test for trend within the context of repeated measures (within-participants) designs and based on Spearman's rank correlation coefficient was developed by Page.^[6]

References

1 2 3 Jonckheere, A. R. (1954). "A distribution-free k-sample test against ordered alternatives". Biometrika. 41: 133–145. doi:10.2307/2333011.
↑ Terpstra, T. J. (1952). "The asymptotic normality and consistency of Kendall's test against trend, when ties are present in one ranking" (PDF). Indagationes Mathematicae. 14: 327–333.
↑ Kendall, M. G. (1962). Rank correlation methods (3rd ed.). London: Charles Griffin.
1 2 Leach, C. (1979). Introduction to Statistics: A non-parametric approach for the social sciences. Chichester: John Wiley.
↑ Loftus, E. F.; Palmer, J. C. (1974). "Reconstruction of automobile destruction: An example of the interaction between language and memory". Journal of Verbal Learning and Verbal Behavior. 13: 585–589. doi:10.1016/S0022-5371(74)80011-3.
↑ Page, E. B. (1963). "Ordered hypotheses for multiple treatments: A significance test for linear ranks". Journal of the American Statistical Association. 58 (301): 216–30. doi:10.2307/2282965.

Study design	Population Statistic Effect size Statistical power Sample size determination Missing data

Survey methodology	Sampling Standard error stratified cluster Opinion poll Questionnaire

Controlled experiments	Design control optimal Controlled trial Randomized Random assignment Replication Blocking Interaction Factorial experiment

Uncontrolled studies	Observational study Natural experiment Quasi-experiment

Statistical inference

Statistical theory

Frequentist inference

Point estimation	Estimating equations Maximum likelihood Method of moments M-estimator Minimum distance Unbiased estimators Mean-unbiased minimum-variance Rao–Blackwellization Lehmann–Scheffé theorem Median unbiased Plug-in

Interval estimation	Confidence interval Pivot Likelihood interval Prediction interval Tolerance interval Resampling Bootstrap Jackknife

Testing hypotheses	1- & 2-tails Power Uniformly most powerful test Permutation test Randomization test Multiple comparisons

Parametric tests	Likelihood-ratio Wald Score

Specific tests

Z (normal) Student's t-test F

Goodness of fit	Chi-squared Kolmogorov–Smirnov Anderson–Darling Normality (Shapiro–Wilk) Likelihood-ratio test Model selection Cross validation AIC BIC

Rank statistics	Sign Sample median Signed rank (Wilcoxon) Hodges–Lehmann estimator Rank sum (Mann–Whitney) Nonparametric anova 1-way (Kruskal–Wallis) 2-way (Friedman) Ordered alternative (Jonckheere–Terpstra)

Bayesian inference

Correlation	Pearson product–moment Partial correlation Confounding variable Coefficient of determination

Regression analysis	Errors and residuals Regression model validation Mixed effects models Simultaneous equations models Multivariate adaptive regression splines (MARS)

Linear regression	Simple linear regression Ordinary least squares General linear model Bayesian regression

Non-standard predictors	Nonlinear regression Nonparametric Semiparametric Isotonic Robust Heteroscedasticity Homoscedasticity

Generalized linear model	Exponential families Logistic (Bernoulli) / Binomial / Poisson regressions

Partition of variance	Analysis of variance (ANOVA, anova) Analysis of covariance Multivariate ANOVA Degrees of freedom

Categorical / Multivariate / Time-series / Survival analysis

Categorical

Multivariate

Time-series

General	Decomposition Trend Stationarity Seasonal adjustment Exponential smoothing Cointegration Structural break Granger causality

Specific tests	Dickey–Fuller Johansen Q-statistic (Ljung–Box) Durbin–Watson Breusch–Godfrey

Time domain	Autocorrelation (ACF) partial (PACF) Cross-correlation (XCF) ARMA model ARIMA model (Box–Jenkins) Autoregressive conditional heteroskedasticity (ARCH) Vector autoregression (VAR)

Frequency domain	Spectral density estimation Fourier analysis Wavelet

Survival

Survival function	Kaplan–Meier estimator (product limit) Proportional hazards models Accelerated failure time (AFT) model First hitting time

Hazard function	Nelson–Aalen estimator

Test	Log-rank test

Applications

Biostatistics	Bioinformatics Clinical trials / studies Epidemiology Medical statistics

Engineering statistics	Chemometrics Methods engineering Probabilistic design Process / quality control Reliability System identification

Social statistics	Actuarial science Census Crime statistics Demography Econometrics National accounts Official statistics Population statistics Psychometrics

Spatial statistics	Cartography Environmental statistics Geographic information system Geostatistics Kriging

Category
Portal
Commons
WikiProject

This article is issued from Wikipedia - version of the 8/22/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.

Jonckheere's trend test

Procedure

The ‘direct counting’ method

The ‘nautical’ method

Normal approximation to S

Ties

A numerical example

The ‘direct counting’ method

The 'nautical' method

Using exact tables

Computing a standard normal approximation

References

Further reading