TukeyHSD                package:stats                R Documentation

_C_o_m_p_u_t_e _T_u_k_e_y _H_o_n_e_s_t _S_i_g_n_i_f_i_c_a_n_t _D_i_f_f_e_r_e_n_c_e_s

_D_e_s_c_r_i_p_t_i_o_n:

     Create a set of confidence intervals on the differences between
     the means of the levels of a factor with the specified family-wise
     probability of coverage.  The intervals are based on the
     Studentized range statistic, Tukey's 'Honest Significant
     Difference' method.  There is a 'plot' method.

_U_s_a_g_e:

     TukeyHSD(x, which, ordered = FALSE, conf.level = 0.95, ...)

_A_r_g_u_m_e_n_t_s:

       x: A fitted model object, usually an 'aov' fit.

   which: A character vector listing terms in the fitted model for
          which the intervals should be calculated.  Defaults to all
          the terms.

 ordered: A logical value indicating if the levels of the factor should
          be ordered according to increasing average in the sample
          before taking differences.  If 'ordered' is true then the
          calculated differences in the means will all be positive. 
          The significant differences will be those for which the 'lwr'
          end point is positive.

conf.level: A numeric value between zero and one giving the family-wise
          confidence level to use.

     ...: Optional additional arguments.  None are used at present.

_D_e_t_a_i_l_s:

     When comparing the means for the levels of a factor in an analysis
     of variance, a simple comparison using t-tests will inflate the
     probability of declaring a significant difference when it is not
     in fact present.  This because the intervals are calculated with a
     given coverage probability for each interval but the
     interpretation of the coverage is usually with respect to the
     entire family of intervals.

     John Tukey introduced intervals based on the range of the sample
     means rather than the individual differences.  The intervals
     returned by this function are based on this Studentized range
     statistics.

     Technically the intervals constructed in this way would only apply
     to balanced designs where there are the same number of
     observations made at each level of the factor.  This function
     incorporates an adjustment for sample size that produces sensible
     intervals for mildly unbalanced designs.

     If 'which' specifies non-factor terms these will be dropped with a
     warning: if no terms are left this is a an error.

_V_a_l_u_e:

     A list with one component for each term requested in 'which'. Each
     component is a matrix with columns 'diff' giving the difference in
     the observed means, 'lwr' giving the lower end point of the
     interval, 'upr' giving the upper end point and 'p adj' giving the
     p-value after adjustment for the multiple comparisons.

_A_u_t_h_o_r(_s):

     Douglas Bates

_R_e_f_e_r_e_n_c_e_s:

     Miller, R. G. (1981) _Simultaneous Statistical Inference_.
     Springer.

     Yandell, B. S. (1997) _Practical Data Analysis for Designed
     Experiments_. Chapman & Hall.

_S_e_e _A_l_s_o:

     'aov', 'qtukey', 'model.tables', 'simint'

_E_x_a_m_p_l_e_s:

     require(graphics)

     summary(fm1 <- aov(breaks ~ wool + tension, data = warpbreaks))
     TukeyHSD(fm1, "tension", ordered = TRUE)
     plot(TukeyHSD(fm1, "tension"))

