xtabs                 package:stats                 R Documentation

_C_r_o_s_s _T_a_b_u_l_a_t_i_o_n

_D_e_s_c_r_i_p_t_i_o_n:

     Create a contingency table from cross-classifying factors, usually
     contained in a data frame, using a formula interface.

_U_s_a_g_e:

     xtabs(formula = ~., data = parent.frame(), subset, na.action,
           exclude = c(NA, NaN), drop.unused.levels = FALSE)

_A_r_g_u_m_e_n_t_s:

 formula: a formula object with the cross-classifying variables
          (separated by '+') on the right hand side (or an object which
          can be coerced to a formula).  Interactions are not allowed. 
          On the left hand side, one may optionally give a vector or a
          matrix of counts; in the latter case, the columns are
          interpreted as corresponding to the levels of a variable. 
          This is useful if the data have already been tabulated, see
          the examples below.

    data: an optional matrix or data frame (or similar: see
          'model.frame') containing the variables in the formula
          'formula'.  By default the variables are taken from
          'environment(formula)'.

  subset: an optional vector specifying a subset of observations to be
          used.

na.action: a function which indicates what should happen when the data
          contain 'NA's.

 exclude: a vector of values to be excluded when forming the set of
          levels of the classifying factors.

drop.unused.levels: a logical indicating whether to drop unused levels
          in the classifying factors.  If this is 'FALSE' and there are
          unused levels, the table will contain zero marginals, and a
          subsequent chi-squared test for independence of the factors
          will not work.

_D_e_t_a_i_l_s:

     There is a 'summary' method for contingency table objects created
     by 'table' or 'xtabs', which gives basic information and performs
     a chi-squared test for independence of factors (note that the
     function 'chisq.test' currently only handles 2-d tables).

     If a left hand side is given in 'formula', its entries are simply
     summed over the cells corresponding to the right hand side; this
     also works if the lhs does not give counts.

_V_a_l_u_e:

     A contingency table in array representation of class 'c("xtabs",
     "table")', with a '"call"' attribute storing the matched call.

_S_e_e _A_l_s_o:

     'table' for traditional cross-tabulation, and
     'as.data.frame.table' which is the inverse operation of 'xtabs'
     (see the 'DF' example below).

_E_x_a_m_p_l_e_s:

     ## 'esoph' has the frequencies of cases and controls for all levels of
     ## the variables 'agegp', 'alcgp', and 'tobgp'.
     xtabs(cbind(ncases, ncontrols) ~ ., data = esoph)
     ## Output is not really helpful ... flat tables are better:
     ftable(xtabs(cbind(ncases, ncontrols) ~ ., data = esoph))
     ## In particular if we have fewer factors ...
     ftable(xtabs(cbind(ncases, ncontrols) ~ agegp, data = esoph))

     ## This is already a contingency table in array form.
     DF <- as.data.frame(UCBAdmissions)
     ## Now 'DF' is a data frame with a grid of the factors and the counts
     ## in variable 'Freq'.
     DF
     ## Nice for taking margins ...
     xtabs(Freq ~ Gender + Admit, DF)
     ## And for testing independence ...
     summary(xtabs(Freq ~ ., DF))

     ## Create a nice display for the warp break data.
     warpbreaks$replicate <- rep(1:9, len = 54)
     ftable(xtabs(breaks ~ wool + tension + replicate, data = warpbreaks))

