rowsum                 package:base                 R Documentation

_G_i_v_e _c_o_l_u_m_n _s_u_m_s _o_f _a _m_a_t_r_i_x _o_r _d_a_t_a _f_r_a_m_e, _b_a_s_e_d _o_n _a _g_r_o_u_p_i_n_g _v_a_r_i_a_b_l_e

_D_e_s_c_r_i_p_t_i_o_n:

     Compute column sums across rows of a matrix-like object for each
     level of a grouping variable.  'rowsum' is generic, with a method
     for data frames and a default method for vectors and matrices.

_U_s_a_g_e:

     rowsum(x, group, reorder = TRUE, ...)

     ## S3 method for class 'data.frame':
     rowsum(x, group, reorder = TRUE, na.rm = FALSE, ...)

     ## Default S3 method:
     rowsum(x, group, reorder = TRUE, na.rm = FALSE, ...)

_A_r_g_u_m_e_n_t_s:

       x: a matrix, data frame or vector of numeric data.  Missing
          values are allowed.  A numeric vector will be treated as a
          column vector.

   group: a vector or factor giving the grouping, with one element per
          row of 'x'.  Missing values will be treated as another group
          and a warning will be given.

 reorder: if 'TRUE', then the result will be in order of
          'sort(unique(group))', if 'FALSE', it will be in the order
          that groups were encountered.

   na.rm: logical ('TRUE' or 'FALSE').  Should 'NA' values be
          discarded?

     ...: other arguments to be passed to or from methods

_D_e_t_a_i_l_s:

     The default is to reorder the rows to agree with 'tapply' as in
     the example below.  Reordering should not add noticeably to the
     time except when there are very many distinct values of 'group'
     and 'x' has few columns.

     The original function was written by Terry Therneau, but this is a
     new implementation using hashing that is much faster for large
     matrices.

     To sum over all the rows of a matrix (ie, a single 'group') use
     'colSums', which should be even faster.

_V_a_l_u_e:

     A matrix or data frame containing the sums.  There will be one row
     per unique value of 'group'.

_S_e_e _A_l_s_o:

     'tapply', 'aggregate', 'rowSums'

_E_x_a_m_p_l_e_s:

     require(stats)

     x <- matrix(runif(100), ncol=5)
     group <- sample(1:8, 20, TRUE)
     (xsum <- rowsum(x, group))
     ## Slower versions
     tapply(x, list(group[row(x)], col(x)), sum)
     t(sapply(split(as.data.frame(x), group), colSums))
     aggregate(x, list(group), sum)[-1]

