biopsy                 package:MASS                 R Documentation

_B_i_o_p_s_y _D_a_t_a _o_n _B_r_e_a_s_t _C_a_n_c_e_r _P_a_t_i_e_n_t_s

_D_e_s_c_r_i_p_t_i_o_n:

     This breast cancer database was obtained from the University of
     Wisconsin Hospitals, Madison from Dr. William H. Wolberg. He
     assessed biopsies of breast tumours for 699 patients up to 15 July
     1992; each of nine attributes has been scored on a scale of 1 to
     10, and the outcome is also known. There are 699 rows and 11
     columns.

_U_s_a_g_e:

     biopsy

_F_o_r_m_a_t:

     This data frame contains the following columns:

     '_I_D' Sample code number (not unique)

     '_V_1' Clump thickness

     '_V_2' Uniformity of cell size

     '_V_3' Uniformity of cell shape

     '_V_4' Marginal adhesion

     '_V_5' Single epithelial cell size

     '_V_6' Bare nuclei (16 values are missing)

     '_V_7' Bland chromatin

     '_V_8' Normal nucleoli

     '_V_9' Mitoses

     '_c_l_a_s_s' '"benign"' or '"malignant"'

_S_o_u_r_c_e:

     P. M. Murphy and D. W. Aha  (1992). UCI Repository of machine
     learning databases. [Machine-readable data repository]. Irvine,
     CA: University of California, Department of Information and
     Computer Science.

     O. L. Mangasarian and W. H. Wolberg (1990) Cancer diagnosis via
     linear programming. _SIAM News_ *23*, pp 1 & 18.

     William H. Wolberg and O.L. Mangasarian (1990) Multisurface method
     of pattern separation for medical diagnosis applied to breast
     cytology. _Proceedings of the National Academy of Sciences,
     U.S.A._ *87*, pp. 9193-9196.

     O. L. Mangasarian, R. Setiono and W.H. Wolberg (1990) Pattern
     recognition via linear programming: Theory and application to
     medical diagnosis. In _Large-scale Numerical Optimization_ eds
     Thomas F. Coleman and Yuying Li, SIAM Publications, Philadelphia,
     pp 22-30.

     K. P. Bennett and O. L. Mangasarian (1992) Robust linear
     programming discrimination of two linearly inseparable sets.
     _Optimization Methods and Software_ *1*, pp. 23-34 (Gordon &
     Breach Science Publishers).

_R_e_f_e_r_e_n_c_e_s:

     Venables, W. N. and Ripley, B. D. (1999) _Modern Applied
     Statistics with S-PLUS._ Third Edition. Springer.

