strptime                package:base                R Documentation

_D_a_t_e-_t_i_m_e _C_o_n_v_e_r_s_i_o_n _F_u_n_c_t_i_o_n_s _t_o _a_n_d _f_r_o_m _C_h_a_r_a_c_t_e_r

_D_e_s_c_r_i_p_t_i_o_n:

     Functions to convert between character representations and objects
     of classes '"POSIXlt"' and '"POSIXct"' representing calendar dates
     and times.

_U_s_a_g_e:

     ## S3 method for class 'POSIXct':
     format(x, format = "", tz = "", usetz = FALSE, ...)
     ## S3 method for class 'POSIXlt':
     format(x, format = "", usetz = FALSE, ...)

     ## S3 method for class 'POSIXt':
     as.character(x, ...)

     strftime(x, format="", usetz = FALSE, ...)
     strptime(x, format, tz = "")

     ISOdatetime(year, month, day, hour, min, sec, tz = "")
     ISOdate(year, month, day, hour = 12, min = 0, sec = 0, tz = "GMT")

_A_r_g_u_m_e_n_t_s:

       x: An object to be converted.

      tz: A timezone specification to be used for the conversion.
          System-specific (see 'as.POSIXlt'), but '""' is the current
          time zone, and '"GMT"' is UTC.

  format: A character string.  The default is '"%Y-%m-%d %H:%M:%S"' if
          any component has a time component which is not midnight, and
          '"%Y-%m-%d"' otherwise.  If 'options("digits.secs")' is set,
          up to the specified number of digits will be printed for
          seconds.

     ...: Further arguments to be passed from or to other methods.

   usetz: logical.  Should the timezone be appended to the output? This
          is used in printing times, and as a workaround for problems
          with using '"%Z"' on most Linux systems.

year, month, day: numerical values to specify a day.

hour, min, sec: numerical values for a time within a day. Fractional
          seconds are allowed.

_D_e_t_a_i_l_s:

     The 'format' and 'as.character' methods and 'strftime' convert
     objects from the classes '"POSIXlt"' and '"POSIXct"' (not
     'strftime') to character vectors.

     'strptime' converts character strings to class '"POSIXlt"': its
     input 'x' is first coerced to character if necessary.  Each string
     is processed as far as necessary for the format specified: any
     trailing characters are ignored.

     'strftime' is an alias for 'format.POSIXlt', and 'format.POSIXct'
     first converts to class '"POSIXlt"' by calling 'as.POSIXlt'.  Note
     that only that conversion depends on the time zone.

     The usual vector re-cycling rules are applied to 'x' and 'format'
     so the answer will be of length that of the longer of the vectors.

     Locale-specific conversions to and from character strings are used
     where appropriate and available.  This affects the names of the
     days and months, the AM/PM indicator (if used) and the separators
     in formats such as '%x' and '%X' (via the setting of the 'LC_TIME'
     locale category).

     The details of the formats are system-specific, but the following
     are defined by the ISO C / POSIX standard for 'strftime' and are
     likely to be widely available.  A _conversion specification_ is
     introduced by '%', usually followed by a single letter or 'O' or
     'E' and then a single letter. Any character in the format string
     not part of a conversion specification is interpreted literally
     (and '%%' gives '%').  Widely implemented conversion
     specifications include

     '%_a' Abbreviated weekday name in the current locale. (Also matches
          full name on input.)

     '%_A' Full weekday name in the current locale.  (Also matches
          abbreviated name on input.)

     '%_b' Abbreviated month name in the current locale. (Also matches
          full name on input.)

     '%_B' Full month name in the current locale.  (Also matches
          abbreviated name on input.)

     '%_c' Date and time, locale-specific.

     '%_d' Day of the month as decimal number (01-31).

     '%_H' Hours as decimal number (00-23).

     '%_I' Hours as decimal number (01-12).

     '%_j' Day of year as decimal number (001-366).

     '%_m' Month as decimal number (01-12).

     '%_M' Minute as decimal number (00-59).

     '%_p' AM/PM indicator in the locale.  Used in conjuction with '%I'
          and *not* with '%H'.  An empty string in some locales.

     '%_S' Second as decimal number (00-61), allowing for up to two
          leap-seconds (but POSIX-compliant OSes will ignore leap
          seconds).

     '%_U' Week of the year as decimal number (00-53) using Sunday as
          the first day 1 of the week (and typically with the first
          Sunday of the year as day 1 of week 1).  The US convention.

     '%_w' Weekday as decimal number (0-6, Sunday is 0).

     '%_W' Week of the year as decimal number (00-53) using Monday as
          the first day of week (and typically with the first Monday of
          the year as day 1 of week 1). The UK convention.

     '%_x' Date, locale-specific.

     '%_X' Time, locale-specific.

     '%_y' Year without century (00-99). If you use this on input, which
          century you get is system-specific.  So don't!  Often values
          up to 68 (or 69) are prefixed by 20 and 69 (or 70) to 99 by
          19.

     '%_Y' Year with century.

     '%_z' (output only.) Offset from Greenwich, so '-0800' is 8 hours
          west of Greenwich.

     '%_Z' (output only.) Time zone as a character string (empty if not
          available).

     Where leading zeros are shown they will be used on output but are
     optional on input.

     Also defined in the current standards but less widely implemented
     (e.g. not for output on Windows) are

     '%_C' Century (00-99): the integer part of the year divided by 100.

     '%_D' Locale-specific date format such as '%m/%d/%y': ISO C99 says
          it should be that exact format.

     '%_e' Day of the month as decimal number (1-31), with a leading
          space for a single-digit number.

     '%_F' Equivalent to %Y-%m-%d (the ISO 8601 date format).

     '%_g' The last two digits of the week-based year (see '%V'). 
          (Typically accepted but ignored on input.)

     '%_G' The week-based year (see '%V') as a decimal number. 
          (Typically accepted but ignored on input.)

     '%_h' Equivalent to '%b'.

     '%_k' The 24-hour clock time with single digits preceded by a
          blank.

     '%_l' The 12-hour clock time with single digits preceded by a
          blank.

     '%_n' Newline on output, arbitrary whitespace on input.

     '%_r' The 12-hour clock time (using the locale's AM or PM).

     '%_R' Equivalent to '%H:%M'.

     '%_t' Tab on output, arbitrary whitespace on input.

     '%_T' Equivalent to '%H:%M:%S'.

     '%_u' Weekday as a decimal number (1-7, Monday is 1).

     '%_V' Week of the year as decimal number (00-53) as defined in ISO
          8601. If the week (starting on Monday) containing 1 January
          has four or more days in the new year, then it is considered
          week 1.  Otherwise, it is the last week of the previous year,
          and the next week is week 1.  (Typically accepted but ignored
          on input.)

     For output (and possibly input) there are also '%O[dHImMUVwWy]'
     which may emit numbers in an alternative locale-dependent format
     (e.g. roman numerals), and '%E[cCyYxX]' which can use an
     alternative 'era' (e.g. a different religious calendar).  Which of
     these are supported is OS-dependent.

     Specific to R is '%OSn', which for output gives the seconds to '0
     <= n <= 6' decimal places (and if '%OS' is not followed by a
     digit, it uses the setting of 'getOption("digits.secs")', or if
     that is unset, 'n = 3'). Further, for 'strptime' '%OS' will input
     seconds including fractional seconds.  Note that '%S' ignores (and
     not rounds) fractional parts on output.

     The behaviour of other conversion specifications (and even if
     other character sequences commencing with '%' _are_ conversion
     specifications) is system-specific.

     'ISOdatetime' and 'ISOdate' are convenience wrappers for
     'strptime', that differ only in their defaults and that 'ISOdate'
     sets a timezone.  (For dates without times it would be better to
     use the '"Date"' class.)

_V_a_l_u_e:

     The 'format' methods and 'strftime' return character vectors
     representing the time.  'NA' times are returned as
     'NA_character_'.

     'strptime' turns character representations into an object of class
     '"POSIXlt"'.  The timezone is used to set the 'isdst' component
     and to set the '"tzone"' attribute if 'tz != ""'.

     'ISOdatetime' and 'ISOdate' return an object of class '"POSIXct"'.

_N_o_t_e:

     The default formats follow the rules of the ISO 8601 international
     standard which expresses a day as '"2001-02-28"' and a time as
     '"14:01:02"' using leading zeroes as here.  The ISO form uses no
     space to separate dates and times.

     If the date string does not specify the date completely, the
     returned answer may be system-specific.  The most common behaviour
     is to assume that unspecified seconds, minutes or hours are zero,
     and a missing year, month or day is the current one.  If it
     specifies a date incorrectly, reliable implementations will give
     an error and the date is reported as 'NA'.  Unfortunately some
     common implementations (such as 'glibc') are unreliable and guess
     at the intended meaning.

     If the timezone specified is invalid on your system, what happens
     is system-specific but it will probably be ignored.

     OS facilities will probably not print years before 1CE (aka 1AD)
     correctly.

     Remember that in most timezones some times do not occur and some
     occur twice because of transitions to/from summer time.  What
     happens in those cases is OS-specific.

_R_e_f_e_r_e_n_c_e_s:

     International Organization for Standardization (2004, 1988, 1997,
     ...) _ISO 8601. Data elements and interchange formats -
     Information interchange - Representation of dates and times._ For
     links to versions available on-line see (at the time of writing)
     <URL: http://www.qsl.net/g1smd/isopdf.htm>; for information on the
     current official version, see <URL:
     http://www.iso.org/iso/en/prods-services/popstds/datesandtime.html>.

_S_e_e _A_l_s_o:

     DateTimeClasses for details of the date-time classes; 'locales' to
     query or set a locale.

     Your system's help pages on 'strftime' and 'strptime' to see how
     to specify their formats. (On some systems 'strptime' is replaced
     by corrected code from 'glibc', when all the conversion
     specifications described here are supported, but with no
     alternative number representation nor era available in any
     locale.)

_E_x_a_m_p_l_e_s:

     ## locale-specific version of date()
     format(Sys.time(), "%a %b %d %X %Y %Z")

     ## time to sub-second accuracy (if supported by the OS)
     format(Sys.time(), "%H:%M:%OS3")

     ## read in date info in format 'ddmmmyyyy'
     ## This will give NA(s) in some locales; setting the C locale
     ## as in the commented lines will overcome this on most systems.
     ## lct <- Sys.getlocale("LC_TIME"); Sys.setlocale("LC_TIME", "C")
     x <- c("1jan1960", "2jan1960", "31mar1960", "30jul1960")
     z <- strptime(x, "%d%b%Y")
     ## Sys.setlocale("LC_TIME", lct)
     z

     ## read in date/time info in format 'm/d/y h:m:s'
     dates <- c("02/27/92", "02/27/92", "01/14/92", "02/28/92", "02/01/92")
     times <- c("23:03:20", "22:29:56", "01:03:30", "18:21:03", "16:56:26")
     x <- paste(dates, times)
     strptime(x, "%m/%d/%y %H:%M:%S")

     ## time with fractional seconds
     z <- strptime("20/2/06 11:16:16.683", "%d/%m/%y %H:%M:%OS")
     z # prints without fractional seconds
     op <- options(digits.secs=3)
     z
     options(op)

     ## timezones are not portable, but 'EST5EDT' comes pretty close.
     (x <- strptime(c("2006-01-08 10:07:52", "2006-08-07 19:33:02"),
                    "%Y-%m-%d %H:%M:%S", tz="EST5EDT"))
     attr(x, "tzone")

