/[pcre]/code/trunk/doc/pcretest.1
ViewVC logotype

Diff of /code/trunk/doc/pcretest.1

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 93 by nigel, Sat Feb 24 21:41:42 2007 UTC revision 286 by ph10, Mon Dec 17 14:46:11 2007 UTC
# Line 157  effect as they do in Perl. For example: Line 157  effect as they do in Perl. For example:
157  The following table shows additional modifiers for setting PCRE options that do  The following table shows additional modifiers for setting PCRE options that do
158  not correspond to anything in Perl:  not correspond to anything in Perl:
159  .sp  .sp
160    \fB/A\fP       PCRE_ANCHORED    \fB/A\fP              PCRE_ANCHORED
161    \fB/C\fP       PCRE_AUTO_CALLOUT    \fB/C\fP              PCRE_AUTO_CALLOUT
162    \fB/E\fP       PCRE_DOLLAR_ENDONLY    \fB/E\fP              PCRE_DOLLAR_ENDONLY
163    \fB/f\fP       PCRE_FIRSTLINE    \fB/f\fP              PCRE_FIRSTLINE
164    \fB/J\fP       PCRE_DUPNAMES    \fB/J\fP              PCRE_DUPNAMES
165    \fB/N\fP       PCRE_NO_AUTO_CAPTURE    \fB/N\fP              PCRE_NO_AUTO_CAPTURE
166    \fB/U\fP       PCRE_UNGREEDY    \fB/U\fP              PCRE_UNGREEDY
167    \fB/X\fP       PCRE_EXTRA    \fB/X\fP              PCRE_EXTRA
168    \fB/<cr>\fP    PCRE_NEWLINE_CR    \fB/<cr>\fP           PCRE_NEWLINE_CR
169    \fB/<lf>\fP    PCRE_NEWLINE_LF    \fB/<lf>\fP           PCRE_NEWLINE_LF
170    \fB/<crlf>\fP  PCRE_NEWLINE_CRLF    \fB/<crlf>\fP         PCRE_NEWLINE_CRLF
171    \fB/<any>\fP   PCRE_NEWLINE_ANY    \fB/<anycrlf>\fP      PCRE_NEWLINE_ANYCRLF
172  .sp    \fB/<any>\fP          PCRE_NEWLINE_ANY
173  Those specifying line ending sequencess are literal strings as shown. This    \fB/<bsr_anycrlf>\fP  PCRE_BSR_ANYCRLF
174  example sets multiline matching with CRLF as the line ending sequence:    \fB/<bsr_unicode>\fP  PCRE_BSR_UNICODE
175    .sp
176    Those specifying line ending sequences are literal strings as shown, but the
177    letters can be in either case. This example sets multiline matching with CRLF
178    as the line ending sequence:
179  .sp  .sp
180    /^abc/m<crlf>    /^abc/m<crlf>
181  .sp  .sp
# Line 214  the subject string. This is useful for t Line 218  the subject string. This is useful for t
218  multiple copies of the same substring.  multiple copies of the same substring.
219  .P  .P
220  The \fB/B\fP modifier is a debugging feature. It requests that \fBpcretest\fP  The \fB/B\fP modifier is a debugging feature. It requests that \fBpcretest\fP
221  output a representation of the compiled byte code after compilation.  output a representation of the compiled byte code after compilation. Normally
222    this information contains length and offset values; however, if \fB/Z\fP is
223    also present, this data is replaced by spaces. This is a special feature for
224    use in the automatic test scripts; it ensures that the same output is generated
225    for different internal link sizes.
226  .P  .P
227  The \fB/L\fP modifier must be followed directly by the name of a locale, for  The \fB/L\fP modifier must be followed directly by the name of a locale, for
228  example,  example,
# Line 233  so on). It does this by calling \fBpcre_ Line 241  so on). It does this by calling \fBpcre_
241  pattern. If the pattern is studied, the results of that are also output.  pattern. If the pattern is studied, the results of that are also output.
242  .P  .P
243  The \fB/D\fP modifier is a PCRE debugging feature, and is equivalent to  The \fB/D\fP modifier is a PCRE debugging feature, and is equivalent to
244  \fB/BI\fP, that is, both the \fP/B\fP and the \fB/I\fP modifiers.  \fB/BI\fP, that is, both the \fB/B\fP and the \fB/I\fP modifiers.
245  .P  .P
246  The \fB/F\fP modifier causes \fBpcretest\fP to flip the byte order of the  The \fB/F\fP modifier causes \fBpcretest\fP to flip the byte order of the
247  fields in the compiled pattern that contain 2-byte and 4-byte numbers. This  fields in the compiled pattern that contain 2-byte and 4-byte numbers. This
# Line 368  recognized: Line 376  recognized:
376    \e<crlf>    pass the PCRE_NEWLINE_CRLF option to \fBpcre_exec()\fP    \e<crlf>    pass the PCRE_NEWLINE_CRLF option to \fBpcre_exec()\fP
377                 or \fBpcre_dfa_exec()\fP                 or \fBpcre_dfa_exec()\fP
378  .\" JOIN  .\" JOIN
379      \e<anycrlf> pass the PCRE_NEWLINE_ANYCRLF option to \fBpcre_exec()\fP
380                   or \fBpcre_dfa_exec()\fP
381    .\" JOIN
382    \e<any>     pass the PCRE_NEWLINE_ANY option to \fBpcre_exec()\fP    \e<any>     pass the PCRE_NEWLINE_ANY option to \fBpcre_exec()\fP
383                 or \fBpcre_dfa_exec()\fP                 or \fBpcre_dfa_exec()\fP
384  .sp  .sp
# Line 403  and \eZ, causing REG_NOTBOL and REG_NOTE Line 414  and \eZ, causing REG_NOTBOL and REG_NOTE
414  The use of \ex{hh...} to represent UTF-8 characters is not dependent on the use  The use of \ex{hh...} to represent UTF-8 characters is not dependent on the use
415  of the \fB/8\fP modifier on the pattern. It is recognized always. There may be  of the \fB/8\fP modifier on the pattern. It is recognized always. There may be
416  any number of hexadecimal digits inside the braces. The result is from one to  any number of hexadecimal digits inside the braces. The result is from one to
417  six bytes, encoded according to the UTF-8 rules.  six bytes, encoded according to the original UTF-8 rules of RFC 2279. This
418    allows for values in the range 0 to 0x7FFFFFFF. Note that not all of those are
419    valid Unicode code points, or indeed valid UTF-8 characters according to the
420    later rules in RFC 3629.
421  .  .
422  .  .
423  .SH "THE ALTERNATIVE MATCHING FUNCTION"  .SH "THE ALTERNATIVE MATCHING FUNCTION"
# Line 449  of an interactive \fBpcretest\fP run. Line 463  of an interactive \fBpcretest\fP run.
463    data> xyz    data> xyz
464    No match    No match
465  .sp  .sp
466    Note that unset capturing substrings that are not followed by one that is set
467    are not returned by \fBpcre_exec()\fP, and are not shown by \fBpcretest\fP. In
468    the following example, there are two capturing substrings, but when the first
469    data line is matched, the second, unset substring is not shown. An "internal"
470    unset substring is shown as "<unset>", as for the second data line.
471    .sp
472        re> /(a)|(b)/
473      data> a
474       0: a
475       1: a
476      data> b
477       0: b
478       1: <unset>
479       2: b
480    .sp
481  If the strings contain any non-printing characters, they are output as \e0x  If the strings contain any non-printing characters, they are output as \e0x
482  escapes, or as \ex{...} escapes if the \fB/8\fP modifier was present on the  escapes, or as \ex{...} escapes if the \fB/8\fP modifier was present on the
483  pattern. See below for the definition of non-printing characters. If the  pattern. See below for the definition of non-printing characters. If the
# Line 529  indicating that the subject partially ma Line 558  indicating that the subject partially ma
558  match with additional subject data by means of the \eR escape sequence. For  match with additional subject data by means of the \eR escape sequence. For
559  example:  example:
560  .sp  .sp
561      re> /^\d?\d(jan|feb|mar|apr|may|jun|jul|aug|sep|oct|nov|dec)\d\d$/      re> /^\ed?\ed(jan|feb|mar|apr|may|jun|jul|aug|sep|oct|nov|dec)\ed\ed$/
562    data> 23ja\eP\eD    data> 23ja\eP\eD
563    Partial match: 23ja    Partial match: 23ja
564    data> n05\eR\eD    data> n05\eR\eD
# Line 665  result is undefined. Line 694  result is undefined.
694  .rs  .rs
695  .sp  .sp
696  \fBpcre\fP(3), \fBpcreapi\fP(3), \fBpcrecallout\fP(3), \fBpcrematching\fP(3),  \fBpcre\fP(3), \fBpcreapi\fP(3), \fBpcrecallout\fP(3), \fBpcrematching\fP(3),
697  \fBpcrepartial\fP(d), \fPpcrepattern\fP(3), \fBpcreprecompile\fP(3).  \fBpcrepartial\fP(d), \fBpcrepattern\fP(3), \fBpcreprecompile\fP(3).
698  .  .
699  .  .
700  .SH AUTHOR  .SH AUTHOR
701  .rs  .rs
702  .sp  .sp
703    .nf
704  Philip Hazel  Philip Hazel
705  .br  University Computing Service
 University Computing Service,  
 .br  
706  Cambridge CB2 3QH, England.  Cambridge CB2 3QH, England.
707  .P  .fi
708  .in 0  .
709  Last updated: 30 November 2006  .
710  .br  .SH REVISION
711  Copyright (c) 1997-2006 University of Cambridge.  .rs
712    .sp
713    .nf
714    Last updated: 19 November 2007
715    Copyright (c) 1997-2007 University of Cambridge.
716    .fi

Legend:
Removed from v.93  
changed lines
  Added in v.286

  ViewVC Help
Powered by ViewVC 1.1.5