/[pcre]/code/trunk/doc/html/pcresyntax.html
ViewVC logotype

Diff of /code/trunk/doc/html/pcresyntax.html

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1403 by ph10, Tue May 28 09:13:59 2013 UTC revision 1404 by ph10, Tue Nov 19 15:36:57 2013 UTC
# Line 65  documentation. This document contains a Line 65  documentation. This document contains a
65    \n         newline (hex 0A)    \n         newline (hex 0A)
66    \r         carriage return (hex 0D)    \r         carriage return (hex 0D)
67    \t         tab (hex 09)    \t         tab (hex 09)
68      \0dd       character with octal code 0dd
69    \ddd       character with octal code ddd, or backreference    \ddd       character with octal code ddd, or backreference
70      \o{ddd..}  character with octal code ddd..
71    \xhh       character with hex code hh    \xhh       character with hex code hh
72    \x{hhh..}  character with hex code hhh..    \x{hhh..}  character with hex code hhh..
73  </PRE>  </pre>
74    Note that \0dd is always an octal code, and that \8 and \9 are the literal
75    characters "8" and "9".
76  </P>  </P>
77  <br><a name="SEC4" href="#TOC1">CHARACTER TYPES</a><br>  <br><a name="SEC4" href="#TOC1">CHARACTER TYPES</a><br>
78  <P>  <P>
# Line 92  documentation. This document contains a Line 96  documentation. This document contains a
96    \W         a "non-word" character    \W         a "non-word" character
97    \X         a Unicode extended grapheme cluster    \X         a Unicode extended grapheme cluster
98  </pre>  </pre>
99  In PCRE, by default, \d, \D, \s, \S, \w, and \W recognize only ASCII  By default, \d, \s, and \w match only ASCII characters, even in UTF-8 mode
100  characters, even in a UTF mode. However, this can be changed by setting the  or in the 16- bit and 32-bit libraries. However, if locale-specific matching is
101  PCRE_UCP option.  happening, \s and \w may also match characters with code points in the range
102    128-255. If the PCRE_UCP option is set, the behaviour of these escape sequences
103    is changed to use Unicode properties and they match many more characters.
104  </P>  </P>
105  <br><a name="SEC5" href="#TOC1">GENERAL CATEGORY PROPERTIES FOR \p and \P</a><br>  <br><a name="SEC5" href="#TOC1">GENERAL CATEGORY PROPERTIES FOR \p and \P</a><br>
106  <P>  <P>
# Line 150  PCRE_UCP option. Line 156  PCRE_UCP option.
156  <pre>  <pre>
157    Xan        Alphanumeric: union of properties L and N    Xan        Alphanumeric: union of properties L and N
158    Xps        POSIX space: property Z or tab, NL, VT, FF, CR    Xps        POSIX space: property Z or tab, NL, VT, FF, CR
159    Xsp        Perl space: property Z or tab, NL, FF, CR    Xsp        Perl space: property Z or tab, NL, VT, FF, CR
160    Xuc        Univerally-named character: one that can be    Xuc        Univerally-named character: one that can be
161                 represented by a Universal Character Name                 represented by a Universal Character Name
162    Xwd        Perl word: property Xan or underscore    Xwd        Perl word: property Xan or underscore
163  </PRE>  </pre>
164    Perl and POSIX space are now the same. Perl added VT to its space character set
165    at release 5.18 and PCRE changed at release 8.34.
166  </P>  </P>
167  <br><a name="SEC7" href="#TOC1">SCRIPT NAMES FOR \p AND \P</a><br>  <br><a name="SEC7" href="#TOC1">SCRIPT NAMES FOR \p AND \P</a><br>
168  <P>  <P>
# Line 385  newline-setting options with similar syn Line 393  newline-setting options with similar syn
393    (*UTF32)        set UTF-32 mode: 32-bit library (PCRE_UTF32)    (*UTF32)        set UTF-32 mode: 32-bit library (PCRE_UTF32)
394    (*UTF)          set appropriate UTF mode for the library in use    (*UTF)          set appropriate UTF mode for the library in use
395    (*UCP)          set PCRE_UCP (use Unicode properties for \d etc)    (*UCP)          set PCRE_UCP (use Unicode properties for \d etc)
396  </PRE>  </pre>
397    Note that LIMIT_MATCH and LIMIT_RECURSION can only reduce the value of the
398    limits set by the caller of pcre_exec(), not increase them.
399  </P>  </P>
400  <br><a name="SEC17" href="#TOC1">LOOKAHEAD AND LOOKBEHIND ASSERTIONS</a><br>  <br><a name="SEC17" href="#TOC1">LOOKAHEAD AND LOOKBEHIND ASSERTIONS</a><br>
401  <P>  <P>
# Line 516  Cambridge CB2 3QH, England. Line 526  Cambridge CB2 3QH, England.
526  </P>  </P>
527  <br><a name="SEC27" href="#TOC1">REVISION</a><br>  <br><a name="SEC27" href="#TOC1">REVISION</a><br>
528  <P>  <P>
529  Last updated: 26 April 2013  Last updated: 12 November 2013
530  <br>  <br>
531  Copyright &copy; 1997-2013 University of Cambridge.  Copyright &copy; 1997-2013 University of Cambridge.
532  <br>  <br>

Legend:
Removed from v.1403  
changed lines
  Added in v.1404

  ViewVC Help
Powered by ViewVC 1.1.5