/[pcre]/code/trunk/doc/pcre.3
ViewVC logotype

Diff of /code/trunk/doc/pcre.3

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 345 by ph10, Mon Apr 28 15:10:02 2008 UTC revision 412 by ph10, Sat Apr 11 10:34:37 2009 UTC
# Line 15  The current implementation of PCRE (rele Line 15  The current implementation of PCRE (rele
15  Perl 5.10, including support for UTF-8 encoded strings and Unicode general  Perl 5.10, including support for UTF-8 encoded strings and Unicode general
16  category properties. However, UTF-8 and Unicode support has to be explicitly  category properties. However, UTF-8 and Unicode support has to be explicitly
17  enabled; it is not the default. The Unicode tables correspond to Unicode  enabled; it is not the default. The Unicode tables correspond to Unicode
18  release 5.0.0.  release 5.1.
19  .P  .P
20  In addition to the Perl-compatible matching function, PCRE contains an  In addition to the Perl-compatible matching function, PCRE contains an
21  alternative matching function that matches the same compiled patterns in a  alternative matching function that matches the same compiled patterns in a
# Line 163  the code, and, in addition, you must cal Line 163  the code, and, in addition, you must cal
163  .\" HREF  .\" HREF
164  \fBpcre_compile()\fP  \fBpcre_compile()\fP
165  .\"  .\"
166  with the PCRE_UTF8 option flag. When you do this, both the pattern and any  with the PCRE_UTF8 option flag, or the pattern must start with the sequence
167  subject strings that are matched against it are treated as UTF-8 strings  (*UTF8). When either of these is the case, both the pattern and any subject
168  instead of just strings of bytes.  strings that are matched against it are treated as UTF-8 strings instead of
169    just strings of bytes.
170  .P  .P
171  If you compile PCRE with UTF-8 support, but do not use it at run time, the  If you compile PCRE with UTF-8 support, but do not use it at run time, the
172  library will be a bit bigger, but the additional run time overhead is limited  library will be a bit bigger, but the additional run time overhead is limited
# Line 252  digits, spaces, or word characters remai Line 253  digits, spaces, or word characters remai
253  values less than 256. This remains true even when PCRE includes Unicode  values less than 256. This remains true even when PCRE includes Unicode
254  property support, because to do otherwise would slow down PCRE in many common  property support, because to do otherwise would slow down PCRE in many common
255  cases. If you really want to test for a wider sense of, say, "digit", you  cases. If you really want to test for a wider sense of, say, "digit", you
256  must use Unicode property tests such as \ep{Nd}.  must use Unicode property tests such as \ep{Nd}. Note that this also applies to
257    \eb, because it is defined in terms of \ew and \eW.
258  .P  .P
259  7. Similarly, characters that match the POSIX named character classes are all  7. Similarly, characters that match the POSIX named character classes are all
260  low-valued characters.  low-valued characters.
# Line 289  two digits 10, at the domain cam.ac.uk. Line 291  two digits 10, at the domain cam.ac.uk.
291  .rs  .rs
292  .sp  .sp
293  .nf  .nf
294  Last updated: 12 April 2008  Last updated: 11 April 2009
295  Copyright (c) 1997-2008 University of Cambridge.  Copyright (c) 1997-2009 University of Cambridge.
296  .fi  .fi

Legend:
Removed from v.345  
changed lines
  Added in v.412

  ViewVC Help
Powered by ViewVC 1.1.5