/[pcre]/code/trunk/doc/pcrepattern.3
ViewVC logotype

Diff of /code/trunk/doc/pcrepattern.3

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1411 by ph10, Tue Dec 3 16:27:00 2013 UTC revision 1412 by ph10, Sun Dec 15 17:01:46 2013 UTC
# Line 537  added VT at release 5.18, and PCRE follo Line 537  added VT at release 5.18, and PCRE follo
537  \es characters are now HT (9), LF (10), VT (11), FF (12), CR (13), and space  \es characters are now HT (9), LF (10), VT (11), FF (12), CR (13), and space
538  (32), which are defined as white space in the "C" locale. This list may vary if  (32), which are defined as white space in the "C" locale. This list may vary if
539  locale-specific matching is taking place. For example, in some locales the  locale-specific matching is taking place. For example, in some locales the
540  "non-breaking space" character (\exA0) is recognized as white space, and in  "non-breaking space" character (\exA0) is recognized as white space, and in
541  others the VT character is not.  others the VT character is not.
542  .P  .P
543  A "word" character is an underscore or any character that is a letter or digit.  A "word" character is an underscore or any character that is a letter or digit.
# Line 1406  points less than 128. Line 1406  points less than 128.
1406  .SH "COMPATIBILITY FEATURE FOR WORD BOUNDARIES"  .SH "COMPATIBILITY FEATURE FOR WORD BOUNDARIES"
1407  .rs  .rs
1408  .sp  .sp
1409  In the POSIX.2 compliant library that was included in 4.4BSD Unix, the ugly  In the POSIX.2 compliant library that was included in 4.4BSD Unix, the ugly
1410  syntax [[:<:]] and [[:>:]] is used for matching "start of word" and "end of  syntax [[:<:]] and [[:>:]] is used for matching "start of word" and "end of
1411  word". PCRE treats these items as follows:  word". PCRE treats these items as follows:
1412  .sp  .sp
1413    [[:<:]]  is converted to  \eb(?=\ew)    [[:<:]]  is converted to  \eb(?=\ew)
1414    [[:>:]]  is converted to  \eb(?<=\ew)    [[:>:]]  is converted to  \eb(?<=\ew)
1415  .sp  .sp
1416  Only these exact character sequences are recognized. A sequence such as  Only these exact character sequences are recognized. A sequence such as
1417  [a[:<:]b] provokes error for an unrecognized POSIX class name. This support is  [a[:<:]b] provokes error for an unrecognized POSIX class name. This support is
1418  not compatible with Perl. It is provided to help migrations from other  not compatible with Perl. It is provided to help migrations from other
1419  environments, and is best not used in any new patterns. Note that \eb matches  environments, and is best not used in any new patterns. Note that \eb matches
1420  at the start and the end of a word (see  at the start and the end of a word (see
1421  .\" HTML <a href="#smallassertions">  .\" HTML <a href="#smallassertions">
1422  .\" </a>  .\" </a>
1423  "Simple assertions"  "Simple assertions"
1424  .\"  .\"
1425  above), and in a Perl-style pattern the preceding or following character  above), and in a Perl-style pattern the preceding or following character
1426  normally shows which is wanted, without the need for the assertions that are  normally shows which is wanted, without the need for the assertions that are
1427  used above in order to give exactly the POSIX behaviour.  used above in order to give exactly the POSIX behaviour.
1428  .  .
1429  .  .

Legend:
Removed from v.1411  
changed lines
  Added in v.1412

  ViewVC Help
Powered by ViewVC 1.1.5