376 |
.rs |
.rs |
377 |
.sp |
.sp |
378 |
When PCRE is built with Unicode character property support, three additional |
When PCRE is built with Unicode character property support, three additional |
379 |
escape sequences to match character properties are available when UTF-8 mode |
escape sequences that match characters with specific properties are available. |
380 |
is selected. They are: |
When not in UTF-8 mode, these sequences are of course limited to testing |
381 |
|
characters whose codepoints are less than 256, but they do work in this mode. |
382 |
|
The extra escape sequences are: |
383 |
.sp |
.sp |
384 |
\ep{\fIxx\fP} a character with the \fIxx\fP property |
\ep{\fIxx\fP} a character with the \fIxx\fP property |
385 |
\eP{\fIxx\fP} a character without the \fIxx\fP property |
\eP{\fIxx\fP} a character without the \fIxx\fP property |
555 |
(see below). |
(see below). |
556 |
.\" |
.\" |
557 |
Characters with the "mark" property are typically accents that affect the |
Characters with the "mark" property are typically accents that affect the |
558 |
preceding character. |
preceding character. None of them have codepoints less than 256, so in |
559 |
|
non-UTF-8 mode \eX matches any one character. |
560 |
.P |
.P |
561 |
Matching characters by Unicode property is not fast, because PCRE has to search |
Matching characters by Unicode property is not fast, because PCRE has to search |
562 |
a structure that contains data for over fifteen thousand characters. That is |
a structure that contains data for over fifteen thousand characters. That is |
1976 |
.rs |
.rs |
1977 |
.sp |
.sp |
1978 |
.nf |
.nf |
1979 |
Last updated: 13 June 2007 |
Last updated: 19 June 2007 |
1980 |
Copyright (c) 1997-2007 University of Cambridge. |
Copyright (c) 1997-2007 University of Cambridge. |
1981 |
.fi |
.fi |