/[pcre]/code/trunk/doc/pcreapi.3
ViewVC logotype

Diff of /code/trunk/doc/pcreapi.3

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1022 by ph10, Tue Aug 28 12:28:15 2012 UTC revision 1194 by ph10, Wed Oct 31 17:42:29 2012 UTC
# Line 1  Line 1 
1  .TH PCREAPI 3 "28 August 2012" "PCRE 8.32"  .TH PCREAPI 3 "31 October 2012" "PCRE 8.32"
2  .SH NAME  .SH NAME
3  PCRE - Perl-compatible regular expressions  PCRE - Perl-compatible regular expressions
4  .sp  .sp
# Line 95  PCRE - Perl-compatible regular expressio Line 95  PCRE - Perl-compatible regular expressio
95  .SH "PCRE NATIVE API AUXILIARY FUNCTIONS"  .SH "PCRE NATIVE API AUXILIARY FUNCTIONS"
96  .rs  .rs
97  .sp  .sp
98    .B int pcre_jit_exec(const pcre *\fIcode\fP, "const pcre_extra *\fIextra\fP,"
99    .ti +5n
100    .B "const char *\fIsubject\fP," int \fIlength\fP, int \fIstartoffset\fP,
101    .ti +5n
102    .B int \fIoptions\fP, int *\fIovector\fP, int \fIovecsize\fP,
103    .ti +5n
104    .B pcre_jit_stack *\fIjstack\fP);
105    .PP
106  .B pcre_jit_stack *pcre_jit_stack_alloc(int \fIstartsize\fP, int \fImaxsize\fP);  .B pcre_jit_stack *pcre_jit_stack_alloc(int \fIstartsize\fP, int \fImaxsize\fP);
107  .PP  .PP
108  .B void pcre_jit_stack_free(pcre_jit_stack *\fIstack\fP);  .B void pcre_jit_stack_free(pcre_jit_stack *\fIstack\fP);
# Line 134  PCRE - Perl-compatible regular expressio Line 142  PCRE - Perl-compatible regular expressio
142  .B int (*pcre_callout)(pcre_callout_block *);  .B int (*pcre_callout)(pcre_callout_block *);
143  .  .
144  .  .
145  .SH "PCRE 8-BIT AND 16-BIT LIBRARIES"  .SH "PCRE 8-BIT, 16-BIT AND 32-BIT LIBRARIES"
146  .rs  .rs
147  .sp  .sp
148  From release 8.30, PCRE can be compiled as a library for handling 16-bit  From release 8.30, PCRE can be compiled as a library for handling 16-bit
149  character strings as well as, or instead of, the original library that handles  character strings as well as, or instead of, the original library that handles
150  8-bit character strings. To avoid too much complication, this document  8-bit character strings. From release 8.32, PCRE can also be compiled as a
151  describes the 8-bit versions of the functions, with only occasional references  library for handling 32-bit character strings. To avoid too much complication,
152  to the 16-bit library.  this document describes the 8-bit versions of the functions, with only
153  .P  occasional references to the 16-bit and 32-bit libraries.
154  The 16-bit functions operate in the same way as their 8-bit counterparts; they  .P
155  just use different data types for their arguments and results, and their names  The 16-bit and 32-bit functions operate in the same way as their 8-bit
156  start with \fBpcre16_\fP instead of \fBpcre_\fP. For every option that has UTF8  counterparts; they just use different data types for their arguments and
157  in its name (for example, PCRE_UTF8), there is a corresponding 16-bit name with  results, and their names start with \fBpcre16_\fP or \fBpcre32_\fP instead of
158  UTF8 replaced by UTF16. This facility is in fact just cosmetic; the 16-bit  \fBpcre_\fP. For every option that has UTF8 in its name (for example,
159  option names define the same bit values.  PCRE_UTF8), there are corresponding 16-bit and 32-bit names with UTF8 replaced
160    by UTF16 or UTF32, respectively. This facility is in fact just cosmetic; the
161    16-bit and 32-bit option names define the same bit values.
162  .P  .P
163  References to bytes and UTF-8 in this document should be read as references to  References to bytes and UTF-8 in this document should be read as references to
164  16-bit data quantities and UTF-16 when using the 16-bit library, unless  16-bit data quantities and UTF-16 when using the 16-bit library, or 32-bit data
165  specified otherwise. More details of the specific differences for the 16-bit  quantities and UTF-32 when using the 32-bit library, unless specified
166  library are given in the  otherwise. More details of the specific differences for the 16-bit and 32-bit
167    libraries are given in the
168  .\" HREF  .\" HREF
169  \fBpcre16\fP  \fBpcre16\fP
170  .\"  .\"
171  page.  and
172    .\" HREF
173    \fBpcre32\fP
174    .\"
175    pages.
176  .  .
177  .  .
178  .SH "PCRE API OVERVIEW"  .SH "PCRE API OVERVIEW"
# Line 213  used if available, by setting an option Line 228  used if available, by setting an option
228  relevant. More complicated programs might need to make use of the functions  relevant. More complicated programs might need to make use of the functions
229  \fBpcre_jit_stack_alloc()\fP, \fBpcre_jit_stack_free()\fP, and  \fBpcre_jit_stack_alloc()\fP, \fBpcre_jit_stack_free()\fP, and
230  \fBpcre_assign_jit_stack()\fP in order to control the JIT code's memory usage.  \fBpcre_assign_jit_stack()\fP in order to control the JIT code's memory usage.
231  These functions are discussed in the  .P
232    From release 8.32 there is also a direct interface for JIT execution, which
233    gives improved performance. The JIT-specific functions are discussed in the
234  .\" HREF  .\" HREF
235  \fBpcrejit\fP  \fBpcrejit\fP
236  .\"  .\"
# Line 392  not recognized. The following informatio Line 409  not recognized. The following informatio
409    PCRE_CONFIG_UTF8    PCRE_CONFIG_UTF8
410  .sp  .sp
411  The output is an integer that is set to one if UTF-8 support is available;  The output is an integer that is set to one if UTF-8 support is available;
412  otherwise it is set to zero. If this option is given to the 16-bit version of  otherwise it is set to zero. This value should normally be given to the 8-bit
413  this function, \fBpcre16_config()\fP, the result is PCRE_ERROR_BADOPTION.  version of this function, \fBpcre_config()\fP. If it is given to the 16-bit
414    or 32-bit version of this function, the result is PCRE_ERROR_BADOPTION.
415  .sp  .sp
416    PCRE_CONFIG_UTF16    PCRE_CONFIG_UTF16
417  .sp  .sp
418  The output is an integer that is set to one if UTF-16 support is available;  The output is an integer that is set to one if UTF-16 support is available;
419  otherwise it is set to zero. This value should normally be given to the 16-bit  otherwise it is set to zero. This value should normally be given to the 16-bit
420  version of this function, \fBpcre16_config()\fP. If it is given to the 8-bit  version of this function, \fBpcre16_config()\fP. If it is given to the 8-bit
421  version of this function, the result is PCRE_ERROR_BADOPTION.  or 32-bit version of this function, the result is PCRE_ERROR_BADOPTION.
422    .sp
423      PCRE_CONFIG_UTF32
424    .sp
425    The output is an integer that is set to one if UTF-32 support is available;
426    otherwise it is set to zero. This value should normally be given to the 32-bit
427    version of this function, \fBpcre32_config()\fP. If it is given to the 8-bit
428    or 16-bit version of this function, the result is PCRE_ERROR_BADOPTION.
429  .sp  .sp
430    PCRE_CONFIG_UNICODE_PROPERTIES    PCRE_CONFIG_UNICODE_PROPERTIES
431  .sp  .sp
# Line 422  unaligned)". If JIT support is not avail Line 447  unaligned)". If JIT support is not avail
447    PCRE_CONFIG_NEWLINE    PCRE_CONFIG_NEWLINE
448  .sp  .sp
449  The output is an integer whose value specifies the default character sequence  The output is an integer whose value specifies the default character sequence
450  that is recognized as meaning "newline". The four values that are supported  that is recognized as meaning "newline". The values that are supported in
451  are: 10 for LF, 13 for CR, 3338 for CRLF, -2 for ANYCRLF, and -1 for ANY.  ASCII/Unicode environments are: 10 for LF, 13 for CR, 3338 for CRLF, -2 for
452  Though they are derived from ASCII, the same values are returned in EBCDIC  ANYCRLF, and -1 for ANY. In EBCDIC environments, CR, ANYCRLF, and ANY yield the
453  environments. The default should normally correspond to the standard sequence  same values. However, the value for LF is normally 21, though some EBCDIC
454  for your operating system.  environments use 37. The corresponding values for CRLF are 3349 and 3365. The
455    default should normally correspond to the standard sequence for your operating
456    system.
457  .sp  .sp
458    PCRE_CONFIG_BSR    PCRE_CONFIG_BSR
459  .sp  .sp
# Line 440  or CRLF. The default can be overridden w Line 467  or CRLF. The default can be overridden w
467  The output is an integer that contains the number of bytes used for internal  The output is an integer that contains the number of bytes used for internal
468  linkage in compiled regular expressions. For the 8-bit library, the value can  linkage in compiled regular expressions. For the 8-bit library, the value can
469  be 2, 3, or 4. For the 16-bit library, the value is either 2 or 4 and is still  be 2, 3, or 4. For the 16-bit library, the value is either 2 or 4 and is still
470  a number of bytes. The default value of 2 is sufficient for all but the most  a number of bytes. For the 32-bit library, the value is either 2 or 4 and is
471  massive patterns, since it allows the compiled pattern to be up to 64K in size.  still a number of bytes. The default value of 2 is sufficient for all but the
472  Larger values allow larger regular expressions to be compiled, at the expense  most massive patterns, since it allows the compiled pattern to be up to 64K in
473  of slower matching.  size. Larger values allow larger regular expressions to be compiled, at the
474    expense of slower matching.
475  .sp  .sp
476    PCRE_CONFIG_POSIX_MALLOC_THRESHOLD    PCRE_CONFIG_POSIX_MALLOC_THRESHOLD
477  .sp  .sp
# Line 739  indicated by a single character (CR or L Line 767  indicated by a single character (CR or L
767  PCRE_NEWLINE_CRLF specifies that a newline is indicated by the two-character  PCRE_NEWLINE_CRLF specifies that a newline is indicated by the two-character
768  CRLF sequence. Setting PCRE_NEWLINE_ANYCRLF specifies that any of the three  CRLF sequence. Setting PCRE_NEWLINE_ANYCRLF specifies that any of the three
769  preceding sequences should be recognized. Setting PCRE_NEWLINE_ANY specifies  preceding sequences should be recognized. Setting PCRE_NEWLINE_ANY specifies
770  that any Unicode newline sequence should be recognized. The Unicode newline  that any Unicode newline sequence should be recognized.
771  sequences are the three just mentioned, plus the single characters VT (vertical  .P
772  tab, U+000B), FF (form feed, U+000C), NEL (next line, U+0085), LS (line  In an ASCII/Unicode environment, the Unicode newline sequences are the three
773  separator, U+2028), and PS (paragraph separator, U+2029). For the 8-bit  just mentioned, plus the single characters VT (vertical tab, U+000B), FF (form
774  library, the last two are recognized only in UTF-8 mode.  feed, U+000C), NEL (next line, U+0085), LS (line separator, U+2028), and PS
775    (paragraph separator, U+2029). For the 8-bit library, the last two are
776    recognized only in UTF-8 mode.
777    .P
778    When PCRE is compiled to run in an EBCDIC (mainframe) environment, the code for
779    CR is 0x0d, the same as ASCII. However, the character code for LF is normally
780    0x15, though in some EBCDIC environments 0x25 is used. Whichever of these is
781    not LF is made to correspond to Unicode's NEL character. EBCDIC codes are all
782    less than 256. For more details, see the
783    .\" HREF
784    \fBpcrebuild\fP
785    .\"
786    documentation.
787  .P  .P
788  The newline setting in the options word uses three bits that are treated  The newline setting in the options word uses three bits that are treated
789  as a number, giving eight possibilities. Currently only six are used (default  as a number, giving eight possibilities. Currently only six are used (default
# Line 819  page. Line 859  page.
859  .sp  .sp
860    PCRE_NO_UTF8_CHECK    PCRE_NO_UTF8_CHECK
861  .sp  .sp
862  When PCRE_UTF8 is set, the validity of the pattern as a UTF-8  When PCRE_UTF8 is set, the validity of the pattern as a UTF-8 string is
863  string is automatically checked. There is a discussion about the  automatically checked. There is a discussion about the
864  .\" HTML <a href="pcreunicode.html#utf8strings">  .\" HTML <a href="pcreunicode.html#utf8strings">
865  .\" </a>  .\" </a>
866  validity of UTF-8 strings  validity of UTF-8 strings
# Line 835  this check for performance reasons, you Line 875  this check for performance reasons, you
875  When it is set, the effect of passing an invalid UTF-8 string as a pattern is  When it is set, the effect of passing an invalid UTF-8 string as a pattern is
876  undefined. It may cause your program to crash. Note that this option can also  undefined. It may cause your program to crash. Note that this option can also
877  be passed to \fBpcre_exec()\fP and \fBpcre_dfa_exec()\fP, to suppress the  be passed to \fBpcre_exec()\fP and \fBpcre_dfa_exec()\fP, to suppress the
878  validity checking of subject strings.  validity checking of subject strings only. If the same string is being matched
879    many times, the option can be safely set for the second and subsequent
880    matchings to improve performance.
881  .  .
882  .  .
883  .SH "COMPILATION ERROR CODES"  .SH "COMPILATION ERROR CODES"
# Line 844  validity checking of subject strings. Line 886  validity checking of subject strings.
886  The following table lists the error codes than may be returned by  The following table lists the error codes than may be returned by
887  \fBpcre_compile2()\fP, along with the error messages that may be returned by  \fBpcre_compile2()\fP, along with the error messages that may be returned by
888  both compiling functions. Note that error messages are always 8-bit ASCII  both compiling functions. Note that error messages are always 8-bit ASCII
889  strings, even in 16-bit mode. As PCRE has developed, some error codes have  strings, even in 16-bit or 32-bit mode. As PCRE has developed, some error codes
890  fallen out of use. To avoid confusion, they have not been re-used.  have fallen out of use. To avoid confusion, they have not been re-used.
891  .sp  .sp
892     0  no error     0  no error
893     1  \e at end of pattern     1  \e at end of pattern
# Line 928  fallen out of use. To avoid confusion, t Line 970  fallen out of use. To avoid confusion, t
970    74  invalid UTF-16 string (specifically UTF-16)    74  invalid UTF-16 string (specifically UTF-16)
971    75  name is too long in (*MARK), (*PRUNE), (*SKIP), or (*THEN)    75  name is too long in (*MARK), (*PRUNE), (*SKIP), or (*THEN)
972    76  character value in \eu.... sequence is too large    76  character value in \eu.... sequence is too large
973      77  invalid UTF-32 string (specifically UTF-32)
974  .sp  .sp
975  The numbers 32 and 10000 in errors 48 and 49 are defaults; different values may  The numbers 32 and 10000 in errors 48 and 49 are defaults; different values may
976  be used if the limits were changed when PCRE was built.  be used if the limits were changed when PCRE was built.
# Line 1033  find out the value in a calling program Line 1076  find out the value in a calling program
1076  Studying a pattern is also useful for non-anchored patterns that do not have a  Studying a pattern is also useful for non-anchored patterns that do not have a
1077  single fixed starting character. A bitmap of possible starting bytes is  single fixed starting character. A bitmap of possible starting bytes is
1078  created. This speeds up finding a position in the subject at which to start  created. This speeds up finding a position in the subject at which to start
1079  matching. (In 16-bit mode, the bitmap is used for 16-bit values less than 256.)  matching. (In 16-bit mode, the bitmap is used for 16-bit values less than 256.
1080    In 32-bit mode, the bitmap is used for 32-bit values less than 256.)
1081  .P  .P
1082  These two optimizations apply to both \fBpcre_exec()\fP and  These two optimizations apply to both \fBpcre_exec()\fP and
1083  \fBpcre_dfa_exec()\fP, and the information is also used by the JIT compiler.  \fBpcre_dfa_exec()\fP, and the information is also used by the JIT compiler.
# Line 1178  variable. Line 1222  variable.
1222  .P  .P
1223  If there is a fixed first value, for example, the letter "c" from a pattern  If there is a fixed first value, for example, the letter "c" from a pattern
1224  such as (cat|cow|coyote), its value is returned. In the 8-bit library, the  such as (cat|cow|coyote), its value is returned. In the 8-bit library, the
1225  value is always less than 256; in the 16-bit library the value can be up to  value is always less than 256. In the 16-bit library the value can be up to
1226  0xffff.  0xffff. In the 32-bit library the value can be up to 0x10ffff.
1227  .P  .P
1228  If there is no fixed first value, and if either  If there is no fixed first value, and if either
1229  .sp  .sp
# Line 1192  starts with "^", or Line 1236  starts with "^", or
1236  -1 is returned, indicating that the pattern matches only at the start of a  -1 is returned, indicating that the pattern matches only at the start of a
1237  subject string or after any newline within the string. Otherwise -2 is  subject string or after any newline within the string. Otherwise -2 is
1238  returned. For anchored patterns, -2 is returned.  returned. For anchored patterns, -2 is returned.
1239    .P
1240    Since for the 32-bit library using the non-UTF-32 mode, this function is unable
1241    to return the full 32-bit range of the character, this value is deprecated;
1242    instead the PCRE_INFO_FIRSTCHARACTERFLAGS and PCRE_INFO_FIRSTCHARACTER values
1243    should be used.
1244  .sp  .sp
1245    PCRE_INFO_FIRSTTABLE    PCRE_INFO_FIRSTTABLE
1246  .sp  .sp
# Line 1239  value, -1 is returned. For anchored patt Line 1288  value, -1 is returned. For anchored patt
1288  only if it follows something of variable length. For example, for the pattern  only if it follows something of variable length. For example, for the pattern
1289  /^a\ed+z\ed+/ the returned value is "z", but for /^a\edz\ed/ the returned value  /^a\ed+z\ed+/ the returned value is "z", but for /^a\edz\ed/ the returned value
1290  is -1.  is -1.
1291    .P
1292    Since for the 32-bit library using the non-UTF-32 mode, this function is unable
1293    to return the full 32-bit range of the character, this value is deprecated;
1294    instead the PCRE_INFO_REQUIREDCHARFLAGS and PCRE_INFO_REQUIREDCHAR values should
1295    be used.
1296  .sp  .sp
1297    PCRE_INFO_MAXLOOKBEHIND    PCRE_INFO_MAXLOOKBEHIND
1298  .sp  .sp
# Line 1278  length of the longest name. PCRE_INFO_NA Line 1332  length of the longest name. PCRE_INFO_NA
1332  entry of the table. This is a pointer to \fBchar\fP in the 8-bit library, where  entry of the table. This is a pointer to \fBchar\fP in the 8-bit library, where
1333  the first two bytes of each entry are the number of the capturing parenthesis,  the first two bytes of each entry are the number of the capturing parenthesis,
1334  most significant byte first. In the 16-bit library, the pointer points to  most significant byte first. In the 16-bit library, the pointer points to
1335  16-bit data units, the first of which contains the parenthesis number. The rest  16-bit data units, the first of which contains the parenthesis number.
1336    In the 32-bit library, the pointer points to 32-bit data units, the first of
1337    which contains the parenthesis number. The rest
1338  of the entry is the corresponding name, zero terminated.  of the entry is the corresponding name, zero terminated.
1339  .P  .P
1340  The names are in alphabetical order. Duplicate names may appear if (?| is used  The names are in alphabetical order. Duplicate names may appear if (?| is used
# Line 1380  is made available via this option so tha Line 1436  is made available via this option so tha
1436  \fBpcreprecompile\fP  \fBpcreprecompile\fP
1437  .\"  .\"
1438  documentation for details).  documentation for details).
1439    .sp
1440      PCRE_INFO_FIRSTCHARACTERFLAGS
1441    .sp
1442    Return information about the first data unit of any matched string, for a
1443    non-anchored pattern. The fourth argument should point to an \fBint\fP
1444    variable.
1445    .P
1446    If there is a fixed first value, for example, the letter "c" from a pattern
1447    such as (cat|cow|coyote), 1 is returned, and the character value can be
1448    retrieved using PCRE_INFO_FIRSTCHARACTER.
1449    .P
1450    If there is no fixed first value, and if either
1451    .sp
1452    (a) the pattern was compiled with the PCRE_MULTILINE option, and every branch
1453    starts with "^", or
1454    .sp
1455    (b) every branch of the pattern starts with ".*" and PCRE_DOTALL is not set
1456    (if it were set, the pattern would be anchored),
1457    .sp
1458    2 is returned, indicating that the pattern matches only at the start of a
1459    subject string or after any newline within the string. Otherwise 0 is
1460    returned. For anchored patterns, 0 is returned.
1461    .sp
1462      PCRE_INFO_FIRSTCHARACTER
1463    .sp
1464    Return the fixed first character value, if PCRE_INFO_FIRSTCHARACTERFLAGS
1465    returned 1; otherwise returns 0. The fourth argument should point to an
1466    \fBuint_t\fP variable.
1467    .P
1468    In the 8-bit library, the value is always less than 256. In the 16-bit library
1469    the value can be up to 0xffff. In the 32-bit library in UTF-32 mode the value
1470    can be up to 0x10ffff, and up to 0xffffffff when not using UTF-32 mode.
1471    .P
1472    If there is no fixed first value, and if either
1473    .sp
1474    (a) the pattern was compiled with the PCRE_MULTILINE option, and every branch
1475    starts with "^", or
1476    .sp
1477    (b) every branch of the pattern starts with ".*" and PCRE_DOTALL is not set
1478    (if it were set, the pattern would be anchored),
1479    .sp
1480    -1 is returned, indicating that the pattern matches only at the start of a
1481    subject string or after any newline within the string. Otherwise -2 is
1482    returned. For anchored patterns, -2 is returned.
1483    .sp
1484      PCRE_INFO_REQUIREDCHARFLAGS
1485    .sp
1486    Returns 1 if there is a rightmost literal data unit that must exist in any
1487    matched string, other than at its start. The fourth argument should  point to
1488    an \fBint\fP variable. If there is no such value, 0 is returned. If returning
1489    1, the character value itself can be retrieved using PCRE_INFO_REQUIREDCHAR.
1490    .P
1491    For anchored patterns, a last literal value is recorded only if it follows
1492    something of variable length. For example, for the pattern /^a\ed+z\ed+/ the
1493    returned value 1 (with "z" returned from PCRE_INFO_REQUIREDCHAR), but for
1494    /^a\edz\ed/ the returned value is 0.
1495    .sp
1496      PCRE_INFO_REQUIREDCHAR
1497    .sp
1498    Return the value of the rightmost literal data unit that must exist in any
1499    matched string, other than at its start, if such a value has been recorded. The
1500    fourth argument should point to an \fBuint32_t\fP variable. If there is no such
1501    value, 0 is returned.
1502  .  .
1503  .  .
1504  .SH "REFERENCE COUNTS"  .SH "REFERENCE COUNTS"
# Line 1476  fields (not necessarily in this order): Line 1595  fields (not necessarily in this order):
1595  .sp  .sp
1596  In the 16-bit version of this structure, the \fImark\fP field has type  In the 16-bit version of this structure, the \fImark\fP field has type
1597  "PCRE_UCHAR16 **".  "PCRE_UCHAR16 **".
1598    .sp
1599    In the 32-bit version of this structure, the \fImark\fP field has type
1600    "PCRE_UCHAR32 **".
1601  .P  .P
1602  The \fIflags\fP field is used to specify which of the other fields are set. The  The \fIflags\fP field is used to specify which of the other fields are set. The
1603  flag bits are:  flag bits are:
# Line 2112  documentation for more details. Line 2234  documentation for more details.
2234    PCRE_ERROR_BADMODE        (-28)    PCRE_ERROR_BADMODE        (-28)
2235  .sp  .sp
2236  This error is given if a pattern that was compiled by the 8-bit library is  This error is given if a pattern that was compiled by the 8-bit library is
2237  passed to a 16-bit library function, or vice versa.  passed to a 16-bit or 32-bit library function, or vice versa.
2238  .sp  .sp
2239    PCRE_ERROR_BADENDIANNESS  (-29)    PCRE_ERROR_BADENDIANNESS  (-29)
2240  .sp  .sp
# Line 2120  This error is given if a pattern that wa Line 2242  This error is given if a pattern that wa
2242  host with different endianness. The utility function  host with different endianness. The utility function
2243  \fBpcre_pattern_to_host_byte_order()\fP can be used to convert such a pattern  \fBpcre_pattern_to_host_byte_order()\fP can be used to convert such a pattern
2244  so that it runs on the new host.  so that it runs on the new host.
2245    .sp
2246      PCRE_ERROR_JIT_BADOPTION
2247    .sp
2248    This error is returned when a pattern that was successfully studied using a JIT
2249    compile option is being matched, but the matching mode (partial or complete
2250    match) does not correspond to any JIT compilation mode. When the JIT fast path
2251    function is used, this error may be also given for invalid options. See the
2252    .\" HREF
2253    \fBpcrejit\fP
2254    .\"
2255    documentation for more details.
2256    .sp
2257      PCRE_ERROR_BADLENGTH      (-32)
2258    .sp
2259    This error is given if \fBpcre_exec()\fP is called with a negative value for
2260    the \fIlength\fP argument.
2261  .P  .P
2262  Error numbers -16 to -20, -22, and -30 are not used by \fBpcre_exec()\fP.  Error numbers -16 to -20, -22, and 30 are not used by \fBpcre_exec()\fP.
2263  .  .
2264  .  .
2265  .\" HTML <a name="badutf8reasons"></a>  .\" HTML <a name="badutf8reasons"></a>
# Line 2133  for the 16-bit library is given in the Line 2271  for the 16-bit library is given in the
2271  .\" HREF  .\" HREF
2272  \fBpcre16\fP  \fBpcre16\fP
2273  .\"  .\"
2274    page. The corresponding information for the 32-bit library is given in the
2275    .\" HREF
2276    \fBpcre32\fP
2277    .\"
2278  page.  page.
2279  .P  .P
2280  When \fBpcre_exec()\fP returns either PCRE_ERROR_BADUTF8 or  When \fBpcre_exec()\fP returns either PCRE_ERROR_BADUTF8 or
# Line 2203  character. Line 2345  character.
2345  .sp  .sp
2346  The first byte of a character has the value 0xfe or 0xff. These values can  The first byte of a character has the value 0xfe or 0xff. These values can
2347  never occur in a valid UTF-8 string.  never occur in a valid UTF-8 string.
2348    .sp
2349      PCRE_UTF8_ERR2
2350    .sp
2351    Non-character. These are the last two characters in each plane (0xfffe, 0xffff,
2352    0x1fffe, 0x1ffff .. 0x10fffe, 0x10ffff), and the characters 0xfdd0..0xfdef.
2353  .  .
2354  .  .
2355  .SH "EXTRACTING CAPTURED SUBSTRINGS BY NUMBER"  .SH "EXTRACTING CAPTURED SUBSTRINGS BY NUMBER"
# Line 2651  fail, this error is given. Line 2798  fail, this error is given.
2798  .SH "SEE ALSO"  .SH "SEE ALSO"
2799  .rs  .rs
2800  .sp  .sp
2801  \fBpcre16\fP(3), \fBpcrebuild\fP(3), \fBpcrecallout\fP(3), \fBpcrecpp(3)\fP(3),  \fBpcre16\fP(3), \fBpcre32\fP(3), \fBpcrebuild\fP(3), \fBpcrecallout\fP(3),
2802  \fBpcrematching\fP(3), \fBpcrepartial\fP(3), \fBpcreposix\fP(3),  \fBpcrecpp(3)\fP(3), \fBpcrematching\fP(3), \fBpcrepartial\fP(3),
2803  \fBpcreprecompile\fP(3), \fBpcresample\fP(3), \fBpcrestack\fP(3).  \fBpcreposix\fP(3), \fBpcreprecompile\fP(3), \fBpcresample\fP(3),
2804    \fBpcrestack\fP(3).
2805  .  .
2806  .  .
2807  .SH AUTHOR  .SH AUTHOR
# Line 2670  Cambridge CB2 3QH, England. Line 2818  Cambridge CB2 3QH, England.
2818  .rs  .rs
2819  .sp  .sp
2820  .nf  .nf
2821  Last updated: 28 August 2012  Last updated: 31 October 2012
2822  Copyright (c) 1997-2012 University of Cambridge.  Copyright (c) 1997-2012 University of Cambridge.
2823  .fi  .fi

Legend:
Removed from v.1022  
changed lines
  Added in v.1194

  ViewVC Help
Powered by ViewVC 1.1.5