--- code/trunk/doc/html/pcreapi.html 2007/08/08 14:24:50 210 +++ code/trunk/doc/html/pcreapi.html 2007/08/09 09:52:43 211 @@ -613,14 +613,17 @@ PCRE_NO_UTF8_CHECK When PCRE_UTF8 is set, the validity of the pattern as a UTF-8 string is -automatically checked. If an invalid UTF-8 sequence of bytes is found, -pcre_compile() returns an error. If you already know that your pattern is -valid, and you want to skip this check for performance reasons, you can set the -PCRE_NO_UTF8_CHECK option. When it is set, the effect of passing an invalid -UTF-8 string as a pattern is undefined. It may cause your program to crash. -Note that this option can also be passed to pcre_exec() and -pcre_dfa_exec(), to suppress the UTF-8 validity checking of subject -strings. +automatically checked. There is a discussion about the +validity of UTF-8 strings +in the main +pcre +page. If an invalid UTF-8 sequence of bytes is found, pcre_compile() +returns an error. If you already know that your pattern is valid, and you want +to skip this check for performance reasons, you can set the PCRE_NO_UTF8_CHECK +option. When it is set, the effect of passing an invalid UTF-8 string as a +pattern is undefined. It may cause your program to crash. Note that this option +can also be passed to pcre_exec() and pcre_dfa_exec(), to suppress +the UTF-8 validity checking of subject strings.


COMPILATION ERROR CODES

@@ -1247,9 +1250,14 @@ When PCRE_UTF8 is set at compile time, the validity of the subject as a UTF-8 string is automatically checked when pcre_exec() is subsequently called. The value of startoffset is also checked to ensure that it points to the -start of a UTF-8 character. If an invalid UTF-8 sequence of bytes is found, -pcre_exec() returns the error PCRE_ERROR_BADUTF8. If startoffset -contains an invalid value, PCRE_ERROR_BADUTF8_OFFSET is returned. +start of a UTF-8 character. There is a discussion about the validity of UTF-8 +strings in the +section on UTF-8 support +in the main +pcre +page. If an invalid UTF-8 sequence of bytes is found, pcre_exec() returns +the error PCRE_ERROR_BADUTF8. If startoffset contains an invalid value, +PCRE_ERROR_BADUTF8_OFFSET is returned.

If you already know that your subject is valid, and you want to skip these @@ -1875,7 +1883,7 @@


REVISION

-Last updated: 30 July 2007 +Last updated: 09 August 2007
Copyright © 1997-2007 University of Cambridge.