34 |
---------------------- |
---------------------- |
35 |
|
|
36 |
If you install PCRE in the normal way, you will end up with an installed set of |
If you install PCRE in the normal way, you will end up with an installed set of |
37 |
man pages whose names all start with "pcre". The one that is called "pcre" |
man pages whose names all start with "pcre". The one that is just called "pcre" |
38 |
lists all the others. In addition to these man pages, the PCRE documentation is |
lists all the others. In addition to these man pages, the PCRE documentation is |
39 |
supplied in two other forms; however, as there is no standard place to install |
supplied in two other forms; however, as there is no standard place to install |
40 |
them, they are left in the doc directory of the unpacked source distribution. |
them, they are left in the doc directory of the unpacked source distribution. |
114 |
. If, in addition to support for UTF-8 character strings, you want to include |
. If, in addition to support for UTF-8 character strings, you want to include |
115 |
support for the \P, \p, and \X sequences that recognize Unicode character |
support for the \P, \p, and \X sequences that recognize Unicode character |
116 |
properties, you must add --enable-unicode-properties to the "configure" |
properties, you must add --enable-unicode-properties to the "configure" |
117 |
command. This adds about 90K to the size of the library (in the form of a |
command. This adds about 30K to the size of the library (in the form of a |
118 |
property table); only the basic two-letter properties such as Lu are |
property table); only the basic two-letter properties such as Lu are |
119 |
supported. |
supported. |
120 |
|
|
121 |
. You can build PCRE to recognize either CR or LF as the newline character, |
. You can build PCRE to recognize either CR or LF or the sequence CRLF as |
122 |
instead of whatever your compiler uses for "\n", by adding --newline-is-cr or |
indicating the end of a line. Whatever you specify at build time is the |
123 |
--newline-is-lf to the "configure" command, respectively. Only do this if you |
default; the caller of PCRE can change the selection at run time. The default |
124 |
really understand what you are doing. On traditional Unix-like systems, the |
newline indicator is a single LF character (the Unix standard). You can |
125 |
newline character is LF. |
specify the default newline indicator by adding --newline-is-cr or |
126 |
|
--newline-is-lf or --newline-is-crlf to the "configure" command, |
127 |
|
respectively. |
128 |
|
|
129 |
. When called via the POSIX interface, PCRE uses malloc() to get additional |
. When called via the POSIX interface, PCRE uses malloc() to get additional |
130 |
storage for processing capturing parentheses if there are more than 10 of |
storage for processing capturing parentheses if there are more than 10 of |
144 |
pcre_exec() can supply their own value. There is discussion on the pcreapi |
pcre_exec() can supply their own value. There is discussion on the pcreapi |
145 |
man page. |
man page. |
146 |
|
|
147 |
|
. There is a separate counter that limits the depth of recursive function calls |
148 |
|
during a matching process. This also has a default of ten million, which is |
149 |
|
essentially "unlimited". You can change the default by setting, for example, |
150 |
|
|
151 |
|
--with-match-limit-recursion=500000 |
152 |
|
|
153 |
|
Recursive function calls use up the runtime stack; running out of stack can |
154 |
|
cause programs to crash in strange ways. There is a discussion about stack |
155 |
|
sizes in the pcrestack man page. |
156 |
|
|
157 |
. The default maximum compiled pattern size is around 64K. You can increase |
. The default maximum compiled pattern size is around 64K. You can increase |
158 |
this by adding --with-link-size=3 to the "configure" command. You can |
this by adding --with-link-size=3 to the "configure" command. You can |
159 |
increase it even more by setting --with-link-size=4, but this is unlikely |
increase it even more by setting --with-link-size=4, but this is unlikely |
177 |
|
|
178 |
The "configure" script builds eight files for the basic C library: |
The "configure" script builds eight files for the basic C library: |
179 |
|
|
|
. pcre.h is the header file for C programs that call PCRE |
|
180 |
. Makefile is the makefile that builds the library |
. Makefile is the makefile that builds the library |
181 |
. config.h contains build-time configuration options for the library |
. config.h contains build-time configuration options for the library |
182 |
. pcre-config is a script that shows the settings of "configure" options |
. pcre-config is a script that shows the settings of "configure" options |
443 |
pcre_info.c ) |
pcre_info.c ) |
444 |
pcre_maketables.c ) |
pcre_maketables.c ) |
445 |
pcre_ord2utf8.c ) |
pcre_ord2utf8.c ) |
446 |
pcre_printint.c ) |
pcre_refcount.c ) |
447 |
pcre_study.c ) |
pcre_study.c ) |
448 |
pcre_tables.c ) |
pcre_tables.c ) |
449 |
pcre_try_flipped.c ) |
pcre_try_flipped.c ) |
450 |
pcre_ucp_findchar.c ) |
pcre_ucp_searchfuncs.c) |
451 |
pcre_valid_utf8.c ) |
pcre_valid_utf8.c ) |
452 |
pcre_version.c ) |
pcre_version.c ) |
453 |
pcre_xclass.c ) |
pcre_xclass.c ) |
|
|
|
|
ucp_findchar.c ) |
|
|
ucp.h ) source for the code that is used for |
|
|
ucpinternal.h ) Unicode property handling |
|
454 |
ucptable.c ) |
ucptable.c ) |
|
ucptypetable.c ) |
|
455 |
|
|
456 |
pcre.in "source" for the header for the external API; pcre.h |
pcre_printint.src ) debugging function that is #included in pcretest, and |
457 |
is built from this by "configure" |
) can also be #included in pcre_compile() |
458 |
|
|
459 |
|
pcre.h the public PCRE header file |
460 |
pcreposix.h header for the external POSIX wrapper API |
pcreposix.h header for the external POSIX wrapper API |
461 |
pcre_internal.h header for internal use |
pcre_internal.h header for internal use |
462 |
|
ucp.h ) headers concerned with |
463 |
|
ucpinternal.h ) Unicode property handling |
464 |
config.in template for config.h, which is built by configure |
config.in template for config.h, which is built by configure |
465 |
|
|
466 |
pcrecpp.h the header file for the C++ wrapper |
pcrecpp.h the header file for the C++ wrapper |
487 |
RunGrepTest.in template for a Unix shell script for pcregrep tests |
RunGrepTest.in template for a Unix shell script for pcregrep tests |
488 |
config.guess ) files used by libtool, |
config.guess ) files used by libtool, |
489 |
config.sub ) used only when building a shared library |
config.sub ) used only when building a shared library |
490 |
|
config.h.in "source" for the config.h header file |
491 |
configure a configuring shell script (built by autoconf) |
configure a configuring shell script (built by autoconf) |
492 |
configure.in the autoconf input used to build configure |
configure.ac the autoconf input used to build configure |
493 |
doc/Tech.Notes notes on the encoding |
doc/Tech.Notes notes on the encoding |
494 |
doc/*.3 man page sources for the PCRE functions |
doc/*.3 man page sources for the PCRE functions |
495 |
doc/*.1 man page sources for pcregrep and pcretest |
doc/*.1 man page sources for pcregrep and pcretest |
517 |
|
|
518 |
libpcre.def |
libpcre.def |
519 |
libpcreposix.def |
libpcreposix.def |
|
pcre.def |
|
520 |
|
|
521 |
(D) Auxiliary file for VPASCAL |
(D) Auxiliary file for VPASCAL |
522 |
|
|
525 |
Philip Hazel |
Philip Hazel |
526 |
Email local part: ph10 |
Email local part: ph10 |
527 |
Email domain: cam.ac.uk |
Email domain: cam.ac.uk |
528 |
January 2006 |
June 2006 |