16 |
Files in the maint directory |
Files in the maint directory |
17 |
---------------------------- |
---------------------------- |
18 |
|
|
19 |
Builducptable A Perl script that creates the contents of the ucptable.h file |
Builducptable A Perl script that creates the contents of the ucptable.h file |
20 |
from two Unicode data files, which themselves are downloaded |
from two Unicode data files, which themselves are downloaded |
21 |
from the Unicode web site. Run this script in the "maint" |
from the Unicode web site. Run this script in the "maint" |
22 |
directory. |
directory. |
23 |
|
|
24 |
Tech.Notes Some notes about the internals of the PCRE code. |
ManyConfigTests A shell script that runs "configure, make, test" a number of |
25 |
|
times with different configuration settings. |
26 |
Unicode.tables The files in this directory, Scripts.txt and UnicodeData.txt, |
|
27 |
were downloaded from the Unicode web site. They contain |
Unicode.tables The files in this directory, Scripts.txt and UnicodeData.txt, |
28 |
information about Unicode characters and scripts. |
were downloaded from the Unicode web site. They contain |
29 |
|
information about Unicode characters and scripts. |
30 |
ucptest.c A short C program for testing the Unicode property functions in |
|
31 |
pcre_ucp_searchfuncs.c, mainly useful after rebuilding the |
ucptest.c A short C program for testing the Unicode property functions |
32 |
Unicode property table. Compile and run this in the "maint" |
in pcre_ucp_searchfuncs.c, mainly useful after rebuilding the |
33 |
directory. |
Unicode property table. Compile and run this in the "maint" |
34 |
|
directory. |
35 |
ucptestdata A directory containing two files, testinput1 and testoutput1, |
|
36 |
to use in conjunction with the ucptest program. |
ucptestdata A directory containing two files, testinput1 and testoutput1, |
37 |
|
to use in conjunction with the ucptest program. |
38 |
utf8.c A short, freestanding C program for converting a Unicode code |
|
39 |
point into a sequence of bytes in the UTF-8 encoding, and vice |
utf8.c A short, freestanding C program for converting a Unicode code |
40 |
versa. If its argument is a hex number such as 0x1234, it |
point into a sequence of bytes in the UTF-8 encoding, and vice |
41 |
outputs a list of the equivalent UTF-8 bytes. If its argument |
versa. If its argument is a hex number such as 0x1234, it |
42 |
is sequence of concatenated UTF-8 bytes (e.g. e188b4) it treats |
outputs a list of the equivalent UTF-8 bytes. If its argument |
43 |
them as a UTF-8 character and outputs the equivalent code point |
is sequence of concatenated UTF-8 bytes (e.g. e188b4) it |
44 |
in hex. |
treats them as a UTF-8 character and outputs the equivalent |
45 |
|
code point in hex. |
46 |
|
|
47 |
|
|
48 |
Updating to a new Unicode release |
Updating to a new Unicode release |
66 |
. Run ./autogen.sh to ensure everything is up-to-date. |
. Run ./autogen.sh to ensure everything is up-to-date. |
67 |
|
|
68 |
. Compile and test with many different config options, and combinations of |
. Compile and test with many different config options, and combinations of |
69 |
options: |
options. The maint/ManyConfigTests script now encapsulates this testing. |
|
|
|
|
* Totally standard ./configure with no options |
|
|
* --disable-shared |
|
|
* --disable-static |
|
|
* --enable-utf8 |
|
|
* --enable-unicode-properties |
|
|
* --disable-cpp |
|
|
* --with-link-size=3 (occasionally check with 4 as well) |
|
|
* --disable-stack-for-recursion |
|
|
* --enable-newline-is-any |
|
|
|
|
|
I've never automated this, but perhaps I should. The newline testing could be |
|
|
enhanced; at present, some tests fail unless plain LF is a newline. |
|
70 |
|
|
71 |
. Run perltest.pl on the test data for tests 1 and 4. The output should match |
. Run perltest.pl on the test data for tests 1 and 4. The output should match |
72 |
the PCRE test output, apart from the version identification at the top. The |
the PCRE test output, apart from the version identification at the top. The |
252 |
. Someone suggested --disable-callout to save code space when callouts are |
. Someone suggested --disable-callout to save code space when callouts are |
253 |
never wanted. This seems rather marginal. |
never wanted. This seems rather marginal. |
254 |
|
|
|
. Automate some of the testing before release into a script that compiles with |
|
|
different options and runs the tests in each case. |
|
|
|
|
|
. How about distributing a fixed pcre_chartables.c file and abandoning the |
|
|
on-the-fly generation using dftables. This will make cross-compiling easier, |
|
|
and in any case, locales are going out of fashion. |
|
|
|
|
255 |
Philip Hazel |
Philip Hazel |
256 |
Email local part: ph10 |
Email local part: ph10 |
257 |
Email domain: cam.ac.uk |
Email domain: cam.ac.uk |
258 |
Last updated: 12 March 2007 |
Last updated: 20 March 2007 |