16 |
Files in the maint directory |
Files in the maint directory |
17 |
---------------------------- |
---------------------------- |
18 |
|
|
19 |
|
----------------- This file is now OBSOLETE and no longer used ---------------- |
20 |
Builducptable A Perl script that creates the contents of the ucptable.h file |
Builducptable A Perl script that creates the contents of the ucptable.h file |
21 |
from two Unicode data files, which themselves are downloaded |
from two Unicode data files, which themselves are downloaded |
22 |
from the Unicode web site. Run this script in the "maint" |
from the Unicode web site. Run this script in the "maint" |
23 |
directory. |
directory. |
24 |
|
----------------- This file is now OBSOLETE and no longer used ---------------- |
25 |
|
|
26 |
|
GenerateUtt.py A Python script to generate part of the pcre_tables.c file |
27 |
|
that contains Unicode script names in a long string with |
28 |
|
offsets, which is tedious to maintain by hand. |
29 |
|
|
30 |
ManyConfigTests A shell script that runs "configure, make, test" a number of |
ManyConfigTests A shell script that runs "configure, make, test" a number of |
31 |
times with different configuration settings. |
times with different configuration settings. |
32 |
|
|
33 |
Unicode.tables The files in this directory, Scripts.txt and UnicodeData.txt, |
MultiStage2.py A Python script that generates the file pcre_ucd.c from three |
34 |
were downloaded from the Unicode web site. They contain |
Unicode data tables, which are themselves downloaded from the |
35 |
information about Unicode characters and scripts. |
Unicode web site. Run this script in the "maint" directory. |
36 |
|
The generated file contains the tables for a 2-stage lookup |
37 |
ucptest.c A short C program for testing the Unicode property functions |
of Unicode properties. |
38 |
in pcre_ucp_searchfuncs.c, mainly useful after rebuilding the |
|
39 |
Unicode property table. Compile and run this in the "maint" |
Unicode.tables The files in this directory, DerivedGeneralCategory.txt, |
40 |
directory. |
Scripts.txt and UnicodeData.txt, were downloaded from the |
41 |
|
Unicode web site. They contain information about Unicode |
42 |
|
characters and scripts. |
43 |
|
|
44 |
|
ucptest.c A short C program for testing the Unicode property macros |
45 |
|
that do lookups in the pcre_ucd.c data, mainly useful after |
46 |
|
rebuilding the Unicode property table. Compile and run this in |
47 |
|
the "maint" directory (see comments at its head). |
48 |
|
|
49 |
ucptestdata A directory containing two files, testinput1 and testoutput1, |
ucptestdata A directory containing two files, testinput1 and testoutput1, |
50 |
to use in conjunction with the ucptest program. |
to use in conjunction with the ucptest program. |
62 |
--------------------------------- |
--------------------------------- |
63 |
|
|
64 |
When there is a new release of Unicode, the files in Unicode.tables must be |
When there is a new release of Unicode, the files in Unicode.tables must be |
65 |
refreshed from the web site, and the Buildupctable script can then be run to |
refreshed from the web site. If the new version of Unicode adds new character |
66 |
generate a new version of ucptable.h. The ucptest program can be used to check |
scripts, the source file ucp.h and both the MultiStage2.py and the |
67 |
that the resulting table works properly, using the data files in ucptestdata to |
GenerateUtt.py scripts must be edited to add the new names. Then the |
68 |
check a number of test characters. |
MultiStage2.py script can then be run to generate a new version of pcre_ucd.c |
69 |
|
and the GenerateUtt.py can be run to generate the tricky tables for inclusion |
70 |
|
in pcre_tables.c. |
71 |
|
|
72 |
|
The ucptest program can then be compiled and used to check that the new tables |
73 |
|
in pcre_ucd.c work properly, using the data files in ucptestdata to check a |
74 |
|
number of test characters. |
75 |
|
|
76 |
|
|
77 |
Preparing for a PCRE release |
Preparing for a PCRE release |
263 |
Philip Hazel |
Philip Hazel |
264 |
Email local part: ph10 |
Email local part: ph10 |
265 |
Email domain: cam.ac.uk |
Email domain: cam.ac.uk |
266 |
Last updated: 27 December 2007 |
Last updated: 04 July 2008 |