/[pcre]/code/trunk/maint/README
ViewVC logotype

Diff of /code/trunk/maint/README

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 454 by ph10, Tue Sep 22 09:42:11 2009 UTC revision 491 by ph10, Mon Mar 1 17:45:08 2010 UTC
# Line 70  GenerateUtt.py scripts must be edited to Line 70  GenerateUtt.py scripts must be edited to
70  can be run to generate a new version of pcre_ucd.c, and GenerateUtt.py can be  can be run to generate a new version of pcre_ucd.c, and GenerateUtt.py can be
71  run to generate the tricky tables for inclusion in pcre_tables.c.  run to generate the tricky tables for inclusion in pcre_tables.c.
72    
73    If MultiStage2.py gives the error "ValueError: list.index(x): x not in list",
74    the cause is usually a missing (or misspelt) name in the list of scripts. I
75    couldn't find a straightforward list of scripts on the Unicode site, but
76    there's a useful Wikipedia page that list them, and notes the Unicode version
77    in which they were introduced:
78    
79    http://en.wikipedia.org/wiki/Unicode_scripts#Table_of_Unicode_scripts
80    
81  The ucptest program can be compiled and used to check that the new tables in  The ucptest program can be compiled and used to check that the new tables in
82  pcre_ucd.c work properly, using the data files in ucptestdata to check a number  pcre_ucd.c work properly, using the data files in ucptestdata to check a number
83  of test characters.  of test characters. The source file ucptest.c must be updated whenever new
84    Unicode script names are added.
85    
86    Note also that both the pcresyntax.3 and pcrepattern.3 man pages contain lists
87    of Unicode script names.
88    
89    
90  Preparing for a PCRE release  Preparing for a PCRE release
# Line 178  others are relatively new. Line 190  others are relatively new.
190      o A required byte from alternatives - not just the last char, but an      o A required byte from alternatives - not just the last char, but an
191        earlier one if common to all alternatives.        earlier one if common to all alternatives.
192    
     o Minimum length of subject needed (see also next . bullet).  
   
193      o Friedl contains other ideas.      o Friedl contains other ideas.
194    
 . There was a request for a way of finding the minimum subject length that can  
   match a given pattern. (If this were available, it could be usefully added  
   to study() - see above.) This is easy for simple cases, but I haven't figured  
   out how to handle recursion.  
   
195  . If Perl gets to a consistent state over the settings of capturing sub-  . If Perl gets to a consistent state over the settings of capturing sub-
196    patterns inside repeats, see if we can match it. One example of the    patterns inside repeats, see if we can match it. One example of the
197    difference is the matching of /(main(O)?)+/ against mainOmain, where PCRE    difference is the matching of /(main(O)?)+/ against mainOmain, where PCRE
# Line 308  others are relatively new. Line 313  others are relatively new.
313  Philip Hazel  Philip Hazel
314  Email local part: ph10  Email local part: ph10
315  Email domain: cam.ac.uk  Email domain: cam.ac.uk
316  Last updated: 20 September 2009  Last updated: 26 September 2009

Legend:
Removed from v.454  
changed lines
  Added in v.491

  ViewVC Help
Powered by ViewVC 1.1.5