ViewVC logotype

Diff of /code/trunk/NEWS

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 535 by ph10, Thu Jun 3 19:18:24 2010 UTC revision 1217 by ph10, Fri Nov 9 17:07:01 2012 UTC
# Line 1  Line 1 
1  News about PCRE releases  News about PCRE releases
2  ------------------------  ------------------------
4  Release 8.10 03-Jun-2010  Release 8.32 12-November-2012
5    -----------------------------
7    This release fixes a number of bugs, but also has some new features. These are
8    the highlights:
10    .  There is now support for 32-bit character strings and UTF-32. Like the
11       16-bit support, this is done by compiling a separate 32-bit library.
13    .  \X now matches a Unicode extended grapheme cluster.
15    .  Case-independent matching of Unicode characters that have more than one
16       "other case" now makes all three (or more) characters equivalent. This
17       applies, for example, to Greek Sigma, which has two lowercase versions.
19    .  Unicode character properties are updated to Unicode 6.2.0.
21    .  The EBCDIC support, which had decayed, has had a spring clean.
23    .  A number of JIT optimizations have been added, which give faster JIT
24       execution speed. In addition, a new direct interface to JIT execution is
25       available. This bypasses some of the sanity checks of pcre_exec() to give a
26       noticeable speed-up.
28    .  A number of issues in pcregrep have been fixed, making it more compatible
29       with GNU grep. In particular, --exclude and --include (and variants) apply
30       to all files now, not just those obtained from scanning a directory
31       recursively. In Windows environments, the default action for directories is
32       now "skip" instead of "read" (which provokes an error).
34    .  If the --only-matching (-o) option in pcregrep is specified multiple
35       times, each one causes appropriate output. For example, -o1 -o2 outputs the
36       substrings matched by the 1st and 2nd capturing parentheses. A separating
37       string can be specified by --om-separator (default empty).
39    .  When PCRE is built via Autotools using a version of gcc that has the
40       "visibility" feature, it is used to hide internal library functions that are
41       not part of the public API.
44    Release 8.31 06-July-2012
45    -------------------------
47    This is mainly a bug-fixing release, with a small number of developments:
49    . The JIT compiler now supports partial matching and the (*MARK) and
50      (*COMMIT) verbs.
52    . PCRE_INFO_MAXLOOKBEHIND can be used to find the longest lookbehind in a
53      pattern.
55    . There should be a performance improvement when using the heap instead of the
56      stack for recursion.
58    . pcregrep can now be linked with libedit as an alternative to libreadline.
60    . pcregrep now has a --file-list option where the list of files to scan is
61      given as a file.
63    . pcregrep now recognizes binary files and there are related options.
65    . The Unicode tables have been updated to 6.1.0.
67    As always, the full list of changes is in the ChangeLog file.
70    Release 8.30 04-February-2012
71    -----------------------------
73    Release 8.30 introduces a major new feature: support for 16-bit character
74    strings, compiled as a separate library. There are a few changes to the
75    8-bit library, in addition to some bug fixes.
77    . The pcre_info() function, which has been obsolete for over 10 years, has
78      been removed.
80    . When a compiled pattern was saved to a file and later reloaded on a host
81      with different endianness, PCRE used automatically to swap the bytes in some
82      of the data fields. With the advent of the 16-bit library, where more of this
83      swapping is needed, it is no longer done automatically. Instead, the bad
84      endianness is detected and a specific error is given. The user can then call
85      a new function called pcre_pattern_to_host_byte_order() (or an equivalent
86      16-bit function) to do the swap.
88    . In UTF-8 mode, the values 0xd800 to 0xdfff are not legal Unicode
89      code points and are now faulted. (They are the so-called "surrogates"
90      that are reserved for coding high values in UTF-16.)
93    Release 8.21 12-Dec-2011
94    ------------------------
96    This is almost entirely a bug-fix release. The only new feature is the ability
97    to obtain the size of the memory used by the JIT compiler.
100    Release 8.20 21-Oct-2011
101    ------------------------
103    The main change in this release is the inclusion of Zoltan Herczeg's
104    just-in-time compiler support, which can be accessed by building PCRE with
105    --enable-jit. Large performance benefits can be had in many situations. 8.20
106    also fixes an unfortunate bug that was introduced in 8.13 as well as tidying up
107    a number of infelicities and differences from Perl.
110    Release 8.13 16-Aug-2011
111    ------------------------
113    This is mainly a bug-fix release. There has been a lot of internal refactoring.
114    The Unicode tables have been updated. The only new feature in the library is
115    the passing of *MARK information to callouts. Some additions have been made to
116    pcretest to make testing easier and more comprehensive. There is a new option
117    for pcregrep to adjust its internal buffer size.
120    Release 8.12 15-Jan-2011
121    ------------------------
123    This release fixes some bugs in pcregrep, one of which caused the tests to fail
124    on 64-bit big-endian systems. There are no changes to the code of the library.
127    Release 8.11 10-Dec-2010
128    ------------------------
130    A number of bugs in the library and in pcregrep have been fixed. As always, see
131    ChangeLog for details. The following are the non-bug-fix changes:
133    . Added --match-limit and --recursion-limit to pcregrep.
135    . Added an optional parentheses number to the -o and --only-matching options
136      of pcregrep.
138    . Changed the way PCRE_PARTIAL_HARD affects the matching of $, \z, \Z, \b, and
139      \B.
141    . Added PCRE_ERROR_SHORTUTF8 to make it possible to distinguish between a
142      bad UTF-8 sequence and one that is incomplete when using PCRE_PARTIAL_HARD.
144    . Recognize (*NO_START_OPT) at the start of a pattern to set the PCRE_NO_
145      START_OPTIMIZE option, which is now allowed at compile time
148    Release 8.10 25-Jun-2010
149  ------------------------  ------------------------
151  There are two major additions: support for (*MAKR) and friends, and the option  There are two major additions: support for (*MARK) and friends, and the option
152  PCRE_UCP, which changes the behaviour of \b, \d, \s, and \w (and their  PCRE_UCP, which changes the behaviour of \b, \d, \s, and \w (and their
153  opposites) so that they make use of Unicode properties. There are also a number  opposites) so that they make use of Unicode properties. There are also a number
154  of lesser new features, and several bugs have been fixed. A new option,  of lesser new features, and several bugs have been fixed. A new option,

Removed from v.535  
changed lines
  Added in v.1217

  ViewVC Help
Powered by ViewVC 1.1.5