/[pcre]/code/trunk/testdata/testoutput15
ViewVC logotype

Contents of /code/trunk/testdata/testoutput15

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1098 - (show annotations)
Tue Oct 16 15:56:18 2012 UTC (7 years, 3 months ago) by chpe
File size: 27703 byte(s)
Error occurred while calculating annotation data.
pcre32: utf: Reject all non-characters and not just 0xfffe
1 /-- This set of tests is for UTF-8 support, and is relevant only to the 8-bit
2 library. --/
3
4 /X(\C{3})/8
5 X\x{1234}
6 0: X\x{1234}
7 1: \x{1234}
8
9 /X(\C{4})/8
10 X\x{1234}YZ
11 0: X\x{1234}Y
12 1: \x{1234}Y
13
14 /X\C*/8
15 XYZabcdce
16 0: XYZabcdce
17
18 /X\C*?/8
19 XYZabcde
20 0: X
21
22 /X\C{3,5}/8
23 Xabcdefg
24 0: Xabcde
25 X\x{1234}
26 0: X\x{1234}
27 X\x{1234}YZ
28 0: X\x{1234}YZ
29 X\x{1234}\x{512}
30 0: X\x{1234}\x{512}
31 X\x{1234}\x{512}YZ
32 0: X\x{1234}\x{512}
33
34 /X\C{3,5}?/8
35 Xabcdefg
36 0: Xabc
37 X\x{1234}
38 0: X\x{1234}
39 X\x{1234}YZ
40 0: X\x{1234}
41 X\x{1234}\x{512}
42 0: X\x{1234}
43
44 /a\Cb/8
45 aXb
46 0: aXb
47 a\nb
48 0: a\x{0a}b
49
50 /a\C\Cb/8
51 a\x{100}b
52 0: a\x{100}b
53
54 /ab\Cde/8
55 abXde
56 0: abXde
57
58 /a\C\Cb/8
59 a\x{100}b
60 0: a\x{100}b
61 ** Failers
62 No match
63 a\x{12257}b
64 No match
65
66 /[]/8
67 Failed: invalid UTF-8 string at offset 1
68
69 //8
70 Failed: invalid UTF-8 string at offset 0
71
72 /xxx/8
73 Failed: invalid UTF-8 string at offset 0
74
75 /xxx/8?DZSS
76 ------------------------------------------------------------------
77 Bra
78 \X{c0}\X{c0}\X{c0}xxx
79 Ket
80 End
81 ------------------------------------------------------------------
82 Capturing subpattern count = 0
83 Options: utf no_utf_check
84 First char = \x{c3}
85 Need char = 'x'
86
87 /abc/8
88 \xc3]
89 Error -10 (bad UTF-8 string) offset=0 reason=6
90 \xc3
91 Error -10 (bad UTF-8 string) offset=0 reason=1
92 \xc3\xc3\xc3
93 Error -10 (bad UTF-8 string) offset=0 reason=6
94 \xc3\xc3\xc3\?
95 No match
96 \xe1\x88
97 Error -10 (bad UTF-8 string) offset=0 reason=1
98 \P\xe1\x88
99 Error -10 (bad UTF-8 string) offset=0 reason=1
100 \P\P\xe1\x88
101 Error -25 (short UTF-8 string) offset=0 reason=1
102 XX\xea
103 Error -10 (bad UTF-8 string) offset=2 reason=2
104 \O0XX\xea
105 Error -10 (bad UTF-8 string)
106 \O1XX\xea
107 Error -10 (bad UTF-8 string)
108 \O2XX\xea
109 Error -10 (bad UTF-8 string) offset=2 reason=2
110 XX\xf1
111 Error -10 (bad UTF-8 string) offset=2 reason=3
112 XX\xf8
113 Error -10 (bad UTF-8 string) offset=2 reason=4
114 XX\xfc
115 Error -10 (bad UTF-8 string) offset=2 reason=5
116 ZZ\xea\xaf\x20YY
117 Error -10 (bad UTF-8 string) offset=2 reason=7
118 ZZ\xfd\xbf\xbf\x2f\xbf\xbfYY
119 Error -10 (bad UTF-8 string) offset=2 reason=8
120 ZZ\xfd\xbf\xbf\xbf\x2f\xbfYY
121 Error -10 (bad UTF-8 string) offset=2 reason=9
122 ZZ\xfd\xbf\xbf\xbf\xbf\x2fYY
123 Error -10 (bad UTF-8 string) offset=2 reason=10
124 ZZ\xffYY
125 Error -10 (bad UTF-8 string) offset=2 reason=21
126 ZZ\xfeYY
127 Error -10 (bad UTF-8 string) offset=2 reason=21
128
129 /anything/8
130 \xc0\x80
131 Error -10 (bad UTF-8 string) offset=0 reason=15
132 \xc1\x8f
133 Error -10 (bad UTF-8 string) offset=0 reason=15
134 \xe0\x9f\x80
135 Error -10 (bad UTF-8 string) offset=0 reason=16
136 \xf0\x8f\x80\x80
137 Error -10 (bad UTF-8 string) offset=0 reason=17
138 \xf8\x87\x80\x80\x80
139 Error -10 (bad UTF-8 string) offset=0 reason=18
140 \xfc\x83\x80\x80\x80\x80
141 Error -10 (bad UTF-8 string) offset=0 reason=19
142 \xfe\x80\x80\x80\x80\x80
143 Error -10 (bad UTF-8 string) offset=0 reason=21
144 \xff\x80\x80\x80\x80\x80
145 Error -10 (bad UTF-8 string) offset=0 reason=21
146 \xc3\x8f
147 No match
148 \xe0\xaf\x80
149 No match
150 \xe1\x80\x80
151 No match
152 \xf0\x9f\x80\x80
153 No match
154 \xf1\x8f\x80\x80
155 No match
156 \xf8\x88\x80\x80\x80
157 Error -10 (bad UTF-8 string) offset=0 reason=11
158 \xf9\x87\x80\x80\x80
159 Error -10 (bad UTF-8 string) offset=0 reason=11
160 \xfc\x84\x80\x80\x80\x80
161 Error -10 (bad UTF-8 string) offset=0 reason=12
162 \xfd\x83\x80\x80\x80\x80
163 Error -10 (bad UTF-8 string) offset=0 reason=12
164 \?\xf8\x88\x80\x80\x80
165 No match
166 \?\xf9\x87\x80\x80\x80
167 No match
168 \?\xfc\x84\x80\x80\x80\x80
169 No match
170 \?\xfd\x83\x80\x80\x80\x80
171 No match
172
173 /noncharacter/8
174 \x{fffe}
175 Error -10 (bad UTF-8 string) offset=0 reason=22
176 \x{ffff}
177 Error -10 (bad UTF-8 string) offset=0 reason=22
178 \x{1fffe}
179 Error -10 (bad UTF-8 string) offset=0 reason=22
180 \x{1ffff}
181 Error -10 (bad UTF-8 string) offset=0 reason=22
182 \x{2fffe}
183 Error -10 (bad UTF-8 string) offset=0 reason=22
184 \x{2ffff}
185 Error -10 (bad UTF-8 string) offset=0 reason=22
186 \x{3fffe}
187 Error -10 (bad UTF-8 string) offset=0 reason=22
188 \x{3ffff}
189 Error -10 (bad UTF-8 string) offset=0 reason=22
190 \x{4fffe}
191 Error -10 (bad UTF-8 string) offset=0 reason=22
192 \x{4ffff}
193 Error -10 (bad UTF-8 string) offset=0 reason=22
194 \x{5fffe}
195 Error -10 (bad UTF-8 string) offset=0 reason=22
196 \x{5ffff}
197 Error -10 (bad UTF-8 string) offset=0 reason=22
198 \x{6fffe}
199 Error -10 (bad UTF-8 string) offset=0 reason=22
200 \x{6ffff}
201 Error -10 (bad UTF-8 string) offset=0 reason=22
202 \x{7fffe}
203 Error -10 (bad UTF-8 string) offset=0 reason=22
204 \x{7ffff}
205 Error -10 (bad UTF-8 string) offset=0 reason=22
206 \x{8fffe}
207 Error -10 (bad UTF-8 string) offset=0 reason=22
208 \x{8ffff}
209 Error -10 (bad UTF-8 string) offset=0 reason=22
210 \x{9fffe}
211 Error -10 (bad UTF-8 string) offset=0 reason=22
212 \x{9ffff}
213 Error -10 (bad UTF-8 string) offset=0 reason=22
214 \x{afffe}
215 Error -10 (bad UTF-8 string) offset=0 reason=22
216 \x{affff}
217 Error -10 (bad UTF-8 string) offset=0 reason=22
218 \x{bfffe}
219 Error -10 (bad UTF-8 string) offset=0 reason=22
220 \x{bffff}
221 Error -10 (bad UTF-8 string) offset=0 reason=22
222 \x{cfffe}
223 Error -10 (bad UTF-8 string) offset=0 reason=22
224 \x{cffff}
225 Error -10 (bad UTF-8 string) offset=0 reason=22
226 \x{dfffe}
227 Error -10 (bad UTF-8 string) offset=0 reason=22
228 \x{dffff}
229 Error -10 (bad UTF-8 string) offset=0 reason=22
230 \x{efffe}
231 Error -10 (bad UTF-8 string) offset=0 reason=22
232 \x{effff}
233 Error -10 (bad UTF-8 string) offset=0 reason=22
234 \x{ffffe}
235 Error -10 (bad UTF-8 string) offset=0 reason=22
236 \x{fffff}
237 Error -10 (bad UTF-8 string) offset=0 reason=22
238 \x{10fffe}
239 Error -10 (bad UTF-8 string) offset=0 reason=22
240 \x{10ffff}
241 Error -10 (bad UTF-8 string) offset=0 reason=22
242 \x{fdd0}
243 Error -10 (bad UTF-8 string) offset=0 reason=22
244 \x{fdd1}
245 Error -10 (bad UTF-8 string) offset=0 reason=22
246 \x{fdd2}
247 Error -10 (bad UTF-8 string) offset=0 reason=22
248 \x{fdd3}
249 Error -10 (bad UTF-8 string) offset=0 reason=22
250 \x{fdd4}
251 Error -10 (bad UTF-8 string) offset=0 reason=22
252 \x{fdd5}
253 Error -10 (bad UTF-8 string) offset=0 reason=22
254 \x{fdd6}
255 Error -10 (bad UTF-8 string) offset=0 reason=22
256 \x{fdd7}
257 Error -10 (bad UTF-8 string) offset=0 reason=22
258 \x{fdd8}
259 Error -10 (bad UTF-8 string) offset=0 reason=22
260 \x{fdd9}
261 Error -10 (bad UTF-8 string) offset=0 reason=22
262 \x{fdda}
263 Error -10 (bad UTF-8 string) offset=0 reason=22
264 \x{fddb}
265 Error -10 (bad UTF-8 string) offset=0 reason=22
266 \x{fddc}
267 Error -10 (bad UTF-8 string) offset=0 reason=22
268 \x{fddd}
269 Error -10 (bad UTF-8 string) offset=0 reason=22
270 \x{fdde}
271 Error -10 (bad UTF-8 string) offset=0 reason=22
272 \x{fddf}
273 Error -10 (bad UTF-8 string) offset=0 reason=22
274 \x{fde0}
275 Error -10 (bad UTF-8 string) offset=0 reason=22
276 \x{fde1}
277 Error -10 (bad UTF-8 string) offset=0 reason=22
278 \x{fde2}
279 Error -10 (bad UTF-8 string) offset=0 reason=22
280 \x{fde3}
281 Error -10 (bad UTF-8 string) offset=0 reason=22
282 \x{fde4}
283 Error -10 (bad UTF-8 string) offset=0 reason=22
284 \x{fde5}
285 Error -10 (bad UTF-8 string) offset=0 reason=22
286 \x{fde6}
287 Error -10 (bad UTF-8 string) offset=0 reason=22
288 \x{fde7}
289 Error -10 (bad UTF-8 string) offset=0 reason=22
290 \x{fde8}
291 Error -10 (bad UTF-8 string) offset=0 reason=22
292 \x{fde9}
293 Error -10 (bad UTF-8 string) offset=0 reason=22
294 \x{fdea}
295 Error -10 (bad UTF-8 string) offset=0 reason=22
296 \x{fdeb}
297 Error -10 (bad UTF-8 string) offset=0 reason=22
298 \x{fdec}
299 Error -10 (bad UTF-8 string) offset=0 reason=22
300 \x{fded}
301 Error -10 (bad UTF-8 string) offset=0 reason=22
302 \x{fdee}
303 Error -10 (bad UTF-8 string) offset=0 reason=22
304 \x{fdef}
305 Error -10 (bad UTF-8 string) offset=0 reason=22
306
307 /\x{100}/8DZ
308 ------------------------------------------------------------------
309 Bra
310 \x{100}
311 Ket
312 End
313 ------------------------------------------------------------------
314 Capturing subpattern count = 0
315 Options: utf
316 First char = \x{c4}
317 Need char = \x{80}
318
319 /\x{1000}/8DZ
320 ------------------------------------------------------------------
321 Bra
322 \x{1000}
323 Ket
324 End
325 ------------------------------------------------------------------
326 Capturing subpattern count = 0
327 Options: utf
328 First char = \x{e1}
329 Need char = \x{80}
330
331 /\x{10000}/8DZ
332 ------------------------------------------------------------------
333 Bra
334 \x{10000}
335 Ket
336 End
337 ------------------------------------------------------------------
338 Capturing subpattern count = 0
339 Options: utf
340 First char = \x{f0}
341 Need char = \x{80}
342
343 /\x{100000}/8DZ
344 ------------------------------------------------------------------
345 Bra
346 \x{100000}
347 Ket
348 End
349 ------------------------------------------------------------------
350 Capturing subpattern count = 0
351 Options: utf
352 First char = \x{f4}
353 Need char = \x{80}
354
355 /\x{10ffff}/8DZ
356 ------------------------------------------------------------------
357 Bra
358 \x{10ffff}
359 Ket
360 End
361 ------------------------------------------------------------------
362 Capturing subpattern count = 0
363 Options: utf
364 First char = \x{f4}
365 Need char = \x{bf}
366
367 /[\x{ff}]/8DZ
368 ------------------------------------------------------------------
369 Bra
370 \x{ff}
371 Ket
372 End
373 ------------------------------------------------------------------
374 Capturing subpattern count = 0
375 Options: utf
376 First char = \x{c3}
377 Need char = \x{bf}
378
379 /[\x{100}]/8DZ
380 ------------------------------------------------------------------
381 Bra
382 \x{100}
383 Ket
384 End
385 ------------------------------------------------------------------
386 Capturing subpattern count = 0
387 Options: utf
388 First char = \x{c4}
389 Need char = \x{80}
390
391 /\x80/8DZ
392 ------------------------------------------------------------------
393 Bra
394 \x{80}
395 Ket
396 End
397 ------------------------------------------------------------------
398 Capturing subpattern count = 0
399 Options: utf
400 First char = \x{c2}
401 Need char = \x{80}
402
403 /\xff/8DZ
404 ------------------------------------------------------------------
405 Bra
406 \x{ff}
407 Ket
408 End
409 ------------------------------------------------------------------
410 Capturing subpattern count = 0
411 Options: utf
412 First char = \x{c3}
413 Need char = \x{bf}
414
415 /\x{D55c}\x{ad6d}\x{C5B4}/DZ8
416 ------------------------------------------------------------------
417 Bra
418 \x{d55c}\x{ad6d}\x{c5b4}
419 Ket
420 End
421 ------------------------------------------------------------------
422 Capturing subpattern count = 0
423 Options: utf
424 First char = \x{ed}
425 Need char = \x{b4}
426 \x{D55c}\x{ad6d}\x{C5B4}
427 0: \x{d55c}\x{ad6d}\x{c5b4}
428
429 /\x{65e5}\x{672c}\x{8a9e}/DZ8
430 ------------------------------------------------------------------
431 Bra
432 \x{65e5}\x{672c}\x{8a9e}
433 Ket
434 End
435 ------------------------------------------------------------------
436 Capturing subpattern count = 0
437 Options: utf
438 First char = \x{e6}
439 Need char = \x{9e}
440 \x{65e5}\x{672c}\x{8a9e}
441 0: \x{65e5}\x{672c}\x{8a9e}
442
443 /\x{80}/DZ8
444 ------------------------------------------------------------------
445 Bra
446 \x{80}
447 Ket
448 End
449 ------------------------------------------------------------------
450 Capturing subpattern count = 0
451 Options: utf
452 First char = \x{c2}
453 Need char = \x{80}
454
455 /\x{084}/DZ8
456 ------------------------------------------------------------------
457 Bra
458 \x{84}
459 Ket
460 End
461 ------------------------------------------------------------------
462 Capturing subpattern count = 0
463 Options: utf
464 First char = \x{c2}
465 Need char = \x{84}
466
467 /\x{104}/DZ8
468 ------------------------------------------------------------------
469 Bra
470 \x{104}
471 Ket
472 End
473 ------------------------------------------------------------------
474 Capturing subpattern count = 0
475 Options: utf
476 First char = \x{c4}
477 Need char = \x{84}
478
479 /\x{861}/DZ8
480 ------------------------------------------------------------------
481 Bra
482 \x{861}
483 Ket
484 End
485 ------------------------------------------------------------------
486 Capturing subpattern count = 0
487 Options: utf
488 First char = \x{e0}
489 Need char = \x{a1}
490
491 /\x{212ab}/DZ8
492 ------------------------------------------------------------------
493 Bra
494 \x{212ab}
495 Ket
496 End
497 ------------------------------------------------------------------
498 Capturing subpattern count = 0
499 Options: utf
500 First char = \x{f0}
501 Need char = \x{ab}
502
503 /-- This one is here not because it's different to Perl, but because the way
504 the captured single-byte is displayed. (In Perl it becomes a character, and you
505 can't tell the difference.) --/
506
507 /X(\C)(.*)/8
508 X\x{1234}
509 0: X\x{1234}
510 1: \x{e1}
511 2: \x{88}\x{b4}
512 X\nabc
513 0: X\x{0a}abc
514 1: \x{0a}
515 2: abc
516
517 /-- This one is here because Perl gives out a grumbly error message (quite
518 correctly, but that messes up comparisons). --/
519
520 /a\Cb/8
521 *** Failers
522 No match
523 a\x{100}b
524 No match
525
526 /[^ab\xC0-\xF0]/8SDZ
527 ------------------------------------------------------------------
528 Bra
529 [\x00-`c-\xbf\xf1-\xff] (neg)
530 Ket
531 End
532 ------------------------------------------------------------------
533 Capturing subpattern count = 0
534 Options: utf
535 No first char
536 No need char
537 Subject length lower bound = 1
538 Starting byte set: \x00 \x01 \x02 \x03 \x04 \x05 \x06 \x07 \x08 \x09 \x0a
539 \x0b \x0c \x0d \x0e \x0f \x10 \x11 \x12 \x13 \x14 \x15 \x16 \x17 \x18 \x19
540 \x1a \x1b \x1c \x1d \x1e \x1f \x20 ! " # $ % & ' ( ) * + , - . / 0 1 2 3 4
541 5 6 7 8 9 : ; < = > ? @ A B C D E F G H I J K L M N O P Q R S T U V W X Y
542 Z [ \ ] ^ _ ` c d e f g h i j k l m n o p q r s t u v w x y z { | } ~ \x7f
543 \xc2 \xc3 \xc4 \xc5 \xc6 \xc7 \xc8 \xc9 \xca \xcb \xcc \xcd \xce \xcf \xd0
544 \xd1 \xd2 \xd3 \xd4 \xd5 \xd6 \xd7 \xd8 \xd9 \xda \xdb \xdc \xdd \xde \xdf
545 \xe0 \xe1 \xe2 \xe3 \xe4 \xe5 \xe6 \xe7 \xe8 \xe9 \xea \xeb \xec \xed \xee
546 \xef \xf0 \xf1 \xf2 \xf3 \xf4 \xf5 \xf6 \xf7 \xf8 \xf9 \xfa \xfb \xfc \xfd
547 \xfe \xff
548 \x{f1}
549 0: \x{f1}
550 \x{bf}
551 0: \x{bf}
552 \x{100}
553 0: \x{100}
554 \x{1000}
555 0: \x{1000}
556 *** Failers
557 0: *
558 \x{c0}
559 No match
560 \x{f0}
561 No match
562
563 /Ā{3,4}/8SDZ
564 ------------------------------------------------------------------
565 Bra
566 \x{100}{3}
567 \x{100}?
568 Ket
569 End
570 ------------------------------------------------------------------
571 Capturing subpattern count = 0
572 Options: utf
573 First char = \x{c4}
574 Need char = \x{80}
575 Subject length lower bound = 3
576 No set of starting bytes
577 \x{100}\x{100}\x{100}\x{100\x{100}
578 0: \x{100}\x{100}\x{100}
579
580 /(\x{100}+|x)/8SDZ
581 ------------------------------------------------------------------
582 Bra
583 CBra 1
584 \x{100}+
585 Alt
586 x
587 Ket
588 Ket
589 End
590 ------------------------------------------------------------------
591 Capturing subpattern count = 1
592 Options: utf
593 No first char
594 No need char
595 Subject length lower bound = 1
596 Starting byte set: x \xc4
597
598 /(\x{100}*a|x)/8SDZ
599 ------------------------------------------------------------------
600 Bra
601 CBra 1
602 \x{100}*+
603 a
604 Alt
605 x
606 Ket
607 Ket
608 End
609 ------------------------------------------------------------------
610 Capturing subpattern count = 1
611 Options: utf
612 No first char
613 No need char
614 Subject length lower bound = 1
615 Starting byte set: a x \xc4
616
617 /(\x{100}{0,2}a|x)/8SDZ
618 ------------------------------------------------------------------
619 Bra
620 CBra 1
621 \x{100}{0,2}
622 a
623 Alt
624 x
625 Ket
626 Ket
627 End
628 ------------------------------------------------------------------
629 Capturing subpattern count = 1
630 Options: utf
631 No first char
632 No need char
633 Subject length lower bound = 1
634 Starting byte set: a x \xc4
635
636 /(\x{100}{1,2}a|x)/8SDZ
637 ------------------------------------------------------------------
638 Bra
639 CBra 1
640 \x{100}
641 \x{100}{0,1}
642 a
643 Alt
644 x
645 Ket
646 Ket
647 End
648 ------------------------------------------------------------------
649 Capturing subpattern count = 1
650 Options: utf
651 No first char
652 No need char
653 Subject length lower bound = 1
654 Starting byte set: x \xc4
655
656 /\x{100}/8DZ
657 ------------------------------------------------------------------
658 Bra
659 \x{100}
660 Ket
661 End
662 ------------------------------------------------------------------
663 Capturing subpattern count = 0
664 Options: utf
665 First char = \x{c4}
666 Need char = \x{80}
667
668 /a\x{100}\x{101}*/8DZ
669 ------------------------------------------------------------------
670 Bra
671 a\x{100}
672 \x{101}*
673 Ket
674 End
675 ------------------------------------------------------------------
676 Capturing subpattern count = 0
677 Options: utf
678 First char = 'a'
679 Need char = \x{80}
680
681 /a\x{100}\x{101}+/8DZ
682 ------------------------------------------------------------------
683 Bra
684 a\x{100}
685 \x{101}+
686 Ket
687 End
688 ------------------------------------------------------------------
689 Capturing subpattern count = 0
690 Options: utf
691 First char = 'a'
692 Need char = \x{81}
693
694 /[^\x{c4}]/DZ
695 ------------------------------------------------------------------
696 Bra
697 [^\x{c4}]
698 Ket
699 End
700 ------------------------------------------------------------------
701 Capturing subpattern count = 0
702 No options
703 No first char
704 No need char
705
706 /[\x{100}]/8DZ
707 ------------------------------------------------------------------
708 Bra
709 \x{100}
710 Ket
711 End
712 ------------------------------------------------------------------
713 Capturing subpattern count = 0
714 Options: utf
715 First char = \x{c4}
716 Need char = \x{80}
717 \x{100}
718 0: \x{100}
719 Z\x{100}
720 0: \x{100}
721 \x{100}Z
722 0: \x{100}
723 *** Failers
724 No match
725
726 /[\xff]/DZ8
727 ------------------------------------------------------------------
728 Bra
729 \x{ff}
730 Ket
731 End
732 ------------------------------------------------------------------
733 Capturing subpattern count = 0
734 Options: utf
735 First char = \x{c3}
736 Need char = \x{bf}
737 >\x{ff}<
738 0: \x{ff}
739
740 /[^\xff]/8DZ
741 ------------------------------------------------------------------
742 Bra
743 [^\x{ff}]
744 Ket
745 End
746 ------------------------------------------------------------------
747 Capturing subpattern count = 0
748 Options: utf
749 No first char
750 No need char
751
752 /\x{100}abc(xyz(?1))/8DZ
753 ------------------------------------------------------------------
754 Bra
755 \x{100}abc
756 CBra 1
757 xyz
758 Recurse
759 Ket
760 Ket
761 End
762 ------------------------------------------------------------------
763 Capturing subpattern count = 1
764 Options: utf
765 First char = \x{c4}
766 Need char = 'z'
767
768 /a\x{1234}b/P8
769 a\x{1234}b
770 0: a\x{1234}b
771
772 /\777/8I
773 Capturing subpattern count = 0
774 Options: utf
775 First char = \x{c7}
776 Need char = \x{bf}
777 \x{1ff}
778 0: \x{1ff}
779 \777
780 0: \x{1ff}
781
782 /\x{100}+\x{200}/8DZ
783 ------------------------------------------------------------------
784 Bra
785 \x{100}++
786 \x{200}
787 Ket
788 End
789 ------------------------------------------------------------------
790 Capturing subpattern count = 0
791 Options: utf
792 First char = \x{c4}
793 Need char = \x{80}
794
795 /\x{100}+X/8DZ
796 ------------------------------------------------------------------
797 Bra
798 \x{100}++
799 X
800 Ket
801 End
802 ------------------------------------------------------------------
803 Capturing subpattern count = 0
804 Options: utf
805 First char = \x{c4}
806 Need char = 'X'
807
808 /^[\QĀ\E-\QŐ\E/BZ8
809 Failed: missing terminating ] for character class at offset 15
810
811 /-- This tests the stricter UTF-8 check according to RFC 3629. --/
812
813 /X/8
814 \x{0}\x{d7ff}\x{e000}\x{10ffff}
815 Error -10 (bad UTF-8 string) offset=7 reason=22
816 \x{d800}
817 Error -10 (bad UTF-8 string) offset=0 reason=14
818 \x{d800}\?
819 No match
820 \x{da00}
821 Error -10 (bad UTF-8 string) offset=0 reason=14
822 \x{da00}\?
823 No match
824 \x{dfff}
825 Error -10 (bad UTF-8 string) offset=0 reason=14
826 \x{dfff}\?
827 No match
828 \x{110000}
829 Error -10 (bad UTF-8 string) offset=0 reason=13
830 \x{110000}\?
831 No match
832 \x{2000000}
833 Error -10 (bad UTF-8 string) offset=0 reason=11
834 \x{2000000}\?
835 No match
836 \x{7fffffff}
837 Error -10 (bad UTF-8 string) offset=0 reason=12
838 \x{7fffffff}\?
839 No match
840
841 /(*UTF8)\x{1234}/
842 abcd\x{1234}pqr
843 0: \x{1234}
844
845 /(*CRLF)(*UTF8)(*BSR_UNICODE)a\Rb/I
846 Capturing subpattern count = 0
847 Options: bsr_unicode utf
848 Forced newline sequence: CRLF
849 First char = 'a'
850 Need char = 'b'
851
852 /\h/SI8
853 Capturing subpattern count = 0
854 Options: utf
855 No first char
856 No need char
857 Subject length lower bound = 1
858 Starting byte set: \x09 \x20 \xc2 \xe1 \xe2 \xe3
859 ABC\x{09}
860 0: \x{09}
861 ABC\x{20}
862 0:
863 ABC\x{a0}
864 0: \x{a0}
865 ABC\x{1680}
866 0: \x{1680}
867 ABC\x{180e}
868 0: \x{180e}
869 ABC\x{2000}
870 0: \x{2000}
871 ABC\x{202f}
872 0: \x{202f}
873 ABC\x{205f}
874 0: \x{205f}
875 ABC\x{3000}
876 0: \x{3000}
877
878 /\v/SI8
879 Capturing subpattern count = 0
880 Options: utf
881 No first char
882 No need char
883 Subject length lower bound = 1
884 Starting byte set: \x0a \x0b \x0c \x0d \xc2 \xe2
885 ABC\x{0a}
886 0: \x{0a}
887 ABC\x{0b}
888 0: \x{0b}
889 ABC\x{0c}
890 0: \x{0c}
891 ABC\x{0d}
892 0: \x{0d}
893 ABC\x{85}
894 0: \x{85}
895 ABC\x{2028}
896 0: \x{2028}
897
898 /\h*A/SI8
899 Capturing subpattern count = 0
900 Options: utf
901 No first char
902 Need char = 'A'
903 Subject length lower bound = 1
904 Starting byte set: \x09 \x20 A \xc2 \xe1 \xe2 \xe3
905 CDBABC
906 0: A
907
908 /\v+A/SI8
909 Capturing subpattern count = 0
910 Options: utf
911 No first char
912 Need char = 'A'
913 Subject length lower bound = 2
914 Starting byte set: \x0a \x0b \x0c \x0d \xc2 \xe2
915
916 /\s?xxx\s/8SI
917 Capturing subpattern count = 0
918 Options: utf
919 No first char
920 Need char = 'x'
921 Subject length lower bound = 4
922 Starting byte set: \x09 \x0a \x0c \x0d \x20 x
923
924 /\sxxx\s/I8ST1
925 Capturing subpattern count = 0
926 Options: utf
927 No first char
928 Need char = 'x'
929 Subject length lower bound = 5
930 Starting byte set: \x09 \x0a \x0c \x0d \x20 \xc2
931 AB\x{85}xxx\x{a0}XYZ
932 0: \x{85}xxx\x{a0}
933 AB\x{a0}xxx\x{85}XYZ
934 0: \x{a0}xxx\x{85}
935
936 /\S \S/I8ST1
937 Capturing subpattern count = 0
938 Options: utf
939 No first char
940 Need char = ' '
941 Subject length lower bound = 3
942 Starting byte set: \x00 \x01 \x02 \x03 \x04 \x05 \x06 \x07 \x08 \x0b \x0e
943 \x0f \x10 \x11 \x12 \x13 \x14 \x15 \x16 \x17 \x18 \x19 \x1a \x1b \x1c \x1d
944 \x1e \x1f ! " # $ % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ? @
945 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z [ \ ] ^ _ ` a b c d e
946 f g h i j k l m n o p q r s t u v w x y z { | } ~ \x7f \xc0 \xc1 \xc2 \xc3
947 \xc4 \xc5 \xc6 \xc7 \xc8 \xc9 \xca \xcb \xcc \xcd \xce \xcf \xd0 \xd1 \xd2
948 \xd3 \xd4 \xd5 \xd6 \xd7 \xd8 \xd9 \xda \xdb \xdc \xdd \xde \xdf \xe0 \xe1
949 \xe2 \xe3 \xe4 \xe5 \xe6 \xe7 \xe8 \xe9 \xea \xeb \xec \xed \xee \xef \xf0
950 \xf1 \xf2 \xf3 \xf4 \xf5 \xf6 \xf7 \xf8 \xf9 \xfa \xfb \xfc \xfd \xfe \xff
951 \x{a2} \x{84}
952 0: \x{a2} \x{84}
953 A Z
954 0: A Z
955
956 /a+/8
957 a\x{123}aa\>1
958 0: aa
959 a\x{123}aa\>2
960 Error -11 (bad UTF-8 offset)
961 a\x{123}aa\>3
962 0: aa
963 a\x{123}aa\>4
964 0: a
965 a\x{123}aa\>5
966 No match
967 a\x{123}aa\>6
968 Error -24 (bad offset value)
969
970 /\x{1234}+/iS8I
971 Capturing subpattern count = 0
972 Options: caseless utf
973 No first char
974 No need char
975 Subject length lower bound = 1
976 Starting byte set: \xe1
977
978 /\x{1234}+?/iS8I
979 Capturing subpattern count = 0
980 Options: caseless utf
981 No first char
982 No need char
983 Subject length lower bound = 1
984 Starting byte set: \xe1
985
986 /\x{1234}++/iS8I
987 Capturing subpattern count = 0
988 Options: caseless utf
989 No first char
990 No need char
991 Subject length lower bound = 1
992 Starting byte set: \xe1
993
994 /\x{1234}{2}/iS8I
995 Capturing subpattern count = 0
996 Options: caseless utf
997 No first char
998 No need char
999 Subject length lower bound = 2
1000 Starting byte set: \xe1
1001
1002 /[^\x{c4}]/8DZ
1003 ------------------------------------------------------------------
1004 Bra
1005 [^\x{c4}]
1006 Ket
1007 End
1008 ------------------------------------------------------------------
1009 Capturing subpattern count = 0
1010 Options: utf
1011 No first char
1012 No need char
1013
1014 /X+\x{200}/8DZ
1015 ------------------------------------------------------------------
1016 Bra
1017 X++
1018 \x{200}
1019 Ket
1020 End
1021 ------------------------------------------------------------------
1022 Capturing subpattern count = 0
1023 Options: utf
1024 First char = 'X'
1025 Need char = \x{80}
1026
1027 /\R/SI8
1028 Capturing subpattern count = 0
1029 Options: utf
1030 No first char
1031 No need char
1032 Subject length lower bound = 1
1033 Starting byte set: \x0a \x0b \x0c \x0d \xc2 \xe2
1034
1035 /\777/8DZ
1036 ------------------------------------------------------------------
1037 Bra
1038 \x{1ff}
1039 Ket
1040 End
1041 ------------------------------------------------------------------
1042 Capturing subpattern count = 0
1043 Options: utf
1044 First char = \x{c7}
1045 Need char = \x{bf}
1046
1047 /\w+\x{C4}/8BZ
1048 ------------------------------------------------------------------
1049 Bra
1050 \w++
1051 \x{c4}
1052 Ket
1053 End
1054 ------------------------------------------------------------------
1055 a\x{C4}\x{C4}
1056 0: a\x{c4}
1057
1058 /\w+\x{C4}/8BZT1
1059 ------------------------------------------------------------------
1060 Bra
1061 \w+
1062 \x{c4}
1063 Ket
1064 End
1065 ------------------------------------------------------------------
1066 a\x{C4}\x{C4}
1067 0: a\x{c4}\x{c4}
1068
1069 /\W+\x{C4}/8BZ
1070 ------------------------------------------------------------------
1071 Bra
1072 \W+
1073 \x{c4}
1074 Ket
1075 End
1076 ------------------------------------------------------------------
1077 !\x{C4}
1078 0: !\x{c4}
1079
1080 /\W+\x{C4}/8BZT1
1081 ------------------------------------------------------------------
1082 Bra
1083 \W++
1084 \x{c4}
1085 Ket
1086 End
1087 ------------------------------------------------------------------
1088 !\x{C4}
1089 0: !\x{c4}
1090
1091 /\W+\x{A1}/8BZ
1092 ------------------------------------------------------------------
1093 Bra
1094 \W+
1095 \x{a1}
1096 Ket
1097 End
1098 ------------------------------------------------------------------
1099 !\x{A1}
1100 0: !\x{a1}
1101
1102 /\W+\x{A1}/8BZT1
1103 ------------------------------------------------------------------
1104 Bra
1105 \W+
1106 \x{a1}
1107 Ket
1108 End
1109 ------------------------------------------------------------------
1110 !\x{A1}
1111 0: !\x{a1}
1112
1113 /X\s+\x{A0}/8BZ
1114 ------------------------------------------------------------------
1115 Bra
1116 X
1117 \s++
1118 \x{a0}
1119 Ket
1120 End
1121 ------------------------------------------------------------------
1122 X\x20\x{A0}\x{A0}
1123 0: X \x{a0}
1124
1125 /X\s+\x{A0}/8BZT1
1126 ------------------------------------------------------------------
1127 Bra
1128 X
1129 \s+
1130 \x{a0}
1131 Ket
1132 End
1133 ------------------------------------------------------------------
1134 X\x20\x{A0}\x{A0}
1135 0: X \x{a0}\x{a0}
1136
1137 /\S+\x{A0}/8BZ
1138 ------------------------------------------------------------------
1139 Bra
1140 \S+
1141 \x{a0}
1142 Ket
1143 End
1144 ------------------------------------------------------------------
1145 X\x{A0}\x{A0}
1146 0: X\x{a0}\x{a0}
1147
1148 /\S+\x{A0}/8BZT1
1149 ------------------------------------------------------------------
1150 Bra
1151 \S++
1152 \x{a0}
1153 Ket
1154 End
1155 ------------------------------------------------------------------
1156 X\x{A0}\x{A0}
1157 0: X\x{a0}
1158
1159 /\x{a0}+\s!/8BZ
1160 ------------------------------------------------------------------
1161 Bra
1162 \x{a0}++
1163 \s
1164 !
1165 Ket
1166 End
1167 ------------------------------------------------------------------
1168 \x{a0}\x20!
1169 0: \x{a0} !
1170
1171 /\x{a0}+\s!/8BZT1
1172 ------------------------------------------------------------------
1173 Bra
1174 \x{a0}+
1175 \s
1176 !
1177 Ket
1178 End
1179 ------------------------------------------------------------------
1180 \x{a0}\x20!
1181 0: \x{a0} !
1182
1183 /-- End of testinput15 --/

  ViewVC Help
Powered by ViewVC 1.1.5