blob: f0b3b151226553477658a8a63b5b42b54f7b1fb1 [file] [log] [blame]
Elliott Hughes4e19c8e2022-04-15 15:11:02 -07001.TH PCRE2JIT 3 "30 November 2021" "PCRE2 10.40"
Elliott Hughes5b808042021-10-01 10:56:10 -07002.SH NAME
3PCRE2 - Perl-compatible regular expressions (revised API)
4.SH "PCRE2 JUST-IN-TIME COMPILER SUPPORT"
5.rs
6.sp
7Just-in-time compiling is a heavyweight optimization that can greatly speed up
8pattern matching. However, it comes at the cost of extra processing before the
9match is performed, so it is of most benefit when the same pattern is going to
10be matched many times. This does not necessarily mean many calls of a matching
11function; if the pattern is not anchored, matching attempts may take place many
12times at various positions in the subject, even for a single call. Therefore,
13if the subject string is very long, it may still pay to use JIT even for
14one-off matches. JIT support is available for all of the 8-bit, 16-bit and
1532-bit PCRE2 libraries.
16.P
17JIT support applies only to the traditional Perl-compatible matching function.
18It does not apply when the DFA matching function is being used. The code for
19this support was written by Zoltan Herczeg.
20.
21.
22.SH "AVAILABILITY OF JIT SUPPORT"
23.rs
24.sp
25JIT support is an optional feature of PCRE2. The "configure" option
26--enable-jit (or equivalent CMake option) must be set when PCRE2 is built if
27you want to use JIT. The support is limited to the following hardware
28platforms:
29.sp
30 ARM 32-bit (v5, v7, and Thumb2)
31 ARM 64-bit
32 IBM s390x 64 bit
33 Intel x86 32-bit and 64-bit
34 MIPS 32-bit and 64-bit
35 Power PC 32-bit and 64-bit
36 SPARC 32-bit
37.sp
38If --enable-jit is set on an unsupported platform, compilation fails.
39.P
40A program can tell if JIT support is available by calling \fBpcre2_config()\fP
41with the PCRE2_CONFIG_JIT option. The result is 1 when JIT is available, and 0
42otherwise. However, a simple program does not need to check this in order to
43use JIT. The API is implemented in a way that falls back to the interpretive
44code if JIT is not available. For programs that need the best possible
45performance, there is also a "fast path" API that is JIT-specific.
46.
47.
48.SH "SIMPLE USE OF JIT"
49.rs
50.sp
51To make use of the JIT support in the simplest way, all you have to do is to
52call \fBpcre2_jit_compile()\fP after successfully compiling a pattern with
53\fBpcre2_compile()\fP. This function has two arguments: the first is the
54compiled pattern pointer that was returned by \fBpcre2_compile()\fP, and the
55second is zero or more of the following option bits: PCRE2_JIT_COMPLETE,
56PCRE2_JIT_PARTIAL_HARD, or PCRE2_JIT_PARTIAL_SOFT.
57.P
58If JIT support is not available, a call to \fBpcre2_jit_compile()\fP does
59nothing and returns PCRE2_ERROR_JIT_BADOPTION. Otherwise, the compiled pattern
60is passed to the JIT compiler, which turns it into machine code that executes
61much faster than the normal interpretive code, but yields exactly the same
62results. The returned value from \fBpcre2_jit_compile()\fP is zero on success,
63or a negative error code.
64.P
65There is a limit to the size of pattern that JIT supports, imposed by the size
66of machine stack that it uses. The exact rules are not documented because they
67may change at any time, in particular, when new optimizations are introduced.
68If a pattern is too big, a call to \fBpcre2_jit_compile()\fP returns
69PCRE2_ERROR_NOMEMORY.
70.P
71PCRE2_JIT_COMPLETE requests the JIT compiler to generate code for complete
72matches. If you want to run partial matches using the PCRE2_PARTIAL_HARD or
73PCRE2_PARTIAL_SOFT options of \fBpcre2_match()\fP, you should set one or both
74of the other options as well as, or instead of PCRE2_JIT_COMPLETE. The JIT
75compiler generates different optimized code for each of the three modes
76(normal, soft partial, hard partial). When \fBpcre2_match()\fP is called, the
77appropriate code is run if it is available. Otherwise, the pattern is matched
78using interpretive code.
79.P
80You can call \fBpcre2_jit_compile()\fP multiple times for the same compiled
81pattern. It does nothing if it has previously compiled code for any of the
82option bits. For example, you can call it once with PCRE2_JIT_COMPLETE and
83(perhaps later, when you find you need partial matching) again with
84PCRE2_JIT_COMPLETE and PCRE2_JIT_PARTIAL_HARD. This time it will ignore
85PCRE2_JIT_COMPLETE and just compile code for partial matching. If
86\fBpcre2_jit_compile()\fP is called with no option bits set, it immediately
87returns zero. This is an alternative way of testing whether JIT is available.
88.P
89At present, it is not possible to free JIT compiled code except when the entire
90compiled pattern is freed by calling \fBpcre2_code_free()\fP.
91.P
92In some circumstances you may need to call additional functions. These are
93described in the section entitled
94.\" HTML <a href="#stackcontrol">
95.\" </a>
96"Controlling the JIT stack"
97.\"
98below.
99.P
100There are some \fBpcre2_match()\fP options that are not supported by JIT, and
101there are also some pattern items that JIT cannot handle. Details are given
102below. In both cases, matching automatically falls back to the interpretive
103code. If you want to know whether JIT was actually used for a particular match,
104you should arrange for a JIT callback function to be set up as described in the
105section entitled
106.\" HTML <a href="#stackcontrol">
107.\" </a>
108"Controlling the JIT stack"
109.\"
110below, even if you do not need to supply a non-default JIT stack. Such a
111callback function is called whenever JIT code is about to be obeyed. If the
112match-time options are not right for JIT execution, the callback function is
113not obeyed.
114.P
115If the JIT compiler finds an unsupported item, no JIT data is generated. You
116can find out if JIT matching is available after compiling a pattern by calling
117\fBpcre2_pattern_info()\fP with the PCRE2_INFO_JITSIZE option. A non-zero
118result means that JIT compilation was successful. A result of 0 means that JIT
119support is not available, or the pattern was not processed by
120\fBpcre2_jit_compile()\fP, or the JIT compiler was not able to handle the
121pattern.
122.
123.
124.SH "MATCHING SUBJECTS CONTAINING INVALID UTF"
125.rs
126.sp
127When a pattern is compiled with the PCRE2_UTF option, subject strings are
128normally expected to be a valid sequence of UTF code units. By default, this is
129checked at the start of matching and an error is generated if invalid UTF is
130detected. The PCRE2_NO_UTF_CHECK option can be passed to \fBpcre2_match()\fP to
131skip the check (for improved performance) if you are sure that a subject string
132is valid. If this option is used with an invalid string, the result is
133undefined.
134.P
135However, a way of running matches on strings that may contain invalid UTF
136sequences is available. Calling \fBpcre2_compile()\fP with the
137PCRE2_MATCH_INVALID_UTF option has two effects: it tells the interpreter in
138\fBpcre2_match()\fP to support invalid UTF, and, if \fBpcre2_jit_compile()\fP
139is called, the compiled JIT code also supports invalid UTF. Details of how this
140support works, in both the JIT and the interpretive cases, is given in the
141.\" HREF
142\fBpcre2unicode\fP
143.\"
144documentation.
145.P
146There is also an obsolete option for \fBpcre2_jit_compile()\fP called
147PCRE2_JIT_INVALID_UTF, which currently exists only for backward compatibility.
148It is superseded by the \fBpcre2_compile()\fP option PCRE2_MATCH_INVALID_UTF
149and should no longer be used. It may be removed in future.
150.
151.
152.SH "UNSUPPORTED OPTIONS AND PATTERN ITEMS"
153.rs
154.sp
155The \fBpcre2_match()\fP options that are supported for JIT matching are
156PCRE2_COPY_MATCHED_SUBJECT, PCRE2_NOTBOL, PCRE2_NOTEOL, PCRE2_NOTEMPTY,
157PCRE2_NOTEMPTY_ATSTART, PCRE2_NO_UTF_CHECK, PCRE2_PARTIAL_HARD, and
158PCRE2_PARTIAL_SOFT. The PCRE2_ANCHORED and PCRE2_ENDANCHORED options are not
159supported at match time.
160.P
161If the PCRE2_NO_JIT option is passed to \fBpcre2_match()\fP it disables the
162use of JIT, forcing matching by the interpreter code.
163.P
164The only unsupported pattern items are \eC (match a single data unit) when
165running in a UTF mode, and a callout immediately before an assertion condition
166in a conditional group.
167.
168.
169.SH "RETURN VALUES FROM JIT MATCHING"
170.rs
171.sp
172When a pattern is matched using JIT matching, the return values are the same
173as those given by the interpretive \fBpcre2_match()\fP code, with the addition
174of one new error code: PCRE2_ERROR_JIT_STACKLIMIT. This means that the memory
175used for the JIT stack was insufficient. See
176.\" HTML <a href="#stackcontrol">
177.\" </a>
178"Controlling the JIT stack"
179.\"
180below for a discussion of JIT stack usage.
181.P
182The error code PCRE2_ERROR_MATCHLIMIT is returned by the JIT code if searching
183a very large pattern tree goes on for too long, as it is in the same
184circumstance when JIT is not used, but the details of exactly what is counted
185are not the same. The PCRE2_ERROR_DEPTHLIMIT error code is never returned
186when JIT matching is used.
187.
188.
189.\" HTML <a name="stackcontrol"></a>
190.SH "CONTROLLING THE JIT STACK"
191.rs
192.sp
193When the compiled JIT code runs, it needs a block of memory to use as a stack.
194By default, it uses 32KiB on the machine stack. However, some large or
195complicated patterns need more than this. The error PCRE2_ERROR_JIT_STACKLIMIT
196is given when there is not enough stack. Three functions are provided for
197managing blocks of memory for use as JIT stacks. There is further discussion
198about the use of JIT stacks in the section entitled
199.\" HTML <a href="#stackfaq">
200.\" </a>
201"JIT stack FAQ"
202.\"
203below.
204.P
205The \fBpcre2_jit_stack_create()\fP function creates a JIT stack. Its arguments
206are a starting size, a maximum size, and a general context (for memory
207allocation functions, or NULL for standard memory allocation). It returns a
208pointer to an opaque structure of type \fBpcre2_jit_stack\fP, or NULL if there
209is an error. The \fBpcre2_jit_stack_free()\fP function is used to free a stack
210that is no longer needed. If its argument is NULL, this function returns
211immediately, without doing anything. (For the technically minded: the address
212space is allocated by mmap or VirtualAlloc.) A maximum stack size of 512KiB to
2131MiB should be more than enough for any pattern.
214.P
215The \fBpcre2_jit_stack_assign()\fP function specifies which stack JIT code
216should use. Its arguments are as follows:
217.sp
218 pcre2_match_context *mcontext
219 pcre2_jit_callback callback
220 void *data
221.sp
222The first argument is a pointer to a match context. When this is subsequently
223passed to a matching function, its information determines which JIT stack is
224used. If this argument is NULL, the function returns immediately, without doing
225anything. There are three cases for the values of the other two options:
226.sp
227 (1) If \fIcallback\fP is NULL and \fIdata\fP is NULL, an internal 32KiB block
228 on the machine stack is used. This is the default when a match
229 context is created.
230.sp
231 (2) If \fIcallback\fP is NULL and \fIdata\fP is not NULL, \fIdata\fP must be
232 a pointer to a valid JIT stack, the result of calling
233 \fBpcre2_jit_stack_create()\fP.
234.sp
235 (3) If \fIcallback\fP is not NULL, it must point to a function that is
236 called with \fIdata\fP as an argument at the start of matching, in
237 order to set up a JIT stack. If the return from the callback
238 function is NULL, the internal 32KiB stack is used; otherwise the
239 return value must be a valid JIT stack, the result of calling
240 \fBpcre2_jit_stack_create()\fP.
241.sp
242A callback function is obeyed whenever JIT code is about to be run; it is not
243obeyed when \fBpcre2_match()\fP is called with options that are incompatible
244for JIT matching. A callback function can therefore be used to determine
245whether a match operation was executed by JIT or by the interpreter.
246.P
247You may safely use the same JIT stack for more than one pattern (either by
248assigning directly or by callback), as long as the patterns are matched
249sequentially in the same thread. Currently, the only way to set up
250non-sequential matches in one thread is to use callouts: if a callout function
251starts another match, that match must use a different JIT stack to the one used
252for currently suspended match(es).
253.P
Elliott Hughes4e19c8e2022-04-15 15:11:02 -0700254In a multithread application, if you do not specify a JIT stack, or if you
255assign or pass back NULL from a callback, that is thread-safe, because each
256thread has its own machine stack. However, if you assign or pass back a
257non-NULL JIT stack, this must be a different stack for each thread so that the
258application is thread-safe.
Elliott Hughes5b808042021-10-01 10:56:10 -0700259.P
260Strictly speaking, even more is allowed. You can assign the same non-NULL stack
261to a match context that is used by any number of patterns, as long as they are
262not used for matching by multiple threads at the same time. For example, you
263could use the same stack in all compiled patterns, with a global mutex in the
264callback to wait until the stack is available for use. However, this is an
265inefficient solution, and not recommended.
266.P
267This is a suggestion for how a multithreaded program that needs to set up
268non-default JIT stacks might operate:
269.sp
270 During thread initialization
271 thread_local_var = pcre2_jit_stack_create(...)
272.sp
273 During thread exit
274 pcre2_jit_stack_free(thread_local_var)
275.sp
276 Use a one-line callback function
277 return thread_local_var
278.sp
279All the functions described in this section do nothing if JIT is not available.
280.
281.
282.\" HTML <a name="stackfaq"></a>
283.SH "JIT STACK FAQ"
284.rs
285.sp
286(1) Why do we need JIT stacks?
287.sp
288PCRE2 (and JIT) is a recursive, depth-first engine, so it needs a stack where
289the local data of the current node is pushed before checking its child nodes.
290Allocating real machine stack on some platforms is difficult. For example, the
291stack chain needs to be updated every time if we extend the stack on PowerPC.
292Although it is possible, its updating time overhead decreases performance. So
293we do the recursion in memory.
294.P
295(2) Why don't we simply allocate blocks of memory with \fBmalloc()\fP?
296.sp
297Modern operating systems have a nice feature: they can reserve an address space
298instead of allocating memory. We can safely allocate memory pages inside this
299address space, so the stack could grow without moving memory data (this is
300important because of pointers). Thus we can allocate 1MiB address space, and
301use only a single memory page (usually 4KiB) if that is enough. However, we can
302still grow up to 1MiB anytime if needed.
303.P
304(3) Who "owns" a JIT stack?
305.sp
306The owner of the stack is the user program, not the JIT studied pattern or
307anything else. The user program must ensure that if a stack is being used by
308\fBpcre2_match()\fP, (that is, it is assigned to a match context that is passed
309to the pattern currently running), that stack must not be used by any other
310threads (to avoid overwriting the same memory area). The best practice for
311multithreaded programs is to allocate a stack for each thread, and return this
312stack through the JIT callback function.
313.P
314(4) When should a JIT stack be freed?
315.sp
316You can free a JIT stack at any time, as long as it will not be used by
317\fBpcre2_match()\fP again. When you assign the stack to a match context, only a
318pointer is set. There is no reference counting or any other magic. You can free
319compiled patterns, contexts, and stacks in any order, anytime.
320Just \fIdo not\fP call \fBpcre2_match()\fP with a match context pointing to an
321already freed stack, as that will cause SEGFAULT. (Also, do not free a stack
322currently used by \fBpcre2_match()\fP in another thread). You can also replace
323the stack in a context at any time when it is not in use. You should free the
324previous stack before assigning a replacement.
325.P
326(5) Should I allocate/free a stack every time before/after calling
327\fBpcre2_match()\fP?
328.sp
329No, because this is too costly in terms of resources. However, you could
330implement some clever idea which release the stack if it is not used in let's
331say two minutes. The JIT callback can help to achieve this without keeping a
332list of patterns.
333.P
334(6) OK, the stack is for long term memory allocation. But what happens if a
335pattern causes stack overflow with a stack of 1MiB? Is that 1MiB kept until the
336stack is freed?
337.sp
338Especially on embedded sytems, it might be a good idea to release memory
339sometimes without freeing the stack. There is no API for this at the moment.
340Probably a function call which returns with the currently allocated memory for
341any stack and another which allows releasing memory (shrinking the stack) would
342be a good idea if someone needs this.
343.P
344(7) This is too much of a headache. Isn't there any better solution for JIT
345stack handling?
346.sp
347No, thanks to Windows. If POSIX threads were used everywhere, we could throw
348out this complicated API.
349.
350.
351.SH "FREEING JIT SPECULATIVE MEMORY"
352.rs
353.sp
354.nf
355.B void pcre2_jit_free_unused_memory(pcre2_general_context *\fIgcontext\fP);
356.fi
357.P
Elliott Hughes4e19c8e2022-04-15 15:11:02 -0700358The JIT executable allocator does not free all memory when it is possible. It
359expects new allocations, and keeps some free memory around to improve
Elliott Hughes5b808042021-10-01 10:56:10 -0700360allocation speed. However, in low memory conditions, it might be better to free
361all possible memory. You can cause this to happen by calling
362pcre2_jit_free_unused_memory(). Its argument is a general context, for custom
363memory management, or NULL for standard memory management.
364.
365.
366.SH "EXAMPLE CODE"
367.rs
368.sp
369This is a single-threaded example that specifies a JIT stack without using a
370callback. A real program should include error checking after all the function
371calls.
372.sp
373 int rc;
374 pcre2_code *re;
375 pcre2_match_data *match_data;
376 pcre2_match_context *mcontext;
377 pcre2_jit_stack *jit_stack;
378.sp
379 re = pcre2_compile(pattern, PCRE2_ZERO_TERMINATED, 0,
380 &errornumber, &erroffset, NULL);
381 rc = pcre2_jit_compile(re, PCRE2_JIT_COMPLETE);
382 mcontext = pcre2_match_context_create(NULL);
383 jit_stack = pcre2_jit_stack_create(32*1024, 512*1024, NULL);
384 pcre2_jit_stack_assign(mcontext, NULL, jit_stack);
385 match_data = pcre2_match_data_create(re, 10);
386 rc = pcre2_match(re, subject, length, 0, 0, match_data, mcontext);
387 /* Process result */
388.sp
389 pcre2_code_free(re);
390 pcre2_match_data_free(match_data);
391 pcre2_match_context_free(mcontext);
392 pcre2_jit_stack_free(jit_stack);
393.sp
394.
395.
396.SH "JIT FAST PATH API"
397.rs
398.sp
399Because the API described above falls back to interpreted matching when JIT is
400not available, it is convenient for programs that are written for general use
401in many environments. However, calling JIT via \fBpcre2_match()\fP does have a
402performance impact. Programs that are written for use where JIT is known to be
403available, and which need the best possible performance, can instead use a
404"fast path" API to call JIT matching directly instead of calling
405\fBpcre2_match()\fP (obviously only for patterns that have been successfully
406processed by \fBpcre2_jit_compile()\fP).
407.P
408The fast path function is called \fBpcre2_jit_match()\fP, and it takes exactly
409the same arguments as \fBpcre2_match()\fP. However, the subject string must be
410specified with a length; PCRE2_ZERO_TERMINATED is not supported. Unsupported
411option bits (for example, PCRE2_ANCHORED, PCRE2_ENDANCHORED and
412PCRE2_COPY_MATCHED_SUBJECT) are ignored, as is the PCRE2_NO_JIT option. The
413return values are also the same as for \fBpcre2_match()\fP, plus
414PCRE2_ERROR_JIT_BADOPTION if a matching mode (partial or complete) is requested
415that was not compiled.
416.P
417When you call \fBpcre2_match()\fP, as well as testing for invalid options, a
418number of other sanity checks are performed on the arguments. For example, if
Elliott Hughes4e19c8e2022-04-15 15:11:02 -0700419the subject pointer is NULL but the length is non-zero, an immediate error is
420given. Also, unless PCRE2_NO_UTF_CHECK is set, a UTF subject string is tested
421for validity. In the interests of speed, these checks do not happen on the JIT
422fast path, and if invalid data is passed, the result is undefined.
Elliott Hughes5b808042021-10-01 10:56:10 -0700423.P
424Bypassing the sanity checks and the \fBpcre2_match()\fP wrapping can give
425speedups of more than 10%.
426.
427.
428.SH "SEE ALSO"
429.rs
430.sp
431\fBpcre2api\fP(3)
432.
433.
434.SH AUTHOR
435.rs
436.sp
437.nf
438Philip Hazel (FAQ by Zoltan Herczeg)
439University Computing Service
440Cambridge, England.
441.fi
442.
443.
444.SH REVISION
445.rs
446.sp
447.nf
Elliott Hughes4e19c8e2022-04-15 15:11:02 -0700448Last updated: 30 November 2021
449Copyright (c) 1997-2021 University of Cambridge.
Elliott Hughes5b808042021-10-01 10:56:10 -0700450.fi