blob: 1a772fb7f9414be917b59bf9a0e0d461d31c1a71 [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001.. _profile:
2
3********************
4The Python Profilers
5********************
6
Raymond Hettinger469271d2011-01-27 20:38:46 +00007**Source code:** :source:`Lib/profile.py` and :source:`Lib/pstats.py`
8
9--------------
Georg Brandl116aa622007-08-15 14:28:22 +000010
Georg Brandl116aa622007-08-15 14:28:22 +000011.. _profiler-introduction:
12
13Introduction to the profilers
14=============================
15
16.. index::
17 single: deterministic profiling
18 single: profiling, deterministic
19
Ezio Melotti075d87c2013-04-12 15:42:06 +030020:mod:`cProfile` and :mod:`profile` provide :dfn:`deterministic profiling` of
21Python programs. A :dfn:`profile` is a set of statistics that describes how
22often and for how long various parts of the program executed. These statistics
23can be formatted into reports via the :mod:`pstats` module.
Georg Brandl116aa622007-08-15 14:28:22 +000024
Ezio Melotti075d87c2013-04-12 15:42:06 +030025The Python standard library provides two different implementations of the same
26profiling interface:
Georg Brandl116aa622007-08-15 14:28:22 +000027
Georg Brandl4eb65972010-10-14 06:41:42 +0000281. :mod:`cProfile` is recommended for most users; it's a C extension with
29 reasonable overhead that makes it suitable for profiling long-running
30 programs. Based on :mod:`lsprof`, contributed by Brett Rosen and Ted
31 Czotter.
Georg Brandl116aa622007-08-15 14:28:22 +000032
Georg Brandl4eb65972010-10-14 06:41:42 +0000332. :mod:`profile`, a pure Python module whose interface is imitated by
Ezio Melotti075d87c2013-04-12 15:42:06 +030034 :mod:`cProfile`, but which adds significant overhead to profiled programs.
35 If you're trying to extend the profiler in some way, the task might be easier
Guido van Rossum6eb740b2015-09-10 12:12:01 -070036 with this module. Originally designed and written by Jim Roskind.
Georg Brandl4eb65972010-10-14 06:41:42 +000037
38.. note::
39
40 The profiler modules are designed to provide an execution profile for a given
41 program, not for benchmarking purposes (for that, there is :mod:`timeit` for
Ezio Melottib5ff3e42011-04-03 16:20:21 +030042 reasonably accurate results). This particularly applies to benchmarking
Georg Brandl4eb65972010-10-14 06:41:42 +000043 Python code against C code: the profilers introduce overhead for Python code,
44 but not for C-level functions, and so the C code would seem faster than any
45 Python one.
Georg Brandl116aa622007-08-15 14:28:22 +000046
47
48.. _profile-instant:
49
50Instant User's Manual
51=====================
52
53This section is provided for users that "don't want to read the manual." It
54provides a very brief overview, and allows a user to rapidly perform profiling
55on an existing application.
56
Ezio Melotti075d87c2013-04-12 15:42:06 +030057To profile a function that takes a single argument, you can do::
Georg Brandl116aa622007-08-15 14:28:22 +000058
59 import cProfile
Ezio Melotti075d87c2013-04-12 15:42:06 +030060 import re
61 cProfile.run('re.compile("foo|bar")')
Georg Brandl116aa622007-08-15 14:28:22 +000062
63(Use :mod:`profile` instead of :mod:`cProfile` if the latter is not available on
64your system.)
65
Ezio Melotti075d87c2013-04-12 15:42:06 +030066The above action would run :func:`re.compile` and print profile results like
67the following::
68
69 197 function calls (192 primitive calls) in 0.002 seconds
70
71 Ordered by: standard name
72
73 ncalls tottime percall cumtime percall filename:lineno(function)
74 1 0.000 0.000 0.001 0.001 <string>:1(<module>)
75 1 0.000 0.000 0.001 0.001 re.py:212(compile)
76 1 0.000 0.000 0.001 0.001 re.py:268(_compile)
77 1 0.000 0.000 0.000 0.000 sre_compile.py:172(_compile_charset)
78 1 0.000 0.000 0.000 0.000 sre_compile.py:201(_optimize_charset)
79 4 0.000 0.000 0.000 0.000 sre_compile.py:25(_identityfunction)
80 3/1 0.000 0.000 0.000 0.000 sre_compile.py:33(_compile)
81
82The first line indicates that 197 calls were monitored. Of those calls, 192
83were :dfn:`primitive`, meaning that the call was not induced via recursion. The
84next line: ``Ordered by: standard name``, indicates that the text string in the
85far right column was used to sort the output. The column headings include:
86
87ncalls
INADA Naokibd3d8ba2017-03-22 16:56:36 +090088 for the number of calls.
Ezio Melotti075d87c2013-04-12 15:42:06 +030089
90tottime
INADA Naokibd3d8ba2017-03-22 16:56:36 +090091 for the total time spent in the given function (and excluding time made in
92 calls to sub-functions)
Ezio Melotti075d87c2013-04-12 15:42:06 +030093
94percall
95 is the quotient of ``tottime`` divided by ``ncalls``
96
97cumtime
98 is the cumulative time spent in this and all subfunctions (from invocation
99 till exit). This figure is accurate *even* for recursive functions.
100
101percall
102 is the quotient of ``cumtime`` divided by primitive calls
103
104filename:lineno(function)
105 provides the respective data of each function
106
107When there are two numbers in the first column (for example ``3/1``), it means
108that the function recursed. The second value is the number of primitive calls
109and the former is the total number of calls. Note that when the function does
110not recurse, these two values are the same, and only the single figure is
111printed.
112
113Instead of printing the output at the end of the profile run, you can save the
114results to a file by specifying a filename to the :func:`run` function::
Georg Brandl116aa622007-08-15 14:28:22 +0000115
116 import cProfile
Ezio Melotti075d87c2013-04-12 15:42:06 +0300117 import re
118 cProfile.run('re.compile("foo|bar")', 'restats')
Georg Brandl116aa622007-08-15 14:28:22 +0000119
Ezio Melotti075d87c2013-04-12 15:42:06 +0300120The :class:`pstats.Stats` class reads profile results from a file and formats
121them in various ways.
122
123The file :mod:`cProfile` can also be invoked as a script to profile another
Georg Brandl116aa622007-08-15 14:28:22 +0000124script. For example::
125
Sanyam Khurana7973e272017-11-08 16:20:56 +0530126 python -m cProfile [-o output_file] [-s sort_order] (-m module | myscript.py)
Georg Brandl116aa622007-08-15 14:28:22 +0000127
Ezio Melotti075d87c2013-04-12 15:42:06 +0300128``-o`` writes the profile results to a file instead of to stdout
Georg Brandl116aa622007-08-15 14:28:22 +0000129
Ezio Melotti075d87c2013-04-12 15:42:06 +0300130``-s`` specifies one of the :func:`~pstats.Stats.sort_stats` sort values to sort
131the output by. This only applies when ``-o`` is not supplied.
Georg Brandl116aa622007-08-15 14:28:22 +0000132
Sanyam Khurana7973e272017-11-08 16:20:56 +0530133``-m`` specifies that a module is being profiled instead of a script.
134
135 .. versionadded:: 3.7
136 Added the ``-m`` option.
137
Ezio Melotti075d87c2013-04-12 15:42:06 +0300138The :mod:`pstats` module's :class:`~pstats.Stats` class has a variety of methods
139for manipulating and printing the data saved into a profile results file::
Georg Brandl116aa622007-08-15 14:28:22 +0000140
141 import pstats
mwidjaja863b1e42018-01-25 20:49:56 -0800142 from pstats import SortKey
Ezio Melotti075d87c2013-04-12 15:42:06 +0300143 p = pstats.Stats('restats')
Georg Brandl116aa622007-08-15 14:28:22 +0000144 p.strip_dirs().sort_stats(-1).print_stats()
145
Ezio Melotti075d87c2013-04-12 15:42:06 +0300146The :meth:`~pstats.Stats.strip_dirs` method removed the extraneous path from all
147the module names. The :meth:`~pstats.Stats.sort_stats` method sorted all the
148entries according to the standard module/line/name string that is printed. The
149:meth:`~pstats.Stats.print_stats` method printed out all the statistics. You
150might try the following sort calls::
Georg Brandl116aa622007-08-15 14:28:22 +0000151
mwidjaja863b1e42018-01-25 20:49:56 -0800152 p.sort_stats(SortKey.NAME)
Georg Brandl116aa622007-08-15 14:28:22 +0000153 p.print_stats()
154
155The first call will actually sort the list by function name, and the second call
156will print out the statistics. The following are some interesting calls to
157experiment with::
158
mwidjaja863b1e42018-01-25 20:49:56 -0800159 p.sort_stats(SortKey.CUMULATIVE).print_stats(10)
Georg Brandl116aa622007-08-15 14:28:22 +0000160
161This sorts the profile by cumulative time in a function, and then only prints
162the ten most significant lines. If you want to understand what algorithms are
163taking time, the above line is what you would use.
164
165If you were looking to see what functions were looping a lot, and taking a lot
166of time, you would do::
167
mwidjaja863b1e42018-01-25 20:49:56 -0800168 p.sort_stats(SortKey.TIME).print_stats(10)
Georg Brandl116aa622007-08-15 14:28:22 +0000169
170to sort according to time spent within each function, and then print the
171statistics for the top ten functions.
172
173You might also try::
174
mwidjaja863b1e42018-01-25 20:49:56 -0800175 p.sort_stats(SortKey.FILENAME).print_stats('__init__')
Georg Brandl116aa622007-08-15 14:28:22 +0000176
177This will sort all the statistics by file name, and then print out statistics
178for only the class init methods (since they are spelled with ``__init__`` in
179them). As one final example, you could try::
180
mwidjaja863b1e42018-01-25 20:49:56 -0800181 p.sort_stats(SortKey.TIME, SortKey.CUMULATIVE).print_stats(.5, 'init')
Georg Brandl116aa622007-08-15 14:28:22 +0000182
183This line sorts statistics with a primary key of time, and a secondary key of
184cumulative time, and then prints out some of the statistics. To be specific, the
185list is first culled down to 50% (re: ``.5``) of its original size, then only
186lines containing ``init`` are maintained, and that sub-sub-list is printed.
187
188If you wondered what functions called the above functions, you could now (``p``
189is still sorted according to the last criteria) do::
190
191 p.print_callers(.5, 'init')
192
193and you would get a list of callers for each of the listed functions.
194
195If you want more functionality, you're going to have to read the manual, or
196guess what the following functions do::
197
198 p.print_callees()
Ezio Melotti075d87c2013-04-12 15:42:06 +0300199 p.add('restats')
Georg Brandl116aa622007-08-15 14:28:22 +0000200
201Invoked as a script, the :mod:`pstats` module is a statistics browser for
202reading and examining profile dumps. It has a simple line-oriented interface
203(implemented using :mod:`cmd`) and interactive help.
204
Ezio Melotti075d87c2013-04-12 15:42:06 +0300205:mod:`profile` and :mod:`cProfile` Module Reference
206=======================================================
207
208.. module:: cProfile
209.. module:: profile
210 :synopsis: Python source profiler.
211
212Both the :mod:`profile` and :mod:`cProfile` modules provide the following
213functions:
214
215.. function:: run(command, filename=None, sort=-1)
216
217 This function takes a single argument that can be passed to the :func:`exec`
218 function, and an optional file name. In all cases this routine executes::
219
220 exec(command, __main__.__dict__, __main__.__dict__)
221
222 and gathers profiling statistics from the execution. If no file name is
223 present, then this function automatically creates a :class:`~pstats.Stats`
csabella99776292017-05-14 03:02:38 -0400224 instance and prints a simple profiling report. If the sort value is specified,
Ezio Melotti075d87c2013-04-12 15:42:06 +0300225 it is passed to this :class:`~pstats.Stats` instance to control how the
226 results are sorted.
227
csabella99776292017-05-14 03:02:38 -0400228.. function:: runctx(command, globals, locals, filename=None, sort=-1)
Ezio Melotti075d87c2013-04-12 15:42:06 +0300229
230 This function is similar to :func:`run`, with added arguments to supply the
231 globals and locals dictionaries for the *command* string. This routine
232 executes::
233
234 exec(command, globals, locals)
235
236 and gathers profiling statistics as in the :func:`run` function above.
237
238.. class:: Profile(timer=None, timeunit=0.0, subcalls=True, builtins=True)
239
240 This class is normally only used if more precise control over profiling is
241 needed than what the :func:`cProfile.run` function provides.
242
243 A custom timer can be supplied for measuring how long code takes to run via
244 the *timer* argument. This must be a function that returns a single number
245 representing the current time. If the number is an integer, the *timeunit*
246 specifies a multiplier that specifies the duration of each unit of time. For
247 example, if the timer returns times measured in thousands of seconds, the
248 time unit would be ``.001``.
249
250 Directly using the :class:`Profile` class allows formatting profile results
251 without writing the profile data to a file::
252
253 import cProfile, pstats, io
mwidjaja863b1e42018-01-25 20:49:56 -0800254 from pstats import SortKey
Ezio Melotti075d87c2013-04-12 15:42:06 +0300255 pr = cProfile.Profile()
256 pr.enable()
Senthil Kumaran21101f72013-09-07 17:51:58 -0700257 # ... do something ...
Ezio Melotti075d87c2013-04-12 15:42:06 +0300258 pr.disable()
259 s = io.StringIO()
mwidjaja863b1e42018-01-25 20:49:56 -0800260 sortby = SortKey.CUMULATIVE
Senthil Kumaran21101f72013-09-07 17:51:58 -0700261 ps = pstats.Stats(pr, stream=s).sort_stats(sortby)
262 ps.print_stats()
263 print(s.getvalue())
Ezio Melotti075d87c2013-04-12 15:42:06 +0300264
Scott Sanderson2e01b752018-06-01 16:36:23 -0400265 The :class:`Profile` class can also be used as a context manager (see
266 :ref:`typecontextmanager`)::
267
268 import cProfile
269
270 with cProfile.Profile() as pr:
271 # ... do something ...
272
273 pr.print_stats()
274
Scott Sandersoncebe80b2018-06-07 05:46:42 -0400275 .. versionchanged:: 3.8
276 Added context manager support.
277
Ezio Melotti075d87c2013-04-12 15:42:06 +0300278 .. method:: enable()
279
280 Start collecting profiling data.
281
282 .. method:: disable()
283
284 Stop collecting profiling data.
285
286 .. method:: create_stats()
287
288 Stop collecting profiling data and record the results internally
289 as the current profile.
290
291 .. method:: print_stats(sort=-1)
292
293 Create a :class:`~pstats.Stats` object based on the current
294 profile and print the results to stdout.
295
296 .. method:: dump_stats(filename)
297
298 Write the results of the current profile to *filename*.
299
300 .. method:: run(cmd)
301
302 Profile the cmd via :func:`exec`.
303
304 .. method:: runctx(cmd, globals, locals)
305
306 Profile the cmd via :func:`exec` with the specified global and
307 local environment.
308
309 .. method:: runcall(func, *args, **kwargs)
310
311 Profile ``func(*args, **kwargs)``
312
Andrés Delfino937fb552018-07-28 07:01:24 -0300313Note that profiling will only work if the called command/function actually
314returns. If the interpreter is terminated (e.g. via a :func:`sys.exit` call
315during the called command/function execution) no profiling results will be
316printed.
317
Ezio Melotti075d87c2013-04-12 15:42:06 +0300318.. _profile-stats:
319
320The :class:`Stats` Class
321========================
322
323Analysis of the profiler data is done using the :class:`~pstats.Stats` class.
324
325.. module:: pstats
326 :synopsis: Statistics object for use with the profiler.
327
328.. class:: Stats(*filenames or profile, stream=sys.stdout)
329
330 This class constructor creates an instance of a "statistics object" from a
331 *filename* (or list of filenames) or from a :class:`Profile` instance. Output
332 will be printed to the stream specified by *stream*.
333
334 The file selected by the above constructor must have been created by the
335 corresponding version of :mod:`profile` or :mod:`cProfile`. To be specific,
336 there is *no* file compatibility guaranteed with future versions of this
337 profiler, and there is no compatibility with files produced by other
Tobias Kunzef7745e12018-06-04 12:07:16 +0200338 profilers, or the same profiler run on a different operating system. If
339 several files are provided, all the statistics for identical functions will
340 be coalesced, so that an overall view of several processes can be considered
341 in a single report. If additional files need to be combined with data in an
342 existing :class:`~pstats.Stats` object, the :meth:`~pstats.Stats.add` method
343 can be used.
Ezio Melotti075d87c2013-04-12 15:42:06 +0300344
345 Instead of reading the profile data from a file, a :class:`cProfile.Profile`
346 or :class:`profile.Profile` object can be used as the profile data source.
347
348 :class:`Stats` objects have the following methods:
349
350 .. method:: strip_dirs()
351
352 This method for the :class:`Stats` class removes all leading path
353 information from file names. It is very useful in reducing the size of
354 the printout to fit within (close to) 80 columns. This method modifies
355 the object, and the stripped information is lost. After performing a
356 strip operation, the object is considered to have its entries in a
357 "random" order, as it was just after object initialization and loading.
358 If :meth:`~pstats.Stats.strip_dirs` causes two function names to be
359 indistinguishable (they are on the same line of the same filename, and
360 have the same function name), then the statistics for these two entries
361 are accumulated into a single entry.
362
363
364 .. method:: add(*filenames)
365
366 This method of the :class:`Stats` class accumulates additional profiling
367 information into the current profiling object. Its arguments should refer
368 to filenames created by the corresponding version of :func:`profile.run`
369 or :func:`cProfile.run`. Statistics for identically named (re: file, line,
370 name) functions are automatically accumulated into single function
371 statistics.
372
373
374 .. method:: dump_stats(filename)
375
376 Save the data loaded into the :class:`Stats` object to a file named
377 *filename*. The file is created if it does not exist, and is overwritten
378 if it already exists. This is equivalent to the method of the same name
379 on the :class:`profile.Profile` and :class:`cProfile.Profile` classes.
380
381
382 .. method:: sort_stats(*keys)
383
384 This method modifies the :class:`Stats` object by sorting it according to
mwidjaja863b1e42018-01-25 20:49:56 -0800385 the supplied criteria. The argument can be either a string or a SortKey
386 enum identifying the basis of a sort (example: ``'time'``, ``'name'``,
387 ``SortKey.TIME`` or ``SortKey.NAME``). The SortKey enums argument have
388 advantage over the string argument in that it is more robust and less
389 error prone.
Ezio Melotti075d87c2013-04-12 15:42:06 +0300390
391 When more than one key is provided, then additional keys are used as
392 secondary criteria when there is equality in all keys selected before
mwidjaja863b1e42018-01-25 20:49:56 -0800393 them. For example, ``sort_stats(SortKey.NAME, SortKey.FILE)`` will sort
394 all the entries according to their function name, and resolve all ties
395 (identical function names) by sorting by file name.
Ezio Melotti075d87c2013-04-12 15:42:06 +0300396
mwidjaja863b1e42018-01-25 20:49:56 -0800397 For the string argument, abbreviations can be used for any key names, as
398 long as the abbreviation is unambiguous.
Ezio Melotti075d87c2013-04-12 15:42:06 +0300399
mwidjaja863b1e42018-01-25 20:49:56 -0800400 The following are the valid string and SortKey:
401
402 +------------------+---------------------+----------------------+
403 | Valid String Arg | Valid enum Arg | Meaning |
404 +==================+=====================+======================+
405 | ``'calls'`` | SortKey.CALLS | call count |
406 +------------------+---------------------+----------------------+
407 | ``'cumulative'`` | SortKey.CUMULATIVE | cumulative time |
408 +------------------+---------------------+----------------------+
409 | ``'cumtime'`` | N/A | cumulative time |
410 +------------------+---------------------+----------------------+
411 | ``'file'`` | N/A | file name |
412 +------------------+---------------------+----------------------+
413 | ``'filename'`` | SortKey.FILENAME | file name |
414 +------------------+---------------------+----------------------+
415 | ``'module'`` | N/A | file name |
416 +------------------+---------------------+----------------------+
417 | ``'ncalls'`` | N/A | call count |
418 +------------------+---------------------+----------------------+
419 | ``'pcalls'`` | SortKey.PCALLS | primitive call count |
420 +------------------+---------------------+----------------------+
421 | ``'line'`` | SortKey.LINE | line number |
422 +------------------+---------------------+----------------------+
423 | ``'name'`` | SortKey.NAME | function name |
424 +------------------+---------------------+----------------------+
425 | ``'nfl'`` | SortKey.NFL | name/file/line |
426 +------------------+---------------------+----------------------+
427 | ``'stdname'`` | SortKey.STDNAME | standard name |
428 +------------------+---------------------+----------------------+
429 | ``'time'`` | SortKey.TIME | internal time |
430 +------------------+---------------------+----------------------+
431 | ``'tottime'`` | N/A | internal time |
432 +------------------+---------------------+----------------------+
Ezio Melotti075d87c2013-04-12 15:42:06 +0300433
434 Note that all sorts on statistics are in descending order (placing most
435 time consuming items first), where as name, file, and line number searches
436 are in ascending order (alphabetical). The subtle distinction between
mwidjaja863b1e42018-01-25 20:49:56 -0800437 ``SortKey.NFL`` and ``SortKey.STDNAME`` is that the standard name is a
438 sort of the name as printed, which means that the embedded line numbers
439 get compared in an odd way. For example, lines 3, 20, and 40 would (if
440 the file names were the same) appear in the string order 20, 3 and 40.
441 In contrast, ``SortKey.NFL`` does a numeric compare of the line numbers.
442 In fact, ``sort_stats(SortKey.NFL)`` is the same as
443 ``sort_stats(SortKey.NAME, SortKey.FILENAME, SortKey.LINE)``.
Ezio Melotti075d87c2013-04-12 15:42:06 +0300444
445 For backward-compatibility reasons, the numeric arguments ``-1``, ``0``,
446 ``1``, and ``2`` are permitted. They are interpreted as ``'stdname'``,
447 ``'calls'``, ``'time'``, and ``'cumulative'`` respectively. If this old
448 style format (numeric) is used, only one sort key (the numeric key) will
449 be used, and additional arguments will be silently ignored.
450
451 .. For compatibility with the old profiler.
452
mwidjaja863b1e42018-01-25 20:49:56 -0800453 .. versionadded:: 3.7
454 Added the SortKey enum.
Ezio Melotti075d87c2013-04-12 15:42:06 +0300455
456 .. method:: reverse_order()
457
458 This method for the :class:`Stats` class reverses the ordering of the
459 basic list within the object. Note that by default ascending vs
460 descending order is properly selected based on the sort key of choice.
461
462 .. This method is provided primarily for compatibility with the old
463 profiler.
464
465
466 .. method:: print_stats(*restrictions)
467
468 This method for the :class:`Stats` class prints out a report as described
469 in the :func:`profile.run` definition.
470
471 The order of the printing is based on the last
472 :meth:`~pstats.Stats.sort_stats` operation done on the object (subject to
473 caveats in :meth:`~pstats.Stats.add` and
474 :meth:`~pstats.Stats.strip_dirs`).
475
476 The arguments provided (if any) can be used to limit the list down to the
477 significant entries. Initially, the list is taken to be the complete set
478 of profiled functions. Each restriction is either an integer (to select a
479 count of lines), or a decimal fraction between 0.0 and 1.0 inclusive (to
Matthias Bussonnier8fb1f6e2017-02-20 21:30:00 -0800480 select a percentage of lines), or a string that will interpreted as a
481 regular expression (to pattern match the standard name that is printed).
482 If several restrictions are provided, then they are applied sequentially.
483 For example::
Ezio Melotti075d87c2013-04-12 15:42:06 +0300484
485 print_stats(.1, 'foo:')
486
487 would first limit the printing to first 10% of list, and then only print
488 functions that were part of filename :file:`.\*foo:`. In contrast, the
489 command::
490
491 print_stats('foo:', .1)
492
493 would limit the list to all functions having file names :file:`.\*foo:`,
494 and then proceed to only print the first 10% of them.
495
496
497 .. method:: print_callers(*restrictions)
498
499 This method for the :class:`Stats` class prints a list of all functions
500 that called each function in the profiled database. The ordering is
501 identical to that provided by :meth:`~pstats.Stats.print_stats`, and the
502 definition of the restricting argument is also identical. Each caller is
503 reported on its own line. The format differs slightly depending on the
504 profiler that produced the stats:
505
506 * With :mod:`profile`, a number is shown in parentheses after each caller
507 to show how many times this specific call was made. For convenience, a
508 second non-parenthesized number repeats the cumulative time spent in the
509 function at the right.
510
511 * With :mod:`cProfile`, each caller is preceded by three numbers: the
512 number of times this specific call was made, and the total and
513 cumulative times spent in the current function while it was invoked by
514 this specific caller.
515
516
517 .. method:: print_callees(*restrictions)
518
519 This method for the :class:`Stats` class prints a list of all function
520 that were called by the indicated function. Aside from this reversal of
521 direction of calls (re: called vs was called by), the arguments and
522 ordering are identical to the :meth:`~pstats.Stats.print_callers` method.
523
Georg Brandl116aa622007-08-15 14:28:22 +0000524
525.. _deterministic-profiling:
526
527What Is Deterministic Profiling?
528================================
529
530:dfn:`Deterministic profiling` is meant to reflect the fact that all *function
531call*, *function return*, and *exception* events are monitored, and precise
532timings are made for the intervals between these events (during which time the
533user's code is executing). In contrast, :dfn:`statistical profiling` (which is
534not done by this module) randomly samples the effective instruction pointer, and
535deduces where time is being spent. The latter technique traditionally involves
536less overhead (as the code does not need to be instrumented), but provides only
537relative indications of where time is being spent.
538
539In Python, since there is an interpreter active during execution, the presence
540of instrumented code is not required to do deterministic profiling. Python
541automatically provides a :dfn:`hook` (optional callback) for each event. In
542addition, the interpreted nature of Python tends to add so much overhead to
543execution, that deterministic profiling tends to only add small processing
544overhead in typical applications. The result is that deterministic profiling is
545not that expensive, yet provides extensive run time statistics about the
546execution of a Python program.
547
548Call count statistics can be used to identify bugs in code (surprising counts),
549and to identify possible inline-expansion points (high call counts). Internal
550time statistics can be used to identify "hot loops" that should be carefully
551optimized. Cumulative time statistics should be used to identify high level
552errors in the selection of algorithms. Note that the unusual handling of
553cumulative times in this profiler allows statistics for recursive
554implementations of algorithms to be directly compared to iterative
555implementations.
556
557
Ezio Melotti075d87c2013-04-12 15:42:06 +0300558.. _profile-limitations:
Georg Brandl116aa622007-08-15 14:28:22 +0000559
560Limitations
561===========
562
563One limitation has to do with accuracy of timing information. There is a
564fundamental problem with deterministic profilers involving accuracy. The most
565obvious restriction is that the underlying "clock" is only ticking at a rate
566(typically) of about .001 seconds. Hence no measurements will be more accurate
567than the underlying clock. If enough measurements are taken, then the "error"
568will tend to average out. Unfortunately, removing this first error induces a
569second source of error.
570
571The second problem is that it "takes a while" from when an event is dispatched
572until the profiler's call to get the time actually *gets* the state of the
573clock. Similarly, there is a certain lag when exiting the profiler event
574handler from the time that the clock's value was obtained (and then squirreled
575away), until the user's code is once again executing. As a result, functions
576that are called many times, or call many functions, will typically accumulate
577this error. The error that accumulates in this fashion is typically less than
578the accuracy of the clock (less than one clock tick), but it *can* accumulate
579and become very significant.
580
581The problem is more important with :mod:`profile` than with the lower-overhead
582:mod:`cProfile`. For this reason, :mod:`profile` provides a means of
583calibrating itself for a given platform so that this error can be
584probabilistically (on the average) removed. After the profiler is calibrated, it
585will be more accurate (in a least square sense), but it will sometimes produce
586negative numbers (when call counts are exceptionally low, and the gods of
587probability work against you :-). ) Do *not* be alarmed by negative numbers in
588the profile. They should *only* appear if you have calibrated your profiler,
589and the results are actually better than without calibration.
590
591
592.. _profile-calibration:
593
594Calibration
595===========
596
597The profiler of the :mod:`profile` module subtracts a constant from each event
598handling time to compensate for the overhead of calling the time function, and
599socking away the results. By default, the constant is 0. The following
600procedure can be used to obtain a better constant for a given platform (see
Ezio Melotti075d87c2013-04-12 15:42:06 +0300601:ref:`profile-limitations`). ::
Georg Brandl116aa622007-08-15 14:28:22 +0000602
603 import profile
604 pr = profile.Profile()
605 for i in range(5):
Georg Brandl6911e3c2007-09-04 07:15:32 +0000606 print(pr.calibrate(10000))
Georg Brandl116aa622007-08-15 14:28:22 +0000607
608The method executes the number of Python calls given by the argument, directly
609and again under the profiler, measuring the time for both. It then computes the
610hidden overhead per profiler event, and returns that as a float. For example,
Victor Stinner884d13a2017-10-17 14:46:45 -0700611on a 1.8Ghz Intel Core i5 running Mac OS X, and using Python's time.process_time() as
Ezio Melotti075d87c2013-04-12 15:42:06 +0300612the timer, the magical number is about 4.04e-6.
Georg Brandl116aa622007-08-15 14:28:22 +0000613
614The object of this exercise is to get a fairly consistent result. If your
615computer is *very* fast, or your timer function has poor resolution, you might
616have to pass 100000, or even 1000000, to get consistent results.
617
Georg Brandle6bcc912008-05-12 18:05:20 +0000618When you have a consistent answer, there are three ways you can use it::
Georg Brandl116aa622007-08-15 14:28:22 +0000619
620 import profile
621
622 # 1. Apply computed bias to all Profile instances created hereafter.
623 profile.Profile.bias = your_computed_bias
624
625 # 2. Apply computed bias to a specific Profile instance.
626 pr = profile.Profile()
627 pr.bias = your_computed_bias
628
629 # 3. Specify computed bias in instance constructor.
630 pr = profile.Profile(bias=your_computed_bias)
631
632If you have a choice, you are better off choosing a smaller constant, and then
633your results will "less often" show up as negative in profile statistics.
634
Ezio Melotti075d87c2013-04-12 15:42:06 +0300635.. _profile-timers:
Georg Brandl116aa622007-08-15 14:28:22 +0000636
Georg Brandlc2b17b22013-10-06 09:17:43 +0200637Using a custom timer
638====================
Georg Brandl116aa622007-08-15 14:28:22 +0000639
Ezio Melotti075d87c2013-04-12 15:42:06 +0300640If you want to change how current time is determined (for example, to force use
641of wall-clock time or elapsed process time), pass the timing function you want
642to the :class:`Profile` class constructor::
Georg Brandl116aa622007-08-15 14:28:22 +0000643
Ezio Melotti075d87c2013-04-12 15:42:06 +0300644 pr = profile.Profile(your_time_func)
Georg Brandl116aa622007-08-15 14:28:22 +0000645
Ezio Melotti075d87c2013-04-12 15:42:06 +0300646The resulting profiler will then call ``your_time_func``. Depending on whether
647you are using :class:`profile.Profile` or :class:`cProfile.Profile`,
648``your_time_func``'s return value will be interpreted differently:
Georg Brandl116aa622007-08-15 14:28:22 +0000649
650:class:`profile.Profile`
Ezio Melotti075d87c2013-04-12 15:42:06 +0300651 ``your_time_func`` should return a single number, or a list of numbers whose
652 sum is the current time (like what :func:`os.times` returns). If the
653 function returns a single time number, or the list of returned numbers has
654 length 2, then you will get an especially fast version of the dispatch
655 routine.
Georg Brandl116aa622007-08-15 14:28:22 +0000656
Ezio Melotti075d87c2013-04-12 15:42:06 +0300657 Be warned that you should calibrate the profiler class for the timer function
658 that you choose (see :ref:`profile-calibration`). For most machines, a timer
659 that returns a lone integer value will provide the best results in terms of
660 low overhead during profiling. (:func:`os.times` is *pretty* bad, as it
661 returns a tuple of floating point values). If you want to substitute a
662 better timer in the cleanest fashion, derive a class and hardwire a
663 replacement dispatch method that best handles your timer call, along with the
664 appropriate calibration constant.
Georg Brandl116aa622007-08-15 14:28:22 +0000665
666:class:`cProfile.Profile`
Ezio Melotti075d87c2013-04-12 15:42:06 +0300667 ``your_time_func`` should return a single number. If it returns integers,
668 you can also invoke the class constructor with a second argument specifying
669 the real duration of one unit of time. For example, if
670 ``your_integer_time_func`` returns times measured in thousands of seconds,
671 you would construct the :class:`Profile` instance as follows::
Georg Brandl116aa622007-08-15 14:28:22 +0000672
Ezio Melotti075d87c2013-04-12 15:42:06 +0300673 pr = cProfile.Profile(your_integer_time_func, 0.001)
Georg Brandl116aa622007-08-15 14:28:22 +0000674
Serhiy Storchakab19542d2015-03-14 21:32:57 +0200675 As the :class:`cProfile.Profile` class cannot be calibrated, custom timer
Ezio Melotti075d87c2013-04-12 15:42:06 +0300676 functions should be used with care and should be as fast as possible. For
677 the best results with a custom timer, it might be necessary to hard-code it
678 in the C source of the internal :mod:`_lsprof` module.
679
680Python 3.3 adds several new functions in :mod:`time` that can be used to make
681precise measurements of process or wall-clock time. For example, see
682:func:`time.perf_counter`.