blob: c3632f97cdf0c870a5182bb8b1b31146ae1f403b [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001.. _profile:
2
3********************
4The Python Profilers
5********************
6
7.. sectionauthor:: James Roskind
8
Benjamin Petersona0dfa822009-11-13 02:25:08 +00009.. module:: profile
10 :synopsis: Python source profiler.
Georg Brandl116aa622007-08-15 14:28:22 +000011
12.. index:: single: InfoSeek Corporation
13
14Copyright © 1994, by InfoSeek Corporation, all rights reserved.
15
16Written by James Roskind. [#]_
17
18Permission to use, copy, modify, and distribute this Python software and its
19associated documentation for any purpose (subject to the restriction in the
20following sentence) without fee is hereby granted, provided that the above
21copyright notice appears in all copies, and that both that copyright notice and
22this permission notice appear in supporting documentation, and that the name of
23InfoSeek not be used in advertising or publicity pertaining to distribution of
24the software without specific, written prior permission. This permission is
25explicitly restricted to the copying and modification of the software to remain
26in Python, compiled Python, or other languages (such as C) wherein the modified
27or derived code is exclusively imported into a Python module.
28
29INFOSEEK CORPORATION DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE,
30INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT
31SHALL INFOSEEK CORPORATION BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL
32DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS,
33WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING
34OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.
35
Georg Brandl116aa622007-08-15 14:28:22 +000036.. _profiler-introduction:
37
38Introduction to the profilers
39=============================
40
41.. index::
42 single: deterministic profiling
43 single: profiling, deterministic
44
Christian Heimesdae2a892008-04-19 00:55:37 +000045A :dfn:`profiler` is a program that describes the run time performance
46of a program, providing a variety of statistics. This documentation
47describes the profiler functionality provided in the modules
48:mod:`cProfile`, :mod:`profile` and :mod:`pstats`. This profiler
49provides :dfn:`deterministic profiling` of Python programs. It also
50provides a series of report generation tools to allow users to rapidly
Georg Brandl116aa622007-08-15 14:28:22 +000051examine the results of a profile operation.
52
Fred Drake0e474a82007-10-11 18:01:43 +000053The Python standard library provides two different profilers:
Georg Brandl116aa622007-08-15 14:28:22 +000054
Georg Brandl48310cd2009-01-03 21:18:54 +000055#. :mod:`cProfile` is recommended for most users; it's a C extension
Christian Heimesdae2a892008-04-19 00:55:37 +000056 with reasonable overhead
Georg Brandl48310cd2009-01-03 21:18:54 +000057 that makes it suitable for profiling long-running programs.
Christian Heimesdae2a892008-04-19 00:55:37 +000058 Based on :mod:`lsprof`,
Georg Brandl48310cd2009-01-03 21:18:54 +000059 contributed by Brett Rosen and Ted Czotter.
Georg Brandl116aa622007-08-15 14:28:22 +000060
Christian Heimesdae2a892008-04-19 00:55:37 +000061#. :mod:`profile`, a pure Python module whose interface is imitated by
Georg Brandl48310cd2009-01-03 21:18:54 +000062 :mod:`cProfile`. Adds significant overhead to profiled programs.
63 If you're trying to extend
Christian Heimesdae2a892008-04-19 00:55:37 +000064 the profiler in some way, the task might be easier with this module.
65 Copyright © 1994, by InfoSeek Corporation.
Georg Brandl116aa622007-08-15 14:28:22 +000066
Georg Brandl116aa622007-08-15 14:28:22 +000067The :mod:`profile` and :mod:`cProfile` modules export the same interface, so
Christian Heimesdae2a892008-04-19 00:55:37 +000068they are mostly interchangeable; :mod:`cProfile` has a much lower overhead but
69is newer and might not be available on all systems.
Georg Brandl116aa622007-08-15 14:28:22 +000070:mod:`cProfile` is really a compatibility layer on top of the internal
Fred Drake0e474a82007-10-11 18:01:43 +000071:mod:`_lsprof` module.
Georg Brandl116aa622007-08-15 14:28:22 +000072
73
74.. _profile-instant:
75
76Instant User's Manual
77=====================
78
79This section is provided for users that "don't want to read the manual." It
80provides a very brief overview, and allows a user to rapidly perform profiling
81on an existing application.
82
83To profile an application with a main entry point of :func:`foo`, you would add
84the following to your module::
85
86 import cProfile
87 cProfile.run('foo()')
88
89(Use :mod:`profile` instead of :mod:`cProfile` if the latter is not available on
90your system.)
91
92The above action would cause :func:`foo` to be run, and a series of informative
93lines (the profile) to be printed. The above approach is most useful when
94working with the interpreter. If you would like to save the results of a
95profile into a file for later examination, you can supply a file name as the
96second argument to the :func:`run` function::
97
98 import cProfile
99 cProfile.run('foo()', 'fooprof')
100
101The file :file:`cProfile.py` can also be invoked as a script to profile another
102script. For example::
103
104 python -m cProfile myscript.py
105
106:file:`cProfile.py` accepts two optional arguments on the command line::
107
108 cProfile.py [-o output_file] [-s sort_order]
109
Benjamin Peterson5e55b3e2010-02-03 02:35:45 +0000110``-s`` only applies to standard output (``-o`` is not supplied).
Georg Brandl116aa622007-08-15 14:28:22 +0000111Look in the :class:`Stats` documentation for valid sort values.
112
113When you wish to review the profile, you should use the methods in the
114:mod:`pstats` module. Typically you would load the statistics data as follows::
115
116 import pstats
117 p = pstats.Stats('fooprof')
118
119The class :class:`Stats` (the above code just created an instance of this class)
120has a variety of methods for manipulating and printing the data that was just
121read into ``p``. When you ran :func:`cProfile.run` above, what was printed was
122the result of three method calls::
123
124 p.strip_dirs().sort_stats(-1).print_stats()
125
126The first method removed the extraneous path from all the module names. The
127second method sorted all the entries according to the standard module/line/name
128string that is printed. The third method printed out all the statistics. You
129might try the following sort calls:
130
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000131.. (this is to comply with the semantics of the old profiler).
Georg Brandl116aa622007-08-15 14:28:22 +0000132
133::
134
135 p.sort_stats('name')
136 p.print_stats()
137
138The first call will actually sort the list by function name, and the second call
139will print out the statistics. The following are some interesting calls to
140experiment with::
141
142 p.sort_stats('cumulative').print_stats(10)
143
144This sorts the profile by cumulative time in a function, and then only prints
145the ten most significant lines. If you want to understand what algorithms are
146taking time, the above line is what you would use.
147
148If you were looking to see what functions were looping a lot, and taking a lot
149of time, you would do::
150
151 p.sort_stats('time').print_stats(10)
152
153to sort according to time spent within each function, and then print the
154statistics for the top ten functions.
155
156You might also try::
157
158 p.sort_stats('file').print_stats('__init__')
159
160This will sort all the statistics by file name, and then print out statistics
161for only the class init methods (since they are spelled with ``__init__`` in
162them). As one final example, you could try::
163
164 p.sort_stats('time', 'cum').print_stats(.5, 'init')
165
166This line sorts statistics with a primary key of time, and a secondary key of
167cumulative time, and then prints out some of the statistics. To be specific, the
168list is first culled down to 50% (re: ``.5``) of its original size, then only
169lines containing ``init`` are maintained, and that sub-sub-list is printed.
170
171If you wondered what functions called the above functions, you could now (``p``
172is still sorted according to the last criteria) do::
173
174 p.print_callers(.5, 'init')
175
176and you would get a list of callers for each of the listed functions.
177
178If you want more functionality, you're going to have to read the manual, or
179guess what the following functions do::
180
181 p.print_callees()
182 p.add('fooprof')
183
184Invoked as a script, the :mod:`pstats` module is a statistics browser for
185reading and examining profile dumps. It has a simple line-oriented interface
186(implemented using :mod:`cmd`) and interactive help.
187
188
189.. _deterministic-profiling:
190
191What Is Deterministic Profiling?
192================================
193
194:dfn:`Deterministic profiling` is meant to reflect the fact that all *function
195call*, *function return*, and *exception* events are monitored, and precise
196timings are made for the intervals between these events (during which time the
197user's code is executing). In contrast, :dfn:`statistical profiling` (which is
198not done by this module) randomly samples the effective instruction pointer, and
199deduces where time is being spent. The latter technique traditionally involves
200less overhead (as the code does not need to be instrumented), but provides only
201relative indications of where time is being spent.
202
203In Python, since there is an interpreter active during execution, the presence
204of instrumented code is not required to do deterministic profiling. Python
205automatically provides a :dfn:`hook` (optional callback) for each event. In
206addition, the interpreted nature of Python tends to add so much overhead to
207execution, that deterministic profiling tends to only add small processing
208overhead in typical applications. The result is that deterministic profiling is
209not that expensive, yet provides extensive run time statistics about the
210execution of a Python program.
211
212Call count statistics can be used to identify bugs in code (surprising counts),
213and to identify possible inline-expansion points (high call counts). Internal
214time statistics can be used to identify "hot loops" that should be carefully
215optimized. Cumulative time statistics should be used to identify high level
216errors in the selection of algorithms. Note that the unusual handling of
217cumulative times in this profiler allows statistics for recursive
218implementations of algorithms to be directly compared to iterative
219implementations.
220
221
222Reference Manual -- :mod:`profile` and :mod:`cProfile`
223======================================================
224
225.. module:: cProfile
226 :synopsis: Python profiler
227
228
229The primary entry point for the profiler is the global function
230:func:`profile.run` (resp. :func:`cProfile.run`). It is typically used to create
231any profile information. The reports are formatted and printed using methods of
232the class :class:`pstats.Stats`. The following is a description of all of these
233standard entry points and functions. For a more in-depth view of some of the
234code, consider reading the later section on Profiler Extensions, which includes
235discussion of how to derive "better" profilers from the classes presented, or
236reading the source code for these modules.
237
238
Georg Brandl18244152009-09-02 20:34:52 +0000239.. function:: run(command, filename=None, sort=-1)
Georg Brandl116aa622007-08-15 14:28:22 +0000240
241 This function takes a single argument that can be passed to the :func:`exec`
242 function, and an optional file name. In all cases this routine attempts to
243 :func:`exec` its first argument, and gather profiling statistics from the
244 execution. If no file name is present, then this function automatically
245 prints a simple profiling report, sorted by the standard name string
246 (file/line/function-name) that is presented in each line. The following is a
247 typical output from such a call::
248
249 2706 function calls (2004 primitive calls) in 4.504 CPU seconds
250
251 Ordered by: standard name
252
253 ncalls tottime percall cumtime percall filename:lineno(function)
254 2 0.006 0.003 0.953 0.477 pobject.py:75(save_objects)
255 43/3 0.533 0.012 0.749 0.250 pobject.py:99(evaluate)
256 ...
257
258 The first line indicates that 2706 calls were monitored. Of those calls, 2004
259 were :dfn:`primitive`. We define :dfn:`primitive` to mean that the call was not
260 induced via recursion. The next line: ``Ordered by: standard name``, indicates
261 that the text string in the far right column was used to sort the output. The
262 column headings include:
263
Georg Brandl48310cd2009-01-03 21:18:54 +0000264 ncalls
Georg Brandl116aa622007-08-15 14:28:22 +0000265 for the number of calls,
266
Georg Brandl48310cd2009-01-03 21:18:54 +0000267 tottime
Georg Brandl18244152009-09-02 20:34:52 +0000268 for the total time spent in the given function (and excluding time made in
269 calls to sub-functions),
Georg Brandl116aa622007-08-15 14:28:22 +0000270
Georg Brandl48310cd2009-01-03 21:18:54 +0000271 percall
Georg Brandl116aa622007-08-15 14:28:22 +0000272 is the quotient of ``tottime`` divided by ``ncalls``
273
Georg Brandl48310cd2009-01-03 21:18:54 +0000274 cumtime
Georg Brandl116aa622007-08-15 14:28:22 +0000275 is the total time spent in this and all subfunctions (from invocation till
276 exit). This figure is accurate *even* for recursive functions.
277
Georg Brandl48310cd2009-01-03 21:18:54 +0000278 percall
Georg Brandl116aa622007-08-15 14:28:22 +0000279 is the quotient of ``cumtime`` divided by primitive calls
280
Georg Brandl48310cd2009-01-03 21:18:54 +0000281 filename:lineno(function)
Georg Brandl116aa622007-08-15 14:28:22 +0000282 provides the respective data of each function
283
284 When there are two numbers in the first column (for example, ``43/3``), then the
285 latter is the number of primitive calls, and the former is the actual number of
286 calls. Note that when the function does not recurse, these two values are the
287 same, and only the single figure is printed.
288
Georg Brandl18244152009-09-02 20:34:52 +0000289 If *sort* is given, it can be one of ``'stdname'`` (sort by filename:lineno),
290 ``'calls'`` (sort by number of calls), ``'time'`` (sort by total time) or
291 ``'cumulative'`` (sort by cumulative time). The default is ``'stdname'``.
Georg Brandl116aa622007-08-15 14:28:22 +0000292
Georg Brandl18244152009-09-02 20:34:52 +0000293
294.. function:: runctx(command, globals, locals, filename=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000295
296 This function is similar to :func:`run`, with added arguments to supply the
297 globals and locals dictionaries for the *command* string.
298
Georg Brandl116aa622007-08-15 14:28:22 +0000299
Georg Brandl18244152009-09-02 20:34:52 +0000300Analysis of the profiler data is done using the :class:`pstats.Stats` class.
Georg Brandl116aa622007-08-15 14:28:22 +0000301
302
303.. module:: pstats
304 :synopsis: Statistics object for use with the profiler.
305
306
Georg Brandl18244152009-09-02 20:34:52 +0000307.. class:: Stats(*filenames, stream=sys.stdout)
Georg Brandl116aa622007-08-15 14:28:22 +0000308
309 This class constructor creates an instance of a "statistics object" from a
310 *filename* (or set of filenames). :class:`Stats` objects are manipulated by
311 methods, in order to print useful reports. You may specify an alternate output
312 stream by giving the keyword argument, ``stream``.
313
314 The file selected by the above constructor must have been created by the
315 corresponding version of :mod:`profile` or :mod:`cProfile`. To be specific,
316 there is *no* file compatibility guaranteed with future versions of this
317 profiler, and there is no compatibility with files produced by other profilers.
318 If several files are provided, all the statistics for identical functions will
319 be coalesced, so that an overall view of several processes can be considered in
320 a single report. If additional files need to be combined with data in an
321 existing :class:`Stats` object, the :meth:`add` method can be used.
322
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000323 .. (such as the old system profiler).
324
Georg Brandl116aa622007-08-15 14:28:22 +0000325
326.. _profile-stats:
327
328The :class:`Stats` Class
329------------------------
330
331:class:`Stats` objects have the following methods:
332
333
334.. method:: Stats.strip_dirs()
335
336 This method for the :class:`Stats` class removes all leading path information
337 from file names. It is very useful in reducing the size of the printout to fit
338 within (close to) 80 columns. This method modifies the object, and the stripped
339 information is lost. After performing a strip operation, the object is
340 considered to have its entries in a "random" order, as it was just after object
341 initialization and loading. If :meth:`strip_dirs` causes two function names to
342 be indistinguishable (they are on the same line of the same filename, and have
343 the same function name), then the statistics for these two entries are
344 accumulated into a single entry.
345
346
Georg Brandl18244152009-09-02 20:34:52 +0000347.. method:: Stats.add(*filenames)
Georg Brandl116aa622007-08-15 14:28:22 +0000348
349 This method of the :class:`Stats` class accumulates additional profiling
350 information into the current profiling object. Its arguments should refer to
351 filenames created by the corresponding version of :func:`profile.run` or
352 :func:`cProfile.run`. Statistics for identically named (re: file, line, name)
353 functions are automatically accumulated into single function statistics.
354
355
356.. method:: Stats.dump_stats(filename)
357
358 Save the data loaded into the :class:`Stats` object to a file named *filename*.
359 The file is created if it does not exist, and is overwritten if it already
360 exists. This is equivalent to the method of the same name on the
361 :class:`profile.Profile` and :class:`cProfile.Profile` classes.
362
Georg Brandl116aa622007-08-15 14:28:22 +0000363
Georg Brandl18244152009-09-02 20:34:52 +0000364.. method:: Stats.sort_stats(*keys)
Georg Brandl116aa622007-08-15 14:28:22 +0000365
366 This method modifies the :class:`Stats` object by sorting it according to the
367 supplied criteria. The argument is typically a string identifying the basis of
368 a sort (example: ``'time'`` or ``'name'``).
369
370 When more than one key is provided, then additional keys are used as secondary
371 criteria when there is equality in all keys selected before them. For example,
372 ``sort_stats('name', 'file')`` will sort all the entries according to their
373 function name, and resolve all ties (identical function names) by sorting by
374 file name.
375
376 Abbreviations can be used for any key names, as long as the abbreviation is
377 unambiguous. The following are the keys currently defined:
378
379 +------------------+----------------------+
380 | Valid Arg | Meaning |
381 +==================+======================+
382 | ``'calls'`` | call count |
383 +------------------+----------------------+
384 | ``'cumulative'`` | cumulative time |
385 +------------------+----------------------+
386 | ``'file'`` | file name |
387 +------------------+----------------------+
388 | ``'module'`` | file name |
389 +------------------+----------------------+
390 | ``'pcalls'`` | primitive call count |
391 +------------------+----------------------+
392 | ``'line'`` | line number |
393 +------------------+----------------------+
394 | ``'name'`` | function name |
395 +------------------+----------------------+
396 | ``'nfl'`` | name/file/line |
397 +------------------+----------------------+
398 | ``'stdname'`` | standard name |
399 +------------------+----------------------+
400 | ``'time'`` | internal time |
401 +------------------+----------------------+
402
403 Note that all sorts on statistics are in descending order (placing most time
404 consuming items first), where as name, file, and line number searches are in
405 ascending order (alphabetical). The subtle distinction between ``'nfl'`` and
406 ``'stdname'`` is that the standard name is a sort of the name as printed, which
407 means that the embedded line numbers get compared in an odd way. For example,
408 lines 3, 20, and 40 would (if the file names were the same) appear in the string
409 order 20, 3 and 40. In contrast, ``'nfl'`` does a numeric compare of the line
410 numbers. In fact, ``sort_stats('nfl')`` is the same as ``sort_stats('name',
411 'file', 'line')``.
412
413 For backward-compatibility reasons, the numeric arguments ``-1``, ``0``, ``1``,
414 and ``2`` are permitted. They are interpreted as ``'stdname'``, ``'calls'``,
415 ``'time'``, and ``'cumulative'`` respectively. If this old style format
416 (numeric) is used, only one sort key (the numeric key) will be used, and
417 additional arguments will be silently ignored.
418
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000419 .. For compatibility with the old profiler,
Georg Brandl116aa622007-08-15 14:28:22 +0000420
421
422.. method:: Stats.reverse_order()
423
424 This method for the :class:`Stats` class reverses the ordering of the basic list
425 within the object. Note that by default ascending vs descending order is
426 properly selected based on the sort key of choice.
427
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000428 .. This method is provided primarily for compatibility with the old profiler.
Georg Brandl116aa622007-08-15 14:28:22 +0000429
430
Georg Brandl18244152009-09-02 20:34:52 +0000431.. method:: Stats.print_stats(*restrictions)
Georg Brandl116aa622007-08-15 14:28:22 +0000432
433 This method for the :class:`Stats` class prints out a report as described in the
434 :func:`profile.run` definition.
435
436 The order of the printing is based on the last :meth:`sort_stats` operation done
437 on the object (subject to caveats in :meth:`add` and :meth:`strip_dirs`).
438
439 The arguments provided (if any) can be used to limit the list down to the
440 significant entries. Initially, the list is taken to be the complete set of
441 profiled functions. Each restriction is either an integer (to select a count of
442 lines), or a decimal fraction between 0.0 and 1.0 inclusive (to select a
443 percentage of lines), or a regular expression (to pattern match the standard
444 name that is printed; as of Python 1.5b1, this uses the Perl-style regular
445 expression syntax defined by the :mod:`re` module). If several restrictions are
446 provided, then they are applied sequentially. For example::
447
448 print_stats(.1, 'foo:')
449
450 would first limit the printing to first 10% of list, and then only print
451 functions that were part of filename :file:`.\*foo:`. In contrast, the
452 command::
453
454 print_stats('foo:', .1)
455
456 would limit the list to all functions having file names :file:`.\*foo:`, and
457 then proceed to only print the first 10% of them.
458
459
Georg Brandl18244152009-09-02 20:34:52 +0000460.. method:: Stats.print_callers(*restrictions)
Georg Brandl116aa622007-08-15 14:28:22 +0000461
462 This method for the :class:`Stats` class prints a list of all functions that
463 called each function in the profiled database. The ordering is identical to
464 that provided by :meth:`print_stats`, and the definition of the restricting
465 argument is also identical. Each caller is reported on its own line. The
466 format differs slightly depending on the profiler that produced the stats:
467
468 * With :mod:`profile`, a number is shown in parentheses after each caller to
469 show how many times this specific call was made. For convenience, a second
470 non-parenthesized number repeats the cumulative time spent in the function
471 at the right.
472
Christian Heimesc3f30c42008-02-22 16:37:40 +0000473 * With :mod:`cProfile`, each caller is preceded by three numbers: the number of
Georg Brandl116aa622007-08-15 14:28:22 +0000474 times this specific call was made, and the total and cumulative times spent in
475 the current function while it was invoked by this specific caller.
476
477
Georg Brandl18244152009-09-02 20:34:52 +0000478.. method:: Stats.print_callees(*restrictions)
Georg Brandl116aa622007-08-15 14:28:22 +0000479
480 This method for the :class:`Stats` class prints a list of all function that were
481 called by the indicated function. Aside from this reversal of direction of
482 calls (re: called vs was called by), the arguments and ordering are identical to
483 the :meth:`print_callers` method.
484
485
486.. _profile-limits:
487
488Limitations
489===========
490
491One limitation has to do with accuracy of timing information. There is a
492fundamental problem with deterministic profilers involving accuracy. The most
493obvious restriction is that the underlying "clock" is only ticking at a rate
494(typically) of about .001 seconds. Hence no measurements will be more accurate
495than the underlying clock. If enough measurements are taken, then the "error"
496will tend to average out. Unfortunately, removing this first error induces a
497second source of error.
498
499The second problem is that it "takes a while" from when an event is dispatched
500until the profiler's call to get the time actually *gets* the state of the
501clock. Similarly, there is a certain lag when exiting the profiler event
502handler from the time that the clock's value was obtained (and then squirreled
503away), until the user's code is once again executing. As a result, functions
504that are called many times, or call many functions, will typically accumulate
505this error. The error that accumulates in this fashion is typically less than
506the accuracy of the clock (less than one clock tick), but it *can* accumulate
507and become very significant.
508
509The problem is more important with :mod:`profile` than with the lower-overhead
510:mod:`cProfile`. For this reason, :mod:`profile` provides a means of
511calibrating itself for a given platform so that this error can be
512probabilistically (on the average) removed. After the profiler is calibrated, it
513will be more accurate (in a least square sense), but it will sometimes produce
514negative numbers (when call counts are exceptionally low, and the gods of
515probability work against you :-). ) Do *not* be alarmed by negative numbers in
516the profile. They should *only* appear if you have calibrated your profiler,
517and the results are actually better than without calibration.
518
519
520.. _profile-calibration:
521
522Calibration
523===========
524
525The profiler of the :mod:`profile` module subtracts a constant from each event
526handling time to compensate for the overhead of calling the time function, and
527socking away the results. By default, the constant is 0. The following
528procedure can be used to obtain a better constant for a given platform (see
529discussion in section Limitations above). ::
530
531 import profile
532 pr = profile.Profile()
533 for i in range(5):
Georg Brandl6911e3c2007-09-04 07:15:32 +0000534 print(pr.calibrate(10000))
Georg Brandl116aa622007-08-15 14:28:22 +0000535
536The method executes the number of Python calls given by the argument, directly
537and again under the profiler, measuring the time for both. It then computes the
538hidden overhead per profiler event, and returns that as a float. For example,
539on an 800 MHz Pentium running Windows 2000, and using Python's time.clock() as
540the timer, the magical number is about 12.5e-6.
541
542The object of this exercise is to get a fairly consistent result. If your
543computer is *very* fast, or your timer function has poor resolution, you might
544have to pass 100000, or even 1000000, to get consistent results.
545
Georg Brandle6bcc912008-05-12 18:05:20 +0000546When you have a consistent answer, there are three ways you can use it::
Georg Brandl116aa622007-08-15 14:28:22 +0000547
548 import profile
549
550 # 1. Apply computed bias to all Profile instances created hereafter.
551 profile.Profile.bias = your_computed_bias
552
553 # 2. Apply computed bias to a specific Profile instance.
554 pr = profile.Profile()
555 pr.bias = your_computed_bias
556
557 # 3. Specify computed bias in instance constructor.
558 pr = profile.Profile(bias=your_computed_bias)
559
560If you have a choice, you are better off choosing a smaller constant, and then
561your results will "less often" show up as negative in profile statistics.
562
563
564.. _profiler-extensions:
565
566Extensions --- Deriving Better Profilers
567========================================
568
569The :class:`Profile` class of both modules, :mod:`profile` and :mod:`cProfile`,
570were written so that derived classes could be developed to extend the profiler.
571The details are not described here, as doing this successfully requires an
572expert understanding of how the :class:`Profile` class works internally. Study
573the source code of the module carefully if you want to pursue this.
574
575If all you want to do is change how current time is determined (for example, to
576force use of wall-clock time or elapsed process time), pass the timing function
577you want to the :class:`Profile` class constructor::
578
579 pr = profile.Profile(your_time_func)
580
581The resulting profiler will then call :func:`your_time_func`.
582
583:class:`profile.Profile`
584 :func:`your_time_func` should return a single number, or a list of numbers whose
585 sum is the current time (like what :func:`os.times` returns). If the function
586 returns a single time number, or the list of returned numbers has length 2, then
587 you will get an especially fast version of the dispatch routine.
588
589 Be warned that you should calibrate the profiler class for the timer function
590 that you choose. For most machines, a timer that returns a lone integer value
591 will provide the best results in terms of low overhead during profiling.
592 (:func:`os.times` is *pretty* bad, as it returns a tuple of floating point
593 values). If you want to substitute a better timer in the cleanest fashion,
594 derive a class and hardwire a replacement dispatch method that best handles your
595 timer call, along with the appropriate calibration constant.
596
597:class:`cProfile.Profile`
Georg Brandl95817b32008-05-11 14:30:18 +0000598 :func:`your_time_func` should return a single number. If it returns
Georg Brandl116aa622007-08-15 14:28:22 +0000599 integers, you can also invoke the class constructor with a second argument
600 specifying the real duration of one unit of time. For example, if
601 :func:`your_integer_time_func` returns times measured in thousands of seconds,
602 you would constuct the :class:`Profile` instance as follows::
603
604 pr = profile.Profile(your_integer_time_func, 0.001)
605
606 As the :mod:`cProfile.Profile` class cannot be calibrated, custom timer
607 functions should be used with care and should be as fast as possible. For the
608 best results with a custom timer, it might be necessary to hard-code it in the C
609 source of the internal :mod:`_lsprof` module.
610
611.. rubric:: Footnotes
612
613.. [#] Updated and converted to LaTeX by Guido van Rossum. Further updated by Armin
614 Rigo to integrate the documentation for the new :mod:`cProfile` module of Python
615 2.5.