blob: ca54021630db42cfa7571e0aeac9b3cac72914fc [file] [log] [blame]
Georg Brandl116aa622007-08-15 14:28:22 +00001.. _profile:
2
3********************
4The Python Profilers
5********************
6
7.. sectionauthor:: James Roskind
8
Benjamin Petersona0dfa822009-11-13 02:25:08 +00009.. module:: profile
10 :synopsis: Python source profiler.
Georg Brandl116aa622007-08-15 14:28:22 +000011
Raymond Hettinger469271d2011-01-27 20:38:46 +000012**Source code:** :source:`Lib/profile.py` and :source:`Lib/pstats.py`
13
14--------------
Georg Brandl116aa622007-08-15 14:28:22 +000015
Georg Brandl116aa622007-08-15 14:28:22 +000016.. _profiler-introduction:
17
18Introduction to the profilers
19=============================
20
21.. index::
22 single: deterministic profiling
23 single: profiling, deterministic
24
Georg Brandl4eb65972010-10-14 06:41:42 +000025A :dfn:`profiler` is a program that describes the run time performance of a
26program, providing a variety of statistics. This documentation describes the
27profiler functionality provided in the modules :mod:`cProfile`, :mod:`profile`
28and :mod:`pstats`. This profiler provides :dfn:`deterministic profiling` of
29Python programs. It also provides a series of report generation tools to allow
30users to rapidly examine the results of a profile operation.
Georg Brandl116aa622007-08-15 14:28:22 +000031
Fred Drake0e474a82007-10-11 18:01:43 +000032The Python standard library provides two different profilers:
Georg Brandl116aa622007-08-15 14:28:22 +000033
Georg Brandl4eb65972010-10-14 06:41:42 +0000341. :mod:`cProfile` is recommended for most users; it's a C extension with
35 reasonable overhead that makes it suitable for profiling long-running
36 programs. Based on :mod:`lsprof`, contributed by Brett Rosen and Ted
37 Czotter.
Georg Brandl116aa622007-08-15 14:28:22 +000038
Georg Brandl4eb65972010-10-14 06:41:42 +0000392. :mod:`profile`, a pure Python module whose interface is imitated by
40 :mod:`cProfile`. Adds significant overhead to profiled programs. If you're
41 trying to extend the profiler in some way, the task might be easier with this
Éric Araujoee19c772011-07-28 22:56:24 +020042 module.
Georg Brandl116aa622007-08-15 14:28:22 +000043
Georg Brandl116aa622007-08-15 14:28:22 +000044The :mod:`profile` and :mod:`cProfile` modules export the same interface, so
Christian Heimesdae2a892008-04-19 00:55:37 +000045they are mostly interchangeable; :mod:`cProfile` has a much lower overhead but
Georg Brandl4eb65972010-10-14 06:41:42 +000046is newer and might not be available on all systems. :mod:`cProfile` is really a
47compatibility layer on top of the internal :mod:`_lsprof` module.
48
49.. note::
50
51 The profiler modules are designed to provide an execution profile for a given
52 program, not for benchmarking purposes (for that, there is :mod:`timeit` for
Ezio Melottib5ff3e42011-04-03 16:20:21 +030053 reasonably accurate results). This particularly applies to benchmarking
Georg Brandl4eb65972010-10-14 06:41:42 +000054 Python code against C code: the profilers introduce overhead for Python code,
55 but not for C-level functions, and so the C code would seem faster than any
56 Python one.
Georg Brandl116aa622007-08-15 14:28:22 +000057
58
59.. _profile-instant:
60
61Instant User's Manual
62=====================
63
64This section is provided for users that "don't want to read the manual." It
65provides a very brief overview, and allows a user to rapidly perform profiling
66on an existing application.
67
68To profile an application with a main entry point of :func:`foo`, you would add
69the following to your module::
70
71 import cProfile
72 cProfile.run('foo()')
73
74(Use :mod:`profile` instead of :mod:`cProfile` if the latter is not available on
75your system.)
76
77The above action would cause :func:`foo` to be run, and a series of informative
78lines (the profile) to be printed. The above approach is most useful when
79working with the interpreter. If you would like to save the results of a
80profile into a file for later examination, you can supply a file name as the
81second argument to the :func:`run` function::
82
83 import cProfile
84 cProfile.run('foo()', 'fooprof')
85
86The file :file:`cProfile.py` can also be invoked as a script to profile another
87script. For example::
88
89 python -m cProfile myscript.py
90
91:file:`cProfile.py` accepts two optional arguments on the command line::
92
93 cProfile.py [-o output_file] [-s sort_order]
94
Benjamin Peterson5e55b3e2010-02-03 02:35:45 +000095``-s`` only applies to standard output (``-o`` is not supplied).
Georg Brandl116aa622007-08-15 14:28:22 +000096Look in the :class:`Stats` documentation for valid sort values.
97
98When you wish to review the profile, you should use the methods in the
99:mod:`pstats` module. Typically you would load the statistics data as follows::
100
101 import pstats
102 p = pstats.Stats('fooprof')
103
104The class :class:`Stats` (the above code just created an instance of this class)
105has a variety of methods for manipulating and printing the data that was just
106read into ``p``. When you ran :func:`cProfile.run` above, what was printed was
107the result of three method calls::
108
109 p.strip_dirs().sort_stats(-1).print_stats()
110
111The first method removed the extraneous path from all the module names. The
112second method sorted all the entries according to the standard module/line/name
113string that is printed. The third method printed out all the statistics. You
114might try the following sort calls:
115
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000116.. (this is to comply with the semantics of the old profiler).
Georg Brandl116aa622007-08-15 14:28:22 +0000117
118::
119
120 p.sort_stats('name')
121 p.print_stats()
122
123The first call will actually sort the list by function name, and the second call
124will print out the statistics. The following are some interesting calls to
125experiment with::
126
127 p.sort_stats('cumulative').print_stats(10)
128
129This sorts the profile by cumulative time in a function, and then only prints
130the ten most significant lines. If you want to understand what algorithms are
131taking time, the above line is what you would use.
132
133If you were looking to see what functions were looping a lot, and taking a lot
134of time, you would do::
135
136 p.sort_stats('time').print_stats(10)
137
138to sort according to time spent within each function, and then print the
139statistics for the top ten functions.
140
141You might also try::
142
143 p.sort_stats('file').print_stats('__init__')
144
145This will sort all the statistics by file name, and then print out statistics
146for only the class init methods (since they are spelled with ``__init__`` in
147them). As one final example, you could try::
148
149 p.sort_stats('time', 'cum').print_stats(.5, 'init')
150
151This line sorts statistics with a primary key of time, and a secondary key of
152cumulative time, and then prints out some of the statistics. To be specific, the
153list is first culled down to 50% (re: ``.5``) of its original size, then only
154lines containing ``init`` are maintained, and that sub-sub-list is printed.
155
156If you wondered what functions called the above functions, you could now (``p``
157is still sorted according to the last criteria) do::
158
159 p.print_callers(.5, 'init')
160
161and you would get a list of callers for each of the listed functions.
162
163If you want more functionality, you're going to have to read the manual, or
164guess what the following functions do::
165
166 p.print_callees()
167 p.add('fooprof')
168
169Invoked as a script, the :mod:`pstats` module is a statistics browser for
170reading and examining profile dumps. It has a simple line-oriented interface
171(implemented using :mod:`cmd`) and interactive help.
172
173
174.. _deterministic-profiling:
175
176What Is Deterministic Profiling?
177================================
178
179:dfn:`Deterministic profiling` is meant to reflect the fact that all *function
180call*, *function return*, and *exception* events are monitored, and precise
181timings are made for the intervals between these events (during which time the
182user's code is executing). In contrast, :dfn:`statistical profiling` (which is
183not done by this module) randomly samples the effective instruction pointer, and
184deduces where time is being spent. The latter technique traditionally involves
185less overhead (as the code does not need to be instrumented), but provides only
186relative indications of where time is being spent.
187
188In Python, since there is an interpreter active during execution, the presence
189of instrumented code is not required to do deterministic profiling. Python
190automatically provides a :dfn:`hook` (optional callback) for each event. In
191addition, the interpreted nature of Python tends to add so much overhead to
192execution, that deterministic profiling tends to only add small processing
193overhead in typical applications. The result is that deterministic profiling is
194not that expensive, yet provides extensive run time statistics about the
195execution of a Python program.
196
197Call count statistics can be used to identify bugs in code (surprising counts),
198and to identify possible inline-expansion points (high call counts). Internal
199time statistics can be used to identify "hot loops" that should be carefully
200optimized. Cumulative time statistics should be used to identify high level
201errors in the selection of algorithms. Note that the unusual handling of
202cumulative times in this profiler allows statistics for recursive
203implementations of algorithms to be directly compared to iterative
204implementations.
205
206
207Reference Manual -- :mod:`profile` and :mod:`cProfile`
208======================================================
209
210.. module:: cProfile
211 :synopsis: Python profiler
212
213
214The primary entry point for the profiler is the global function
215:func:`profile.run` (resp. :func:`cProfile.run`). It is typically used to create
216any profile information. The reports are formatted and printed using methods of
217the class :class:`pstats.Stats`. The following is a description of all of these
218standard entry points and functions. For a more in-depth view of some of the
219code, consider reading the later section on Profiler Extensions, which includes
220discussion of how to derive "better" profilers from the classes presented, or
221reading the source code for these modules.
222
223
Georg Brandl18244152009-09-02 20:34:52 +0000224.. function:: run(command, filename=None, sort=-1)
Georg Brandl116aa622007-08-15 14:28:22 +0000225
226 This function takes a single argument that can be passed to the :func:`exec`
227 function, and an optional file name. In all cases this routine attempts to
228 :func:`exec` its first argument, and gather profiling statistics from the
229 execution. If no file name is present, then this function automatically
230 prints a simple profiling report, sorted by the standard name string
231 (file/line/function-name) that is presented in each line. The following is a
232 typical output from such a call::
233
234 2706 function calls (2004 primitive calls) in 4.504 CPU seconds
235
236 Ordered by: standard name
237
238 ncalls tottime percall cumtime percall filename:lineno(function)
239 2 0.006 0.003 0.953 0.477 pobject.py:75(save_objects)
240 43/3 0.533 0.012 0.749 0.250 pobject.py:99(evaluate)
241 ...
242
243 The first line indicates that 2706 calls were monitored. Of those calls, 2004
244 were :dfn:`primitive`. We define :dfn:`primitive` to mean that the call was not
245 induced via recursion. The next line: ``Ordered by: standard name``, indicates
246 that the text string in the far right column was used to sort the output. The
247 column headings include:
248
Georg Brandl48310cd2009-01-03 21:18:54 +0000249 ncalls
Georg Brandl116aa622007-08-15 14:28:22 +0000250 for the number of calls,
251
Georg Brandl48310cd2009-01-03 21:18:54 +0000252 tottime
Georg Brandl18244152009-09-02 20:34:52 +0000253 for the total time spent in the given function (and excluding time made in
254 calls to sub-functions),
Georg Brandl116aa622007-08-15 14:28:22 +0000255
Georg Brandl48310cd2009-01-03 21:18:54 +0000256 percall
Georg Brandl116aa622007-08-15 14:28:22 +0000257 is the quotient of ``tottime`` divided by ``ncalls``
258
Georg Brandl48310cd2009-01-03 21:18:54 +0000259 cumtime
Georg Brandl116aa622007-08-15 14:28:22 +0000260 is the total time spent in this and all subfunctions (from invocation till
261 exit). This figure is accurate *even* for recursive functions.
262
Georg Brandl48310cd2009-01-03 21:18:54 +0000263 percall
Georg Brandl116aa622007-08-15 14:28:22 +0000264 is the quotient of ``cumtime`` divided by primitive calls
265
Georg Brandl48310cd2009-01-03 21:18:54 +0000266 filename:lineno(function)
Georg Brandl116aa622007-08-15 14:28:22 +0000267 provides the respective data of each function
268
269 When there are two numbers in the first column (for example, ``43/3``), then the
270 latter is the number of primitive calls, and the former is the actual number of
271 calls. Note that when the function does not recurse, these two values are the
272 same, and only the single figure is printed.
273
Georg Brandl18244152009-09-02 20:34:52 +0000274 If *sort* is given, it can be one of ``'stdname'`` (sort by filename:lineno),
275 ``'calls'`` (sort by number of calls), ``'time'`` (sort by total time) or
276 ``'cumulative'`` (sort by cumulative time). The default is ``'stdname'``.
Georg Brandl116aa622007-08-15 14:28:22 +0000277
Georg Brandl18244152009-09-02 20:34:52 +0000278
279.. function:: runctx(command, globals, locals, filename=None)
Georg Brandl116aa622007-08-15 14:28:22 +0000280
281 This function is similar to :func:`run`, with added arguments to supply the
282 globals and locals dictionaries for the *command* string.
283
Georg Brandl116aa622007-08-15 14:28:22 +0000284
Georg Brandl18244152009-09-02 20:34:52 +0000285Analysis of the profiler data is done using the :class:`pstats.Stats` class.
Georg Brandl116aa622007-08-15 14:28:22 +0000286
287
288.. module:: pstats
289 :synopsis: Statistics object for use with the profiler.
290
291
Georg Brandl18244152009-09-02 20:34:52 +0000292.. class:: Stats(*filenames, stream=sys.stdout)
Georg Brandl116aa622007-08-15 14:28:22 +0000293
294 This class constructor creates an instance of a "statistics object" from a
295 *filename* (or set of filenames). :class:`Stats` objects are manipulated by
296 methods, in order to print useful reports. You may specify an alternate output
297 stream by giving the keyword argument, ``stream``.
298
299 The file selected by the above constructor must have been created by the
300 corresponding version of :mod:`profile` or :mod:`cProfile`. To be specific,
301 there is *no* file compatibility guaranteed with future versions of this
302 profiler, and there is no compatibility with files produced by other profilers.
303 If several files are provided, all the statistics for identical functions will
304 be coalesced, so that an overall view of several processes can be considered in
305 a single report. If additional files need to be combined with data in an
306 existing :class:`Stats` object, the :meth:`add` method can be used.
307
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000308 .. (such as the old system profiler).
309
Georg Brandl116aa622007-08-15 14:28:22 +0000310
311.. _profile-stats:
312
313The :class:`Stats` Class
314------------------------
315
316:class:`Stats` objects have the following methods:
317
318
319.. method:: Stats.strip_dirs()
320
321 This method for the :class:`Stats` class removes all leading path information
322 from file names. It is very useful in reducing the size of the printout to fit
323 within (close to) 80 columns. This method modifies the object, and the stripped
324 information is lost. After performing a strip operation, the object is
325 considered to have its entries in a "random" order, as it was just after object
326 initialization and loading. If :meth:`strip_dirs` causes two function names to
327 be indistinguishable (they are on the same line of the same filename, and have
328 the same function name), then the statistics for these two entries are
329 accumulated into a single entry.
330
331
Georg Brandl18244152009-09-02 20:34:52 +0000332.. method:: Stats.add(*filenames)
Georg Brandl116aa622007-08-15 14:28:22 +0000333
334 This method of the :class:`Stats` class accumulates additional profiling
335 information into the current profiling object. Its arguments should refer to
336 filenames created by the corresponding version of :func:`profile.run` or
337 :func:`cProfile.run`. Statistics for identically named (re: file, line, name)
338 functions are automatically accumulated into single function statistics.
339
340
341.. method:: Stats.dump_stats(filename)
342
343 Save the data loaded into the :class:`Stats` object to a file named *filename*.
344 The file is created if it does not exist, and is overwritten if it already
345 exists. This is equivalent to the method of the same name on the
346 :class:`profile.Profile` and :class:`cProfile.Profile` classes.
347
Georg Brandl116aa622007-08-15 14:28:22 +0000348
Georg Brandl18244152009-09-02 20:34:52 +0000349.. method:: Stats.sort_stats(*keys)
Georg Brandl116aa622007-08-15 14:28:22 +0000350
351 This method modifies the :class:`Stats` object by sorting it according to the
352 supplied criteria. The argument is typically a string identifying the basis of
353 a sort (example: ``'time'`` or ``'name'``).
354
355 When more than one key is provided, then additional keys are used as secondary
356 criteria when there is equality in all keys selected before them. For example,
357 ``sort_stats('name', 'file')`` will sort all the entries according to their
358 function name, and resolve all ties (identical function names) by sorting by
359 file name.
360
361 Abbreviations can be used for any key names, as long as the abbreviation is
362 unambiguous. The following are the keys currently defined:
363
364 +------------------+----------------------+
365 | Valid Arg | Meaning |
366 +==================+======================+
367 | ``'calls'`` | call count |
368 +------------------+----------------------+
369 | ``'cumulative'`` | cumulative time |
370 +------------------+----------------------+
371 | ``'file'`` | file name |
372 +------------------+----------------------+
373 | ``'module'`` | file name |
374 +------------------+----------------------+
375 | ``'pcalls'`` | primitive call count |
376 +------------------+----------------------+
377 | ``'line'`` | line number |
378 +------------------+----------------------+
379 | ``'name'`` | function name |
380 +------------------+----------------------+
381 | ``'nfl'`` | name/file/line |
382 +------------------+----------------------+
383 | ``'stdname'`` | standard name |
384 +------------------+----------------------+
385 | ``'time'`` | internal time |
386 +------------------+----------------------+
387
388 Note that all sorts on statistics are in descending order (placing most time
389 consuming items first), where as name, file, and line number searches are in
390 ascending order (alphabetical). The subtle distinction between ``'nfl'`` and
391 ``'stdname'`` is that the standard name is a sort of the name as printed, which
392 means that the embedded line numbers get compared in an odd way. For example,
393 lines 3, 20, and 40 would (if the file names were the same) appear in the string
394 order 20, 3 and 40. In contrast, ``'nfl'`` does a numeric compare of the line
395 numbers. In fact, ``sort_stats('nfl')`` is the same as ``sort_stats('name',
396 'file', 'line')``.
397
398 For backward-compatibility reasons, the numeric arguments ``-1``, ``0``, ``1``,
399 and ``2`` are permitted. They are interpreted as ``'stdname'``, ``'calls'``,
400 ``'time'``, and ``'cumulative'`` respectively. If this old style format
401 (numeric) is used, only one sort key (the numeric key) will be used, and
402 additional arguments will be silently ignored.
403
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000404 .. For compatibility with the old profiler,
Georg Brandl116aa622007-08-15 14:28:22 +0000405
406
407.. method:: Stats.reverse_order()
408
409 This method for the :class:`Stats` class reverses the ordering of the basic list
410 within the object. Note that by default ascending vs descending order is
411 properly selected based on the sort key of choice.
412
Christian Heimes5b5e81c2007-12-31 16:14:33 +0000413 .. This method is provided primarily for compatibility with the old profiler.
Georg Brandl116aa622007-08-15 14:28:22 +0000414
415
Georg Brandl18244152009-09-02 20:34:52 +0000416.. method:: Stats.print_stats(*restrictions)
Georg Brandl116aa622007-08-15 14:28:22 +0000417
418 This method for the :class:`Stats` class prints out a report as described in the
419 :func:`profile.run` definition.
420
421 The order of the printing is based on the last :meth:`sort_stats` operation done
422 on the object (subject to caveats in :meth:`add` and :meth:`strip_dirs`).
423
424 The arguments provided (if any) can be used to limit the list down to the
425 significant entries. Initially, the list is taken to be the complete set of
426 profiled functions. Each restriction is either an integer (to select a count of
427 lines), or a decimal fraction between 0.0 and 1.0 inclusive (to select a
428 percentage of lines), or a regular expression (to pattern match the standard
429 name that is printed; as of Python 1.5b1, this uses the Perl-style regular
430 expression syntax defined by the :mod:`re` module). If several restrictions are
431 provided, then they are applied sequentially. For example::
432
433 print_stats(.1, 'foo:')
434
435 would first limit the printing to first 10% of list, and then only print
436 functions that were part of filename :file:`.\*foo:`. In contrast, the
437 command::
438
439 print_stats('foo:', .1)
440
441 would limit the list to all functions having file names :file:`.\*foo:`, and
442 then proceed to only print the first 10% of them.
443
444
Georg Brandl18244152009-09-02 20:34:52 +0000445.. method:: Stats.print_callers(*restrictions)
Georg Brandl116aa622007-08-15 14:28:22 +0000446
447 This method for the :class:`Stats` class prints a list of all functions that
448 called each function in the profiled database. The ordering is identical to
449 that provided by :meth:`print_stats`, and the definition of the restricting
450 argument is also identical. Each caller is reported on its own line. The
451 format differs slightly depending on the profiler that produced the stats:
452
453 * With :mod:`profile`, a number is shown in parentheses after each caller to
454 show how many times this specific call was made. For convenience, a second
455 non-parenthesized number repeats the cumulative time spent in the function
456 at the right.
457
Christian Heimesc3f30c42008-02-22 16:37:40 +0000458 * With :mod:`cProfile`, each caller is preceded by three numbers: the number of
Georg Brandl116aa622007-08-15 14:28:22 +0000459 times this specific call was made, and the total and cumulative times spent in
460 the current function while it was invoked by this specific caller.
461
462
Georg Brandl18244152009-09-02 20:34:52 +0000463.. method:: Stats.print_callees(*restrictions)
Georg Brandl116aa622007-08-15 14:28:22 +0000464
465 This method for the :class:`Stats` class prints a list of all function that were
466 called by the indicated function. Aside from this reversal of direction of
467 calls (re: called vs was called by), the arguments and ordering are identical to
468 the :meth:`print_callers` method.
469
470
471.. _profile-limits:
472
473Limitations
474===========
475
476One limitation has to do with accuracy of timing information. There is a
477fundamental problem with deterministic profilers involving accuracy. The most
478obvious restriction is that the underlying "clock" is only ticking at a rate
479(typically) of about .001 seconds. Hence no measurements will be more accurate
480than the underlying clock. If enough measurements are taken, then the "error"
481will tend to average out. Unfortunately, removing this first error induces a
482second source of error.
483
484The second problem is that it "takes a while" from when an event is dispatched
485until the profiler's call to get the time actually *gets* the state of the
486clock. Similarly, there is a certain lag when exiting the profiler event
487handler from the time that the clock's value was obtained (and then squirreled
488away), until the user's code is once again executing. As a result, functions
489that are called many times, or call many functions, will typically accumulate
490this error. The error that accumulates in this fashion is typically less than
491the accuracy of the clock (less than one clock tick), but it *can* accumulate
492and become very significant.
493
494The problem is more important with :mod:`profile` than with the lower-overhead
495:mod:`cProfile`. For this reason, :mod:`profile` provides a means of
496calibrating itself for a given platform so that this error can be
497probabilistically (on the average) removed. After the profiler is calibrated, it
498will be more accurate (in a least square sense), but it will sometimes produce
499negative numbers (when call counts are exceptionally low, and the gods of
500probability work against you :-). ) Do *not* be alarmed by negative numbers in
501the profile. They should *only* appear if you have calibrated your profiler,
502and the results are actually better than without calibration.
503
504
505.. _profile-calibration:
506
507Calibration
508===========
509
510The profiler of the :mod:`profile` module subtracts a constant from each event
511handling time to compensate for the overhead of calling the time function, and
512socking away the results. By default, the constant is 0. The following
513procedure can be used to obtain a better constant for a given platform (see
514discussion in section Limitations above). ::
515
516 import profile
517 pr = profile.Profile()
518 for i in range(5):
Georg Brandl6911e3c2007-09-04 07:15:32 +0000519 print(pr.calibrate(10000))
Georg Brandl116aa622007-08-15 14:28:22 +0000520
521The method executes the number of Python calls given by the argument, directly
522and again under the profiler, measuring the time for both. It then computes the
523hidden overhead per profiler event, and returns that as a float. For example,
524on an 800 MHz Pentium running Windows 2000, and using Python's time.clock() as
525the timer, the magical number is about 12.5e-6.
526
527The object of this exercise is to get a fairly consistent result. If your
528computer is *very* fast, or your timer function has poor resolution, you might
529have to pass 100000, or even 1000000, to get consistent results.
530
Georg Brandle6bcc912008-05-12 18:05:20 +0000531When you have a consistent answer, there are three ways you can use it::
Georg Brandl116aa622007-08-15 14:28:22 +0000532
533 import profile
534
535 # 1. Apply computed bias to all Profile instances created hereafter.
536 profile.Profile.bias = your_computed_bias
537
538 # 2. Apply computed bias to a specific Profile instance.
539 pr = profile.Profile()
540 pr.bias = your_computed_bias
541
542 # 3. Specify computed bias in instance constructor.
543 pr = profile.Profile(bias=your_computed_bias)
544
545If you have a choice, you are better off choosing a smaller constant, and then
546your results will "less often" show up as negative in profile statistics.
547
548
549.. _profiler-extensions:
550
551Extensions --- Deriving Better Profilers
552========================================
553
554The :class:`Profile` class of both modules, :mod:`profile` and :mod:`cProfile`,
555were written so that derived classes could be developed to extend the profiler.
556The details are not described here, as doing this successfully requires an
557expert understanding of how the :class:`Profile` class works internally. Study
558the source code of the module carefully if you want to pursue this.
559
560If all you want to do is change how current time is determined (for example, to
561force use of wall-clock time or elapsed process time), pass the timing function
562you want to the :class:`Profile` class constructor::
563
564 pr = profile.Profile(your_time_func)
565
566The resulting profiler will then call :func:`your_time_func`.
567
568:class:`profile.Profile`
569 :func:`your_time_func` should return a single number, or a list of numbers whose
570 sum is the current time (like what :func:`os.times` returns). If the function
571 returns a single time number, or the list of returned numbers has length 2, then
572 you will get an especially fast version of the dispatch routine.
573
574 Be warned that you should calibrate the profiler class for the timer function
575 that you choose. For most machines, a timer that returns a lone integer value
576 will provide the best results in terms of low overhead during profiling.
577 (:func:`os.times` is *pretty* bad, as it returns a tuple of floating point
578 values). If you want to substitute a better timer in the cleanest fashion,
579 derive a class and hardwire a replacement dispatch method that best handles your
580 timer call, along with the appropriate calibration constant.
581
582:class:`cProfile.Profile`
Georg Brandl95817b32008-05-11 14:30:18 +0000583 :func:`your_time_func` should return a single number. If it returns
Georg Brandl116aa622007-08-15 14:28:22 +0000584 integers, you can also invoke the class constructor with a second argument
585 specifying the real duration of one unit of time. For example, if
586 :func:`your_integer_time_func` returns times measured in thousands of seconds,
Georg Brandl6faee4e2010-09-21 14:48:28 +0000587 you would construct the :class:`Profile` instance as follows::
Georg Brandl116aa622007-08-15 14:28:22 +0000588
589 pr = profile.Profile(your_integer_time_func, 0.001)
590
591 As the :mod:`cProfile.Profile` class cannot be calibrated, custom timer
592 functions should be used with care and should be as fast as possible. For the
593 best results with a custom timer, it might be necessary to hard-code it in the C
594 source of the internal :mod:`_lsprof` module.