Blame - Doc/library/profile.rst - platform/external/python/cpython2

blob: e2757f6632a6f058555fb50e51893c6b707386d1 [file] [log] [blame]

Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	1	.. _profile:
				2
				3	********************
				4	The Python Profilers
				5	********************
				6
Éric Araujo	29a0b57	2011-08-19 02:14:03 +0200	[diff] [blame]	7	Source code: :source:`Lib/profile.py` and :source:`Lib/pstats.py`
				8
				9	--------------
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	10
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	11	.. _profiler-introduction:
				12
				13	Introduction to the profilers
				14	=============================
				15
				16	.. index::
				17	single: deterministic profiling
				18	single: profiling, deterministic
				19
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	20	:mod:`cProfile` and :mod:`profile` provide :dfn:`deterministic profiling` of
				21	Python programs. A :dfn:`profile` is a set of statistics that describes how
				22	often and for how long various parts of the program executed. These statistics
				23	can be formatted into reports via the :mod:`pstats` module.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	24
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	25	The Python standard library provides three different implementations of the same
				26	profiling interface:
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	27
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	28	1. :mod:`cProfile` is recommended for most users; it's a C extension with
				29	reasonable overhead that makes it suitable for profiling long-running
				30	programs. Based on :mod:`lsprof`, contributed by Brett Rosen and Ted
				31	Czotter.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	32
				33	.. versionadded:: 2.5
				34
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	35	2. :mod:`profile`, a pure Python module whose interface is imitated by
				36	:mod:`cProfile`, but which adds significant overhead to profiled programs.
				37	If you're trying to extend the profiler in some way, the task might be easier
				38	with this module.
Andrew M. Kuchling	0892317	2008-04-18 18:39:55 +0000	[diff] [blame]	39
				40	.. versionchanged:: 2.4
Andrew Svetlov	8a9b401	2012-10-31 21:54:45 +0200	[diff] [blame]	41	Now also reports the time spent in calls to built-in functions
				42	and methods.
Andrew M. Kuchling	0892317	2008-04-18 18:39:55 +0000	[diff] [blame]	43
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	44	3. :mod:`hotshot` was an experimental C module that focused on minimizing
Andrew M. Kuchling	0892317	2008-04-18 18:39:55 +0000	[diff] [blame]	45	the overhead of profiling, at the expense of longer data
				46	post-processing times. It is no longer maintained and may be
				47	dropped in a future version of Python.
Georg Brandl	c62ef8b	2009-01-03 20:55:06 +0000	[diff] [blame]	48
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	49
				50	.. versionchanged:: 2.5
Andrew M. Kuchling	0892317	2008-04-18 18:39:55 +0000	[diff] [blame]	51	The results should be more meaningful than in the past: the timing core
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	52	contained a critical bug.
				53
				54	The :mod:`profile` and :mod:`cProfile` modules export the same interface, so
Andrew M. Kuchling	0892317	2008-04-18 18:39:55 +0000	[diff] [blame]	55	they are mostly interchangeable; :mod:`cProfile` has a much lower overhead but
				56	is newer and might not be available on all systems.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	57	:mod:`cProfile` is really a compatibility layer on top of the internal
Andrew M. Kuchling	0892317	2008-04-18 18:39:55 +0000	[diff] [blame]	58	:mod:`_lsprof` module. The :mod:`hotshot` module is reserved for specialized
				59	usage.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	60
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	61	.. note::
				62
				63	The profiler modules are designed to provide an execution profile for a given
				64	program, not for benchmarking purposes (for that, there is :mod:`timeit` for
				65	reasonably accurate results). This particularly applies to benchmarking
				66	Python code against C code: the profilers introduce overhead for Python code,
				67	but not for C-level functions, and so the C code would seem faster than any
				68	Python one.
				69
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	70
				71	.. _profile-instant:
				72
				73	Instant User's Manual
				74	=====================
				75
				76	This section is provided for users that "don't want to read the manual." It
				77	provides a very brief overview, and allows a user to rapidly perform profiling
				78	on an existing application.
				79
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	80	To profile a function that takes a single argument, you can do::
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	81
				82	import cProfile
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	83	import re
				84	cProfile.run('re.compile("foo\|bar")')
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	85
				86	(Use :mod:`profile` instead of :mod:`cProfile` if the latter is not available on
				87	your system.)
				88
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	89	The above action would run :func:`re.compile` and print profile results like
				90	the following::
				91
				92	197 function calls (192 primitive calls) in 0.002 seconds
				93
				94	Ordered by: standard name
				95
				96	ncalls tottime percall cumtime percall filename:lineno(function)
				97	1 0.000 0.000 0.001 0.001 <string>:1(<module>)
				98	1 0.000 0.000 0.001 0.001 re.py:212(compile)
				99	1 0.000 0.000 0.001 0.001 re.py:268(_compile)
				100	1 0.000 0.000 0.000 0.000 sre_compile.py:172(_compile_charset)
				101	1 0.000 0.000 0.000 0.000 sre_compile.py:201(_optimize_charset)
				102	4 0.000 0.000 0.000 0.000 sre_compile.py:25(_identityfunction)
				103	3/1 0.000 0.000 0.000 0.000 sre_compile.py:33(_compile)
				104
				105	The first line indicates that 197 calls were monitored. Of those calls, 192
				106	were :dfn:`primitive`, meaning that the call was not induced via recursion. The
				107	next line: ``Ordered by: standard name``, indicates that the text string in the
				108	far right column was used to sort the output. The column headings include:
				109
				110	ncalls
				111	for the number of calls,
				112
				113	tottime
				114	for the total time spent in the given function (and excluding time made in
				115	calls to sub-functions)
				116
				117	percall
				118	is the quotient of ``tottime`` divided by ``ncalls``
				119
				120	cumtime
				121	is the cumulative time spent in this and all subfunctions (from invocation
				122	till exit). This figure is accurate even for recursive functions.
				123
				124	percall
				125	is the quotient of ``cumtime`` divided by primitive calls
				126
				127	filename:lineno(function)
				128	provides the respective data of each function
				129
				130	When there are two numbers in the first column (for example ``3/1``), it means
				131	that the function recursed. The second value is the number of primitive calls
				132	and the former is the total number of calls. Note that when the function does
				133	not recurse, these two values are the same, and only the single figure is
				134	printed.
				135
				136	Instead of printing the output at the end of the profile run, you can save the
				137	results to a file by specifying a filename to the :func:`run` function::
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	138
				139	import cProfile
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	140	import re
				141	cProfile.run('re.compile("foo\|bar")', 'restats')
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	142
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	143	The :class:`pstats.Stats` class reads profile results from a file and formats
				144	them in various ways.
				145
				146	The file :mod:`cProfile` can also be invoked as a script to profile another
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	147	script. For example::
				148
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	149	python -m cProfile [-o output_file] [-s sort_order] myscript.py
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	150
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	151	``-o`` writes the profile results to a file instead of to stdout
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	152
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	153	``-s`` specifies one of the :func:`~pstats.Stats.sort_stats` sort values to sort
				154	the output by. This only applies when ``-o`` is not supplied.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	155
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	156	The :mod:`pstats` module's :class:`~pstats.Stats` class has a variety of methods
				157	for manipulating and printing the data saved into a profile results file::
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	158
				159	import pstats
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	160	p = pstats.Stats('restats')
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	161	p.strip_dirs().sort_stats(-1).print_stats()
				162
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	163	The :meth:`~pstats.Stats.strip_dirs` method removed the extraneous path from all
				164	the module names. The :meth:`~pstats.Stats.sort_stats` method sorted all the
				165	entries according to the standard module/line/name string that is printed. The
				166	:meth:`~pstats.Stats.print_stats` method printed out all the statistics. You
				167	might try the following sort calls::
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	168
				169	p.sort_stats('name')
				170	p.print_stats()
				171
				172	The first call will actually sort the list by function name, and the second call
				173	will print out the statistics. The following are some interesting calls to
				174	experiment with::
				175
				176	p.sort_stats('cumulative').print_stats(10)
				177
				178	This sorts the profile by cumulative time in a function, and then only prints
				179	the ten most significant lines. If you want to understand what algorithms are
				180	taking time, the above line is what you would use.
				181
				182	If you were looking to see what functions were looping a lot, and taking a lot
				183	of time, you would do::
				184
				185	p.sort_stats('time').print_stats(10)
				186
				187	to sort according to time spent within each function, and then print the
				188	statistics for the top ten functions.
				189
				190	You might also try::
				191
				192	p.sort_stats('file').print_stats('__init__')
				193
				194	This will sort all the statistics by file name, and then print out statistics
				195	for only the class init methods (since they are spelled with ``__init__`` in
				196	them). As one final example, you could try::
				197
				198	p.sort_stats('time', 'cum').print_stats(.5, 'init')
				199
				200	This line sorts statistics with a primary key of time, and a secondary key of
				201	cumulative time, and then prints out some of the statistics. To be specific, the
				202	list is first culled down to 50% (re: ``.5``) of its original size, then only
				203	lines containing ``init`` are maintained, and that sub-sub-list is printed.
				204
				205	If you wondered what functions called the above functions, you could now (``p``
				206	is still sorted according to the last criteria) do::
				207
				208	p.print_callers(.5, 'init')
				209
				210	and you would get a list of callers for each of the listed functions.
				211
				212	If you want more functionality, you're going to have to read the manual, or
				213	guess what the following functions do::
				214
				215	p.print_callees()
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	216	p.add('restats')
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	217
				218	Invoked as a script, the :mod:`pstats` module is a statistics browser for
				219	reading and examining profile dumps. It has a simple line-oriented interface
				220	(implemented using :mod:`cmd`) and interactive help.
				221
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	222	:mod:`profile` and :mod:`cProfile` Module Reference
				223	=======================================================
				224
				225	.. module:: cProfile
				226	.. module:: profile
				227	:synopsis: Python source profiler.
				228
				229	Both the :mod:`profile` and :mod:`cProfile` modules provide the following
				230	functions:
				231
				232	.. function:: run(command, filename=None, sort=-1)
				233
				234	This function takes a single argument that can be passed to the :func:`exec`
				235	function, and an optional file name. In all cases this routine executes::
				236
				237	exec(command, __main__.__dict__, __main__.__dict__)
				238
				239	and gathers profiling statistics from the execution. If no file name is
				240	present, then this function automatically creates a :class:`~pstats.Stats`
				241	instance and prints a simple profiling report. If the sort value is specified
				242	it is passed to this :class:`~pstats.Stats` instance to control how the
				243	results are sorted.
				244
				245	.. function:: runctx(command, globals, locals, filename=None)
				246
				247	This function is similar to :func:`run`, with added arguments to supply the
				248	globals and locals dictionaries for the command string. This routine
				249	executes::
				250
				251	exec(command, globals, locals)
				252
				253	and gathers profiling statistics as in the :func:`run` function above.
				254
				255	.. class:: Profile(timer=None, timeunit=0.0, subcalls=True, builtins=True)
				256
				257	This class is normally only used if more precise control over profiling is
				258	needed than what the :func:`cProfile.run` function provides.
				259
				260	A custom timer can be supplied for measuring how long code takes to run via
				261	the timer argument. This must be a function that returns a single number
				262	representing the current time. If the number is an integer, the timeunit
				263	specifies a multiplier that specifies the duration of each unit of time. For
				264	example, if the timer returns times measured in thousands of seconds, the
				265	time unit would be ``.001``.
				266
				267	Directly using the :class:`Profile` class allows formatting profile results
				268	without writing the profile data to a file::
				269
				270	import cProfile, pstats, io
				271	pr = cProfile.Profile()
				272	pr.enable()
				273	... do something ...
				274	pr.disable()
				275	s = io.StringIO()
				276	ps = pstats.Stats(pr, stream=s)
				277	ps.print_results()
				278
				279	.. method:: enable()
				280
				281	Start collecting profiling data.
				282
				283	.. method:: disable()
				284
				285	Stop collecting profiling data.
				286
				287	.. method:: create_stats()
				288
				289	Stop collecting profiling data and record the results internally
				290	as the current profile.
				291
				292	.. method:: print_stats(sort=-1)
				293
				294	Create a :class:`~pstats.Stats` object based on the current
				295	profile and print the results to stdout.
				296
				297	.. method:: dump_stats(filename)
				298
				299	Write the results of the current profile to filename.
				300
				301	.. method:: run(cmd)
				302
				303	Profile the cmd via :func:`exec`.
				304
				305	.. method:: runctx(cmd, globals, locals)
				306
				307	Profile the cmd via :func:`exec` with the specified global and
				308	local environment.
				309
				310	.. method:: runcall(func, args, *kwargs)
				311
				312	Profile ``func(args, *kwargs)``
				313
				314	.. _profile-stats:
				315
				316	The :class:`Stats` Class
				317	========================
				318
				319	Analysis of the profiler data is done using the :class:`~pstats.Stats` class.
				320
				321	.. module:: pstats
				322	:synopsis: Statistics object for use with the profiler.
				323
				324	.. class:: Stats(*filenames or profile, stream=sys.stdout)
				325
				326	This class constructor creates an instance of a "statistics object" from a
				327	filename (or list of filenames) or from a :class:`Profile` instance. Output
				328	will be printed to the stream specified by stream.
				329
				330	The file selected by the above constructor must have been created by the
				331	corresponding version of :mod:`profile` or :mod:`cProfile`. To be specific,
				332	there is no file compatibility guaranteed with future versions of this
				333	profiler, and there is no compatibility with files produced by other
				334	profilers. If several files are provided, all the statistics for identical
				335	functions will be coalesced, so that an overall view of several processes can
				336	be considered in a single report. If additional files need to be combined
				337	with data in an existing :class:`~pstats.Stats` object, the
				338	:meth:`~pstats.Stats.add` method can be used.
				339
				340	Instead of reading the profile data from a file, a :class:`cProfile.Profile`
				341	or :class:`profile.Profile` object can be used as the profile data source.
				342
				343	:class:`Stats` objects have the following methods:
				344
				345	.. method:: strip_dirs()
				346
				347	This method for the :class:`Stats` class removes all leading path
				348	information from file names. It is very useful in reducing the size of
				349	the printout to fit within (close to) 80 columns. This method modifies
				350	the object, and the stripped information is lost. After performing a
				351	strip operation, the object is considered to have its entries in a
				352	"random" order, as it was just after object initialization and loading.
				353	If :meth:`~pstats.Stats.strip_dirs` causes two function names to be
				354	indistinguishable (they are on the same line of the same filename, and
				355	have the same function name), then the statistics for these two entries
				356	are accumulated into a single entry.
				357
				358
				359	.. method:: add(*filenames)
				360
				361	This method of the :class:`Stats` class accumulates additional profiling
				362	information into the current profiling object. Its arguments should refer
				363	to filenames created by the corresponding version of :func:`profile.run`
				364	or :func:`cProfile.run`. Statistics for identically named (re: file, line,
				365	name) functions are automatically accumulated into single function
				366	statistics.
				367
				368
				369	.. method:: dump_stats(filename)
				370
				371	Save the data loaded into the :class:`Stats` object to a file named
				372	filename. The file is created if it does not exist, and is overwritten
				373	if it already exists. This is equivalent to the method of the same name
				374	on the :class:`profile.Profile` and :class:`cProfile.Profile` classes.
				375
				376	.. versionadded:: 2.3
				377
				378
				379	.. method:: sort_stats(*keys)
				380
				381	This method modifies the :class:`Stats` object by sorting it according to
				382	the supplied criteria. The argument is typically a string identifying the
				383	basis of a sort (example: ``'time'`` or ``'name'``).
				384
				385	When more than one key is provided, then additional keys are used as
				386	secondary criteria when there is equality in all keys selected before
				387	them. For example, ``sort_stats('name', 'file')`` will sort all the
				388	entries according to their function name, and resolve all ties (identical
				389	function names) by sorting by file name.
				390
				391	Abbreviations can be used for any key names, as long as the abbreviation
				392	is unambiguous. The following are the keys currently defined:
				393
				394	+------------------+----------------------+
				395	\| Valid Arg \| Meaning \|
				396	+==================+======================+
				397	\| ``'calls'`` \| call count \|
				398	+------------------+----------------------+
				399	\| ``'cumulative'`` \| cumulative time \|
				400	+------------------+----------------------+
				401	\| ``'cumtime'`` \| cumulative time \|
				402	+------------------+----------------------+
				403	\| ``'file'`` \| file name \|
				404	+------------------+----------------------+
				405	\| ``'filename'`` \| file name \|
				406	+------------------+----------------------+
				407	\| ``'module'`` \| file name \|
				408	+------------------+----------------------+
				409	\| ``'ncalls'`` \| call count \|
				410	+------------------+----------------------+
				411	\| ``'pcalls'`` \| primitive call count \|
				412	+------------------+----------------------+
				413	\| ``'line'`` \| line number \|
				414	+------------------+----------------------+
				415	\| ``'name'`` \| function name \|
				416	+------------------+----------------------+
				417	\| ``'nfl'`` \| name/file/line \|
				418	+------------------+----------------------+
				419	\| ``'stdname'`` \| standard name \|
				420	+------------------+----------------------+
				421	\| ``'time'`` \| internal time \|
				422	+------------------+----------------------+
				423	\| ``'tottime'`` \| internal time \|
				424	+------------------+----------------------+
				425
				426	Note that all sorts on statistics are in descending order (placing most
				427	time consuming items first), where as name, file, and line number searches
				428	are in ascending order (alphabetical). The subtle distinction between
				429	``'nfl'`` and ``'stdname'`` is that the standard name is a sort of the
				430	name as printed, which means that the embedded line numbers get compared
				431	in an odd way. For example, lines 3, 20, and 40 would (if the file names
				432	were the same) appear in the string order 20, 3 and 40. In contrast,
				433	``'nfl'`` does a numeric compare of the line numbers. In fact,
				434	``sort_stats('nfl')`` is the same as ``sort_stats('name', 'file',
				435	'line')``.
				436
				437	For backward-compatibility reasons, the numeric arguments ``-1``, ``0``,
				438	``1``, and ``2`` are permitted. They are interpreted as ``'stdname'``,
				439	``'calls'``, ``'time'``, and ``'cumulative'`` respectively. If this old
				440	style format (numeric) is used, only one sort key (the numeric key) will
				441	be used, and additional arguments will be silently ignored.
				442
				443	.. For compatibility with the old profiler.
				444
				445
				446	.. method:: reverse_order()
				447
				448	This method for the :class:`Stats` class reverses the ordering of the
				449	basic list within the object. Note that by default ascending vs
				450	descending order is properly selected based on the sort key of choice.
				451
				452	.. This method is provided primarily for compatibility with the old
				453	profiler.
				454
				455
				456	.. method:: print_stats(*restrictions)
				457
				458	This method for the :class:`Stats` class prints out a report as described
				459	in the :func:`profile.run` definition.
				460
				461	The order of the printing is based on the last
				462	:meth:`~pstats.Stats.sort_stats` operation done on the object (subject to
				463	caveats in :meth:`~pstats.Stats.add` and
				464	:meth:`~pstats.Stats.strip_dirs`).
				465
				466	The arguments provided (if any) can be used to limit the list down to the
				467	significant entries. Initially, the list is taken to be the complete set
				468	of profiled functions. Each restriction is either an integer (to select a
				469	count of lines), or a decimal fraction between 0.0 and 1.0 inclusive (to
				470	select a percentage of lines), or a regular expression (to pattern match
				471	the standard name that is printed. If several restrictions are provided,
				472	then they are applied sequentially. For example::
				473
				474	print_stats(.1, 'foo:')
				475
				476	would first limit the printing to first 10% of list, and then only print
				477	functions that were part of filename :file:`.\*foo:`. In contrast, the
				478	command::
				479
				480	print_stats('foo:', .1)
				481
				482	would limit the list to all functions having file names :file:`.\*foo:`,
				483	and then proceed to only print the first 10% of them.
				484
				485
				486	.. method:: print_callers(*restrictions)
				487
				488	This method for the :class:`Stats` class prints a list of all functions
				489	that called each function in the profiled database. The ordering is
				490	identical to that provided by :meth:`~pstats.Stats.print_stats`, and the
				491	definition of the restricting argument is also identical. Each caller is
				492	reported on its own line. The format differs slightly depending on the
				493	profiler that produced the stats:
				494
				495	* With :mod:`profile`, a number is shown in parentheses after each caller
				496	to show how many times this specific call was made. For convenience, a
				497	second non-parenthesized number repeats the cumulative time spent in the
				498	function at the right.
				499
				500	* With :mod:`cProfile`, each caller is preceded by three numbers: the
				501	number of times this specific call was made, and the total and
				502	cumulative times spent in the current function while it was invoked by
				503	this specific caller.
				504
				505
				506	.. method:: print_callees(*restrictions)
				507
				508	This method for the :class:`Stats` class prints a list of all function
				509	that were called by the indicated function. Aside from this reversal of
				510	direction of calls (re: called vs was called by), the arguments and
				511	ordering are identical to the :meth:`~pstats.Stats.print_callers` method.
				512
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	513
				514	.. _deterministic-profiling:
				515
				516	What Is Deterministic Profiling?
				517	================================
				518
				519	:dfn:`Deterministic profiling` is meant to reflect the fact that all *function
				520	call, function return, and exception* events are monitored, and precise
				521	timings are made for the intervals between these events (during which time the
				522	user's code is executing). In contrast, :dfn:`statistical profiling` (which is
				523	not done by this module) randomly samples the effective instruction pointer, and
				524	deduces where time is being spent. The latter technique traditionally involves
				525	less overhead (as the code does not need to be instrumented), but provides only
				526	relative indications of where time is being spent.
				527
				528	In Python, since there is an interpreter active during execution, the presence
				529	of instrumented code is not required to do deterministic profiling. Python
				530	automatically provides a :dfn:`hook` (optional callback) for each event. In
				531	addition, the interpreted nature of Python tends to add so much overhead to
				532	execution, that deterministic profiling tends to only add small processing
				533	overhead in typical applications. The result is that deterministic profiling is
				534	not that expensive, yet provides extensive run time statistics about the
				535	execution of a Python program.
				536
				537	Call count statistics can be used to identify bugs in code (surprising counts),
				538	and to identify possible inline-expansion points (high call counts). Internal
				539	time statistics can be used to identify "hot loops" that should be carefully
				540	optimized. Cumulative time statistics should be used to identify high level
				541	errors in the selection of algorithms. Note that the unusual handling of
				542	cumulative times in this profiler allows statistics for recursive
				543	implementations of algorithms to be directly compared to iterative
				544	implementations.
				545
				546
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	547	.. _profile-limitations:
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	548
				549	Limitations
				550	===========
				551
				552	One limitation has to do with accuracy of timing information. There is a
				553	fundamental problem with deterministic profilers involving accuracy. The most
				554	obvious restriction is that the underlying "clock" is only ticking at a rate
				555	(typically) of about .001 seconds. Hence no measurements will be more accurate
				556	than the underlying clock. If enough measurements are taken, then the "error"
				557	will tend to average out. Unfortunately, removing this first error induces a
				558	second source of error.
				559
				560	The second problem is that it "takes a while" from when an event is dispatched
				561	until the profiler's call to get the time actually gets the state of the
				562	clock. Similarly, there is a certain lag when exiting the profiler event
				563	handler from the time that the clock's value was obtained (and then squirreled
				564	away), until the user's code is once again executing. As a result, functions
				565	that are called many times, or call many functions, will typically accumulate
				566	this error. The error that accumulates in this fashion is typically less than
				567	the accuracy of the clock (less than one clock tick), but it can accumulate
				568	and become very significant.
				569
				570	The problem is more important with :mod:`profile` than with the lower-overhead
				571	:mod:`cProfile`. For this reason, :mod:`profile` provides a means of
				572	calibrating itself for a given platform so that this error can be
				573	probabilistically (on the average) removed. After the profiler is calibrated, it
				574	will be more accurate (in a least square sense), but it will sometimes produce
				575	negative numbers (when call counts are exceptionally low, and the gods of
				576	probability work against you :-). ) Do not be alarmed by negative numbers in
				577	the profile. They should only appear if you have calibrated your profiler,
				578	and the results are actually better than without calibration.
				579
				580
				581	.. _profile-calibration:
				582
				583	Calibration
				584	===========
				585
				586	The profiler of the :mod:`profile` module subtracts a constant from each event
				587	handling time to compensate for the overhead of calling the time function, and
				588	socking away the results. By default, the constant is 0. The following
				589	procedure can be used to obtain a better constant for a given platform (see
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	590	:ref:`profile-limitations`). ::
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	591
				592	import profile
				593	pr = profile.Profile()
				594	for i in range(5):
				595	print pr.calibrate(10000)
				596
				597	The method executes the number of Python calls given by the argument, directly
				598	and again under the profiler, measuring the time for both. It then computes the
				599	hidden overhead per profiler event, and returns that as a float. For example,
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	600	on a 1.8Ghz Intel Core i5 running Mac OS X, and using Python's time.clock() as
				601	the timer, the magical number is about 4.04e-6.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	602
				603	The object of this exercise is to get a fairly consistent result. If your
				604	computer is very fast, or your timer function has poor resolution, you might
				605	have to pass 100000, or even 1000000, to get consistent results.
				606
				607	When you have a consistent answer, there are three ways you can use it: [#]_ ::
				608
				609	import profile
				610
				611	# 1. Apply computed bias to all Profile instances created hereafter.
				612	profile.Profile.bias = your_computed_bias
				613
				614	# 2. Apply computed bias to a specific Profile instance.
				615	pr = profile.Profile()
				616	pr.bias = your_computed_bias
				617
				618	# 3. Specify computed bias in instance constructor.
				619	pr = profile.Profile(bias=your_computed_bias)
				620
				621	If you have a choice, you are better off choosing a smaller constant, and then
				622	your results will "less often" show up as negative in profile statistics.
				623
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	624	.. _profile-timers:
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	625
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	626	Using a customer timer
				627	======================
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	628
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	629	If you want to change how current time is determined (for example, to force use
				630	of wall-clock time or elapsed process time), pass the timing function you want
				631	to the :class:`Profile` class constructor::
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	632
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	633	pr = profile.Profile(your_time_func)
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	634
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	635	The resulting profiler will then call ``your_time_func``. Depending on whether
				636	you are using :class:`profile.Profile` or :class:`cProfile.Profile`,
				637	``your_time_func``'s return value will be interpreted differently:
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	638
				639	:class:`profile.Profile`
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	640	``your_time_func`` should return a single number, or a list of numbers whose
				641	sum is the current time (like what :func:`os.times` returns). If the
				642	function returns a single time number, or the list of returned numbers has
				643	length 2, then you will get an especially fast version of the dispatch
				644	routine.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	645
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	646	Be warned that you should calibrate the profiler class for the timer function
				647	that you choose (see :ref:`profile-calibration`). For most machines, a timer
				648	that returns a lone integer value will provide the best results in terms of
				649	low overhead during profiling. (:func:`os.times` is pretty bad, as it
				650	returns a tuple of floating point values). If you want to substitute a
				651	better timer in the cleanest fashion, derive a class and hardwire a
				652	replacement dispatch method that best handles your timer call, along with the
				653	appropriate calibration constant.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	654
				655	:class:`cProfile.Profile`
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	656	``your_time_func`` should return a single number. If it returns integers,
				657	you can also invoke the class constructor with a second argument specifying
				658	the real duration of one unit of time. For example, if
				659	``your_integer_time_func`` returns times measured in thousands of seconds,
				660	you would construct the :class:`Profile` instance as follows::
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	661
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	662	pr = cProfile.Profile(your_integer_time_func, 0.001)
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	663
Ezio Melotti	0ba584c	2013-04-12 16:22:24 +0300	[diff] [blame]	664	As the :mod:`cProfile.Profile` class cannot be calibrated, custom timer
				665	functions should be used with care and should be as fast as possible. For
				666	the best results with a custom timer, it might be necessary to hard-code it
				667	in the C source of the internal :mod:`_lsprof` module.
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	668
Georg Brandl	1014686	2012-07-01 09:40:16 +0200	[diff] [blame]	669
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	670	.. rubric:: Footnotes
				671
Georg Brandl	1014686	2012-07-01 09:40:16 +0200	[diff] [blame]	672	.. [#] Prior to Python 2.2, it was necessary to edit the profiler source code to
				673	embed the bias as a literal number. You still can, but that method is no longer
Georg Brandl	8ec7f65	2007-08-15 14:28:01 +0000	[diff] [blame]	674	described, because no longer needed.