blob: 81e11fa8ed6d705b483ea136e6dba41be9e0907c [file] [log] [blame]
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +00001 Writing Python Regression Tests
2 -------------------------------
Skip Montanaro47c60ec2000-06-30 06:08:35 +00003 Skip Montanaro
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +00004 (skip@mojam.com)
5
6
7Introduction
Skip Montanaro47c60ec2000-06-30 06:08:35 +00008
9If you add a new module to Python or modify the functionality of an existing
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000010module, you should write one or more test cases to exercise that new
11functionality. The mechanics of how the test system operates are fairly
12straightforward. When a test case is run, the output is compared with the
13expected output that is stored in .../Lib/test/output. If the test runs to
14completion and the actual and expected outputs match, the test succeeds, if
Thomas Woutersb9fa0a82000-08-04 13:34:43 +000015not, it fails. If an ImportError or test_support.TestSkipped error is
16raised, the test is not run.
Skip Montanaro47c60ec2000-06-30 06:08:35 +000017
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000018You will be writing unit tests (isolated tests of functions and objects
19defined by the module) using white box techniques. Unlike black box
20testing, where you only have the external interfaces to guide your test case
21writing, in white box testing you can see the code being tested and tailor
22your test cases to exercise it more completely. In particular, you will be
23able to refer to the C and Python code in the CVS repository when writing
24your regression test cases.
Skip Montanaro47c60ec2000-06-30 06:08:35 +000025
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000026
27Executing Test Cases
28
29If you are writing test cases for module spam, you need to create a file
30in .../Lib/test named test_spam.py and an expected output file in
31.../Lib/test/output named test_spam ("..." represents the top-level
32directory in the Python source tree, the directory containing the configure
33script). From the top-level directory, generate the initial version of the
34test output file by executing:
35
36 ./python Lib/test/regrtest.py -g test_spam.py
37
38Any time you modify test_spam.py you need to generate a new expected
Skip Montanaro47c60ec2000-06-30 06:08:35 +000039output file. Don't forget to desk check the generated output to make sure
40it's really what you expected to find! To run a single test after modifying
41a module, simply run regrtest.py without the -g flag:
42
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000043 ./python Lib/test/regrtest.py test_spam.py
44
45While debugging a regression test, you can of course execute it
46independently of the regression testing framework and see what it prints:
47
48 ./python Lib/test/test_spam.py
Skip Montanaro47c60ec2000-06-30 06:08:35 +000049
50To run the entire test suite, make the "test" target at the top level:
51
Skip Montanaro47c60ec2000-06-30 06:08:35 +000052 make test
53
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000054On non-Unix platforms where make may not be available, you can simply
55execute the two runs of regrtest (optimized and non-optimized) directly:
56
57 ./python Lib/test/regrtest.py
58 ./python -O Lib/test/regrtest.py
59
60
61Test cases generate output based upon values computed by the test code.
62When executed, regrtest.py compares the actual output generated by executing
63the test case with the expected output and reports success or failure. It
64stands to reason that if the actual and expected outputs are to match, they
65must not contain any machine dependencies. This means your test cases
66should not print out absolute machine addresses (e.g. the return value of
67the id() builtin function) or floating point numbers with large numbers of
68significant digits (unless you understand what you are doing!).
69
70
71Test Case Writing Tips
Skip Montanaro47c60ec2000-06-30 06:08:35 +000072
73Writing good test cases is a skilled task and is too complex to discuss in
74detail in this short document. Many books have been written on the subject.
75I'll show my age by suggesting that Glenford Myers' "The Art of Software
76Testing", published in 1979, is still the best introduction to the subject
77available. It is short (177 pages), easy to read, and discusses the major
78elements of software testing, though its publication predates the
79object-oriented software revolution, so doesn't cover that subject at all.
80Unfortunately, it is very expensive (about $100 new). If you can borrow it
81or find it used (around $20), I strongly urge you to pick up a copy.
82
Skip Montanaro47c60ec2000-06-30 06:08:35 +000083The most important goal when writing test cases is to break things. A test
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000084case that doesn't uncover a bug is much less valuable than one that does.
85In designing test cases you should pay attention to the following:
Skip Montanaro47c60ec2000-06-30 06:08:35 +000086
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000087 * Your test cases should exercise all the functions and objects defined
88 in the module, not just the ones meant to be called by users of your
89 module. This may require you to write test code that uses the module
90 in ways you don't expect (explicitly calling internal functions, for
91 example - see test_atexit.py).
Skip Montanaro47c60ec2000-06-30 06:08:35 +000092
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000093 * You should consider any boundary values that may tickle exceptional
94 conditions (e.g. if you were writing regression tests for division,
95 you might well want to generate tests with numerators and denominators
96 at the limits of floating point and integer numbers on the machine
97 performing the tests as well as a denominator of zero).
Skip Montanaro47c60ec2000-06-30 06:08:35 +000098
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +000099 * You should exercise as many paths through the code as possible. This
100 may not always be possible, but is a goal to strive for. In
101 particular, when considering if statements (or their equivalent), you
102 want to create test cases that exercise both the true and false
103 branches. For loops, you should create test cases that exercise the
104 loop zero, one and multiple times.
105
106 * You should test with obviously invalid input. If you know that a
107 function requires an integer input, try calling it with other types of
108 objects to see how it responds.
109
110 * You should test with obviously out-of-range input. If the domain of a
111 function is only defined for positive integers, try calling it with a
112 negative integer.
113
114 * If you are going to fix a bug that wasn't uncovered by an existing
115 test, try to write a test case that exposes the bug (preferably before
116 fixing it).
117
118
119Regression Test Writing Rules
120
121Each test case is different. There is no "standard" form for a Python
122regression test case, though there are some general rules:
123
124 * If your test case detects a failure, raise TestFailed (found in
125 test_support).
126
127 * Import everything you'll need as early as possible.
128
129 * If you'll be importing objects from a module that is at least
130 partially platform-dependent, only import those objects you need for
131 the current test case to avoid spurious ImportError exceptions that
132 prevent the test from running to completion.
133
134 * Print all your test case results using the print statement. For
135 non-fatal errors, print an error message (or omit a successful
136 completion print) to indicate the failure, but proceed instead of
137 raising TestFailed.
138
Tim Petersa48b5262000-08-23 05:28:45 +0000139 * Use "assert" sparingly, if at all. It's usually better to just print
140 what you got, and rely on regrtest's got-vs-expected comparison to
141 catch deviations from what you expect. assert statements aren't
142 executed at all when regrtest is run in -O mode; and, because they
143 cause the test to stop immediately, can lead to a long & tedious
144 test-fix, test-fix, test-fix, ... cycle when things are badly broken
145 (and note that "badly broken" often includes running the test suite
146 for the first time on new platforms or under new implementations of
147 the language).
148
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +0000149
150Miscellaneous
151
152There is a test_support module you can import from your test case. It
153provides the following useful objects:
154
155 * TestFailed - raise this exception when your regression test detects a
156 failure.
157
Fred Drake62c53dd2000-08-21 16:55:57 +0000158 * TestSkipped - raise this if the test could not be run because the
159 platform doesn't offer all the required facilities (like large
160 file support), even if all the required modules are available.
161
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +0000162 * findfile(file) - you can call this function to locate a file somewhere
163 along sys.path or in the Lib/test tree - see test_linuxaudiodev.py for
164 an example of its use.
165
166 * verbose - you can use this variable to control print output. Many
167 modules use it. Search for "verbose" in the test_*.py files to see
168 lots of examples.
169
Tim Petersa48b5262000-08-23 05:28:45 +0000170 * use_large_resources - true iff tests requiring large time or space
171 should be run.
172
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +0000173 * fcmp(x,y) - you can call this function to compare two floating point
174 numbers when you expect them to only be approximately equal withing a
175 fuzz factor (test_support.FUZZ, which defaults to 1e-6).
176
Tim Petersa48b5262000-08-23 05:28:45 +0000177NOTE: Always import something from test_support like so:
178
179 from test_support import verbose
180
181or like so:
182
183 import test_support
184 ... use test_support.verbose in the code ...
185
186Never import anything from test_support like this:
187
188 from test.test_support import verbose
189
190"test" is a package already, so can refer to modules it contains without
191"test." qualification. If you do an explicit "test.xxx" qualification, that
192can fool Python into believing test.xxx is a module distinct from the xxx
193in the current package, and you can end up importing two distinct copies of
194xxx. This is especially bad if xxx=test_support, as regrtest.py can (and
195routinely does) overwrite its "verbose" and "use_large_resources"
196attributes: if you get a second copy of test_support loaded, it may not
197have the same values for those as regrtest intended.
198
199
Skip Montanaroe9e5dcd2000-07-19 17:19:49 +0000200Python and C statement coverage results are currently available at
201
202 http://www.musi-cal.com/~skip/python/Python/dist/src/
203
204As of this writing (July, 2000) these results are being generated nightly.
205You can refer to the summaries and the test coverage output files to see
206where coverage is adequate or lacking and write test cases to beef up the
207coverage.