blob: 7a52eb9ba434e2dddb6b6ba67d346fe7822f2129 [file] [log] [blame]
Georg Brandl8ec7f652007-08-15 14:28:01 +00001
2:mod:`textwrap` --- Text wrapping and filling
3=============================================
4
5.. module:: textwrap
6 :synopsis: Text wrapping and filling
7.. moduleauthor:: Greg Ward <gward@python.net>
8.. sectionauthor:: Greg Ward <gward@python.net>
9
10
11.. versionadded:: 2.3
12
13The :mod:`textwrap` module provides two convenience functions, :func:`wrap` and
14:func:`fill`, as well as :class:`TextWrapper`, the class that does all the work,
15and a utility function :func:`dedent`. If you're just wrapping or filling one
16or two text strings, the convenience functions should be good enough;
17otherwise, you should use an instance of :class:`TextWrapper` for efficiency.
18
19
20.. function:: wrap(text[, width[, ...]])
21
22 Wraps the single paragraph in *text* (a string) so every line is at most *width*
23 characters long. Returns a list of output lines, without final newlines.
24
25 Optional keyword arguments correspond to the instance attributes of
26 :class:`TextWrapper`, documented below. *width* defaults to ``70``.
27
28
29.. function:: fill(text[, width[, ...]])
30
31 Wraps the single paragraph in *text*, and returns a single string containing the
32 wrapped paragraph. :func:`fill` is shorthand for ::
33
34 "\n".join(wrap(text, ...))
35
36 In particular, :func:`fill` accepts exactly the same keyword arguments as
37 :func:`wrap`.
38
39Both :func:`wrap` and :func:`fill` work by creating a :class:`TextWrapper`
40instance and calling a single method on it. That instance is not reused, so for
41applications that wrap/fill many text strings, it will be more efficient for you
42to create your own :class:`TextWrapper` object.
43
44An additional utility function, :func:`dedent`, is provided to remove
45indentation from strings that have unwanted whitespace to the left of the text.
46
47
48.. function:: dedent(text)
49
50 Remove any common leading whitespace from every line in *text*.
51
52 This can be used to make triple-quoted strings line up with the left edge of the
53 display, while still presenting them in the source code in indented form.
54
55 Note that tabs and spaces are both treated as whitespace, but they are not
56 equal: the lines ``" hello"`` and ``"\thello"`` are considered to have no
57 common leading whitespace. (This behaviour is new in Python 2.5; older versions
58 of this module incorrectly expanded tabs before searching for common leading
59 whitespace.)
60
61 For example::
62
63 def test():
64 # end first line with \ to avoid the empty line!
65 s = '''\
66 hello
67 world
68 '''
69 print repr(s) # prints ' hello\n world\n '
70 print repr(dedent(s)) # prints 'hello\n world\n'
71
72
73.. class:: TextWrapper(...)
74
75 The :class:`TextWrapper` constructor accepts a number of optional keyword
76 arguments. Each argument corresponds to one instance attribute, so for example
77 ::
78
79 wrapper = TextWrapper(initial_indent="* ")
80
81 is the same as ::
82
83 wrapper = TextWrapper()
84 wrapper.initial_indent = "* "
85
86 You can re-use the same :class:`TextWrapper` object many times, and you can
87 change any of its options through direct assignment to instance attributes
88 between uses.
89
Benjamin Petersonc7b05922008-04-25 01:29:10 +000090 The :class:`TextWrapper` instance attributes (and keyword arguments to the
91 constructor) are as follows:
Georg Brandl8ec7f652007-08-15 14:28:01 +000092
93
Benjamin Petersonc7b05922008-04-25 01:29:10 +000094 .. attribute:: width
Georg Brandl8ec7f652007-08-15 14:28:01 +000095
Benjamin Petersonc7b05922008-04-25 01:29:10 +000096 (default: ``70``) The maximum length of wrapped lines. As long as there
97 are no individual words in the input text longer than :attr:`width`,
98 :class:`TextWrapper` guarantees that no output line will be longer than
99 :attr:`width` characters.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000100
101
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000102 .. attribute:: expand_tabs
Georg Brandl8ec7f652007-08-15 14:28:01 +0000103
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000104 (default: ``True``) If true, then all tab characters in *text* will be
105 expanded to spaces using the :meth:`expandtabs` method of *text*.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000106
107
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000108 .. attribute:: replace_whitespace
Georg Brandl8ec7f652007-08-15 14:28:01 +0000109
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000110 (default: ``True``) If true, each whitespace character (as defined by
111 ``string.whitespace``) remaining after tab expansion will be replaced by a
112 single space.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000113
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000114 .. note::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000115
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000116 If :attr:`expand_tabs` is false and :attr:`replace_whitespace` is true,
117 each tab character will be replaced by a single space, which is *not*
118 the same as tab expansion.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000119
120
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000121 .. attribute:: drop_whitespace
Georg Brandl8ec7f652007-08-15 14:28:01 +0000122
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000123 (default: ``True``) If true, whitespace that, after wrapping, happens to
124 end up at the beginning or end of a line is dropped (leading whitespace in
125 the first line is always preserved, though).
Georg Brandl8ec7f652007-08-15 14:28:01 +0000126
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000127 .. versionadded:: 2.6
128 Whitespace was always dropped in earlier versions.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000129
130
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000131 .. attribute:: initial_indent
Georg Brandl8ec7f652007-08-15 14:28:01 +0000132
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000133 (default: ``''``) String that will be prepended to the first line of
134 wrapped output. Counts towards the length of the first line.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000135
136
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000137 .. attribute:: subsequent_indent
Georg Brandl8ec7f652007-08-15 14:28:01 +0000138
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000139 (default: ``''``) String that will be prepended to all lines of wrapped
140 output except the first. Counts towards the length of each line except
141 the first.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000142
143
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000144 .. attribute:: fix_sentence_endings
Georg Brandl8ec7f652007-08-15 14:28:01 +0000145
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000146 (default: ``False``) If true, :class:`TextWrapper` attempts to detect
147 sentence endings and ensure that sentences are always separated by exactly
148 two spaces. This is generally desired for text in a monospaced font.
149 However, the sentence detection algorithm is imperfect: it assumes that a
150 sentence ending consists of a lowercase letter followed by one of ``'.'``,
151 ``'!'``, or ``'?'``, possibly followed by one of ``'"'`` or ``"'"``,
152 followed by a space. One problem with this is algorithm is that it is
153 unable to detect the difference between "Dr." in ::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000154
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000155 [...] Dr. Frankenstein's monster [...]
Georg Brandl8ec7f652007-08-15 14:28:01 +0000156
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000157 and "Spot." in ::
Georg Brandl8ec7f652007-08-15 14:28:01 +0000158
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000159 [...] See Spot. See Spot run [...]
Georg Brandl8ec7f652007-08-15 14:28:01 +0000160
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000161 :attr:`fix_sentence_endings` is false by default.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000162
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000163 Since the sentence detection algorithm relies on ``string.lowercase`` for
164 the definition of "lowercase letter," and a convention of using two spaces
165 after a period to separate sentences on the same line, it is specific to
166 English-language texts.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000167
168
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000169 .. attribute:: break_long_words
Georg Brandl8ec7f652007-08-15 14:28:01 +0000170
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000171 (default: ``True``) If true, then words longer than :attr:`width` will be
172 broken in order to ensure that no lines are longer than :attr:`width`. If
173 it is false, long words will not be broken, and some lines may be longer
174 than :attr:`width`. (Long words will be put on a line by themselves, in
175 order to minimize the amount by which :attr:`width` is exceeded.)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000176
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000177 :class:`TextWrapper` also provides two public methods, analogous to the
178 module-level convenience functions:
Georg Brandl8ec7f652007-08-15 14:28:01 +0000179
180
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000181 .. method:: wrap(text)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000182
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000183 Wraps the single paragraph in *text* (a string) so every line is at most
184 :attr:`width` characters long. All wrapping options are taken from
185 instance attributes of the :class:`TextWrapper` instance. Returns a list
186 of output lines, without final newlines.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000187
188
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000189 .. method:: fill(text)
Georg Brandl8ec7f652007-08-15 14:28:01 +0000190
Benjamin Petersonc7b05922008-04-25 01:29:10 +0000191 Wraps the single paragraph in *text*, and returns a single string
192 containing the wrapped paragraph.
Georg Brandl8ec7f652007-08-15 14:28:01 +0000193