blob: ef36f50235f6d31aee95a473474392bc3e89f076 [file] [log] [blame]
Georg Brandl8509db52008-06-10 07:45:28 +00001.. _ast:
2
3Abstract Syntax Trees
4=====================
5
6.. module:: ast
7 :synopsis: Abstract Syntax Tree classes and manipulation.
8
9.. sectionauthor:: Martin v. Lรถwis <martin@v.loewis.de>
10.. sectionauthor:: Georg Brandl <georg@python.org>
11
12.. versionadded:: 2.5
13 The low-level ``_ast`` module containing only the node classes.
14
15.. versionadded:: 2.6
16 The high-level ``ast`` module containing all helpers.
17
18
19The :mod:`ast` module helps Python applications to process trees of the Python
20abstract syntax grammar. The abstract syntax itself might change with each
21Python release; this module helps to find out programmatically what the current
22grammar looks like.
23
Benjamin Petersond923f982008-11-08 16:55:33 +000024An abstract syntax tree can be generated by passing :data:`ast.PyCF_ONLY_AST` as
Georg Brandld7d4fd72009-07-26 14:37:28 +000025a flag to the :func:`compile` built-in function, or using the :func:`parse`
Georg Brandl8509db52008-06-10 07:45:28 +000026helper provided in this module. The result will be a tree of objects whose
Benjamin Petersond923f982008-11-08 16:55:33 +000027classes all inherit from :class:`ast.AST`. An abstract syntax tree can be
28compiled into a Python code object using the built-in :func:`compile` function.
Georg Brandl8509db52008-06-10 07:45:28 +000029
Georg Brandl8509db52008-06-10 07:45:28 +000030
Raymond Hettingere679a372010-11-05 23:58:42 +000031.. seealso::
32
33 Latest version of the `ast module Python source code
34 <http://svn.python.org/view/python/branches/release27-maint/Lib/ast.py?view=markup>`_
35
Georg Brandl8509db52008-06-10 07:45:28 +000036Node classes
37------------
38
39.. class:: AST
40
41 This is the base of all AST node classes. The actual node classes are
42 derived from the :file:`Parser/Python.asdl` file, which is reproduced
43 :ref:`below <abstract-grammar>`. They are defined in the :mod:`_ast` C
44 module and re-exported in :mod:`ast`.
45
46 There is one class defined for each left-hand side symbol in the abstract
47 grammar (for example, :class:`ast.stmt` or :class:`ast.expr`). In addition,
48 there is one class defined for each constructor on the right-hand side; these
49 classes inherit from the classes for the left-hand side trees. For example,
50 :class:`ast.BinOp` inherits from :class:`ast.expr`. For production rules
51 with alternatives (aka "sums"), the left-hand side class is abstract: only
52 instances of specific constructor nodes are ever created.
53
54 .. attribute:: _fields
55
56 Each concrete class has an attribute :attr:`_fields` which gives the names
57 of all child nodes.
58
59 Each instance of a concrete class has one attribute for each child node,
60 of the type as defined in the grammar. For example, :class:`ast.BinOp`
61 instances have an attribute :attr:`left` of type :class:`ast.expr`.
62
63 If these attributes are marked as optional in the grammar (using a
64 question mark), the value might be ``None``. If the attributes can have
65 zero-or-more values (marked with an asterisk), the values are represented
66 as Python lists. All possible attributes must be present and have valid
67 values when compiling an AST with :func:`compile`.
68
69 .. attribute:: lineno
70 col_offset
71
72 Instances of :class:`ast.expr` and :class:`ast.stmt` subclasses have
73 :attr:`lineno` and :attr:`col_offset` attributes. The :attr:`lineno` is
74 the line number of source text (1-indexed so the first line is line 1) and
75 the :attr:`col_offset` is the UTF-8 byte offset of the first token that
76 generated the node. The UTF-8 offset is recorded because the parser uses
77 UTF-8 internally.
78
79 The constructor of a class :class:`ast.T` parses its arguments as follows:
80
81 * If there are positional arguments, there must be as many as there are items
82 in :attr:`T._fields`; they will be assigned as attributes of these names.
83 * If there are keyword arguments, they will set the attributes of the same
84 names to the given values.
85
86 For example, to create and populate an :class:`ast.UnaryOp` node, you could
87 use ::
88
89 node = ast.UnaryOp()
90 node.op = ast.USub()
91 node.operand = ast.Num()
92 node.operand.n = 5
93 node.operand.lineno = 0
94 node.operand.col_offset = 0
95 node.lineno = 0
96 node.col_offset = 0
97
98 or the more compact ::
99
100 node = ast.UnaryOp(ast.USub(), ast.Num(5, lineno=0, col_offset=0),
101 lineno=0, col_offset=0)
102
Armin Ronacher482f3122008-06-10 20:52:19 +0000103 .. versionadded:: 2.6
104 The constructor as explained above was added. In Python 2.5 nodes had
105 to be created by calling the class constructor without arguments and
106 setting the attributes afterwards.
107
Georg Brandl8509db52008-06-10 07:45:28 +0000108
109.. _abstract-grammar:
110
111Abstract Grammar
112----------------
113
114The module defines a string constant ``__version__`` which is the decimal
115Subversion revision number of the file shown below.
116
117The abstract grammar is currently defined as follows:
118
119.. literalinclude:: ../../Parser/Python.asdl
120
121
122:mod:`ast` Helpers
123------------------
124
125.. versionadded:: 2.6
126
127Apart from the node classes, :mod:`ast` module defines these utility functions
128and classes for traversing abstract syntax trees:
129
Terry Reedy51581de2011-01-24 21:48:11 +0000130.. function:: parse(source, filename='<unknown>', mode='exec')
Georg Brandl8509db52008-06-10 07:45:28 +0000131
Terry Reedy51581de2011-01-24 21:48:11 +0000132 Parse the source into an AST node. Equivalent to ``compile(source,
Benjamin Petersond923f982008-11-08 16:55:33 +0000133 filename, mode, ast.PyCF_ONLY_AST)``.
Georg Brandl8509db52008-06-10 07:45:28 +0000134
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000135
Georg Brandl8509db52008-06-10 07:45:28 +0000136.. function:: literal_eval(node_or_string)
137
138 Safely evaluate an expression node or a string containing a Python
139 expression. The string or node provided may only consist of the following
140 Python literal structures: strings, numbers, tuples, lists, dicts, booleans,
141 and ``None``.
142
143 This can be used for safely evaluating strings containing Python expressions
144 from untrusted sources without the need to parse the values oneself.
145
146
Armin Ronacher3079be52008-06-10 20:37:02 +0000147.. function:: get_docstring(node, clean=True)
Georg Brandl8509db52008-06-10 07:45:28 +0000148
149 Return the docstring of the given *node* (which must be a
150 :class:`FunctionDef`, :class:`ClassDef` or :class:`Module` node), or ``None``
151 if it has no docstring. If *clean* is true, clean up the docstring's
152 indentation with :func:`inspect.cleandoc`.
153
154
155.. function:: fix_missing_locations(node)
156
157 When you compile a node tree with :func:`compile`, the compiler expects
158 :attr:`lineno` and :attr:`col_offset` attributes for every node that supports
159 them. This is rather tedious to fill in for generated nodes, so this helper
160 adds these attributes recursively where not already set, by setting them to
161 the values of the parent node. It works recursively starting at *node*.
162
163
164.. function:: increment_lineno(node, n=1)
165
166 Increment the line number of each node in the tree starting at *node* by *n*.
167 This is useful to "move code" to a different location in a file.
168
169
170.. function:: copy_location(new_node, old_node)
171
172 Copy source location (:attr:`lineno` and :attr:`col_offset`) from *old_node*
173 to *new_node* if possible, and return *new_node*.
174
175
176.. function:: iter_fields(node)
177
178 Yield a tuple of ``(fieldname, value)`` for each field in ``node._fields``
179 that is present on *node*.
180
181
182.. function:: iter_child_nodes(node)
183
184 Yield all direct child nodes of *node*, that is, all fields that are nodes
185 and all items of fields that are lists of nodes.
186
187
188.. function:: walk(node)
189
Georg Brandl6adb9792011-01-09 07:53:14 +0000190 Recursively yield all descendant nodes in the tree starting at *node*
191 (including *node* itself), in no specified order. This is useful if you only
192 want to modify nodes in place and don't care about the context.
Georg Brandl8509db52008-06-10 07:45:28 +0000193
194
195.. class:: NodeVisitor()
196
197 A node visitor base class that walks the abstract syntax tree and calls a
198 visitor function for every node found. This function may return a value
Georg Brandle92818f2009-01-03 20:47:01 +0000199 which is forwarded by the :meth:`visit` method.
Georg Brandl8509db52008-06-10 07:45:28 +0000200
201 This class is meant to be subclassed, with the subclass adding visitor
202 methods.
203
204 .. method:: visit(node)
205
206 Visit a node. The default implementation calls the method called
207 :samp:`self.visit_{classname}` where *classname* is the name of the node
208 class, or :meth:`generic_visit` if that method doesn't exist.
209
210 .. method:: generic_visit(node)
211
212 This visitor calls :meth:`visit` on all children of the node.
Georg Brandlc62ef8b2009-01-03 20:55:06 +0000213
Georg Brandl8509db52008-06-10 07:45:28 +0000214 Note that child nodes of nodes that have a custom visitor method won't be
215 visited unless the visitor calls :meth:`generic_visit` or visits them
216 itself.
217
218 Don't use the :class:`NodeVisitor` if you want to apply changes to nodes
219 during traversal. For this a special visitor exists
220 (:class:`NodeTransformer`) that allows modifications.
221
222
223.. class:: NodeTransformer()
224
225 A :class:`NodeVisitor` subclass that walks the abstract syntax tree and
226 allows modification of nodes.
227
Georg Brandle92818f2009-01-03 20:47:01 +0000228 The :class:`NodeTransformer` will walk the AST and use the return value of
229 the visitor methods to replace or remove the old node. If the return value
230 of the visitor method is ``None``, the node will be removed from its
231 location, otherwise it is replaced with the return value. The return value
232 may be the original node in which case no replacement takes place.
Georg Brandl8509db52008-06-10 07:45:28 +0000233
234 Here is an example transformer that rewrites all occurrences of name lookups
235 (``foo``) to ``data['foo']``::
236
237 class RewriteName(NodeTransformer):
238
239 def visit_Name(self, node):
240 return copy_location(Subscript(
241 value=Name(id='data', ctx=Load()),
242 slice=Index(value=Str(s=node.id)),
243 ctx=node.ctx
244 ), node)
245
246 Keep in mind that if the node you're operating on has child nodes you must
247 either transform the child nodes yourself or call the :meth:`generic_visit`
248 method for the node first.
249
250 For nodes that were part of a collection of statements (that applies to all
251 statement nodes), the visitor may also return a list of nodes rather than
252 just a single node.
253
254 Usually you use the transformer like this::
255
256 node = YourTransformer().visit(node)
257
258
259.. function:: dump(node, annotate_fields=True, include_attributes=False)
260
261 Return a formatted dump of the tree in *node*. This is mainly useful for
262 debugging purposes. The returned string will show the names and the values
263 for fields. This makes the code impossible to evaluate, so if evaluation is
264 wanted *annotate_fields* must be set to False. Attributes such as line
Georg Brandl924b8232008-06-21 14:26:19 +0000265 numbers and column offsets are not dumped by default. If this is wanted,
Georg Brandl8509db52008-06-10 07:45:28 +0000266 *include_attributes* can be set to ``True``.