blob: 2883f3c6739f16903ef31294b6370f411d4d299b [file] [log] [blame]
Alexander Belopolskyf0a0d142010-10-27 03:06:43 +00001:mod:`ast` --- Abstract Syntax Trees
2====================================
Georg Brandl0c77a822008-06-10 16:37:50 +00003
4.. module:: ast
5 :synopsis: Abstract Syntax Tree classes and manipulation.
6
7.. sectionauthor:: Martin v. Lรถwis <martin@v.loewis.de>
8.. sectionauthor:: Georg Brandl <georg@python.org>
9
Raymond Hettinger10480942011-01-10 03:26:08 +000010**Source code:** :source:`Lib/ast.py`
Georg Brandl0c77a822008-06-10 16:37:50 +000011
Raymond Hettinger4f707fd2011-01-10 19:54:11 +000012--------------
13
Georg Brandl0c77a822008-06-10 16:37:50 +000014The :mod:`ast` module helps Python applications to process trees of the Python
15abstract syntax grammar. The abstract syntax itself might change with each
16Python release; this module helps to find out programmatically what the current
17grammar looks like.
18
Benjamin Petersonec9199b2008-11-08 17:05:00 +000019An abstract syntax tree can be generated by passing :data:`ast.PyCF_ONLY_AST` as
Georg Brandl22b34312009-07-26 14:54:51 +000020a flag to the :func:`compile` built-in function, or using the :func:`parse`
Georg Brandl0c77a822008-06-10 16:37:50 +000021helper provided in this module. The result will be a tree of objects whose
Benjamin Petersonec9199b2008-11-08 17:05:00 +000022classes all inherit from :class:`ast.AST`. An abstract syntax tree can be
23compiled into a Python code object using the built-in :func:`compile` function.
Georg Brandl0c77a822008-06-10 16:37:50 +000024
Georg Brandl0c77a822008-06-10 16:37:50 +000025
26Node classes
27------------
28
29.. class:: AST
30
31 This is the base of all AST node classes. The actual node classes are
32 derived from the :file:`Parser/Python.asdl` file, which is reproduced
33 :ref:`below <abstract-grammar>`. They are defined in the :mod:`_ast` C
34 module and re-exported in :mod:`ast`.
35
36 There is one class defined for each left-hand side symbol in the abstract
37 grammar (for example, :class:`ast.stmt` or :class:`ast.expr`). In addition,
38 there is one class defined for each constructor on the right-hand side; these
39 classes inherit from the classes for the left-hand side trees. For example,
40 :class:`ast.BinOp` inherits from :class:`ast.expr`. For production rules
41 with alternatives (aka "sums"), the left-hand side class is abstract: only
42 instances of specific constructor nodes are ever created.
43
Serhiy Storchaka913876d2018-10-28 13:41:26 +020044 .. index:: single: ? (question mark); in AST grammar
45 .. index:: single: * (asterisk); in AST grammar
46
Georg Brandl0c77a822008-06-10 16:37:50 +000047 .. attribute:: _fields
48
49 Each concrete class has an attribute :attr:`_fields` which gives the names
50 of all child nodes.
51
52 Each instance of a concrete class has one attribute for each child node,
53 of the type as defined in the grammar. For example, :class:`ast.BinOp`
54 instances have an attribute :attr:`left` of type :class:`ast.expr`.
55
56 If these attributes are marked as optional in the grammar (using a
57 question mark), the value might be ``None``. If the attributes can have
58 zero-or-more values (marked with an asterisk), the values are represented
59 as Python lists. All possible attributes must be present and have valid
60 values when compiling an AST with :func:`compile`.
61
62 .. attribute:: lineno
63 col_offset
64
65 Instances of :class:`ast.expr` and :class:`ast.stmt` subclasses have
66 :attr:`lineno` and :attr:`col_offset` attributes. The :attr:`lineno` is
67 the line number of source text (1-indexed so the first line is line 1) and
68 the :attr:`col_offset` is the UTF-8 byte offset of the first token that
69 generated the node. The UTF-8 offset is recorded because the parser uses
70 UTF-8 internally.
71
72 The constructor of a class :class:`ast.T` parses its arguments as follows:
73
74 * If there are positional arguments, there must be as many as there are items
75 in :attr:`T._fields`; they will be assigned as attributes of these names.
76 * If there are keyword arguments, they will set the attributes of the same
77 names to the given values.
78
79 For example, to create and populate an :class:`ast.UnaryOp` node, you could
80 use ::
81
82 node = ast.UnaryOp()
83 node.op = ast.USub()
Serhiy Storchaka3f228112018-09-27 17:42:37 +030084 node.operand = ast.Constant()
85 node.operand.value = 5
Georg Brandl0c77a822008-06-10 16:37:50 +000086 node.operand.lineno = 0
87 node.operand.col_offset = 0
88 node.lineno = 0
89 node.col_offset = 0
90
91 or the more compact ::
92
Serhiy Storchaka3f228112018-09-27 17:42:37 +030093 node = ast.UnaryOp(ast.USub(), ast.Constant(5, lineno=0, col_offset=0),
Georg Brandl0c77a822008-06-10 16:37:50 +000094 lineno=0, col_offset=0)
95
Serhiy Storchaka3f228112018-09-27 17:42:37 +030096.. deprecated:: 3.8
97
98 Class :class:`ast.Constant` is now used for all constants. Old classes
99 :class:`ast.Num`, :class:`ast.Str`, :class:`ast.Bytes`,
100 :class:`ast.NameConstant` and :class:`ast.Ellipsis` are still available,
101 but they will be removed in future Python releases.
102
Georg Brandl0c77a822008-06-10 16:37:50 +0000103
104.. _abstract-grammar:
105
106Abstract Grammar
107----------------
108
Georg Brandl0c77a822008-06-10 16:37:50 +0000109The abstract grammar is currently defined as follows:
110
111.. literalinclude:: ../../Parser/Python.asdl
Martin Panter1050d2d2016-07-26 11:18:21 +0200112 :language: none
Georg Brandl0c77a822008-06-10 16:37:50 +0000113
114
115:mod:`ast` Helpers
116------------------
117
Martin Panter2e4571a2015-11-14 01:07:43 +0000118Apart from the node classes, the :mod:`ast` module defines these utility functions
Georg Brandl0c77a822008-06-10 16:37:50 +0000119and classes for traversing abstract syntax trees:
120
Terry Reedyfeac6242011-01-24 21:36:03 +0000121.. function:: parse(source, filename='<unknown>', mode='exec')
Georg Brandl0c77a822008-06-10 16:37:50 +0000122
Terry Reedyfeac6242011-01-24 21:36:03 +0000123 Parse the source into an AST node. Equivalent to ``compile(source,
Benjamin Petersonec9199b2008-11-08 17:05:00 +0000124 filename, mode, ast.PyCF_ONLY_AST)``.
Georg Brandl0c77a822008-06-10 16:37:50 +0000125
Brett Cannon7a7f1002018-03-09 12:03:22 -0800126 .. warning::
127 It is possible to crash the Python interpreter with a
128 sufficiently large/complex string due to stack depth limitations
129 in Python's AST compiler.
130
Georg Brandl48310cd2009-01-03 21:18:54 +0000131
Georg Brandl0c77a822008-06-10 16:37:50 +0000132.. function:: literal_eval(node_or_string)
133
Georg Brandlb9b389e2014-11-05 20:20:28 +0100134 Safely evaluate an expression node or a string containing a Python literal or
135 container display. The string or node provided may only consist of the
136 following Python literal structures: strings, bytes, numbers, tuples, lists,
137 dicts, sets, booleans, and ``None``.
Georg Brandl0c77a822008-06-10 16:37:50 +0000138
Georg Brandlb9b389e2014-11-05 20:20:28 +0100139 This can be used for safely evaluating strings containing Python values from
140 untrusted sources without the need to parse the values oneself. It is not
141 capable of evaluating arbitrarily complex expressions, for example involving
142 operators or indexing.
Georg Brandl0c77a822008-06-10 16:37:50 +0000143
Brett Cannon7a7f1002018-03-09 12:03:22 -0800144 .. warning::
145 It is possible to crash the Python interpreter with a
146 sufficiently large/complex string due to stack depth limitations
147 in Python's AST compiler.
148
Georg Brandl492f3fc2010-07-11 09:41:21 +0000149 .. versionchanged:: 3.2
Georg Brandl85f21772010-07-13 06:38:10 +0000150 Now allows bytes and set literals.
Georg Brandl492f3fc2010-07-11 09:41:21 +0000151
Georg Brandl0c77a822008-06-10 16:37:50 +0000152
Amaury Forgeot d'Arcfdfe62d2008-06-17 20:36:03 +0000153.. function:: get_docstring(node, clean=True)
Georg Brandl0c77a822008-06-10 16:37:50 +0000154
155 Return the docstring of the given *node* (which must be a
INADA Naokicb41b272017-02-23 00:31:59 +0900156 :class:`FunctionDef`, :class:`AsyncFunctionDef`, :class:`ClassDef`,
157 or :class:`Module` node), or ``None`` if it has no docstring.
158 If *clean* is true, clean up the docstring's indentation with
159 :func:`inspect.cleandoc`.
160
161 .. versionchanged:: 3.5
162 :class:`AsyncFunctionDef` is now supported.
163
Georg Brandl0c77a822008-06-10 16:37:50 +0000164
165.. function:: fix_missing_locations(node)
166
167 When you compile a node tree with :func:`compile`, the compiler expects
168 :attr:`lineno` and :attr:`col_offset` attributes for every node that supports
169 them. This is rather tedious to fill in for generated nodes, so this helper
170 adds these attributes recursively where not already set, by setting them to
171 the values of the parent node. It works recursively starting at *node*.
172
173
174.. function:: increment_lineno(node, n=1)
175
176 Increment the line number of each node in the tree starting at *node* by *n*.
177 This is useful to "move code" to a different location in a file.
178
179
180.. function:: copy_location(new_node, old_node)
181
182 Copy source location (:attr:`lineno` and :attr:`col_offset`) from *old_node*
183 to *new_node* if possible, and return *new_node*.
184
185
186.. function:: iter_fields(node)
187
188 Yield a tuple of ``(fieldname, value)`` for each field in ``node._fields``
189 that is present on *node*.
190
191
192.. function:: iter_child_nodes(node)
193
194 Yield all direct child nodes of *node*, that is, all fields that are nodes
195 and all items of fields that are lists of nodes.
196
197
198.. function:: walk(node)
199
Georg Brandl619e7ba2011-01-09 07:38:51 +0000200 Recursively yield all descendant nodes in the tree starting at *node*
201 (including *node* itself), in no specified order. This is useful if you only
202 want to modify nodes in place and don't care about the context.
Georg Brandl0c77a822008-06-10 16:37:50 +0000203
204
205.. class:: NodeVisitor()
206
207 A node visitor base class that walks the abstract syntax tree and calls a
208 visitor function for every node found. This function may return a value
Georg Brandl36ab1ef2009-01-03 21:17:04 +0000209 which is forwarded by the :meth:`visit` method.
Georg Brandl0c77a822008-06-10 16:37:50 +0000210
211 This class is meant to be subclassed, with the subclass adding visitor
212 methods.
213
214 .. method:: visit(node)
215
216 Visit a node. The default implementation calls the method called
217 :samp:`self.visit_{classname}` where *classname* is the name of the node
218 class, or :meth:`generic_visit` if that method doesn't exist.
219
220 .. method:: generic_visit(node)
221
222 This visitor calls :meth:`visit` on all children of the node.
Georg Brandl48310cd2009-01-03 21:18:54 +0000223
Georg Brandl0c77a822008-06-10 16:37:50 +0000224 Note that child nodes of nodes that have a custom visitor method won't be
225 visited unless the visitor calls :meth:`generic_visit` or visits them
226 itself.
227
228 Don't use the :class:`NodeVisitor` if you want to apply changes to nodes
229 during traversal. For this a special visitor exists
230 (:class:`NodeTransformer`) that allows modifications.
231
232
233.. class:: NodeTransformer()
234
235 A :class:`NodeVisitor` subclass that walks the abstract syntax tree and
236 allows modification of nodes.
237
Georg Brandl36ab1ef2009-01-03 21:17:04 +0000238 The :class:`NodeTransformer` will walk the AST and use the return value of
239 the visitor methods to replace or remove the old node. If the return value
240 of the visitor method is ``None``, the node will be removed from its
241 location, otherwise it is replaced with the return value. The return value
242 may be the original node in which case no replacement takes place.
Georg Brandl0c77a822008-06-10 16:37:50 +0000243
244 Here is an example transformer that rewrites all occurrences of name lookups
245 (``foo``) to ``data['foo']``::
246
247 class RewriteName(NodeTransformer):
248
249 def visit_Name(self, node):
250 return copy_location(Subscript(
251 value=Name(id='data', ctx=Load()),
Serhiy Storchaka3f228112018-09-27 17:42:37 +0300252 slice=Index(value=Constant(value=node.id)),
Georg Brandl0c77a822008-06-10 16:37:50 +0000253 ctx=node.ctx
254 ), node)
255
256 Keep in mind that if the node you're operating on has child nodes you must
257 either transform the child nodes yourself or call the :meth:`generic_visit`
258 method for the node first.
259
260 For nodes that were part of a collection of statements (that applies to all
261 statement nodes), the visitor may also return a list of nodes rather than
262 just a single node.
263
264 Usually you use the transformer like this::
265
266 node = YourTransformer().visit(node)
267
268
269.. function:: dump(node, annotate_fields=True, include_attributes=False)
270
271 Return a formatted dump of the tree in *node*. This is mainly useful for
272 debugging purposes. The returned string will show the names and the values
273 for fields. This makes the code impossible to evaluate, so if evaluation is
Serhiy Storchakafbc1c262013-11-29 12:17:13 +0200274 wanted *annotate_fields* must be set to ``False``. Attributes such as line
Benjamin Petersondcf97b92008-07-02 17:30:14 +0000275 numbers and column offsets are not dumped by default. If this is wanted,
Georg Brandl0c77a822008-06-10 16:37:50 +0000276 *include_attributes* can be set to ``True``.
Senthil Kumaranf3695bf2016-01-06 21:26:53 -0800277
278.. seealso::
279
Sanyam Khurana338cd832018-01-20 05:55:37 +0530280 `Green Tree Snakes <https://greentreesnakes.readthedocs.io/>`_, an external documentation resource, has good
Senthil Kumaranf3695bf2016-01-06 21:26:53 -0800281 details on working with Python ASTs.