blob: e0b0fe095a57100dc4c4c3d1fc9d060843be07ad [file] [log] [blame]
Dean Moldovan67b52d82016-10-16 19:12:43 +02001Functions
2#########
3
4Before proceeding with this section, make sure that you are already familiar
5with the basics of binding functions and classes, as explained in :doc:`/basics`
6and :doc:`/classes`. The following guide is applicable to both free and member
7functions, i.e. *methods* in Python.
8
Jason Rhinelander17d02832017-01-16 20:35:14 -05009.. _return_value_policies:
10
Dean Moldovan67b52d82016-10-16 19:12:43 +020011Return value policies
12=====================
13
Wenzel Jakob6ba98652016-10-24 23:48:20 +020014Python and C++ use fundamentally different ways of managing the memory and
15lifetime of objects managed by them. This can lead to issues when creating
16bindings for functions that return a non-trivial type. Just by looking at the
17type information, it is not clear whether Python should take charge of the
18returned value and eventually free its resources, or if this is handled on the
Dean Moldovan57a9bbc2017-01-31 16:54:08 +010019C++ side. For this reason, pybind11 provides a several *return value policy*
Wenzel Jakob6ba98652016-10-24 23:48:20 +020020annotations that can be passed to the :func:`module::def` and
21:func:`class_::def` functions. The default policy is
22:enum:`return_value_policy::automatic`.
Dean Moldovan67b52d82016-10-16 19:12:43 +020023
Wenzel Jakob6ba98652016-10-24 23:48:20 +020024Return value policies are tricky, and it's very important to get them right.
25Just to illustrate what can go wrong, consider the following simple example:
Dean Moldovan67b52d82016-10-16 19:12:43 +020026
27.. code-block:: cpp
28
Dean Moldovan57a9bbc2017-01-31 16:54:08 +010029 /* Function declaration */
Wenzel Jakob6ba98652016-10-24 23:48:20 +020030 Data *get_data() { return _data; /* (pointer to a static data structure) */ }
31 ...
Dean Moldovan67b52d82016-10-16 19:12:43 +020032
Dean Moldovan57a9bbc2017-01-31 16:54:08 +010033 /* Binding code */
Wenzel Jakob6ba98652016-10-24 23:48:20 +020034 m.def("get_data", &get_data); // <-- KABOOM, will cause crash when called from Python
35
36What's going on here? When ``get_data()`` is called from Python, the return
37value (a native C++ type) must be wrapped to turn it into a usable Python type.
38In this case, the default return value policy (:enum:`return_value_policy::automatic`)
39causes pybind11 to assume ownership of the static ``_data`` instance.
40
41When Python's garbage collector eventually deletes the Python
42wrapper, pybind11 will also attempt to delete the C++ instance (via ``operator
43delete()``) due to the implied ownership. At this point, the entire application
44will come crashing down, though errors could also be more subtle and involve
45silent data corruption.
46
47In the above example, the policy :enum:`return_value_policy::reference` should have
48been specified so that the global data instance is only *referenced* without any
Dean Moldovan57a9bbc2017-01-31 16:54:08 +010049implied transfer of ownership, i.e.:
Wenzel Jakob6ba98652016-10-24 23:48:20 +020050
51.. code-block:: cpp
52
53 m.def("get_data", &get_data, return_value_policy::reference);
54
55On the other hand, this is not the right policy for many other situations,
56where ignoring ownership could lead to resource leaks.
57As a developer using pybind11, it's important to be familiar with the different
58return value policies, including which situation calls for which one of them.
59The following table provides an overview of available policies:
Dean Moldovan67b52d82016-10-16 19:12:43 +020060
61.. tabularcolumns:: |p{0.5\textwidth}|p{0.45\textwidth}|
62
63+--------------------------------------------------+----------------------------------------------------------------------------+
64| Return value policy | Description |
65+==================================================+============================================================================+
Dean Moldovan67b52d82016-10-16 19:12:43 +020066| :enum:`return_value_policy::take_ownership` | Reference an existing object (i.e. do not create a new copy) and take |
67| | ownership. Python will call the destructor and delete operator when the |
68| | object's reference count reaches zero. Undefined behavior ensues when the |
Wenzel Jakob6ba98652016-10-24 23:48:20 +020069| | C++ side does the same, or when the data was not dynamically allocated. |
Dean Moldovan67b52d82016-10-16 19:12:43 +020070+--------------------------------------------------+----------------------------------------------------------------------------+
71| :enum:`return_value_policy::copy` | Create a new copy of the returned object, which will be owned by Python. |
72| | This policy is comparably safe because the lifetimes of the two instances |
73| | are decoupled. |
74+--------------------------------------------------+----------------------------------------------------------------------------+
75| :enum:`return_value_policy::move` | Use ``std::move`` to move the return value contents into a new instance |
76| | that will be owned by Python. This policy is comparably safe because the |
77| | lifetimes of the two instances (move source and destination) are decoupled.|
78+--------------------------------------------------+----------------------------------------------------------------------------+
79| :enum:`return_value_policy::reference` | Reference an existing object, but do not take ownership. The C++ side is |
80| | responsible for managing the object's lifetime and deallocating it when |
81| | it is no longer used. Warning: undefined behavior will ensue when the C++ |
82| | side deletes an object that is still referenced and used by Python. |
83+--------------------------------------------------+----------------------------------------------------------------------------+
84| :enum:`return_value_policy::reference_internal` | Indicates that the lifetime of the return value is tied to the lifetime |
85| | of a parent object, namely the implicit ``this``, or ``self`` argument of |
86| | the called method or property. Internally, this policy works just like |
87| | :enum:`return_value_policy::reference` but additionally applies a |
88| | ``keep_alive<0, 1>`` *call policy* (described in the next section) that |
89| | prevents the parent object from being garbage collected as long as the |
90| | return value is referenced by Python. This is the default policy for |
91| | property getters created via ``def_property``, ``def_readwrite``, etc. |
92+--------------------------------------------------+----------------------------------------------------------------------------+
jbarlow837830e852017-01-13 02:17:29 -080093| :enum:`return_value_policy::automatic` | **Default policy.** This policy falls back to the policy |
Wenzel Jakob6ba98652016-10-24 23:48:20 +020094| | :enum:`return_value_policy::take_ownership` when the return value is a |
thorinke72eaa42017-02-17 12:57:39 +010095| | pointer. Otherwise, it uses :enum:`return_value_policy::move` or |
96| | :enum:`return_value_policy::copy` for rvalue and lvalue references, |
97| | respectively. See above for a description of what all of these different |
98| | policies do. |
Wenzel Jakob6ba98652016-10-24 23:48:20 +020099+--------------------------------------------------+----------------------------------------------------------------------------+
100| :enum:`return_value_policy::automatic_reference` | As above, but use policy :enum:`return_value_policy::reference` when the |
101| | return value is a pointer. This is the default conversion policy for |
102| | function arguments when calling Python functions manually from C++ code |
103| | (i.e. via handle::operator()). You probably won't need to use this. |
104+--------------------------------------------------+----------------------------------------------------------------------------+
105
Dean Moldovan03f627e2016-11-01 11:44:57 +0100106Return value policies can also be applied to properties:
Wenzel Jakob6ba98652016-10-24 23:48:20 +0200107
108.. code-block:: cpp
109
110 class_<MyClass>(m, "MyClass")
Dean Moldovan03f627e2016-11-01 11:44:57 +0100111 .def_property("data", &MyClass::getData, &MyClass::setData,
112 py::return_value_policy::copy);
113
114Technically, the code above applies the policy to both the getter and the
115setter function, however, the setter doesn't really care about *return*
116value policies which makes this a convenient terse syntax. Alternatively,
117targeted arguments can be passed through the :class:`cpp_function` constructor:
118
119.. code-block:: cpp
120
121 class_<MyClass>(m, "MyClass")
122 .def_property("data"
Wenzel Jakob6ba98652016-10-24 23:48:20 +0200123 py::cpp_function(&MyClass::getData, py::return_value_policy::copy),
124 py::cpp_function(&MyClass::setData)
125 );
Dean Moldovan67b52d82016-10-16 19:12:43 +0200126
127.. warning::
128
129 Code with invalid return value policies might access unitialized memory or
130 free data structures multiple times, which can lead to hard-to-debug
131 non-determinism and segmentation faults, hence it is worth spending the
132 time to understand all the different options in the table above.
133
Wenzel Jakob6ba98652016-10-24 23:48:20 +0200134.. note::
135
136 One important aspect of the above policies is that they only apply to
137 instances which pybind11 has *not* seen before, in which case the policy
138 clarifies essential questions about the return value's lifetime and
139 ownership. When pybind11 knows the instance already (as identified by its
140 type and address in memory), it will return the existing Python object
141 wrapper rather than creating a new copy.
Dean Moldovan67b52d82016-10-16 19:12:43 +0200142
143.. note::
144
145 The next section on :ref:`call_policies` discusses *call policies* that can be
146 specified *in addition* to a return value policy from the list above. Call
147 policies indicate reference relationships that can involve both return values
148 and parameters of functions.
149
150.. note::
151
152 As an alternative to elaborate call policies and lifetime management logic,
153 consider using smart pointers (see the section on :ref:`smart_pointers` for
154 details). Smart pointers can tell whether an object is still referenced from
155 C++ or Python, which generally eliminates the kinds of inconsistencies that
156 can lead to crashes or undefined behavior. For functions returning smart
157 pointers, it is not necessary to specify a return value policy.
158
159.. _call_policies:
160
161Additional call policies
162========================
163
Dean Moldovan57a9bbc2017-01-31 16:54:08 +0100164In addition to the above return value policies, further *call policies* can be
165specified to indicate dependencies between parameters. In general, call policies
166are required when the C++ object is any kind of container and another object is being
jbarlow837830e852017-01-13 02:17:29 -0800167added to the container.
168
169There is currently just
Dean Moldovan67b52d82016-10-16 19:12:43 +0200170one policy named ``keep_alive<Nurse, Patient>``, which indicates that the
171argument with index ``Patient`` should be kept alive at least until the
172argument with index ``Nurse`` is freed by the garbage collector. Argument
173indices start at one, while zero refers to the return value. For methods, index
174``1`` refers to the implicit ``this`` pointer, while regular arguments begin at
175index ``2``. Arbitrarily many call policies can be specified. When a ``Nurse``
176with value ``None`` is detected at runtime, the call policy does nothing.
177
178This feature internally relies on the ability to create a *weak reference* to
179the nurse object, which is permitted by all classes exposed via pybind11. When
180the nurse object does not support weak references, an exception will be thrown.
181
182Consider the following example: here, the binding code for a list append
183operation ties the lifetime of the newly added element to the underlying
184container:
185
186.. code-block:: cpp
187
188 py::class_<List>(m, "List")
189 .def("append", &List::append, py::keep_alive<1, 2>());
190
191.. note::
192
193 ``keep_alive`` is analogous to the ``with_custodian_and_ward`` (if Nurse,
194 Patient != 0) and ``with_custodian_and_ward_postcall`` (if Nurse/Patient ==
195 0) policies from Boost.Python.
196
197.. seealso::
198
199 The file :file:`tests/test_keep_alive.cpp` contains a complete example
200 that demonstrates using :class:`keep_alive` in more detail.
201
202.. _python_objects_as_args:
203
204Python objects as arguments
205===========================
206
207pybind11 exposes all major Python types using thin C++ wrapper classes. These
208wrapper classes can also be used as parameters of functions in bindings, which
209makes it possible to directly work with native Python types on the C++ side.
210For instance, the following statement iterates over a Python ``dict``:
211
212.. code-block:: cpp
213
214 void print_dict(py::dict dict) {
215 /* Easily interact with Python types */
216 for (auto item : dict)
myd73499b815ad2017-01-13 18:15:52 +0800217 std::cout << "key=" << std::string(py::str(item.first)) << ", "
218 << "value=" << std::string(py::str(item.second)) << std::endl;
Dean Moldovan67b52d82016-10-16 19:12:43 +0200219 }
220
221It can be exported:
222
223.. code-block:: cpp
224
225 m.def("print_dict", &print_dict);
226
227And used in Python as usual:
228
229.. code-block:: pycon
230
231 >>> print_dict({'foo': 123, 'bar': 'hello'})
232 key=foo, value=123
233 key=bar, value=hello
234
235For more information on using Python objects in C++, see :doc:`/advanced/pycpp/index`.
236
237Accepting \*args and \*\*kwargs
238===============================
239
240Python provides a useful mechanism to define functions that accept arbitrary
241numbers of arguments and keyword arguments:
242
243.. code-block:: python
244
245 def generic(*args, **kwargs):
246 ... # do something with args and kwargs
247
248Such functions can also be created using pybind11:
249
250.. code-block:: cpp
251
252 void generic(py::args args, py::kwargs kwargs) {
253 /// .. do something with args
254 if (kwargs)
255 /// .. do something with kwargs
256 }
257
258 /// Binding code
259 m.def("generic", &generic);
260
261The class ``py::args`` derives from ``py::tuple`` and ``py::kwargs`` derives
Jason Rhinelander2686da82017-01-21 23:42:14 -0500262from ``py::dict``.
Dean Moldovan67b52d82016-10-16 19:12:43 +0200263
Jason Rhinelander2686da82017-01-21 23:42:14 -0500264You may also use just one or the other, and may combine these with other
265arguments as long as the ``py::args`` and ``py::kwargs`` arguments are the last
266arguments accepted by the function.
Dean Moldovan67b52d82016-10-16 19:12:43 +0200267
Jason Rhinelander2686da82017-01-21 23:42:14 -0500268Please refer to the other examples for details on how to iterate over these,
269and on how to cast their entries into C++ objects. A demonstration is also
270available in ``tests/test_kwargs_and_defaults.cpp``.
271
272.. note::
273
274 When combining \*args or \*\*kwargs with :ref:`keyword_args` you should
275 *not* include ``py::arg`` tags for the ``py::args`` and ``py::kwargs``
276 arguments.
Dean Moldovan67b52d82016-10-16 19:12:43 +0200277
278Default arguments revisited
279===========================
280
281The section on :ref:`default_args` previously discussed basic usage of default
282arguments using pybind11. One noteworthy aspect of their implementation is that
283default arguments are converted to Python objects right at declaration time.
284Consider the following example:
285
286.. code-block:: cpp
287
288 py::class_<MyClass>("MyClass")
289 .def("myFunction", py::arg("arg") = SomeType(123));
290
291In this case, pybind11 must already be set up to deal with values of the type
292``SomeType`` (via a prior instantiation of ``py::class_<SomeType>``), or an
293exception will be thrown.
294
295Another aspect worth highlighting is that the "preview" of the default argument
296in the function signature is generated using the object's ``__repr__`` method.
297If not available, the signature may not be very helpful, e.g.:
298
299.. code-block:: pycon
300
301 FUNCTIONS
302 ...
303 | myFunction(...)
304 | Signature : (MyClass, arg : SomeType = <SomeType object at 0x101b7b080>) -> NoneType
305 ...
306
307The first way of addressing this is by defining ``SomeType.__repr__``.
308Alternatively, it is possible to specify the human-readable preview of the
309default argument manually using the ``arg_v`` notation:
310
311.. code-block:: cpp
312
313 py::class_<MyClass>("MyClass")
314 .def("myFunction", py::arg_v("arg", SomeType(123), "SomeType(123)"));
315
316Sometimes it may be necessary to pass a null pointer value as a default
317argument. In this case, remember to cast it to the underlying type in question,
318like so:
319
320.. code-block:: cpp
321
322 py::class_<MyClass>("MyClass")
323 .def("myFunction", py::arg("arg") = (SomeType *) nullptr);
Jason Rhinelanderabc29ca2017-01-23 03:50:00 -0500324
Jason Rhinelander17d02832017-01-16 20:35:14 -0500325.. _nonconverting_arguments:
326
Jason Rhinelanderabc29ca2017-01-23 03:50:00 -0500327Non-converting arguments
328========================
329
330Certain argument types may support conversion from one type to another. Some
331examples of conversions are:
332
333* :ref:`implicit_conversions` declared using ``py::implicitly_convertible<A,B>()``
334* Calling a method accepting a double with an integer argument
335* Calling a ``std::complex<float>`` argument with a non-complex python type
336 (for example, with a float). (Requires the optional ``pybind11/complex.h``
337 header).
338* Calling a function taking an Eigen matrix reference with a numpy array of the
339 wrong type or of an incompatible data layout. (Requires the optional
340 ``pybind11/eigen.h`` header).
341
342This behaviour is sometimes undesirable: the binding code may prefer to raise
343an error rather than convert the argument. This behaviour can be obtained
344through ``py::arg`` by calling the ``.noconvert()`` method of the ``py::arg``
345object, such as:
346
347.. code-block:: cpp
348
349 m.def("floats_only", [](double f) { return 0.5 * f; }, py::arg("f").noconvert());
350 m.def("floats_preferred", [](double f) { return 0.5 * f; }, py::arg("f"));
351
352Attempting the call the second function (the one without ``.noconvert()``) with
353an integer will succeed, but attempting to call the ``.noconvert()`` version
354will fail with a ``TypeError``:
355
356.. code-block:: pycon
357
358 >>> floats_preferred(4)
359 2.0
360 >>> floats_only(4)
361 Traceback (most recent call last):
362 File "<stdin>", line 1, in <module>
363 TypeError: floats_only(): incompatible function arguments. The following argument types are supported:
364 1. (f: float) -> float
365
366 Invoked with: 4
367
368You may, of course, combine this with the :var:`_a` shorthand notation (see
369:ref:`keyword_args`) and/or :ref:`default_args`. It is also permitted to omit
370the argument name by using the ``py::arg()`` constructor without an argument
371name, i.e. by specifying ``py::arg().noconvert()``.
372
373.. note::
374
375 When specifying ``py::arg`` options it is necessary to provide the same
376 number of options as the bound function has arguments. Thus if you want to
377 enable no-convert behaviour for just one of several arguments, you will
378 need to specify a ``py::arg()`` annotation for each argument with the
379 no-convert argument modified to ``py::arg().noconvert()``.
Jason Rhinelandere5505892017-02-03 18:25:34 -0500380
381Overload resolution order
382=========================
383
384When a function or method with multiple overloads is called from Python,
385pybind11 determines which overload to call in two passes. The first pass
386attempts to call each overload without allowing argument conversion (as if
387every argument had been specified as ``py::arg().noconvert()`` as decribed
388above).
389
390If no overload succeeds in the no-conversion first pass, a second pass is
391attempted in which argument conversion is allowed (except where prohibited via
392an explicit ``py::arg().noconvert()`` attribute in the function definition).
393
394If the second pass also fails a ``TypeError`` is raised.
395
396Within each pass, overloads are tried in the order they were registered with
397pybind11.
398
399What this means in practice is that pybind11 will prefer any overload that does
400not require conversion of arguments to an overload that does, but otherwise prefers
401earlier-defined overloads to later-defined ones.
402
403.. note::
404
405 pybind11 does *not* further prioritize based on the number/pattern of
406 overloaded arguments. That is, pybind11 does not prioritize a function
407 requiring one conversion over one requiring three, but only prioritizes
408 overloads requiring no conversion at all to overloads that require
409 conversion of at least one argument.