Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1 | .. _advanced: |
| 2 | |
| 3 | Advanced topics |
| 4 | ############### |
| 5 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 6 | For brevity, the rest of this chapter assumes that the following two lines are |
| 7 | present: |
| 8 | |
| 9 | .. code-block:: cpp |
| 10 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 11 | #include <pybind11/pybind11.h> |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 12 | |
Wenzel Jakob | 10e62e1 | 2015-10-15 22:46:07 +0200 | [diff] [blame] | 13 | namespace py = pybind11; |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 14 | |
Wenzel Jakob | de3ad07 | 2016-02-02 11:38:21 +0100 | [diff] [blame] | 15 | Exporting constants and mutable objects |
| 16 | ======================================= |
| 17 | |
| 18 | To expose a C++ constant, use the ``attr`` function to register it in a module |
| 19 | as shown below. The ``int_`` class is one of many small wrapper objects defined |
| 20 | in ``pybind11/pytypes.h``. General objects (including integers) can also be |
| 21 | converted using the function ``cast``. |
| 22 | |
| 23 | .. code-block:: cpp |
| 24 | |
| 25 | PYBIND11_PLUGIN(example) { |
| 26 | py::module m("example", "pybind11 example plugin"); |
| 27 | m.attr("MY_CONSTANT") = py::int_(123); |
| 28 | m.attr("MY_CONSTANT_2") = py::cast(new MyObject()); |
| 29 | } |
| 30 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 31 | Operator overloading |
| 32 | ==================== |
| 33 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 34 | Suppose that we're given the following ``Vector2`` class with a vector addition |
| 35 | and scalar multiplication operation, all implemented using overloaded operators |
| 36 | in C++. |
| 37 | |
| 38 | .. code-block:: cpp |
| 39 | |
| 40 | class Vector2 { |
| 41 | public: |
| 42 | Vector2(float x, float y) : x(x), y(y) { } |
| 43 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 44 | Vector2 operator+(const Vector2 &v) const { return Vector2(x + v.x, y + v.y); } |
| 45 | Vector2 operator*(float value) const { return Vector2(x * value, y * value); } |
| 46 | Vector2& operator+=(const Vector2 &v) { x += v.x; y += v.y; return *this; } |
| 47 | Vector2& operator*=(float v) { x *= v; y *= v; return *this; } |
| 48 | |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 49 | friend Vector2 operator*(float f, const Vector2 &v) { |
| 50 | return Vector2(f * v.x, f * v.y); |
| 51 | } |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 52 | |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 53 | std::string toString() const { |
| 54 | return "[" + std::to_string(x) + ", " + std::to_string(y) + "]"; |
| 55 | } |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 56 | private: |
| 57 | float x, y; |
| 58 | }; |
| 59 | |
| 60 | The following snippet shows how the above operators can be conveniently exposed |
| 61 | to Python. |
| 62 | |
| 63 | .. code-block:: cpp |
| 64 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 65 | #include <pybind11/operators.h> |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 66 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 67 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 68 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 69 | |
| 70 | py::class_<Vector2>(m, "Vector2") |
| 71 | .def(py::init<float, float>()) |
| 72 | .def(py::self + py::self) |
| 73 | .def(py::self += py::self) |
| 74 | .def(py::self *= float()) |
| 75 | .def(float() * py::self) |
| 76 | .def("__repr__", &Vector2::toString); |
| 77 | |
| 78 | return m.ptr(); |
| 79 | } |
| 80 | |
| 81 | Note that a line like |
| 82 | |
| 83 | .. code-block:: cpp |
| 84 | |
| 85 | .def(py::self * float()) |
| 86 | |
| 87 | is really just short hand notation for |
| 88 | |
| 89 | .. code-block:: cpp |
| 90 | |
| 91 | .def("__mul__", [](const Vector2 &a, float b) { |
| 92 | return a * b; |
| 93 | }) |
| 94 | |
| 95 | This can be useful for exposing additional operators that don't exist on the |
| 96 | C++ side, or to perform other types of customization. |
| 97 | |
| 98 | .. note:: |
| 99 | |
| 100 | To use the more convenient ``py::self`` notation, the additional |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 101 | header file :file:`pybind11/operators.h` must be included. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 102 | |
| 103 | .. seealso:: |
| 104 | |
| 105 | The file :file:`example/example3.cpp` contains a complete example that |
| 106 | demonstrates how to work with overloaded operators in more detail. |
| 107 | |
| 108 | Callbacks and passing anonymous functions |
| 109 | ========================================= |
| 110 | |
| 111 | The C++11 standard brought lambda functions and the generic polymorphic |
| 112 | function wrapper ``std::function<>`` to the C++ programming language, which |
| 113 | enable powerful new ways of working with functions. Lambda functions come in |
| 114 | two flavors: stateless lambda function resemble classic function pointers that |
| 115 | link to an anonymous piece of code, while stateful lambda functions |
| 116 | additionally depend on captured variables that are stored in an anonymous |
| 117 | *lambda closure object*. |
| 118 | |
| 119 | Here is a simple example of a C++ function that takes an arbitrary function |
| 120 | (stateful or stateless) with signature ``int -> int`` as an argument and runs |
| 121 | it with the value 10. |
| 122 | |
| 123 | .. code-block:: cpp |
| 124 | |
| 125 | int func_arg(const std::function<int(int)> &f) { |
| 126 | return f(10); |
| 127 | } |
| 128 | |
| 129 | The example below is more involved: it takes a function of signature ``int -> int`` |
| 130 | and returns another function of the same kind. The return value is a stateful |
| 131 | lambda function, which stores the value ``f`` in the capture object and adds 1 to |
| 132 | its return value upon execution. |
| 133 | |
| 134 | .. code-block:: cpp |
| 135 | |
| 136 | std::function<int(int)> func_ret(const std::function<int(int)> &f) { |
| 137 | return [f](int i) { |
| 138 | return f(i) + 1; |
| 139 | }; |
| 140 | } |
| 141 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 142 | After including the extra header file :file:`pybind11/functional.h`, it is almost |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 143 | trivial to generate binding code for both of these functions. |
| 144 | |
| 145 | .. code-block:: cpp |
| 146 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 147 | #include <pybind11/functional.h> |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 148 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 149 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 150 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 151 | |
| 152 | m.def("func_arg", &func_arg); |
| 153 | m.def("func_ret", &func_ret); |
| 154 | |
| 155 | return m.ptr(); |
| 156 | } |
| 157 | |
| 158 | The following interactive session shows how to call them from Python. |
| 159 | |
| 160 | .. code-block:: python |
| 161 | |
| 162 | $ python |
| 163 | >>> import example |
| 164 | >>> def square(i): |
| 165 | ... return i * i |
| 166 | ... |
| 167 | >>> example.func_arg(square) |
| 168 | 100L |
| 169 | >>> square_plus_1 = example.func_ret(square) |
| 170 | >>> square_plus_1(4) |
| 171 | 17L |
| 172 | >>> |
| 173 | |
| 174 | .. note:: |
| 175 | |
| 176 | This functionality is very useful when generating bindings for callbacks in |
| 177 | C++ libraries (e.g. a graphical user interface library). |
| 178 | |
| 179 | The file :file:`example/example5.cpp` contains a complete example that |
| 180 | demonstrates how to work with callbacks and anonymous functions in more detail. |
| 181 | |
Wenzel Jakob | a4175d6 | 2015-11-17 08:30:34 +0100 | [diff] [blame] | 182 | .. warning:: |
| 183 | |
| 184 | Keep in mind that passing a function from C++ to Python (or vice versa) |
| 185 | will instantiate a piece of wrapper code that translates function |
| 186 | invocations between the two languages. Copying the same function back and |
| 187 | forth between Python and C++ many times in a row will cause these wrappers |
| 188 | to accumulate, which can decrease performance. |
| 189 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 190 | Overriding virtual functions in Python |
| 191 | ====================================== |
| 192 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 193 | Suppose that a C++ class or interface has a virtual function that we'd like to |
| 194 | to override from within Python (we'll focus on the class ``Animal``; ``Dog`` is |
| 195 | given as a specific example of how one would do this with traditional C++ |
| 196 | code). |
| 197 | |
| 198 | .. code-block:: cpp |
| 199 | |
| 200 | class Animal { |
| 201 | public: |
| 202 | virtual ~Animal() { } |
| 203 | virtual std::string go(int n_times) = 0; |
| 204 | }; |
| 205 | |
| 206 | class Dog : public Animal { |
| 207 | public: |
| 208 | std::string go(int n_times) { |
| 209 | std::string result; |
| 210 | for (int i=0; i<n_times; ++i) |
| 211 | result += "woof! "; |
| 212 | return result; |
| 213 | } |
| 214 | }; |
| 215 | |
| 216 | Let's also suppose that we are given a plain function which calls the |
| 217 | function ``go()`` on an arbitrary ``Animal`` instance. |
| 218 | |
| 219 | .. code-block:: cpp |
| 220 | |
| 221 | std::string call_go(Animal *animal) { |
| 222 | return animal->go(3); |
| 223 | } |
| 224 | |
| 225 | Normally, the binding code for these classes would look as follows: |
| 226 | |
| 227 | .. code-block:: cpp |
| 228 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 229 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 230 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 231 | |
| 232 | py::class_<Animal> animal(m, "Animal"); |
| 233 | animal |
| 234 | .def("go", &Animal::go); |
| 235 | |
| 236 | py::class_<Dog>(m, "Dog", animal) |
| 237 | .def(py::init<>()); |
| 238 | |
| 239 | m.def("call_go", &call_go); |
| 240 | |
| 241 | return m.ptr(); |
| 242 | } |
| 243 | |
| 244 | However, these bindings are impossible to extend: ``Animal`` is not |
| 245 | constructible, and we clearly require some kind of "trampoline" that |
| 246 | redirects virtual calls back to Python. |
| 247 | |
| 248 | Defining a new type of ``Animal`` from within Python is possible but requires a |
| 249 | helper class that is defined as follows: |
| 250 | |
| 251 | .. code-block:: cpp |
| 252 | |
| 253 | class PyAnimal : public Animal { |
| 254 | public: |
| 255 | /* Inherit the constructors */ |
| 256 | using Animal::Animal; |
| 257 | |
| 258 | /* Trampoline (need one for each virtual function) */ |
| 259 | std::string go(int n_times) { |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 260 | PYBIND11_OVERLOAD_PURE( |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 261 | std::string, /* Return type */ |
| 262 | Animal, /* Parent class */ |
| 263 | go, /* Name of function */ |
| 264 | n_times /* Argument(s) */ |
| 265 | ); |
| 266 | } |
| 267 | }; |
| 268 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 269 | The macro :func:`PYBIND11_OVERLOAD_PURE` should be used for pure virtual |
| 270 | functions, and :func:`PYBIND11_OVERLOAD` should be used for functions which have |
Wenzel Jakob | 1e3be73 | 2016-05-24 23:42:05 +0200 | [diff] [blame^] | 271 | a default implementation. |
| 272 | |
| 273 | There are also two alternate macros :func:`PYBIND11_OVERLOAD_PURE_NAME` and |
| 274 | :func:`PYBIND11_OVERLOAD_NAME` which take a string-valued name argument |
| 275 | after the *Name of the function* slot. This is useful when the C++ and Python |
| 276 | versions of the function have different names, e.g. ``operator()`` vs ``__call__``. |
| 277 | |
| 278 | The binding code also needs a few minor adaptations (highlighted): |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 279 | |
| 280 | .. code-block:: cpp |
| 281 | :emphasize-lines: 4,6,7 |
| 282 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 283 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 284 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 285 | |
| 286 | py::class_<PyAnimal> animal(m, "Animal"); |
| 287 | animal |
| 288 | .alias<Animal>() |
| 289 | .def(py::init<>()) |
| 290 | .def("go", &Animal::go); |
| 291 | |
| 292 | py::class_<Dog>(m, "Dog", animal) |
| 293 | .def(py::init<>()); |
| 294 | |
| 295 | m.def("call_go", &call_go); |
| 296 | |
| 297 | return m.ptr(); |
| 298 | } |
| 299 | |
| 300 | Importantly, the trampoline helper class is used as the template argument to |
| 301 | :class:`class_`, and a call to :func:`class_::alias` informs the binding |
| 302 | generator that this is merely an alias for the underlying type ``Animal``. |
| 303 | Following this, we are able to define a constructor as usual. |
| 304 | |
| 305 | The Python session below shows how to override ``Animal::go`` and invoke it via |
| 306 | a virtual method call. |
| 307 | |
Wenzel Jakob | de3ad07 | 2016-02-02 11:38:21 +0100 | [diff] [blame] | 308 | .. code-block:: python |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 309 | |
| 310 | >>> from example import * |
| 311 | >>> d = Dog() |
| 312 | >>> call_go(d) |
| 313 | u'woof! woof! woof! ' |
| 314 | >>> class Cat(Animal): |
| 315 | ... def go(self, n_times): |
| 316 | ... return "meow! " * n_times |
| 317 | ... |
| 318 | >>> c = Cat() |
| 319 | >>> call_go(c) |
| 320 | u'meow! meow! meow! ' |
| 321 | |
Wenzel Jakob | bd986fe | 2016-05-21 10:48:30 +0200 | [diff] [blame] | 322 | .. warning:: |
| 323 | |
| 324 | Both :func:`PYBIND11_OVERLOAD` and :func:`PYBIND11_OVERLOAD_PURE` are |
| 325 | macros, which means that they can get confused by commas in a template |
| 326 | argument such as ``PYBIND11_OVERLOAD(MyReturnValue<T1, T2>, myFunc)``. In |
| 327 | this case, the preprocessor assumes that the comma indicates the beginnning |
| 328 | of the next parameter. Use a ``typedef`` to bind the template to another |
| 329 | name and use it in the macro to avoid this problem. |
| 330 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 331 | .. seealso:: |
| 332 | |
| 333 | The file :file:`example/example12.cpp` contains a complete example that |
| 334 | demonstrates how to override virtual functions using pybind11 in more |
| 335 | detail. |
| 336 | |
Wenzel Jakob | ecdd868 | 2015-12-07 18:17:58 +0100 | [diff] [blame] | 337 | |
| 338 | Global Interpreter Lock (GIL) |
| 339 | ============================= |
| 340 | |
| 341 | The classes :class:`gil_scoped_release` and :class:`gil_scoped_acquire` can be |
| 342 | used to acquire and release the global interpreter lock in the body of a C++ |
| 343 | function call. In this way, long-running C++ code can be parallelized using |
| 344 | multiple Python threads. Taking the previous section as an example, this could |
| 345 | be realized as follows (important changes highlighted): |
| 346 | |
| 347 | .. code-block:: cpp |
| 348 | :emphasize-lines: 8,9,33,34 |
| 349 | |
| 350 | class PyAnimal : public Animal { |
| 351 | public: |
| 352 | /* Inherit the constructors */ |
| 353 | using Animal::Animal; |
| 354 | |
| 355 | /* Trampoline (need one for each virtual function) */ |
| 356 | std::string go(int n_times) { |
| 357 | /* Acquire GIL before calling Python code */ |
Wenzel Jakob | a4caa85 | 2015-12-14 12:39:02 +0100 | [diff] [blame] | 358 | py::gil_scoped_acquire acquire; |
Wenzel Jakob | ecdd868 | 2015-12-07 18:17:58 +0100 | [diff] [blame] | 359 | |
| 360 | PYBIND11_OVERLOAD_PURE( |
| 361 | std::string, /* Return type */ |
| 362 | Animal, /* Parent class */ |
| 363 | go, /* Name of function */ |
| 364 | n_times /* Argument(s) */ |
| 365 | ); |
| 366 | } |
| 367 | }; |
| 368 | |
| 369 | PYBIND11_PLUGIN(example) { |
| 370 | py::module m("example", "pybind11 example plugin"); |
| 371 | |
| 372 | py::class_<PyAnimal> animal(m, "Animal"); |
| 373 | animal |
| 374 | .alias<Animal>() |
| 375 | .def(py::init<>()) |
| 376 | .def("go", &Animal::go); |
| 377 | |
| 378 | py::class_<Dog>(m, "Dog", animal) |
| 379 | .def(py::init<>()); |
| 380 | |
| 381 | m.def("call_go", [](Animal *animal) -> std::string { |
| 382 | /* Release GIL before calling into (potentially long-running) C++ code */ |
Wenzel Jakob | a4caa85 | 2015-12-14 12:39:02 +0100 | [diff] [blame] | 383 | py::gil_scoped_release release; |
Wenzel Jakob | ecdd868 | 2015-12-07 18:17:58 +0100 | [diff] [blame] | 384 | return call_go(animal); |
| 385 | }); |
| 386 | |
| 387 | return m.ptr(); |
| 388 | } |
| 389 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 390 | Passing STL data structures |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 391 | =========================== |
| 392 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 393 | When including the additional header file :file:`pybind11/stl.h`, conversions |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 394 | between ``std::vector<>``, ``std::list<>``, ``std::set<>``, and ``std::map<>`` |
| 395 | and the Python ``list``, ``set`` and ``dict`` data structures are automatically |
| 396 | enabled. The types ``std::pair<>`` and ``std::tuple<>`` are already supported |
| 397 | out of the box with just the core :file:`pybind11/pybind11.h` header. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 398 | |
| 399 | .. note:: |
| 400 | |
Wenzel Jakob | 44db04f | 2015-12-14 12:40:45 +0100 | [diff] [blame] | 401 | Arbitrary nesting of any of these types is supported. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 402 | |
| 403 | .. seealso:: |
| 404 | |
| 405 | The file :file:`example/example2.cpp` contains a complete example that |
| 406 | demonstrates how to pass STL data types in more detail. |
| 407 | |
Wenzel Jakob | b282595 | 2016-04-13 23:33:00 +0200 | [diff] [blame] | 408 | Binding sequence data types, iterators, the slicing protocol, etc. |
| 409 | ================================================================== |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 410 | |
| 411 | Please refer to the supplemental example for details. |
| 412 | |
| 413 | .. seealso:: |
| 414 | |
| 415 | The file :file:`example/example6.cpp` contains a complete example that |
| 416 | shows how to bind a sequence data type, including length queries |
| 417 | (``__len__``), iterators (``__iter__``), the slicing protocol and other |
| 418 | kinds of useful operations. |
| 419 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 420 | Return value policies |
| 421 | ===================== |
| 422 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 423 | Python and C++ use wildly different ways of managing the memory and lifetime of |
| 424 | objects managed by them. This can lead to issues when creating bindings for |
| 425 | functions that return a non-trivial type. Just by looking at the type |
| 426 | information, it is not clear whether Python should take charge of the returned |
| 427 | value and eventually free its resources, or if this is handled on the C++ side. |
| 428 | For this reason, pybind11 provides a several `return value policy` annotations |
| 429 | that can be passed to the :func:`module::def` and :func:`class_::def` |
Wenzel Jakob | 61d67f0 | 2015-12-14 12:53:06 +0100 | [diff] [blame] | 430 | functions. The default policy is :enum:`return_value_policy::automatic`. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 431 | |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 432 | .. tabularcolumns:: |p{0.5\textwidth}|p{0.45\textwidth}| |
| 433 | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 434 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 435 | | Return value policy | Description | |
| 436 | +==================================================+============================================================================+ |
| 437 | | :enum:`return_value_policy::automatic` | This is the default return value policy, which falls back to the policy | |
| 438 | | | :enum:`return_value_policy::take_ownership` when the return value is a | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 439 | | | pointer. Otherwise, it uses :enum:`return_value::move` or | |
| 440 | | | :enum:`return_value::copy` for rvalue and lvalue references, respectively. | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 441 | | | See below for a description of what all of these different policies do. | |
| 442 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 443 | | :enum:`return_value_policy::automatic_reference` | As above, but use policy :enum:`return_value_policy::reference` when the | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 444 | | | return value is a pointer. You probably won't need to use this. | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 445 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 446 | | :enum:`return_value_policy::take_ownership` | Reference an existing object (i.e. do not create a new copy) and take | |
| 447 | | | ownership. Python will call the destructor and delete operator when the | |
| 448 | | | object's reference count reaches zero. Undefined behavior ensues when the | |
| 449 | | | C++ side does the same.. | |
| 450 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 451 | | :enum:`return_value_policy::copy` | Create a new copy of the returned object, which will be owned by Python. | |
| 452 | | | This policy is comparably safe because the lifetimes of the two instances | |
| 453 | | | are decoupled. | |
| 454 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 455 | | :enum:`return_value_policy::move` | Use ``std::move`` to move the return value contents into a new instance | |
| 456 | | | that will be owned by Python. This policy is comparably safe because the | |
| 457 | | | lifetimes of the two instances (move source and destination) are decoupled.| |
| 458 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 459 | | :enum:`return_value_policy::reference` | Reference an existing object, but do not take ownership. The C++ side is | |
| 460 | | | responsible for managing the object's lifetime and deallocating it when | |
| 461 | | | it is no longer used. Warning: undefined behavior will ensue when the C++ | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 462 | | | side deletes an object that is still referenced and used by Python. | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 463 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 464 | | :enum:`return_value_policy::reference_internal` | This policy only applies to methods and properties. It references the | |
| 465 | | | object without taking ownership similar to the above | |
| 466 | | | :enum:`return_value_policy::reference` policy. In contrast to that policy, | |
| 467 | | | the function or property's implicit ``this`` argument (called the *parent*)| |
| 468 | | | is considered to be the the owner of the return value (the *child*). | |
| 469 | | | pybind11 then couples the lifetime of the parent to the child via a | |
| 470 | | | reference relationship that ensures that the parent cannot be garbage | |
| 471 | | | collected while Python is still using the child. More advanced variations | |
| 472 | | | of this scheme are also possible using combinations of | |
| 473 | | | :enum:`return_value_policy::reference` and the :class:`keep_alive` call | |
| 474 | | | policy described next. | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 475 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 476 | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 477 | The following example snippet shows a use case of the |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 478 | :enum:`return_value_policy::reference_internal` policy. |
| 479 | |
| 480 | .. code-block:: cpp |
| 481 | |
| 482 | class Example { |
| 483 | public: |
| 484 | Internal &get_internal() { return internal; } |
| 485 | private: |
| 486 | Internal internal; |
| 487 | }; |
| 488 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 489 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 490 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 491 | |
| 492 | py::class_<Example>(m, "Example") |
| 493 | .def(py::init<>()) |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 494 | .def("get_internal", &Example::get_internal, "Return the internal data", |
| 495 | py::return_value_policy::reference_internal); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 496 | |
| 497 | return m.ptr(); |
| 498 | } |
| 499 | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 500 | .. warning:: |
| 501 | |
| 502 | Code with invalid call policies might access unitialized memory or free |
| 503 | data structures multiple times, which can lead to hard-to-debug |
| 504 | non-determinism and segmentation faults, hence it is worth spending the |
| 505 | time to understand all the different options in the table above. |
| 506 | |
| 507 | .. note:: |
| 508 | |
| 509 | The next section on :ref:`call_policies` discusses *call policies* that can be |
| 510 | specified *in addition* to a return value policy from the list above. Call |
| 511 | policies indicate reference relationships that can involve both return values |
| 512 | and parameters of functions. |
| 513 | |
| 514 | .. note:: |
| 515 | |
| 516 | As an alternative to elaborate call policies and lifetime management logic, |
| 517 | consider using smart pointers (see the section on :ref:`smart_pointers` for |
| 518 | details). Smart pointers can tell whether an object is still referenced from |
| 519 | C++ or Python, which generally eliminates the kinds of inconsistencies that |
| 520 | can lead to crashes or undefined behavior. For functions returning smart |
| 521 | pointers, it is not necessary to specify a return value policy. |
Wenzel Jakob | 5f218b3 | 2016-01-17 22:36:39 +0100 | [diff] [blame] | 522 | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 523 | .. _call_policies: |
| 524 | |
Wenzel Jakob | 5f218b3 | 2016-01-17 22:36:39 +0100 | [diff] [blame] | 525 | Additional call policies |
| 526 | ======================== |
| 527 | |
| 528 | In addition to the above return value policies, further `call policies` can be |
| 529 | specified to indicate dependencies between parameters. There is currently just |
| 530 | one policy named ``keep_alive<Nurse, Patient>``, which indicates that the |
| 531 | argument with index ``Patient`` should be kept alive at least until the |
| 532 | argument with index ``Nurse`` is freed by the garbage collector; argument |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 533 | indices start at one, while zero refers to the return value. For methods, index |
| 534 | one refers to the implicit ``this`` pointer, while regular arguments begin at |
| 535 | index two. Arbitrarily many call policies can be specified. |
Wenzel Jakob | 5f218b3 | 2016-01-17 22:36:39 +0100 | [diff] [blame] | 536 | |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 537 | Consider the following example: the binding code for a list append operation |
| 538 | that ties the lifetime of the newly added element to the underlying container |
| 539 | might be declared as follows: |
Wenzel Jakob | 5f218b3 | 2016-01-17 22:36:39 +0100 | [diff] [blame] | 540 | |
| 541 | .. code-block:: cpp |
| 542 | |
| 543 | py::class_<List>(m, "List") |
| 544 | .def("append", &List::append, py::keep_alive<1, 2>()); |
| 545 | |
| 546 | .. note:: |
| 547 | |
| 548 | ``keep_alive`` is analogous to the ``with_custodian_and_ward`` (if Nurse, |
| 549 | Patient != 0) and ``with_custodian_and_ward_postcall`` (if Nurse/Patient == |
| 550 | 0) policies from Boost.Python. |
| 551 | |
Wenzel Jakob | 6158716 | 2016-01-18 22:38:52 +0100 | [diff] [blame] | 552 | .. seealso:: |
| 553 | |
| 554 | The file :file:`example/example13.cpp` contains a complete example that |
| 555 | demonstrates using :class:`keep_alive` in more detail. |
| 556 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 557 | Implicit type conversions |
| 558 | ========================= |
| 559 | |
| 560 | Suppose that instances of two types ``A`` and ``B`` are used in a project, and |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 561 | that an ``A`` can easily be converted into an instance of type ``B`` (examples of this |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 562 | could be a fixed and an arbitrary precision number type). |
| 563 | |
| 564 | .. code-block:: cpp |
| 565 | |
| 566 | py::class_<A>(m, "A") |
| 567 | /// ... members ... |
| 568 | |
| 569 | py::class_<B>(m, "B") |
| 570 | .def(py::init<A>()) |
| 571 | /// ... members ... |
| 572 | |
| 573 | m.def("func", |
| 574 | [](const B &) { /* .... */ } |
| 575 | ); |
| 576 | |
| 577 | To invoke the function ``func`` using a variable ``a`` containing an ``A`` |
| 578 | instance, we'd have to write ``func(B(a))`` in Python. On the other hand, C++ |
| 579 | will automatically apply an implicit type conversion, which makes it possible |
| 580 | to directly write ``func(a)``. |
| 581 | |
| 582 | In this situation (i.e. where ``B`` has a constructor that converts from |
| 583 | ``A``), the following statement enables similar implicit conversions on the |
| 584 | Python side: |
| 585 | |
| 586 | .. code-block:: cpp |
| 587 | |
| 588 | py::implicitly_convertible<A, B>(); |
| 589 | |
Wenzel Jakob | 9f0dfce | 2016-04-06 17:38:18 +0200 | [diff] [blame] | 590 | Unique pointers |
| 591 | =============== |
| 592 | |
| 593 | Given a class ``Example`` with Python bindings, it's possible to return |
| 594 | instances wrapped in C++11 unique pointers, like so |
| 595 | |
| 596 | .. code-block:: cpp |
| 597 | |
| 598 | std::unique_ptr<Example> create_example() { return std::unique_ptr<Example>(new Example()); } |
| 599 | |
| 600 | .. code-block:: cpp |
| 601 | |
| 602 | m.def("create_example", &create_example); |
| 603 | |
| 604 | In other words, there is nothing special that needs to be done. While returning |
| 605 | unique pointers in this way is allowed, it is *illegal* to use them as function |
| 606 | arguments. For instance, the following function signature cannot be processed |
| 607 | by pybind11. |
| 608 | |
| 609 | .. code-block:: cpp |
| 610 | |
| 611 | void do_something_with_example(std::unique_ptr<Example> ex) { ... } |
| 612 | |
| 613 | The above signature would imply that Python needs to give up ownership of an |
| 614 | object that is passed to this function, which is generally not possible (for |
| 615 | instance, the object might be referenced elsewhere). |
| 616 | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 617 | .. _smart_pointers: |
| 618 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 619 | Smart pointers |
| 620 | ============== |
| 621 | |
Wenzel Jakob | 9f0dfce | 2016-04-06 17:38:18 +0200 | [diff] [blame] | 622 | This section explains how to pass values that are wrapped in "smart" pointer |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 623 | types with internal reference counting. For the simpler C++11 unique pointers, |
| 624 | refer to the previous section. |
Wenzel Jakob | 9f0dfce | 2016-04-06 17:38:18 +0200 | [diff] [blame] | 625 | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 626 | The binding generator for classes, :class:`class_`, takes an optional second |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 627 | template type, which denotes a special *holder* type that is used to manage |
| 628 | references to the object. When wrapping a type named ``Type``, the default |
| 629 | value of this template parameter is ``std::unique_ptr<Type>``, which means that |
| 630 | the object is deallocated when Python's reference count goes to zero. |
| 631 | |
Wenzel Jakob | 1853b65 | 2015-10-18 15:38:50 +0200 | [diff] [blame] | 632 | It is possible to switch to other types of reference counting wrappers or smart |
| 633 | pointers, which is useful in codebases that rely on them. For instance, the |
| 634 | following snippet causes ``std::shared_ptr`` to be used instead. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 635 | |
| 636 | .. code-block:: cpp |
| 637 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 638 | py::class_<Example, std::shared_ptr<Example> /* <- holder type */> obj(m, "Example"); |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 639 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 640 | Note that any particular class can only be associated with a single holder type. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 641 | |
Wenzel Jakob | 1853b65 | 2015-10-18 15:38:50 +0200 | [diff] [blame] | 642 | To enable transparent conversions for functions that take shared pointers as an |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 643 | argument or that return them, a macro invocation similar to the following must |
Wenzel Jakob | 1853b65 | 2015-10-18 15:38:50 +0200 | [diff] [blame] | 644 | be declared at the top level before any binding code: |
| 645 | |
| 646 | .. code-block:: cpp |
| 647 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 648 | PYBIND11_DECLARE_HOLDER_TYPE(T, std::shared_ptr<T>); |
Wenzel Jakob | 1853b65 | 2015-10-18 15:38:50 +0200 | [diff] [blame] | 649 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 650 | .. note:: |
Wenzel Jakob | 61d67f0 | 2015-12-14 12:53:06 +0100 | [diff] [blame] | 651 | |
| 652 | The first argument of :func:`PYBIND11_DECLARE_HOLDER_TYPE` should be a |
| 653 | placeholder name that is used as a template parameter of the second |
| 654 | argument. Thus, feel free to use any identifier, but use it consistently on |
| 655 | both sides; also, don't use the name of a type that already exists in your |
| 656 | codebase. |
| 657 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 658 | One potential stumbling block when using holder types is that they need to be |
| 659 | applied consistently. Can you guess what's broken about the following binding |
| 660 | code? |
Wenzel Jakob | 6e213c9 | 2015-11-24 23:05:58 +0100 | [diff] [blame] | 661 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 662 | .. code-block:: cpp |
Wenzel Jakob | 6e213c9 | 2015-11-24 23:05:58 +0100 | [diff] [blame] | 663 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 664 | class Child { }; |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 665 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 666 | class Parent { |
| 667 | public: |
| 668 | Parent() : child(std::make_shared<Child>()) { } |
| 669 | Child *get_child() { return child.get(); } /* Hint: ** DON'T DO THIS ** */ |
| 670 | private: |
| 671 | std::shared_ptr<Child> child; |
| 672 | }; |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 673 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 674 | PYBIND11_PLUGIN(example) { |
| 675 | py::module m("example"); |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 676 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 677 | py::class_<Child, std::shared_ptr<Child>>(m, "Child"); |
| 678 | |
| 679 | py::class_<Parent, std::shared_ptr<Parent>>(m, "Parent") |
| 680 | .def(py::init<>()) |
| 681 | .def("get_child", &Parent::get_child); |
| 682 | |
| 683 | return m.ptr(); |
| 684 | } |
| 685 | |
| 686 | The following Python code will cause undefined behavior (and likely a |
| 687 | segmentation fault). |
| 688 | |
| 689 | .. code-block:: python |
| 690 | |
| 691 | from example import Parent |
| 692 | print(Parent().get_child()) |
| 693 | |
| 694 | The problem is that ``Parent::get_child()`` returns a pointer to an instance of |
| 695 | ``Child``, but the fact that this instance is already managed by |
| 696 | ``std::shared_ptr<...>`` is lost when passing raw pointers. In this case, |
| 697 | pybind11 will create a second independent ``std::shared_ptr<...>`` that also |
| 698 | claims ownership of the pointer. In the end, the object will be freed **twice** |
| 699 | since these shared pointers have no way of knowing about each other. |
| 700 | |
| 701 | There are two ways to resolve this issue: |
| 702 | |
| 703 | 1. For types that are managed by a smart pointer class, never use raw pointers |
| 704 | in function arguments or return values. In other words: always consistently |
| 705 | wrap pointers into their designated holder types (such as |
| 706 | ``std::shared_ptr<...>``). In this case, the signature of ``get_child()`` |
| 707 | should be modified as follows: |
| 708 | |
| 709 | .. code-block:: cpp |
| 710 | |
| 711 | std::shared_ptr<Child> get_child() { return child; } |
| 712 | |
| 713 | 2. Adjust the definition of ``Child`` by specifying |
| 714 | ``std::enable_shared_from_this<T>`` (see cppreference_ for details) as a |
| 715 | base class. This adds a small bit of information to ``Child`` that allows |
| 716 | pybind11 to realize that there is already an existing |
| 717 | ``std::shared_ptr<...>`` and communicate with it. In this case, the |
| 718 | declaration of ``Child`` should look as follows: |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 719 | |
Wenzel Jakob | 6e213c9 | 2015-11-24 23:05:58 +0100 | [diff] [blame] | 720 | .. _cppreference: http://en.cppreference.com/w/cpp/memory/enable_shared_from_this |
| 721 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 722 | .. code-block:: cpp |
| 723 | |
| 724 | class Child : public std::enable_shared_from_this<Child> { }; |
| 725 | |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 726 | .. seealso:: |
| 727 | |
| 728 | The file :file:`example/example8.cpp` contains a complete example that |
| 729 | demonstrates how to work with custom reference-counting holder types in |
| 730 | more detail. |
| 731 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 732 | .. _custom_constructors: |
| 733 | |
| 734 | Custom constructors |
| 735 | =================== |
| 736 | |
| 737 | The syntax for binding constructors was previously introduced, but it only |
| 738 | works when a constructor with the given parameters actually exists on the C++ |
| 739 | side. To extend this to more general cases, let's take a look at what actually |
| 740 | happens under the hood: the following statement |
| 741 | |
| 742 | .. code-block:: cpp |
| 743 | |
| 744 | py::class_<Example>(m, "Example") |
| 745 | .def(py::init<int>()); |
| 746 | |
| 747 | is short hand notation for |
| 748 | |
| 749 | .. code-block:: cpp |
| 750 | |
| 751 | py::class_<Example>(m, "Example") |
| 752 | .def("__init__", |
| 753 | [](Example &instance, int arg) { |
| 754 | new (&instance) Example(arg); |
| 755 | } |
| 756 | ); |
| 757 | |
| 758 | In other words, :func:`init` creates an anonymous function that invokes an |
| 759 | in-place constructor. Memory allocation etc. is already take care of beforehand |
| 760 | within pybind11. |
| 761 | |
| 762 | Catching and throwing exceptions |
| 763 | ================================ |
| 764 | |
| 765 | When C++ code invoked from Python throws an ``std::exception``, it is |
| 766 | automatically converted into a Python ``Exception``. pybind11 defines multiple |
| 767 | special exception classes that will map to different types of Python |
| 768 | exceptions: |
| 769 | |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 770 | .. tabularcolumns:: |p{0.5\textwidth}|p{0.45\textwidth}| |
| 771 | |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 772 | +--------------------------------------+------------------------------+ |
| 773 | | C++ exception type | Python exception type | |
| 774 | +======================================+==============================+ |
| 775 | | :class:`std::exception` | ``RuntimeError`` | |
| 776 | +--------------------------------------+------------------------------+ |
| 777 | | :class:`std::bad_alloc` | ``MemoryError`` | |
| 778 | +--------------------------------------+------------------------------+ |
| 779 | | :class:`std::domain_error` | ``ValueError`` | |
| 780 | +--------------------------------------+------------------------------+ |
| 781 | | :class:`std::invalid_argument` | ``ValueError`` | |
| 782 | +--------------------------------------+------------------------------+ |
| 783 | | :class:`std::length_error` | ``ValueError`` | |
| 784 | +--------------------------------------+------------------------------+ |
| 785 | | :class:`std::out_of_range` | ``ValueError`` | |
| 786 | +--------------------------------------+------------------------------+ |
| 787 | | :class:`std::range_error` | ``ValueError`` | |
| 788 | +--------------------------------------+------------------------------+ |
| 789 | | :class:`pybind11::stop_iteration` | ``StopIteration`` (used to | |
| 790 | | | implement custom iterators) | |
| 791 | +--------------------------------------+------------------------------+ |
| 792 | | :class:`pybind11::index_error` | ``IndexError`` (used to | |
| 793 | | | indicate out of bounds | |
| 794 | | | accesses in ``__getitem__``, | |
| 795 | | | ``__setitem__``, etc.) | |
| 796 | +--------------------------------------+------------------------------+ |
Sergey Lyskov | a95bde1 | 2016-05-08 19:31:55 -0400 | [diff] [blame] | 797 | | :class:`pybind11::value_error` | ``ValueError`` (used to | |
| 798 | | | indicate wrong value passed | |
| 799 | | | in ``container.remove(...)`` | |
| 800 | +--------------------------------------+------------------------------+ |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 801 | | :class:`pybind11::error_already_set` | Indicates that the Python | |
| 802 | | | exception flag has already | |
| 803 | | | been initialized | |
| 804 | +--------------------------------------+------------------------------+ |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 805 | |
| 806 | When a Python function invoked from C++ throws an exception, it is converted |
| 807 | into a C++ exception of type :class:`error_already_set` whose string payload |
| 808 | contains a textual summary. |
| 809 | |
| 810 | There is also a special exception :class:`cast_error` that is thrown by |
| 811 | :func:`handle::call` when the input arguments cannot be converted to Python |
| 812 | objects. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 813 | |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 814 | .. _opaque: |
| 815 | |
| 816 | Treating STL data structures as opaque objects |
| 817 | ============================================== |
| 818 | |
| 819 | pybind11 heavily relies on a template matching mechanism to convert parameters |
| 820 | and return values that are constructed from STL data types such as vectors, |
| 821 | linked lists, hash tables, etc. This even works in a recursive manner, for |
| 822 | instance to deal with lists of hash maps of pairs of elementary and custom |
| 823 | types, etc. |
| 824 | |
| 825 | However, a fundamental limitation of this approach is that internal conversions |
| 826 | between Python and C++ types involve a copy operation that prevents |
| 827 | pass-by-reference semantics. What does this mean? |
| 828 | |
| 829 | Suppose we bind the following function |
| 830 | |
| 831 | .. code-block:: cpp |
| 832 | |
| 833 | void append_1(std::vector<int> &v) { |
| 834 | v.push_back(1); |
| 835 | } |
| 836 | |
| 837 | and call it from Python, the following happens: |
| 838 | |
| 839 | .. code-block:: python |
| 840 | |
| 841 | >>> v = [5, 6] |
| 842 | >>> append_1(v) |
| 843 | >>> print(v) |
| 844 | [5, 6] |
| 845 | |
| 846 | As you can see, when passing STL data structures by reference, modifications |
| 847 | are not propagated back the Python side. A similar situation arises when |
| 848 | exposing STL data structures using the ``def_readwrite`` or ``def_readonly`` |
| 849 | functions: |
| 850 | |
| 851 | .. code-block:: cpp |
| 852 | |
| 853 | /* ... definition ... */ |
| 854 | |
| 855 | class MyClass { |
| 856 | std::vector<int> contents; |
| 857 | }; |
| 858 | |
| 859 | /* ... binding code ... */ |
| 860 | |
| 861 | py::class_<MyClass>(m, "MyClass") |
| 862 | .def(py::init<>) |
| 863 | .def_readwrite("contents", &MyClass::contents); |
| 864 | |
| 865 | In this case, properties can be read and written in their entirety. However, an |
| 866 | ``append`` operaton involving such a list type has no effect: |
| 867 | |
| 868 | .. code-block:: python |
| 869 | |
| 870 | >>> m = MyClass() |
| 871 | >>> m.contents = [5, 6] |
| 872 | >>> print(m.contents) |
| 873 | [5, 6] |
| 874 | >>> m.contents.append(7) |
| 875 | >>> print(m.contents) |
| 876 | [5, 6] |
| 877 | |
| 878 | To deal with both of the above situations, pybind11 provides a macro named |
| 879 | ``PYBIND11_MAKE_OPAQUE(T)`` that disables the template-based conversion |
| 880 | machinery of types, thus rendering them *opaque*. The contents of opaque |
| 881 | objects are never inspected or extracted, hence they can be passed by |
| 882 | reference. For instance, to turn ``std::vector<int>`` into an opaque type, add |
| 883 | the declaration |
| 884 | |
| 885 | .. code-block:: cpp |
| 886 | |
| 887 | PYBIND11_MAKE_OPAQUE(std::vector<int>); |
| 888 | |
| 889 | before any binding code (e.g. invocations to ``class_::def()``, etc.). This |
| 890 | macro must be specified at the top level, since instantiates a partial template |
| 891 | overload. If your binding code consists of multiple compilation units, it must |
| 892 | be present in every file preceding any usage of ``std::vector<int>``. Opaque |
| 893 | types must also have a corresponding ``class_`` declaration to associate them |
| 894 | with a name in Python, and to define a set of available operations: |
| 895 | |
| 896 | .. code-block:: cpp |
| 897 | |
| 898 | py::class_<std::vector<int>>(m, "IntVector") |
| 899 | .def(py::init<>()) |
| 900 | .def("clear", &std::vector<int>::clear) |
| 901 | .def("pop_back", &std::vector<int>::pop_back) |
| 902 | .def("__len__", [](const std::vector<int> &v) { return v.size(); }) |
| 903 | .def("__iter__", [](std::vector<int> &v) { |
| 904 | return py::make_iterator(v.begin(), v.end()); |
| 905 | }, py::keep_alive<0, 1>()) /* Keep vector alive while iterator is used */ |
| 906 | // .... |
| 907 | |
| 908 | |
| 909 | .. seealso:: |
| 910 | |
| 911 | The file :file:`example/example14.cpp` contains a complete example that |
| 912 | demonstrates how to create and expose opaque types using pybind11 in more |
| 913 | detail. |
| 914 | |
| 915 | .. _eigen: |
| 916 | |
| 917 | Transparent conversion of dense and sparse Eigen data types |
| 918 | =========================================================== |
| 919 | |
| 920 | Eigen [#f1]_ is C++ header-based library for dense and sparse linear algebra. Due to |
| 921 | its popularity and widespread adoption, pybind11 provides transparent |
| 922 | conversion support between Eigen and Scientific Python linear algebra data types. |
| 923 | |
| 924 | Specifically, when including the optional header file :file:`pybind11/eigen.h`, |
Wenzel Jakob | 178c8a8 | 2016-05-10 15:59:01 +0100 | [diff] [blame] | 925 | pybind11 will automatically and transparently convert |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 926 | |
| 927 | 1. Static and dynamic Eigen dense vectors and matrices to instances of |
| 928 | ``numpy.ndarray`` (and vice versa). |
| 929 | |
| 930 | 1. Eigen sparse vectors and matrices to instances of |
| 931 | ``scipy.sparse.csr_matrix``/``scipy.sparse.csc_matrix`` (and vice versa). |
| 932 | |
| 933 | This makes it possible to bind most kinds of functions that rely on these types. |
| 934 | One major caveat are functions that take Eigen matrices *by reference* and modify |
| 935 | them somehow, in which case the information won't be propagated to the caller. |
| 936 | |
| 937 | .. code-block:: cpp |
| 938 | |
| 939 | /* The Python bindings of this function won't replicate |
| 940 | the intended effect of modifying the function argument */ |
| 941 | void scale_by_2(Eigen::Vector3f &v) { |
| 942 | v *= 2; |
| 943 | } |
| 944 | |
| 945 | To see why this is, refer to the section on :ref:`opaque` (although that |
| 946 | section specifically covers STL data types, the underlying issue is the same). |
| 947 | The next two sections discuss an efficient alternative for exposing the |
| 948 | underlying native Eigen types as opaque objects in a way that still integrates |
| 949 | with NumPy and SciPy. |
| 950 | |
| 951 | .. [#f1] http://eigen.tuxfamily.org |
| 952 | |
| 953 | .. seealso:: |
| 954 | |
| 955 | The file :file:`example/eigen.cpp` contains a complete example that |
| 956 | shows how to pass Eigen sparse and dense data types in more detail. |
| 957 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 958 | Buffer protocol |
| 959 | =============== |
| 960 | |
| 961 | Python supports an extremely general and convenient approach for exchanging |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 962 | data between plugin libraries. Types can expose a buffer view [#f2]_, which |
| 963 | provides fast direct access to the raw internal data representation. Suppose we |
| 964 | want to bind the following simplistic Matrix class: |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 965 | |
| 966 | .. code-block:: cpp |
| 967 | |
| 968 | class Matrix { |
| 969 | public: |
| 970 | Matrix(size_t rows, size_t cols) : m_rows(rows), m_cols(cols) { |
| 971 | m_data = new float[rows*cols]; |
| 972 | } |
| 973 | float *data() { return m_data; } |
| 974 | size_t rows() const { return m_rows; } |
| 975 | size_t cols() const { return m_cols; } |
| 976 | private: |
| 977 | size_t m_rows, m_cols; |
| 978 | float *m_data; |
| 979 | }; |
| 980 | |
| 981 | The following binding code exposes the ``Matrix`` contents as a buffer object, |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 982 | making it possible to cast Matrices into NumPy arrays. It is even possible to |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 983 | completely avoid copy operations with Python expressions like |
| 984 | ``np.array(matrix_instance, copy = False)``. |
| 985 | |
| 986 | .. code-block:: cpp |
| 987 | |
| 988 | py::class_<Matrix>(m, "Matrix") |
| 989 | .def_buffer([](Matrix &m) -> py::buffer_info { |
| 990 | return py::buffer_info( |
Wenzel Jakob | 876eeab | 2016-05-04 22:22:48 +0200 | [diff] [blame] | 991 | m.data(), /* Pointer to buffer */ |
| 992 | sizeof(float), /* Size of one scalar */ |
| 993 | py::format_descriptor<float>::value, /* Python struct-style format descriptor */ |
| 994 | 2, /* Number of dimensions */ |
| 995 | { m.rows(), m.cols() }, /* Buffer dimensions */ |
| 996 | { sizeof(float) * m.rows(), /* Strides (in bytes) for each index */ |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 997 | sizeof(float) } |
| 998 | ); |
| 999 | }); |
| 1000 | |
| 1001 | The snippet above binds a lambda function, which can create ``py::buffer_info`` |
| 1002 | description records on demand describing a given matrix. The contents of |
| 1003 | ``py::buffer_info`` mirror the Python buffer protocol specification. |
| 1004 | |
| 1005 | .. code-block:: cpp |
| 1006 | |
| 1007 | struct buffer_info { |
| 1008 | void *ptr; |
| 1009 | size_t itemsize; |
| 1010 | std::string format; |
| 1011 | int ndim; |
| 1012 | std::vector<size_t> shape; |
| 1013 | std::vector<size_t> strides; |
| 1014 | }; |
| 1015 | |
| 1016 | To create a C++ function that can take a Python buffer object as an argument, |
| 1017 | simply use the type ``py::buffer`` as one of its arguments. Buffers can exist |
| 1018 | in a great variety of configurations, hence some safety checks are usually |
| 1019 | necessary in the function body. Below, you can see an basic example on how to |
| 1020 | define a custom constructor for the Eigen double precision matrix |
| 1021 | (``Eigen::MatrixXd``) type, which supports initialization from compatible |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1022 | buffer objects (e.g. a NumPy matrix). |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1023 | |
| 1024 | .. code-block:: cpp |
| 1025 | |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1026 | /* Bind MatrixXd (or some other Eigen type) to Python */ |
| 1027 | typedef Eigen::MatrixXd Matrix; |
| 1028 | |
| 1029 | typedef Matrix::Scalar Scalar; |
| 1030 | constexpr bool rowMajor = Matrix::Flags & Eigen::RowMajorBit; |
| 1031 | |
| 1032 | py::class_<Matrix>(m, "Matrix") |
| 1033 | .def("__init__", [](Matrix &m, py::buffer b) { |
Wenzel Jakob | e762853 | 2016-05-05 10:04:44 +0200 | [diff] [blame] | 1034 | typedef Eigen::Stride<Eigen::Dynamic, Eigen::Dynamic> Strides; |
Wenzel Jakob | e762853 | 2016-05-05 10:04:44 +0200 | [diff] [blame] | 1035 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1036 | /* Request a buffer descriptor from Python */ |
| 1037 | py::buffer_info info = b.request(); |
| 1038 | |
| 1039 | /* Some sanity checks ... */ |
Wenzel Jakob | e762853 | 2016-05-05 10:04:44 +0200 | [diff] [blame] | 1040 | if (info.format != py::format_descriptor<Scalar>::value) |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1041 | throw std::runtime_error("Incompatible format: expected a double array!"); |
| 1042 | |
| 1043 | if (info.ndim != 2) |
| 1044 | throw std::runtime_error("Incompatible buffer dimension!"); |
| 1045 | |
Wenzel Jakob | e762853 | 2016-05-05 10:04:44 +0200 | [diff] [blame] | 1046 | auto strides = Strides( |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1047 | info.strides[rowMajor ? 0 : 1] / sizeof(Scalar), |
| 1048 | info.strides[rowMajor ? 1 : 0] / sizeof(Scalar)); |
Wenzel Jakob | e762853 | 2016-05-05 10:04:44 +0200 | [diff] [blame] | 1049 | |
| 1050 | auto map = Eigen::Map<Matrix, 0, Strides>( |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1051 | static_cat<Scalar *>(info.ptr), info.shape[0], info.shape[1], strides); |
Wenzel Jakob | e762853 | 2016-05-05 10:04:44 +0200 | [diff] [blame] | 1052 | |
| 1053 | new (&m) Matrix(map); |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1054 | }); |
| 1055 | |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1056 | For reference, the ``def_buffer()`` call for this Eigen data type should look |
| 1057 | as follows: |
| 1058 | |
| 1059 | .. code-block:: cpp |
| 1060 | |
| 1061 | .def_buffer([](Matrix &m) -> py::buffer_info { |
| 1062 | return py::buffer_info( |
| 1063 | m.data(), /* Pointer to buffer */ |
| 1064 | sizeof(Scalar), /* Size of one scalar */ |
| 1065 | /* Python struct-style format descriptor */ |
| 1066 | py::format_descriptor<Scalar>::value, |
| 1067 | /* Number of dimensions */ |
| 1068 | 2, |
| 1069 | /* Buffer dimensions */ |
| 1070 | { (size_t) m.rows(), |
| 1071 | (size_t) m.cols() }, |
| 1072 | /* Strides (in bytes) for each index */ |
| 1073 | { sizeof(Scalar) * (rowMajor ? m.cols() : 1), |
| 1074 | sizeof(Scalar) * (rowMajor ? 1 : m.rows()) } |
| 1075 | ); |
| 1076 | }) |
| 1077 | |
| 1078 | For a much easier approach of binding Eigen types (although with some |
| 1079 | limitations), refer to the section on :ref:`eigen`. |
| 1080 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1081 | .. seealso:: |
| 1082 | |
| 1083 | The file :file:`example/example7.cpp` contains a complete example that |
| 1084 | demonstrates using the buffer protocol with pybind11 in more detail. |
| 1085 | |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1086 | .. [#f2] http://docs.python.org/3/c-api/buffer.html |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 1087 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1088 | NumPy support |
| 1089 | ============= |
| 1090 | |
| 1091 | By exchanging ``py::buffer`` with ``py::array`` in the above snippet, we can |
| 1092 | restrict the function so that it only accepts NumPy arrays (rather than any |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 1093 | type of Python object satisfying the buffer protocol). |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1094 | |
| 1095 | In many situations, we want to define a function which only accepts a NumPy |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1096 | array of a certain data type. This is possible via the ``py::array_t<T>`` |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1097 | template. For instance, the following function requires the argument to be a |
Wenzel Jakob | f1032df | 2016-05-05 10:00:00 +0200 | [diff] [blame] | 1098 | NumPy array containing double precision values. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1099 | |
| 1100 | .. code-block:: cpp |
| 1101 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1102 | void f(py::array_t<double> array); |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1103 | |
Wenzel Jakob | f1032df | 2016-05-05 10:00:00 +0200 | [diff] [blame] | 1104 | When it is invoked with a different type (e.g. an integer or a list of |
| 1105 | integers), the binding code will attempt to cast the input into a NumPy array |
| 1106 | of the requested type. Note that this feature requires the |
| 1107 | :file:``pybind11/numpy.h`` header to be included. |
| 1108 | |
| 1109 | Data in NumPy arrays is not guaranteed to packed in a dense manner; |
| 1110 | furthermore, entries can be separated by arbitrary column and row strides. |
| 1111 | Sometimes, it can be useful to require a function to only accept dense arrays |
| 1112 | using either the C (row-major) or Fortran (column-major) ordering. This can be |
| 1113 | accomplished via a second template argument with values ``py::array::c_style`` |
| 1114 | or ``py::array::f_style``. |
| 1115 | |
| 1116 | .. code-block:: cpp |
| 1117 | |
Wenzel Jakob | b47a9de | 2016-05-19 16:02:09 +0200 | [diff] [blame] | 1118 | void f(py::array_t<double, py::array::c_style | py::array::forcecast> array); |
Wenzel Jakob | f1032df | 2016-05-05 10:00:00 +0200 | [diff] [blame] | 1119 | |
Wenzel Jakob | b47a9de | 2016-05-19 16:02:09 +0200 | [diff] [blame] | 1120 | The ``py::array::forcecast`` argument is the default value of the second |
| 1121 | template paramenter, and it ensures that non-conforming arguments are converted |
| 1122 | into an array satisfying the specified requirements instead of trying the next |
| 1123 | function overload. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1124 | |
| 1125 | Vectorizing functions |
| 1126 | ===================== |
| 1127 | |
| 1128 | Suppose we want to bind a function with the following signature to Python so |
| 1129 | that it can process arbitrary NumPy array arguments (vectors, matrices, general |
| 1130 | N-D arrays) in addition to its normal arguments: |
| 1131 | |
| 1132 | .. code-block:: cpp |
| 1133 | |
| 1134 | double my_func(int x, float y, double z); |
| 1135 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 1136 | After including the ``pybind11/numpy.h`` header, this is extremely simple: |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1137 | |
| 1138 | .. code-block:: cpp |
| 1139 | |
| 1140 | m.def("vectorized_func", py::vectorize(my_func)); |
| 1141 | |
| 1142 | Invoking the function like below causes 4 calls to be made to ``my_func`` with |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 1143 | each of the array elements. The significant advantage of this compared to |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 1144 | solutions like ``numpy.vectorize()`` is that the loop over the elements runs |
| 1145 | entirely on the C++ side and can be crunched down into a tight, optimized loop |
| 1146 | by the compiler. The result is returned as a NumPy array of type |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1147 | ``numpy.dtype.float64``. |
| 1148 | |
| 1149 | .. code-block:: python |
| 1150 | |
| 1151 | >>> x = np.array([[1, 3],[5, 7]]) |
| 1152 | >>> y = np.array([[2, 4],[6, 8]]) |
| 1153 | >>> z = 3 |
| 1154 | >>> result = vectorized_func(x, y, z) |
| 1155 | |
| 1156 | The scalar argument ``z`` is transparently replicated 4 times. The input |
| 1157 | arrays ``x`` and ``y`` are automatically converted into the right types (they |
| 1158 | are of type ``numpy.dtype.int64`` but need to be ``numpy.dtype.int32`` and |
| 1159 | ``numpy.dtype.float32``, respectively) |
| 1160 | |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 1161 | Sometimes we might want to explicitly exclude an argument from the vectorization |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1162 | because it makes little sense to wrap it in a NumPy array. For instance, |
| 1163 | suppose the function signature was |
| 1164 | |
| 1165 | .. code-block:: cpp |
| 1166 | |
| 1167 | double my_func(int x, float y, my_custom_type *z); |
| 1168 | |
| 1169 | This can be done with a stateful Lambda closure: |
| 1170 | |
| 1171 | .. code-block:: cpp |
| 1172 | |
| 1173 | // Vectorize a lambda function with a capture object (e.g. to exclude some arguments from the vectorization) |
| 1174 | m.def("vectorized_func", |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1175 | [](py::array_t<int> x, py::array_t<float> y, my_custom_type *z) { |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1176 | auto stateful_closure = [z](int x, float y) { return my_func(x, y, z); }; |
| 1177 | return py::vectorize(stateful_closure)(x, y); |
| 1178 | } |
| 1179 | ); |
| 1180 | |
Wenzel Jakob | 6158716 | 2016-01-18 22:38:52 +0100 | [diff] [blame] | 1181 | In cases where the computation is too complicated to be reduced to |
| 1182 | ``vectorize``, it will be necessary to create and access the buffer contents |
| 1183 | manually. The following snippet contains a complete example that shows how this |
| 1184 | works (the code is somewhat contrived, since it could have been done more |
| 1185 | simply using ``vectorize``). |
| 1186 | |
| 1187 | .. code-block:: cpp |
| 1188 | |
| 1189 | #include <pybind11/pybind11.h> |
| 1190 | #include <pybind11/numpy.h> |
| 1191 | |
| 1192 | namespace py = pybind11; |
| 1193 | |
| 1194 | py::array_t<double> add_arrays(py::array_t<double> input1, py::array_t<double> input2) { |
| 1195 | auto buf1 = input1.request(), buf2 = input2.request(); |
| 1196 | |
| 1197 | if (buf1.ndim != 1 || buf2.ndim != 1) |
| 1198 | throw std::runtime_error("Number of dimensions must be one"); |
| 1199 | |
| 1200 | if (buf1.shape[0] != buf2.shape[0]) |
| 1201 | throw std::runtime_error("Input shapes must match"); |
| 1202 | |
| 1203 | auto result = py::array(py::buffer_info( |
| 1204 | nullptr, /* Pointer to data (nullptr -> ask NumPy to allocate!) */ |
| 1205 | sizeof(double), /* Size of one item */ |
Nils Werner | f7048f2 | 2016-05-19 11:17:17 +0200 | [diff] [blame] | 1206 | py::format_descriptor<double>::value(), /* Buffer format */ |
Wenzel Jakob | 6158716 | 2016-01-18 22:38:52 +0100 | [diff] [blame] | 1207 | buf1.ndim, /* How many dimensions? */ |
| 1208 | { buf1.shape[0] }, /* Number of elements for each dimension */ |
| 1209 | { sizeof(double) } /* Strides for each dimension */ |
| 1210 | )); |
| 1211 | |
| 1212 | auto buf3 = result.request(); |
| 1213 | |
| 1214 | double *ptr1 = (double *) buf1.ptr, |
| 1215 | *ptr2 = (double *) buf2.ptr, |
| 1216 | *ptr3 = (double *) buf3.ptr; |
| 1217 | |
| 1218 | for (size_t idx = 0; idx < buf1.shape[0]; idx++) |
| 1219 | ptr3[idx] = ptr1[idx] + ptr2[idx]; |
| 1220 | |
| 1221 | return result; |
| 1222 | } |
| 1223 | |
| 1224 | PYBIND11_PLUGIN(test) { |
| 1225 | py::module m("test"); |
| 1226 | m.def("add_arrays", &add_arrays, "Add two NumPy arrays"); |
| 1227 | return m.ptr(); |
| 1228 | } |
| 1229 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1230 | .. seealso:: |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1231 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1232 | The file :file:`example/example10.cpp` contains a complete example that |
| 1233 | demonstrates using :func:`vectorize` in more detail. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1234 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1235 | Functions taking Python objects as arguments |
| 1236 | ============================================ |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1237 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1238 | pybind11 exposes all major Python types using thin C++ wrapper classes. These |
| 1239 | wrapper classes can also be used as parameters of functions in bindings, which |
| 1240 | makes it possible to directly work with native Python types on the C++ side. |
| 1241 | For instance, the following statement iterates over a Python ``dict``: |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1242 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1243 | .. code-block:: cpp |
| 1244 | |
| 1245 | void print_dict(py::dict dict) { |
| 1246 | /* Easily interact with Python types */ |
| 1247 | for (auto item : dict) |
| 1248 | std::cout << "key=" << item.first << ", " |
| 1249 | << "value=" << item.second << std::endl; |
| 1250 | } |
| 1251 | |
| 1252 | Available types include :class:`handle`, :class:`object`, :class:`bool_`, |
Wenzel Jakob | 27e8e10 | 2016-01-17 22:36:37 +0100 | [diff] [blame] | 1253 | :class:`int_`, :class:`float_`, :class:`str`, :class:`bytes`, :class:`tuple`, |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 1254 | :class:`list`, :class:`dict`, :class:`slice`, :class:`none`, :class:`capsule`, |
| 1255 | :class:`iterable`, :class:`iterator`, :class:`function`, :class:`buffer`, |
| 1256 | :class:`array`, and :class:`array_t`. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1257 | |
Wenzel Jakob | 436b731 | 2015-10-20 01:04:30 +0200 | [diff] [blame] | 1258 | In this kind of mixed code, it is often necessary to convert arbitrary C++ |
| 1259 | types to Python, which can be done using :func:`cast`: |
| 1260 | |
| 1261 | .. code-block:: cpp |
| 1262 | |
| 1263 | MyClass *cls = ..; |
| 1264 | py::object obj = py::cast(cls); |
| 1265 | |
| 1266 | The reverse direction uses the following syntax: |
| 1267 | |
| 1268 | .. code-block:: cpp |
| 1269 | |
| 1270 | py::object obj = ...; |
| 1271 | MyClass *cls = obj.cast<MyClass *>(); |
| 1272 | |
| 1273 | When conversion fails, both directions throw the exception :class:`cast_error`. |
Wenzel Jakob | 178c8a8 | 2016-05-10 15:59:01 +0100 | [diff] [blame] | 1274 | It is also possible to call python functions via ``operator()``. |
| 1275 | |
| 1276 | .. code-block:: cpp |
| 1277 | |
| 1278 | py::function f = <...>; |
| 1279 | py::object result_py = f(1234, "hello", some_instance); |
| 1280 | MyClass &result = result_py.cast<MyClass>(); |
| 1281 | |
| 1282 | The special ``f(*args)`` and ``f(*args, **kwargs)`` syntax is also supported to |
| 1283 | supply arbitrary argument and keyword lists, although these cannot be mixed |
| 1284 | with other parameters. |
| 1285 | |
| 1286 | .. code-block:: cpp |
| 1287 | |
| 1288 | py::function f = <...>; |
| 1289 | py::tuple args = py::make_tuple(1234); |
| 1290 | py::dict kwargs; |
| 1291 | kwargs["y"] = py::cast(5678); |
| 1292 | py::object result = f(*args, **kwargs); |
Wenzel Jakob | 436b731 | 2015-10-20 01:04:30 +0200 | [diff] [blame] | 1293 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1294 | .. seealso:: |
| 1295 | |
| 1296 | The file :file:`example/example2.cpp` contains a complete example that |
Wenzel Jakob | 178c8a8 | 2016-05-10 15:59:01 +0100 | [diff] [blame] | 1297 | demonstrates passing native Python types in more detail. The file |
| 1298 | :file:`example/example11.cpp` discusses usage of ``args`` and ``kwargs``. |
Wenzel Jakob | 2ac5044 | 2016-01-17 22:36:35 +0100 | [diff] [blame] | 1299 | |
| 1300 | Default arguments revisited |
| 1301 | =========================== |
| 1302 | |
| 1303 | The section on :ref:`default_args` previously discussed basic usage of default |
| 1304 | arguments using pybind11. One noteworthy aspect of their implementation is that |
| 1305 | default arguments are converted to Python objects right at declaration time. |
| 1306 | Consider the following example: |
| 1307 | |
| 1308 | .. code-block:: cpp |
| 1309 | |
| 1310 | py::class_<MyClass>("MyClass") |
| 1311 | .def("myFunction", py::arg("arg") = SomeType(123)); |
| 1312 | |
| 1313 | In this case, pybind11 must already be set up to deal with values of the type |
| 1314 | ``SomeType`` (via a prior instantiation of ``py::class_<SomeType>``), or an |
| 1315 | exception will be thrown. |
| 1316 | |
| 1317 | Another aspect worth highlighting is that the "preview" of the default argument |
| 1318 | in the function signature is generated using the object's ``__repr__`` method. |
| 1319 | If not available, the signature may not be very helpful, e.g.: |
| 1320 | |
| 1321 | .. code-block:: python |
| 1322 | |
| 1323 | FUNCTIONS |
| 1324 | ... |
| 1325 | | myFunction(...) |
Wenzel Jakob | 48548ea | 2016-01-17 22:36:44 +0100 | [diff] [blame] | 1326 | | Signature : (MyClass, arg : SomeType = <SomeType object at 0x101b7b080>) -> NoneType |
Wenzel Jakob | 2ac5044 | 2016-01-17 22:36:35 +0100 | [diff] [blame] | 1327 | ... |
| 1328 | |
| 1329 | The first way of addressing this is by defining ``SomeType.__repr__``. |
| 1330 | Alternatively, it is possible to specify the human-readable preview of the |
| 1331 | default argument manually using the ``arg_t`` notation: |
| 1332 | |
| 1333 | .. code-block:: cpp |
| 1334 | |
| 1335 | py::class_<MyClass>("MyClass") |
| 1336 | .def("myFunction", py::arg_t<SomeType>("arg", SomeType(123), "SomeType(123)")); |
| 1337 | |
Wenzel Jakob | c769fce | 2016-03-03 12:03:30 +0100 | [diff] [blame] | 1338 | Sometimes it may be necessary to pass a null pointer value as a default |
| 1339 | argument. In this case, remember to cast it to the underlying type in question, |
| 1340 | like so: |
| 1341 | |
| 1342 | .. code-block:: cpp |
| 1343 | |
| 1344 | py::class_<MyClass>("MyClass") |
| 1345 | .def("myFunction", py::arg("arg") = (SomeType *) nullptr); |
| 1346 | |
Wenzel Jakob | 178c8a8 | 2016-05-10 15:59:01 +0100 | [diff] [blame] | 1347 | Binding functions that accept arbitrary numbers of arguments and keywords arguments |
| 1348 | =================================================================================== |
| 1349 | |
| 1350 | Python provides a useful mechanism to define functions that accept arbitrary |
| 1351 | numbers of arguments and keyword arguments: |
| 1352 | |
| 1353 | .. code-block:: cpp |
| 1354 | |
| 1355 | def generic(*args, **kwargs): |
| 1356 | # .. do something with args and kwargs |
| 1357 | |
| 1358 | Such functions can also be created using pybind11: |
| 1359 | |
| 1360 | .. code-block:: cpp |
| 1361 | |
| 1362 | void generic(py::args args, py::kwargs kwargs) { |
| 1363 | /// .. do something with args |
| 1364 | if (kwargs) |
| 1365 | /// .. do something with kwargs |
| 1366 | } |
| 1367 | |
| 1368 | /// Binding code |
| 1369 | m.def("generic", &generic); |
| 1370 | |
| 1371 | (See ``example/example11.cpp``). The class ``py::args`` derives from |
| 1372 | ``py::list`` and ``py::kwargs`` derives from ``py::dict`` Note that the |
| 1373 | ``kwargs`` argument is invalid if no keyword arguments were actually provided. |
| 1374 | Please refer to the other examples for details on how to iterate over these, |
| 1375 | and on how to cast their entries into C++ objects. |
| 1376 | |
Wenzel Jakob | 2dfbade | 2016-01-17 22:36:37 +0100 | [diff] [blame] | 1377 | Partitioning code over multiple extension modules |
| 1378 | ================================================= |
| 1379 | |
Wenzel Jakob | 90d2f5e | 2016-04-11 14:30:11 +0200 | [diff] [blame] | 1380 | It's straightforward to split binding code over multiple extension modules, |
| 1381 | while referencing types that are declared elsewhere. Everything "just" works |
| 1382 | without any special precautions. One exception to this rule occurs when |
| 1383 | extending a type declared in another extension module. Recall the basic example |
| 1384 | from Section :ref:`inheritance`. |
Wenzel Jakob | 2dfbade | 2016-01-17 22:36:37 +0100 | [diff] [blame] | 1385 | |
| 1386 | .. code-block:: cpp |
| 1387 | |
| 1388 | py::class_<Pet> pet(m, "Pet"); |
| 1389 | pet.def(py::init<const std::string &>()) |
| 1390 | .def_readwrite("name", &Pet::name); |
| 1391 | |
| 1392 | py::class_<Dog>(m, "Dog", pet /* <- specify parent */) |
| 1393 | .def(py::init<const std::string &>()) |
| 1394 | .def("bark", &Dog::bark); |
| 1395 | |
| 1396 | Suppose now that ``Pet`` bindings are defined in a module named ``basic``, |
| 1397 | whereas the ``Dog`` bindings are defined somewhere else. The challenge is of |
| 1398 | course that the variable ``pet`` is not available anymore though it is needed |
| 1399 | to indicate the inheritance relationship to the constructor of ``class_<Dog>``. |
| 1400 | However, it can be acquired as follows: |
| 1401 | |
| 1402 | .. code-block:: cpp |
| 1403 | |
| 1404 | py::object pet = (py::object) py::module::import("basic").attr("Pet"); |
| 1405 | |
| 1406 | py::class_<Dog>(m, "Dog", pet) |
| 1407 | .def(py::init<const std::string &>()) |
| 1408 | .def("bark", &Dog::bark); |
| 1409 | |
Wenzel Jakob | 8d862b3 | 2016-03-06 13:37:22 +0100 | [diff] [blame] | 1410 | Alternatively, we can rely on the ``base`` tag, which performs an automated |
| 1411 | lookup of the corresponding Python type. However, this also requires invoking |
| 1412 | the ``import`` function once to ensure that the pybind11 binding code of the |
| 1413 | module ``basic`` has been executed. |
| 1414 | |
Wenzel Jakob | 8d862b3 | 2016-03-06 13:37:22 +0100 | [diff] [blame] | 1415 | .. code-block:: cpp |
| 1416 | |
| 1417 | py::module::import("basic"); |
| 1418 | |
| 1419 | py::class_<Dog>(m, "Dog", py::base<Pet>()) |
| 1420 | .def(py::init<const std::string &>()) |
| 1421 | .def("bark", &Dog::bark); |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1422 | |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 1423 | Naturally, both methods will fail when there are cyclic dependencies. |
| 1424 | |
Wenzel Jakob | 90d2f5e | 2016-04-11 14:30:11 +0200 | [diff] [blame] | 1425 | Note that compiling code which has its default symbol visibility set to |
| 1426 | *hidden* (e.g. via the command line flag ``-fvisibility=hidden`` on GCC/Clang) can interfere with the |
| 1427 | ability to access types defined in another extension module. Workarounds |
| 1428 | include changing the global symbol visibility (not recommended, because it will |
| 1429 | lead unnecessarily large binaries) or manually exporting types that are |
| 1430 | accessed by multiple extension modules: |
| 1431 | |
| 1432 | .. code-block:: cpp |
| 1433 | |
| 1434 | #ifdef _WIN32 |
| 1435 | # define EXPORT_TYPE __declspec(dllexport) |
| 1436 | #else |
| 1437 | # define EXPORT_TYPE __attribute__ ((visibility("default"))) |
| 1438 | #endif |
| 1439 | |
| 1440 | class EXPORT_TYPE Dog : public Animal { |
| 1441 | ... |
| 1442 | }; |
| 1443 | |
| 1444 | |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1445 | Pickling support |
| 1446 | ================ |
| 1447 | |
| 1448 | Python's ``pickle`` module provides a powerful facility to serialize and |
| 1449 | de-serialize a Python object graph into a binary data stream. To pickle and |
Wenzel Jakob | 3d0e6ff | 2016-04-13 11:48:10 +0200 | [diff] [blame] | 1450 | unpickle C++ classes using pybind11, two additional functions must be provided. |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1451 | Suppose the class in question has the following signature: |
| 1452 | |
| 1453 | .. code-block:: cpp |
| 1454 | |
| 1455 | class Pickleable { |
| 1456 | public: |
| 1457 | Pickleable(const std::string &value) : m_value(value) { } |
| 1458 | const std::string &value() const { return m_value; } |
| 1459 | |
| 1460 | void setExtra(int extra) { m_extra = extra; } |
| 1461 | int extra() const { return m_extra; } |
| 1462 | private: |
| 1463 | std::string m_value; |
| 1464 | int m_extra = 0; |
| 1465 | }; |
| 1466 | |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1467 | The binding code including the requisite ``__setstate__`` and ``__getstate__`` methods [#f3]_ |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1468 | looks as follows: |
| 1469 | |
| 1470 | .. code-block:: cpp |
| 1471 | |
| 1472 | py::class_<Pickleable>(m, "Pickleable") |
| 1473 | .def(py::init<std::string>()) |
| 1474 | .def("value", &Pickleable::value) |
| 1475 | .def("extra", &Pickleable::extra) |
| 1476 | .def("setExtra", &Pickleable::setExtra) |
| 1477 | .def("__getstate__", [](const Pickleable &p) { |
| 1478 | /* Return a tuple that fully encodes the state of the object */ |
| 1479 | return py::make_tuple(p.value(), p.extra()); |
| 1480 | }) |
| 1481 | .def("__setstate__", [](Pickleable &p, py::tuple t) { |
| 1482 | if (t.size() != 2) |
| 1483 | throw std::runtime_error("Invalid state!"); |
| 1484 | |
Wenzel Jakob | d40885a | 2016-04-13 13:30:05 +0200 | [diff] [blame] | 1485 | /* Invoke the in-place constructor. Note that this is needed even |
| 1486 | when the object just has a trivial default constructor */ |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1487 | new (&p) Pickleable(t[0].cast<std::string>()); |
| 1488 | |
| 1489 | /* Assign any additional state */ |
| 1490 | p.setExtra(t[1].cast<int>()); |
| 1491 | }); |
| 1492 | |
| 1493 | An instance can now be pickled as follows: |
| 1494 | |
| 1495 | .. code-block:: python |
| 1496 | |
| 1497 | try: |
| 1498 | import cPickle as pickle # Use cPickle on Python 2.7 |
| 1499 | except ImportError: |
| 1500 | import pickle |
| 1501 | |
| 1502 | p = Pickleable("test_value") |
| 1503 | p.setExtra(15) |
Wenzel Jakob | 81e0975 | 2016-04-30 23:13:03 +0200 | [diff] [blame] | 1504 | data = pickle.dumps(p, 2) |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1505 | |
Wenzel Jakob | 81e0975 | 2016-04-30 23:13:03 +0200 | [diff] [blame] | 1506 | Note that only the cPickle module is supported on Python 2.7. The second |
| 1507 | argument to ``dumps`` is also crucial: it selects the pickle protocol version |
| 1508 | 2, since the older version 1 is not supported. Newer versions are also fineāfor |
| 1509 | instance, specify ``-1`` to always use the latest available version. Beware: |
| 1510 | failure to follow these instructions will cause important pybind11 memory |
| 1511 | allocation routines to be skipped during unpickling, which will likely lead to |
| 1512 | memory corruption and/or segmentation faults. |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1513 | |
| 1514 | .. seealso:: |
| 1515 | |
| 1516 | The file :file:`example/example15.cpp` contains a complete example that |
| 1517 | demonstrates how to pickle and unpickle types using pybind11 in more detail. |
| 1518 | |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1519 | .. [#f3] http://docs.python.org/3/library/pickle.html#pickling-class-instances |
Wenzel Jakob | ef7a9b9 | 2016-04-13 18:41:59 +0200 | [diff] [blame] | 1520 | |
| 1521 | Generating documentation using Sphinx |
| 1522 | ===================================== |
| 1523 | |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1524 | Sphinx [#f4]_ has the ability to inspect the signatures and documentation |
Wenzel Jakob | ef7a9b9 | 2016-04-13 18:41:59 +0200 | [diff] [blame] | 1525 | strings in pybind11-based extension modules to automatically generate beautiful |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1526 | documentation in a variety formats. The pbtest repository [#f5]_ contains a |
Wenzel Jakob | ef7a9b9 | 2016-04-13 18:41:59 +0200 | [diff] [blame] | 1527 | simple example repository which uses this approach. |
| 1528 | |
| 1529 | There are two potential gotchas when using this approach: first, make sure that |
| 1530 | the resulting strings do not contain any :kbd:`TAB` characters, which break the |
| 1531 | docstring parsing routines. You may want to use C++11 raw string literals, |
| 1532 | which are convenient for multi-line comments. Conveniently, any excess |
| 1533 | indentation will be automatically be removed by Sphinx. However, for this to |
| 1534 | work, it is important that all lines are indented consistently, i.e.: |
| 1535 | |
| 1536 | .. code-block:: cpp |
| 1537 | |
| 1538 | // ok |
| 1539 | m.def("foo", &foo, R"mydelimiter( |
| 1540 | The foo function |
| 1541 | |
| 1542 | Parameters |
| 1543 | ---------- |
| 1544 | )mydelimiter"); |
| 1545 | |
| 1546 | // *not ok* |
| 1547 | m.def("foo", &foo, R"mydelimiter(The foo function |
| 1548 | |
| 1549 | Parameters |
| 1550 | ---------- |
| 1551 | )mydelimiter"); |
| 1552 | |
Wenzel Jakob | 9e0a056 | 2016-05-05 20:33:54 +0200 | [diff] [blame] | 1553 | .. [#f4] http://www.sphinx-doc.org |
| 1554 | .. [#f5] http://github.com/pybind/pbtest |