Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1 | .. _advanced: |
| 2 | |
| 3 | Advanced topics |
| 4 | ############### |
| 5 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 6 | For brevity, the rest of this chapter assumes that the following two lines are |
| 7 | present: |
| 8 | |
| 9 | .. code-block:: cpp |
| 10 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 11 | #include <pybind11/pybind11.h> |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 12 | |
Wenzel Jakob | 10e62e1 | 2015-10-15 22:46:07 +0200 | [diff] [blame] | 13 | namespace py = pybind11; |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 14 | |
Wenzel Jakob | de3ad07 | 2016-02-02 11:38:21 +0100 | [diff] [blame] | 15 | Exporting constants and mutable objects |
| 16 | ======================================= |
| 17 | |
| 18 | To expose a C++ constant, use the ``attr`` function to register it in a module |
| 19 | as shown below. The ``int_`` class is one of many small wrapper objects defined |
| 20 | in ``pybind11/pytypes.h``. General objects (including integers) can also be |
| 21 | converted using the function ``cast``. |
| 22 | |
| 23 | .. code-block:: cpp |
| 24 | |
| 25 | PYBIND11_PLUGIN(example) { |
| 26 | py::module m("example", "pybind11 example plugin"); |
| 27 | m.attr("MY_CONSTANT") = py::int_(123); |
| 28 | m.attr("MY_CONSTANT_2") = py::cast(new MyObject()); |
| 29 | } |
| 30 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 31 | Operator overloading |
| 32 | ==================== |
| 33 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 34 | Suppose that we're given the following ``Vector2`` class with a vector addition |
| 35 | and scalar multiplication operation, all implemented using overloaded operators |
| 36 | in C++. |
| 37 | |
| 38 | .. code-block:: cpp |
| 39 | |
| 40 | class Vector2 { |
| 41 | public: |
| 42 | Vector2(float x, float y) : x(x), y(y) { } |
| 43 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 44 | Vector2 operator+(const Vector2 &v) const { return Vector2(x + v.x, y + v.y); } |
| 45 | Vector2 operator*(float value) const { return Vector2(x * value, y * value); } |
| 46 | Vector2& operator+=(const Vector2 &v) { x += v.x; y += v.y; return *this; } |
| 47 | Vector2& operator*=(float v) { x *= v; y *= v; return *this; } |
| 48 | |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 49 | friend Vector2 operator*(float f, const Vector2 &v) { |
| 50 | return Vector2(f * v.x, f * v.y); |
| 51 | } |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 52 | |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 53 | std::string toString() const { |
| 54 | return "[" + std::to_string(x) + ", " + std::to_string(y) + "]"; |
| 55 | } |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 56 | private: |
| 57 | float x, y; |
| 58 | }; |
| 59 | |
| 60 | The following snippet shows how the above operators can be conveniently exposed |
| 61 | to Python. |
| 62 | |
| 63 | .. code-block:: cpp |
| 64 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 65 | #include <pybind11/operators.h> |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 66 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 67 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 68 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 69 | |
| 70 | py::class_<Vector2>(m, "Vector2") |
| 71 | .def(py::init<float, float>()) |
| 72 | .def(py::self + py::self) |
| 73 | .def(py::self += py::self) |
| 74 | .def(py::self *= float()) |
| 75 | .def(float() * py::self) |
| 76 | .def("__repr__", &Vector2::toString); |
| 77 | |
| 78 | return m.ptr(); |
| 79 | } |
| 80 | |
| 81 | Note that a line like |
| 82 | |
| 83 | .. code-block:: cpp |
| 84 | |
| 85 | .def(py::self * float()) |
| 86 | |
| 87 | is really just short hand notation for |
| 88 | |
| 89 | .. code-block:: cpp |
| 90 | |
| 91 | .def("__mul__", [](const Vector2 &a, float b) { |
| 92 | return a * b; |
| 93 | }) |
| 94 | |
| 95 | This can be useful for exposing additional operators that don't exist on the |
| 96 | C++ side, or to perform other types of customization. |
| 97 | |
| 98 | .. note:: |
| 99 | |
| 100 | To use the more convenient ``py::self`` notation, the additional |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 101 | header file :file:`pybind11/operators.h` must be included. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 102 | |
| 103 | .. seealso:: |
| 104 | |
| 105 | The file :file:`example/example3.cpp` contains a complete example that |
| 106 | demonstrates how to work with overloaded operators in more detail. |
| 107 | |
| 108 | Callbacks and passing anonymous functions |
| 109 | ========================================= |
| 110 | |
| 111 | The C++11 standard brought lambda functions and the generic polymorphic |
| 112 | function wrapper ``std::function<>`` to the C++ programming language, which |
| 113 | enable powerful new ways of working with functions. Lambda functions come in |
| 114 | two flavors: stateless lambda function resemble classic function pointers that |
| 115 | link to an anonymous piece of code, while stateful lambda functions |
| 116 | additionally depend on captured variables that are stored in an anonymous |
| 117 | *lambda closure object*. |
| 118 | |
| 119 | Here is a simple example of a C++ function that takes an arbitrary function |
| 120 | (stateful or stateless) with signature ``int -> int`` as an argument and runs |
| 121 | it with the value 10. |
| 122 | |
| 123 | .. code-block:: cpp |
| 124 | |
| 125 | int func_arg(const std::function<int(int)> &f) { |
| 126 | return f(10); |
| 127 | } |
| 128 | |
| 129 | The example below is more involved: it takes a function of signature ``int -> int`` |
| 130 | and returns another function of the same kind. The return value is a stateful |
| 131 | lambda function, which stores the value ``f`` in the capture object and adds 1 to |
| 132 | its return value upon execution. |
| 133 | |
| 134 | .. code-block:: cpp |
| 135 | |
| 136 | std::function<int(int)> func_ret(const std::function<int(int)> &f) { |
| 137 | return [f](int i) { |
| 138 | return f(i) + 1; |
| 139 | }; |
| 140 | } |
| 141 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 142 | After including the extra header file :file:`pybind11/functional.h`, it is almost |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 143 | trivial to generate binding code for both of these functions. |
| 144 | |
| 145 | .. code-block:: cpp |
| 146 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 147 | #include <pybind11/functional.h> |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 148 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 149 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 150 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 151 | |
| 152 | m.def("func_arg", &func_arg); |
| 153 | m.def("func_ret", &func_ret); |
| 154 | |
| 155 | return m.ptr(); |
| 156 | } |
| 157 | |
| 158 | The following interactive session shows how to call them from Python. |
| 159 | |
| 160 | .. code-block:: python |
| 161 | |
| 162 | $ python |
| 163 | >>> import example |
| 164 | >>> def square(i): |
| 165 | ... return i * i |
| 166 | ... |
| 167 | >>> example.func_arg(square) |
| 168 | 100L |
| 169 | >>> square_plus_1 = example.func_ret(square) |
| 170 | >>> square_plus_1(4) |
| 171 | 17L |
| 172 | >>> |
| 173 | |
| 174 | .. note:: |
| 175 | |
| 176 | This functionality is very useful when generating bindings for callbacks in |
| 177 | C++ libraries (e.g. a graphical user interface library). |
| 178 | |
| 179 | The file :file:`example/example5.cpp` contains a complete example that |
| 180 | demonstrates how to work with callbacks and anonymous functions in more detail. |
| 181 | |
Wenzel Jakob | a4175d6 | 2015-11-17 08:30:34 +0100 | [diff] [blame] | 182 | .. warning:: |
| 183 | |
| 184 | Keep in mind that passing a function from C++ to Python (or vice versa) |
| 185 | will instantiate a piece of wrapper code that translates function |
| 186 | invocations between the two languages. Copying the same function back and |
| 187 | forth between Python and C++ many times in a row will cause these wrappers |
| 188 | to accumulate, which can decrease performance. |
| 189 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 190 | Overriding virtual functions in Python |
| 191 | ====================================== |
| 192 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 193 | Suppose that a C++ class or interface has a virtual function that we'd like to |
| 194 | to override from within Python (we'll focus on the class ``Animal``; ``Dog`` is |
| 195 | given as a specific example of how one would do this with traditional C++ |
| 196 | code). |
| 197 | |
| 198 | .. code-block:: cpp |
| 199 | |
| 200 | class Animal { |
| 201 | public: |
| 202 | virtual ~Animal() { } |
| 203 | virtual std::string go(int n_times) = 0; |
| 204 | }; |
| 205 | |
| 206 | class Dog : public Animal { |
| 207 | public: |
| 208 | std::string go(int n_times) { |
| 209 | std::string result; |
| 210 | for (int i=0; i<n_times; ++i) |
| 211 | result += "woof! "; |
| 212 | return result; |
| 213 | } |
| 214 | }; |
| 215 | |
| 216 | Let's also suppose that we are given a plain function which calls the |
| 217 | function ``go()`` on an arbitrary ``Animal`` instance. |
| 218 | |
| 219 | .. code-block:: cpp |
| 220 | |
| 221 | std::string call_go(Animal *animal) { |
| 222 | return animal->go(3); |
| 223 | } |
| 224 | |
| 225 | Normally, the binding code for these classes would look as follows: |
| 226 | |
| 227 | .. code-block:: cpp |
| 228 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 229 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 230 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 231 | |
| 232 | py::class_<Animal> animal(m, "Animal"); |
| 233 | animal |
| 234 | .def("go", &Animal::go); |
| 235 | |
| 236 | py::class_<Dog>(m, "Dog", animal) |
| 237 | .def(py::init<>()); |
| 238 | |
| 239 | m.def("call_go", &call_go); |
| 240 | |
| 241 | return m.ptr(); |
| 242 | } |
| 243 | |
| 244 | However, these bindings are impossible to extend: ``Animal`` is not |
| 245 | constructible, and we clearly require some kind of "trampoline" that |
| 246 | redirects virtual calls back to Python. |
| 247 | |
| 248 | Defining a new type of ``Animal`` from within Python is possible but requires a |
| 249 | helper class that is defined as follows: |
| 250 | |
| 251 | .. code-block:: cpp |
| 252 | |
| 253 | class PyAnimal : public Animal { |
| 254 | public: |
| 255 | /* Inherit the constructors */ |
| 256 | using Animal::Animal; |
| 257 | |
| 258 | /* Trampoline (need one for each virtual function) */ |
| 259 | std::string go(int n_times) { |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 260 | PYBIND11_OVERLOAD_PURE( |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 261 | std::string, /* Return type */ |
| 262 | Animal, /* Parent class */ |
| 263 | go, /* Name of function */ |
| 264 | n_times /* Argument(s) */ |
| 265 | ); |
| 266 | } |
| 267 | }; |
| 268 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 269 | The macro :func:`PYBIND11_OVERLOAD_PURE` should be used for pure virtual |
| 270 | functions, and :func:`PYBIND11_OVERLOAD` should be used for functions which have |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 271 | a default implementation. The binding code also needs a few minor adaptations |
| 272 | (highlighted): |
| 273 | |
| 274 | .. code-block:: cpp |
| 275 | :emphasize-lines: 4,6,7 |
| 276 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 277 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 278 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 279 | |
| 280 | py::class_<PyAnimal> animal(m, "Animal"); |
| 281 | animal |
| 282 | .alias<Animal>() |
| 283 | .def(py::init<>()) |
| 284 | .def("go", &Animal::go); |
| 285 | |
| 286 | py::class_<Dog>(m, "Dog", animal) |
| 287 | .def(py::init<>()); |
| 288 | |
| 289 | m.def("call_go", &call_go); |
| 290 | |
| 291 | return m.ptr(); |
| 292 | } |
| 293 | |
| 294 | Importantly, the trampoline helper class is used as the template argument to |
| 295 | :class:`class_`, and a call to :func:`class_::alias` informs the binding |
| 296 | generator that this is merely an alias for the underlying type ``Animal``. |
| 297 | Following this, we are able to define a constructor as usual. |
| 298 | |
| 299 | The Python session below shows how to override ``Animal::go`` and invoke it via |
| 300 | a virtual method call. |
| 301 | |
Wenzel Jakob | de3ad07 | 2016-02-02 11:38:21 +0100 | [diff] [blame] | 302 | .. code-block:: python |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 303 | |
| 304 | >>> from example import * |
| 305 | >>> d = Dog() |
| 306 | >>> call_go(d) |
| 307 | u'woof! woof! woof! ' |
| 308 | >>> class Cat(Animal): |
| 309 | ... def go(self, n_times): |
| 310 | ... return "meow! " * n_times |
| 311 | ... |
| 312 | >>> c = Cat() |
| 313 | >>> call_go(c) |
| 314 | u'meow! meow! meow! ' |
| 315 | |
| 316 | .. seealso:: |
| 317 | |
| 318 | The file :file:`example/example12.cpp` contains a complete example that |
| 319 | demonstrates how to override virtual functions using pybind11 in more |
| 320 | detail. |
| 321 | |
Wenzel Jakob | ecdd868 | 2015-12-07 18:17:58 +0100 | [diff] [blame] | 322 | |
| 323 | Global Interpreter Lock (GIL) |
| 324 | ============================= |
| 325 | |
| 326 | The classes :class:`gil_scoped_release` and :class:`gil_scoped_acquire` can be |
| 327 | used to acquire and release the global interpreter lock in the body of a C++ |
| 328 | function call. In this way, long-running C++ code can be parallelized using |
| 329 | multiple Python threads. Taking the previous section as an example, this could |
| 330 | be realized as follows (important changes highlighted): |
| 331 | |
| 332 | .. code-block:: cpp |
| 333 | :emphasize-lines: 8,9,33,34 |
| 334 | |
| 335 | class PyAnimal : public Animal { |
| 336 | public: |
| 337 | /* Inherit the constructors */ |
| 338 | using Animal::Animal; |
| 339 | |
| 340 | /* Trampoline (need one for each virtual function) */ |
| 341 | std::string go(int n_times) { |
| 342 | /* Acquire GIL before calling Python code */ |
Wenzel Jakob | a4caa85 | 2015-12-14 12:39:02 +0100 | [diff] [blame] | 343 | py::gil_scoped_acquire acquire; |
Wenzel Jakob | ecdd868 | 2015-12-07 18:17:58 +0100 | [diff] [blame] | 344 | |
| 345 | PYBIND11_OVERLOAD_PURE( |
| 346 | std::string, /* Return type */ |
| 347 | Animal, /* Parent class */ |
| 348 | go, /* Name of function */ |
| 349 | n_times /* Argument(s) */ |
| 350 | ); |
| 351 | } |
| 352 | }; |
| 353 | |
| 354 | PYBIND11_PLUGIN(example) { |
| 355 | py::module m("example", "pybind11 example plugin"); |
| 356 | |
| 357 | py::class_<PyAnimal> animal(m, "Animal"); |
| 358 | animal |
| 359 | .alias<Animal>() |
| 360 | .def(py::init<>()) |
| 361 | .def("go", &Animal::go); |
| 362 | |
| 363 | py::class_<Dog>(m, "Dog", animal) |
| 364 | .def(py::init<>()); |
| 365 | |
| 366 | m.def("call_go", [](Animal *animal) -> std::string { |
| 367 | /* Release GIL before calling into (potentially long-running) C++ code */ |
Wenzel Jakob | a4caa85 | 2015-12-14 12:39:02 +0100 | [diff] [blame] | 368 | py::gil_scoped_release release; |
Wenzel Jakob | ecdd868 | 2015-12-07 18:17:58 +0100 | [diff] [blame] | 369 | return call_go(animal); |
| 370 | }); |
| 371 | |
| 372 | return m.ptr(); |
| 373 | } |
| 374 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 375 | Passing STL data structures |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 376 | =========================== |
| 377 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 378 | When including the additional header file :file:`pybind11/stl.h`, conversions |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 379 | between ``std::vector<>``, ``std::list<>``, ``std::set<>``, and ``std::map<>`` |
| 380 | and the Python ``list``, ``set`` and ``dict`` data structures are automatically |
| 381 | enabled. The types ``std::pair<>`` and ``std::tuple<>`` are already supported |
| 382 | out of the box with just the core :file:`pybind11/pybind11.h` header. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 383 | |
| 384 | .. note:: |
| 385 | |
Wenzel Jakob | 44db04f | 2015-12-14 12:40:45 +0100 | [diff] [blame] | 386 | Arbitrary nesting of any of these types is supported. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 387 | |
| 388 | .. seealso:: |
| 389 | |
| 390 | The file :file:`example/example2.cpp` contains a complete example that |
| 391 | demonstrates how to pass STL data types in more detail. |
| 392 | |
Wenzel Jakob | b282595 | 2016-04-13 23:33:00 +0200 | [diff] [blame] | 393 | Binding sequence data types, iterators, the slicing protocol, etc. |
| 394 | ================================================================== |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 395 | |
| 396 | Please refer to the supplemental example for details. |
| 397 | |
| 398 | .. seealso:: |
| 399 | |
| 400 | The file :file:`example/example6.cpp` contains a complete example that |
| 401 | shows how to bind a sequence data type, including length queries |
| 402 | (``__len__``), iterators (``__iter__``), the slicing protocol and other |
| 403 | kinds of useful operations. |
| 404 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 405 | Return value policies |
| 406 | ===================== |
| 407 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 408 | Python and C++ use wildly different ways of managing the memory and lifetime of |
| 409 | objects managed by them. This can lead to issues when creating bindings for |
| 410 | functions that return a non-trivial type. Just by looking at the type |
| 411 | information, it is not clear whether Python should take charge of the returned |
| 412 | value and eventually free its resources, or if this is handled on the C++ side. |
| 413 | For this reason, pybind11 provides a several `return value policy` annotations |
| 414 | that can be passed to the :func:`module::def` and :func:`class_::def` |
Wenzel Jakob | 61d67f0 | 2015-12-14 12:53:06 +0100 | [diff] [blame] | 415 | functions. The default policy is :enum:`return_value_policy::automatic`. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 416 | |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 417 | .. tabularcolumns:: |p{0.5\textwidth}|p{0.45\textwidth}| |
| 418 | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 419 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 420 | | Return value policy | Description | |
| 421 | +==================================================+============================================================================+ |
| 422 | | :enum:`return_value_policy::automatic` | This is the default return value policy, which falls back to the policy | |
| 423 | | | :enum:`return_value_policy::take_ownership` when the return value is a | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 424 | | | pointer. Otherwise, it uses :enum:`return_value::move` or | |
| 425 | | | :enum:`return_value::copy` for rvalue and lvalue references, respectively. | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 426 | | | See below for a description of what all of these different policies do. | |
| 427 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 428 | | :enum:`return_value_policy::automatic_reference` | As above, but use policy :enum:`return_value_policy::reference` when the | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 429 | | | return value is a pointer. You probably won't need to use this. | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 430 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 431 | | :enum:`return_value_policy::take_ownership` | Reference an existing object (i.e. do not create a new copy) and take | |
| 432 | | | ownership. Python will call the destructor and delete operator when the | |
| 433 | | | object's reference count reaches zero. Undefined behavior ensues when the | |
| 434 | | | C++ side does the same.. | |
| 435 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 436 | | :enum:`return_value_policy::copy` | Create a new copy of the returned object, which will be owned by Python. | |
| 437 | | | This policy is comparably safe because the lifetimes of the two instances | |
| 438 | | | are decoupled. | |
| 439 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 440 | | :enum:`return_value_policy::move` | Use ``std::move`` to move the return value contents into a new instance | |
| 441 | | | that will be owned by Python. This policy is comparably safe because the | |
| 442 | | | lifetimes of the two instances (move source and destination) are decoupled.| |
| 443 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
| 444 | | :enum:`return_value_policy::reference` | Reference an existing object, but do not take ownership. The C++ side is | |
| 445 | | | responsible for managing the object's lifetime and deallocating it when | |
| 446 | | | it is no longer used. Warning: undefined behavior will ensue when the C++ | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 447 | | | side deletes an object that is still referenced and used by Python. | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 448 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 449 | | :enum:`return_value_policy::reference_internal` | This policy only applies to methods and properties. It references the | |
| 450 | | | object without taking ownership similar to the above | |
| 451 | | | :enum:`return_value_policy::reference` policy. In contrast to that policy, | |
| 452 | | | the function or property's implicit ``this`` argument (called the *parent*)| |
| 453 | | | is considered to be the the owner of the return value (the *child*). | |
| 454 | | | pybind11 then couples the lifetime of the parent to the child via a | |
| 455 | | | reference relationship that ensures that the parent cannot be garbage | |
| 456 | | | collected while Python is still using the child. More advanced variations | |
| 457 | | | of this scheme are also possible using combinations of | |
| 458 | | | :enum:`return_value_policy::reference` and the :class:`keep_alive` call | |
| 459 | | | policy described next. | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 460 | +--------------------------------------------------+----------------------------------------------------------------------------+ |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 461 | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 462 | The following example snippet shows a use case of the |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 463 | :enum:`return_value_policy::reference_internal` policy. |
| 464 | |
| 465 | .. code-block:: cpp |
| 466 | |
| 467 | class Example { |
| 468 | public: |
| 469 | Internal &get_internal() { return internal; } |
| 470 | private: |
| 471 | Internal internal; |
| 472 | }; |
| 473 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 474 | PYBIND11_PLUGIN(example) { |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 475 | py::module m("example", "pybind11 example plugin"); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 476 | |
| 477 | py::class_<Example>(m, "Example") |
| 478 | .def(py::init<>()) |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 479 | .def("get_internal", &Example::get_internal, "Return the internal data", |
| 480 | py::return_value_policy::reference_internal); |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 481 | |
| 482 | return m.ptr(); |
| 483 | } |
| 484 | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 485 | .. warning:: |
| 486 | |
| 487 | Code with invalid call policies might access unitialized memory or free |
| 488 | data structures multiple times, which can lead to hard-to-debug |
| 489 | non-determinism and segmentation faults, hence it is worth spending the |
| 490 | time to understand all the different options in the table above. |
| 491 | |
| 492 | .. note:: |
| 493 | |
| 494 | The next section on :ref:`call_policies` discusses *call policies* that can be |
| 495 | specified *in addition* to a return value policy from the list above. Call |
| 496 | policies indicate reference relationships that can involve both return values |
| 497 | and parameters of functions. |
| 498 | |
| 499 | .. note:: |
| 500 | |
| 501 | As an alternative to elaborate call policies and lifetime management logic, |
| 502 | consider using smart pointers (see the section on :ref:`smart_pointers` for |
| 503 | details). Smart pointers can tell whether an object is still referenced from |
| 504 | C++ or Python, which generally eliminates the kinds of inconsistencies that |
| 505 | can lead to crashes or undefined behavior. For functions returning smart |
| 506 | pointers, it is not necessary to specify a return value policy. |
Wenzel Jakob | 5f218b3 | 2016-01-17 22:36:39 +0100 | [diff] [blame] | 507 | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 508 | .. _call_policies: |
| 509 | |
Wenzel Jakob | 5f218b3 | 2016-01-17 22:36:39 +0100 | [diff] [blame] | 510 | Additional call policies |
| 511 | ======================== |
| 512 | |
| 513 | In addition to the above return value policies, further `call policies` can be |
| 514 | specified to indicate dependencies between parameters. There is currently just |
| 515 | one policy named ``keep_alive<Nurse, Patient>``, which indicates that the |
| 516 | argument with index ``Patient`` should be kept alive at least until the |
| 517 | argument with index ``Nurse`` is freed by the garbage collector; argument |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 518 | indices start at one, while zero refers to the return value. For methods, index |
| 519 | one refers to the implicit ``this`` pointer, while regular arguments begin at |
| 520 | index two. Arbitrarily many call policies can be specified. |
Wenzel Jakob | 5f218b3 | 2016-01-17 22:36:39 +0100 | [diff] [blame] | 521 | |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 522 | Consider the following example: the binding code for a list append operation |
| 523 | that ties the lifetime of the newly added element to the underlying container |
| 524 | might be declared as follows: |
Wenzel Jakob | 5f218b3 | 2016-01-17 22:36:39 +0100 | [diff] [blame] | 525 | |
| 526 | .. code-block:: cpp |
| 527 | |
| 528 | py::class_<List>(m, "List") |
| 529 | .def("append", &List::append, py::keep_alive<1, 2>()); |
| 530 | |
| 531 | .. note:: |
| 532 | |
| 533 | ``keep_alive`` is analogous to the ``with_custodian_and_ward`` (if Nurse, |
| 534 | Patient != 0) and ``with_custodian_and_ward_postcall`` (if Nurse/Patient == |
| 535 | 0) policies from Boost.Python. |
| 536 | |
Wenzel Jakob | 6158716 | 2016-01-18 22:38:52 +0100 | [diff] [blame] | 537 | .. seealso:: |
| 538 | |
| 539 | The file :file:`example/example13.cpp` contains a complete example that |
| 540 | demonstrates using :class:`keep_alive` in more detail. |
| 541 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 542 | Implicit type conversions |
| 543 | ========================= |
| 544 | |
| 545 | Suppose that instances of two types ``A`` and ``B`` are used in a project, and |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 546 | that an ``A`` can easily be converted into an instance of type ``B`` (examples of this |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 547 | could be a fixed and an arbitrary precision number type). |
| 548 | |
| 549 | .. code-block:: cpp |
| 550 | |
| 551 | py::class_<A>(m, "A") |
| 552 | /// ... members ... |
| 553 | |
| 554 | py::class_<B>(m, "B") |
| 555 | .def(py::init<A>()) |
| 556 | /// ... members ... |
| 557 | |
| 558 | m.def("func", |
| 559 | [](const B &) { /* .... */ } |
| 560 | ); |
| 561 | |
| 562 | To invoke the function ``func`` using a variable ``a`` containing an ``A`` |
| 563 | instance, we'd have to write ``func(B(a))`` in Python. On the other hand, C++ |
| 564 | will automatically apply an implicit type conversion, which makes it possible |
| 565 | to directly write ``func(a)``. |
| 566 | |
| 567 | In this situation (i.e. where ``B`` has a constructor that converts from |
| 568 | ``A``), the following statement enables similar implicit conversions on the |
| 569 | Python side: |
| 570 | |
| 571 | .. code-block:: cpp |
| 572 | |
| 573 | py::implicitly_convertible<A, B>(); |
| 574 | |
Wenzel Jakob | 9f0dfce | 2016-04-06 17:38:18 +0200 | [diff] [blame] | 575 | Unique pointers |
| 576 | =============== |
| 577 | |
| 578 | Given a class ``Example`` with Python bindings, it's possible to return |
| 579 | instances wrapped in C++11 unique pointers, like so |
| 580 | |
| 581 | .. code-block:: cpp |
| 582 | |
| 583 | std::unique_ptr<Example> create_example() { return std::unique_ptr<Example>(new Example()); } |
| 584 | |
| 585 | .. code-block:: cpp |
| 586 | |
| 587 | m.def("create_example", &create_example); |
| 588 | |
| 589 | In other words, there is nothing special that needs to be done. While returning |
| 590 | unique pointers in this way is allowed, it is *illegal* to use them as function |
| 591 | arguments. For instance, the following function signature cannot be processed |
| 592 | by pybind11. |
| 593 | |
| 594 | .. code-block:: cpp |
| 595 | |
| 596 | void do_something_with_example(std::unique_ptr<Example> ex) { ... } |
| 597 | |
| 598 | The above signature would imply that Python needs to give up ownership of an |
| 599 | object that is passed to this function, which is generally not possible (for |
| 600 | instance, the object might be referenced elsewhere). |
| 601 | |
Wenzel Jakob | f7b5874 | 2016-04-25 23:04:27 +0200 | [diff] [blame] | 602 | .. _smart_pointers: |
| 603 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 604 | Smart pointers |
| 605 | ============== |
| 606 | |
Wenzel Jakob | 9f0dfce | 2016-04-06 17:38:18 +0200 | [diff] [blame] | 607 | This section explains how to pass values that are wrapped in "smart" pointer |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 608 | types with internal reference counting. For the simpler C++11 unique pointers, |
| 609 | refer to the previous section. |
Wenzel Jakob | 9f0dfce | 2016-04-06 17:38:18 +0200 | [diff] [blame] | 610 | |
Wenzel Jakob | e84f557 | 2016-04-26 23:19:19 +0200 | [diff] [blame] | 611 | The binding generator for classes, :class:`class_`, takes an optional second |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 612 | template type, which denotes a special *holder* type that is used to manage |
| 613 | references to the object. When wrapping a type named ``Type``, the default |
| 614 | value of this template parameter is ``std::unique_ptr<Type>``, which means that |
| 615 | the object is deallocated when Python's reference count goes to zero. |
| 616 | |
Wenzel Jakob | 1853b65 | 2015-10-18 15:38:50 +0200 | [diff] [blame] | 617 | It is possible to switch to other types of reference counting wrappers or smart |
| 618 | pointers, which is useful in codebases that rely on them. For instance, the |
| 619 | following snippet causes ``std::shared_ptr`` to be used instead. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 620 | |
| 621 | .. code-block:: cpp |
| 622 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 623 | py::class_<Example, std::shared_ptr<Example> /* <- holder type */> obj(m, "Example"); |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 624 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 625 | Note that any particular class can only be associated with a single holder type. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 626 | |
Wenzel Jakob | 1853b65 | 2015-10-18 15:38:50 +0200 | [diff] [blame] | 627 | To enable transparent conversions for functions that take shared pointers as an |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 628 | argument or that return them, a macro invocation similar to the following must |
Wenzel Jakob | 1853b65 | 2015-10-18 15:38:50 +0200 | [diff] [blame] | 629 | be declared at the top level before any binding code: |
| 630 | |
| 631 | .. code-block:: cpp |
| 632 | |
Wenzel Jakob | b1b7140 | 2015-10-18 16:48:30 +0200 | [diff] [blame] | 633 | PYBIND11_DECLARE_HOLDER_TYPE(T, std::shared_ptr<T>); |
Wenzel Jakob | 1853b65 | 2015-10-18 15:38:50 +0200 | [diff] [blame] | 634 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 635 | .. note:: |
Wenzel Jakob | 61d67f0 | 2015-12-14 12:53:06 +0100 | [diff] [blame] | 636 | |
| 637 | The first argument of :func:`PYBIND11_DECLARE_HOLDER_TYPE` should be a |
| 638 | placeholder name that is used as a template parameter of the second |
| 639 | argument. Thus, feel free to use any identifier, but use it consistently on |
| 640 | both sides; also, don't use the name of a type that already exists in your |
| 641 | codebase. |
| 642 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 643 | One potential stumbling block when using holder types is that they need to be |
| 644 | applied consistently. Can you guess what's broken about the following binding |
| 645 | code? |
Wenzel Jakob | 6e213c9 | 2015-11-24 23:05:58 +0100 | [diff] [blame] | 646 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 647 | .. code-block:: cpp |
Wenzel Jakob | 6e213c9 | 2015-11-24 23:05:58 +0100 | [diff] [blame] | 648 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 649 | class Child { }; |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 650 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 651 | class Parent { |
| 652 | public: |
| 653 | Parent() : child(std::make_shared<Child>()) { } |
| 654 | Child *get_child() { return child.get(); } /* Hint: ** DON'T DO THIS ** */ |
| 655 | private: |
| 656 | std::shared_ptr<Child> child; |
| 657 | }; |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 658 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 659 | PYBIND11_PLUGIN(example) { |
| 660 | py::module m("example"); |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 661 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 662 | py::class_<Child, std::shared_ptr<Child>>(m, "Child"); |
| 663 | |
| 664 | py::class_<Parent, std::shared_ptr<Parent>>(m, "Parent") |
| 665 | .def(py::init<>()) |
| 666 | .def("get_child", &Parent::get_child); |
| 667 | |
| 668 | return m.ptr(); |
| 669 | } |
| 670 | |
| 671 | The following Python code will cause undefined behavior (and likely a |
| 672 | segmentation fault). |
| 673 | |
| 674 | .. code-block:: python |
| 675 | |
| 676 | from example import Parent |
| 677 | print(Parent().get_child()) |
| 678 | |
| 679 | The problem is that ``Parent::get_child()`` returns a pointer to an instance of |
| 680 | ``Child``, but the fact that this instance is already managed by |
| 681 | ``std::shared_ptr<...>`` is lost when passing raw pointers. In this case, |
| 682 | pybind11 will create a second independent ``std::shared_ptr<...>`` that also |
| 683 | claims ownership of the pointer. In the end, the object will be freed **twice** |
| 684 | since these shared pointers have no way of knowing about each other. |
| 685 | |
| 686 | There are two ways to resolve this issue: |
| 687 | |
| 688 | 1. For types that are managed by a smart pointer class, never use raw pointers |
| 689 | in function arguments or return values. In other words: always consistently |
| 690 | wrap pointers into their designated holder types (such as |
| 691 | ``std::shared_ptr<...>``). In this case, the signature of ``get_child()`` |
| 692 | should be modified as follows: |
| 693 | |
| 694 | .. code-block:: cpp |
| 695 | |
| 696 | std::shared_ptr<Child> get_child() { return child; } |
| 697 | |
| 698 | 2. Adjust the definition of ``Child`` by specifying |
| 699 | ``std::enable_shared_from_this<T>`` (see cppreference_ for details) as a |
| 700 | base class. This adds a small bit of information to ``Child`` that allows |
| 701 | pybind11 to realize that there is already an existing |
| 702 | ``std::shared_ptr<...>`` and communicate with it. In this case, the |
| 703 | declaration of ``Child`` should look as follows: |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 704 | |
Wenzel Jakob | 6e213c9 | 2015-11-24 23:05:58 +0100 | [diff] [blame] | 705 | .. _cppreference: http://en.cppreference.com/w/cpp/memory/enable_shared_from_this |
| 706 | |
Wenzel Jakob | b2c2c79 | 2016-01-17 22:36:40 +0100 | [diff] [blame] | 707 | .. code-block:: cpp |
| 708 | |
| 709 | class Child : public std::enable_shared_from_this<Child> { }; |
| 710 | |
Wenzel Jakob | 5ef1219 | 2015-12-15 17:07:35 +0100 | [diff] [blame] | 711 | .. seealso:: |
| 712 | |
| 713 | The file :file:`example/example8.cpp` contains a complete example that |
| 714 | demonstrates how to work with custom reference-counting holder types in |
| 715 | more detail. |
| 716 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 717 | .. _custom_constructors: |
| 718 | |
| 719 | Custom constructors |
| 720 | =================== |
| 721 | |
| 722 | The syntax for binding constructors was previously introduced, but it only |
| 723 | works when a constructor with the given parameters actually exists on the C++ |
| 724 | side. To extend this to more general cases, let's take a look at what actually |
| 725 | happens under the hood: the following statement |
| 726 | |
| 727 | .. code-block:: cpp |
| 728 | |
| 729 | py::class_<Example>(m, "Example") |
| 730 | .def(py::init<int>()); |
| 731 | |
| 732 | is short hand notation for |
| 733 | |
| 734 | .. code-block:: cpp |
| 735 | |
| 736 | py::class_<Example>(m, "Example") |
| 737 | .def("__init__", |
| 738 | [](Example &instance, int arg) { |
| 739 | new (&instance) Example(arg); |
| 740 | } |
| 741 | ); |
| 742 | |
| 743 | In other words, :func:`init` creates an anonymous function that invokes an |
| 744 | in-place constructor. Memory allocation etc. is already take care of beforehand |
| 745 | within pybind11. |
| 746 | |
| 747 | Catching and throwing exceptions |
| 748 | ================================ |
| 749 | |
| 750 | When C++ code invoked from Python throws an ``std::exception``, it is |
| 751 | automatically converted into a Python ``Exception``. pybind11 defines multiple |
| 752 | special exception classes that will map to different types of Python |
| 753 | exceptions: |
| 754 | |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 755 | .. tabularcolumns:: |p{0.5\textwidth}|p{0.45\textwidth}| |
| 756 | |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 757 | +--------------------------------------+------------------------------+ |
| 758 | | C++ exception type | Python exception type | |
| 759 | +======================================+==============================+ |
| 760 | | :class:`std::exception` | ``RuntimeError`` | |
| 761 | +--------------------------------------+------------------------------+ |
| 762 | | :class:`std::bad_alloc` | ``MemoryError`` | |
| 763 | +--------------------------------------+------------------------------+ |
| 764 | | :class:`std::domain_error` | ``ValueError`` | |
| 765 | +--------------------------------------+------------------------------+ |
| 766 | | :class:`std::invalid_argument` | ``ValueError`` | |
| 767 | +--------------------------------------+------------------------------+ |
| 768 | | :class:`std::length_error` | ``ValueError`` | |
| 769 | +--------------------------------------+------------------------------+ |
| 770 | | :class:`std::out_of_range` | ``ValueError`` | |
| 771 | +--------------------------------------+------------------------------+ |
| 772 | | :class:`std::range_error` | ``ValueError`` | |
| 773 | +--------------------------------------+------------------------------+ |
| 774 | | :class:`pybind11::stop_iteration` | ``StopIteration`` (used to | |
| 775 | | | implement custom iterators) | |
| 776 | +--------------------------------------+------------------------------+ |
| 777 | | :class:`pybind11::index_error` | ``IndexError`` (used to | |
| 778 | | | indicate out of bounds | |
| 779 | | | accesses in ``__getitem__``, | |
| 780 | | | ``__setitem__``, etc.) | |
| 781 | +--------------------------------------+------------------------------+ |
| 782 | | :class:`pybind11::error_already_set` | Indicates that the Python | |
| 783 | | | exception flag has already | |
| 784 | | | been initialized | |
| 785 | +--------------------------------------+------------------------------+ |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 786 | |
| 787 | When a Python function invoked from C++ throws an exception, it is converted |
| 788 | into a C++ exception of type :class:`error_already_set` whose string payload |
| 789 | contains a textual summary. |
| 790 | |
| 791 | There is also a special exception :class:`cast_error` that is thrown by |
| 792 | :func:`handle::call` when the input arguments cannot be converted to Python |
| 793 | objects. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 794 | |
| 795 | Buffer protocol |
| 796 | =============== |
| 797 | |
| 798 | Python supports an extremely general and convenient approach for exchanging |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 799 | data between plugin libraries. Types can expose a buffer view [#f1]_, |
| 800 | which provides fast direct access to the raw internal representation. Suppose |
| 801 | we want to bind the following simplistic Matrix class: |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 802 | |
| 803 | .. code-block:: cpp |
| 804 | |
| 805 | class Matrix { |
| 806 | public: |
| 807 | Matrix(size_t rows, size_t cols) : m_rows(rows), m_cols(cols) { |
| 808 | m_data = new float[rows*cols]; |
| 809 | } |
| 810 | float *data() { return m_data; } |
| 811 | size_t rows() const { return m_rows; } |
| 812 | size_t cols() const { return m_cols; } |
| 813 | private: |
| 814 | size_t m_rows, m_cols; |
| 815 | float *m_data; |
| 816 | }; |
| 817 | |
| 818 | The following binding code exposes the ``Matrix`` contents as a buffer object, |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 819 | making it possible to cast Matrices into NumPy arrays. It is even possible to |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 820 | completely avoid copy operations with Python expressions like |
| 821 | ``np.array(matrix_instance, copy = False)``. |
| 822 | |
| 823 | .. code-block:: cpp |
| 824 | |
| 825 | py::class_<Matrix>(m, "Matrix") |
| 826 | .def_buffer([](Matrix &m) -> py::buffer_info { |
| 827 | return py::buffer_info( |
Wenzel Jakob | 876eeab | 2016-05-04 22:22:48 +0200 | [diff] [blame^] | 828 | m.data(), /* Pointer to buffer */ |
| 829 | sizeof(float), /* Size of one scalar */ |
| 830 | py::format_descriptor<float>::value, /* Python struct-style format descriptor */ |
| 831 | 2, /* Number of dimensions */ |
| 832 | { m.rows(), m.cols() }, /* Buffer dimensions */ |
| 833 | { sizeof(float) * m.rows(), /* Strides (in bytes) for each index */ |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 834 | sizeof(float) } |
| 835 | ); |
| 836 | }); |
| 837 | |
| 838 | The snippet above binds a lambda function, which can create ``py::buffer_info`` |
| 839 | description records on demand describing a given matrix. The contents of |
| 840 | ``py::buffer_info`` mirror the Python buffer protocol specification. |
| 841 | |
| 842 | .. code-block:: cpp |
| 843 | |
| 844 | struct buffer_info { |
| 845 | void *ptr; |
| 846 | size_t itemsize; |
| 847 | std::string format; |
| 848 | int ndim; |
| 849 | std::vector<size_t> shape; |
| 850 | std::vector<size_t> strides; |
| 851 | }; |
| 852 | |
| 853 | To create a C++ function that can take a Python buffer object as an argument, |
| 854 | simply use the type ``py::buffer`` as one of its arguments. Buffers can exist |
| 855 | in a great variety of configurations, hence some safety checks are usually |
| 856 | necessary in the function body. Below, you can see an basic example on how to |
| 857 | define a custom constructor for the Eigen double precision matrix |
| 858 | (``Eigen::MatrixXd``) type, which supports initialization from compatible |
| 859 | buffer |
| 860 | objects (e.g. a NumPy matrix). |
| 861 | |
| 862 | .. code-block:: cpp |
| 863 | |
| 864 | py::class_<Eigen::MatrixXd>(m, "MatrixXd") |
| 865 | .def("__init__", [](Eigen::MatrixXd &m, py::buffer b) { |
| 866 | /* Request a buffer descriptor from Python */ |
| 867 | py::buffer_info info = b.request(); |
| 868 | |
| 869 | /* Some sanity checks ... */ |
Wenzel Jakob | 876eeab | 2016-05-04 22:22:48 +0200 | [diff] [blame^] | 870 | if (info.format != py::format_descriptor<double>::value) |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 871 | throw std::runtime_error("Incompatible format: expected a double array!"); |
| 872 | |
| 873 | if (info.ndim != 2) |
| 874 | throw std::runtime_error("Incompatible buffer dimension!"); |
| 875 | |
| 876 | if (info.strides[0] == sizeof(double)) { |
| 877 | /* Buffer has the right layout -- directly copy. */ |
| 878 | new (&m) Eigen::MatrixXd(info.shape[0], info.shape[1]); |
| 879 | memcpy(m.data(), info.ptr, sizeof(double) * m.size()); |
| 880 | } else { |
| 881 | /* Oops -- the buffer is transposed */ |
| 882 | new (&m) Eigen::MatrixXd(info.shape[1], info.shape[0]); |
| 883 | memcpy(m.data(), info.ptr, sizeof(double) * m.size()); |
| 884 | m.transposeInPlace(); |
| 885 | } |
| 886 | }); |
| 887 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 888 | .. seealso:: |
| 889 | |
| 890 | The file :file:`example/example7.cpp` contains a complete example that |
| 891 | demonstrates using the buffer protocol with pybind11 in more detail. |
| 892 | |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 893 | .. [#f1] http://docs.python.org/3/c-api/buffer.html |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 894 | |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 895 | NumPy support |
| 896 | ============= |
| 897 | |
| 898 | By exchanging ``py::buffer`` with ``py::array`` in the above snippet, we can |
| 899 | restrict the function so that it only accepts NumPy arrays (rather than any |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 900 | type of Python object satisfying the buffer protocol). |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 901 | |
| 902 | In many situations, we want to define a function which only accepts a NumPy |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 903 | array of a certain data type. This is possible via the ``py::array_t<T>`` |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 904 | template. For instance, the following function requires the argument to be a |
| 905 | dense array of doubles in C-style ordering. |
| 906 | |
| 907 | .. code-block:: cpp |
| 908 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 909 | void f(py::array_t<double> array); |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 910 | |
| 911 | When it is invoked with a different type (e.g. an integer), the binding code |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 912 | will attempt to cast the input into a NumPy array of the requested type. Note |
| 913 | that this feature requires the :file:``pybind11/numpy.h`` header to be |
| 914 | included. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 915 | |
| 916 | Vectorizing functions |
| 917 | ===================== |
| 918 | |
| 919 | Suppose we want to bind a function with the following signature to Python so |
| 920 | that it can process arbitrary NumPy array arguments (vectors, matrices, general |
| 921 | N-D arrays) in addition to its normal arguments: |
| 922 | |
| 923 | .. code-block:: cpp |
| 924 | |
| 925 | double my_func(int x, float y, double z); |
| 926 | |
Wenzel Jakob | 8f4eb00 | 2015-10-15 18:13:33 +0200 | [diff] [blame] | 927 | After including the ``pybind11/numpy.h`` header, this is extremely simple: |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 928 | |
| 929 | .. code-block:: cpp |
| 930 | |
| 931 | m.def("vectorized_func", py::vectorize(my_func)); |
| 932 | |
| 933 | Invoking the function like below causes 4 calls to be made to ``my_func`` with |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 934 | each of the array elements. The significant advantage of this compared to |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 935 | solutions like ``numpy.vectorize()`` is that the loop over the elements runs |
| 936 | entirely on the C++ side and can be crunched down into a tight, optimized loop |
| 937 | by the compiler. The result is returned as a NumPy array of type |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 938 | ``numpy.dtype.float64``. |
| 939 | |
| 940 | .. code-block:: python |
| 941 | |
| 942 | >>> x = np.array([[1, 3],[5, 7]]) |
| 943 | >>> y = np.array([[2, 4],[6, 8]]) |
| 944 | >>> z = 3 |
| 945 | >>> result = vectorized_func(x, y, z) |
| 946 | |
| 947 | The scalar argument ``z`` is transparently replicated 4 times. The input |
| 948 | arrays ``x`` and ``y`` are automatically converted into the right types (they |
| 949 | are of type ``numpy.dtype.int64`` but need to be ``numpy.dtype.int32`` and |
| 950 | ``numpy.dtype.float32``, respectively) |
| 951 | |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 952 | Sometimes we might want to explicitly exclude an argument from the vectorization |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 953 | because it makes little sense to wrap it in a NumPy array. For instance, |
| 954 | suppose the function signature was |
| 955 | |
| 956 | .. code-block:: cpp |
| 957 | |
| 958 | double my_func(int x, float y, my_custom_type *z); |
| 959 | |
| 960 | This can be done with a stateful Lambda closure: |
| 961 | |
| 962 | .. code-block:: cpp |
| 963 | |
| 964 | // Vectorize a lambda function with a capture object (e.g. to exclude some arguments from the vectorization) |
| 965 | m.def("vectorized_func", |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 966 | [](py::array_t<int> x, py::array_t<float> y, my_custom_type *z) { |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 967 | auto stateful_closure = [z](int x, float y) { return my_func(x, y, z); }; |
| 968 | return py::vectorize(stateful_closure)(x, y); |
| 969 | } |
| 970 | ); |
| 971 | |
Wenzel Jakob | 6158716 | 2016-01-18 22:38:52 +0100 | [diff] [blame] | 972 | In cases where the computation is too complicated to be reduced to |
| 973 | ``vectorize``, it will be necessary to create and access the buffer contents |
| 974 | manually. The following snippet contains a complete example that shows how this |
| 975 | works (the code is somewhat contrived, since it could have been done more |
| 976 | simply using ``vectorize``). |
| 977 | |
| 978 | .. code-block:: cpp |
| 979 | |
| 980 | #include <pybind11/pybind11.h> |
| 981 | #include <pybind11/numpy.h> |
| 982 | |
| 983 | namespace py = pybind11; |
| 984 | |
| 985 | py::array_t<double> add_arrays(py::array_t<double> input1, py::array_t<double> input2) { |
| 986 | auto buf1 = input1.request(), buf2 = input2.request(); |
| 987 | |
| 988 | if (buf1.ndim != 1 || buf2.ndim != 1) |
| 989 | throw std::runtime_error("Number of dimensions must be one"); |
| 990 | |
| 991 | if (buf1.shape[0] != buf2.shape[0]) |
| 992 | throw std::runtime_error("Input shapes must match"); |
| 993 | |
| 994 | auto result = py::array(py::buffer_info( |
| 995 | nullptr, /* Pointer to data (nullptr -> ask NumPy to allocate!) */ |
| 996 | sizeof(double), /* Size of one item */ |
Wenzel Jakob | 876eeab | 2016-05-04 22:22:48 +0200 | [diff] [blame^] | 997 | py::format_descriptor<double>::value, /* Buffer format */ |
Wenzel Jakob | 6158716 | 2016-01-18 22:38:52 +0100 | [diff] [blame] | 998 | buf1.ndim, /* How many dimensions? */ |
| 999 | { buf1.shape[0] }, /* Number of elements for each dimension */ |
| 1000 | { sizeof(double) } /* Strides for each dimension */ |
| 1001 | )); |
| 1002 | |
| 1003 | auto buf3 = result.request(); |
| 1004 | |
| 1005 | double *ptr1 = (double *) buf1.ptr, |
| 1006 | *ptr2 = (double *) buf2.ptr, |
| 1007 | *ptr3 = (double *) buf3.ptr; |
| 1008 | |
| 1009 | for (size_t idx = 0; idx < buf1.shape[0]; idx++) |
| 1010 | ptr3[idx] = ptr1[idx] + ptr2[idx]; |
| 1011 | |
| 1012 | return result; |
| 1013 | } |
| 1014 | |
| 1015 | PYBIND11_PLUGIN(test) { |
| 1016 | py::module m("test"); |
| 1017 | m.def("add_arrays", &add_arrays, "Add two NumPy arrays"); |
| 1018 | return m.ptr(); |
| 1019 | } |
| 1020 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1021 | .. seealso:: |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1022 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1023 | The file :file:`example/example10.cpp` contains a complete example that |
| 1024 | demonstrates using :func:`vectorize` in more detail. |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1025 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1026 | Functions taking Python objects as arguments |
| 1027 | ============================================ |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1028 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1029 | pybind11 exposes all major Python types using thin C++ wrapper classes. These |
| 1030 | wrapper classes can also be used as parameters of functions in bindings, which |
| 1031 | makes it possible to directly work with native Python types on the C++ side. |
| 1032 | For instance, the following statement iterates over a Python ``dict``: |
Wenzel Jakob | 28f98aa | 2015-10-13 02:57:16 +0200 | [diff] [blame] | 1033 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1034 | .. code-block:: cpp |
| 1035 | |
| 1036 | void print_dict(py::dict dict) { |
| 1037 | /* Easily interact with Python types */ |
| 1038 | for (auto item : dict) |
| 1039 | std::cout << "key=" << item.first << ", " |
| 1040 | << "value=" << item.second << std::endl; |
| 1041 | } |
| 1042 | |
| 1043 | Available types include :class:`handle`, :class:`object`, :class:`bool_`, |
Wenzel Jakob | 27e8e10 | 2016-01-17 22:36:37 +0100 | [diff] [blame] | 1044 | :class:`int_`, :class:`float_`, :class:`str`, :class:`bytes`, :class:`tuple`, |
Wenzel Jakob | f64feaf | 2016-04-28 14:33:45 +0200 | [diff] [blame] | 1045 | :class:`list`, :class:`dict`, :class:`slice`, :class:`none`, :class:`capsule`, |
| 1046 | :class:`iterable`, :class:`iterator`, :class:`function`, :class:`buffer`, |
| 1047 | :class:`array`, and :class:`array_t`. |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1048 | |
Wenzel Jakob | 436b731 | 2015-10-20 01:04:30 +0200 | [diff] [blame] | 1049 | In this kind of mixed code, it is often necessary to convert arbitrary C++ |
| 1050 | types to Python, which can be done using :func:`cast`: |
| 1051 | |
| 1052 | .. code-block:: cpp |
| 1053 | |
| 1054 | MyClass *cls = ..; |
| 1055 | py::object obj = py::cast(cls); |
| 1056 | |
| 1057 | The reverse direction uses the following syntax: |
| 1058 | |
| 1059 | .. code-block:: cpp |
| 1060 | |
| 1061 | py::object obj = ...; |
| 1062 | MyClass *cls = obj.cast<MyClass *>(); |
| 1063 | |
| 1064 | When conversion fails, both directions throw the exception :class:`cast_error`. |
| 1065 | |
Wenzel Jakob | 9329669 | 2015-10-13 23:21:54 +0200 | [diff] [blame] | 1066 | .. seealso:: |
| 1067 | |
| 1068 | The file :file:`example/example2.cpp` contains a complete example that |
| 1069 | demonstrates passing native Python types in more detail. |
Wenzel Jakob | 2ac5044 | 2016-01-17 22:36:35 +0100 | [diff] [blame] | 1070 | |
| 1071 | Default arguments revisited |
| 1072 | =========================== |
| 1073 | |
| 1074 | The section on :ref:`default_args` previously discussed basic usage of default |
| 1075 | arguments using pybind11. One noteworthy aspect of their implementation is that |
| 1076 | default arguments are converted to Python objects right at declaration time. |
| 1077 | Consider the following example: |
| 1078 | |
| 1079 | .. code-block:: cpp |
| 1080 | |
| 1081 | py::class_<MyClass>("MyClass") |
| 1082 | .def("myFunction", py::arg("arg") = SomeType(123)); |
| 1083 | |
| 1084 | In this case, pybind11 must already be set up to deal with values of the type |
| 1085 | ``SomeType`` (via a prior instantiation of ``py::class_<SomeType>``), or an |
| 1086 | exception will be thrown. |
| 1087 | |
| 1088 | Another aspect worth highlighting is that the "preview" of the default argument |
| 1089 | in the function signature is generated using the object's ``__repr__`` method. |
| 1090 | If not available, the signature may not be very helpful, e.g.: |
| 1091 | |
| 1092 | .. code-block:: python |
| 1093 | |
| 1094 | FUNCTIONS |
| 1095 | ... |
| 1096 | | myFunction(...) |
Wenzel Jakob | 48548ea | 2016-01-17 22:36:44 +0100 | [diff] [blame] | 1097 | | Signature : (MyClass, arg : SomeType = <SomeType object at 0x101b7b080>) -> NoneType |
Wenzel Jakob | 2ac5044 | 2016-01-17 22:36:35 +0100 | [diff] [blame] | 1098 | ... |
| 1099 | |
| 1100 | The first way of addressing this is by defining ``SomeType.__repr__``. |
| 1101 | Alternatively, it is possible to specify the human-readable preview of the |
| 1102 | default argument manually using the ``arg_t`` notation: |
| 1103 | |
| 1104 | .. code-block:: cpp |
| 1105 | |
| 1106 | py::class_<MyClass>("MyClass") |
| 1107 | .def("myFunction", py::arg_t<SomeType>("arg", SomeType(123), "SomeType(123)")); |
| 1108 | |
Wenzel Jakob | c769fce | 2016-03-03 12:03:30 +0100 | [diff] [blame] | 1109 | Sometimes it may be necessary to pass a null pointer value as a default |
| 1110 | argument. In this case, remember to cast it to the underlying type in question, |
| 1111 | like so: |
| 1112 | |
| 1113 | .. code-block:: cpp |
| 1114 | |
| 1115 | py::class_<MyClass>("MyClass") |
| 1116 | .def("myFunction", py::arg("arg") = (SomeType *) nullptr); |
| 1117 | |
Wenzel Jakob | 2dfbade | 2016-01-17 22:36:37 +0100 | [diff] [blame] | 1118 | Partitioning code over multiple extension modules |
| 1119 | ================================================= |
| 1120 | |
Wenzel Jakob | 90d2f5e | 2016-04-11 14:30:11 +0200 | [diff] [blame] | 1121 | It's straightforward to split binding code over multiple extension modules, |
| 1122 | while referencing types that are declared elsewhere. Everything "just" works |
| 1123 | without any special precautions. One exception to this rule occurs when |
| 1124 | extending a type declared in another extension module. Recall the basic example |
| 1125 | from Section :ref:`inheritance`. |
Wenzel Jakob | 2dfbade | 2016-01-17 22:36:37 +0100 | [diff] [blame] | 1126 | |
| 1127 | .. code-block:: cpp |
| 1128 | |
| 1129 | py::class_<Pet> pet(m, "Pet"); |
| 1130 | pet.def(py::init<const std::string &>()) |
| 1131 | .def_readwrite("name", &Pet::name); |
| 1132 | |
| 1133 | py::class_<Dog>(m, "Dog", pet /* <- specify parent */) |
| 1134 | .def(py::init<const std::string &>()) |
| 1135 | .def("bark", &Dog::bark); |
| 1136 | |
| 1137 | Suppose now that ``Pet`` bindings are defined in a module named ``basic``, |
| 1138 | whereas the ``Dog`` bindings are defined somewhere else. The challenge is of |
| 1139 | course that the variable ``pet`` is not available anymore though it is needed |
| 1140 | to indicate the inheritance relationship to the constructor of ``class_<Dog>``. |
| 1141 | However, it can be acquired as follows: |
| 1142 | |
| 1143 | .. code-block:: cpp |
| 1144 | |
| 1145 | py::object pet = (py::object) py::module::import("basic").attr("Pet"); |
| 1146 | |
| 1147 | py::class_<Dog>(m, "Dog", pet) |
| 1148 | .def(py::init<const std::string &>()) |
| 1149 | .def("bark", &Dog::bark); |
| 1150 | |
Wenzel Jakob | 8d862b3 | 2016-03-06 13:37:22 +0100 | [diff] [blame] | 1151 | Alternatively, we can rely on the ``base`` tag, which performs an automated |
| 1152 | lookup of the corresponding Python type. However, this also requires invoking |
| 1153 | the ``import`` function once to ensure that the pybind11 binding code of the |
| 1154 | module ``basic`` has been executed. |
| 1155 | |
Wenzel Jakob | 8d862b3 | 2016-03-06 13:37:22 +0100 | [diff] [blame] | 1156 | .. code-block:: cpp |
| 1157 | |
| 1158 | py::module::import("basic"); |
| 1159 | |
| 1160 | py::class_<Dog>(m, "Dog", py::base<Pet>()) |
| 1161 | .def(py::init<const std::string &>()) |
| 1162 | .def("bark", &Dog::bark); |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1163 | |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 1164 | Naturally, both methods will fail when there are cyclic dependencies. |
| 1165 | |
Wenzel Jakob | 90d2f5e | 2016-04-11 14:30:11 +0200 | [diff] [blame] | 1166 | Note that compiling code which has its default symbol visibility set to |
| 1167 | *hidden* (e.g. via the command line flag ``-fvisibility=hidden`` on GCC/Clang) can interfere with the |
| 1168 | ability to access types defined in another extension module. Workarounds |
| 1169 | include changing the global symbol visibility (not recommended, because it will |
| 1170 | lead unnecessarily large binaries) or manually exporting types that are |
| 1171 | accessed by multiple extension modules: |
| 1172 | |
| 1173 | .. code-block:: cpp |
| 1174 | |
| 1175 | #ifdef _WIN32 |
| 1176 | # define EXPORT_TYPE __declspec(dllexport) |
| 1177 | #else |
| 1178 | # define EXPORT_TYPE __attribute__ ((visibility("default"))) |
| 1179 | #endif |
| 1180 | |
| 1181 | class EXPORT_TYPE Dog : public Animal { |
| 1182 | ... |
| 1183 | }; |
| 1184 | |
| 1185 | |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1186 | Treating STL data structures as opaque objects |
| 1187 | ============================================== |
| 1188 | |
| 1189 | pybind11 heavily relies on a template matching mechanism to convert parameters |
| 1190 | and return values that are constructed from STL data types such as vectors, |
| 1191 | linked lists, hash tables, etc. This even works in a recursive manner, for |
| 1192 | instance to deal with lists of hash maps of pairs of elementary and custom |
| 1193 | types, etc. |
| 1194 | |
Wenzel Jakob | 0871228 | 2016-04-22 16:52:15 +0200 | [diff] [blame] | 1195 | However, a fundamental limitation of this approach is that internal conversions |
| 1196 | between Python and C++ types involve a copy operation that prevents |
Wenzel Jakob | 978e376 | 2016-04-07 18:00:41 +0200 | [diff] [blame] | 1197 | pass-by-reference semantics. What does this mean? |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1198 | |
| 1199 | Suppose we bind the following function |
| 1200 | |
| 1201 | .. code-block:: cpp |
| 1202 | |
| 1203 | void append_1(std::vector<int> &v) { |
| 1204 | v.push_back(1); |
| 1205 | } |
| 1206 | |
Wenzel Jakob | 06f56ee | 2016-04-28 16:25:24 +0200 | [diff] [blame] | 1207 | and call it from Python, the following happens: |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1208 | |
| 1209 | .. code-block:: python |
| 1210 | |
| 1211 | >>> v = [5, 6] |
| 1212 | >>> append_1(v) |
| 1213 | >>> print(v) |
| 1214 | [5, 6] |
| 1215 | |
| 1216 | As you can see, when passing STL data structures by reference, modifications |
Wenzel Jakob | 0871228 | 2016-04-22 16:52:15 +0200 | [diff] [blame] | 1217 | are not propagated back the Python side. A similar situation arises when |
| 1218 | exposing STL data structures using the ``def_readwrite`` or ``def_readonly`` |
| 1219 | functions: |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1220 | |
Wenzel Jakob | 0871228 | 2016-04-22 16:52:15 +0200 | [diff] [blame] | 1221 | .. code-block:: cpp |
| 1222 | |
| 1223 | /* ... definition ... */ |
| 1224 | |
| 1225 | class MyClass { |
| 1226 | std::vector<int> contents; |
| 1227 | }; |
| 1228 | |
| 1229 | /* ... binding code ... */ |
| 1230 | |
| 1231 | py::class_<MyClass>(m, "MyClass") |
| 1232 | .def(py::init<>) |
| 1233 | .def_readwrite("contents", &MyClass::contents); |
| 1234 | |
| 1235 | In this case, properties can be read and written in their entirety. However, an |
| 1236 | ``append`` operaton involving such a list type has no effect: |
| 1237 | |
| 1238 | .. code-block:: python |
| 1239 | |
| 1240 | >>> m = MyClass() |
| 1241 | >>> m.contents = [5, 6] |
| 1242 | >>> print(m.contents) |
| 1243 | [5, 6] |
| 1244 | >>> m.contents.append(7) |
| 1245 | >>> print(m.contents) |
| 1246 | [5, 6] |
| 1247 | |
Wenzel Jakob | 06f56ee | 2016-04-28 16:25:24 +0200 | [diff] [blame] | 1248 | To deal with both of the above situations, pybind11 provides a macro named |
| 1249 | ``PYBIND11_MAKE_OPAQUE(T)`` that disables the template-based conversion |
| 1250 | machinery of types, thus rendering them *opaque*. The contents of opaque |
| 1251 | objects are never inspected or extracted, hence they can be passed by |
| 1252 | reference. For instance, to turn ``std::vector<int>`` into an opaque type, add |
| 1253 | the declaration |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1254 | |
| 1255 | .. code-block:: cpp |
| 1256 | |
Wenzel Jakob | 06f56ee | 2016-04-28 16:25:24 +0200 | [diff] [blame] | 1257 | PYBIND11_MAKE_OPAQUE(std::vector<int>); |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1258 | |
Wenzel Jakob | 8e93df8 | 2016-05-01 02:36:58 +0200 | [diff] [blame] | 1259 | before any binding code (e.g. invocations to ``class_::def()``, etc.). This |
Wenzel Jakob | 06f56ee | 2016-04-28 16:25:24 +0200 | [diff] [blame] | 1260 | macro must be specified at the top level, since instantiates a partial template |
| 1261 | overload. If your binding code consists of multiple compilation units, it must |
| 1262 | be present in every file preceding any usage of ``std::vector<int>``. Opaque |
| 1263 | types must also have a corresponding ``class_`` declaration to associate them |
| 1264 | with a name in Python, and to define a set of available operations: |
| 1265 | |
| 1266 | .. code-block:: cpp |
| 1267 | |
| 1268 | py::class_<std::vector<int>>(m, "IntVector") |
| 1269 | .def(py::init<>()) |
| 1270 | .def("clear", &std::vector<int>::clear) |
| 1271 | .def("pop_back", &std::vector<int>::pop_back) |
| 1272 | .def("__len__", [](const std::vector<int> &v) { return v.size(); }) |
| 1273 | .def("__iter__", [](std::vector<int> &v) { |
| 1274 | return py::make_iterator(v.begin(), v.end()); |
| 1275 | }, py::keep_alive<0, 1>()) /* Keep vector alive while iterator is used */ |
| 1276 | // .... |
| 1277 | |
Wenzel Jakob | eda978e | 2016-03-15 15:05:40 +0100 | [diff] [blame] | 1278 | |
| 1279 | .. seealso:: |
| 1280 | |
| 1281 | The file :file:`example/example14.cpp` contains a complete example that |
Wenzel Jakob | 0871228 | 2016-04-22 16:52:15 +0200 | [diff] [blame] | 1282 | demonstrates how to create and expose opaque types using pybind11 in more |
| 1283 | detail. |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1284 | |
| 1285 | Pickling support |
| 1286 | ================ |
| 1287 | |
| 1288 | Python's ``pickle`` module provides a powerful facility to serialize and |
| 1289 | de-serialize a Python object graph into a binary data stream. To pickle and |
Wenzel Jakob | 3d0e6ff | 2016-04-13 11:48:10 +0200 | [diff] [blame] | 1290 | unpickle C++ classes using pybind11, two additional functions must be provided. |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1291 | Suppose the class in question has the following signature: |
| 1292 | |
| 1293 | .. code-block:: cpp |
| 1294 | |
| 1295 | class Pickleable { |
| 1296 | public: |
| 1297 | Pickleable(const std::string &value) : m_value(value) { } |
| 1298 | const std::string &value() const { return m_value; } |
| 1299 | |
| 1300 | void setExtra(int extra) { m_extra = extra; } |
| 1301 | int extra() const { return m_extra; } |
| 1302 | private: |
| 1303 | std::string m_value; |
| 1304 | int m_extra = 0; |
| 1305 | }; |
| 1306 | |
| 1307 | The binding code including the requisite ``__setstate__`` and ``__getstate__`` methods [#f2]_ |
| 1308 | looks as follows: |
| 1309 | |
| 1310 | .. code-block:: cpp |
| 1311 | |
| 1312 | py::class_<Pickleable>(m, "Pickleable") |
| 1313 | .def(py::init<std::string>()) |
| 1314 | .def("value", &Pickleable::value) |
| 1315 | .def("extra", &Pickleable::extra) |
| 1316 | .def("setExtra", &Pickleable::setExtra) |
| 1317 | .def("__getstate__", [](const Pickleable &p) { |
| 1318 | /* Return a tuple that fully encodes the state of the object */ |
| 1319 | return py::make_tuple(p.value(), p.extra()); |
| 1320 | }) |
| 1321 | .def("__setstate__", [](Pickleable &p, py::tuple t) { |
| 1322 | if (t.size() != 2) |
| 1323 | throw std::runtime_error("Invalid state!"); |
| 1324 | |
Wenzel Jakob | d40885a | 2016-04-13 13:30:05 +0200 | [diff] [blame] | 1325 | /* Invoke the in-place constructor. Note that this is needed even |
| 1326 | when the object just has a trivial default constructor */ |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1327 | new (&p) Pickleable(t[0].cast<std::string>()); |
| 1328 | |
| 1329 | /* Assign any additional state */ |
| 1330 | p.setExtra(t[1].cast<int>()); |
| 1331 | }); |
| 1332 | |
| 1333 | An instance can now be pickled as follows: |
| 1334 | |
| 1335 | .. code-block:: python |
| 1336 | |
| 1337 | try: |
| 1338 | import cPickle as pickle # Use cPickle on Python 2.7 |
| 1339 | except ImportError: |
| 1340 | import pickle |
| 1341 | |
| 1342 | p = Pickleable("test_value") |
| 1343 | p.setExtra(15) |
Wenzel Jakob | 81e0975 | 2016-04-30 23:13:03 +0200 | [diff] [blame] | 1344 | data = pickle.dumps(p, 2) |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1345 | |
Wenzel Jakob | 81e0975 | 2016-04-30 23:13:03 +0200 | [diff] [blame] | 1346 | Note that only the cPickle module is supported on Python 2.7. The second |
| 1347 | argument to ``dumps`` is also crucial: it selects the pickle protocol version |
| 1348 | 2, since the older version 1 is not supported. Newer versions are also fineāfor |
| 1349 | instance, specify ``-1`` to always use the latest available version. Beware: |
| 1350 | failure to follow these instructions will cause important pybind11 memory |
| 1351 | allocation routines to be skipped during unpickling, which will likely lead to |
| 1352 | memory corruption and/or segmentation faults. |
Wenzel Jakob | 1c329aa | 2016-04-13 02:37:36 +0200 | [diff] [blame] | 1353 | |
| 1354 | .. seealso:: |
| 1355 | |
| 1356 | The file :file:`example/example15.cpp` contains a complete example that |
| 1357 | demonstrates how to pickle and unpickle types using pybind11 in more detail. |
| 1358 | |
| 1359 | .. [#f2] http://docs.python.org/3/library/pickle.html#pickling-class-instances |
Wenzel Jakob | ef7a9b9 | 2016-04-13 18:41:59 +0200 | [diff] [blame] | 1360 | |
| 1361 | Generating documentation using Sphinx |
| 1362 | ===================================== |
| 1363 | |
| 1364 | Sphinx [#f3]_ has the ability to inspect the signatures and documentation |
| 1365 | strings in pybind11-based extension modules to automatically generate beautiful |
| 1366 | documentation in a variety formats. The pbtest repository [#f4]_ contains a |
| 1367 | simple example repository which uses this approach. |
| 1368 | |
| 1369 | There are two potential gotchas when using this approach: first, make sure that |
| 1370 | the resulting strings do not contain any :kbd:`TAB` characters, which break the |
| 1371 | docstring parsing routines. You may want to use C++11 raw string literals, |
| 1372 | which are convenient for multi-line comments. Conveniently, any excess |
| 1373 | indentation will be automatically be removed by Sphinx. However, for this to |
| 1374 | work, it is important that all lines are indented consistently, i.e.: |
| 1375 | |
| 1376 | .. code-block:: cpp |
| 1377 | |
| 1378 | // ok |
| 1379 | m.def("foo", &foo, R"mydelimiter( |
| 1380 | The foo function |
| 1381 | |
| 1382 | Parameters |
| 1383 | ---------- |
| 1384 | )mydelimiter"); |
| 1385 | |
| 1386 | // *not ok* |
| 1387 | m.def("foo", &foo, R"mydelimiter(The foo function |
| 1388 | |
| 1389 | Parameters |
| 1390 | ---------- |
| 1391 | )mydelimiter"); |
| 1392 | |
| 1393 | .. [#f3] http://www.sphinx-doc.org |
| 1394 | .. [#f4] http://github.com/pybind/pbtest |
| 1395 | |