Blame - lldb/docs/use/variable.rst - toolchain/llvm-project

blob: 7d496c37bd4df157a7f6cd8fb9ab061db00a0511 [file] [log] [blame]

Jonas Devlieghere	edb874b	2019-01-30 18:51:40 +0000	[diff] [blame]	1	Variable Formatting
				2	===================
				3
				4	.. contents::
				5	:local:
				6
				7	LLDB has a data formatters subsystem that allows users to define custom display
				8	options for their variables.
				9
				10	Usually, when you type frame variable or run some expression LLDB will
				11	automatically choose the way to display your results on a per-type basis, as in
				12	the following example:
				13
				14	::
				15
				16	(lldb) frame variable
				17	(uint8_t) x = 'a'
				18	(intptr_t) y = 124752287
				19
				20	However, in certain cases, you may want to associate a different style to the display for certain datatypes. To do so, you need to give hints to the debugger
				21	as to how variables should be displayed. The LLDB type command allows you to do
				22	just that.
				23
				24	Using it you can change your visualization to look like this:
				25
				26	::
				27
				28	(lldb) frame variable
				29	(uint8_t) x = chr='a' dec=65 hex=0x41
				30	(intptr_t) y = 0x76f919f
				31
				32	There are several features related to data visualization: formats, summaries,
				33	filters, synthetic children.
				34
				35	To reflect this, the type command has five subcommands:
				36
				37	::
				38
				39	type format
				40	type summary
				41	type filter
				42	type synthetic
				43	type category
				44
				45	These commands are meant to bind printing options to types. When variables are
				46	printed, LLDB will first check if custom printing options have been associated
				47	to a variable's type and, if so, use them instead of picking the default
				48	choices.
				49
				50	Each of the commands (except ``type category``) has four subcommands available:
				51
				52	- ``add``: associates a new printing option to one or more types
				53	- ``delete``: deletes an existing association
				54	- ``list``: provides a listing of all associations
				55	- ``clear``: deletes all associations
				56
				57	Type Format
				58	-----------
				59
				60	Type formats enable you to quickly override the default format for displaying
				61	primitive types (the usual basic C/C++/ObjC types: int, float, char, ...).
				62
				63	If for some reason you want all int variables in your program to print out as
				64	hex, you can add a format to the int type.
				65
				66	This is done by typing
				67
				68	::
				69
				70	(lldb) type format add --format hex int
				71
				72	at the LLDB command line.
				73
				74	The ``--format`` (which you can shorten to -f) option accepts a :doc:`format
				75	name<formatting>`. Then, you provide one or more types to which you want the
				76	new format applied.
				77
				78	A frequent scenario is that your program has a typedef for a numeric type that
				79	you know represents something that must be printed in a certain way. Again, you
				80	can add a format just to that typedef by using type format add with the name
				81	alias.
				82
				83	But things can quickly get hierarchical. Let's say you have a situation like
				84	the following:
				85
				86	::
				87
				88	typedef int A;
				89	typedef A B;
				90	typedef B C;
				91	typedef C D;
				92
				93	and you want to show all A's as hex, all C's as byte arrays and leave the
				94	defaults untouched for other types (albeit its contrived look, the example is
				95	far from unrealistic in large software systems).
				96
				97	If you simply type
				98
				99	::
				100
				101	(lldb) type format add -f hex A
				102	(lldb) type format add -f uint8_t[] C
				103
				104	values of type B will be shown as hex and values of type D as byte arrays, as in:
				105
				106	::
				107
				108	(lldb) frame variable -T
				109	(A) a = 0x00000001
				110	(B) b = 0x00000002
				111	(C) c = {0x03 0x00 0x00 0x00}
				112	(D) d = {0x04 0x00 0x00 0x00}
				113
				114	This is because by default LLDB cascades formats through typedef chains. In
				115	order to avoid that you can use the option -C no to prevent cascading, thus
				116	making the two commands required to achieve your goal:
				117
				118	::
				119
				120	(lldb) type format add -C no -f hex A
				121	(lldb) type format add -C no -f uint8_t[] C
				122
				123
				124	which provides the desired output:
				125
				126	::
				127
				128	(lldb) frame variable -T
				129	(A) a = 0x00000001
				130	(B) b = 2
				131	(C) c = {0x03 0x00 0x00 0x00}
				132	(D) d = 4
				133
				134	Two additional options that you will want to look at are --skip-pointers (-p)
				135	and --skip-references (-r). These two options prevent LLDB from applying a
				136	format for type T to values of type T* and T& respectively.
				137
				138	::
				139
				140	(lldb) type format add -f float32[] int
				141	(lldb) frame variable pointer *pointer -T
				142	(int *) pointer = {1.46991e-39 1.4013e-45}
				143	(int) *pointer = {1.53302e-42}
				144	(lldb) type format add -f float32[] int -p
				145	(lldb) frame variable pointer *pointer -T
				146	(int *) pointer = 0x0000000100100180
				147	(int) *pointer = {1.53302e-42}
				148
				149	While they can be applied to pointers and references, formats will make no
				150	attempt to dereference the pointer and extract the value before applying the
				151	format, which means you are effectively formatting the address stored in the
				152	pointer rather than the pointee value. For this reason, you may want to use the
				153	-p option when defining formats.
				154
				155	If you need to delete a custom format simply type type format delete followed
				156	by the name of the type to which the format applies.Even if you defined the
				157	same format for multiple types on the same command, type format delete will
				158	only remove the format for the type name passed as argument.
				159
				160	To delete ALL formats, use ``type format clear``. To see all the formats
				161	defined, use type format list.
				162
				163	If all you need to do, however, is display one variable in a custom format,
				164	while leaving the others of the same type untouched, you can simply type:
				165
				166	::
				167
				168	(lldb) frame variable counter -f hex
				169
				170	This has the effect of displaying the value of counter as an hexadecimal
				171	number, and will keep showing it this way until you either pick a different
				172	format or till you let your program run again.
				173
				174	Finally, this is a list of formatting options available out of which you can
				175	pick:
				176
				177	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				178	\| Format name \| Abbreviation \| Description \|
				179	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				180	\| ``default`` \| \| the default LLDB algorithm is used to pick a format \|
				181	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				182	\| ``boolean`` \| B \| show this as a true/false boolean, using the customary rule that 0 is \|
				183	\| \| \| false and everything else is true \|
				184	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				185	\| ``binary`` \| b \| show this as a sequence of bits \|
				186	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				187	\| ``bytes`` \| y \| show the bytes one after the other \|
				188	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				189	\| ``bytes with ASCII`` \| Y \| show the bytes, but try to display them as ASCII characters as well \|
				190	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				191	\| ``character`` \| c \| show the bytes as ASCII characters \|
				192	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				193	\| ``printable character`` \| C \| show the bytes as printable ASCII characters \|
				194	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				195	\| ``complex float`` \| F \| interpret this value as the real and imaginary part of a complex \|
				196	\| \| \| floating-point number \|
				197	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				198	\| ``c-string`` \| s \| show this as a 0-terminated C string \|
				199	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				200	\| ``decimal`` \| i \| show this as a signed integer number (this does not perform a cast, it \|
				201	\| \| \| simply shows the bytes as an integer with sign) \|
				202	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				203	\| ``enumeration`` \| E \| show this as an enumeration, printing the \|
				204	\| \| \| value's name if available or the integer value otherwise \|
				205	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				206	\| ``hex`` \| x \| show this as in hexadecimal notation (this does \|
				207	\| \| \| not perform a cast, it simply shows the bytes as hex) \|
				208	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				209	\| ``float`` \| f \| show this as a floating-point number (this does not perform a cast, it \|
				210	\| \| \| simply interprets the bytes as an IEEE754 floating-point value) \|
				211	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				212	\| ``octal`` \| o \| show this in octal notation \|
				213	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				214	\| ``OSType`` \| O \| show this as a MacOS OSType \|
				215	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				216	\| ``unicode16`` \| U \| show this as UTF-16 characters \|
				217	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				218	\| ``unicode32`` \| \| show this as UTF-32 characters \|
				219	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				220	\| ``unsigned decimal`` \| u \| show this as an unsigned integer number (this does not perform a cast, \|
				221	\| \| \| it simply shows the bytes as unsigned integer) \|
				222	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				223	\| ``pointer`` \| p \| show this as a native pointer (unless this is really a pointer, the \|
				224	\| \| \| resulting address will probably be invalid) \|
				225	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				226	\| ``char[]`` \| \| show this as an array of characters \|
				227	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				228	\| ``int8_t[], uint8_t[]`` \| \| show this as an array of the corresponding integer type \|
				229	\| ``int16_t[], uint16_t[]`` \| \| \|
				230	\| ``int32_t[], uint32_t[]`` \| \| \|
				231	\| ``int64_t[], uint64_t[]`` \| \| \|
				232	\| ``uint128_t[]`` \| \| \|
				233	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				234	\| ``float32[], float64[]`` \| \| show this as an array of the corresponding \|
				235	\| \| \| floating-point type \|
				236	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				237	\| ``complex integer`` \| I \| interpret this value as the real and imaginary part of a complex integer \|
				238	\| \| \| number \|
				239	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				240	\| ``character array`` \| a \| show this as a character array \|
				241	+-----------------------------------------------+------------------+--------------------------------------------------------------------------+
				242
				243	Type Summary
				244	------------
				245
				246	Type formats work by showing a different kind of display for the value of a
				247	variable. However, they only work for basic types. When you want to display a
				248	class or struct in a custom format, you cannot do that using formats.
				249
				250	A different feature, type summaries, works by extracting information from
				251	classes, structures, ... (aggregate types) and arranging it in a user-defined
				252	format, as in the following example:
				253
				254	before adding a summary...
				255
				256	::
				257
				258	(lldb) frame variable -T one
				259	(i_am_cool) one = {
				260	(int) x = 3
				261	(float) y = 3.14159
				262	(char) z = 'E'
				263	}
				264
				265	after adding a summary...
				266
				267	::
				268
				269	(lldb) frame variable one
				270	(i_am_cool) one = int = 3, float = 3.14159, char = 69
				271
				272	There are two ways to use type summaries: the first one is to bind a summary
				273	string to the type; the second is to write a Python script that returns the
				274	string to be used as summary. Both options are enabled by the type summary add
				275	command.
				276
				277	The command to obtain the output shown in the example is:
				278
				279	::
				280
				281	(lldb) type summary add --summary-string "int = ${var.x}, float = ${var.y}, char = ${var.z%u}" i_am_cool
				282
				283	Initially, we will focus on summary strings, and then describe the Python
				284	binding mechanism.
				285
				286	Summary Strings
				287	---------------
				288
				289	Summary strings are written using a simple control language, exemplified by the
				290	snippet above. A summary string contains a sequence of tokens that are
				291	processed by LLDB to generate the summary.
				292
				293	Summary strings can contain plain text, control characters and special
				294	variables that have access to information about the current object and the
				295	overall program state.
				296
				297	Plain text is any sequence of characters that doesn't contain a ``{``, ``}``, ``$``,
				298	or ``\`` character, which are the syntax control characters.
				299
				300	The special variables are found in between a "${" prefix, and end with a "}"
				301	suffix. Variables can be a simple name or they can refer to complex objects
				302	that have subitems themselves. In other words, a variable looks like
				303	``${object}`` or ``${object.child.otherchild}``. A variable can also be
				304	prefixed or suffixed with other symbols meant to change the way its value is
				305	handled. An example is ``${*var.int_pointer[0-3]}``.
				306
				307	Basically, the syntax is the same one described Frame and Thread Formatting
				308	plus additional symbols specific for summary strings. The main of them is
				309	${var, which is used refer to the variable that a summary is being created for.
				310
				311	The simplest thing you can do is grab a member variable of a class or structure
				312	by typing its expression path. In the previous example, the expression path for
				313	the field float y is simply .y. Thus, to ask the summary string to display y
				314	you would type ${var.y}.
				315
				316	If you have code like the following:
				317
				318	::
				319
				320	struct A {
				321	int x;
				322	int y;
				323	};
				324	struct B {
				325	A x;
				326	A y;
				327	int *z;
				328	};
				329
				330	the expression path for the y member of the x member of an object of type B
				331	would be .x.y and you would type ``${var.x.y}`` to display it in a summary
				332	string for type B.
				333
				334	By default, a summary defined for type T, also works for types T* and T& (you
				335	can disable this behavior if desired). For this reason, expression paths do not
				336	differentiate between . and ->, and the above expression path .x.y would be
				337	just as good if you were displaying a B*, or even if the actual definition of B
				338	were:
				339
				340	::
				341
				342	struct B {
				343	A *x;
				344	A y;
				345	int *z;
				346	};
				347
				348	This is unlike the behavior of frame variable which, on the contrary, will
				349	enforce the distinction. As hinted above, the rationale for this choice is that
				350	waiving this distinction enables you to write a summary string once for type T
				351	and use it for both T and T* instances. As a summary string is mostly about
				352	extracting nested members' information, a pointer to an object is just as good
				353	as the object itself for the purpose.
				354
				355	If you need to access the value of the integer pointed to by B::z, you cannot
				356	simply say ${var.z} because that symbol refers to the pointer z. In order to
				357	dereference it and get the pointed value, you should say ``${*var.z}``. The
				358	``${*var`` tells LLDB to get the object that the expression paths leads to, and
				359	then dereference it. In this example is it equivalent to ``*(bObject.z)`` in
				360	C/C++ syntax. Because . and -> operators can both be used, there is no need to
				361	have dereferences in the middle of an expression path (e.g. you do not need to
				362	type ``${(var.x).x}``) to read A::x as contained in ``(B::x)``. To achieve
				363	that effect you can simply write ``${var.x->x}``, or even ``${var.x.x}``. The
				364	``*`` operator only binds to the result of the whole expression path, rather
				365	than piecewise, and there is no way to use parentheses to change that behavior.
				366
				367	Of course, a summary string can contain more than one ${var specifier, and can
				368	use ``${var`` and ``${*var`` specifiers together.
				369
				370	Formatting Summary Elements
				371	---------------------------
				372
				373	An expression path can include formatting codes. Much like the type formats
				374	discussed previously, you can also customize the way variables are displayed in
				375	summary strings, regardless of the format they have applied to their types. To
				376	do that, you can use %format inside an expression path, as in ${var.x->x%u},
				377	which would display the value of x as an unsigned integer.
				378
				379	You can also use some other special format markers, not available for formats
				380	themselves, but which carry a special meaning when used in this context:
				381
				382	+------------+--------------------------------------------------------------------------+
				383	\| Symbol \| Description \|
				384	+------------+--------------------------------------------------------------------------+
				385	\| ``Symbol`` \| ``Description`` \|
				386	+------------+--------------------------------------------------------------------------+
				387	\| ``%S`` \| Use this object's summary (the default for aggregate types) \|
				388	+------------+--------------------------------------------------------------------------+
				389	\| ``%V`` \| Use this object's value (the default for non-aggregate types) \|
				390	+------------+--------------------------------------------------------------------------+
				391	\| ``%@`` \| Use a language-runtime specific description (for C++ this does nothing, \|
				392	\| \| for Objective-C it calls the NSPrintForDebugger API) \|
				393	+------------+--------------------------------------------------------------------------+
				394	\| ``%L`` \| Use this object's location (memory address, register name, ...) \|
				395	+------------+--------------------------------------------------------------------------+
				396	\| ``%#`` \| Use the count of the children of this object \|
				397	+------------+--------------------------------------------------------------------------+
				398	\| ``%T`` \| Use this object's datatype name \|
				399	+------------+--------------------------------------------------------------------------+
				400	\| ``%N`` \| Print the variable's basename \|
				401	+------------+--------------------------------------------------------------------------+
				402	\| ``%>`` \| Print the expression path for this item \|
				403	+------------+--------------------------------------------------------------------------+
				404
				405	Starting with SVN r228207, you can also specify
				406	``${script.var:pythonFuncName}``. Previously, back to r220821, this was
				407	specified with a different syntax: ``${var.script:pythonFuncName}``.
				408
				409	It is expected that the function name you use specifies a function whose
				410	signature is the same as a Python summary function. The return string from the
				411	function will be placed verbatim in the output.
				412
				413	You cannot use element access, or formatting symbols, in combination with this
				414	syntax. For example the following:
				415
				416	::
				417
				418	${script.var.element[0]:myFunctionName%@}
				419
				420	is not valid and will cause the summary to fail to evaluate.
				421
				422
				423	Element Inlining
				424	----------------
				425
				426	Option --inline-children (-c) to type summary add tells LLDB not to look for a summary string, but instead to just print a listing of all the object's children on one line.
				427
				428	As an example, given a type pair:
				429
				430	::
				431
				432	(lldb) frame variable --show-types a_pair
				433	(pair) a_pair = {
				434	(int) first = 1;
				435	(int) second = 2;
				436	}
				437
				438	If one types the following commands:
				439
				440	::
				441
				442	(lldb) type summary add --inline-children pair
				443
				444	the output becomes:
				445
				446	::
				447
				448	(lldb) frame variable a_pair
				449	(pair) a_pair = (first=1, second=2)
				450
				451
				452	Of course, one can obtain the same effect by typing
				453
				454	::
				455
				456	(lldb) type summary add pair --summary-string "(first=${var.first}, second=${var.second})"
				457
				458	While the final result is the same, using --inline-children can often save
				459	time. If one does not need to see the names of the variables, but just their
				460	values, the option --omit-names (-O, uppercase letter o), can be combined with
				461	--inline-children to obtain:
				462
				463	::
				464
				465	(lldb) frame variable a_pair
				466	(pair) a_pair = (1, 2)
				467
				468	which is of course the same as typing
				469
				470	::
				471
				472	(lldb) type summary add pair --summary-string "(${var.first}, ${var.second})"
				473
				474	Bitfields And Array Syntax
				475	--------------------------
				476
				477	Sometimes, a basic type's value actually represents several different values
				478	packed together in a bitfield.
				479
				480	With the classical view, there is no way to look at them. Hexadecimal display
				481	can help, but if the bits actually span nibble boundaries, the help is limited.
				482
				483	Binary view would show it all without ambiguity, but is often too detailed and
				484	hard to read for real-life scenarios.
				485
				486	To cope with the issue, LLDB supports native bitfield formatting in summary
				487	strings. If your expression paths leads to a so-called scalar type (the usual
				488	int, float, char, double, short, long, long long, double, long double and
				489	unsigned variants), you can ask LLDB to only grab some bits out of the value
				490	and display them in any format you like. If you only need one bit you can use
				491	the [n], just like indexing an array. To extract multiple bits, you can use a
				492	slice-like syntax: [n-m], e.g.
				493
				494	::
				495
				496	(lldb) frame variable float_point
				497	(float) float_point = -3.14159
				498
				499	::
				500
				501	(lldb) type summary add --summary-string "Sign: ${var[31]%B} Exponent: ${var[30-23]%x} Mantissa: ${var[0-22]%u}" float
				502	(lldb) frame variable float_point
				503	(float) float_point = -3.14159 Sign: true Exponent: 0x00000080 Mantissa: 4788184
				504
				505	In this example, LLDB shows the internal representation of a float variable by
				506	extracting bitfields out of a float object.
				507
				508	When typing a range, the extremes n and m are always included, and the order of
				509	the indices is irrelevant.
				510
				511	LLDB also allows to use a similar syntax to display array members inside a summary string. For instance, you may want to display all arrays of a given type using a more compact notation than the default, and then just delve into individual array members that prove interesting to your debugging task. You can tell LLDB to format arrays in special ways, possibly independent of the way the array members' datatype is formatted.
				512	e.g.
				513
				514	::
				515
				516	(lldb) frame variable sarray
				517	(Simple [3]) sarray = {
				518	[0] = {
				519	x = 1
				520	y = 2
				521	z = '\x03'
				522	}
				523	[1] = {
				524	x = 4
				525	y = 5
				526	z = '\x06'
				527	}
				528	[2] = {
				529	x = 7
				530	y = 8
				531	z = '\t'
				532	}
				533	}
				534
				535	(lldb) type summary add --summary-string "${var[].x}" "Simple [3]"
				536
				537	(lldb) frame variable sarray
				538	(Simple [3]) sarray = [1,4,7]
				539
				540	The [] symbol amounts to: if var is an array and I know its size, apply this summary string to every element of the array. Here, we are asking LLDB to display .x for every element of the array, and in fact this is what happens. If you find some of those integers anomalous, you can then inspect that one item in greater detail, without the array format getting in the way:
				541
				542	::
				543
				544	(lldb) frame variable sarray[1]
				545	(Simple) sarray[1] = {
				546	x = 4
				547	y = 5
				548	z = '\x06'
				549	}
				550
				551	You can also ask LLDB to only print a subset of the array range by using the
				552	same syntax used to extract bit for bitfields:
				553
				554	::
				555
				556	(lldb) type summary add --summary-string "${var[1-2].x}" "Simple [3]"
				557
				558	(lldb) frame variable sarray
				559	(Simple [3]) sarray = [4,7]
				560
				561	If you are dealing with a pointer that you know is an array, you can use this
				562	syntax to display the elements contained in the pointed array instead of just
				563	the pointer value. However, because pointers have no notion of their size, the
				564	empty brackets [] operator does not work, and you must explicitly provide
				565	higher and lower bounds.
				566
				567	In general, LLDB needs the square brackets operator [] in order to handle
				568	arrays and pointers correctly, and for pointers it also needs a range. However,
				569	a few special cases are defined to make your life easier:
				570
				571	you can print a 0-terminated string (C-string) using the %s format, omitting
				572	square brackets, as in:
				573
				574	::
				575
				576	(lldb) type summary add --summary-string "${var%s}" "char *"
				577
				578	This syntax works for char* as well as for char[] because LLDB can rely on the
				579	final \0 terminator to know when the string has ended.
				580
				581	LLDB has default summary strings for char* and char[] that use this special
				582	case. On debugger startup, the following are defined automatically:
				583
				584	::
				585
				586	(lldb) type summary add --summary-string "${var%s}" "char *"
				587	(lldb) type summary add --summary-string "${var%s}" -x "char \[[0-9]+]"
				588
				589	any of the array formats (int8_t[], float32{}, ...), and the y, Y and a formats
				590	work to print an array of a non-aggregate type, even if square brackets are
				591	omitted.
				592
				593	::
				594
				595	(lldb) type summary add --summary-string "${var%int32_t[]}" "int [10]"
				596
				597	This feature, however, is not enabled for pointers because there is no way for
				598	LLDB to detect the end of the pointed data.
				599
				600	This also does not work for other formats (e.g. boolean), and you must specify
				601	the square brackets operator to get the expected output.
				602
				603	Python Scripting
				604	----------------
				605
				606	Most of the times, summary strings prove good enough for the job of summarizing
				607	the contents of a variable. However, as soon as you need to do more than
				608	picking some values and rearranging them for display, summary strings stop
				609	being an effective tool. This is because summary strings lack the power to
				610	actually perform any kind of computation on the value of variables.
				611
				612	To solve this issue, you can bind some Python scripting code as a summary for
				613	your datatype, and that script has the ability to both extract children
				614	variables as the summary strings do and to perform active computation on the
				615	extracted values. As a small example, let's say we have a Rectangle class:
				616
				617	::
				618
				619
				620	class Rectangle
				621	{
				622	private:
				623	int height;
				624	int width;
				625	public:
				626	Rectangle() : height(3), width(5) {}
				627	Rectangle(int H) : height(H), width(H*2-1) {}
				628	Rectangle(int H, int W) : height(H), width(W) {}
				629	int GetHeight() { return height; }
				630	int GetWidth() { return width; }
				631	};
				632
				633	Summary strings are effective to reduce the screen real estate used by the
				634	default viewing mode, but are not effective if we want to display the area and
				635	perimeter of Rectangle objects
				636
				637	To obtain this, we can simply attach a small Python script to the Rectangle
				638	class, as shown in this example:
				639
				640	::
				641
				642	(lldb) type summary add -P Rectangle
				643	Enter your Python command(s). Type 'DONE' to end.
				644	def function (valobj,internal_dict):
				645	height_val = valobj.GetChildMemberWithName('height')
				646	width_val = valobj.GetChildMemberWithName('width')
				647	height = height_val.GetValueAsUnsigned(0)
				648	width = width_val.GetValueAsUnsigned(0)
				649	area = height*width
				650	perimeter = 2*(height + width)
				651	return 'Area: ' + str(area) + ', Perimeter: ' + str(perimeter)
				652	DONE
				653	(lldb) frame variable
				654	(Rectangle) r1 = Area: 20, Perimeter: 18
				655	(Rectangle) r2 = Area: 72, Perimeter: 36
				656	(Rectangle) r3 = Area: 16, Perimeter: 16
				657
				658	In order to write effective summary scripts, you need to know the LLDB public
				659	API, which is the way Python code can access the LLDB object model. For further
				660	details on the API you should look at the LLDB API reference documentation.
				661
				662
				663	As a brief introduction, your script is encapsulated into a function that is
				664	passed two parameters: ``valobj`` and ``internal_dict``.
				665
				666	``internal_dict`` is an internal support parameter used by LLDB and you should
				667	not touch it.
				668
				669	``valobj`` is the object encapsulating the actual variable being displayed, and
				670	its type is SBValue. Out of the many possible operations on an SBValue, the
				671	basic one is retrieve the children objects it contains (essentially, the fields
				672	of the object wrapped by it), by calling ``GetChildMemberWithName()``, passing
				673	it the child's name as a string.
				674
				675	If the variable has a value, you can ask for it, and return it as a string
				676	using ``GetValue()``, or as a signed/unsigned number using
				677	``GetValueAsSigned()``, ``GetValueAsUnsigned()``. It is also possible to
				678	retrieve an SBData object by calling ``GetData()`` and then read the object's
				679	contents out of the SBData.
				680
				681	If you need to delve into several levels of hierarchy, as you can do with
				682	summary strings, you can use the method ``GetValueForExpressionPath()``,
				683	passing it an expression path just like those you could use for summary strings
				684	(one of the differences is that dereferencing a pointer does not occur by
				685	prefixing the path with a ``*```, but by calling the ``Dereference()`` method
				686	on the returned SBValue). If you need to access array slices, you cannot do
				687	that (yet) via this method call, and you must use ``GetChildAtIndex()``
				688	querying it for the array items one by one. Also, handling custom formats is
				689	something you have to deal with on your own.
				690
				691	Other than interactively typing a Python script there are two other ways for
				692	you to input a Python script as a summary:
				693
				694	- using the --python-script option to type summary add and typing the script
				695	code as an option argument; as in:
				696
				697	::
				698
				699	(lldb) type summary add --python-script "height = valobj.GetChildMemberWithName('height').GetValueAsUnsigned(0);width = valobj.GetChildMemberWithName('width').GetValueAsUnsigned(0); return 'Area: %d' % (height*width)" Rectangle
				700
				701
				702	- using the --python-function (-F) option to type summary add and giving the
				703	name of a Python function with the correct prototype. Most probably, you will
				704	define (or have already defined) the function in the interactive interpreter,
				705	or somehow loaded it from a file, using the command script import command.
				706	LLDB will emit a warning if it is unable to find the function you passed, but
				707	will still register the binding.
				708
				709	Starting in SVN r222593, Python summary formatters can optionally define a
				710	third argument: options
				711
				712	This is an object of type ``lldb.SBTypeSummaryOptions`` that can be passed into
				713	the formatter, allowing for a few customizations of the result. The decision to
				714	adopt or not this third argument - and the meaning of options thereof - is
				715	within the individual formatters' writer.
				716
				717	Regular Expression Typenames
				718	----------------------------
				719
				720	As you noticed, in order to associate the custom summary string to the array
				721	types, one must give the array size as part of the typename. This can long
				722	become tiresome when using arrays of different sizes, Simple [3], Simple [9],
				723	Simple [12], ...
				724
				725	If you use the -x option, type names are treated as regular expressions instead
				726	of type names. This would let you rephrase the above example for arrays of type
				727	Simple [3] as:
				728
				729	::
				730	(lldb) type summary add --summary-string "${var[].x}" -x "Simple \[[0-9]+\]"
				731	(lldb) frame variable
				732	(Simple [3]) sarray = [1,4,7]
				733	(Simple [2]) sother = [3,6]
				734
				735	The above scenario works for Simple [3] as well as for any other array of
				736	Simple objects.
				737
				738	While this feature is mostly useful for arrays, you could also use regular
				739	expressions to catch other type sets grouped by name. However, as regular
				740	expression matching is slower than normal name matching, LLDB will first try to
				741	match by name in any way it can, and only when this fails, will it resort to
				742	regular expression matching.
				743
				744	One of the ways LLDB uses this feature internally, is to match the names of STL
				745	container classes, regardless of the template arguments provided. The details
				746	for this are found at FormatManager.cpp
				747
				748	The regular expression language used by LLDB is the POSIX extended language, as
				749	defined by the Single UNIX Specification, of which Mac OS X is a compliant
				750	implementation.
				751
				752	Names Summaries
				753	---------------
				754
				755	For a given type, there may be different meaningful summary representations.
				756	However, currently, only one summary can be associated to a type at each
				757	moment. If you need to temporarily override the association for a variable,
				758	without changing the summary string for to its type, you can use named
				759	summaries.
				760
				761	Named summaries work by attaching a name to a summary when creating it. Then,
				762	when there is a need to attach the summary to a variable, the frame variable
				763	command, supports a --summary option that tells LLDB to use the named summary
				764	given instead of the default one.
				765
				766	::
				767	(lldb) type summary add --summary-string "x=${var.integer}" --name NamedSummary
				768	(lldb) frame variable one
				769	(i_am_cool) one = int = 3, float = 3.14159, char = 69
				770	(lldb) frame variable one --summary NamedSummary
				771	(i_am_cool) one = x=3
				772
				773	When defining a named summary, binding it to one or more types becomes
				774	optional. Even if you bind the named summary to a type, and later change the
				775	summary string for that type, the named summary will not be changed by that.
				776	You can delete named summaries by using the type summary delete command, as if
				777	the summary name was the datatype that the summary is applied to
				778
				779	A summary attached to a variable using the --summary option, has the same
				780	semantics that a custom format attached using the -f option has: it stays
				781	attached till you attach a new one, or till you let your program run again.
				782
				783	Synthetic Children
				784	------------------
				785
				786	Summaries work well when one is able to navigate through an expression path. In
				787	order for LLDB to do so, appropriate debugging information must be available.
				788
				789	Some types are opaque, i.e. no knowledge of their internals is provided. When
				790	that's the case, expression paths do not work correctly.
				791
				792	In other cases, the internals are available to use in expression paths, but
				793	they do not provide a user-friendly representation of the object's value.
				794
				795	For instance, consider an STL vector, as implemented by the GNU C++ Library:
				796
				797	::
				798
				799	(lldb) frame variable numbers -T
				800	(std::vector<int>) numbers = {
				801	(std::_Vector_base<int, std::allocator<int> >) std::_Vector_base<int, std::allocator<int> > = {
				802	(std::_Vector_base<int, std::allocator&tl;int> >::_Vector_impl) _M_impl = {
				803	(int *) _M_start = 0x00000001001008a0
				804	(int *) _M_finish = 0x00000001001008a8
				805	(int *) _M_end_of_storage = 0x00000001001008a8
				806	}
				807	}
				808	}
				809
				810	Here, you can see how the type is implemented, and you can write a summary for
				811	that implementation but that is not going to help you infer what items are
				812	actually stored in the vector.
				813
				814	What you would like to see is probably something like:
				815
				816	::
				817
				818	(lldb) frame variable numbers -T
				819	(std::vector<int>) numbers = {
				820	(int) [0] = 1
				821	(int) [1] = 12
				822	(int) [2] = 123
				823	(int) [3] = 1234
				824	}
				825
				826	Synthetic children are a way to get that result.
				827
				828	The feature is based upon the idea of providing a new set of children for a
				829	variable that replaces the ones available by default through the debug
				830	information. In the example, we can use synthetic children to provide the
				831	vector items as children for the std::vector object.
				832
				833	In order to create synthetic children, you need to provide a Python class that
				834	adheres to a given interface (the word is italicized because Python has no
				835	explicit notion of interface, by that word we mean a given set of methods must
				836	be implemented by the Python class):
				837
				838	::
				839
				840	class SyntheticChildrenProvider:
				841	def __init__(self, valobj, internal_dict):
				842	this call should initialize the Python object using valobj as the variable to provide synthetic children for
				843	def num_children(self):
				844	this call should return the number of children that you want your object to have
				845	def get_child_index(self,name):
				846	this call should return the index of the synthetic child whose name is given as argument
				847	def get_child_at_index(self,index):
				848	this call should return a new LLDB SBValue object representing the child at the index given as argument
				849	def update(self):
				850	this call should be used to update the internal state of this Python object whenever the state of the variables in LLDB changes.[1]
				851	def has_children(self):
				852	this call should return True if this object might have children, and False if this object can be guaranteed not to have children.[2]
				853	def get_value(self):
				854	this call can return an SBValue to be presented as the value of the synthetic value under consideration.[3]
				855
				856	[1] This method is optional. Also, it may optionally choose to return a value
				857	(starting with SVN rev153061/LLDB-134). If it returns a value, and that value
				858	is True, LLDB will be allowed to cache the children and the children count it
				859	previously obtained, and will not return to the provider class to ask. If
				860	nothing, None, or anything other than True is returned, LLDB will discard the
				861	cached information and ask. Regardless, whenever necessary LLDB will call
				862	update.
				863
				864	[2] This method is optional (starting with SVN rev166495/LLDB-175). While
				865	implementing it in terms of num_children is acceptable, implementors are
				866	encouraged to look for optimized coding alternatives whenever reasonable.
				867
				868	[3] This method is optional (starting with SVN revision 219330). The SBValue
				869	you return here will most likely be a numeric type (int, float, ...) as its
				870	value bytes will be used as-if they were the value of the root SBValue proper.
				871	As a shortcut for this, you can inherit from lldb.SBSyntheticValueProvider, and
				872	just define get_value as other methods are defaulted in the superclass as
				873	returning default no-children responses.
				874
				875	If a synthetic child provider supplies a special child named $$dereference$$
				876	then it will be used when evaluating opertaor* and operator-> in the frame
				877	variable command and related SB API functions.
				878
				879	For examples of how synthetic children are created, you are encouraged to look
				880	at examples/synthetic in the LLDB trunk. Please, be aware that the code in
				881	those files (except bitfield/) is legacy code and is not maintained. You may
				882	especially want to begin looking at this example to get a feel for this
				883	feature, as it is a very easy and well commented example.
				884
				885	The design pattern consistently used in synthetic providers shipping with LLDB
				886	is to use the __init__ to store the SBValue instance as a part of self. The
				887	update function is then used to perform the actual initialization. Once a
				888	synthetic children provider is written, one must load it into LLDB before it
				889	can be used. Currently, one can use the LLDB script command to type Python code
				890	interactively, or use the command script import fileName command to load Python
				891	code from a Python module (ordinary rules apply to importing modules this way).
				892	A third option is to type the code for the provider class interactively while
				893	adding it.
				894
				895	For example, let's pretend we have a class Foo for which a synthetic children
				896	provider class Foo_Provider is available, in a Python module contained in file
				897	~/Foo_Tools.py. The following interaction sets Foo_Provider as a synthetic
				898	children provider in LLDB:
				899
				900	::
				901
				902	(lldb) command script import ~/Foo_Tools.py
				903	(lldb) type synthetic add Foo --python-class Foo_Tools.Foo_Provider
				904	(lldb) frame variable a_foo
				905	(Foo) a_foo = {
				906	x = 1
				907	y = "Hello world"
				908	}
				909
				910	LLDB has synthetic children providers for a core subset of STL classes, both in
				911	the version provided by libstdcpp and by libcxx, as well as for several
				912	Foundation classes.
				913
				914	Synthetic children extend summary strings by enabling a new special variable:
				915	``${svar``.
				916
				917	This symbol tells LLDB to refer expression paths to the synthetic children
				918	instead of the real ones. For instance,
				919
				920	::
				921
				922	(lldb) type summary add --expand -x "std::vector<" --summary-string "${svar%#} items"
				923	(lldb) frame variable numbers
				924	(std::vector<int>) numbers = 4 items {
				925	(int) [0] = 1
				926	(int) [1] = 12
				927	(int) [2] = 123
				928	(int) [3] = 1234
				929	}
				930
				931	In some cases, if LLDB is unable to use the real object to get a child
				932	specified in an expression path, it will automatically refer to the synthetic
				933	children. While in summaries it is best to always use ${svar to make your
				934	intentions clearer, interactive debugging can benefit from this behavior, as
				935	in:
				936
				937	::
				938
				939	(lldb) frame variable numbers[0] numbers[1]
				940	(int) numbers[0] = 1
				941	(int) numbers[1] = 12
				942
				943	Unlike many other visualization features, however, the access to synthetic
				944	children only works when using frame variable, and is not supported in
				945	expression:
				946
				947	::
				948
				949	(lldb) expression numbers[0]
				950	Error [IRForTarget]: Call to a function '_ZNSt33vector<int, std::allocator<int> >ixEm' that is not present in the target
				951	error: Couldn't convert the expression to DWARF
				952
				953	The reason for this is that classes might have an overloaded operator [], or
				954	other special provisions and the expression command chooses to ignore synthetic
				955	children in the interest of equivalency with code you asked to have compiled
				956	from source.
				957
				958	Filters
				959	-------
				960
				961	Filters are a solution to the display of complex classes. At times, classes
				962	have many member variables but not all of these are actually necessary for the
				963	user to see.
				964
				965	A filter will solve this issue by only letting the user see those member
				966	variables he cares about. Of course, the equivalent of a filter can be
				967	implemented easily using synthetic children, but a filter lets you get the job
				968	done without having to write Python code.
				969
				970	For instance, if your class Foobar has member variables named A thru Z, but you
				971	only need to see the ones named B, H and Q, you can define a filter:
				972
				973	::
				974
				975	(lldb) type filter add Foobar --child B --child H --child Q
				976	(lldb) frame variable a_foobar
				977	(Foobar) a_foobar = {
				978	(int) B = 1
				979	(char) H = 'H'
				980	(std::string) Q = "Hello world"
				981	}
				982
				983	Objective-C Dynamic Type Discovery
				984	----------------------------------
				985
				986	When doing Objective-C development, you may notice that some of your variables
				987	come out as of type id (for instance, items extracted from NSArray). By
				988	default, LLDB will not show you the real type of the object. it can actually
				989	dynamically discover the type of an Objective-C variable, much like the runtime
				990	itself does when invoking a selector. In order to be shown the result of that
				991	discovery that, however, a special option to frame variable or expression is
				992	required: ``--dynamic-type``.
				993
				994
				995	``--dynamic-type`` can have one of three values:
				996
				997	- ``no-dynamic-values``: the default, prevents dynamic type discovery
				998	- ``no-run-target``: enables dynamic type discovery as long as running code on
				999	the target is not required
				1000	- ``run-target``: enables code execution on the target in order to perform
				1001	dynamic type discovery
				1002
				1003	If you specify a value of either no-run-target or run-target, LLDB will detect
				1004	the dynamic type of your variables and show the appropriate formatters for
				1005	them. As an example:
				1006
				1007	::
				1008
				1009	(lldb) expr @"Hello"
				1010	(NSString *) $0 = 0x00000001048000b0 @"Hello"
				1011	(lldb) expr -d no-run @"Hello"
				1012	(__NSCFString *) $1 = 0x00000001048000b0 @"Hello"
				1013
				1014	Because LLDB uses a detection algorithm that does not need to invoke any
				1015	functions on the target process, no-run-target is enough for this to work.
				1016
				1017	As a side note, the summary for NSString shown in the example is built right
				1018	into LLDB. It was initially implemented through Python (the code is still
				1019	available for reference at CFString.py). However, this is out of sync with the
				1020	current implementation of the NSString formatter (which is a C++ function
				1021	compiled into the LLDB core).
				1022
				1023	Categories
				1024	----------
				1025
				1026	Categories are a way to group related formatters. For instance, LLDB itself
				1027	groups the formatters for the libstdc++ types in a category named
				1028	gnu-libstdc++. Basically, categories act like containers in which to store
				1029	formatters for a same library or OS release.
				1030
				1031	By default, several categories are created in LLDB:
				1032
				1033	- default: this is the category where every formatter ends up, unless another category is specified
				1034	- objc: formatters for basic and common Objective-C types that do not specifically depend on Mac OS X
				1035	- gnu-libstdc++: formatters for std::string, std::vector, std::list and std::map as implemented by libstdcpp
				1036	- libcxx: formatters for std::string, std::vector, std::list and std::map as implemented by libcxx
				1037	- system: truly basic types for which a formatter is required
				1038	- AppKit: Cocoa classes
				1039	- CoreFoundation: CF classes
				1040	- CoreGraphics: CG classes
				1041	- CoreServices: CS classes
				1042	- VectorTypes: compact display for several vector types
				1043
				1044	If you want to use a custom category for your formatters, all the type ... add
				1045	provide a --category (-w) option, that names the category to add the formatter
				1046	to. To delete the formatter, you then have to specify the correct category.
				1047
				1048	Categories can be in one of two states: enabled and disabled. A category is
				1049	initially disabled, and can be enabled using the type category enable command.
				1050	To disable an enabled category, the command to use is type category disable.
				1051
				1052	The order in which categories are enabled or disabled is significant, in that
				1053	LLDB uses that order when looking for formatters. Therefore, when you enable a
				1054	category, it becomes the second one to be searched (after default, which always
				1055	stays on top of the list). The default categories are enabled in such a way
				1056	that the search order is:
				1057
				1058	- default
				1059	- objc
				1060	- CoreFoundation
				1061	- AppKit
				1062	- CoreServices
				1063	- CoreGraphics
				1064	- gnu-libstdc++
				1065	- libcxx
				1066	- VectorTypes
				1067	- system
				1068
				1069	As said, gnu-libstdc++ and libcxx contain formatters for C++ STL data types.
				1070	system contains formatters for char* and char[], which reflect the behavior of
				1071	older versions of LLDB which had built-in formatters for these types. Because
				1072	now these are formatters, you can even replace them with your own if so you
				1073	wish.
				1074
				1075	There is no special command to create a category. When you place a formatter in
				1076	a category, if that category does not exist, it is automatically created. For
				1077	instance,
				1078
				1079	::
				1080
				1081	(lldb) type summary add Foobar --summary-string "a foobar" --category newcategory
				1082
				1083	automatically creates a (disabled) category named newcategory.
				1084
				1085	Another way to create a new (empty) category, is to enable it, as in:
				1086
				1087	::
				1088
				1089	(lldb) type category enable newcategory
				1090
				1091	However, in this case LLDB warns you that enabling an empty category has no
				1092	effect. If you add formatters to the category after enabling it, they will be
				1093	honored. But an empty category per se does not change the way any type is
				1094	displayed. The reason the debugger warns you is that enabling an empty category
				1095	might be a typo, and you effectively wanted to enable a similarly-named but
				1096	not-empty category.
				1097
				1098	Finding Formatters 101
				1099	----------------------
				1100
				1101	Searching for a formatter (including formats, starting in SVN rev r192217)
				1102	given a variable goes through a rather intricate set of rules. Namely, what
				1103	happens is that LLDB starts looking in each enabled category, according to the
				1104	order in which they were enabled (latest enabled first). In each category, LLDB
				1105	does the following:
				1106
				1107	- If there is a formatter for the type of the variable, use it
				1108	- If this object is a pointer, and there is a formatter for the pointee type
				1109	that does not skip pointers, use it
				1110	- If this object is a reference, and there is a formatter for the referred type
				1111	that does not skip references, use it
				1112	- If this object is an Objective-C class and dynamic types are enabled, look
				1113	for a formatter for the dynamic type of the object. If dynamic types are
				1114	disabled, or the search failed, look for a formatter for the declared type of
				1115	the object
				1116	- If this object's type is a typedef, go through typedef hierarchy (LLDB might
				1117	not be able to do this if the compiler has not emitted enough information. If
				1118	the required information to traverse typedef hierarchies is missing, type
				1119	cascading will not work. The clang compiler, part of the LLVM project, emits
				1120	the correct debugging information for LLDB to cascade). If at any level of
				1121	the hierarchy there is a valid formatter that can cascade, use it.
				1122	- If everything has failed, repeat the above search, looking for regular
				1123	expressions instead of exact matches
				1124
				1125	If any of those attempts returned a valid formatter to be used, that one is
				1126	used, and the search is terminated (without going to look in other categories).
				1127	If nothing was found in the current category, the next enabled category is
				1128	scanned according to the same algorithm. If there are no more enabled
				1129	categories, the search has failed.
				1130
				1131	Warning: previous versions of LLDB defined cascading to mean not only going
				1132	through typedef chains, but also through inheritance chains. This feature has
				1133	been removed since it significantly degrades performance. You need to set up
				1134	your formatters for every type in inheritance chains to which you want the
				1135	formatter to apply.