Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 1 | ========================================= |
| 2 | Uprobe-tracer: Uprobe-based Event Tracing |
| 3 | ========================================= |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 4 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 5 | :Author: Srikar Dronamraju |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 6 | |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 7 | |
| 8 | Overview |
| 9 | -------- |
| 10 | Uprobe based trace events are similar to kprobe based trace events. |
Anton Blanchard | 6b0b755 | 2017-02-16 17:00:50 +1100 | [diff] [blame] | 11 | To enable this feature, build your kernel with CONFIG_UPROBE_EVENTS=y. |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 12 | |
| 13 | Similar to the kprobe-event tracer, this doesn't need to be activated via |
| 14 | current_tracer. Instead of that, add probe points via |
| 15 | /sys/kernel/debug/tracing/uprobe_events, and enable it via |
| 16 | /sys/kernel/debug/tracing/events/uprobes/<EVENT>/enabled. |
| 17 | |
| 18 | However unlike kprobe-event tracer, the uprobe event interface expects the |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 19 | user to calculate the offset of the probepoint in the object. |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 20 | |
| 21 | Synopsis of uprobe_tracer |
| 22 | ------------------------- |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 23 | :: |
| 24 | |
Namhyung Kim | 306cfe2 | 2013-07-03 16:44:46 +0900 | [diff] [blame] | 25 | p[:[GRP/]EVENT] PATH:OFFSET [FETCHARGS] : Set a uprobe |
| 26 | r[:[GRP/]EVENT] PATH:OFFSET [FETCHARGS] : Set a return uprobe (uretprobe) |
| 27 | -:[GRP/]EVENT : Clear uprobe or uretprobe event |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 28 | |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 29 | GRP : Group name. If omitted, "uprobes" is the default value. |
| 30 | EVENT : Event name. If omitted, the event name is generated based |
Namhyung Kim | 306cfe2 | 2013-07-03 16:44:46 +0900 | [diff] [blame] | 31 | on PATH+OFFSET. |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 32 | PATH : Path to an executable or a library. |
Namhyung Kim | 306cfe2 | 2013-07-03 16:44:46 +0900 | [diff] [blame] | 33 | OFFSET : Offset where the probe is inserted. |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 34 | |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 35 | FETCHARGS : Arguments. Each probe can have up to 128 args. |
| 36 | %REG : Fetch register REG |
Namhyung Kim | b079d37 | 2013-07-03 18:34:23 +0900 | [diff] [blame] | 37 | @ADDR : Fetch memory at ADDR (ADDR should be in userspace) |
Namhyung Kim | b7e0bf3 | 2013-11-25 13:42:47 +0900 | [diff] [blame] | 38 | @+OFFSET : Fetch memory at OFFSET (OFFSET from same file as PATH) |
Namhyung Kim | b079d37 | 2013-07-03 18:34:23 +0900 | [diff] [blame] | 39 | $stackN : Fetch Nth entry of stack (N >= 0) |
| 40 | $stack : Fetch stack address. |
| 41 | $retval : Fetch return value.(*) |
Omar Sandoval | 35abb67 | 2016-06-08 18:38:02 -0700 | [diff] [blame] | 42 | $comm : Fetch current task comm. |
Namhyung Kim | b079d37 | 2013-07-03 18:34:23 +0900 | [diff] [blame] | 43 | +|-offs(FETCHARG) : Fetch memory at FETCHARG +|- offs address.(**) |
| 44 | NAME=FETCHARG : Set NAME as the argument name of FETCHARG. |
| 45 | FETCHARG:TYPE : Set TYPE as the type of FETCHARG. Currently, basic types |
Masami Hiramatsu | 17ce3dc | 2016-08-18 17:57:50 +0900 | [diff] [blame] | 46 | (u8/u16/u32/u64/s8/s16/s32/s64), hexadecimal types |
| 47 | (x8/x16/x32/x64), "string" and bitfield are supported. |
Namhyung Kim | b079d37 | 2013-07-03 18:34:23 +0900 | [diff] [blame] | 48 | |
| 49 | (*) only for return probe. |
| 50 | (**) this is useful for fetching a field of data structures. |
| 51 | |
| 52 | Types |
| 53 | ----- |
| 54 | Several types are supported for fetch-args. Uprobe tracer will access memory |
| 55 | by given type. Prefix 's' and 'u' means those types are signed and unsigned |
Masami Hiramatsu | bdca79c2 | 2016-08-18 17:59:21 +0900 | [diff] [blame] | 56 | respectively. 'x' prefix implies it is unsigned. Traced arguments are shown |
| 57 | in decimal ('s' and 'u') or hexadecimal ('x'). Without type casting, 'x32' |
| 58 | or 'x64' is used depends on the architecture (e.g. x86-32 uses x32, and |
| 59 | x86-64 uses x64). |
Namhyung Kim | b079d37 | 2013-07-03 18:34:23 +0900 | [diff] [blame] | 60 | String type is a special type, which fetches a "null-terminated" string from |
| 61 | user space. |
| 62 | Bitfield is another special type, which takes 3 parameters, bit-width, bit- |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 63 | offset, and container-size (usually 32). The syntax is:: |
Namhyung Kim | b079d37 | 2013-07-03 18:34:23 +0900 | [diff] [blame] | 64 | |
| 65 | b<bit-width>@<bit-offset>/<container-size> |
| 66 | |
Omar Sandoval | 35abb67 | 2016-06-08 18:38:02 -0700 | [diff] [blame] | 67 | For $comm, the default type is "string"; any other type is invalid. |
| 68 | |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 69 | |
| 70 | Event Profiling |
| 71 | --------------- |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 72 | You can check the total number of probe hits and probe miss-hits via |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 73 | /sys/kernel/debug/tracing/uprobe_profile. |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 74 | The first column is event name, the second is the number of probe hits, |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 75 | the third is the number of probe miss-hits. |
| 76 | |
| 77 | Usage examples |
| 78 | -------------- |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 79 | * Add a probe as a new uprobe event, write a new definition to uprobe_events |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 80 | as below (sets a uprobe at an offset of 0x4245c0 in the executable /bin/bash):: |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 81 | |
Marcin Nowakowski | 7058763 | 2016-10-06 09:52:12 +0200 | [diff] [blame] | 82 | echo 'p /bin/bash:0x4245c0' > /sys/kernel/debug/tracing/uprobe_events |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 83 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 84 | * Add a probe as a new uretprobe event:: |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 85 | |
Marcin Nowakowski | 7058763 | 2016-10-06 09:52:12 +0200 | [diff] [blame] | 86 | echo 'r /bin/bash:0x4245c0' > /sys/kernel/debug/tracing/uprobe_events |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 87 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 88 | * Unset registered event:: |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 89 | |
Marcin Nowakowski | 7058763 | 2016-10-06 09:52:12 +0200 | [diff] [blame] | 90 | echo '-:p_bash_0x4245c0' >> /sys/kernel/debug/tracing/uprobe_events |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 91 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 92 | * Print out the events that are registered:: |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 93 | |
| 94 | cat /sys/kernel/debug/tracing/uprobe_events |
| 95 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 96 | * Clear all events:: |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 97 | |
| 98 | echo > /sys/kernel/debug/tracing/uprobe_events |
| 99 | |
| 100 | Following example shows how to dump the instruction pointer and %ax register |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 101 | at the probed text address. Probe zfree function in /bin/zsh:: |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 102 | |
| 103 | # cd /sys/kernel/debug/tracing/ |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 104 | # cat /proc/`pgrep zsh`/maps | grep /bin/zsh | grep r-xp |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 105 | 00400000-0048a000 r-xp 00000000 08:03 130904 /bin/zsh |
| 106 | # objdump -T /bin/zsh | grep -w zfree |
| 107 | 0000000000446420 g DF .text 0000000000000012 Base zfree |
| 108 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 109 | 0x46420 is the offset of zfree in object /bin/zsh that is loaded at |
| 110 | 0x00400000. Hence the command to uprobe would be:: |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 111 | |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 112 | # echo 'p:zfree_entry /bin/zsh:0x46420 %ip %ax' > uprobe_events |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 113 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 114 | And the same for the uretprobe would be:: |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 115 | |
| 116 | # echo 'r:zfree_exit /bin/zsh:0x46420 %ip %ax' >> uprobe_events |
| 117 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 118 | .. note:: User has to explicitly calculate the offset of the probe-point |
| 119 | in the object. |
| 120 | |
| 121 | We can see the events that are registered by looking at the uprobe_events file. |
| 122 | :: |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 123 | |
| 124 | # cat uprobe_events |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 125 | p:uprobes/zfree_entry /bin/zsh:0x00046420 arg1=%ip arg2=%ax |
| 126 | r:uprobes/zfree_exit /bin/zsh:0x00046420 arg1=%ip arg2=%ax |
Srikar Dronamraju | ec83db0 | 2012-05-08 16:41:26 +0530 | [diff] [blame] | 127 | |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 128 | Format of events can be seen by viewing the file events/uprobes/zfree_entry/format. |
| 129 | :: |
Srikar Dronamraju | ec83db0 | 2012-05-08 16:41:26 +0530 | [diff] [blame] | 130 | |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 131 | # cat events/uprobes/zfree_entry/format |
| 132 | name: zfree_entry |
Srikar Dronamraju | ec83db0 | 2012-05-08 16:41:26 +0530 | [diff] [blame] | 133 | ID: 922 |
| 134 | format: |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 135 | field:unsigned short common_type; offset:0; size:2; signed:0; |
| 136 | field:unsigned char common_flags; offset:2; size:1; signed:0; |
| 137 | field:unsigned char common_preempt_count; offset:3; size:1; signed:0; |
| 138 | field:int common_pid; offset:4; size:4; signed:1; |
| 139 | field:int common_padding; offset:8; size:4; signed:1; |
Srikar Dronamraju | ec83db0 | 2012-05-08 16:41:26 +0530 | [diff] [blame] | 140 | |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 141 | field:unsigned long __probe_ip; offset:12; size:4; signed:0; |
| 142 | field:u32 arg1; offset:16; size:4; signed:0; |
| 143 | field:u32 arg2; offset:20; size:4; signed:0; |
Srikar Dronamraju | ec83db0 | 2012-05-08 16:41:26 +0530 | [diff] [blame] | 144 | |
| 145 | print fmt: "(%lx) arg1=%lx arg2=%lx", REC->__probe_ip, REC->arg1, REC->arg2 |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 146 | |
| 147 | Right after definition, each event is disabled by default. For tracing these |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 148 | events, you need to enable it by:: |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 149 | |
| 150 | # echo 1 > events/uprobes/enable |
| 151 | |
| 152 | Lets disable the event after sleeping for some time. |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 153 | :: |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 154 | |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 155 | # sleep 20 |
| 156 | # echo 0 > events/uprobes/enable |
| 157 | |
| 158 | And you can see the traced information via /sys/kernel/debug/tracing/trace. |
Changbin Du | 00b27da | 2018-02-17 13:39:40 +0800 | [diff] [blame] | 159 | :: |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 160 | |
| 161 | # cat trace |
| 162 | # tracer: nop |
| 163 | # |
| 164 | # TASK-PID CPU# TIMESTAMP FUNCTION |
| 165 | # | | | | | |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 166 | zsh-24842 [006] 258544.995456: zfree_entry: (0x446420) arg1=446420 arg2=79 |
| 167 | zsh-24842 [007] 258545.000270: zfree_exit: (0x446540 <- 0x446420) arg1=446540 arg2=0 |
| 168 | zsh-24842 [002] 258545.043929: zfree_entry: (0x446420) arg1=446420 arg2=79 |
| 169 | zsh-24842 [004] 258547.046129: zfree_exit: (0x446540 <- 0x446420) arg1=446540 arg2=0 |
Srikar Dronamraju | f3f096c | 2012-04-11 16:00:43 +0530 | [diff] [blame] | 170 | |
Anton Arapov | decc6bf | 2013-04-03 18:00:39 +0200 | [diff] [blame] | 171 | Output shows us uprobe was triggered for a pid 24842 with ip being 0x446420 |
| 172 | and contents of ax register being 79. And uretprobe was triggered with ip at |
| 173 | 0x446540 with counterpart function entry at 0x446420. |