Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 1 | perf-record(1) |
Ingo Molnar | c1c2365 | 2009-05-30 12:38:51 +0200 | [diff] [blame] | 2 | ============== |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 3 | |
| 4 | NAME |
| 5 | ---- |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 6 | perf-record - Run a command and record its profile into perf.data |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 7 | |
| 8 | SYNOPSIS |
| 9 | -------- |
| 10 | [verse] |
| 11 | 'perf record' [-e <EVENT> | --event=EVENT] [-l] [-a] <command> |
Mike Galbraith | 9e096753 | 2009-05-28 16:25:34 +0200 | [diff] [blame] | 12 | 'perf record' [-e <EVENT> | --event=EVENT] [-l] [-a] -- <command> [<options>] |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 13 | |
| 14 | DESCRIPTION |
| 15 | ----------- |
| 16 | This command runs a command and gathers a performance counter profile |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 17 | from it, into perf.data - without displaying anything. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 18 | |
| 19 | This file can then be inspected later on, using 'perf report'. |
| 20 | |
| 21 | |
| 22 | OPTIONS |
| 23 | ------- |
| 24 | <command>...:: |
| 25 | Any command you can specify in a shell. |
| 26 | |
| 27 | -e:: |
| 28 | --event=:: |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 29 | Select the PMU event. Selection can be: |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 30 | |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 31 | - a symbolic event name (use 'perf list' to list all events) |
| 32 | |
| 33 | - a raw PMU event (eventsel+umask) in the form of rNNN where NNN is a |
| 34 | hexadecimal event descriptor. |
| 35 | |
Cody P Schafer | f9ab9c1 | 2015-01-07 17:13:53 -0800 | [diff] [blame] | 36 | - a symbolically formed PMU event like 'pmu/param1=0x3,param2/' where |
| 37 | 'param1', 'param2', etc are defined as formats for the PMU in |
Adrian Hunter | a9e5700 | 2016-09-23 17:38:33 +0300 | [diff] [blame] | 38 | /sys/bus/event_source/devices/<pmu>/format/*. |
Cody P Schafer | f9ab9c1 | 2015-01-07 17:13:53 -0800 | [diff] [blame] | 39 | |
| 40 | - a symbolically formed event like 'pmu/config=M,config1=N,config3=K/' |
| 41 | |
| 42 | where M, N, K are numbers (in decimal, hex, octal format). Acceptable |
| 43 | values for each of 'config', 'config1' and 'config2' are defined by |
Adrian Hunter | a9e5700 | 2016-09-23 17:38:33 +0300 | [diff] [blame] | 44 | corresponding entries in /sys/bus/event_source/devices/<pmu>/format/* |
Cody P Schafer | f9ab9c1 | 2015-01-07 17:13:53 -0800 | [diff] [blame] | 45 | param1 and param2 are defined as formats for the PMU in: |
Adrian Hunter | a9e5700 | 2016-09-23 17:38:33 +0300 | [diff] [blame] | 46 | /sys/bus/event_source/devices/<pmu>/format/* |
Cody P Schafer | f9ab9c1 | 2015-01-07 17:13:53 -0800 | [diff] [blame] | 47 | |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 48 | There are also some params which are not defined in .../<pmu>/format/*. |
Jiri Olsa | ee4c758 | 2015-07-29 05:42:11 -0400 | [diff] [blame] | 49 | These params can be used to overload default config values per event. |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 50 | Here is a list of the params. |
| 51 | - 'period': Set event sampling period |
Namhyung Kim | 09af2a5 | 2015-08-09 15:45:23 +0900 | [diff] [blame] | 52 | - 'freq': Set event sampling frequency |
Kan Liang | 3206771 | 2015-08-04 04:30:19 -0400 | [diff] [blame] | 53 | - 'time': Disable/enable time stamping. Acceptable values are 1 for |
| 54 | enabling time stamping. 0 for disabling time stamping. |
| 55 | The default is 1. |
Kan Liang | d457c96 | 2015-08-11 06:30:47 -0400 | [diff] [blame] | 56 | - 'call-graph': Disable/enable callgraph. Acceptable str are "fp" for |
Kan Liang | f9db0d0 | 2015-08-11 06:30:48 -0400 | [diff] [blame] | 57 | FP mode, "dwarf" for DWARF mode, "lbr" for LBR mode and |
| 58 | "no" for disable callgraph. |
Kan Liang | d457c96 | 2015-08-11 06:30:47 -0400 | [diff] [blame] | 59 | - 'stack-size': user stack size for dwarf mode |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 60 | Note: If user explicitly sets options which conflict with the params, |
| 61 | the value set by the params will be overridden. |
| 62 | |
Mathieu Poirier | dd60fba | 2016-09-06 10:37:15 -0600 | [diff] [blame] | 63 | Also not defined in .../<pmu>/format/* are PMU driver specific |
| 64 | configuration parameters. Any configuration parameter preceded by |
| 65 | the letter '@' is not interpreted in user space and sent down directly |
| 66 | to the PMU driver. For example: |
| 67 | |
| 68 | perf record -e some_event/@cfg1,@cfg2=config/ ... |
| 69 | |
| 70 | will see 'cfg1' and 'cfg2=config' pushed to the PMU driver associated |
| 71 | with the event for further processing. There is no restriction on |
| 72 | what the configuration parameters are, as long as their semantic is |
| 73 | understood and supported by the PMU driver. |
| 74 | |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 75 | - a hardware breakpoint event in the form of '\mem:addr[/len][:access]' |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 76 | where addr is the address in memory you want to break in. |
| 77 | Access is the memory access type (read, write, execute) it can |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 78 | be passed as follows: '\mem:addr[:[r][w][x]]'. len is the range, |
| 79 | number of bytes from specified addr, which the breakpoint will cover. |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 80 | If you want to profile read-write accesses in 0x1000, just set |
| 81 | 'mem:0x1000:rw'. |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 82 | If you want to profile write accesses in [0x1000~1008), just set |
| 83 | 'mem:0x1000/8:w'. |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 84 | |
Namhyung Kim | 9a75606 | 2015-03-02 12:13:33 +0900 | [diff] [blame] | 85 | - a group of events surrounded by a pair of brace ("{event1,event2,...}"). |
| 86 | Each event is separated by commas and the group should be quoted to |
| 87 | prevent the shell interpretation. You also need to use --group on |
| 88 | "perf report" to view group events together. |
| 89 | |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 90 | --filter=<filter>:: |
Wang Nan | 4ba1faa | 2015-07-10 07:36:10 +0000 | [diff] [blame] | 91 | Event filter. This option should follow a event selector (-e) which |
Adrian Hunter | 1b36c03 | 2016-09-23 17:38:39 +0300 | [diff] [blame] | 92 | selects either tracepoint event(s) or a hardware trace PMU |
| 93 | (e.g. Intel PT or CoreSight). |
| 94 | |
| 95 | - tracepoint filters |
| 96 | |
| 97 | In the case of tracepoints, multiple '--filter' options are combined |
Wang Nan | 4ba1faa | 2015-07-10 07:36:10 +0000 | [diff] [blame] | 98 | using '&&'. |
| 99 | |
Adrian Hunter | 1b36c03 | 2016-09-23 17:38:39 +0300 | [diff] [blame] | 100 | - address filters |
| 101 | |
| 102 | A hardware trace PMU advertises its ability to accept a number of |
| 103 | address filters by specifying a non-zero value in |
| 104 | /sys/bus/event_source/devices/<pmu>/nr_addr_filters. |
| 105 | |
| 106 | Address filters have the format: |
| 107 | |
| 108 | filter|start|stop|tracestop <start> [/ <size>] [@<file name>] |
| 109 | |
| 110 | Where: |
| 111 | - 'filter': defines a region that will be traced. |
| 112 | - 'start': defines an address at which tracing will begin. |
| 113 | - 'stop': defines an address at which tracing will stop. |
| 114 | - 'tracestop': defines a region in which tracing will stop. |
| 115 | |
| 116 | <file name> is the name of the object file, <start> is the offset to the |
| 117 | code to trace in that file, and <size> is the size of the region to |
| 118 | trace. 'start' and 'stop' filters need not specify a <size>. |
| 119 | |
| 120 | If no object file is specified then the kernel is assumed, in which case |
| 121 | the start address must be a current kernel memory address. |
| 122 | |
| 123 | <start> can also be specified by providing the name of a symbol. If the |
| 124 | symbol name is not unique, it can be disambiguated by inserting #n where |
| 125 | 'n' selects the n'th symbol in address order. Alternately #0, #g or #G |
| 126 | select only a global symbol. <size> can also be specified by providing |
| 127 | the name of a symbol, in which case the size is calculated to the end |
| 128 | of that symbol. For 'filter' and 'tracestop' filters, if <size> is |
| 129 | omitted and <start> is a symbol, then the size is calculated to the end |
| 130 | of that symbol. |
| 131 | |
| 132 | If <size> is omitted and <start> is '*', then the start and size will |
| 133 | be calculated from the first and last symbols, i.e. to trace the whole |
| 134 | file. |
| 135 | |
| 136 | If symbol names (or '*') are provided, they must be surrounded by white |
| 137 | space. |
| 138 | |
| 139 | The filter passed to the kernel is not necessarily the same as entered. |
| 140 | To see the filter that is passed, use the -v option. |
| 141 | |
| 142 | The kernel may not be able to configure a trace region if it is not |
| 143 | within a single mapping. MMAP events (or /proc/<pid>/maps) can be |
| 144 | examined to determine if that is a possibility. |
| 145 | |
| 146 | Multiple filters can be separated with space or comma. |
| 147 | |
Wang Nan | 4ba1faa | 2015-07-10 07:36:10 +0000 | [diff] [blame] | 148 | --exclude-perf:: |
| 149 | Don't record events issued by perf itself. This option should follow |
| 150 | a event selector (-e) which selects tracepoint event(s). It adds a |
| 151 | filter expression 'common_pid != $PERFPID' to filters. If other |
| 152 | '--filter' exists, the new filter expression will be combined with |
| 153 | them by '&&'. |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 154 | |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 155 | -a:: |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 156 | --all-cpus:: |
| 157 | System-wide collection from all CPUs. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 158 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 159 | -p:: |
| 160 | --pid=:: |
David Ahern | b52956c | 2012-02-08 09:32:52 -0700 | [diff] [blame] | 161 | Record events on existing process ID (comma separated list). |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 162 | |
| 163 | -t:: |
| 164 | --tid=:: |
David Ahern | b52956c | 2012-02-08 09:32:52 -0700 | [diff] [blame] | 165 | Record events on existing thread ID (comma separated list). |
Adrian Hunter | 69e7e5b | 2013-11-18 11:55:57 +0200 | [diff] [blame] | 166 | This option also disables inheritance by default. Enable it by adding |
| 167 | --inherit. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 168 | |
Arnaldo Carvalho de Melo | 0d37aa3 | 2012-01-19 14:08:15 -0200 | [diff] [blame] | 169 | -u:: |
| 170 | --uid=:: |
| 171 | Record events in threads owned by uid. Name or number. |
| 172 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 173 | -r:: |
| 174 | --realtime=:: |
| 175 | Collect data with this RT SCHED_FIFO priority. |
Jiri Olsa | 563aecb | 2013-06-05 13:35:06 +0200 | [diff] [blame] | 176 | |
Arnaldo Carvalho de Melo | 509051e | 2014-01-14 17:52:14 -0300 | [diff] [blame] | 177 | --no-buffering:: |
Kirill Smelkov | acac03f | 2011-01-12 17:59:36 +0300 | [diff] [blame] | 178 | Collect data without buffering. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 179 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 180 | -c:: |
| 181 | --count=:: |
| 182 | Event period to sample. |
| 183 | |
| 184 | -o:: |
| 185 | --output=:: |
| 186 | Output file name. |
| 187 | |
| 188 | -i:: |
Stephane Eranian | 2e6cdf9 | 2010-05-12 10:40:01 +0200 | [diff] [blame] | 189 | --no-inherit:: |
| 190 | Child tasks do not inherit counters. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 191 | -F:: |
| 192 | --freq=:: |
| 193 | Profile at this frequency. |
| 194 | |
| 195 | -m:: |
| 196 | --mmap-pages=:: |
Jiri Olsa | 27050f5 | 2013-09-01 12:36:13 +0200 | [diff] [blame] | 197 | Number of mmap data pages (must be a power of two) or size |
| 198 | specification with appended unit character - B/K/M/G. The |
| 199 | size is rounded up to have nearest pages power of two value. |
Adrian Hunter | e9db131 | 2015-04-09 18:53:46 +0300 | [diff] [blame] | 200 | Also, by adding a comma, the number of mmap pages for AUX |
| 201 | area tracing can be specified. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 202 | |
Namhyung Kim | 9a75606 | 2015-03-02 12:13:33 +0900 | [diff] [blame] | 203 | --group:: |
| 204 | Put all events in a single event group. This precedes the --event |
| 205 | option and remains only for backward compatibility. See --event. |
| 206 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 207 | -g:: |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 208 | Enables call-graph (stack chain/backtrace) recording. |
| 209 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 210 | --call-graph:: |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 211 | Setup and enable call-graph (stack chain/backtrace) recording, |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 212 | implies -g. Default is "fp". |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 213 | |
| 214 | Allows specifying "fp" (frame pointer) or "dwarf" |
Kan Liang | aad2b21 | 2015-01-05 13:23:04 -0500 | [diff] [blame] | 215 | (DWARF's CFI - Call Frame Information) or "lbr" |
| 216 | (Hardware Last Branch Record facility) as the method to collect |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 217 | the information used to show the call graphs. |
| 218 | |
| 219 | In some systems, where binaries are build with gcc |
| 220 | --fomit-frame-pointer, using the "fp" method will produce bogus |
| 221 | call graphs, using "dwarf", if available (perf tools linked to |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 222 | the libunwind or libdw library) should be used instead. |
Kan Liang | aad2b21 | 2015-01-05 13:23:04 -0500 | [diff] [blame] | 223 | Using the "lbr" method doesn't require any compiler options. It |
| 224 | will produce call graphs from the hardware LBR registers. The |
| 225 | main limition is that it is only available on new Intel |
| 226 | platforms, such as Haswell. It can only get user call chain. It |
| 227 | doesn't work with branch stack sampling at the same time. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 228 | |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 229 | When "dwarf" recording is used, perf also records (user) stack dump |
| 230 | when sampled. Default size of the stack dump is 8192 (bytes). |
| 231 | User can change the size by passing the size after comma like |
| 232 | "--call-graph dwarf,4096". |
| 233 | |
Arnaldo Carvalho de Melo | b44308f | 2010-10-26 15:20:09 -0200 | [diff] [blame] | 234 | -q:: |
| 235 | --quiet:: |
| 236 | Don't print any message, useful for scripting. |
| 237 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 238 | -v:: |
| 239 | --verbose:: |
| 240 | Be more verbose (show counter open errors, etc). |
| 241 | |
| 242 | -s:: |
| 243 | --stat:: |
Namhyung Kim | 1f91d5f | 2015-05-10 00:19:42 +0900 | [diff] [blame] | 244 | Record per-thread event counts. Use it with 'perf report -T' to see |
| 245 | the values. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 246 | |
| 247 | -d:: |
| 248 | --data:: |
Peter Zijlstra | 5610032 | 2015-06-10 16:48:50 +0200 | [diff] [blame] | 249 | Record the sample addresses. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 250 | |
Arnaldo Carvalho de Melo | 9c90a61 | 2010-12-02 10:25:28 -0200 | [diff] [blame] | 251 | -T:: |
| 252 | --timestamp:: |
Peter Zijlstra | 5610032 | 2015-06-10 16:48:50 +0200 | [diff] [blame] | 253 | Record the sample timestamps. Use it with 'perf report -D' to see the |
| 254 | timestamps, for instance. |
| 255 | |
| 256 | -P:: |
| 257 | --period:: |
| 258 | Record the sample period. |
Arnaldo Carvalho de Melo | 9c90a61 | 2010-12-02 10:25:28 -0200 | [diff] [blame] | 259 | |
Jiri Olsa | b6f35ed | 2016-08-01 20:02:35 +0200 | [diff] [blame] | 260 | --sample-cpu:: |
| 261 | Record the sample cpu. |
| 262 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 263 | -n:: |
| 264 | --no-samples:: |
| 265 | Don't sample. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 266 | |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 267 | -R:: |
| 268 | --raw-samples:: |
Frederic Weisbecker | bdef3b0 | 2010-04-14 20:05:17 +0200 | [diff] [blame] | 269 | Collect raw sample records from all opened counters (default for tracepoint counters). |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 270 | |
Stephane Eranian | c45c6ea | 2010-05-28 12:00:01 +0200 | [diff] [blame] | 271 | -C:: |
| 272 | --cpu:: |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 273 | Collect samples only on the list of CPUs provided. Multiple CPUs can be provided as a |
| 274 | comma-separated list with no space: 0,1. Ranges of CPUs are specified with -: 0-2. |
Stephane Eranian | c45c6ea | 2010-05-28 12:00:01 +0200 | [diff] [blame] | 275 | In per-thread mode with inheritance mode on (default), samples are captured only when |
| 276 | the thread executes on the designated CPUs. Default is to monitor all CPUs. |
| 277 | |
Namhyung Kim | 7a29c08 | 2015-12-15 10:49:56 +0900 | [diff] [blame] | 278 | -B:: |
| 279 | --no-buildid:: |
| 280 | Do not save the build ids of binaries in the perf.data files. This skips |
| 281 | post processing after recording, which sometimes makes the final step in |
| 282 | the recording process to take a long time, as it needs to process all |
| 283 | events looking for mmap records. The downside is that it can misresolve |
| 284 | symbols if the workload binaries used when recording get locally rebuilt |
| 285 | or upgraded, because the only key available in this case is the |
| 286 | pathname. You can also set the "record.build-id" config variable to |
| 287 | 'skip to have this behaviour permanently. |
| 288 | |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 289 | -N:: |
| 290 | --no-buildid-cache:: |
Masanari Iida | 96355f2 | 2014-09-10 00:18:50 +0900 | [diff] [blame] | 291 | Do not update the buildid cache. This saves some overhead in situations |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 292 | where the information in the perf.data file (which includes buildids) |
Namhyung Kim | 7a29c08 | 2015-12-15 10:49:56 +0900 | [diff] [blame] | 293 | is sufficient. You can also set the "record.build-id" config variable to |
| 294 | 'no-cache' to have the same effect. |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 295 | |
Stephane Eranian | 023695d | 2011-02-14 11:20:01 +0200 | [diff] [blame] | 296 | -G name,...:: |
| 297 | --cgroup name,...:: |
| 298 | monitor only in the container (cgroup) called "name". This option is available only |
| 299 | in per-cpu mode. The cgroup filesystem must be mounted. All threads belonging to |
| 300 | container "name" are monitored when they run on the monitored CPUs. Multiple cgroups |
| 301 | can be provided. Each cgroup is applied to the corresponding event, i.e., first cgroup |
| 302 | to first event, second cgroup to second event and so on. It is possible to provide |
| 303 | an empty cgroup (monitor all the time) using, e.g., -G foo,,bar. Cgroups must have |
| 304 | corresponding events, i.e., they always refer to events defined earlier on the command |
| 305 | line. |
| 306 | |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 307 | -b:: |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 308 | --branch-any:: |
| 309 | Enable taken branch stack sampling. Any type of taken branch may be sampled. |
| 310 | This is a shortcut for --branch-filter any. See --branch-filter for more infos. |
| 311 | |
| 312 | -j:: |
| 313 | --branch-filter:: |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 314 | Enable taken branch stack sampling. Each sample captures a series of consecutive |
| 315 | taken branches. The number of branches captured with each sample depends on the |
| 316 | underlying hardware, the type of branches of interest, and the executed code. |
| 317 | It is possible to select the types of branches captured by enabling filters. The |
| 318 | following filters are defined: |
| 319 | |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 320 | - any: any type of branches |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 321 | - any_call: any function call or system call |
| 322 | - any_ret: any function return or system call return |
Anshuman Khandual | 2e49a94 | 2012-05-18 14:16:50 +0530 | [diff] [blame] | 323 | - ind_call: any indirect branch |
Stephane Eranian | 43e41ad | 2015-10-13 09:09:11 +0200 | [diff] [blame] | 324 | - call: direct calls, including far (to/from kernel) calls |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 325 | - u: only when the branch target is at the user level |
| 326 | - k: only when the branch target is in the kernel |
| 327 | - hv: only when the target is at the hypervisor level |
Andi Kleen | 0126d49 | 2013-09-20 07:40:42 -0700 | [diff] [blame] | 328 | - in_tx: only when the target is in a hardware transaction |
| 329 | - no_tx: only when the target is not in a hardware transaction |
| 330 | - abort_tx: only when the target is a hardware transaction abort |
Anshuman Khandual | 3e39db4 | 2014-05-22 12:50:10 +0530 | [diff] [blame] | 331 | - cond: conditional branches |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 332 | |
| 333 | + |
Anshuman Khandual | 3e39db4 | 2014-05-22 12:50:10 +0530 | [diff] [blame] | 334 | The option requires at least one branch type among any, any_call, any_ret, ind_call, cond. |
Masanari Iida | 9c76820 | 2012-11-30 14:10:25 +0900 | [diff] [blame] | 335 | The privilege levels may be omitted, in which case, the privilege levels of the associated |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 336 | event are applied to the branch filter. Both kernel (k) and hypervisor (hv) privilege |
| 337 | levels are subject to permissions. When sampling on multiple events, branch stack sampling |
| 338 | is enabled for all the sampling events. The sampled branch type is the same for all events. |
| 339 | The various filters must be specified as a comma separated list: --branch-filter any_ret,u,k |
| 340 | Note that this feature may not be available on all processors. |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 341 | |
Andi Kleen | 0548429 | 2013-01-24 16:10:29 +0100 | [diff] [blame] | 342 | --weight:: |
| 343 | Enable weightened sampling. An additional weight is recorded per sample and can be |
| 344 | displayed with the weight and local_weight sort keys. This currently works for TSX |
| 345 | abort events and some memory events in precise mode on modern Intel CPUs. |
| 346 | |
Andi Kleen | 475eeab | 2013-09-20 07:40:43 -0700 | [diff] [blame] | 347 | --transaction:: |
| 348 | Record transaction flags for transaction related events. |
| 349 | |
Adrian Hunter | 3aa5939 | 2013-11-15 15:52:29 +0200 | [diff] [blame] | 350 | --per-thread:: |
| 351 | Use per-thread mmaps. By default per-cpu mmaps are created. This option |
| 352 | overrides that and uses per-thread mmaps. A side-effect of that is that |
| 353 | inheritance is automatically disabled. --per-thread is ignored with a warning |
| 354 | if combined with -a or -C options. |
Adrian Hunter | 539e6bb | 2013-11-01 15:51:34 +0200 | [diff] [blame] | 355 | |
Arnaldo Carvalho de Melo | a6205a3 | 2014-01-14 17:58:12 -0300 | [diff] [blame] | 356 | -D:: |
| 357 | --delay=:: |
Andi Kleen | 6619a53 | 2014-01-11 13:38:27 -0800 | [diff] [blame] | 358 | After starting the program, wait msecs before measuring. This is useful to |
| 359 | filter out the startup phase of the program, which is often very different. |
| 360 | |
Stephane Eranian | 4b6c517 | 2014-09-24 13:48:41 +0200 | [diff] [blame] | 361 | -I:: |
| 362 | --intr-regs:: |
| 363 | Capture machine state (registers) at interrupt, i.e., on counter overflows for |
| 364 | each sample. List of captured registers depends on the architecture. This option |
Stephane Eranian | bcc84ec | 2015-08-31 18:41:12 +0200 | [diff] [blame] | 365 | is off by default. It is possible to select the registers to sample using their |
| 366 | symbolic names, e.g. on x86, ax, si. To list the available registers use |
| 367 | --intr-regs=\?. To name registers, pass a comma separated list such as |
| 368 | --intr-regs=ax,bx. The list of register is architecture dependent. |
| 369 | |
Stephane Eranian | 4b6c517 | 2014-09-24 13:48:41 +0200 | [diff] [blame] | 370 | |
Andi Kleen | 85c273d | 2015-02-24 15:13:40 -0800 | [diff] [blame] | 371 | --running-time:: |
| 372 | Record running and enabled time for read events (:S) |
| 373 | |
Peter Zijlstra | 814c8c3 | 2015-03-31 00:19:31 +0200 | [diff] [blame] | 374 | -k:: |
| 375 | --clockid:: |
| 376 | Sets the clock id to use for the various time fields in the perf_event_type |
| 377 | records. See clock_gettime(). In particular CLOCK_MONOTONIC and |
| 378 | CLOCK_MONOTONIC_RAW are supported, some events might also allow |
| 379 | CLOCK_BOOTTIME, CLOCK_REALTIME and CLOCK_TAI. |
| 380 | |
Adrian Hunter | 2dd6d8a | 2015-04-30 17:37:32 +0300 | [diff] [blame] | 381 | -S:: |
| 382 | --snapshot:: |
| 383 | Select AUX area tracing Snapshot Mode. This option is valid only with an |
| 384 | AUX area tracing event. Optionally the number of bytes to capture per |
| 385 | snapshot can be specified. In Snapshot Mode, trace data is captured only when |
| 386 | signal SIGUSR2 is received. |
| 387 | |
Kan Liang | 9d9cad7 | 2015-06-17 09:51:11 -0400 | [diff] [blame] | 388 | --proc-map-timeout:: |
| 389 | When processing pre-existing threads /proc/XXX/mmap, it may take a long time, |
| 390 | because the file may be huge. A time out is needed in such cases. |
| 391 | This option sets the time out limit. The default value is 500 ms. |
| 392 | |
Adrian Hunter | b757bb0 | 2015-07-21 12:44:04 +0300 | [diff] [blame] | 393 | --switch-events:: |
| 394 | Record context switch events i.e. events of type PERF_RECORD_SWITCH or |
| 395 | PERF_RECORD_SWITCH_CPU_WIDE. |
| 396 | |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 397 | --clang-path=PATH:: |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 398 | Path to clang binary to use for compiling BPF scriptlets. |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 399 | (enabled when BPF support is on) |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 400 | |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 401 | --clang-opt=OPTIONS:: |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 402 | Options passed to clang when compiling BPF scriptlets. |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 403 | (enabled when BPF support is on) |
| 404 | |
| 405 | --vmlinux=PATH:: |
| 406 | Specify vmlinux path which has debuginfo. |
| 407 | (enabled when BPF prologue is on) |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 408 | |
Namhyung Kim | 6156681 | 2016-01-11 22:37:09 +0900 | [diff] [blame] | 409 | --buildid-all:: |
| 410 | Record build-id of all DSOs regardless whether it's actually hit or not. |
| 411 | |
Jiri Olsa | 8572388 | 2016-02-15 09:34:31 +0100 | [diff] [blame] | 412 | --all-kernel:: |
| 413 | Configure all used events to run in kernel space. |
| 414 | |
| 415 | --all-user:: |
| 416 | Configure all used events to run in user space. |
| 417 | |
Wang Nan | eca857a | 2016-04-20 18:59:51 +0000 | [diff] [blame] | 418 | --timestamp-filename |
| 419 | Append timestamp to output file name. |
| 420 | |
Wang Nan | 3c1cb7e | 2016-04-20 18:59:50 +0000 | [diff] [blame] | 421 | --switch-output:: |
| 422 | Generate multiple perf.data files, timestamp prefixed, switching to a new one |
| 423 | when receiving a SIGUSR2. |
| 424 | |
| 425 | A possible use case is to, given an external event, slice the perf.data file |
| 426 | that gets then processed, possibly via a perf script, to decide if that |
| 427 | particular perf.data snapshot should be kept or not. |
| 428 | |
Wang Nan | 0c1d46a | 2016-04-20 18:59:52 +0000 | [diff] [blame] | 429 | Implies --timestamp-filename, --no-buildid and --no-buildid-cache. |
Wang Nan | eca857a | 2016-04-20 18:59:51 +0000 | [diff] [blame] | 430 | |
Wang Nan | 0aab213 | 2016-06-16 08:02:41 +0000 | [diff] [blame] | 431 | --dry-run:: |
| 432 | Parse options then exit. --dry-run can be used to detect errors in cmdline |
| 433 | options. |
| 434 | |
| 435 | 'perf record --dry-run -e' can act as a BPF script compiler if llvm.dump-obj |
| 436 | in config file is set to true. |
| 437 | |
Wang Nan | 4ea648a | 2016-07-14 08:34:47 +0000 | [diff] [blame] | 438 | --tail-synthesize:: |
| 439 | Instead of collecting non-sample events (for example, fork, comm, mmap) at |
| 440 | the beginning of record, collect them during finalizing an output file. |
| 441 | The collected non-sample events reflects the status of the system when |
| 442 | record is finished. |
| 443 | |
Wang Nan | 626a6b7 | 2016-07-14 08:34:45 +0000 | [diff] [blame] | 444 | --overwrite:: |
| 445 | Makes all events use an overwritable ring buffer. An overwritable ring |
| 446 | buffer works like a flight recorder: when it gets full, the kernel will |
| 447 | overwrite the oldest records, that thus will never make it to the |
| 448 | perf.data file. |
| 449 | |
| 450 | When '--overwrite' and '--switch-output' are used perf records and drops |
| 451 | events until it receives a signal, meaning that something unusual was |
| 452 | detected that warrants taking a snapshot of the most current events, |
| 453 | those fitting in the ring buffer at that moment. |
| 454 | |
| 455 | 'overwrite' attribute can also be set or canceled for an event using |
| 456 | config terms. For example: 'cycles/overwrite/' and 'instructions/no-overwrite/'. |
| 457 | |
Wang Nan | 4ea648a | 2016-07-14 08:34:47 +0000 | [diff] [blame] | 458 | Implies --tail-synthesize. |
| 459 | |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 460 | SEE ALSO |
| 461 | -------- |
Thomas Gleixner | 386b05e | 2009-06-06 14:56:33 +0200 | [diff] [blame] | 462 | linkperf:perf-stat[1], linkperf:perf-list[1] |