Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 1 | perf-record(1) |
Ingo Molnar | c1c2365 | 2009-05-30 12:38:51 +0200 | [diff] [blame] | 2 | ============== |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 3 | |
| 4 | NAME |
| 5 | ---- |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 6 | perf-record - Run a command and record its profile into perf.data |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 7 | |
| 8 | SYNOPSIS |
| 9 | -------- |
| 10 | [verse] |
| 11 | 'perf record' [-e <EVENT> | --event=EVENT] [-l] [-a] <command> |
Mike Galbraith | 9e096753 | 2009-05-28 16:25:34 +0200 | [diff] [blame] | 12 | 'perf record' [-e <EVENT> | --event=EVENT] [-l] [-a] -- <command> [<options>] |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 13 | |
| 14 | DESCRIPTION |
| 15 | ----------- |
| 16 | This command runs a command and gathers a performance counter profile |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 17 | from it, into perf.data - without displaying anything. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 18 | |
| 19 | This file can then be inspected later on, using 'perf report'. |
| 20 | |
| 21 | |
| 22 | OPTIONS |
| 23 | ------- |
| 24 | <command>...:: |
| 25 | Any command you can specify in a shell. |
| 26 | |
| 27 | -e:: |
| 28 | --event=:: |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 29 | Select the PMU event. Selection can be: |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 30 | |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 31 | - a symbolic event name (use 'perf list' to list all events) |
| 32 | |
| 33 | - a raw PMU event (eventsel+umask) in the form of rNNN where NNN is a |
| 34 | hexadecimal event descriptor. |
| 35 | |
Cody P Schafer | f9ab9c1 | 2015-01-07 17:13:53 -0800 | [diff] [blame] | 36 | - a symbolically formed PMU event like 'pmu/param1=0x3,param2/' where |
| 37 | 'param1', 'param2', etc are defined as formats for the PMU in |
| 38 | /sys/bus/event_sources/devices/<pmu>/format/*. |
| 39 | |
| 40 | - a symbolically formed event like 'pmu/config=M,config1=N,config3=K/' |
| 41 | |
| 42 | where M, N, K are numbers (in decimal, hex, octal format). Acceptable |
| 43 | values for each of 'config', 'config1' and 'config2' are defined by |
| 44 | corresponding entries in /sys/bus/event_sources/devices/<pmu>/format/* |
| 45 | param1 and param2 are defined as formats for the PMU in: |
| 46 | /sys/bus/event_sources/devices/<pmu>/format/* |
| 47 | |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 48 | There are also some params which are not defined in .../<pmu>/format/*. |
Jiri Olsa | ee4c758 | 2015-07-29 05:42:11 -0400 | [diff] [blame] | 49 | These params can be used to overload default config values per event. |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 50 | Here is a list of the params. |
| 51 | - 'period': Set event sampling period |
Namhyung Kim | 09af2a5 | 2015-08-09 15:45:23 +0900 | [diff] [blame] | 52 | - 'freq': Set event sampling frequency |
Kan Liang | 3206771 | 2015-08-04 04:30:19 -0400 | [diff] [blame] | 53 | - 'time': Disable/enable time stamping. Acceptable values are 1 for |
| 54 | enabling time stamping. 0 for disabling time stamping. |
| 55 | The default is 1. |
Kan Liang | d457c96 | 2015-08-11 06:30:47 -0400 | [diff] [blame] | 56 | - 'call-graph': Disable/enable callgraph. Acceptable str are "fp" for |
Kan Liang | f9db0d0 | 2015-08-11 06:30:48 -0400 | [diff] [blame] | 57 | FP mode, "dwarf" for DWARF mode, "lbr" for LBR mode and |
| 58 | "no" for disable callgraph. |
Kan Liang | d457c96 | 2015-08-11 06:30:47 -0400 | [diff] [blame] | 59 | - 'stack-size': user stack size for dwarf mode |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 60 | Note: If user explicitly sets options which conflict with the params, |
| 61 | the value set by the params will be overridden. |
| 62 | |
Mathieu Poirier | dd60fba | 2016-09-06 10:37:15 -0600 | [diff] [blame] | 63 | Also not defined in .../<pmu>/format/* are PMU driver specific |
| 64 | configuration parameters. Any configuration parameter preceded by |
| 65 | the letter '@' is not interpreted in user space and sent down directly |
| 66 | to the PMU driver. For example: |
| 67 | |
| 68 | perf record -e some_event/@cfg1,@cfg2=config/ ... |
| 69 | |
| 70 | will see 'cfg1' and 'cfg2=config' pushed to the PMU driver associated |
| 71 | with the event for further processing. There is no restriction on |
| 72 | what the configuration parameters are, as long as their semantic is |
| 73 | understood and supported by the PMU driver. |
| 74 | |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 75 | - a hardware breakpoint event in the form of '\mem:addr[/len][:access]' |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 76 | where addr is the address in memory you want to break in. |
| 77 | Access is the memory access type (read, write, execute) it can |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 78 | be passed as follows: '\mem:addr[:[r][w][x]]'. len is the range, |
| 79 | number of bytes from specified addr, which the breakpoint will cover. |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 80 | If you want to profile read-write accesses in 0x1000, just set |
| 81 | 'mem:0x1000:rw'. |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 82 | If you want to profile write accesses in [0x1000~1008), just set |
| 83 | 'mem:0x1000/8:w'. |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 84 | |
Namhyung Kim | 9a75606 | 2015-03-02 12:13:33 +0900 | [diff] [blame] | 85 | - a group of events surrounded by a pair of brace ("{event1,event2,...}"). |
| 86 | Each event is separated by commas and the group should be quoted to |
| 87 | prevent the shell interpretation. You also need to use --group on |
| 88 | "perf report" to view group events together. |
| 89 | |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 90 | --filter=<filter>:: |
Wang Nan | 4ba1faa | 2015-07-10 07:36:10 +0000 | [diff] [blame] | 91 | Event filter. This option should follow a event selector (-e) which |
| 92 | selects tracepoint event(s). Multiple '--filter' options are combined |
| 93 | using '&&'. |
| 94 | |
| 95 | --exclude-perf:: |
| 96 | Don't record events issued by perf itself. This option should follow |
| 97 | a event selector (-e) which selects tracepoint event(s). It adds a |
| 98 | filter expression 'common_pid != $PERFPID' to filters. If other |
| 99 | '--filter' exists, the new filter expression will be combined with |
| 100 | them by '&&'. |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 101 | |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 102 | -a:: |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 103 | --all-cpus:: |
| 104 | System-wide collection from all CPUs. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 105 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 106 | -p:: |
| 107 | --pid=:: |
David Ahern | b52956c | 2012-02-08 09:32:52 -0700 | [diff] [blame] | 108 | Record events on existing process ID (comma separated list). |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 109 | |
| 110 | -t:: |
| 111 | --tid=:: |
David Ahern | b52956c | 2012-02-08 09:32:52 -0700 | [diff] [blame] | 112 | Record events on existing thread ID (comma separated list). |
Adrian Hunter | 69e7e5b | 2013-11-18 11:55:57 +0200 | [diff] [blame] | 113 | This option also disables inheritance by default. Enable it by adding |
| 114 | --inherit. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 115 | |
Arnaldo Carvalho de Melo | 0d37aa3 | 2012-01-19 14:08:15 -0200 | [diff] [blame] | 116 | -u:: |
| 117 | --uid=:: |
| 118 | Record events in threads owned by uid. Name or number. |
| 119 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 120 | -r:: |
| 121 | --realtime=:: |
| 122 | Collect data with this RT SCHED_FIFO priority. |
Jiri Olsa | 563aecb | 2013-06-05 13:35:06 +0200 | [diff] [blame] | 123 | |
Arnaldo Carvalho de Melo | 509051e | 2014-01-14 17:52:14 -0300 | [diff] [blame] | 124 | --no-buffering:: |
Kirill Smelkov | acac03f | 2011-01-12 17:59:36 +0300 | [diff] [blame] | 125 | Collect data without buffering. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 126 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 127 | -c:: |
| 128 | --count=:: |
| 129 | Event period to sample. |
| 130 | |
| 131 | -o:: |
| 132 | --output=:: |
| 133 | Output file name. |
| 134 | |
| 135 | -i:: |
Stephane Eranian | 2e6cdf9 | 2010-05-12 10:40:01 +0200 | [diff] [blame] | 136 | --no-inherit:: |
| 137 | Child tasks do not inherit counters. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 138 | -F:: |
| 139 | --freq=:: |
| 140 | Profile at this frequency. |
| 141 | |
| 142 | -m:: |
| 143 | --mmap-pages=:: |
Jiri Olsa | 27050f5 | 2013-09-01 12:36:13 +0200 | [diff] [blame] | 144 | Number of mmap data pages (must be a power of two) or size |
| 145 | specification with appended unit character - B/K/M/G. The |
| 146 | size is rounded up to have nearest pages power of two value. |
Adrian Hunter | e9db131 | 2015-04-09 18:53:46 +0300 | [diff] [blame] | 147 | Also, by adding a comma, the number of mmap pages for AUX |
| 148 | area tracing can be specified. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 149 | |
Namhyung Kim | 9a75606 | 2015-03-02 12:13:33 +0900 | [diff] [blame] | 150 | --group:: |
| 151 | Put all events in a single event group. This precedes the --event |
| 152 | option and remains only for backward compatibility. See --event. |
| 153 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 154 | -g:: |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 155 | Enables call-graph (stack chain/backtrace) recording. |
| 156 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 157 | --call-graph:: |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 158 | Setup and enable call-graph (stack chain/backtrace) recording, |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 159 | implies -g. Default is "fp". |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 160 | |
| 161 | Allows specifying "fp" (frame pointer) or "dwarf" |
Kan Liang | aad2b21 | 2015-01-05 13:23:04 -0500 | [diff] [blame] | 162 | (DWARF's CFI - Call Frame Information) or "lbr" |
| 163 | (Hardware Last Branch Record facility) as the method to collect |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 164 | the information used to show the call graphs. |
| 165 | |
| 166 | In some systems, where binaries are build with gcc |
| 167 | --fomit-frame-pointer, using the "fp" method will produce bogus |
| 168 | call graphs, using "dwarf", if available (perf tools linked to |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 169 | the libunwind or libdw library) should be used instead. |
Kan Liang | aad2b21 | 2015-01-05 13:23:04 -0500 | [diff] [blame] | 170 | Using the "lbr" method doesn't require any compiler options. It |
| 171 | will produce call graphs from the hardware LBR registers. The |
| 172 | main limition is that it is only available on new Intel |
| 173 | platforms, such as Haswell. It can only get user call chain. It |
| 174 | doesn't work with branch stack sampling at the same time. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 175 | |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 176 | When "dwarf" recording is used, perf also records (user) stack dump |
| 177 | when sampled. Default size of the stack dump is 8192 (bytes). |
| 178 | User can change the size by passing the size after comma like |
| 179 | "--call-graph dwarf,4096". |
| 180 | |
Arnaldo Carvalho de Melo | b44308f | 2010-10-26 15:20:09 -0200 | [diff] [blame] | 181 | -q:: |
| 182 | --quiet:: |
| 183 | Don't print any message, useful for scripting. |
| 184 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 185 | -v:: |
| 186 | --verbose:: |
| 187 | Be more verbose (show counter open errors, etc). |
| 188 | |
| 189 | -s:: |
| 190 | --stat:: |
Namhyung Kim | 1f91d5f | 2015-05-10 00:19:42 +0900 | [diff] [blame] | 191 | Record per-thread event counts. Use it with 'perf report -T' to see |
| 192 | the values. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 193 | |
| 194 | -d:: |
| 195 | --data:: |
Peter Zijlstra | 5610032 | 2015-06-10 16:48:50 +0200 | [diff] [blame] | 196 | Record the sample addresses. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 197 | |
Arnaldo Carvalho de Melo | 9c90a61 | 2010-12-02 10:25:28 -0200 | [diff] [blame] | 198 | -T:: |
| 199 | --timestamp:: |
Peter Zijlstra | 5610032 | 2015-06-10 16:48:50 +0200 | [diff] [blame] | 200 | Record the sample timestamps. Use it with 'perf report -D' to see the |
| 201 | timestamps, for instance. |
| 202 | |
| 203 | -P:: |
| 204 | --period:: |
| 205 | Record the sample period. |
Arnaldo Carvalho de Melo | 9c90a61 | 2010-12-02 10:25:28 -0200 | [diff] [blame] | 206 | |
Jiri Olsa | b6f35ed | 2016-08-01 20:02:35 +0200 | [diff] [blame] | 207 | --sample-cpu:: |
| 208 | Record the sample cpu. |
| 209 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 210 | -n:: |
| 211 | --no-samples:: |
| 212 | Don't sample. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 213 | |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 214 | -R:: |
| 215 | --raw-samples:: |
Frederic Weisbecker | bdef3b0 | 2010-04-14 20:05:17 +0200 | [diff] [blame] | 216 | Collect raw sample records from all opened counters (default for tracepoint counters). |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 217 | |
Stephane Eranian | c45c6ea | 2010-05-28 12:00:01 +0200 | [diff] [blame] | 218 | -C:: |
| 219 | --cpu:: |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 220 | Collect samples only on the list of CPUs provided. Multiple CPUs can be provided as a |
| 221 | comma-separated list with no space: 0,1. Ranges of CPUs are specified with -: 0-2. |
Stephane Eranian | c45c6ea | 2010-05-28 12:00:01 +0200 | [diff] [blame] | 222 | In per-thread mode with inheritance mode on (default), samples are captured only when |
| 223 | the thread executes on the designated CPUs. Default is to monitor all CPUs. |
| 224 | |
Namhyung Kim | 7a29c08 | 2015-12-15 10:49:56 +0900 | [diff] [blame] | 225 | -B:: |
| 226 | --no-buildid:: |
| 227 | Do not save the build ids of binaries in the perf.data files. This skips |
| 228 | post processing after recording, which sometimes makes the final step in |
| 229 | the recording process to take a long time, as it needs to process all |
| 230 | events looking for mmap records. The downside is that it can misresolve |
| 231 | symbols if the workload binaries used when recording get locally rebuilt |
| 232 | or upgraded, because the only key available in this case is the |
| 233 | pathname. You can also set the "record.build-id" config variable to |
| 234 | 'skip to have this behaviour permanently. |
| 235 | |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 236 | -N:: |
| 237 | --no-buildid-cache:: |
Masanari Iida | 96355f2 | 2014-09-10 00:18:50 +0900 | [diff] [blame] | 238 | Do not update the buildid cache. This saves some overhead in situations |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 239 | where the information in the perf.data file (which includes buildids) |
Namhyung Kim | 7a29c08 | 2015-12-15 10:49:56 +0900 | [diff] [blame] | 240 | is sufficient. You can also set the "record.build-id" config variable to |
| 241 | 'no-cache' to have the same effect. |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 242 | |
Stephane Eranian | 023695d | 2011-02-14 11:20:01 +0200 | [diff] [blame] | 243 | -G name,...:: |
| 244 | --cgroup name,...:: |
| 245 | monitor only in the container (cgroup) called "name". This option is available only |
| 246 | in per-cpu mode. The cgroup filesystem must be mounted. All threads belonging to |
| 247 | container "name" are monitored when they run on the monitored CPUs. Multiple cgroups |
| 248 | can be provided. Each cgroup is applied to the corresponding event, i.e., first cgroup |
| 249 | to first event, second cgroup to second event and so on. It is possible to provide |
| 250 | an empty cgroup (monitor all the time) using, e.g., -G foo,,bar. Cgroups must have |
| 251 | corresponding events, i.e., they always refer to events defined earlier on the command |
| 252 | line. |
| 253 | |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 254 | -b:: |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 255 | --branch-any:: |
| 256 | Enable taken branch stack sampling. Any type of taken branch may be sampled. |
| 257 | This is a shortcut for --branch-filter any. See --branch-filter for more infos. |
| 258 | |
| 259 | -j:: |
| 260 | --branch-filter:: |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 261 | Enable taken branch stack sampling. Each sample captures a series of consecutive |
| 262 | taken branches. The number of branches captured with each sample depends on the |
| 263 | underlying hardware, the type of branches of interest, and the executed code. |
| 264 | It is possible to select the types of branches captured by enabling filters. The |
| 265 | following filters are defined: |
| 266 | |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 267 | - any: any type of branches |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 268 | - any_call: any function call or system call |
| 269 | - any_ret: any function return or system call return |
Anshuman Khandual | 2e49a94 | 2012-05-18 14:16:50 +0530 | [diff] [blame] | 270 | - ind_call: any indirect branch |
Stephane Eranian | 43e41ad | 2015-10-13 09:09:11 +0200 | [diff] [blame] | 271 | - call: direct calls, including far (to/from kernel) calls |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 272 | - u: only when the branch target is at the user level |
| 273 | - k: only when the branch target is in the kernel |
| 274 | - hv: only when the target is at the hypervisor level |
Andi Kleen | 0126d49 | 2013-09-20 07:40:42 -0700 | [diff] [blame] | 275 | - in_tx: only when the target is in a hardware transaction |
| 276 | - no_tx: only when the target is not in a hardware transaction |
| 277 | - abort_tx: only when the target is a hardware transaction abort |
Anshuman Khandual | 3e39db4 | 2014-05-22 12:50:10 +0530 | [diff] [blame] | 278 | - cond: conditional branches |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 279 | |
| 280 | + |
Anshuman Khandual | 3e39db4 | 2014-05-22 12:50:10 +0530 | [diff] [blame] | 281 | The option requires at least one branch type among any, any_call, any_ret, ind_call, cond. |
Masanari Iida | 9c76820 | 2012-11-30 14:10:25 +0900 | [diff] [blame] | 282 | The privilege levels may be omitted, in which case, the privilege levels of the associated |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 283 | event are applied to the branch filter. Both kernel (k) and hypervisor (hv) privilege |
| 284 | levels are subject to permissions. When sampling on multiple events, branch stack sampling |
| 285 | is enabled for all the sampling events. The sampled branch type is the same for all events. |
| 286 | The various filters must be specified as a comma separated list: --branch-filter any_ret,u,k |
| 287 | Note that this feature may not be available on all processors. |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 288 | |
Andi Kleen | 0548429 | 2013-01-24 16:10:29 +0100 | [diff] [blame] | 289 | --weight:: |
| 290 | Enable weightened sampling. An additional weight is recorded per sample and can be |
| 291 | displayed with the weight and local_weight sort keys. This currently works for TSX |
| 292 | abort events and some memory events in precise mode on modern Intel CPUs. |
| 293 | |
Andi Kleen | 475eeab | 2013-09-20 07:40:43 -0700 | [diff] [blame] | 294 | --transaction:: |
| 295 | Record transaction flags for transaction related events. |
| 296 | |
Adrian Hunter | 3aa5939 | 2013-11-15 15:52:29 +0200 | [diff] [blame] | 297 | --per-thread:: |
| 298 | Use per-thread mmaps. By default per-cpu mmaps are created. This option |
| 299 | overrides that and uses per-thread mmaps. A side-effect of that is that |
| 300 | inheritance is automatically disabled. --per-thread is ignored with a warning |
| 301 | if combined with -a or -C options. |
Adrian Hunter | 539e6bb | 2013-11-01 15:51:34 +0200 | [diff] [blame] | 302 | |
Arnaldo Carvalho de Melo | a6205a3 | 2014-01-14 17:58:12 -0300 | [diff] [blame] | 303 | -D:: |
| 304 | --delay=:: |
Andi Kleen | 6619a53 | 2014-01-11 13:38:27 -0800 | [diff] [blame] | 305 | After starting the program, wait msecs before measuring. This is useful to |
| 306 | filter out the startup phase of the program, which is often very different. |
| 307 | |
Stephane Eranian | 4b6c517 | 2014-09-24 13:48:41 +0200 | [diff] [blame] | 308 | -I:: |
| 309 | --intr-regs:: |
| 310 | Capture machine state (registers) at interrupt, i.e., on counter overflows for |
| 311 | each sample. List of captured registers depends on the architecture. This option |
Stephane Eranian | bcc84ec | 2015-08-31 18:41:12 +0200 | [diff] [blame] | 312 | is off by default. It is possible to select the registers to sample using their |
| 313 | symbolic names, e.g. on x86, ax, si. To list the available registers use |
| 314 | --intr-regs=\?. To name registers, pass a comma separated list such as |
| 315 | --intr-regs=ax,bx. The list of register is architecture dependent. |
| 316 | |
Stephane Eranian | 4b6c517 | 2014-09-24 13:48:41 +0200 | [diff] [blame] | 317 | |
Andi Kleen | 85c273d | 2015-02-24 15:13:40 -0800 | [diff] [blame] | 318 | --running-time:: |
| 319 | Record running and enabled time for read events (:S) |
| 320 | |
Peter Zijlstra | 814c8c3 | 2015-03-31 00:19:31 +0200 | [diff] [blame] | 321 | -k:: |
| 322 | --clockid:: |
| 323 | Sets the clock id to use for the various time fields in the perf_event_type |
| 324 | records. See clock_gettime(). In particular CLOCK_MONOTONIC and |
| 325 | CLOCK_MONOTONIC_RAW are supported, some events might also allow |
| 326 | CLOCK_BOOTTIME, CLOCK_REALTIME and CLOCK_TAI. |
| 327 | |
Adrian Hunter | 2dd6d8a | 2015-04-30 17:37:32 +0300 | [diff] [blame] | 328 | -S:: |
| 329 | --snapshot:: |
| 330 | Select AUX area tracing Snapshot Mode. This option is valid only with an |
| 331 | AUX area tracing event. Optionally the number of bytes to capture per |
| 332 | snapshot can be specified. In Snapshot Mode, trace data is captured only when |
| 333 | signal SIGUSR2 is received. |
| 334 | |
Kan Liang | 9d9cad7 | 2015-06-17 09:51:11 -0400 | [diff] [blame] | 335 | --proc-map-timeout:: |
| 336 | When processing pre-existing threads /proc/XXX/mmap, it may take a long time, |
| 337 | because the file may be huge. A time out is needed in such cases. |
| 338 | This option sets the time out limit. The default value is 500 ms. |
| 339 | |
Adrian Hunter | b757bb0 | 2015-07-21 12:44:04 +0300 | [diff] [blame] | 340 | --switch-events:: |
| 341 | Record context switch events i.e. events of type PERF_RECORD_SWITCH or |
| 342 | PERF_RECORD_SWITCH_CPU_WIDE. |
| 343 | |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 344 | --clang-path=PATH:: |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 345 | Path to clang binary to use for compiling BPF scriptlets. |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 346 | (enabled when BPF support is on) |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 347 | |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 348 | --clang-opt=OPTIONS:: |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 349 | Options passed to clang when compiling BPF scriptlets. |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 350 | (enabled when BPF support is on) |
| 351 | |
| 352 | --vmlinux=PATH:: |
| 353 | Specify vmlinux path which has debuginfo. |
| 354 | (enabled when BPF prologue is on) |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 355 | |
Namhyung Kim | 6156681 | 2016-01-11 22:37:09 +0900 | [diff] [blame] | 356 | --buildid-all:: |
| 357 | Record build-id of all DSOs regardless whether it's actually hit or not. |
| 358 | |
Jiri Olsa | 8572388 | 2016-02-15 09:34:31 +0100 | [diff] [blame] | 359 | --all-kernel:: |
| 360 | Configure all used events to run in kernel space. |
| 361 | |
| 362 | --all-user:: |
| 363 | Configure all used events to run in user space. |
| 364 | |
Wang Nan | eca857a | 2016-04-20 18:59:51 +0000 | [diff] [blame] | 365 | --timestamp-filename |
| 366 | Append timestamp to output file name. |
| 367 | |
Wang Nan | 3c1cb7e | 2016-04-20 18:59:50 +0000 | [diff] [blame] | 368 | --switch-output:: |
| 369 | Generate multiple perf.data files, timestamp prefixed, switching to a new one |
| 370 | when receiving a SIGUSR2. |
| 371 | |
| 372 | A possible use case is to, given an external event, slice the perf.data file |
| 373 | that gets then processed, possibly via a perf script, to decide if that |
| 374 | particular perf.data snapshot should be kept or not. |
| 375 | |
Wang Nan | 0c1d46a | 2016-04-20 18:59:52 +0000 | [diff] [blame] | 376 | Implies --timestamp-filename, --no-buildid and --no-buildid-cache. |
Wang Nan | eca857a | 2016-04-20 18:59:51 +0000 | [diff] [blame] | 377 | |
Wang Nan | 0aab213 | 2016-06-16 08:02:41 +0000 | [diff] [blame] | 378 | --dry-run:: |
| 379 | Parse options then exit. --dry-run can be used to detect errors in cmdline |
| 380 | options. |
| 381 | |
| 382 | 'perf record --dry-run -e' can act as a BPF script compiler if llvm.dump-obj |
| 383 | in config file is set to true. |
| 384 | |
Wang Nan | 4ea648a | 2016-07-14 08:34:47 +0000 | [diff] [blame] | 385 | --tail-synthesize:: |
| 386 | Instead of collecting non-sample events (for example, fork, comm, mmap) at |
| 387 | the beginning of record, collect them during finalizing an output file. |
| 388 | The collected non-sample events reflects the status of the system when |
| 389 | record is finished. |
| 390 | |
Wang Nan | 626a6b7 | 2016-07-14 08:34:45 +0000 | [diff] [blame] | 391 | --overwrite:: |
| 392 | Makes all events use an overwritable ring buffer. An overwritable ring |
| 393 | buffer works like a flight recorder: when it gets full, the kernel will |
| 394 | overwrite the oldest records, that thus will never make it to the |
| 395 | perf.data file. |
| 396 | |
| 397 | When '--overwrite' and '--switch-output' are used perf records and drops |
| 398 | events until it receives a signal, meaning that something unusual was |
| 399 | detected that warrants taking a snapshot of the most current events, |
| 400 | those fitting in the ring buffer at that moment. |
| 401 | |
| 402 | 'overwrite' attribute can also be set or canceled for an event using |
| 403 | config terms. For example: 'cycles/overwrite/' and 'instructions/no-overwrite/'. |
| 404 | |
Wang Nan | 4ea648a | 2016-07-14 08:34:47 +0000 | [diff] [blame] | 405 | Implies --tail-synthesize. |
| 406 | |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 407 | SEE ALSO |
| 408 | -------- |
Thomas Gleixner | 386b05e | 2009-06-06 14:56:33 +0200 | [diff] [blame] | 409 | linkperf:perf-stat[1], linkperf:perf-list[1] |