Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 1 | perf-record(1) |
Ingo Molnar | c1c2365 | 2009-05-30 12:38:51 +0200 | [diff] [blame] | 2 | ============== |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 3 | |
| 4 | NAME |
| 5 | ---- |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 6 | perf-record - Run a command and record its profile into perf.data |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 7 | |
| 8 | SYNOPSIS |
| 9 | -------- |
| 10 | [verse] |
| 11 | 'perf record' [-e <EVENT> | --event=EVENT] [-l] [-a] <command> |
Mike Galbraith | 9e096753 | 2009-05-28 16:25:34 +0200 | [diff] [blame] | 12 | 'perf record' [-e <EVENT> | --event=EVENT] [-l] [-a] -- <command> [<options>] |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 13 | |
| 14 | DESCRIPTION |
| 15 | ----------- |
| 16 | This command runs a command and gathers a performance counter profile |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 17 | from it, into perf.data - without displaying anything. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 18 | |
| 19 | This file can then be inspected later on, using 'perf report'. |
| 20 | |
| 21 | |
| 22 | OPTIONS |
| 23 | ------- |
| 24 | <command>...:: |
| 25 | Any command you can specify in a shell. |
| 26 | |
| 27 | -e:: |
| 28 | --event=:: |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 29 | Select the PMU event. Selection can be: |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 30 | |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 31 | - a symbolic event name (use 'perf list' to list all events) |
| 32 | |
| 33 | - a raw PMU event (eventsel+umask) in the form of rNNN where NNN is a |
| 34 | hexadecimal event descriptor. |
| 35 | |
Cody P Schafer | f9ab9c1 | 2015-01-07 17:13:53 -0800 | [diff] [blame] | 36 | - a symbolically formed PMU event like 'pmu/param1=0x3,param2/' where |
| 37 | 'param1', 'param2', etc are defined as formats for the PMU in |
| 38 | /sys/bus/event_sources/devices/<pmu>/format/*. |
| 39 | |
| 40 | - a symbolically formed event like 'pmu/config=M,config1=N,config3=K/' |
| 41 | |
| 42 | where M, N, K are numbers (in decimal, hex, octal format). Acceptable |
| 43 | values for each of 'config', 'config1' and 'config2' are defined by |
| 44 | corresponding entries in /sys/bus/event_sources/devices/<pmu>/format/* |
| 45 | param1 and param2 are defined as formats for the PMU in: |
| 46 | /sys/bus/event_sources/devices/<pmu>/format/* |
| 47 | |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 48 | There are also some params which are not defined in .../<pmu>/format/*. |
Jiri Olsa | ee4c758 | 2015-07-29 05:42:11 -0400 | [diff] [blame] | 49 | These params can be used to overload default config values per event. |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 50 | Here is a list of the params. |
| 51 | - 'period': Set event sampling period |
Namhyung Kim | 09af2a5 | 2015-08-09 15:45:23 +0900 | [diff] [blame] | 52 | - 'freq': Set event sampling frequency |
Kan Liang | 3206771 | 2015-08-04 04:30:19 -0400 | [diff] [blame] | 53 | - 'time': Disable/enable time stamping. Acceptable values are 1 for |
| 54 | enabling time stamping. 0 for disabling time stamping. |
| 55 | The default is 1. |
Kan Liang | d457c96 | 2015-08-11 06:30:47 -0400 | [diff] [blame] | 56 | - 'call-graph': Disable/enable callgraph. Acceptable str are "fp" for |
Kan Liang | f9db0d0 | 2015-08-11 06:30:48 -0400 | [diff] [blame] | 57 | FP mode, "dwarf" for DWARF mode, "lbr" for LBR mode and |
| 58 | "no" for disable callgraph. |
Kan Liang | d457c96 | 2015-08-11 06:30:47 -0400 | [diff] [blame] | 59 | - 'stack-size': user stack size for dwarf mode |
Kan Liang | 3d5d68a | 2015-07-08 04:44:54 -0400 | [diff] [blame] | 60 | Note: If user explicitly sets options which conflict with the params, |
| 61 | the value set by the params will be overridden. |
| 62 | |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 63 | - a hardware breakpoint event in the form of '\mem:addr[/len][:access]' |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 64 | where addr is the address in memory you want to break in. |
| 65 | Access is the memory access type (read, write, execute) it can |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 66 | be passed as follows: '\mem:addr[:[r][w][x]]'. len is the range, |
| 67 | number of bytes from specified addr, which the breakpoint will cover. |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 68 | If you want to profile read-write accesses in 0x1000, just set |
| 69 | 'mem:0x1000:rw'. |
Jacob Shin | 3741eb9 | 2014-05-29 17:26:51 +0200 | [diff] [blame] | 70 | If you want to profile write accesses in [0x1000~1008), just set |
| 71 | 'mem:0x1000/8:w'. |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 72 | |
Namhyung Kim | 9a75606 | 2015-03-02 12:13:33 +0900 | [diff] [blame] | 73 | - a group of events surrounded by a pair of brace ("{event1,event2,...}"). |
| 74 | Each event is separated by commas and the group should be quoted to |
| 75 | prevent the shell interpretation. You also need to use --group on |
| 76 | "perf report" to view group events together. |
| 77 | |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 78 | --filter=<filter>:: |
Wang Nan | 4ba1faa | 2015-07-10 07:36:10 +0000 | [diff] [blame] | 79 | Event filter. This option should follow a event selector (-e) which |
| 80 | selects tracepoint event(s). Multiple '--filter' options are combined |
| 81 | using '&&'. |
| 82 | |
| 83 | --exclude-perf:: |
| 84 | Don't record events issued by perf itself. This option should follow |
| 85 | a event selector (-e) which selects tracepoint event(s). It adds a |
| 86 | filter expression 'common_pid != $PERFPID' to filters. If other |
| 87 | '--filter' exists, the new filter expression will be combined with |
| 88 | them by '&&'. |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 89 | |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 90 | -a:: |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 91 | --all-cpus:: |
| 92 | System-wide collection from all CPUs. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 93 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 94 | -p:: |
| 95 | --pid=:: |
David Ahern | b52956c | 2012-02-08 09:32:52 -0700 | [diff] [blame] | 96 | Record events on existing process ID (comma separated list). |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 97 | |
| 98 | -t:: |
| 99 | --tid=:: |
David Ahern | b52956c | 2012-02-08 09:32:52 -0700 | [diff] [blame] | 100 | Record events on existing thread ID (comma separated list). |
Adrian Hunter | 69e7e5b | 2013-11-18 11:55:57 +0200 | [diff] [blame] | 101 | This option also disables inheritance by default. Enable it by adding |
| 102 | --inherit. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 103 | |
Arnaldo Carvalho de Melo | 0d37aa3 | 2012-01-19 14:08:15 -0200 | [diff] [blame] | 104 | -u:: |
| 105 | --uid=:: |
| 106 | Record events in threads owned by uid. Name or number. |
| 107 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 108 | -r:: |
| 109 | --realtime=:: |
| 110 | Collect data with this RT SCHED_FIFO priority. |
Jiri Olsa | 563aecb | 2013-06-05 13:35:06 +0200 | [diff] [blame] | 111 | |
Arnaldo Carvalho de Melo | 509051e | 2014-01-14 17:52:14 -0300 | [diff] [blame] | 112 | --no-buffering:: |
Kirill Smelkov | acac03f | 2011-01-12 17:59:36 +0300 | [diff] [blame] | 113 | Collect data without buffering. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 114 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 115 | -c:: |
| 116 | --count=:: |
| 117 | Event period to sample. |
| 118 | |
| 119 | -o:: |
| 120 | --output=:: |
| 121 | Output file name. |
| 122 | |
| 123 | -i:: |
Stephane Eranian | 2e6cdf9 | 2010-05-12 10:40:01 +0200 | [diff] [blame] | 124 | --no-inherit:: |
| 125 | Child tasks do not inherit counters. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 126 | -F:: |
| 127 | --freq=:: |
| 128 | Profile at this frequency. |
| 129 | |
| 130 | -m:: |
| 131 | --mmap-pages=:: |
Jiri Olsa | 27050f5 | 2013-09-01 12:36:13 +0200 | [diff] [blame] | 132 | Number of mmap data pages (must be a power of two) or size |
| 133 | specification with appended unit character - B/K/M/G. The |
| 134 | size is rounded up to have nearest pages power of two value. |
Adrian Hunter | e9db131 | 2015-04-09 18:53:46 +0300 | [diff] [blame] | 135 | Also, by adding a comma, the number of mmap pages for AUX |
| 136 | area tracing can be specified. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 137 | |
Namhyung Kim | 9a75606 | 2015-03-02 12:13:33 +0900 | [diff] [blame] | 138 | --group:: |
| 139 | Put all events in a single event group. This precedes the --event |
| 140 | option and remains only for backward compatibility. See --event. |
| 141 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 142 | -g:: |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 143 | Enables call-graph (stack chain/backtrace) recording. |
| 144 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 145 | --call-graph:: |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 146 | Setup and enable call-graph (stack chain/backtrace) recording, |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 147 | implies -g. Default is "fp". |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 148 | |
| 149 | Allows specifying "fp" (frame pointer) or "dwarf" |
Kan Liang | aad2b21 | 2015-01-05 13:23:04 -0500 | [diff] [blame] | 150 | (DWARF's CFI - Call Frame Information) or "lbr" |
| 151 | (Hardware Last Branch Record facility) as the method to collect |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 152 | the information used to show the call graphs. |
| 153 | |
| 154 | In some systems, where binaries are build with gcc |
| 155 | --fomit-frame-pointer, using the "fp" method will produce bogus |
| 156 | call graphs, using "dwarf", if available (perf tools linked to |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 157 | the libunwind or libdw library) should be used instead. |
Kan Liang | aad2b21 | 2015-01-05 13:23:04 -0500 | [diff] [blame] | 158 | Using the "lbr" method doesn't require any compiler options. It |
| 159 | will produce call graphs from the hardware LBR registers. The |
| 160 | main limition is that it is only available on new Intel |
| 161 | platforms, such as Haswell. It can only get user call chain. It |
| 162 | doesn't work with branch stack sampling at the same time. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 163 | |
Namhyung Kim | 76a2654 | 2015-10-22 23:28:32 +0900 | [diff] [blame] | 164 | When "dwarf" recording is used, perf also records (user) stack dump |
| 165 | when sampled. Default size of the stack dump is 8192 (bytes). |
| 166 | User can change the size by passing the size after comma like |
| 167 | "--call-graph dwarf,4096". |
| 168 | |
Arnaldo Carvalho de Melo | b44308f | 2010-10-26 15:20:09 -0200 | [diff] [blame] | 169 | -q:: |
| 170 | --quiet:: |
| 171 | Don't print any message, useful for scripting. |
| 172 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 173 | -v:: |
| 174 | --verbose:: |
| 175 | Be more verbose (show counter open errors, etc). |
| 176 | |
| 177 | -s:: |
| 178 | --stat:: |
Namhyung Kim | 1f91d5f | 2015-05-10 00:19:42 +0900 | [diff] [blame] | 179 | Record per-thread event counts. Use it with 'perf report -T' to see |
| 180 | the values. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 181 | |
| 182 | -d:: |
| 183 | --data:: |
Peter Zijlstra | 5610032 | 2015-06-10 16:48:50 +0200 | [diff] [blame] | 184 | Record the sample addresses. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 185 | |
Arnaldo Carvalho de Melo | 9c90a61 | 2010-12-02 10:25:28 -0200 | [diff] [blame] | 186 | -T:: |
| 187 | --timestamp:: |
Peter Zijlstra | 5610032 | 2015-06-10 16:48:50 +0200 | [diff] [blame] | 188 | Record the sample timestamps. Use it with 'perf report -D' to see the |
| 189 | timestamps, for instance. |
| 190 | |
| 191 | -P:: |
| 192 | --period:: |
| 193 | Record the sample period. |
Arnaldo Carvalho de Melo | 9c90a61 | 2010-12-02 10:25:28 -0200 | [diff] [blame] | 194 | |
Jiri Olsa | b6f35ed | 2016-08-01 20:02:35 +0200 | [diff] [blame] | 195 | --sample-cpu:: |
| 196 | Record the sample cpu. |
| 197 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 198 | -n:: |
| 199 | --no-samples:: |
| 200 | Don't sample. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 201 | |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 202 | -R:: |
| 203 | --raw-samples:: |
Frederic Weisbecker | bdef3b0 | 2010-04-14 20:05:17 +0200 | [diff] [blame] | 204 | Collect raw sample records from all opened counters (default for tracepoint counters). |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 205 | |
Stephane Eranian | c45c6ea | 2010-05-28 12:00:01 +0200 | [diff] [blame] | 206 | -C:: |
| 207 | --cpu:: |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 208 | Collect samples only on the list of CPUs provided. Multiple CPUs can be provided as a |
| 209 | comma-separated list with no space: 0,1. Ranges of CPUs are specified with -: 0-2. |
Stephane Eranian | c45c6ea | 2010-05-28 12:00:01 +0200 | [diff] [blame] | 210 | In per-thread mode with inheritance mode on (default), samples are captured only when |
| 211 | the thread executes on the designated CPUs. Default is to monitor all CPUs. |
| 212 | |
Namhyung Kim | 7a29c08 | 2015-12-15 10:49:56 +0900 | [diff] [blame] | 213 | -B:: |
| 214 | --no-buildid:: |
| 215 | Do not save the build ids of binaries in the perf.data files. This skips |
| 216 | post processing after recording, which sometimes makes the final step in |
| 217 | the recording process to take a long time, as it needs to process all |
| 218 | events looking for mmap records. The downside is that it can misresolve |
| 219 | symbols if the workload binaries used when recording get locally rebuilt |
| 220 | or upgraded, because the only key available in this case is the |
| 221 | pathname. You can also set the "record.build-id" config variable to |
| 222 | 'skip to have this behaviour permanently. |
| 223 | |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 224 | -N:: |
| 225 | --no-buildid-cache:: |
Masanari Iida | 96355f2 | 2014-09-10 00:18:50 +0900 | [diff] [blame] | 226 | Do not update the buildid cache. This saves some overhead in situations |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 227 | where the information in the perf.data file (which includes buildids) |
Namhyung Kim | 7a29c08 | 2015-12-15 10:49:56 +0900 | [diff] [blame] | 228 | is sufficient. You can also set the "record.build-id" config variable to |
| 229 | 'no-cache' to have the same effect. |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 230 | |
Stephane Eranian | 023695d | 2011-02-14 11:20:01 +0200 | [diff] [blame] | 231 | -G name,...:: |
| 232 | --cgroup name,...:: |
| 233 | monitor only in the container (cgroup) called "name". This option is available only |
| 234 | in per-cpu mode. The cgroup filesystem must be mounted. All threads belonging to |
| 235 | container "name" are monitored when they run on the monitored CPUs. Multiple cgroups |
| 236 | can be provided. Each cgroup is applied to the corresponding event, i.e., first cgroup |
| 237 | to first event, second cgroup to second event and so on. It is possible to provide |
| 238 | an empty cgroup (monitor all the time) using, e.g., -G foo,,bar. Cgroups must have |
| 239 | corresponding events, i.e., they always refer to events defined earlier on the command |
| 240 | line. |
| 241 | |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 242 | -b:: |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 243 | --branch-any:: |
| 244 | Enable taken branch stack sampling. Any type of taken branch may be sampled. |
| 245 | This is a shortcut for --branch-filter any. See --branch-filter for more infos. |
| 246 | |
| 247 | -j:: |
| 248 | --branch-filter:: |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 249 | Enable taken branch stack sampling. Each sample captures a series of consecutive |
| 250 | taken branches. The number of branches captured with each sample depends on the |
| 251 | underlying hardware, the type of branches of interest, and the executed code. |
| 252 | It is possible to select the types of branches captured by enabling filters. The |
| 253 | following filters are defined: |
| 254 | |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 255 | - any: any type of branches |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 256 | - any_call: any function call or system call |
| 257 | - any_ret: any function return or system call return |
Anshuman Khandual | 2e49a94 | 2012-05-18 14:16:50 +0530 | [diff] [blame] | 258 | - ind_call: any indirect branch |
Stephane Eranian | 43e41ad | 2015-10-13 09:09:11 +0200 | [diff] [blame] | 259 | - call: direct calls, including far (to/from kernel) calls |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 260 | - u: only when the branch target is at the user level |
| 261 | - k: only when the branch target is in the kernel |
| 262 | - hv: only when the target is at the hypervisor level |
Andi Kleen | 0126d49 | 2013-09-20 07:40:42 -0700 | [diff] [blame] | 263 | - in_tx: only when the target is in a hardware transaction |
| 264 | - no_tx: only when the target is not in a hardware transaction |
| 265 | - abort_tx: only when the target is a hardware transaction abort |
Anshuman Khandual | 3e39db4 | 2014-05-22 12:50:10 +0530 | [diff] [blame] | 266 | - cond: conditional branches |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 267 | |
| 268 | + |
Anshuman Khandual | 3e39db4 | 2014-05-22 12:50:10 +0530 | [diff] [blame] | 269 | The option requires at least one branch type among any, any_call, any_ret, ind_call, cond. |
Masanari Iida | 9c76820 | 2012-11-30 14:10:25 +0900 | [diff] [blame] | 270 | The privilege levels may be omitted, in which case, the privilege levels of the associated |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 271 | event are applied to the branch filter. Both kernel (k) and hypervisor (hv) privilege |
| 272 | levels are subject to permissions. When sampling on multiple events, branch stack sampling |
| 273 | is enabled for all the sampling events. The sampled branch type is the same for all events. |
| 274 | The various filters must be specified as a comma separated list: --branch-filter any_ret,u,k |
| 275 | Note that this feature may not be available on all processors. |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 276 | |
Andi Kleen | 0548429 | 2013-01-24 16:10:29 +0100 | [diff] [blame] | 277 | --weight:: |
| 278 | Enable weightened sampling. An additional weight is recorded per sample and can be |
| 279 | displayed with the weight and local_weight sort keys. This currently works for TSX |
| 280 | abort events and some memory events in precise mode on modern Intel CPUs. |
| 281 | |
Andi Kleen | 475eeab | 2013-09-20 07:40:43 -0700 | [diff] [blame] | 282 | --transaction:: |
| 283 | Record transaction flags for transaction related events. |
| 284 | |
Adrian Hunter | 3aa5939 | 2013-11-15 15:52:29 +0200 | [diff] [blame] | 285 | --per-thread:: |
| 286 | Use per-thread mmaps. By default per-cpu mmaps are created. This option |
| 287 | overrides that and uses per-thread mmaps. A side-effect of that is that |
| 288 | inheritance is automatically disabled. --per-thread is ignored with a warning |
| 289 | if combined with -a or -C options. |
Adrian Hunter | 539e6bb | 2013-11-01 15:51:34 +0200 | [diff] [blame] | 290 | |
Arnaldo Carvalho de Melo | a6205a3 | 2014-01-14 17:58:12 -0300 | [diff] [blame] | 291 | -D:: |
| 292 | --delay=:: |
Andi Kleen | 6619a53 | 2014-01-11 13:38:27 -0800 | [diff] [blame] | 293 | After starting the program, wait msecs before measuring. This is useful to |
| 294 | filter out the startup phase of the program, which is often very different. |
| 295 | |
Stephane Eranian | 4b6c517 | 2014-09-24 13:48:41 +0200 | [diff] [blame] | 296 | -I:: |
| 297 | --intr-regs:: |
| 298 | Capture machine state (registers) at interrupt, i.e., on counter overflows for |
| 299 | each sample. List of captured registers depends on the architecture. This option |
Stephane Eranian | bcc84ec | 2015-08-31 18:41:12 +0200 | [diff] [blame] | 300 | is off by default. It is possible to select the registers to sample using their |
| 301 | symbolic names, e.g. on x86, ax, si. To list the available registers use |
| 302 | --intr-regs=\?. To name registers, pass a comma separated list such as |
| 303 | --intr-regs=ax,bx. The list of register is architecture dependent. |
| 304 | |
Stephane Eranian | 4b6c517 | 2014-09-24 13:48:41 +0200 | [diff] [blame] | 305 | |
Andi Kleen | 85c273d | 2015-02-24 15:13:40 -0800 | [diff] [blame] | 306 | --running-time:: |
| 307 | Record running and enabled time for read events (:S) |
| 308 | |
Peter Zijlstra | 814c8c3 | 2015-03-31 00:19:31 +0200 | [diff] [blame] | 309 | -k:: |
| 310 | --clockid:: |
| 311 | Sets the clock id to use for the various time fields in the perf_event_type |
| 312 | records. See clock_gettime(). In particular CLOCK_MONOTONIC and |
| 313 | CLOCK_MONOTONIC_RAW are supported, some events might also allow |
| 314 | CLOCK_BOOTTIME, CLOCK_REALTIME and CLOCK_TAI. |
| 315 | |
Adrian Hunter | 2dd6d8a | 2015-04-30 17:37:32 +0300 | [diff] [blame] | 316 | -S:: |
| 317 | --snapshot:: |
| 318 | Select AUX area tracing Snapshot Mode. This option is valid only with an |
| 319 | AUX area tracing event. Optionally the number of bytes to capture per |
| 320 | snapshot can be specified. In Snapshot Mode, trace data is captured only when |
| 321 | signal SIGUSR2 is received. |
| 322 | |
Kan Liang | 9d9cad7 | 2015-06-17 09:51:11 -0400 | [diff] [blame] | 323 | --proc-map-timeout:: |
| 324 | When processing pre-existing threads /proc/XXX/mmap, it may take a long time, |
| 325 | because the file may be huge. A time out is needed in such cases. |
| 326 | This option sets the time out limit. The default value is 500 ms. |
| 327 | |
Adrian Hunter | b757bb0 | 2015-07-21 12:44:04 +0300 | [diff] [blame] | 328 | --switch-events:: |
| 329 | Record context switch events i.e. events of type PERF_RECORD_SWITCH or |
| 330 | PERF_RECORD_SWITCH_CPU_WIDE. |
| 331 | |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 332 | --clang-path=PATH:: |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 333 | Path to clang binary to use for compiling BPF scriptlets. |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 334 | (enabled when BPF support is on) |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 335 | |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 336 | --clang-opt=OPTIONS:: |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 337 | Options passed to clang when compiling BPF scriptlets. |
He Kuang | 7efe0e0 | 2015-12-14 10:39:23 +0000 | [diff] [blame] | 338 | (enabled when BPF support is on) |
| 339 | |
| 340 | --vmlinux=PATH:: |
| 341 | Specify vmlinux path which has debuginfo. |
| 342 | (enabled when BPF prologue is on) |
Wang Nan | 71dc2326 | 2015-10-14 12:41:19 +0000 | [diff] [blame] | 343 | |
Namhyung Kim | 6156681 | 2016-01-11 22:37:09 +0900 | [diff] [blame] | 344 | --buildid-all:: |
| 345 | Record build-id of all DSOs regardless whether it's actually hit or not. |
| 346 | |
Jiri Olsa | 8572388 | 2016-02-15 09:34:31 +0100 | [diff] [blame] | 347 | --all-kernel:: |
| 348 | Configure all used events to run in kernel space. |
| 349 | |
| 350 | --all-user:: |
| 351 | Configure all used events to run in user space. |
| 352 | |
Wang Nan | eca857a | 2016-04-20 18:59:51 +0000 | [diff] [blame] | 353 | --timestamp-filename |
| 354 | Append timestamp to output file name. |
| 355 | |
Wang Nan | 3c1cb7e | 2016-04-20 18:59:50 +0000 | [diff] [blame] | 356 | --switch-output:: |
| 357 | Generate multiple perf.data files, timestamp prefixed, switching to a new one |
| 358 | when receiving a SIGUSR2. |
| 359 | |
| 360 | A possible use case is to, given an external event, slice the perf.data file |
| 361 | that gets then processed, possibly via a perf script, to decide if that |
| 362 | particular perf.data snapshot should be kept or not. |
| 363 | |
Wang Nan | 0c1d46a | 2016-04-20 18:59:52 +0000 | [diff] [blame] | 364 | Implies --timestamp-filename, --no-buildid and --no-buildid-cache. |
Wang Nan | eca857a | 2016-04-20 18:59:51 +0000 | [diff] [blame] | 365 | |
Wang Nan | 0aab213 | 2016-06-16 08:02:41 +0000 | [diff] [blame] | 366 | --dry-run:: |
| 367 | Parse options then exit. --dry-run can be used to detect errors in cmdline |
| 368 | options. |
| 369 | |
| 370 | 'perf record --dry-run -e' can act as a BPF script compiler if llvm.dump-obj |
| 371 | in config file is set to true. |
| 372 | |
Wang Nan | 4ea648a | 2016-07-14 08:34:47 +0000 | [diff] [blame] | 373 | --tail-synthesize:: |
| 374 | Instead of collecting non-sample events (for example, fork, comm, mmap) at |
| 375 | the beginning of record, collect them during finalizing an output file. |
| 376 | The collected non-sample events reflects the status of the system when |
| 377 | record is finished. |
| 378 | |
Wang Nan | 626a6b7 | 2016-07-14 08:34:45 +0000 | [diff] [blame] | 379 | --overwrite:: |
| 380 | Makes all events use an overwritable ring buffer. An overwritable ring |
| 381 | buffer works like a flight recorder: when it gets full, the kernel will |
| 382 | overwrite the oldest records, that thus will never make it to the |
| 383 | perf.data file. |
| 384 | |
| 385 | When '--overwrite' and '--switch-output' are used perf records and drops |
| 386 | events until it receives a signal, meaning that something unusual was |
| 387 | detected that warrants taking a snapshot of the most current events, |
| 388 | those fitting in the ring buffer at that moment. |
| 389 | |
| 390 | 'overwrite' attribute can also be set or canceled for an event using |
| 391 | config terms. For example: 'cycles/overwrite/' and 'instructions/no-overwrite/'. |
| 392 | |
Wang Nan | 4ea648a | 2016-07-14 08:34:47 +0000 | [diff] [blame] | 393 | Implies --tail-synthesize. |
| 394 | |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 395 | SEE ALSO |
| 396 | -------- |
Thomas Gleixner | 386b05e | 2009-06-06 14:56:33 +0200 | [diff] [blame] | 397 | linkperf:perf-stat[1], linkperf:perf-list[1] |