Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 1 | perf-record(1) |
Ingo Molnar | c1c2365 | 2009-05-30 12:38:51 +0200 | [diff] [blame] | 2 | ============== |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 3 | |
| 4 | NAME |
| 5 | ---- |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 6 | perf-record - Run a command and record its profile into perf.data |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 7 | |
| 8 | SYNOPSIS |
| 9 | -------- |
| 10 | [verse] |
| 11 | 'perf record' [-e <EVENT> | --event=EVENT] [-l] [-a] <command> |
Mike Galbraith | 9e096753 | 2009-05-28 16:25:34 +0200 | [diff] [blame] | 12 | 'perf record' [-e <EVENT> | --event=EVENT] [-l] [-a] -- <command> [<options>] |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 13 | |
| 14 | DESCRIPTION |
| 15 | ----------- |
| 16 | This command runs a command and gathers a performance counter profile |
Ingo Molnar | 23ac9cb | 2009-05-27 09:33:18 +0200 | [diff] [blame] | 17 | from it, into perf.data - without displaying anything. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 18 | |
| 19 | This file can then be inspected later on, using 'perf report'. |
| 20 | |
| 21 | |
| 22 | OPTIONS |
| 23 | ------- |
| 24 | <command>...:: |
| 25 | Any command you can specify in a shell. |
| 26 | |
| 27 | -e:: |
| 28 | --event=:: |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 29 | Select the PMU event. Selection can be: |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 30 | |
Frederic Weisbecker | 1b290d6 | 2009-11-23 15:42:35 +0100 | [diff] [blame] | 31 | - a symbolic event name (use 'perf list' to list all events) |
| 32 | |
| 33 | - a raw PMU event (eventsel+umask) in the form of rNNN where NNN is a |
| 34 | hexadecimal event descriptor. |
| 35 | |
| 36 | - a hardware breakpoint event in the form of '\mem:addr[:access]' |
| 37 | where addr is the address in memory you want to break in. |
| 38 | Access is the memory access type (read, write, execute) it can |
| 39 | be passed as follows: '\mem:addr[:[r][w][x]]'. |
| 40 | If you want to profile read-write accesses in 0x1000, just set |
| 41 | 'mem:0x1000:rw'. |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 42 | |
| 43 | --filter=<filter>:: |
| 44 | Event filter. |
| 45 | |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 46 | -a:: |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 47 | --all-cpus:: |
| 48 | System-wide collection from all CPUs. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 49 | |
| 50 | -l:: |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 51 | Scale counter values. |
| 52 | |
| 53 | -p:: |
| 54 | --pid=:: |
David Ahern | b52956c | 2012-02-08 09:32:52 -0700 | [diff] [blame] | 55 | Record events on existing process ID (comma separated list). |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 56 | |
| 57 | -t:: |
| 58 | --tid=:: |
David Ahern | b52956c | 2012-02-08 09:32:52 -0700 | [diff] [blame] | 59 | Record events on existing thread ID (comma separated list). |
Adrian Hunter | 69e7e5b | 2013-11-18 11:55:57 +0200 | [diff] [blame^] | 60 | This option also disables inheritance by default. Enable it by adding |
| 61 | --inherit. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 62 | |
Arnaldo Carvalho de Melo | 0d37aa3 | 2012-01-19 14:08:15 -0200 | [diff] [blame] | 63 | -u:: |
| 64 | --uid=:: |
| 65 | Record events in threads owned by uid. Name or number. |
| 66 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 67 | -r:: |
| 68 | --realtime=:: |
| 69 | Collect data with this RT SCHED_FIFO priority. |
Jiri Olsa | 563aecb | 2013-06-05 13:35:06 +0200 | [diff] [blame] | 70 | |
Kirill Smelkov | acac03f | 2011-01-12 17:59:36 +0300 | [diff] [blame] | 71 | -D:: |
| 72 | --no-delay:: |
| 73 | Collect data without buffering. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 74 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 75 | -c:: |
| 76 | --count=:: |
| 77 | Event period to sample. |
| 78 | |
| 79 | -o:: |
| 80 | --output=:: |
| 81 | Output file name. |
| 82 | |
| 83 | -i:: |
Stephane Eranian | 2e6cdf9 | 2010-05-12 10:40:01 +0200 | [diff] [blame] | 84 | --no-inherit:: |
| 85 | Child tasks do not inherit counters. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 86 | -F:: |
| 87 | --freq=:: |
| 88 | Profile at this frequency. |
| 89 | |
| 90 | -m:: |
| 91 | --mmap-pages=:: |
Jiri Olsa | 27050f5 | 2013-09-01 12:36:13 +0200 | [diff] [blame] | 92 | Number of mmap data pages (must be a power of two) or size |
| 93 | specification with appended unit character - B/K/M/G. The |
| 94 | size is rounded up to have nearest pages power of two value. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 95 | |
| 96 | -g:: |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 97 | Enables call-graph (stack chain/backtrace) recording. |
| 98 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 99 | --call-graph:: |
Jiri Olsa | 09b0fd4 | 2013-10-26 16:25:33 +0200 | [diff] [blame] | 100 | Setup and enable call-graph (stack chain/backtrace) recording, |
| 101 | implies -g. |
| 102 | |
| 103 | Allows specifying "fp" (frame pointer) or "dwarf" |
| 104 | (DWARF's CFI - Call Frame Information) as the method to collect |
| 105 | the information used to show the call graphs. |
| 106 | |
| 107 | In some systems, where binaries are build with gcc |
| 108 | --fomit-frame-pointer, using the "fp" method will produce bogus |
| 109 | call graphs, using "dwarf", if available (perf tools linked to |
| 110 | the libunwind library) should be used instead. |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 111 | |
Arnaldo Carvalho de Melo | b44308f | 2010-10-26 15:20:09 -0200 | [diff] [blame] | 112 | -q:: |
| 113 | --quiet:: |
| 114 | Don't print any message, useful for scripting. |
| 115 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 116 | -v:: |
| 117 | --verbose:: |
| 118 | Be more verbose (show counter open errors, etc). |
| 119 | |
| 120 | -s:: |
| 121 | --stat:: |
| 122 | Per thread counts. |
| 123 | |
| 124 | -d:: |
| 125 | --data:: |
| 126 | Sample addresses. |
| 127 | |
Arnaldo Carvalho de Melo | 9c90a61 | 2010-12-02 10:25:28 -0200 | [diff] [blame] | 128 | -T:: |
| 129 | --timestamp:: |
| 130 | Sample timestamps. Use it with 'perf report -D' to see the timestamps, |
| 131 | for instance. |
| 132 | |
Arnaldo Carvalho de Melo | 386c0b7 | 2009-08-05 10:04:53 -0300 | [diff] [blame] | 133 | -n:: |
| 134 | --no-samples:: |
| 135 | Don't sample. |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 136 | |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 137 | -R:: |
| 138 | --raw-samples:: |
Frederic Weisbecker | bdef3b0 | 2010-04-14 20:05:17 +0200 | [diff] [blame] | 139 | Collect raw sample records from all opened counters (default for tracepoint counters). |
Frederic Weisbecker | ec7ba4e | 2009-08-31 03:32:03 +0200 | [diff] [blame] | 140 | |
Stephane Eranian | c45c6ea | 2010-05-28 12:00:01 +0200 | [diff] [blame] | 141 | -C:: |
| 142 | --cpu:: |
Shawn Bohrer | 08dbd7e | 2010-11-30 19:57:16 -0600 | [diff] [blame] | 143 | Collect samples only on the list of CPUs provided. Multiple CPUs can be provided as a |
| 144 | comma-separated list with no space: 0,1. Ranges of CPUs are specified with -: 0-2. |
Stephane Eranian | c45c6ea | 2010-05-28 12:00:01 +0200 | [diff] [blame] | 145 | In per-thread mode with inheritance mode on (default), samples are captured only when |
| 146 | the thread executes on the designated CPUs. Default is to monitor all CPUs. |
| 147 | |
Stephane Eranian | a1ac1d3 | 2010-06-17 11:39:01 +0200 | [diff] [blame] | 148 | -N:: |
| 149 | --no-buildid-cache:: |
| 150 | Do not update the builid cache. This saves some overhead in situations |
| 151 | where the information in the perf.data file (which includes buildids) |
| 152 | is sufficient. |
| 153 | |
Stephane Eranian | 023695d | 2011-02-14 11:20:01 +0200 | [diff] [blame] | 154 | -G name,...:: |
| 155 | --cgroup name,...:: |
| 156 | monitor only in the container (cgroup) called "name". This option is available only |
| 157 | in per-cpu mode. The cgroup filesystem must be mounted. All threads belonging to |
| 158 | container "name" are monitored when they run on the monitored CPUs. Multiple cgroups |
| 159 | can be provided. Each cgroup is applied to the corresponding event, i.e., first cgroup |
| 160 | to first event, second cgroup to second event and so on. It is possible to provide |
| 161 | an empty cgroup (monitor all the time) using, e.g., -G foo,,bar. Cgroups must have |
| 162 | corresponding events, i.e., they always refer to events defined earlier on the command |
| 163 | line. |
| 164 | |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 165 | -b:: |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 166 | --branch-any:: |
| 167 | Enable taken branch stack sampling. Any type of taken branch may be sampled. |
| 168 | This is a shortcut for --branch-filter any. See --branch-filter for more infos. |
| 169 | |
| 170 | -j:: |
| 171 | --branch-filter:: |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 172 | Enable taken branch stack sampling. Each sample captures a series of consecutive |
| 173 | taken branches. The number of branches captured with each sample depends on the |
| 174 | underlying hardware, the type of branches of interest, and the executed code. |
| 175 | It is possible to select the types of branches captured by enabling filters. The |
| 176 | following filters are defined: |
| 177 | |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 178 | - any: any type of branches |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 179 | - any_call: any function call or system call |
| 180 | - any_ret: any function return or system call return |
Anshuman Khandual | 2e49a94 | 2012-05-18 14:16:50 +0530 | [diff] [blame] | 181 | - ind_call: any indirect branch |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 182 | - u: only when the branch target is at the user level |
| 183 | - k: only when the branch target is in the kernel |
| 184 | - hv: only when the target is at the hypervisor level |
Andi Kleen | 0126d49 | 2013-09-20 07:40:42 -0700 | [diff] [blame] | 185 | - in_tx: only when the target is in a hardware transaction |
| 186 | - no_tx: only when the target is not in a hardware transaction |
| 187 | - abort_tx: only when the target is a hardware transaction abort |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 188 | |
| 189 | + |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 190 | The option requires at least one branch type among any, any_call, any_ret, ind_call. |
Masanari Iida | 9c76820 | 2012-11-30 14:10:25 +0900 | [diff] [blame] | 191 | The privilege levels may be omitted, in which case, the privilege levels of the associated |
Stephane Eranian | a5aabda | 2012-03-08 23:47:45 +0100 | [diff] [blame] | 192 | event are applied to the branch filter. Both kernel (k) and hypervisor (hv) privilege |
| 193 | levels are subject to permissions. When sampling on multiple events, branch stack sampling |
| 194 | is enabled for all the sampling events. The sampled branch type is the same for all events. |
| 195 | The various filters must be specified as a comma separated list: --branch-filter any_ret,u,k |
| 196 | Note that this feature may not be available on all processors. |
Roberto Agostino Vitillo | bdfebd8 | 2012-02-09 23:21:02 +0100 | [diff] [blame] | 197 | |
Andi Kleen | 0548429 | 2013-01-24 16:10:29 +0100 | [diff] [blame] | 198 | --weight:: |
| 199 | Enable weightened sampling. An additional weight is recorded per sample and can be |
| 200 | displayed with the weight and local_weight sort keys. This currently works for TSX |
| 201 | abort events and some memory events in precise mode on modern Intel CPUs. |
| 202 | |
Andi Kleen | 475eeab | 2013-09-20 07:40:43 -0700 | [diff] [blame] | 203 | --transaction:: |
| 204 | Record transaction flags for transaction related events. |
| 205 | |
Adrian Hunter | 3aa5939 | 2013-11-15 15:52:29 +0200 | [diff] [blame] | 206 | --per-thread:: |
| 207 | Use per-thread mmaps. By default per-cpu mmaps are created. This option |
| 208 | overrides that and uses per-thread mmaps. A side-effect of that is that |
| 209 | inheritance is automatically disabled. --per-thread is ignored with a warning |
| 210 | if combined with -a or -C options. |
Adrian Hunter | 539e6bb | 2013-11-01 15:51:34 +0200 | [diff] [blame] | 211 | |
Ingo Molnar | e33e0a4 | 2009-04-20 15:58:01 +0200 | [diff] [blame] | 212 | SEE ALSO |
| 213 | -------- |
Thomas Gleixner | 386b05e | 2009-06-06 14:56:33 +0200 | [diff] [blame] | 214 | linkperf:perf-stat[1], linkperf:perf-list[1] |