Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 1 | |
| 2 | |
Jesper Dangaard Brouer | d012827 | 2015-05-21 12:15:56 +0200 | [diff] [blame^] | 3 | HOWTO for the linux packet generator |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 4 | ------------------------------------ |
| 5 | |
Ben Hutchings | 4e081e0 | 2015-02-24 02:33:29 +0000 | [diff] [blame] | 6 | Enable CONFIG_NET_PKTGEN to compile and build pktgen either in-kernel |
| 7 | or as a module. A module is preferred; modprobe pktgen if needed. Once |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 8 | running, pktgen creates a thread for each CPU with affinity to that CPU. |
| 9 | Monitoring and controlling is done via /proc. It is easiest to select a |
| 10 | suitable sample script and configure that. |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 11 | |
| 12 | On a dual CPU: |
| 13 | |
| 14 | ps aux | grep pkt |
| 15 | root 129 0.3 0.0 0 0 ? SW 2003 523:20 [pktgen/0] |
| 16 | root 130 0.3 0.0 0 0 ? SW 2003 509:50 [pktgen/1] |
| 17 | |
| 18 | |
Matt LaPlante | 2fe0ae7 | 2006-10-03 22:50:39 +0200 | [diff] [blame] | 19 | For monitoring and control pktgen creates: |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 20 | /proc/net/pktgen/pgctrl |
| 21 | /proc/net/pktgen/kpktgend_X |
| 22 | /proc/net/pktgen/ethX |
| 23 | |
| 24 | |
Jesper Dangaard Brouer | 9ceb87f | 2014-06-26 13:16:27 +0200 | [diff] [blame] | 25 | Tuning NIC for max performance |
| 26 | ============================== |
| 27 | |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 28 | The default NIC settings are (likely) not tuned for pktgen's artificial |
Jesper Dangaard Brouer | 9ceb87f | 2014-06-26 13:16:27 +0200 | [diff] [blame] | 29 | overload type of benchmarking, as this could hurt the normal use-case. |
| 30 | |
| 31 | Specifically increasing the TX ring buffer in the NIC: |
| 32 | # ethtool -G ethX tx 1024 |
| 33 | |
| 34 | A larger TX ring can improve pktgen's performance, while it can hurt |
| 35 | in the general case, 1) because the TX ring buffer might get larger |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 36 | than the CPU's L1/L2 cache, 2) because it allows more queueing in the |
Jesper Dangaard Brouer | 9ceb87f | 2014-06-26 13:16:27 +0200 | [diff] [blame] | 37 | NIC HW layer (which is bad for bufferbloat). |
| 38 | |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 39 | One should hesitate to conclude that packets/descriptors in the HW |
Jesper Dangaard Brouer | 9ceb87f | 2014-06-26 13:16:27 +0200 | [diff] [blame] | 40 | TX ring cause delay. Drivers usually delay cleaning up the |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 41 | ring-buffers for various performance reasons, and packets stalling |
| 42 | the TX ring might just be waiting for cleanup. |
Jesper Dangaard Brouer | 9ceb87f | 2014-06-26 13:16:27 +0200 | [diff] [blame] | 43 | |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 44 | This cleanup issue is specifically the case for the driver ixgbe |
| 45 | (Intel 82599 chip). This driver (ixgbe) combines TX+RX ring cleanups, |
Jesper Dangaard Brouer | 9ceb87f | 2014-06-26 13:16:27 +0200 | [diff] [blame] | 46 | and the cleanup interval is affected by the ethtool --coalesce setting |
| 47 | of parameter "rx-usecs". |
| 48 | |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 49 | For ixgbe use e.g. "30" resulting in approx 33K interrupts/sec (1/30*10^6): |
Jesper Dangaard Brouer | 9ceb87f | 2014-06-26 13:16:27 +0200 | [diff] [blame] | 50 | # ethtool -C ethX rx-usecs 30 |
| 51 | |
| 52 | |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 53 | Viewing threads |
| 54 | =============== |
Jesper Dangaard Brouer | d012827 | 2015-05-21 12:15:56 +0200 | [diff] [blame^] | 55 | /proc/net/pktgen/kpktgend_0 |
| 56 | Running: |
| 57 | Stopped: eth1 |
| 58 | Result: OK: add_device=eth1 |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 59 | |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 60 | Most important are the devices assigned to the thread. Note that a |
| 61 | device can only belong to one thread. |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 62 | |
| 63 | |
| 64 | Viewing devices |
| 65 | =============== |
| 66 | |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 67 | The Params section holds configured information. The Current section |
| 68 | holds running statistics. The Result is printed after a run or after |
| 69 | interruption. Example: |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 70 | |
Jesper Dangaard Brouer | d012827 | 2015-05-21 12:15:56 +0200 | [diff] [blame^] | 71 | /proc/net/pktgen/eth1 |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 72 | |
| 73 | Params: count 10000000 min_pkt_size: 60 max_pkt_size: 60 |
| 74 | frags: 0 delay: 0 clone_skb: 1000000 ifname: eth1 |
| 75 | flows: 0 flowlen: 0 |
| 76 | dst_min: 10.10.11.2 dst_max: |
| 77 | src_min: src_max: |
| 78 | src_mac: 00:00:00:00:00:00 dst_mac: 00:04:23:AC:FD:82 |
| 79 | udp_src_min: 9 udp_src_max: 9 udp_dst_min: 9 udp_dst_max: 9 |
| 80 | src_mac_count: 0 dst_mac_count: 0 |
| 81 | Flags: |
| 82 | Current: |
| 83 | pkts-sofar: 10000000 errors: 39664 |
| 84 | started: 1103053986245187us stopped: 1103053999346329us idle: 880401us |
| 85 | seq_num: 10000011 cur_dst_mac_offset: 0 cur_src_mac_offset: 0 |
| 86 | cur_saddr: 0x10a0a0a cur_daddr: 0x20b0a0a |
| 87 | cur_udp_dst: 9 cur_udp_src: 9 |
| 88 | flows: 0 |
| 89 | Result: OK: 13101142(c12220741+d880401) usec, 10000000 (60byte,0frags) |
| 90 | 763292pps 390Mb/sec (390805504bps) errors: 39664 |
| 91 | |
Matt LaPlante | 5d3f083 | 2006-11-30 05:21:10 +0100 | [diff] [blame] | 92 | Configuring threads and devices |
| 93 | ================================ |
Ben Hutchings | 7c95a9d | 2015-02-24 02:32:07 +0000 | [diff] [blame] | 94 | This is done via the /proc interface, and most easily done via pgset |
| 95 | as defined in the sample scripts. |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 96 | |
| 97 | Examples: |
| 98 | |
| 99 | pgset "clone_skb 1" sets the number of copies of the same packet |
| 100 | pgset "clone_skb 0" use single SKB for all transmits |
Alexei Starovoitov | 38b2cf2 | 2014-09-30 17:53:21 -0700 | [diff] [blame] | 101 | pgset "burst 8" uses xmit_more API to queue 8 copies of the same |
| 102 | packet and update HW tx queue tail pointer once. |
| 103 | "burst 1" is the default |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 104 | pgset "pkt_size 9014" sets packet size to 9014 |
| 105 | pgset "frags 5" packet will consist of 5 fragments |
| 106 | pgset "count 200000" sets number of packets to send, set to zero |
Adrian Bunk | d0f19d8 | 2006-06-30 18:28:43 +0200 | [diff] [blame] | 107 | for continuous sends until explicitly stopped. |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 108 | |
| 109 | pgset "delay 5000" adds delay to hard_start_xmit(). nanoseconds |
| 110 | |
| 111 | pgset "dst 10.0.0.1" sets IP destination address |
| 112 | (BEWARE! This generator is very aggressive!) |
| 113 | |
| 114 | pgset "dst_min 10.0.0.1" Same as dst |
| 115 | pgset "dst_max 10.0.0.254" Set the maximum destination IP. |
| 116 | pgset "src_min 10.0.0.1" Set the minimum (or only) source IP. |
| 117 | pgset "src_max 10.0.0.254" Set the maximum source IP. |
| 118 | pgset "dst6 fec0::1" IPV6 destination address |
| 119 | pgset "src6 fec0::2" IPV6 source address |
| 120 | pgset "dstmac 00:00:00:00:00:00" sets MAC destination address |
| 121 | pgset "srcmac 00:00:00:00:00:00" sets MAC source address |
| 122 | |
Eric Dumazet | 896a7cf | 2009-10-02 20:24:59 +0000 | [diff] [blame] | 123 | pgset "queue_map_min 0" Sets the min value of tx queue interval |
| 124 | pgset "queue_map_max 7" Sets the max value of tx queue interval, for multiqueue devices |
| 125 | To select queue 1 of a given device, |
| 126 | use queue_map_min=1 and queue_map_max=1 |
| 127 | |
Jesper Dangaard Brouer | d012827 | 2015-05-21 12:15:56 +0200 | [diff] [blame^] | 128 | pgset "src_mac_count 1" Sets the number of MACs we'll range through. |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 129 | The 'minimum' MAC is what you set with srcmac. |
| 130 | |
| 131 | pgset "dst_mac_count 1" Sets the number of MACs we'll range through. |
| 132 | The 'minimum' MAC is what you set with dstmac. |
| 133 | |
| 134 | pgset "flag [name]" Set a flag to determine behaviour. Current flags |
Mathias Krause | 72f8e06 | 2014-02-21 21:38:36 +0100 | [diff] [blame] | 135 | are: IPSRC_RND # IP source is random (between min/max) |
| 136 | IPDST_RND # IP destination is random |
| 137 | UDPSRC_RND, UDPDST_RND, |
| 138 | MACSRC_RND, MACDST_RND |
| 139 | TXSIZE_RND, IPV6, |
Francesco Fondelli | f0e82fd | 2006-09-27 16:33:05 -0700 | [diff] [blame] | 140 | MPLS_RND, VID_RND, SVID_RND |
Mathias Krause | 72f8e06 | 2014-02-21 21:38:36 +0100 | [diff] [blame] | 141 | FLOW_SEQ, |
Eric Dumazet | 896a7cf | 2009-10-02 20:24:59 +0000 | [diff] [blame] | 142 | QUEUE_MAP_RND # queue map random |
| 143 | QUEUE_MAP_CPU # queue map mirrors smp_processor_id() |
Mathias Krause | 72f8e06 | 2014-02-21 21:38:36 +0100 | [diff] [blame] | 144 | UDPCSUM, |
| 145 | IPSEC # IPsec encapsulation (needs CONFIG_XFRM) |
| 146 | NODE_ALLOC # node specific memory allocation |
Jesper Dangaard Brouer | f1f00d8ff | 2015-05-07 16:34:51 +0200 | [diff] [blame] | 147 | NO_TIMESTAMP # disable timestamping |
Eric Dumazet | 896a7cf | 2009-10-02 20:24:59 +0000 | [diff] [blame] | 148 | |
Fan Du | e5f79d1 | 2014-01-03 11:18:34 +0800 | [diff] [blame] | 149 | pgset spi SPI_VALUE Set specific SA used to transform packet. |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 150 | |
| 151 | pgset "udp_src_min 9" set UDP source port min, If < udp_src_max, then |
| 152 | cycle through the port range. |
| 153 | |
| 154 | pgset "udp_src_max 9" set UDP source port max. |
| 155 | pgset "udp_dst_min 9" set UDP destination port min, If < udp_dst_max, then |
| 156 | cycle through the port range. |
| 157 | pgset "udp_dst_max 9" set UDP destination port max. |
| 158 | |
Steven Whitehouse | ca6549a | 2006-03-23 01:10:26 -0800 | [diff] [blame] | 159 | pgset "mpls 0001000a,0002000a,0000000a" set MPLS labels (in this example |
| 160 | outer label=16,middle label=32, |
| 161 | inner label=0 (IPv4 NULL)) Note that |
| 162 | there must be no spaces between the |
| 163 | arguments. Leading zeros are required. |
| 164 | Do not set the bottom of stack bit, |
Matt LaPlante | fa00e7e | 2006-11-30 04:55:36 +0100 | [diff] [blame] | 165 | that's done automatically. If you do |
Steven Whitehouse | ca6549a | 2006-03-23 01:10:26 -0800 | [diff] [blame] | 166 | set the bottom of stack bit, that |
| 167 | indicates that you want to randomly |
| 168 | generate that address and the flag |
| 169 | MPLS_RND will be turned on. You |
| 170 | can have any mix of random and fixed |
| 171 | labels in the label stack. |
| 172 | |
| 173 | pgset "mpls 0" turn off mpls (or any invalid argument works too!) |
| 174 | |
Francesco Fondelli | f0e82fd | 2006-09-27 16:33:05 -0700 | [diff] [blame] | 175 | pgset "vlan_id 77" set VLAN ID 0-4095 |
| 176 | pgset "vlan_p 3" set priority bit 0-7 (default 0) |
| 177 | pgset "vlan_cfi 0" set canonical format identifier 0-1 (default 0) |
| 178 | |
| 179 | pgset "svlan_id 22" set SVLAN ID 0-4095 |
| 180 | pgset "svlan_p 3" set priority bit 0-7 (default 0) |
| 181 | pgset "svlan_cfi 0" set canonical format identifier 0-1 (default 0) |
| 182 | |
| 183 | pgset "vlan_id 9999" > 4095 remove vlan and svlan tags |
| 184 | pgset "svlan 9999" > 4095 remove svlan tag |
| 185 | |
| 186 | |
| 187 | pgset "tos XX" set former IPv4 TOS field (e.g. "tos 28" for AF11 no ECN, default 00) |
| 188 | pgset "traffic_class XX" set former IPv6 TRAFFIC CLASS (e.g. "traffic_class B8" for EF no ECN, default 00) |
| 189 | |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 190 | pgset stop aborts injection. Also, ^C aborts generator. |
| 191 | |
Daniel Turull | 43d28b6 | 2010-06-09 22:49:57 +0000 | [diff] [blame] | 192 | pgset "rate 300M" set rate to 300 Mb/s |
| 193 | pgset "ratep 1000000" set rate to 1Mpps |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 194 | |
Alexei Starovoitov | 62f64ae | 2015-05-07 16:35:32 +0200 | [diff] [blame] | 195 | pgset "xmit_mode netif_receive" RX inject into stack netif_receive_skb() |
| 196 | Works with "burst" but not with "clone_skb". |
| 197 | Default xmit_mode is "start_xmit". |
| 198 | |
Ben Hutchings | 7c95a9d | 2015-02-24 02:32:07 +0000 | [diff] [blame] | 199 | Sample scripts |
| 200 | ============== |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 201 | |
Ben Hutchings | 7c95a9d | 2015-02-24 02:32:07 +0000 | [diff] [blame] | 202 | A collection of small tutorial scripts for pktgen is in the |
| 203 | samples/pktgen directory: |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 204 | |
| 205 | pktgen.conf-1-1 # 1 CPU 1 dev |
| 206 | pktgen.conf-1-2 # 1 CPU 2 dev |
| 207 | pktgen.conf-2-1 # 2 CPU's 1 dev |
| 208 | pktgen.conf-2-2 # 2 CPU's 2 dev |
| 209 | pktgen.conf-1-1-rdos # 1 CPU 1 dev w. route DoS |
| 210 | pktgen.conf-1-1-ip6 # 1 CPU 1 dev ipv6 |
| 211 | pktgen.conf-1-1-ip6-rdos # 1 CPU 1 dev ipv6 w. route DoS |
| 212 | pktgen.conf-1-1-flows # 1 CPU 1 dev multiple flows. |
| 213 | |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 214 | Run in shell: ./pktgen.conf-X-Y |
| 215 | This does all the setup including sending. |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 216 | |
| 217 | |
| 218 | Interrupt affinity |
| 219 | =================== |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 220 | Note that when adding devices to a specific CPU it is a good idea to |
| 221 | also assign /proc/irq/XX/smp_affinity so that the TX interrupts are bound |
| 222 | to the same CPU. This reduces cache bouncing when freeing skbs. |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 223 | |
Fan Du | e5f79d1 | 2014-01-03 11:18:34 +0800 | [diff] [blame] | 224 | Enable IPsec |
| 225 | ============ |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 226 | Default IPsec transformation with ESP encapsulation plus transport mode |
| 227 | can be enabled by simply setting: |
Fan Du | e5f79d1 | 2014-01-03 11:18:34 +0800 | [diff] [blame] | 228 | |
| 229 | pgset "flag IPSEC" |
| 230 | pgset "flows 1" |
| 231 | |
| 232 | To avoid breaking existing testbed scripts for using AH type and tunnel mode, |
Ben Hutchings | ca5b542 | 2015-02-24 02:31:52 +0000 | [diff] [blame] | 233 | you can use "pgset spi SPI_VALUE" to specify which transformation mode |
Fan Du | e5f79d1 | 2014-01-03 11:18:34 +0800 | [diff] [blame] | 234 | to employ. |
| 235 | |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 236 | |
| 237 | Current commands and configuration options |
| 238 | ========================================== |
| 239 | |
| 240 | ** Pgcontrol commands: |
| 241 | |
| 242 | start |
| 243 | stop |
| 244 | |
| 245 | ** Thread commands: |
| 246 | |
| 247 | add_device |
| 248 | rem_device_all |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 249 | |
| 250 | |
| 251 | ** Device commands: |
| 252 | |
| 253 | count |
| 254 | clone_skb |
| 255 | debug |
| 256 | |
| 257 | frags |
| 258 | delay |
| 259 | |
| 260 | src_mac_count |
| 261 | dst_mac_count |
| 262 | |
Jesper Dangaard Brouer | d012827 | 2015-05-21 12:15:56 +0200 | [diff] [blame^] | 263 | pkt_size |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 264 | min_pkt_size |
| 265 | max_pkt_size |
| 266 | |
Steven Whitehouse | ca6549a | 2006-03-23 01:10:26 -0800 | [diff] [blame] | 267 | mpls |
| 268 | |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 269 | udp_src_min |
| 270 | udp_src_max |
| 271 | |
| 272 | udp_dst_min |
| 273 | udp_dst_max |
| 274 | |
| 275 | flag |
| 276 | IPSRC_RND |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 277 | IPDST_RND |
| 278 | UDPSRC_RND |
| 279 | UDPDST_RND |
| 280 | MACSRC_RND |
| 281 | MACDST_RND |
Mathias Krause | 72f8e06 | 2014-02-21 21:38:36 +0100 | [diff] [blame] | 282 | TXSIZE_RND |
| 283 | IPV6 |
| 284 | MPLS_RND |
| 285 | VID_RND |
| 286 | SVID_RND |
| 287 | FLOW_SEQ |
| 288 | QUEUE_MAP_RND |
| 289 | QUEUE_MAP_CPU |
| 290 | UDPCSUM |
Fan Du | e5f79d1 | 2014-01-03 11:18:34 +0800 | [diff] [blame] | 291 | IPSEC |
Mathias Krause | 72f8e06 | 2014-02-21 21:38:36 +0100 | [diff] [blame] | 292 | NODE_ALLOC |
Jesper Dangaard Brouer | f1f00d8ff | 2015-05-07 16:34:51 +0200 | [diff] [blame] | 293 | NO_TIMESTAMP |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 294 | |
| 295 | dst_min |
| 296 | dst_max |
| 297 | |
| 298 | src_min |
| 299 | src_max |
| 300 | |
| 301 | dst_mac |
| 302 | src_mac |
| 303 | |
| 304 | clear_counters |
| 305 | |
| 306 | dst6 |
| 307 | src6 |
| 308 | |
| 309 | flows |
| 310 | flowlen |
| 311 | |
Daniel Turull | 43d28b6 | 2010-06-09 22:49:57 +0000 | [diff] [blame] | 312 | rate |
| 313 | ratep |
| 314 | |
Alexei Starovoitov | 62f64ae | 2015-05-07 16:35:32 +0200 | [diff] [blame] | 315 | xmit_mode <start_xmit|netif_receive> |
| 316 | |
| 317 | |
Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 318 | References: |
| 319 | ftp://robur.slu.se/pub/Linux/net-development/pktgen-testing/ |
| 320 | ftp://robur.slu.se/pub/Linux/net-development/pktgen-testing/examples/ |
| 321 | |
| 322 | Paper from Linux-Kongress in Erlangen 2004. |
| 323 | ftp://robur.slu.se/pub/Linux/net-development/pktgen-testing/pktgen_paper.pdf |
| 324 | |
| 325 | Thanks to: |
| 326 | Grant Grundler for testing on IA-64 and parisc, Harald Welte, Lennert Buytenhek |
| 327 | Stephen Hemminger, Andi Kleen, Dave Miller and many others. |
| 328 | |
| 329 | |
Steven Whitehouse | ca6549a | 2006-03-23 01:10:26 -0800 | [diff] [blame] | 330 | Good luck with the linux net-development. |