blob: 62f5ebb9a2d59e046deb76332dae1c4d012a67d4 [file] [log] [blame]
Linus Torvalds1da177e2005-04-16 15:20:36 -07001
2
Jesper Dangaard Brouerd0128272015-05-21 12:15:56 +02003 HOWTO for the linux packet generator
Linus Torvalds1da177e2005-04-16 15:20:36 -07004 ------------------------------------
5
Ben Hutchings4e081e02015-02-24 02:33:29 +00006Enable CONFIG_NET_PKTGEN to compile and build pktgen either in-kernel
7or as a module. A module is preferred; modprobe pktgen if needed. Once
Ben Hutchingsca5b5422015-02-24 02:31:52 +00008running, pktgen creates a thread for each CPU with affinity to that CPU.
9Monitoring and controlling is done via /proc. It is easiest to select a
10suitable sample script and configure that.
Linus Torvalds1da177e2005-04-16 15:20:36 -070011
12On a dual CPU:
13
14ps aux | grep pkt
15root 129 0.3 0.0 0 0 ? SW 2003 523:20 [pktgen/0]
16root 130 0.3 0.0 0 0 ? SW 2003 509:50 [pktgen/1]
17
18
Matt LaPlante2fe0ae72006-10-03 22:50:39 +020019For monitoring and control pktgen creates:
Linus Torvalds1da177e2005-04-16 15:20:36 -070020 /proc/net/pktgen/pgctrl
21 /proc/net/pktgen/kpktgend_X
22 /proc/net/pktgen/ethX
23
24
Jesper Dangaard Brouer9ceb87f2014-06-26 13:16:27 +020025Tuning NIC for max performance
26==============================
27
Ben Hutchingsca5b5422015-02-24 02:31:52 +000028The default NIC settings are (likely) not tuned for pktgen's artificial
Jesper Dangaard Brouer9ceb87f2014-06-26 13:16:27 +020029overload type of benchmarking, as this could hurt the normal use-case.
30
31Specifically increasing the TX ring buffer in the NIC:
32 # ethtool -G ethX tx 1024
33
34A larger TX ring can improve pktgen's performance, while it can hurt
35in the general case, 1) because the TX ring buffer might get larger
Ben Hutchingsca5b5422015-02-24 02:31:52 +000036than the CPU's L1/L2 cache, 2) because it allows more queueing in the
Jesper Dangaard Brouer9ceb87f2014-06-26 13:16:27 +020037NIC HW layer (which is bad for bufferbloat).
38
Ben Hutchingsca5b5422015-02-24 02:31:52 +000039One should hesitate to conclude that packets/descriptors in the HW
Jesper Dangaard Brouer9ceb87f2014-06-26 13:16:27 +020040TX ring cause delay. Drivers usually delay cleaning up the
Ben Hutchingsca5b5422015-02-24 02:31:52 +000041ring-buffers for various performance reasons, and packets stalling
42the TX ring might just be waiting for cleanup.
Jesper Dangaard Brouer9ceb87f2014-06-26 13:16:27 +020043
Ben Hutchingsca5b5422015-02-24 02:31:52 +000044This cleanup issue is specifically the case for the driver ixgbe
45(Intel 82599 chip). This driver (ixgbe) combines TX+RX ring cleanups,
Jesper Dangaard Brouer9ceb87f2014-06-26 13:16:27 +020046and the cleanup interval is affected by the ethtool --coalesce setting
47of parameter "rx-usecs".
48
Ben Hutchingsca5b5422015-02-24 02:31:52 +000049For ixgbe use e.g. "30" resulting in approx 33K interrupts/sec (1/30*10^6):
Jesper Dangaard Brouer9ceb87f2014-06-26 13:16:27 +020050 # ethtool -C ethX rx-usecs 30
51
52
Linus Torvalds1da177e2005-04-16 15:20:36 -070053Viewing threads
54===============
Jesper Dangaard Brouerd0128272015-05-21 12:15:56 +020055/proc/net/pktgen/kpktgend_0
56Running:
57Stopped: eth1
58Result: OK: add_device=eth1
Linus Torvalds1da177e2005-04-16 15:20:36 -070059
Ben Hutchingsca5b5422015-02-24 02:31:52 +000060Most important are the devices assigned to the thread. Note that a
61device can only belong to one thread.
Linus Torvalds1da177e2005-04-16 15:20:36 -070062
63
64Viewing devices
65===============
66
Ben Hutchingsca5b5422015-02-24 02:31:52 +000067The Params section holds configured information. The Current section
68holds running statistics. The Result is printed after a run or after
69interruption. Example:
Linus Torvalds1da177e2005-04-16 15:20:36 -070070
Jesper Dangaard Brouerd0128272015-05-21 12:15:56 +020071/proc/net/pktgen/eth1
Linus Torvalds1da177e2005-04-16 15:20:36 -070072
73Params: count 10000000 min_pkt_size: 60 max_pkt_size: 60
74 frags: 0 delay: 0 clone_skb: 1000000 ifname: eth1
75 flows: 0 flowlen: 0
76 dst_min: 10.10.11.2 dst_max:
77 src_min: src_max:
78 src_mac: 00:00:00:00:00:00 dst_mac: 00:04:23:AC:FD:82
79 udp_src_min: 9 udp_src_max: 9 udp_dst_min: 9 udp_dst_max: 9
80 src_mac_count: 0 dst_mac_count: 0
81 Flags:
82Current:
83 pkts-sofar: 10000000 errors: 39664
84 started: 1103053986245187us stopped: 1103053999346329us idle: 880401us
85 seq_num: 10000011 cur_dst_mac_offset: 0 cur_src_mac_offset: 0
86 cur_saddr: 0x10a0a0a cur_daddr: 0x20b0a0a
87 cur_udp_dst: 9 cur_udp_src: 9
88 flows: 0
89Result: OK: 13101142(c12220741+d880401) usec, 10000000 (60byte,0frags)
90 763292pps 390Mb/sec (390805504bps) errors: 39664
91
Matt LaPlante5d3f0832006-11-30 05:21:10 +010092Configuring threads and devices
93================================
Ben Hutchings7c95a9d2015-02-24 02:32:07 +000094This is done via the /proc interface, and most easily done via pgset
95as defined in the sample scripts.
Linus Torvalds1da177e2005-04-16 15:20:36 -070096
97Examples:
98
99 pgset "clone_skb 1" sets the number of copies of the same packet
100 pgset "clone_skb 0" use single SKB for all transmits
Alexei Starovoitov38b2cf22014-09-30 17:53:21 -0700101 pgset "burst 8" uses xmit_more API to queue 8 copies of the same
102 packet and update HW tx queue tail pointer once.
103 "burst 1" is the default
Linus Torvalds1da177e2005-04-16 15:20:36 -0700104 pgset "pkt_size 9014" sets packet size to 9014
105 pgset "frags 5" packet will consist of 5 fragments
106 pgset "count 200000" sets number of packets to send, set to zero
Adrian Bunkd0f19d82006-06-30 18:28:43 +0200107 for continuous sends until explicitly stopped.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700108
109 pgset "delay 5000" adds delay to hard_start_xmit(). nanoseconds
110
111 pgset "dst 10.0.0.1" sets IP destination address
112 (BEWARE! This generator is very aggressive!)
113
114 pgset "dst_min 10.0.0.1" Same as dst
115 pgset "dst_max 10.0.0.254" Set the maximum destination IP.
116 pgset "src_min 10.0.0.1" Set the minimum (or only) source IP.
117 pgset "src_max 10.0.0.254" Set the maximum source IP.
118 pgset "dst6 fec0::1" IPV6 destination address
119 pgset "src6 fec0::2" IPV6 source address
120 pgset "dstmac 00:00:00:00:00:00" sets MAC destination address
121 pgset "srcmac 00:00:00:00:00:00" sets MAC source address
122
Eric Dumazet896a7cf2009-10-02 20:24:59 +0000123 pgset "queue_map_min 0" Sets the min value of tx queue interval
124 pgset "queue_map_max 7" Sets the max value of tx queue interval, for multiqueue devices
125 To select queue 1 of a given device,
126 use queue_map_min=1 and queue_map_max=1
127
Jesper Dangaard Brouerd0128272015-05-21 12:15:56 +0200128 pgset "src_mac_count 1" Sets the number of MACs we'll range through.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700129 The 'minimum' MAC is what you set with srcmac.
130
131 pgset "dst_mac_count 1" Sets the number of MACs we'll range through.
132 The 'minimum' MAC is what you set with dstmac.
133
134 pgset "flag [name]" Set a flag to determine behaviour. Current flags
Mathias Krause72f8e062014-02-21 21:38:36 +0100135 are: IPSRC_RND # IP source is random (between min/max)
136 IPDST_RND # IP destination is random
137 UDPSRC_RND, UDPDST_RND,
138 MACSRC_RND, MACDST_RND
139 TXSIZE_RND, IPV6,
Francesco Fondellif0e82fd2006-09-27 16:33:05 -0700140 MPLS_RND, VID_RND, SVID_RND
Mathias Krause72f8e062014-02-21 21:38:36 +0100141 FLOW_SEQ,
Eric Dumazet896a7cf2009-10-02 20:24:59 +0000142 QUEUE_MAP_RND # queue map random
143 QUEUE_MAP_CPU # queue map mirrors smp_processor_id()
Mathias Krause72f8e062014-02-21 21:38:36 +0100144 UDPCSUM,
145 IPSEC # IPsec encapsulation (needs CONFIG_XFRM)
146 NODE_ALLOC # node specific memory allocation
Jesper Dangaard Brouerf1f00d8ff2015-05-07 16:34:51 +0200147 NO_TIMESTAMP # disable timestamping
Eric Dumazet896a7cf2009-10-02 20:24:59 +0000148
Fan Due5f79d12014-01-03 11:18:34 +0800149 pgset spi SPI_VALUE Set specific SA used to transform packet.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700150
151 pgset "udp_src_min 9" set UDP source port min, If < udp_src_max, then
152 cycle through the port range.
153
154 pgset "udp_src_max 9" set UDP source port max.
155 pgset "udp_dst_min 9" set UDP destination port min, If < udp_dst_max, then
156 cycle through the port range.
157 pgset "udp_dst_max 9" set UDP destination port max.
158
Steven Whitehouseca6549a2006-03-23 01:10:26 -0800159 pgset "mpls 0001000a,0002000a,0000000a" set MPLS labels (in this example
160 outer label=16,middle label=32,
161 inner label=0 (IPv4 NULL)) Note that
162 there must be no spaces between the
163 arguments. Leading zeros are required.
164 Do not set the bottom of stack bit,
Matt LaPlantefa00e7e2006-11-30 04:55:36 +0100165 that's done automatically. If you do
Steven Whitehouseca6549a2006-03-23 01:10:26 -0800166 set the bottom of stack bit, that
167 indicates that you want to randomly
168 generate that address and the flag
169 MPLS_RND will be turned on. You
170 can have any mix of random and fixed
171 labels in the label stack.
172
173 pgset "mpls 0" turn off mpls (or any invalid argument works too!)
174
Francesco Fondellif0e82fd2006-09-27 16:33:05 -0700175 pgset "vlan_id 77" set VLAN ID 0-4095
176 pgset "vlan_p 3" set priority bit 0-7 (default 0)
177 pgset "vlan_cfi 0" set canonical format identifier 0-1 (default 0)
178
179 pgset "svlan_id 22" set SVLAN ID 0-4095
180 pgset "svlan_p 3" set priority bit 0-7 (default 0)
181 pgset "svlan_cfi 0" set canonical format identifier 0-1 (default 0)
182
183 pgset "vlan_id 9999" > 4095 remove vlan and svlan tags
184 pgset "svlan 9999" > 4095 remove svlan tag
185
186
187 pgset "tos XX" set former IPv4 TOS field (e.g. "tos 28" for AF11 no ECN, default 00)
188 pgset "traffic_class XX" set former IPv6 TRAFFIC CLASS (e.g. "traffic_class B8" for EF no ECN, default 00)
189
Linus Torvalds1da177e2005-04-16 15:20:36 -0700190 pgset stop aborts injection. Also, ^C aborts generator.
191
Daniel Turull43d28b62010-06-09 22:49:57 +0000192 pgset "rate 300M" set rate to 300 Mb/s
193 pgset "ratep 1000000" set rate to 1Mpps
Linus Torvalds1da177e2005-04-16 15:20:36 -0700194
Alexei Starovoitov62f64ae2015-05-07 16:35:32 +0200195 pgset "xmit_mode netif_receive" RX inject into stack netif_receive_skb()
196 Works with "burst" but not with "clone_skb".
197 Default xmit_mode is "start_xmit".
198
Ben Hutchings7c95a9d2015-02-24 02:32:07 +0000199Sample scripts
200==============
Linus Torvalds1da177e2005-04-16 15:20:36 -0700201
Ben Hutchings7c95a9d2015-02-24 02:32:07 +0000202A collection of small tutorial scripts for pktgen is in the
203samples/pktgen directory:
Linus Torvalds1da177e2005-04-16 15:20:36 -0700204
205pktgen.conf-1-1 # 1 CPU 1 dev
206pktgen.conf-1-2 # 1 CPU 2 dev
207pktgen.conf-2-1 # 2 CPU's 1 dev
208pktgen.conf-2-2 # 2 CPU's 2 dev
209pktgen.conf-1-1-rdos # 1 CPU 1 dev w. route DoS
210pktgen.conf-1-1-ip6 # 1 CPU 1 dev ipv6
211pktgen.conf-1-1-ip6-rdos # 1 CPU 1 dev ipv6 w. route DoS
212pktgen.conf-1-1-flows # 1 CPU 1 dev multiple flows.
213
Ben Hutchingsca5b5422015-02-24 02:31:52 +0000214Run in shell: ./pktgen.conf-X-Y
215This does all the setup including sending.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700216
217
218Interrupt affinity
219===================
Ben Hutchingsca5b5422015-02-24 02:31:52 +0000220Note that when adding devices to a specific CPU it is a good idea to
221also assign /proc/irq/XX/smp_affinity so that the TX interrupts are bound
222to the same CPU. This reduces cache bouncing when freeing skbs.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700223
Fan Due5f79d12014-01-03 11:18:34 +0800224Enable IPsec
225============
Ben Hutchingsca5b5422015-02-24 02:31:52 +0000226Default IPsec transformation with ESP encapsulation plus transport mode
227can be enabled by simply setting:
Fan Due5f79d12014-01-03 11:18:34 +0800228
229pgset "flag IPSEC"
230pgset "flows 1"
231
232To avoid breaking existing testbed scripts for using AH type and tunnel mode,
Ben Hutchingsca5b5422015-02-24 02:31:52 +0000233you can use "pgset spi SPI_VALUE" to specify which transformation mode
Fan Due5f79d12014-01-03 11:18:34 +0800234to employ.
235
Linus Torvalds1da177e2005-04-16 15:20:36 -0700236
237Current commands and configuration options
238==========================================
239
240** Pgcontrol commands:
241
242start
243stop
244
245** Thread commands:
246
247add_device
248rem_device_all
Linus Torvalds1da177e2005-04-16 15:20:36 -0700249
250
251** Device commands:
252
253count
254clone_skb
255debug
256
257frags
258delay
259
260src_mac_count
261dst_mac_count
262
Jesper Dangaard Brouerd0128272015-05-21 12:15:56 +0200263pkt_size
Linus Torvalds1da177e2005-04-16 15:20:36 -0700264min_pkt_size
265max_pkt_size
266
Steven Whitehouseca6549a2006-03-23 01:10:26 -0800267mpls
268
Linus Torvalds1da177e2005-04-16 15:20:36 -0700269udp_src_min
270udp_src_max
271
272udp_dst_min
273udp_dst_max
274
275flag
276 IPSRC_RND
Linus Torvalds1da177e2005-04-16 15:20:36 -0700277 IPDST_RND
278 UDPSRC_RND
279 UDPDST_RND
280 MACSRC_RND
281 MACDST_RND
Mathias Krause72f8e062014-02-21 21:38:36 +0100282 TXSIZE_RND
283 IPV6
284 MPLS_RND
285 VID_RND
286 SVID_RND
287 FLOW_SEQ
288 QUEUE_MAP_RND
289 QUEUE_MAP_CPU
290 UDPCSUM
Fan Due5f79d12014-01-03 11:18:34 +0800291 IPSEC
Mathias Krause72f8e062014-02-21 21:38:36 +0100292 NODE_ALLOC
Jesper Dangaard Brouerf1f00d8ff2015-05-07 16:34:51 +0200293 NO_TIMESTAMP
Linus Torvalds1da177e2005-04-16 15:20:36 -0700294
295dst_min
296dst_max
297
298src_min
299src_max
300
301dst_mac
302src_mac
303
304clear_counters
305
306dst6
307src6
308
309flows
310flowlen
311
Daniel Turull43d28b62010-06-09 22:49:57 +0000312rate
313ratep
314
Alexei Starovoitov62f64ae2015-05-07 16:35:32 +0200315xmit_mode <start_xmit|netif_receive>
316
317
Linus Torvalds1da177e2005-04-16 15:20:36 -0700318References:
319ftp://robur.slu.se/pub/Linux/net-development/pktgen-testing/
320ftp://robur.slu.se/pub/Linux/net-development/pktgen-testing/examples/
321
322Paper from Linux-Kongress in Erlangen 2004.
323ftp://robur.slu.se/pub/Linux/net-development/pktgen-testing/pktgen_paper.pdf
324
325Thanks to:
326Grant Grundler for testing on IA-64 and parisc, Harald Welte, Lennert Buytenhek
327Stephen Hemminger, Andi Kleen, Dave Miller and many others.
328
329
Steven Whitehouseca6549a2006-03-23 01:10:26 -0800330Good luck with the linux net-development.