blob: 26b7ee491df8912d070b5b9d33252e78f606e2ab [file] [log] [blame]
Linus Torvalds1da177e2005-04-16 15:20:36 -07001Documentation for /proc/sys/kernel/* kernel version 2.2.10
2 (c) 1998, 1999, Rik van Riel <riel@nl.linux.org>
Shen Feng760df932009-04-02 16:57:20 -07003 (c) 2009, Shen Feng<shen@cn.fujitsu.com>
Linus Torvalds1da177e2005-04-16 15:20:36 -07004
5For general info and legal blurb, please look in README.
6
7==============================================================
8
9This file contains documentation for the sysctl files in
10/proc/sys/kernel/ and is valid for Linux kernel version 2.2.
11
12The files in this directory can be used to tune and monitor
13miscellaneous and general things in the operation of the Linux
14kernel. Since some of the files _can_ be used to screw up your
15system, it is advisable to read both documentation and source
16before actually making adjustments.
17
18Currently, these files might (depending on your configuration)
19show up in /proc/sys/kernel:
Borislav Petkov807094c2011-07-23 10:39:29 -070020
Linus Torvalds1da177e2005-04-16 15:20:36 -070021- acct
Borislav Petkov807094c2011-07-23 10:39:29 -070022- acpi_video_flags
23- auto_msgmni
H. Peter Anvind75757a2009-12-11 14:23:44 -080024- bootloader_type [ X86 only ]
25- bootloader_version [ X86 only ]
Hans-Joachim Pichtc114728a2009-09-11 10:28:47 +020026- callhome [ S390 only ]
Dan Ballard73efc032011-10-31 17:11:20 -070027- cap_last_cap
Linus Torvalds1da177e2005-04-16 15:20:36 -070028- core_pattern
Neil Hormana2939802009-09-23 15:56:56 -070029- core_pipe_limit
Linus Torvalds1da177e2005-04-16 15:20:36 -070030- core_uses_pid
31- ctrl-alt-del
Dan Rosenbergeaf06b22010-11-11 14:05:18 -080032- dmesg_restrict
Linus Torvalds1da177e2005-04-16 15:20:36 -070033- domainname
34- hostname
35- hotplug
Dan Rosenberg455cd5a2011-01-12 16:59:41 -080036- kptr_restrict
Chuck Ebbert0741f4d2006-12-07 02:14:11 +010037- kstack_depth_to_print [ X86 only ]
Linus Torvalds1da177e2005-04-16 15:20:36 -070038- l2cr [ PPC only ]
Michael Opdenackerac76cff2008-02-13 15:03:32 -080039- modprobe ==> Documentation/debugging-modules.txt
Kees Cook3d433212009-04-02 15:49:29 -070040- modules_disabled
Stanislav Kinsbursky03f59562013-01-04 15:34:50 -080041- msg_next_id [ sysv ipc ]
Linus Torvalds1da177e2005-04-16 15:20:36 -070042- msgmax
43- msgmnb
44- msgmni
Shen Feng760df932009-04-02 16:57:20 -070045- nmi_watchdog
Linus Torvalds1da177e2005-04-16 15:20:36 -070046- osrelease
47- ostype
48- overflowgid
49- overflowuid
50- panic
Borislav Petkov807094c2011-07-23 10:39:29 -070051- panic_on_oops
52- panic_on_unrecovered_nmi
Mitsuo Hayasaka55af7792011-11-29 15:08:36 +090053- panic_on_stackoverflow
Linus Torvalds1da177e2005-04-16 15:20:36 -070054- pid_max
55- powersave-nap [ PPC only ]
56- printk
Borislav Petkov807094c2011-07-23 10:39:29 -070057- printk_delay
58- printk_ratelimit
59- printk_ratelimit_burst
Jiri Kosina1ec7fd52008-02-09 23:24:08 +010060- randomize_va_space
Linus Torvalds1da177e2005-04-16 15:20:36 -070061- real-root-dev ==> Documentation/initrd.txt
62- reboot-cmd [ SPARC only ]
63- rtsig-max
64- rtsig-nr
65- sem
Stanislav Kinsbursky03f59562013-01-04 15:34:50 -080066- sem_next_id [ sysv ipc ]
Linus Torvalds1da177e2005-04-16 15:20:36 -070067- sg-big-buff [ generic SCSI device (sg) ]
Stanislav Kinsbursky03f59562013-01-04 15:34:50 -080068- shm_next_id [ sysv ipc ]
Vasiliy Kulikovb34a6b12011-07-26 16:08:48 -070069- shm_rmid_forced
Linus Torvalds1da177e2005-04-16 15:20:36 -070070- shmall
71- shmmax [ sysv ipc ]
72- shmmni
73- stop-a [ SPARC only ]
74- sysrq ==> Documentation/sysrq.txt
75- tainted
76- threads-max
Shen Feng760df932009-04-02 16:57:20 -070077- unknown_nmi_panic
Li Zefan08825c92013-05-17 10:31:20 +080078- watchdog_thresh
Linus Torvalds1da177e2005-04-16 15:20:36 -070079- version
80
81==============================================================
82
83acct:
84
85highwater lowwater frequency
86
87If BSD-style process accounting is enabled these values control
88its behaviour. If free space on filesystem where the log lives
89goes below <lowwater>% accounting suspends. If free space gets
90above <highwater>% accounting resumes. <Frequency> determines
91how often do we check the amount of free space (value is in
92seconds). Default:
934 2 30
94That is, suspend accounting if there left <= 2% free; resume it
95if we got >=4%; consider information about amount of free space
96valid for 30 seconds.
97
98==============================================================
99
Borislav Petkov807094c2011-07-23 10:39:29 -0700100acpi_video_flags:
101
102flags
103
104See Doc*/kernel/power/video.txt, it allows mode of video boot to be
105set during run time.
106
107==============================================================
108
109auto_msgmni:
110
111Enables/Disables automatic recomputing of msgmni upon memory add/remove
112or upon ipc namespace creation/removal (see the msgmni description
113above). Echoing "1" into this file enables msgmni automatic recomputing.
114Echoing "0" turns it off. auto_msgmni default value is 1.
115
116
117==============================================================
118
H. Peter Anvind75757a2009-12-11 14:23:44 -0800119bootloader_type:
120
121x86 bootloader identification
122
123This gives the bootloader type number as indicated by the bootloader,
124shifted left by 4, and OR'd with the low four bits of the bootloader
125version. The reason for this encoding is that this used to match the
126type_of_loader field in the kernel header; the encoding is kept for
127backwards compatibility. That is, if the full bootloader type number
128is 0x15 and the full version number is 0x234, this file will contain
129the value 340 = 0x154.
130
131See the type_of_loader and ext_loader_type fields in
132Documentation/x86/boot.txt for additional information.
133
134==============================================================
135
136bootloader_version:
137
138x86 bootloader version
139
140The complete bootloader version number. In the example above, this
141file will contain the value 564 = 0x234.
142
143See the type_of_loader and ext_loader_ver fields in
144Documentation/x86/boot.txt for additional information.
145
146==============================================================
147
Hans-Joachim Pichtc114728a2009-09-11 10:28:47 +0200148callhome:
149
150Controls the kernel's callhome behavior in case of a kernel panic.
151
152The s390 hardware allows an operating system to send a notification
153to a service organization (callhome) in case of an operating system panic.
154
155When the value in this file is 0 (which is the default behavior)
156nothing happens in case of a kernel panic. If this value is set to "1"
157the complete kernel oops message is send to the IBM customer service
158organization in case the mainframe the Linux operating system is running
159on has a service contract with IBM.
160
161==============================================================
162
Dan Ballard73efc032011-10-31 17:11:20 -0700163cap_last_cap
164
165Highest valid capability of the running kernel. Exports
166CAP_LAST_CAP from the kernel.
167
168==============================================================
169
Linus Torvalds1da177e2005-04-16 15:20:36 -0700170core_pattern:
171
172core_pattern is used to specify a core dumpfile pattern name.
Matthias Urlichscd081042006-10-11 01:21:57 -0700173. max length 128 characters; default value is "core"
Linus Torvalds1da177e2005-04-16 15:20:36 -0700174. core_pattern is used as a pattern template for the output filename;
175 certain string patterns (beginning with '%') are substituted with
176 their actual values.
177. backward compatibility with core_uses_pid:
178 If core_pattern does not include "%p" (default does not)
179 and core_uses_pid is set, then .PID will be appended to
180 the filename.
181. corename format specifiers:
182 %<NUL> '%' is dropped
183 %% output one '%'
184 %p pid
Stéphane Graber65aafb12013-09-11 14:24:32 -0700185 %P global pid (init PID namespace)
Linus Torvalds1da177e2005-04-16 15:20:36 -0700186 %u uid
187 %g gid
Oleg Nesterov12a2b4b2012-10-04 17:15:25 -0700188 %d dump mode, matches PR_SET_DUMPABLE and
189 /proc/sys/fs/suid_dumpable
Linus Torvalds1da177e2005-04-16 15:20:36 -0700190 %s signal number
191 %t UNIX time of dump
192 %h hostname
Jiri Slaby57cc0832011-05-26 16:25:46 -0700193 %e executable filename (may be shortened)
194 %E executable path
Linus Torvalds1da177e2005-04-16 15:20:36 -0700195 %<OTHER> both are dropped
Matthias Urlichscd081042006-10-11 01:21:57 -0700196. If the first character of the pattern is a '|', the kernel will treat
197 the rest of the pattern as a command to run. The core dump will be
198 written to the standard input of that program instead of to a file.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700199
200==============================================================
201
Neil Hormana2939802009-09-23 15:56:56 -0700202core_pipe_limit:
203
Borislav Petkov807094c2011-07-23 10:39:29 -0700204This sysctl is only applicable when core_pattern is configured to pipe
205core files to a user space helper (when the first character of
206core_pattern is a '|', see above). When collecting cores via a pipe
207to an application, it is occasionally useful for the collecting
208application to gather data about the crashing process from its
209/proc/pid directory. In order to do this safely, the kernel must wait
210for the collecting process to exit, so as not to remove the crashing
211processes proc files prematurely. This in turn creates the
212possibility that a misbehaving userspace collecting process can block
213the reaping of a crashed process simply by never exiting. This sysctl
214defends against that. It defines how many concurrent crashing
215processes may be piped to user space applications in parallel. If
216this value is exceeded, then those crashing processes above that value
217are noted via the kernel log and their cores are skipped. 0 is a
218special value, indicating that unlimited processes may be captured in
219parallel, but that no waiting will take place (i.e. the collecting
220process is not guaranteed access to /proc/<crashing pid>/). This
221value defaults to 0.
Neil Hormana2939802009-09-23 15:56:56 -0700222
223==============================================================
224
Linus Torvalds1da177e2005-04-16 15:20:36 -0700225core_uses_pid:
226
227The default coredump filename is "core". By setting
228core_uses_pid to 1, the coredump filename becomes core.PID.
229If core_pattern does not include "%p" (default does not)
230and core_uses_pid is set, then .PID will be appended to
231the filename.
232
233==============================================================
234
235ctrl-alt-del:
236
237When the value in this file is 0, ctrl-alt-del is trapped and
238sent to the init(1) program to handle a graceful restart.
239When, however, the value is > 0, Linux's reaction to a Vulcan
240Nerve Pinch (tm) will be an immediate reboot, without even
241syncing its dirty buffers.
242
243Note: when a program (like dosemu) has the keyboard in 'raw'
244mode, the ctrl-alt-del is intercepted by the program before it
245ever reaches the kernel tty layer, and it's up to the program
246to decide what to do with it.
247
248==============================================================
249
Dan Rosenbergeaf06b22010-11-11 14:05:18 -0800250dmesg_restrict:
251
Borislav Petkov807094c2011-07-23 10:39:29 -0700252This toggle indicates whether unprivileged users are prevented
253from using dmesg(8) to view messages from the kernel's log buffer.
254When dmesg_restrict is set to (0) there are no restrictions. When
Serge E. Hallyn38ef4c22010-12-08 15:19:01 +0000255dmesg_restrict is set set to (1), users must have CAP_SYSLOG to use
Dan Rosenbergeaf06b22010-11-11 14:05:18 -0800256dmesg(8).
257
Borislav Petkov807094c2011-07-23 10:39:29 -0700258The kernel config option CONFIG_SECURITY_DMESG_RESTRICT sets the
259default value of dmesg_restrict.
Dan Rosenbergeaf06b22010-11-11 14:05:18 -0800260
261==============================================================
262
Linus Torvalds1da177e2005-04-16 15:20:36 -0700263domainname & hostname:
264
265These files can be used to set the NIS/YP domainname and the
266hostname of your box in exactly the same way as the commands
267domainname and hostname, i.e.:
268# echo "darkstar" > /proc/sys/kernel/hostname
269# echo "mydomain" > /proc/sys/kernel/domainname
270has the same effect as
271# hostname "darkstar"
272# domainname "mydomain"
273
274Note, however, that the classic darkstar.frop.org has the
275hostname "darkstar" and DNS (Internet Domain Name Server)
276domainname "frop.org", not to be confused with the NIS (Network
277Information Service) or YP (Yellow Pages) domainname. These two
278domain names are in general different. For a detailed discussion
279see the hostname(1) man page.
280
281==============================================================
282
283hotplug:
284
285Path for the hotplug policy agent.
286Default value is "/sbin/hotplug".
287
288==============================================================
289
Dan Rosenberg455cd5a2011-01-12 16:59:41 -0800290kptr_restrict:
291
292This toggle indicates whether restrictions are placed on
Ryan Mallon312b4e22013-11-12 15:08:51 -0800293exposing kernel addresses via /proc and other interfaces.
294
295When kptr_restrict is set to (0), the default, there are no restrictions.
296
297When kptr_restrict is set to (1), kernel pointers printed using the %pK
298format specifier will be replaced with 0's unless the user has CAP_SYSLOG
299and effective user and group ids are equal to the real ids. This is
300because %pK checks are done at read() time rather than open() time, so
301if permissions are elevated between the open() and the read() (e.g via
302a setuid binary) then %pK will not leak kernel pointers to unprivileged
303users. Note, this is a temporary solution only. The correct long-term
304solution is to do the permission checks at open() time. Consider removing
305world read permissions from files that use %pK, and using dmesg_restrict
306to protect against uses of %pK in dmesg(8) if leaking kernel pointer
307values to unprivileged users is a concern.
308
309When kptr_restrict is set to (2), kernel pointers printed using
310%pK will be replaced with 0's regardless of privileges.
Dan Rosenberg455cd5a2011-01-12 16:59:41 -0800311
312==============================================================
313
Chuck Ebbert0741f4d2006-12-07 02:14:11 +0100314kstack_depth_to_print: (X86 only)
315
316Controls the number of words to print when dumping the raw
317kernel stack.
318
319==============================================================
320
Borislav Petkov807094c2011-07-23 10:39:29 -0700321l2cr: (PPC only)
322
323This flag controls the L2 cache of G3 processor boards. If
3240, the cache is disabled. Enabled if nonzero.
325
326==============================================================
327
Kees Cook3d433212009-04-02 15:49:29 -0700328modules_disabled:
329
330A toggle value indicating if modules are allowed to be loaded
331in an otherwise modular kernel. This toggle defaults to off
332(0), but can be set true (1). Once true, modules can be
333neither loaded nor unloaded, and the toggle cannot be set back
334to false.
335
336==============================================================
337
Stanislav Kinsbursky03f59562013-01-04 15:34:50 -0800338msg_next_id, sem_next_id, and shm_next_id:
339
340These three toggles allows to specify desired id for next allocated IPC
341object: message, semaphore or shared memory respectively.
342
343By default they are equal to -1, which means generic allocation logic.
344Possible values to set are in range {0..INT_MAX}.
345
346Notes:
3471) kernel doesn't guarantee, that new object will have desired id. So,
348it's up to userspace, how to handle an object with "wrong" id.
3492) Toggle with non-default value will be set back to -1 by kernel after
350successful IPC object allocation.
351
352==============================================================
353
Borislav Petkov807094c2011-07-23 10:39:29 -0700354nmi_watchdog:
355
356Enables/Disables the NMI watchdog on x86 systems. When the value is
357non-zero the NMI watchdog is enabled and will continuously test all
358online cpus to determine whether or not they are still functioning
359properly. Currently, passing "nmi_watchdog=" parameter at boot time is
360required for this function to work.
361
362If LAPIC NMI watchdog method is in use (nmi_watchdog=2 kernel
363parameter), the NMI watchdog shares registers with oprofile. By
364disabling the NMI watchdog, oprofile may have more registers to
365utilize.
366
367==============================================================
368
Mel Gorman10fc05d2013-10-07 11:28:40 +0100369numa_balancing
370
371Enables/disables automatic page fault based NUMA memory
372balancing. Memory is moved automatically to nodes
373that access it often.
374
375Enables/disables automatic NUMA memory balancing. On NUMA machines, there
376is a performance penalty if remote memory is accessed by a CPU. When this
377feature is enabled the kernel samples what task thread is accessing memory
378by periodically unmapping pages and later trapping a page fault. At the
379time of the page fault, it is determined if the data being accessed should
380be migrated to a local memory node.
381
382The unmapping of pages and trapping faults incur additional overhead that
383ideally is offset by improved memory locality but there is no universal
384guarantee. If the target workload is already bound to NUMA nodes then this
385feature should be disabled. Otherwise, if the system overhead from the
386feature is too high then the rate the kernel samples for NUMA hinting
387faults may be controlled by the numa_balancing_scan_period_min_ms,
Mel Gorman930aa172013-10-07 11:29:37 +0100388numa_balancing_scan_delay_ms, numa_balancing_scan_period_max_ms,
Rik van Rielde1c9ce62013-10-07 11:29:39 +0100389numa_balancing_scan_size_mb, numa_balancing_settle_count sysctls and
390numa_balancing_migrate_deferred.
Mel Gorman10fc05d2013-10-07 11:28:40 +0100391
392==============================================================
393
394numa_balancing_scan_period_min_ms, numa_balancing_scan_delay_ms,
Mel Gorman930aa172013-10-07 11:29:37 +0100395numa_balancing_scan_period_max_ms, numa_balancing_scan_size_mb
Mel Gorman10fc05d2013-10-07 11:28:40 +0100396
397Automatic NUMA balancing scans tasks address space and unmaps pages to
398detect if pages are properly placed or if the data should be migrated to a
399memory node local to where the task is running. Every "scan delay" the task
400scans the next "scan size" number of pages in its address space. When the
401end of the address space is reached the scanner restarts from the beginning.
402
403In combination, the "scan delay" and "scan size" determine the scan rate.
404When "scan delay" decreases, the scan rate increases. The scan delay and
405hence the scan rate of every task is adaptive and depends on historical
406behaviour. If pages are properly placed then the scan delay increases,
407otherwise the scan delay decreases. The "scan size" is not adaptive but
408the higher the "scan size", the higher the scan rate.
409
410Higher scan rates incur higher system overhead as page faults must be
411trapped and potentially data must be migrated. However, the higher the scan
412rate, the more quickly a tasks memory is migrated to a local node if the
413workload pattern changes and minimises performance impact due to remote
414memory accesses. These sysctls control the thresholds for scan delays and
415the number of pages scanned.
416
Mel Gorman598f0ec2013-10-07 11:28:55 +0100417numa_balancing_scan_period_min_ms is the minimum time in milliseconds to
418scan a tasks virtual memory. It effectively controls the maximum scanning
419rate for each task.
Mel Gorman10fc05d2013-10-07 11:28:40 +0100420
421numa_balancing_scan_delay_ms is the starting "scan delay" used for a task
422when it initially forks.
423
Mel Gorman598f0ec2013-10-07 11:28:55 +0100424numa_balancing_scan_period_max_ms is the maximum time in milliseconds to
425scan a tasks virtual memory. It effectively controls the minimum scanning
426rate for each task.
Mel Gorman10fc05d2013-10-07 11:28:40 +0100427
428numa_balancing_scan_size_mb is how many megabytes worth of pages are
429scanned for a given scan.
430
Mel Gorman3a7053b2013-10-07 11:29:00 +0100431numa_balancing_settle_count is how many scan periods must complete before
432the schedule balancer stops pushing the task towards a preferred node. This
433gives the scheduler a chance to place the task on an alternative node if the
434preferred node is overloaded.
435
Rik van Rielde1c9ce62013-10-07 11:29:39 +0100436numa_balancing_migrate_deferred is how many page migrations get skipped
437unconditionally, after a page migration is skipped because a page is shared
438with other tasks. This reduces page migration overhead, and determines
439how much stronger the "move task near its memory" policy scheduler becomes,
440versus the "move memory near its task" memory management policy, for workloads
441with shared memory.
442
Mel Gorman10fc05d2013-10-07 11:28:40 +0100443==============================================================
444
Linus Torvalds1da177e2005-04-16 15:20:36 -0700445osrelease, ostype & version:
446
447# cat osrelease
4482.1.88
449# cat ostype
450Linux
451# cat version
452#5 Wed Feb 25 21:49:24 MET 1998
453
454The files osrelease and ostype should be clear enough. Version
455needs a little more clarification however. The '#5' means that
456this is the fifth kernel built from this source base and the
457date behind it indicates the time the kernel was built.
458The only way to tune these values is to rebuild the kernel :-)
459
460==============================================================
461
462overflowgid & overflowuid:
463
Borislav Petkov807094c2011-07-23 10:39:29 -0700464if your architecture did not always support 32-bit UIDs (i.e. arm,
465i386, m68k, sh, and sparc32), a fixed UID and GID will be returned to
466applications that use the old 16-bit UID/GID system calls, if the
467actual UID or GID would exceed 65535.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700468
469These sysctls allow you to change the value of the fixed UID and GID.
470The default is 65534.
471
472==============================================================
473
474panic:
475
Borislav Petkov807094c2011-07-23 10:39:29 -0700476The value in this file represents the number of seconds the kernel
477waits before rebooting on a panic. When you use the software watchdog,
478the recommended setting is 60.
479
480==============================================================
481
482panic_on_unrecovered_nmi:
483
484The default Linux behaviour on an NMI of either memory or unknown is
485to continue operation. For many environments such as scientific
486computing it is preferable that the box is taken out and the error
487dealt with than an uncorrected parity/ECC error get propagated.
488
489A small number of systems do generate NMI's for bizarre random reasons
490such as power management so the default is off. That sysctl works like
491the existing panic controls already in that directory.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700492
493==============================================================
494
495panic_on_oops:
496
497Controls the kernel's behaviour when an oops or BUG is encountered.
498
4990: try to continue operation
500
Matt LaPlantea982ac02007-05-09 07:35:06 +02005011: panic immediately. If the `panic' sysctl is also non-zero then the
Maxime Bizon8b23d04d2006-08-05 12:14:32 -0700502 machine will be rebooted.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700503
504==============================================================
505
Mitsuo Hayasaka55af7792011-11-29 15:08:36 +0900506panic_on_stackoverflow:
507
508Controls the kernel's behavior when detecting the overflows of
509kernel, IRQ and exception stacks except a user stack.
510This file shows up if CONFIG_DEBUG_STACKOVERFLOW is enabled.
511
5120: try to continue operation.
513
5141: panic immediately.
515
516==============================================================
517
Dave Hansen14c63f12013-06-21 08:51:36 -0700518perf_cpu_time_max_percent:
519
520Hints to the kernel how much CPU time it should be allowed to
521use to handle perf sampling events. If the perf subsystem
522is informed that its samples are exceeding this limit, it
523will drop its sampling frequency to attempt to reduce its CPU
524usage.
525
526Some perf sampling happens in NMIs. If these samples
527unexpectedly take too long to execute, the NMIs can become
528stacked up next to each other so much that nothing else is
529allowed to execute.
530
5310: disable the mechanism. Do not monitor or correct perf's
532 sampling rate no matter how CPU time it takes.
533
5341-100: attempt to throttle perf's sample rate to this
535 percentage of CPU. Note: the kernel calculates an
536 "expected" length of each sample event. 100 here means
537 100% of that expected length. Even if this is set to
538 100, you may still see sample throttling if this
539 length is exceeded. Set to 0 if you truly do not care
540 how much CPU is consumed.
541
542==============================================================
543
Mitsuo Hayasaka55af7792011-11-29 15:08:36 +0900544
Linus Torvalds1da177e2005-04-16 15:20:36 -0700545pid_max:
546
Robert P. J. Daybeb7dd82007-05-09 07:14:03 +0200547PID allocation wrap value. When the kernel's next PID value
Linus Torvalds1da177e2005-04-16 15:20:36 -0700548reaches this value, it wraps back to a minimum PID value.
549PIDs of value pid_max or larger are not allocated.
550
551==============================================================
552
Pavel Emelyanovb8f566b2012-01-12 17:20:27 -0800553ns_last_pid:
554
555The last pid allocated in the current (the one task using this sysctl
556lives in) pid namespace. When selecting a pid for a next task on fork
557kernel tries to allocate a number starting from this one.
558
559==============================================================
560
Linus Torvalds1da177e2005-04-16 15:20:36 -0700561powersave-nap: (PPC only)
562
563If set, Linux-PPC will use the 'nap' mode of powersaving,
564otherwise the 'doze' mode will be used.
565
566==============================================================
567
568printk:
569
570The four values in printk denote: console_loglevel,
571default_message_loglevel, minimum_console_loglevel and
572default_console_loglevel respectively.
573
574These values influence printk() behavior when printing or
575logging error messages. See 'man 2 syslog' for more info on
576the different loglevels.
577
578- console_loglevel: messages with a higher priority than
579 this will be printed to the console
Paul Bolle87889e12011-02-06 21:00:41 +0100580- default_message_loglevel: messages without an explicit priority
Linus Torvalds1da177e2005-04-16 15:20:36 -0700581 will be printed with this priority
582- minimum_console_loglevel: minimum (highest) value to which
583 console_loglevel can be set
584- default_console_loglevel: default value for console_loglevel
585
586==============================================================
587
Borislav Petkov807094c2011-07-23 10:39:29 -0700588printk_delay:
589
590Delay each printk message in printk_delay milliseconds
591
592Value from 0 - 10000 is allowed.
593
594==============================================================
595
Linus Torvalds1da177e2005-04-16 15:20:36 -0700596printk_ratelimit:
597
598Some warning messages are rate limited. printk_ratelimit specifies
599the minimum length of time between these messages (in jiffies), by
600default we allow one every 5 seconds.
601
602A value of 0 will disable rate limiting.
603
604==============================================================
605
606printk_ratelimit_burst:
607
608While long term we enforce one message per printk_ratelimit
609seconds, we do allow a burst of messages to pass through.
610printk_ratelimit_burst specifies the number of messages we can
611send before ratelimiting kicks in.
612
613==============================================================
614
Borislav Petkov807094c2011-07-23 10:39:29 -0700615randomize_va_space:
Jiri Kosina1ec7fd52008-02-09 23:24:08 +0100616
617This option can be used to select the type of process address
618space randomization that is used in the system, for architectures
619that support this feature.
620
Horst Schirmeierb7f5ab62009-07-03 14:20:17 +02006210 - Turn the process address space randomization off. This is the
622 default for architectures that do not support this feature anyways,
623 and kernels that are booted with the "norandmaps" parameter.
Jiri Kosina1ec7fd52008-02-09 23:24:08 +0100624
6251 - Make the addresses of mmap base, stack and VDSO page randomized.
626 This, among other things, implies that shared libraries will be
Horst Schirmeierb7f5ab62009-07-03 14:20:17 +0200627 loaded to random addresses. Also for PIE-linked binaries, the
628 location of code start is randomized. This is the default if the
629 CONFIG_COMPAT_BRK option is enabled.
Jiri Kosina1ec7fd52008-02-09 23:24:08 +0100630
Horst Schirmeierb7f5ab62009-07-03 14:20:17 +02006312 - Additionally enable heap randomization. This is the default if
632 CONFIG_COMPAT_BRK is disabled.
633
634 There are a few legacy applications out there (such as some ancient
Jiri Kosina1ec7fd52008-02-09 23:24:08 +0100635 versions of libc.so.5 from 1996) that assume that brk area starts
Horst Schirmeierb7f5ab62009-07-03 14:20:17 +0200636 just after the end of the code+bss. These applications break when
637 start of the brk area is randomized. There are however no known
Jiri Kosina1ec7fd52008-02-09 23:24:08 +0100638 non-legacy applications that would be broken this way, so for most
Horst Schirmeierb7f5ab62009-07-03 14:20:17 +0200639 systems it is safe to choose full randomization.
640
641 Systems with ancient and/or broken binaries should be configured
642 with CONFIG_COMPAT_BRK enabled, which excludes the heap from process
643 address space randomization.
Jiri Kosina1ec7fd52008-02-09 23:24:08 +0100644
645==============================================================
646
Linus Torvalds1da177e2005-04-16 15:20:36 -0700647reboot-cmd: (Sparc only)
648
649??? This seems to be a way to give an argument to the Sparc
650ROM/Flash boot loader. Maybe to tell it what to do after
651rebooting. ???
652
653==============================================================
654
655rtsig-max & rtsig-nr:
656
657The file rtsig-max can be used to tune the maximum number
658of POSIX realtime (queued) signals that can be outstanding
659in the system.
660
661rtsig-nr shows the number of RT signals currently queued.
662
663==============================================================
664
665sg-big-buff:
666
667This file shows the size of the generic SCSI (sg) buffer.
668You can't tune it just yet, but you could change it on
669compile time by editing include/scsi/sg.h and changing
670the value of SG_BIG_BUFF.
671
672There shouldn't be any reason to change this value. If
673you can come up with one, you probably know what you
674are doing anyway :)
675
676==============================================================
677
Carlos Alberto Lopez Perez358e4192013-01-04 15:35:05 -0800678shmall:
679
680This parameter sets the total amount of shared memory pages that
681can be used system wide. Hence, SHMALL should always be at least
682ceil(shmmax/PAGE_SIZE).
683
684If you are not sure what the default PAGE_SIZE is on your Linux
685system, you can run the following command:
686
687# getconf PAGE_SIZE
688
689==============================================================
690
Borislav Petkov807094c2011-07-23 10:39:29 -0700691shmmax:
Linus Torvalds1da177e2005-04-16 15:20:36 -0700692
693This value can be used to query and set the run time limit
694on the maximum shared memory segment size that can be created.
Borislav Petkov807094c2011-07-23 10:39:29 -0700695Shared memory segments up to 1Gb are now supported in the
Linus Torvalds1da177e2005-04-16 15:20:36 -0700696kernel. This value defaults to SHMMAX.
697
698==============================================================
699
Vasiliy Kulikovb34a6b12011-07-26 16:08:48 -0700700shm_rmid_forced:
701
702Linux lets you set resource limits, including how much memory one
703process can consume, via setrlimit(2). Unfortunately, shared memory
704segments are allowed to exist without association with any process, and
705thus might not be counted against any resource limits. If enabled,
706shared memory segments are automatically destroyed when their attach
707count becomes zero after a detach or a process termination. It will
708also destroy segments that were created, but never attached to, on exit
709from the process. The only use left for IPC_RMID is to immediately
710destroy an unattached segment. Of course, this breaks the way things are
711defined, so some applications might stop working. Note that this
712feature will do you no good unless you also configure your resource
713limits (in particular, RLIMIT_AS and RLIMIT_NPROC). Most systems don't
714need this.
715
716Note that if you change this from 0 to 1, already created segments
717without users and with a dead originative process will be destroyed.
718
719==============================================================
720
Borislav Petkov807094c2011-07-23 10:39:29 -0700721tainted:
Linus Torvalds1da177e2005-04-16 15:20:36 -0700722
723Non-zero if the kernel has been tainted. Numeric values, which
724can be ORed together:
725
Greg Kroah-Hartmanbb206982008-10-17 15:01:07 -0700726 1 - A module with a non-GPL license has been loaded, this
727 includes modules with no license.
728 Set by modutils >= 2.4.9 and module-init-tools.
729 2 - A module was force loaded by insmod -f.
730 Set by modutils >= 2.4.9 and module-init-tools.
731 4 - Unsafe SMP processors: SMP with CPUs not designed for SMP.
732 8 - A module was forcibly unloaded from the system by rmmod -f.
733 16 - A hardware machine check error occurred on the system.
734 32 - A bad page was discovered on the system.
735 64 - The user has asked that the system be marked "tainted". This
736 could be because they are running software that directly modifies
737 the hardware, or for other reasons.
738 128 - The system has died.
739 256 - The ACPI DSDT has been overridden with one supplied by the user
740 instead of using the one provided by the hardware.
741 512 - A kernel warning has occurred.
7421024 - A module from drivers/staging was loaded.
Larry Fingerf5fe1842012-02-06 09:49:50 -08007432048 - The system is working around a severe firmware bug.
7444096 - An out-of-tree module has been loaded.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700745
Shen Feng760df932009-04-02 16:57:20 -0700746==============================================================
747
Shen Feng760df932009-04-02 16:57:20 -0700748unknown_nmi_panic:
749
Borislav Petkov807094c2011-07-23 10:39:29 -0700750The value in this file affects behavior of handling NMI. When the
751value is non-zero, unknown NMI is trapped and then panic occurs. At
752that time, kernel debugging information is displayed on console.
Shen Feng760df932009-04-02 16:57:20 -0700753
Borislav Petkov807094c2011-07-23 10:39:29 -0700754NMI switch that most IA32 servers have fires unknown NMI up, for
755example. If a system hangs up, try pressing the NMI switch.
Li Zefan08825c92013-05-17 10:31:20 +0800756
757==============================================================
758
759watchdog_thresh:
760
761This value can be used to control the frequency of hrtimer and NMI
762events and the soft and hard lockup thresholds. The default threshold
763is 10 seconds.
764
765The softlockup threshold is (2 * watchdog_thresh). Setting this
766tunable to zero will disable lockup detection altogether.
767
768==============================================================