blob: cbfaaa674118daaf46e6467cf865d4cd06aae23e [file] [log] [blame]
Linus Torvalds1da177e2005-04-16 15:20:36 -07001The Linux kernel supports the following overcommit handling modes
2
30 - Heuristic overcommit handling. Obvious overcommits of
4 address space are refused. Used for a typical system. It
5 ensures a seriously wild allocation fails while allowing
6 overcommit to reduce swap usage. root is allowed to
Lucas De Marchi25985ed2011-03-30 22:57:33 -03007 allocate slightly more memory in this mode. This is the
Linus Torvalds1da177e2005-04-16 15:20:36 -07008 default.
9
101 - Always overcommit. Appropriate for some scientific
Andrew Shewmakerc9b1d092013-04-29 15:08:10 -070011 applications. Classic example is code using sparse arrays
12 and just relying on the virtual memory consisting almost
13 entirely of zero pages.
Linus Torvalds1da177e2005-04-16 15:20:36 -070014
152 - Don't overcommit. The total address space commit
16 for the system is not permitted to exceed swap + a
Jerome Marchand49f0ce52014-01-21 15:49:14 -080017 configurable amount (default is 50%) of physical RAM.
18 Depending on the amount you use, in most situations
Linus Torvalds1da177e2005-04-16 15:20:36 -070019 this means a process will not be killed while accessing
20 pages but will receive errors on memory allocation as
21 appropriate.
22
Andrew Shewmakerc9b1d092013-04-29 15:08:10 -070023 Useful for applications that want to guarantee their
24 memory allocations will be available in the future
25 without having to initialize every page.
26
Linus Torvalds1da177e2005-04-16 15:20:36 -070027The overcommit policy is set via the sysctl `vm.overcommit_memory'.
28
Jerome Marchand49f0ce52014-01-21 15:49:14 -080029The overcommit amount can be set via `vm.overcommit_ratio' (percentage)
30or `vm.overcommit_kbytes' (absolute value).
Linus Torvalds1da177e2005-04-16 15:20:36 -070031
32The current overcommit limit and amount committed are viewable in
33/proc/meminfo as CommitLimit and Committed_AS respectively.
34
35Gotchas
36-------
37
38The C language stack growth does an implicit mremap. If you want absolute
39guarantees and run close to the edge you MUST mmap your stack for the
40largest size you think you will need. For typical stack usage this does
41not matter much but it's a corner case if you really really care
42
43In mode 2 the MAP_NORESERVE flag is ignored.
44
45
46How It Works
47------------
48
49The overcommit is based on the following rules
50
51For a file backed map
52 SHARED or READ-only - 0 cost (the file is the map not swap)
53 PRIVATE WRITABLE - size of mapping per instance
54
55For an anonymous or /dev/zero map
56 SHARED - size of mapping
57 PRIVATE READ-only - 0 cost (but of little use)
58 PRIVATE WRITABLE - size of mapping per instance
59
60Additional accounting
61 Pages made writable copies by mmap
62 shmfs memory drawn from the same pool
63
64Status
65------
66
67o We account mmap memory mappings
68o We account mprotect changes in commit
69o We account mremap changes in size
70o We account brk
71o We account munmap
72o We report the commit status in /proc
73o Account and check on fork
74o Review stack handling/building on exec
75o SHMfs accounting
76o Implement actual limit enforcement
77
78To Do
79-----
80o Account ptrace pages (this is hard)