Blame view

Documentation/kmemleak.txt 8.54 KB
04f70336c   Catalin Marinas   kmemleak: Add doc...
1
2
3
4
5
6
7
8
  Kernel Memory Leak Detector
  ===========================
  
  Introduction
  ------------
  
  Kmemleak provides a way of detecting possible kernel memory leaks in a
  way similar to a tracing garbage collector
ae13c65bc   Masanari Iida   Doc: Change wikip...
9
  (https://en.wikipedia.org/wiki/Garbage_collection_%28computer_science%29#Tracing_garbage_collectors),
04f70336c   Catalin Marinas   kmemleak: Add doc...
10
11
12
13
  with the difference that the orphan objects are not freed but only
  reported via /sys/kernel/debug/kmemleak. A similar method is used by the
  Valgrind tool (memcheck --leak-check) to detect the memory leaks in
  user-space applications.
4762c9841   Wang YanQing   Documentation/kme...
14
  Kmemleak is supported on x86, arm, powerpc, sparc, sh, microblaze, ppc, mips, s390, metag and tile.
04f70336c   Catalin Marinas   kmemleak: Add doc...
15
16
17
18
19
  
  Usage
  -----
  
  CONFIG_DEBUG_KMEMLEAK in "Kernel hacking" has to be enabled. A kernel
bab4a34af   Catalin Marinas   kmemleak: Simplif...
20
  thread scans the memory every 10 minutes (by default) and prints the
4698c1f2b   Catalin Marinas   kmemleak: Do not ...
21
22
  number of new unreferenced objects found. To display the details of all
  the possible memory leaks:
04f70336c   Catalin Marinas   kmemleak: Add doc...
23
24
25
  
    # mount -t debugfs nodev /sys/kernel/debug/
    # cat /sys/kernel/debug/kmemleak
4698c1f2b   Catalin Marinas   kmemleak: Do not ...
26
27
28
  To trigger an intermediate memory scan:
  
    # echo scan > /sys/kernel/debug/kmemleak
30b371010   Luis R. Rodriguez   kmemleak: add cle...
29
30
31
32
33
34
  To clear the list of all current possible memory leaks:
  
    # echo clear > /sys/kernel/debug/kmemleak
  
  New leaks will then come up upon reading /sys/kernel/debug/kmemleak
  again.
04f70336c   Catalin Marinas   kmemleak: Add doc...
35
36
37
38
39
40
41
42
  Note that the orphan objects are listed in the order they were allocated
  and one object at the beginning of the list may cause other subsequent
  objects to be reported as orphan.
  
  Memory scanning parameters can be modified at run-time by writing to the
  /sys/kernel/debug/kmemleak file. The following parameters are supported:
  
    off		- disable kmemleak (irreversible)
e0a2a1601   Catalin Marinas   kmemleak: Enable ...
43
    stack=on	- enable the task stacks scanning (default)
04f70336c   Catalin Marinas   kmemleak: Add doc...
44
    stack=off	- disable the tasks stacks scanning
e0a2a1601   Catalin Marinas   kmemleak: Enable ...
45
    scan=on	- start the automatic memory scanning thread (default)
04f70336c   Catalin Marinas   kmemleak: Add doc...
46
    scan=off	- stop the automatic memory scanning thread
e0a2a1601   Catalin Marinas   kmemleak: Enable ...
47
48
    scan=<secs>	- set the automatic memory scanning period in seconds
  		  (default 600, 0 to stop the automatic scanning)
4698c1f2b   Catalin Marinas   kmemleak: Do not ...
49
    scan		- trigger a memory scan
30b371010   Luis R. Rodriguez   kmemleak: add cle...
50
    clear		- clear list of current memory leak suspects, done by
c89da70c7   Li Zefan   kmemleak: allow f...
51
52
  		  marking all current reported unreferenced objects grey,
  		  or free all kmemleak objects if kmemleak has been disabled.
189d84ed5   Catalin Marinas   kmemleak: Dump ob...
53
    dump=<addr>	- dump information about the object found at <addr>
04f70336c   Catalin Marinas   kmemleak: Add doc...
54
55
56
  
  Kmemleak can also be disabled at boot-time by passing "kmemleak=off" on
  the kernel command line.
a9d9058ab   Catalin Marinas   kmemleak: Allow t...
57
58
59
  Memory may be allocated or freed before kmemleak is initialised and
  these actions are stored in an early log buffer. The size of this buffer
  is configured via the CONFIG_DEBUG_KMEMLEAK_EARLY_LOG_SIZE option.
6808a40dd   Masanari Iida   Documentation: Ad...
60
61
62
  If CONFIG_DEBUG_KMEMLEAK_DEFAULT_OFF are enabled, the kmemleak is
  disabled by default. Passing "kmemleak=on" on the kernel command
  line enables the function. 
04f70336c   Catalin Marinas   kmemleak: Add doc...
63
64
65
66
67
  Basic Algorithm
  ---------------
  
  The memory allocations via kmalloc, vmalloc, kmem_cache_alloc and
  friends are traced and the pointers, together with additional
4762c9841   Wang YanQing   Documentation/kme...
68
  information like size and stack trace, are stored in a rbtree.
04f70336c   Catalin Marinas   kmemleak: Add doc...
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
  The corresponding freeing function calls are tracked and the pointers
  removed from the kmemleak data structures.
  
  An allocated block of memory is considered orphan if no pointer to its
  start address or to any location inside the block can be found by
  scanning the memory (including saved registers). This means that there
  might be no way for the kernel to pass the address of the allocated
  block to a freeing function and therefore the block is considered a
  memory leak.
  
  The scanning algorithm steps:
  
    1. mark all objects as white (remaining white objects will later be
       considered orphan)
    2. scan the memory starting with the data section and stacks, checking
4762c9841   Wang YanQing   Documentation/kme...
84
       the values against the addresses stored in the rbtree. If
04f70336c   Catalin Marinas   kmemleak: Add doc...
85
86
87
88
89
90
91
92
93
94
95
96
97
       a pointer to a white object is found, the object is added to the
       gray list
    3. scan the gray objects for matching addresses (some white objects
       can become gray and added at the end of the gray list) until the
       gray set is finished
    4. the remaining white objects are considered orphan and reported via
       /sys/kernel/debug/kmemleak
  
  Some allocated memory blocks have pointers stored in the kernel's
  internal data structures and they cannot be detected as orphans. To
  avoid this, kmemleak can also store the number of values pointing to an
  address inside the block address range that need to be found so that the
  block is not considered a leak. One example is __vmalloc().
30b371010   Luis R. Rodriguez   kmemleak: add cle...
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
  Testing specific sections with kmemleak
  ---------------------------------------
  
  Upon initial bootup your /sys/kernel/debug/kmemleak output page may be
  quite extensive. This can also be the case if you have very buggy code
  when doing development. To work around these situations you can use the
  'clear' command to clear all reported unreferenced objects from the
  /sys/kernel/debug/kmemleak output. By issuing a 'scan' after a 'clear'
  you can find new unreferenced objects; this should help with testing
  specific sections of code.
  
  To test a critical section on demand with a clean kmemleak do:
  
    # echo clear > /sys/kernel/debug/kmemleak
    ... test your kernel or modules ...
    # echo scan > /sys/kernel/debug/kmemleak
  
  Then as usual to get your report with:
  
    # cat /sys/kernel/debug/kmemleak
c89da70c7   Li Zefan   kmemleak: allow f...
118
119
  Freeing kmemleak internal objects
  ---------------------------------
abb3b1f8d   Rahul Bedarkar   Documentation: km...
120
  To allow access to previously found memory leaks after kmemleak has been
c89da70c7   Li Zefan   kmemleak: allow f...
121
122
123
124
125
126
127
  disabled by the user or due to an fatal error, internal kmemleak objects
  won't be freed when kmemleak is disabled, and those objects may occupy
  a large part of physical memory.
  
  In this situation, you may reclaim memory with:
  
    # echo clear > /sys/kernel/debug/kmemleak
04f70336c   Catalin Marinas   kmemleak: Add doc...
128
129
130
131
132
133
134
  Kmemleak API
  ------------
  
  See the include/linux/kmemleak.h header for the functions prototype.
  
  kmemleak_init		 - initialize kmemleak
  kmemleak_alloc		 - notify of a memory block allocation
f528f0b8e   Catalin Marinas   kmemleak: Handle ...
135
  kmemleak_alloc_percpu	 - notify of a percpu memory block allocation
04f70336c   Catalin Marinas   kmemleak: Add doc...
136
  kmemleak_free		 - notify of a memory block freeing
f528f0b8e   Catalin Marinas   kmemleak: Handle ...
137
138
  kmemleak_free_part	 - notify of a partial memory block freeing
  kmemleak_free_percpu	 - notify of a percpu memory block freeing
ffe2c748e   Catalin Marinas   mm: introduce kme...
139
  kmemleak_update_trace	 - update object allocation stack trace
04f70336c   Catalin Marinas   kmemleak: Add doc...
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
  kmemleak_not_leak	 - mark an object as not a leak
  kmemleak_ignore		 - do not scan or report an object as leak
  kmemleak_scan_area	 - add scan areas inside a memory block
  kmemleak_no_scan	 - do not scan a memory block
  kmemleak_erase		 - erase an old value in a pointer variable
  kmemleak_alloc_recursive - as kmemleak_alloc but checks the recursiveness
  kmemleak_free_recursive	 - as kmemleak_free but checks the recursiveness
  
  Dealing with false positives/negatives
  --------------------------------------
  
  The false negatives are real memory leaks (orphan objects) but not
  reported by kmemleak because values found during the memory scanning
  point to such objects. To reduce the number of false negatives, kmemleak
  provides the kmemleak_ignore, kmemleak_scan_area, kmemleak_no_scan and
  kmemleak_erase functions (see above). The task stacks also increase the
  amount of false negatives and their scanning is not enabled by default.
  
  The false positives are objects wrongly reported as being memory leaks
  (orphan). For objects known not to be leaks, kmemleak provides the
  kmemleak_not_leak function. The kmemleak_ignore could also be used if
  the memory block is known not to contain other pointers and it will no
  longer be scanned.
  
  Some of the reported leaks are only transient, especially on SMP
  systems, because of pointers temporarily stored in CPU registers or
  stacks. Kmemleak defines MSECS_MIN_AGE (defaulting to 1000) representing
  the minimum age of an object to be reported as a memory leak.
  
  Limitations and Drawbacks
  -------------------------
  
  The main drawback is the reduced performance of memory allocation and
  freeing. To avoid other penalties, the memory scanning is only performed
  when the /sys/kernel/debug/kmemleak file is read. Anyway, this tool is
  intended for debugging purposes where the performance might not be the
  most important requirement.
  
  To keep the algorithm simple, kmemleak scans for values pointing to any
  address inside a block's address range. This may lead to an increased
  number of false negatives. However, it is likely that a real memory leak
  will eventually become visible.
  
  Another source of false negatives is the data stored in non-pointer
  values. In a future version, kmemleak could only scan the pointer
  members in the allocated structures. This feature would solve many of
  the false negative cases described above.
  
  The tool can report false positives. These are cases where an allocated
  block doesn't need to be freed (some cases in the init_call functions),
  the pointer is calculated by other methods than the usual container_of
  macro or the pointer is stored in a location not scanned by kmemleak.
21b86bd5a   Daniel Baluta   Documentation: up...
192
  Page allocations and ioremap are not tracked.