Blame - Documentation/scheduler/completion.txt - kernel/msm-5.4

blob: e5b9df4d8078bf258da650ee239808648149a6a0 [file] [log] [blame]

Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	1	Completions - "wait for completion" barrier APIs
				2	================================================
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	3
				4	Introduction:
				5	-------------
				6
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	7	If you have one or more threads that must wait for some kernel activity
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	8	to have reached a point or a specific state, completions can provide a
				9	race-free solution to this problem. Semantically they are somewhat like a
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	10	pthread_barrier() and have similar use-cases.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	11
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	12	Completions are a code synchronization mechanism which is preferable to any
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	13	misuse of locks/semaphores and busy-loops. Any time you think of using
				14	yield() or some quirky msleep(1) loop to allow something else to proceed,
				15	you probably want to look into using one of the wait_for_completion*()
				16	calls and complete() instead.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	17
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	18	The advantage of using completions is that they have a well defined, focused
				19	purpose which makes it very easy to see the intent of the code, but they
				20	also result in more efficient code as all threads can continue execution
				21	until the result is actually needed, and both the waiting and the signalling
				22	is highly efficient using low level scheduler sleep/wakeup facilities.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	23
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	24	Completions are built on top of the waitqueue and wakeup infrastructure of
				25	the Linux scheduler. The event the threads on the waitqueue are waiting for
				26	is reduced to a simple flag in 'struct completion', appropriately called "done".
				27
				28	As completions are scheduling related, the code can be found in
Brian Norris	dc92726	2016-11-15 14:42:14 -0800	[diff] [blame]	29	kernel/sched/completion.c.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	30
				31
				32	Usage:
				33	------
				34
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	35	There are three main parts to using completions:
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	36
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	37	- the initialization of the 'struct completion' synchronization object
				38	- the waiting part through a call to one of the variants of wait_for_completion(),
				39	- the signaling side through a call to complete() or complete_all().
				40
				41	There are also some helper functions for checking the state of completions.
				42	Note that while initialization must happen first, the waiting and signaling
				43	part can happen in any order. I.e. it's entirely normal for a thread
				44	to have marked a completion as 'done' before another thread checks whether
				45	it has to wait for it.
				46
				47	To use completions you need to #include <linux/completion.h> and
				48	create a static or dynamic variable of type 'struct completion',
				49	which has only two fields:
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	50
				51	struct completion {
				52	unsigned int done;
				53	wait_queue_head_t wait;
				54	};
				55
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	56	This provides the ->wait waitqueue to place tasks on for waiting (if any), and
				57	the ->done completion flag for indicating whether it's completed or not.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	58
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	59	Completions should be named to refer to the event that is being synchronized on.
				60	A good example is:
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	61
				62	wait_for_completion(&early_console_added);
				63
				64	complete(&early_console_added);
				65
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	66	Good, intuitive naming (as always) helps code readability. Naming a completion
				67	'complete' is not helpful unless the purpose is super obvious...
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	68
				69
				70	Initializing completions:
				71	-------------------------
				72
Nicholas Mc Guire	11e1369	2018-10-16 15:45:39 +0200	[diff] [blame]	73	Dynamically allocated completion objects should preferably be embedded in data
				74	structures that are assured to be alive for the life-time of the function/driver,
				75	to prevent races with asynchronous complete() calls from occurring.
				76
				77	Particular care should be taken when using the _timeout() or _killable()/_interruptible()
				78	variants of wait_for_completion(), as it must be assured that memory de-allocation
				79	does not happen until all related activities (complete() or reinit_completion())
				80	have taken place, even if these wait functions return prematurely due to a timeout
				81	or a signal triggering.
				82
				83	Initializing of dynamically allocated completion objects is done via a call to
				84	init_completion():
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	85
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	86	init_completion(&dynamic_object->done);
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	87
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	88	In this call we initialize the waitqueue and set ->done to 0, i.e. "not completed"
				89	or "not done".
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	90
				91	The re-initialization function, reinit_completion(), simply resets the
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	92	->done field to 0 ("not done"), without touching the waitqueue.
				93	Callers of this function must make sure that there are no racy
				94	wait_for_completion() calls going on in parallel.
				95
				96	Calling init_completion() on the same completion object twice is
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	97	most likely a bug as it re-initializes the queue to an empty queue and
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	98	enqueued tasks could get "lost" - use reinit_completion() in that case,
				99	but be aware of other races.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	100
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	101	For static declaration and initialization, macros are available.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	102
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	103	For static (or global) declarations in file scope you can use DECLARE_COMPLETION():
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	104
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	105	static DECLARE_COMPLETION(setup_done);
				106	DECLARE_COMPLETION(setup_done);
				107
				108	Note that in this case the completion is boot time (or module load time)
				109	initialized to 'not done' and doesn't require an init_completion() call.
				110
				111	When a completion is declared as a local variable within a function,
Nicholas Mc Guire	11e1369	2018-10-16 15:45:39 +0200	[diff] [blame]	112	then the initialization should always use DECLARE_COMPLETION_ONSTACK()
				113	explicitly, not just to make lockdep happy, but also to make it clear
				114	that limited scope had been considered and is intentional:
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	115
				116	DECLARE_COMPLETION_ONSTACK(setup_done)
				117
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	118	Note that when using completion objects as local variables you must be
Nicholas Mc Guire	11e1369	2018-10-16 15:45:39 +0200	[diff] [blame]	119	acutely aware of the short life time of the function stack: the function
				120	must not return to a calling context until all activities (such as waiting
				121	threads) have ceased and the completion object is completely unused.
				122
				123	To emphasise this again: in particular when using some of the waiting API variants
				124	with more complex outcomes, such as the timeout or signalling (_timeout(),
				125	_killable() and _interruptible()) variants, the wait might complete
				126	prematurely while the object might still be in use by another thread - and a return
				127	from the wait_on_completion*() caller function will deallocate the function
				128	stack and cause subtle data corruption if a complete() is done in some
				129	other thread. Simple testing might not trigger these kinds of races.
				130
				131	If unsure, use dynamically allocated completion objects, preferably embedded
				132	in some other long lived object that has a boringly long life time which
				133	exceeds the life time of any helper threads using the completion object,
				134	or has a lock or other synchronization mechanism to make sure complete()
				135	is not called on a freed object.
				136
				137	A naive DECLARE_COMPLETION() on the stack triggers a lockdep warning.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	138
				139	Waiting for completions:
				140	------------------------
				141
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	142	For a thread to wait for some concurrent activity to finish, it
				143	calls wait_for_completion() on the initialized completion structure:
				144
				145	void wait_for_completion(struct completion *done)
				146
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	147	A typical usage scenario is:
				148
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	149	CPU#1 CPU#2
				150
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	151	struct completion setup_done;
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	152
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	153	init_completion(&setup_done);
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	154	initialize_work(...,&setup_done,...);
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	155
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	156	/* run non-dependent code / / do setup */
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	157
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	158	wait_for_completion(&setup_done); complete(setup_done);
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	159
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	160	This is not implying any particular order between wait_for_completion() and
				161	the call to complete() - if the call to complete() happened before the call
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	162	to wait_for_completion() then the waiting side simply will continue
John Garry	7b6abce	2018-10-10 22:56:32 +0800	[diff] [blame]	163	immediately as all dependencies are satisfied; if not, it will block until
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	164	completion is signaled by complete().
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	165
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	166	Note that wait_for_completion() is calling spin_lock_irq()/spin_unlock_irq(),
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	167	so it can only be called safely when you know that interrupts are enabled.
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	168	Calling it from IRQs-off atomic contexts will result in hard-to-detect
				169	spurious enabling of interrupts.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	170
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	171	The default behavior is to wait without a timeout and to mark the task as
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	172	uninterruptible. wait_for_completion() and its variants are only safe
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	173	in process context (as they can sleep) but not in atomic context,
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	174	interrupt context, with disabled IRQs, or preemption is disabled - see also
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	175	try_wait_for_completion() below for handling completion in atomic/interrupt
				176	context.
				177
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	178	As all variants of wait_for_completion() can (obviously) block for a long
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	179	time depending on the nature of the activity they are waiting for, so in
				180	most cases you probably don't want to call this with held mutexes.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	181
				182
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	183	wait_for_completion*() variants available:
				184	------------------------------------------
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	185
				186	The below variants all return status and this status should be checked in
				187	most(/all) cases - in cases where the status is deliberately not checked you
				188	probably want to make a note explaining this (e.g. see
				189	arch/arm/kernel/smp.c:__cpu_up()).
				190
				191	A common problem that occurs is to have unclean assignment of return types,
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	192	so take care to assign return-values to variables of the proper type.
				193
				194	Checking for the specific meaning of return values also has been found
				195	to be quite inaccurate, e.g. constructs like:
				196
				197	if (!wait_for_completion_interruptible_timeout(...))
				198
				199	... would execute the same code path for successful completion and for the
				200	interrupted case - which is probably not what you want.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	201
				202	int wait_for_completion_interruptible(struct completion *done)
				203
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	204	This function marks the task TASK_INTERRUPTIBLE while it is waiting.
				205	If a signal was received while waiting it will return -ERESTARTSYS; 0 otherwise.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	206
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	207	unsigned long wait_for_completion_timeout(struct completion *done, unsigned long timeout)
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	208
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	209	The task is marked as TASK_UNINTERRUPTIBLE and will wait at most 'timeout'
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	210	jiffies. If a timeout occurs it returns 0, else the remaining time in
				211	jiffies (but at least 1).
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	212
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	213	Timeouts are preferably calculated with msecs_to_jiffies() or usecs_to_jiffies(),
				214	to make the code largely HZ-invariant.
				215
				216	If the returned timeout value is deliberately ignored a comment should probably explain
				217	why (e.g. see drivers/mfd/wm8350-core.c wm8350_read_auxadc()).
				218
				219	long wait_for_completion_interruptible_timeout(struct completion *done, unsigned long timeout)
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	220
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	221	This function passes a timeout in jiffies and marks the task as
				222	TASK_INTERRUPTIBLE. If a signal was received it will return -ERESTARTSYS;
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	223	otherwise it returns 0 if the completion timed out, or the remaining time in
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	224	jiffies if completion occurred.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	225
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	226	Further variants include _killable which uses TASK_KILLABLE as the
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	227	designated tasks state and will return -ERESTARTSYS if it is interrupted,
				228	or 0 if completion was achieved. There is a _timeout variant as well:
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	229
				230	long wait_for_completion_killable(struct completion *done)
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	231	long wait_for_completion_killable_timeout(struct completion *done, unsigned long timeout)
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	232
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	233	The _io variants wait_for_completion_io() behave the same as the non-_io
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	234	variants, except for accounting waiting time as 'waiting on IO', which has
				235	an impact on how the task is accounted in scheduling/IO stats:
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	236
				237	void wait_for_completion_io(struct completion *done)
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	238	unsigned long wait_for_completion_io_timeout(struct completion *done, unsigned long timeout)
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	239
				240
				241	Signaling completions:
				242	----------------------
				243
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	244	A thread that wants to signal that the conditions for continuation have been
				245	achieved calls complete() to signal exactly one of the waiters that it can
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	246	continue:
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	247
				248	void complete(struct completion *done)
				249
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	250	... or calls complete_all() to signal all current and future waiters:
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	251
				252	void complete_all(struct completion *done)
				253
				254	The signaling will work as expected even if completions are signaled before
				255	a thread starts waiting. This is achieved by the waiter "consuming"
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	256	(decrementing) the done field of 'struct completion'. Waiting threads
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	257	wakeup order is the same in which they were enqueued (FIFO order).
				258
				259	If complete() is called multiple times then this will allow for that number
				260	of waiters to continue - each call to complete() will simply increment the
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	261	done field. Calling complete_all() multiple times is a bug though. Both
				262	complete() and complete_all() can be called in IRQ/atomic context safely.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	263
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	264	There can only be one thread calling complete() or complete_all() on a
				265	particular 'struct completion' at any time - serialized through the wait
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	266	queue spinlock. Any such concurrent calls to complete() or complete_all()
				267	probably are a design bug.
				268
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	269	Signaling completion from IRQ context is fine as it will appropriately
Linus Torvalds	01aa9d5	2018-10-24 18:01:11 +0100	[diff] [blame]	270	lock with spin_lock_irqsave()/spin_unlock_irqrestore() and it will never
				271	sleep.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	272
				273
				274	try_wait_for_completion()/completion_done():
				275	--------------------------------------------
				276
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	277	The try_wait_for_completion() function will not put the thread on the wait
				278	queue but rather returns false if it would need to enqueue (block) the thread,
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	279	else it consumes one posted completion and returns true.
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	280
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	281	bool try_wait_for_completion(struct completion *done)
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	282
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	283	Finally, to check the state of a completion without changing it in any way,
Jonathan Corbet	7085f6c	2015-03-27 10:16:35 -0600	[diff] [blame]	284	call completion_done(), which returns false if there are no posted
				285	completions that were not yet consumed by waiters (implying that there are
				286	waiters) and true otherwise;
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	287
Nicholas Mc Guire	4988aaa	2015-02-20 12:28:48 -0500	[diff] [blame]	288	bool completion_done(struct completion *done)
Nicholas Mc Guire	202799b	2015-01-30 08:01:52 +0100	[diff] [blame]	289
				290	Both try_wait_for_completion() and completion_done() are safe to be called in
Ingo Molnar	0c37334	2018-10-11 10:36:23 +0200	[diff] [blame]	291	IRQ or atomic context.