mutex: Add support for wound/wait style locks

Wound/wait mutexes are used when other multiple lock acquisitions of a similar type can be done in an arbitrary order. The deadlock handling used here is called wait/wound in the RDBMS literature: The older tasks waits until it can acquire the contended lock. The younger tasks needs to back off and drop all the locks it is currently holding, i.e. the younger task is wounded. For full documentation please read Documentation/ww-mutex-design.txt. References: https://lwn.net/Articles/548909/ Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com> Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Peter Zijlstra <a.p.zijlstra@chello.nl> Cc: dri-devel@lists.freedesktop.org Cc: linaro-mm-sig@lists.linaro.org Cc: rostedt@goodmis.org Cc: daniel@ffwll.ch Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Thomas Gleixner <tglx@linutronix.de> Link: http://lkml.kernel.org/r/51C8038C.9000106@canonical.com Signed-off-by: Ingo Molnar <mingo@kernel.org>

mutex: Add support for wound/wait style locks
Wound/wait mutexes are used when other multiple lock acquisitions of a similar type can be done in an arbitrary order. The deadlock handling used here is called wait/wound in the RDBMS literature: The older tasks waits until it can acquire the contended lock. The younger tasks needs to back off and drop all the locks it is currently holding, i.e. the younger task is wounded. For full documentation please read Documentation/ww-mutex-design.txt. References: https://lwn.net/Articles/548909/ Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com> Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Peter Zijlstra <a.p.zijlstra@chello.nl> Cc: dri-devel@lists.freedesktop.org Cc: linaro-mm-sig@lists.linaro.org Cc: rostedt@goodmis.org Cc: daniel@ffwll.ch Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Thomas Gleixner <tglx@linutronix.de> Link: http://lkml.kernel.org/r/51C8038C.9000106@canonical.com Signed-off-by: Ingo Molnar <mingo@kernel.org>
Maarten Lankhorst · Ingo Molnar
1 parent a41b56efa7
Showing 5 changed files with 1003 additions and 17 deletions Side-by-side Diff
Documentation/ww-mutex-design.txt
include/linux/mutex-debug.h
include/linux/mutex.h
kernel/mutex.c
lib/debug_locks.c
+Wait/Wound Deadlock-Proof Mutex Design
+======================================
+
+Please read mutex-design.txt first, as it applies to wait/wound mutexes too.
+
+Motivation for WW-Mutexes
+-------------------------
+
+GPU's do operations that commonly involve many buffers.  Those buffers
+can be shared across contexts/processes, exist in different memory
+domains (for example VRAM vs system memory), and so on.  And with
+PRIME / dmabuf, they can even be shared across devices.  So there are
+a handful of situations where the driver needs to wait for buffers to
+become ready.  If you think about this in terms of waiting on a buffer
+mutex for it to become available, this presents a problem because
+there is no way to guarantee that buffers appear in a execbuf/batch in
+the same order in all contexts.  That is directly under control of
+userspace, and a result of the sequence of GL calls that an application
+makes.	Which results in the potential for deadlock.  The problem gets
+more complex when you consider that the kernel may need to migrate the
+buffer(s) into VRAM before the GPU operates on the buffer(s), which
+may in turn require evicting some other buffers (and you don't want to
+evict other buffers which are already queued up to the GPU), but for a
+simplified understanding of the problem you can ignore this.
+
+The algorithm that the TTM graphics subsystem came up with for dealing with
+this problem is quite simple.  For each group of buffers (execbuf) that need
+to be locked, the caller would be assigned a unique reservation id/ticket,
+from a global counter.  In case of deadlock while locking all the buffers
+associated with a execbuf, the one with the lowest reservation ticket (i.e.
+the oldest task) wins, and the one with the higher reservation id (i.e. the
+younger task) unlocks all of the buffers that it has already locked, and then
+tries again.
+
+In the RDBMS literature this deadlock handling approach is called wait/wound:
+The older tasks waits until it can acquire the contended lock. The younger tasks
+needs to back off and drop all the locks it is currently holding, i.e. the
+younger task is wounded.
+
+Concepts
+--------
+
+Compared to normal mutexes two additional concepts/objects show up in the lock
+interface for w/w mutexes:
+
+Acquire context: To ensure eventual forward progress it is important the a task
+trying to acquire locks doesn't grab a new reservation id, but keeps the one it
+acquired when starting the lock acquisition. This ticket is stored in the
+acquire context. Furthermore the acquire context keeps track of debugging state
+to catch w/w mutex interface abuse.
+
+W/w class: In contrast to normal mutexes the lock class needs to be explicit for
+w/w mutexes, since it is required to initialize the acquire context.
+
+Furthermore there are three different class of w/w lock acquire functions:
+
+* Normal lock acquisition with a context, using ww_mutex_lock.
+
+* Slowpath lock acquisition on the contending lock, used by the wounded task
+  after having dropped all already acquired locks. These functions have the
+  _slow postfix.
+
+  From a simple semantics point-of-view the _slow functions are not strictly
+  required, since simply calling the normal ww_mutex_lock functions on the
+  contending lock (after having dropped all other already acquired locks) will
+  work correctly. After all if no other ww mutex has been acquired yet there's
+  no deadlock potential and hence the ww_mutex_lock call will block and not
+  prematurely return -EDEADLK. The advantage of the _slow functions is in
+  interface safety:
+  - ww_mutex_lock has a __must_check int return type, whereas ww_mutex_lock_slow
+    has a void return type. Note that since ww mutex code needs loops/retries
+    anyway the __must_check doesn't result in spurious warnings, even though the
+    very first lock operation can never fail.
+  - When full debugging is enabled ww_mutex_lock_slow checks that all acquired
+    ww mutex have been released (preventing deadlocks) and makes sure that we
+    block on the contending lock (preventing spinning through the -EDEADLK
+    slowpath until the contended lock can be acquired).
+
+* Functions to only acquire a single w/w mutex, which results in the exact same
+  semantics as a normal mutex. This is done by calling ww_mutex_lock with a NULL
+  context.
+
+  Again this is not strictly required. But often you only want to acquire a
+  single lock in which case it's pointless to set up an acquire context (and so
+  better to avoid grabbing a deadlock avoidance ticket).
+
+Of course, all the usual variants for handling wake-ups due to signals are also
+provided.
+
+Usage
+-----
+
+Three different ways to acquire locks within the same w/w class. Common
+definitions for methods #1 and #2:
+
+static DEFINE_WW_CLASS(ww_class);
+
+struct obj {
+	struct ww_mutex lock;
+	/* obj data */
+};
+
+struct obj_entry {
+	struct list_head head;
+	struct obj *obj;
+};
+
+Method 1, using a list in execbuf->buffers that's not allowed to be reordered.
+This is useful if a list of required objects is already tracked somewhere.
+Furthermore the lock helper can use propagate the -EALREADY return code back to
+the caller as a signal that an object is twice on the list. This is useful if
+the list is constructed from userspace input and the ABI requires userspace to
+not have duplicate entries (e.g. for a gpu commandbuffer submission ioctl).
+
+int lock_objs(struct list_head *list, struct ww_acquire_ctx *ctx)
+{
+	struct obj *res_obj = NULL;
+	struct obj_entry *contended_entry = NULL;
+	struct obj_entry *entry;
+
+	ww_acquire_init(ctx, &ww_class);
+
+retry:
+	list_for_each_entry (entry, list, head) {
+		if (entry->obj == res_obj) {
+			res_obj = NULL;
+			continue;
+		}
+		ret = ww_mutex_lock(&entry->obj->lock, ctx);
+		if (ret < 0) {
+			contended_entry = entry;
+			goto err;
+		}
+	}
+
+	ww_acquire_done(ctx);
+	return 0;
+
+err:
+	list_for_each_entry_continue_reverse (entry, list, head)
+		ww_mutex_unlock(&entry->obj->lock);
+
+	if (res_obj)
+		ww_mutex_unlock(&res_obj->lock);
+
+	if (ret == -EDEADLK) {
+		/* we lost out in a seqno race, lock and retry.. */
+		ww_mutex_lock_slow(&contended_entry->obj->lock, ctx);
+		res_obj = contended_entry->obj;
+		goto retry;
+	}
+	ww_acquire_fini(ctx);
+
+	return ret;
+}
+
+Method 2, using a list in execbuf->buffers that can be reordered. Same semantics
+of duplicate entry detection using -EALREADY as method 1 above. But the
+list-reordering allows for a bit more idiomatic code.
+
+int lock_objs(struct list_head *list, struct ww_acquire_ctx *ctx)
+{
+	struct obj_entry *entry, *entry2;
+
+	ww_acquire_init(ctx, &ww_class);
+
+	list_for_each_entry (entry, list, head) {
+		ret = ww_mutex_lock(&entry->obj->lock, ctx);
+		if (ret < 0) {
+			entry2 = entry;
+
+			list_for_each_entry_continue_reverse (entry2, list, head)
+				ww_mutex_unlock(&entry2->obj->lock);
+
+			if (ret != -EDEADLK) {
+				ww_acquire_fini(ctx);
+				return ret;
+			}
+
+			/* we lost out in a seqno race, lock and retry.. */
+			ww_mutex_lock_slow(&entry->obj->lock, ctx);
+
+			/*
+			 * Move buf to head of the list, this will point
+			 * buf->next to the first unlocked entry,
+			 * restarting the for loop.
+			 */
+			list_del(&entry->head);
+			list_add(&entry->head, list);
+		}
+	}
+
+	ww_acquire_done(ctx);
+	return 0;
+}
+
+Unlocking works the same way for both methods #1 and #2:
+
+void unlock_objs(struct list_head *list, struct ww_acquire_ctx *ctx)
+{
+	struct obj_entry *entry;
+
+	list_for_each_entry (entry, list, head)
+		ww_mutex_unlock(&entry->obj->lock);
+
+	ww_acquire_fini(ctx);
+}
+
+Method 3 is useful if the list of objects is constructed ad-hoc and not upfront,
+e.g. when adjusting edges in a graph where each node has its own ww_mutex lock,
+and edges can only be changed when holding the locks of all involved nodes. w/w
+mutexes are a natural fit for such a case for two reasons:
+- They can handle lock-acquisition in any order which allows us to start walking
+  a graph from a starting point and then iteratively discovering new edges and
+  locking down the nodes those edges connect to.
+- Due to the -EALREADY return code signalling that a given objects is already
+  held there's no need for additional book-keeping to break cycles in the graph
+  or keep track off which looks are already held (when using more than one node
+  as a starting point).
+
+Note that this approach differs in two important ways from the above methods:
+- Since the list of objects is dynamically constructed (and might very well be
+  different when retrying due to hitting the -EDEADLK wound condition) there's
+  no need to keep any object on a persistent list when it's not locked. We can
+  therefore move the list_head into the object itself.
+- On the other hand the dynamic object list construction also means that the -EALREADY return
+  code can't be propagated.
+
+Note also that methods #1 and #2 and method #3 can be combined, e.g. to first lock a
+list of starting nodes (passed in from userspace) using one of the above
+methods. And then lock any additional objects affected by the operations using
+method #3 below. The backoff/retry procedure will be a bit more involved, since
+when the dynamic locking step hits -EDEADLK we also need to unlock all the
+objects acquired with the fixed list. But the w/w mutex debug checks will catch
+any interface misuse for these cases.
+
+Also, method 3 can't fail the lock acquisition step since it doesn't return
+-EALREADY. Of course this would be different when using the _interruptible
+variants, but that's outside of the scope of these examples here.
+
+struct obj {
+	struct ww_mutex ww_mutex;
+	struct list_head locked_list;
+};
+
+static DEFINE_WW_CLASS(ww_class);
+
+void __unlock_objs(struct list_head *list)
+{
+	struct obj *entry, *temp;
+
+	list_for_each_entry_safe (entry, temp, list, locked_list) {
+		/* need to do that before unlocking, since only the current lock holder is
+		allowed to use object */
+		list_del(&entry->locked_list);
+		ww_mutex_unlock(entry->ww_mutex)
+	}
+}
+
+void lock_objs(struct list_head *list, struct ww_acquire_ctx *ctx)
+{
+	struct obj *obj;
+
+	ww_acquire_init(ctx, &ww_class);
+
+retry:
+	/* re-init loop start state */
+	loop {
+		/* magic code which walks over a graph and decides which objects
+		 * to lock */
+
+		ret = ww_mutex_lock(obj->ww_mutex, ctx);
+		if (ret == -EALREADY) {
+			/* we have that one already, get to the next object */
+			continue;
+		}
+		if (ret == -EDEADLK) {
+			__unlock_objs(list);
+
+			ww_mutex_lock_slow(obj, ctx);
+			list_add(&entry->locked_list, list);
+			goto retry;
+		}
+
+		/* locked a new object, add it to the list */
+		list_add_tail(&entry->locked_list, list);
+	}
+
+	ww_acquire_done(ctx);
+	return 0;
+}
+
+void unlock_objs(struct list_head *list, struct ww_acquire_ctx *ctx)
+{
+	__unlock_objs(list);
+	ww_acquire_fini(ctx);
+}
+
+Method 4: Only lock one single objects. In that case deadlock detection and
+prevention is obviously overkill, since with grabbing just one lock you can't
+produce a deadlock within just one class. To simplify this case the w/w mutex
+api can be used with a NULL context.
+
+Implementation Details
+----------------------
+
+Design:
+  ww_mutex currently encapsulates a struct mutex, this means no extra overhead for
+  normal mutex locks, which are far more common. As such there is only a small
+  increase in code size if wait/wound mutexes are not used.
+
+  In general, not much contention is expected. The locks are typically used to
+  serialize access to resources for devices. The only way to make wakeups
+  smarter would be at the cost of adding a field to struct mutex_waiter. This
+  would add overhead to all cases where normal mutexes are used, and
+  ww_mutexes are generally less performance sensitive.
+
+Lockdep:
+  Special care has been taken to warn for as many cases of api abuse
+  as possible. Some common api abuses will be caught with
+  CONFIG_DEBUG_MUTEXES, but CONFIG_PROVE_LOCKING is recommended.
+
+  Some of the errors which will be warned about:
+   - Forgetting to call ww_acquire_fini or ww_acquire_init.
+   - Attempting to lock more mutexes after ww_acquire_done.
+   - Attempting to lock the wrong mutex after -EDEADLK and
+     unlocking all mutexes.
+   - Attempting to lock the right mutex after -EDEADLK,
+     before unlocking all mutexes.
+
+   - Calling ww_mutex_lock_slow before -EDEADLK was returned.
+
+   - Unlocking mutexes with the wrong unlock function.
+   - Calling one of the ww_acquire_* twice on the same context.
+   - Using a different ww_class for the mutex than for the ww_acquire_ctx.
+   - Normal lockdep errors that can result in deadlocks.
+
+  Some of the lockdep errors that can result in deadlocks:
+   - Calling ww_acquire_init to initialize a second ww_acquire_ctx before
+     having called ww_acquire_fini on the first.
+   - 'normal' deadlocks that can occur.
+
+FIXME: Update this section once we have the TASK_DEADLOCK task state flag magic
+implemented.
@@ -3,6 +3,7 @@
  
 #include <linux/linkage.h>
 #include <linux/lockdep.h>
+#include <linux/debug_locks.h>
  
 /*
  * Mutexes - debugging helpers:
@@ -10,6 +10,7 @@
 #ifndef __LINUX_MUTEX_H
 #define __LINUX_MUTEX_H
  
+#include <asm/current.h>
 #include <linux/list.h>
 #include <linux/spinlock_types.h>
 #include <linux/linkage.h>
  
@@ -77,7 +78,37 @@
 #endif
 };
  
+struct ww_class {
+	atomic_long_t stamp;
+	struct lock_class_key acquire_key;
+	struct lock_class_key mutex_key;
+	const char *acquire_name;
+	const char *mutex_name;
+};
+
+struct ww_acquire_ctx {
+	struct task_struct *task;
+	unsigned long stamp;
+	unsigned acquired;
 #ifdef CONFIG_DEBUG_MUTEXES
+	unsigned done_acquire;
+	struct ww_class *ww_class;
+	struct ww_mutex *contending_lock;
+#endif
+#ifdef CONFIG_DEBUG_LOCK_ALLOC
+	struct lockdep_map dep_map;
+#endif
+};
+
+struct ww_mutex {
+	struct mutex base;
+	struct ww_acquire_ctx *ctx;
+#ifdef CONFIG_DEBUG_MUTEXES
+	struct ww_class *ww_class;
+#endif
+};
+
+#ifdef CONFIG_DEBUG_MUTEXES
 # include <linux/mutex-debug.h>
 #else
 # define __DEBUG_MUTEX_INITIALIZER(lockname)
  
@@ -101,8 +132,11 @@
 #ifdef CONFIG_DEBUG_LOCK_ALLOC
 # define __DEP_MAP_MUTEX_INITIALIZER(lockname) \
 		, .dep_map = { .name = #lockname }
+# define __WW_CLASS_MUTEX_INITIALIZER(lockname, ww_class) \
+		, .ww_class = &ww_class
 #else
 # define __DEP_MAP_MUTEX_INITIALIZER(lockname)
+# define __WW_CLASS_MUTEX_INITIALIZER(lockname, ww_class)
 #endif
  
 #define __MUTEX_INITIALIZER(lockname) \
  
  
@@ -112,13 +146,49 @@
 		__DEBUG_MUTEX_INITIALIZER(lockname) \
 		__DEP_MAP_MUTEX_INITIALIZER(lockname) }
  
+#define __WW_CLASS_INITIALIZER(ww_class) \
+		{ .stamp = ATOMIC_LONG_INIT(0) \
+		, .acquire_name = #ww_class "_acquire" \
+		, .mutex_name = #ww_class "_mutex" }
+
+#define __WW_MUTEX_INITIALIZER(lockname, class) \
+		{ .base = { \__MUTEX_INITIALIZER(lockname) } \
+		__WW_CLASS_MUTEX_INITIALIZER(lockname, class) }
+
 #define DEFINE_MUTEX(mutexname) \
 	struct mutex mutexname = __MUTEX_INITIALIZER(mutexname)
  
+#define DEFINE_WW_CLASS(classname) \
+	struct ww_class classname = __WW_CLASS_INITIALIZER(classname)
+
+#define DEFINE_WW_MUTEX(mutexname, ww_class) \
+	struct ww_mutex mutexname = __WW_MUTEX_INITIALIZER(mutexname, ww_class)
+
+
 extern void __mutex_init(struct mutex *lock, const char *name,
 			 struct lock_class_key *key);
  
 /**
+ * ww_mutex_init - initialize the w/w mutex
+ * @lock: the mutex to be initialized
+ * @ww_class: the w/w class the mutex should belong to
+ *
+ * Initialize the w/w mutex to unlocked state and associate it with the given
+ * class.
+ *
+ * It is not allowed to initialize an already locked mutex.
+ */
+static inline void ww_mutex_init(struct ww_mutex *lock,
+				 struct ww_class *ww_class)
+{
+	__mutex_init(&lock->base, ww_class->mutex_name, &ww_class->mutex_key);
+	lock->ctx = NULL;
+#ifdef CONFIG_DEBUG_MUTEXES
+	lock->ww_class = ww_class;
+#endif
+}
+
+/**
  * mutex_is_locked - is the mutex locked
  * @lock: the mutex to be queried
  *
@@ -136,6 +206,7 @@
 #ifdef CONFIG_DEBUG_LOCK_ALLOC
 extern void mutex_lock_nested(struct mutex *lock, unsigned int subclass);
 extern void _mutex_lock_nest_lock(struct mutex *lock, struct lockdep_map *nest_lock);
+
 extern int __must_check mutex_lock_interruptible_nested(struct mutex *lock,
 					unsigned int subclass);
 extern int __must_check mutex_lock_killable_nested(struct mutex *lock,
@@ -147,7 +218,7 @@
  
 #define mutex_lock_nest_lock(lock, nest_lock)				\
 do {									\
-	typecheck(struct lockdep_map *, &(nest_lock)->dep_map);		\
+	typecheck(struct lockdep_map *, &(nest_lock)->dep_map);	\
 	_mutex_lock_nest_lock(lock, &(nest_lock)->dep_map);		\
 } while (0)
  
@@ -170,6 +241,288 @@
  */
 extern int mutex_trylock(struct mutex *lock);
 extern void mutex_unlock(struct mutex *lock);
+
+/**
+ * ww_acquire_init - initialize a w/w acquire context
+ * @ctx: w/w acquire context to initialize
+ * @ww_class: w/w class of the context
+ *
+ * Initializes an context to acquire multiple mutexes of the given w/w class.
+ *
+ * Context-based w/w mutex acquiring can be done in any order whatsoever within
+ * a given lock class. Deadlocks will be detected and handled with the
+ * wait/wound logic.
+ *
+ * Mixing of context-based w/w mutex acquiring and single w/w mutex locking can
+ * result in undetected deadlocks and is so forbidden. Mixing different contexts
+ * for the same w/w class when acquiring mutexes can also result in undetected
+ * deadlocks, and is hence also forbidden. Both types of abuse will be caught by
+ * enabling CONFIG_PROVE_LOCKING.
+ *
+ * Nesting of acquire contexts for _different_ w/w classes is possible, subject
+ * to the usual locking rules between different lock classes.
+ *
+ * An acquire context must be released with ww_acquire_fini by the same task
+ * before the memory is freed. It is recommended to allocate the context itself
+ * on the stack.
+ */
+static inline void ww_acquire_init(struct ww_acquire_ctx *ctx,
+				   struct ww_class *ww_class)
+{
+	ctx->task = current;
+	ctx->stamp = atomic_long_inc_return(&ww_class->stamp);
+	ctx->acquired = 0;
+#ifdef CONFIG_DEBUG_MUTEXES
+	ctx->ww_class = ww_class;
+	ctx->done_acquire = 0;
+	ctx->contending_lock = NULL;
+#endif
+#ifdef CONFIG_DEBUG_LOCK_ALLOC
+	debug_check_no_locks_freed((void *)ctx, sizeof(*ctx));
+	lockdep_init_map(&ctx->dep_map, ww_class->acquire_name,
+			 &ww_class->acquire_key, 0);
+	mutex_acquire(&ctx->dep_map, 0, 0, _RET_IP_);
+#endif
+}
+
+/**
+ * ww_acquire_done - marks the end of the acquire phase
+ * @ctx: the acquire context
+ *
+ * Marks the end of the acquire phase, any further w/w mutex lock calls using
+ * this context are forbidden.
+ *
+ * Calling this function is optional, it is just useful to document w/w mutex
+ * code and clearly designated the acquire phase from actually using the locked
+ * data structures.
+ */
+static inline void ww_acquire_done(struct ww_acquire_ctx *ctx)
+{
+#ifdef CONFIG_DEBUG_MUTEXES
+	lockdep_assert_held(ctx);
+
+	DEBUG_LOCKS_WARN_ON(ctx->done_acquire);
+	ctx->done_acquire = 1;
+#endif
+}
+
+/**
+ * ww_acquire_fini - releases a w/w acquire context
+ * @ctx: the acquire context to free
+ *
+ * Releases a w/w acquire context. This must be called _after_ all acquired w/w
+ * mutexes have been released with ww_mutex_unlock.
+ */
+static inline void ww_acquire_fini(struct ww_acquire_ctx *ctx)
+{
+#ifdef CONFIG_DEBUG_MUTEXES
+	mutex_release(&ctx->dep_map, 0, _THIS_IP_);
+
+	DEBUG_LOCKS_WARN_ON(ctx->acquired);
+	if (!config_enabled(CONFIG_PROVE_LOCKING))
+		/*
+		 * lockdep will normally handle this,
+		 * but fail without anyway
+		 */
+		ctx->done_acquire = 1;
+
+	if (!config_enabled(CONFIG_DEBUG_LOCK_ALLOC))
+		/* ensure ww_acquire_fini will still fail if called twice */
+		ctx->acquired = ~0U;
+#endif
+}
+
+extern int __must_check __ww_mutex_lock(struct ww_mutex *lock,
+					struct ww_acquire_ctx *ctx);
+extern int __must_check __ww_mutex_lock_interruptible(struct ww_mutex *lock,
+						      struct ww_acquire_ctx *ctx);
+
+/**
+ * ww_mutex_lock - acquire the w/w mutex
+ * @lock: the mutex to be acquired
+ * @ctx: w/w acquire context, or NULL to acquire only a single lock.
+ *
+ * Lock the w/w mutex exclusively for this task.
+ *
+ * Deadlocks within a given w/w class of locks are detected and handled with the
+ * wait/wound algorithm. If the lock isn't immediately avaiable this function
+ * will either sleep until it is (wait case). Or it selects the current context
+ * for backing off by returning -EDEADLK (wound case). Trying to acquire the
+ * same lock with the same context twice is also detected and signalled by
+ * returning -EALREADY. Returns 0 if the mutex was successfully acquired.
+ *
+ * In the wound case the caller must release all currently held w/w mutexes for
+ * the given context and then wait for this contending lock to be available by
+ * calling ww_mutex_lock_slow. Alternatively callers can opt to not acquire this
+ * lock and proceed with trying to acquire further w/w mutexes (e.g. when
+ * scanning through lru lists trying to free resources).
+ *
+ * The mutex must later on be released by the same task that
+ * acquired it. The task may not exit without first unlocking the mutex. Also,
+ * kernel memory where the mutex resides must not be freed with the mutex still
+ * locked. The mutex must first be initialized (or statically defined) before it
+ * can be locked. memset()-ing the mutex to 0 is not allowed. The mutex must be
+ * of the same w/w lock class as was used to initialize the acquire context.
+ *
+ * A mutex acquired with this function must be released with ww_mutex_unlock.
+ */
+static inline int ww_mutex_lock(struct ww_mutex *lock, struct ww_acquire_ctx *ctx)
+{
+	if (ctx)
+		return __ww_mutex_lock(lock, ctx);
+	else {
+		mutex_lock(&lock->base);
+		return 0;
+	}
+}
+
+/**
+ * ww_mutex_lock_interruptible - acquire the w/w mutex, interruptible
+ * @lock: the mutex to be acquired
+ * @ctx: w/w acquire context
+ *
+ * Lock the w/w mutex exclusively for this task.
+ *
+ * Deadlocks within a given w/w class of locks are detected and handled with the
+ * wait/wound algorithm. If the lock isn't immediately avaiable this function
+ * will either sleep until it is (wait case). Or it selects the current context
+ * for backing off by returning -EDEADLK (wound case). Trying to acquire the
+ * same lock with the same context twice is also detected and signalled by
+ * returning -EALREADY. Returns 0 if the mutex was successfully acquired. If a
+ * signal arrives while waiting for the lock then this function returns -EINTR.
+ *
+ * In the wound case the caller must release all currently held w/w mutexes for
+ * the given context and then wait for this contending lock to be available by
+ * calling ww_mutex_lock_slow_interruptible. Alternatively callers can opt to
+ * not acquire this lock and proceed with trying to acquire further w/w mutexes
+ * (e.g. when scanning through lru lists trying to free resources).
+ *
+ * The mutex must later on be released by the same task that
+ * acquired it. The task may not exit without first unlocking the mutex. Also,
+ * kernel memory where the mutex resides must not be freed with the mutex still
+ * locked. The mutex must first be initialized (or statically defined) before it
+ * can be locked. memset()-ing the mutex to 0 is not allowed. The mutex must be
+ * of the same w/w lock class as was used to initialize the acquire context.
+ *
+ * A mutex acquired with this function must be released with ww_mutex_unlock.
+ */
+static inline int __must_check ww_mutex_lock_interruptible(struct ww_mutex *lock,
+							   struct ww_acquire_ctx *ctx)
+{
+	if (ctx)
+		return __ww_mutex_lock_interruptible(lock, ctx);
+	else
+		return mutex_lock_interruptible(&lock->base);
+}
+
+/**
+ * ww_mutex_lock_slow - slowpath acquiring of the w/w mutex
+ * @lock: the mutex to be acquired
+ * @ctx: w/w acquire context
+ *
+ * Acquires a w/w mutex with the given context after a wound case. This function
+ * will sleep until the lock becomes available.
+ *
+ * The caller must have released all w/w mutexes already acquired with the
+ * context and then call this function on the contended lock.
+ *
+ * Afterwards the caller may continue to (re)acquire the other w/w mutexes it
+ * needs with ww_mutex_lock. Note that the -EALREADY return code from
+ * ww_mutex_lock can be used to avoid locking this contended mutex twice.
+ *
+ * It is forbidden to call this function with any other w/w mutexes associated
+ * with the context held. It is forbidden to call this on anything else than the
+ * contending mutex.
+ *
+ * Note that the slowpath lock acquiring can also be done by calling
+ * ww_mutex_lock directly. This function here is simply to help w/w mutex
+ * locking code readability by clearly denoting the slowpath.
+ */
+static inline void
+ww_mutex_lock_slow(struct ww_mutex *lock, struct ww_acquire_ctx *ctx)
+{
+	int ret;
+#ifdef CONFIG_DEBUG_MUTEXES
+	DEBUG_LOCKS_WARN_ON(!ctx->contending_lock);
+#endif
+	ret = ww_mutex_lock(lock, ctx);
+	(void)ret;
+}
+
+/**
+ * ww_mutex_lock_slow_interruptible - slowpath acquiring of the w/w mutex,
+ * 				      interruptible
+ * @lock: the mutex to be acquired
+ * @ctx: w/w acquire context
+ *
+ * Acquires a w/w mutex with the given context after a wound case. This function
+ * will sleep until the lock becomes available and returns 0 when the lock has
+ * been acquired. If a signal arrives while waiting for the lock then this
+ * function returns -EINTR.
+ *
+ * The caller must have released all w/w mutexes already acquired with the
+ * context and then call this function on the contended lock.
+ *
+ * Afterwards the caller may continue to (re)acquire the other w/w mutexes it
+ * needs with ww_mutex_lock. Note that the -EALREADY return code from
+ * ww_mutex_lock can be used to avoid locking this contended mutex twice.
+ *
+ * It is forbidden to call this function with any other w/w mutexes associated
+ * with the given context held. It is forbidden to call this on anything else
+ * than the contending mutex.
+ *
+ * Note that the slowpath lock acquiring can also be done by calling
+ * ww_mutex_lock_interruptible directly. This function here is simply to help
+ * w/w mutex locking code readability by clearly denoting the slowpath.
+ */
+static inline int __must_check
+ww_mutex_lock_slow_interruptible(struct ww_mutex *lock,
+				 struct ww_acquire_ctx *ctx)
+{
+#ifdef CONFIG_DEBUG_MUTEXES
+	DEBUG_LOCKS_WARN_ON(!ctx->contending_lock);
+#endif
+	return ww_mutex_lock_interruptible(lock, ctx);
+}
+
+extern void ww_mutex_unlock(struct ww_mutex *lock);
+
+/**
+ * ww_mutex_trylock - tries to acquire the w/w mutex without acquire context
+ * @lock: mutex to lock
+ *
+ * Trylocks a mutex without acquire context, so no deadlock detection is
+ * possible. Returns 1 if the mutex has been acquired successfully, 0 otherwise.
+ */
+static inline int __must_check ww_mutex_trylock(struct ww_mutex *lock)
+{
+	return mutex_trylock(&lock->base);
+}
+
+/***
+ * ww_mutex_destroy - mark a w/w mutex unusable
+ * @lock: the mutex to be destroyed
+ *
+ * This function marks the mutex uninitialized, and any subsequent
+ * use of the mutex is forbidden. The mutex must not be locked when
+ * this function is called.
+ */
+static inline void ww_mutex_destroy(struct ww_mutex *lock)
+{
+	mutex_destroy(&lock->base);
+}
+
+/**
+ * ww_mutex_is_locked - is the w/w mutex locked
+ * @lock: the mutex to be queried
+ *
+ * Returns 1 if the mutex is locked, 0 if unlocked.
+ */
+static inline bool ww_mutex_is_locked(struct ww_mutex *lock)
+{
+	return mutex_is_locked(&lock->base);
+}
+
 extern int atomic_dec_and_mutex_lock(atomic_t *cnt, struct mutex *lock);
  
 #ifndef CONFIG_HAVE_ARCH_MUTEX_CPU_RELAX
@@ -254,16 +254,165 @@
  
 EXPORT_SYMBOL(mutex_unlock);
  
+/**
+ * ww_mutex_unlock - release the w/w mutex
+ * @lock: the mutex to be released
+ *
+ * Unlock a mutex that has been locked by this task previously with any of the
+ * ww_mutex_lock* functions (with or without an acquire context). It is
+ * forbidden to release the locks after releasing the acquire context.
+ *
+ * This function must not be used in interrupt context. Unlocking
+ * of a unlocked mutex is not allowed.
+ */
+void __sched ww_mutex_unlock(struct ww_mutex *lock)
+{
+	/*
+	 * The unlocking fastpath is the 0->1 transition from 'locked'
+	 * into 'unlocked' state:
+	 */
+	if (lock->ctx) {
+#ifdef CONFIG_DEBUG_MUTEXES
+		DEBUG_LOCKS_WARN_ON(!lock->ctx->acquired);
+#endif
+		if (lock->ctx->acquired > 0)
+			lock->ctx->acquired--;
+		lock->ctx = NULL;
+	}
+
+#ifndef CONFIG_DEBUG_MUTEXES
+	/*
+	 * When debugging is enabled we must not clear the owner before time,
+	 * the slow path will always be taken, and that clears the owner field
+	 * after verifying that it was indeed current.
+	 */
+	mutex_clear_owner(&lock->base);
+#endif
+	__mutex_fastpath_unlock(&lock->base.count, __mutex_unlock_slowpath);
+}
+EXPORT_SYMBOL(ww_mutex_unlock);
+
+static inline int __sched
+__mutex_lock_check_stamp(struct mutex *lock, struct ww_acquire_ctx *ctx)
+{
+	struct ww_mutex *ww = container_of(lock, struct ww_mutex, base);
+	struct ww_acquire_ctx *hold_ctx = ACCESS_ONCE(ww->ctx);
+
+	if (!hold_ctx)
+		return 0;
+
+	if (unlikely(ctx == hold_ctx))
+		return -EALREADY;
+
+	if (ctx->stamp - hold_ctx->stamp <= LONG_MAX &&
+	    (ctx->stamp != hold_ctx->stamp || ctx > hold_ctx)) {
+#ifdef CONFIG_DEBUG_MUTEXES
+		DEBUG_LOCKS_WARN_ON(ctx->contending_lock);
+		ctx->contending_lock = ww;
+#endif
+		return -EDEADLK;
+	}
+
+	return 0;
+}
+
+static __always_inline void ww_mutex_lock_acquired(struct ww_mutex *ww,
+						   struct ww_acquire_ctx *ww_ctx)
+{
+#ifdef CONFIG_DEBUG_MUTEXES
+	/*
+	 * If this WARN_ON triggers, you used ww_mutex_lock to acquire,
+	 * but released with a normal mutex_unlock in this call.
+	 *
+	 * This should never happen, always use ww_mutex_unlock.
+	 */
+	DEBUG_LOCKS_WARN_ON(ww->ctx);
+
+	/*
+	 * Not quite done after calling ww_acquire_done() ?
+	 */
+	DEBUG_LOCKS_WARN_ON(ww_ctx->done_acquire);
+
+	if (ww_ctx->contending_lock) {
+		/*
+		 * After -EDEADLK you tried to
+		 * acquire a different ww_mutex? Bad!
+		 */
+		DEBUG_LOCKS_WARN_ON(ww_ctx->contending_lock != ww);
+
+		/*
+		 * You called ww_mutex_lock after receiving -EDEADLK,
+		 * but 'forgot' to unlock everything else first?
+		 */
+		DEBUG_LOCKS_WARN_ON(ww_ctx->acquired > 0);
+		ww_ctx->contending_lock = NULL;
+	}
+
+	/*
+	 * Naughty, using a different class will lead to undefined behavior!
+	 */
+	DEBUG_LOCKS_WARN_ON(ww_ctx->ww_class != ww->ww_class);
+#endif
+	ww_ctx->acquired++;
+}
+
 /*
+ * after acquiring lock with fastpath or when we lost out in contested
+ * slowpath, set ctx and wake up any waiters so they can recheck.
+ *
+ * This function is never called when CONFIG_DEBUG_LOCK_ALLOC is set,
+ * as the fastpath and opportunistic spinning are disabled in that case.
+ */
+static __always_inline void
+ww_mutex_set_context_fastpath(struct ww_mutex *lock,
+			       struct ww_acquire_ctx *ctx)
+{
+	unsigned long flags;
+	struct mutex_waiter *cur;
+
+	ww_mutex_lock_acquired(lock, ctx);
+
+	lock->ctx = ctx;
+
+	/*
+	 * The lock->ctx update should be visible on all cores before
+	 * the atomic read is done, otherwise contended waiters might be
+	 * missed. The contended waiters will either see ww_ctx == NULL
+	 * and keep spinning, or it will acquire wait_lock, add itself
+	 * to waiter list and sleep.
+	 */
+	smp_mb(); /* ^^^ */
+
+	/*
+	 * Check if lock is contended, if not there is nobody to wake up
+	 */
+	if (likely(atomic_read(&lock->base.count) == 0))
+		return;
+
+	/*
+	 * Uh oh, we raced in fastpath, wake up everyone in this case,
+	 * so they can see the new lock->ctx.
+	 */
+	spin_lock_mutex(&lock->base.wait_lock, flags);
+	list_for_each_entry(cur, &lock->base.wait_list, list) {
+		debug_mutex_wake_waiter(&lock->base, cur);
+		wake_up_process(cur->task);
+	}
+	spin_unlock_mutex(&lock->base.wait_lock, flags);
+}
+
+/*
  * Lock a mutex (possibly interruptible), slowpath:
  */
-static inline int __sched
+static __always_inline int __sched
 __mutex_lock_common(struct mutex *lock, long state, unsigned int subclass,
-		    struct lockdep_map *nest_lock, unsigned long ip)
+		    struct lockdep_map *nest_lock, unsigned long ip,
+		    struct ww_acquire_ctx *ww_ctx)
 {
 	struct task_struct *task = current;
 	struct mutex_waiter waiter;
 	unsigned long flags;
+	int ret;
  
 	preempt_disable();
 	mutex_acquire_nest(&lock->dep_map, subclass, 0, nest_lock, ip);
@@ -298,6 +447,22 @@
 		struct task_struct *owner;
 		struct mspin_node  node;
  
+		if (!__builtin_constant_p(ww_ctx == NULL) && ww_ctx->acquired > 0) {
+			struct ww_mutex *ww;
+
+			ww = container_of(lock, struct ww_mutex, base);
+			/*
+			 * If ww->ctx is set the contents are undefined, only
+			 * by acquiring wait_lock there is a guarantee that
+			 * they are not invalid when reading.
+			 *
+			 * As such, when deadlock detection needs to be
+			 * performed the optimistic spinning cannot be done.
+			 */
+			if (ACCESS_ONCE(ww->ctx))
+				break;
+		}
+
 		/*
 		 * If there's an owner, wait for it to either
 		 * release the lock or go to sleep.
@@ -312,6 +477,13 @@
 		if ((atomic_read(&lock->count) == 1) &&
 		    (atomic_cmpxchg(&lock->count, 1, 0) == 1)) {
 			lock_acquired(&lock->dep_map, ip);
+			if (!__builtin_constant_p(ww_ctx == NULL)) {
+				struct ww_mutex *ww;
+				ww = container_of(lock, struct ww_mutex, base);
+
+				ww_mutex_set_context_fastpath(ww, ww_ctx);
+			}
+
 			mutex_set_owner(lock);
 			mspin_unlock(MLOCK(lock), &node);
 			preempt_enable();
  
  
@@ -371,15 +543,16 @@
 		 * TASK_UNINTERRUPTIBLE case.)
 		 */
 		if (unlikely(signal_pending_state(state, task))) {
-			mutex_remove_waiter(lock, &waiter,
-					    task_thread_info(task));
-			mutex_release(&lock->dep_map, 1, ip);
-			spin_unlock_mutex(&lock->wait_lock, flags);
+			ret = -EINTR;
+			goto err;
+		}
  
-			debug_mutex_free_waiter(&waiter);
-			preempt_enable();
-			return -EINTR;
+		if (!__builtin_constant_p(ww_ctx == NULL) && ww_ctx->acquired > 0) {
+			ret = __mutex_lock_check_stamp(lock, ww_ctx);
+			if (ret)
+				goto err;
 		}
+
 		__set_task_state(task, state);
  
 		/* didn't get the lock, go to sleep: */
@@ -394,6 +567,30 @@
 	mutex_remove_waiter(lock, &waiter, current_thread_info());
 	mutex_set_owner(lock);
  
+	if (!__builtin_constant_p(ww_ctx == NULL)) {
+		struct ww_mutex *ww = container_of(lock,
+						      struct ww_mutex,
+						      base);
+		struct mutex_waiter *cur;
+
+		/*
+		 * This branch gets optimized out for the common case,
+		 * and is only important for ww_mutex_lock.
+		 */
+
+		ww_mutex_lock_acquired(ww, ww_ctx);
+		ww->ctx = ww_ctx;
+
+		/*
+		 * Give any possible sleeping processes the chance to wake up,
+		 * so they can recheck if they have to back off.
+		 */
+		list_for_each_entry(cur, &lock->wait_list, list) {
+			debug_mutex_wake_waiter(lock, cur);
+			wake_up_process(cur->task);
+		}
+	}
+
 	/* set it to 0 if there are no waiters left: */
 	if (likely(list_empty(&lock->wait_list)))
 		atomic_set(&lock->count, 0);
@@ -404,6 +601,14 @@
 	preempt_enable();
  
 	return 0;
+
+err:
+	mutex_remove_waiter(lock, &waiter, task_thread_info(task));
+	spin_unlock_mutex(&lock->wait_lock, flags);
+	debug_mutex_free_waiter(&waiter);
+	mutex_release(&lock->dep_map, 1, ip);
+	preempt_enable();
+	return ret;
 }
  
 #ifdef CONFIG_DEBUG_LOCK_ALLOC
@@ -411,7 +616,8 @@
 mutex_lock_nested(struct mutex *lock, unsigned int subclass)
 {
 	might_sleep();
-	__mutex_lock_common(lock, TASK_UNINTERRUPTIBLE, subclass, NULL, _RET_IP_);
+	__mutex_lock_common(lock, TASK_UNINTERRUPTIBLE,
+			    subclass, NULL, _RET_IP_, NULL);
 }
  
 EXPORT_SYMBOL_GPL(mutex_lock_nested);
@@ -420,7 +626,8 @@
 _mutex_lock_nest_lock(struct mutex *lock, struct lockdep_map *nest)
 {
 	might_sleep();
-	__mutex_lock_common(lock, TASK_UNINTERRUPTIBLE, 0, nest, _RET_IP_);
+	__mutex_lock_common(lock, TASK_UNINTERRUPTIBLE,
+			    0, nest, _RET_IP_, NULL);
 }
  
 EXPORT_SYMBOL_GPL(_mutex_lock_nest_lock);
@@ -429,7 +636,8 @@
 mutex_lock_killable_nested(struct mutex *lock, unsigned int subclass)
 {
 	might_sleep();
-	return __mutex_lock_common(lock, TASK_KILLABLE, subclass, NULL, _RET_IP_);
+	return __mutex_lock_common(lock, TASK_KILLABLE,
+				   subclass, NULL, _RET_IP_, NULL);
 }
 EXPORT_SYMBOL_GPL(mutex_lock_killable_nested);
  
  
@@ -438,10 +646,30 @@
 {
 	might_sleep();
 	return __mutex_lock_common(lock, TASK_INTERRUPTIBLE,
-				   subclass, NULL, _RET_IP_);
+				   subclass, NULL, _RET_IP_, NULL);
 }
  
 EXPORT_SYMBOL_GPL(mutex_lock_interruptible_nested);
+
+
+int __sched
+__ww_mutex_lock(struct ww_mutex *lock, struct ww_acquire_ctx *ctx)
+{
+	might_sleep();
+	return __mutex_lock_common(&lock->base, TASK_UNINTERRUPTIBLE,
+				   0, &ctx->dep_map, _RET_IP_, ctx);
+}
+EXPORT_SYMBOL_GPL(__ww_mutex_lock);
+
+int __sched
+__ww_mutex_lock_interruptible(struct ww_mutex *lock, struct ww_acquire_ctx *ctx)
+{
+	might_sleep();
+	return __mutex_lock_common(&lock->base, TASK_INTERRUPTIBLE,
+				   0, &ctx->dep_map, _RET_IP_, ctx);
+}
+EXPORT_SYMBOL_GPL(__ww_mutex_lock_interruptible);
+
 #endif
  
 /*
  
  
  
@@ -544,20 +772,39 @@
 {
 	struct mutex *lock = container_of(lock_count, struct mutex, count);
  
-	__mutex_lock_common(lock, TASK_UNINTERRUPTIBLE, 0, NULL, _RET_IP_);
+	__mutex_lock_common(lock, TASK_UNINTERRUPTIBLE, 0,
+			    NULL, _RET_IP_, NULL);
 }
  
 static noinline int __sched
 __mutex_lock_killable_slowpath(struct mutex *lock)
 {
-	return __mutex_lock_common(lock, TASK_KILLABLE, 0, NULL, _RET_IP_);
+	return __mutex_lock_common(lock, TASK_KILLABLE, 0,
+				   NULL, _RET_IP_, NULL);
 }
  
 static noinline int __sched
 __mutex_lock_interruptible_slowpath(struct mutex *lock)
 {
-	return __mutex_lock_common(lock, TASK_INTERRUPTIBLE, 0, NULL, _RET_IP_);
+	return __mutex_lock_common(lock, TASK_INTERRUPTIBLE, 0,
+				   NULL, _RET_IP_, NULL);
 }
+
+static noinline int __sched
+__ww_mutex_lock_slowpath(struct ww_mutex *lock, struct ww_acquire_ctx *ctx)
+{
+	return __mutex_lock_common(&lock->base, TASK_UNINTERRUPTIBLE, 0,
+				   NULL, _RET_IP_, ctx);
+}
+
+static noinline int __sched
+__ww_mutex_lock_interruptible_slowpath(struct ww_mutex *lock,
+					    struct ww_acquire_ctx *ctx)
+{
+	return __mutex_lock_common(&lock->base, TASK_INTERRUPTIBLE, 0,
+				   NULL, _RET_IP_, ctx);
+}
+
 #endif
  
 /*
@@ -612,6 +859,45 @@
 	return ret;
 }
 EXPORT_SYMBOL(mutex_trylock);
+
+#ifndef CONFIG_DEBUG_LOCK_ALLOC
+int __sched
+__ww_mutex_lock(struct ww_mutex *lock, struct ww_acquire_ctx *ctx)
+{
+	int ret;
+
+	might_sleep();
+
+	ret = __mutex_fastpath_lock_retval(&lock->base.count);
+
+	if (likely(!ret)) {
+		ww_mutex_set_context_fastpath(lock, ctx);
+		mutex_set_owner(&lock->base);
+	} else
+		ret = __ww_mutex_lock_slowpath(lock, ctx);
+	return ret;
+}
+EXPORT_SYMBOL(__ww_mutex_lock);
+
+int __sched
+__ww_mutex_lock_interruptible(struct ww_mutex *lock, struct ww_acquire_ctx *ctx)
+{
+	int ret;
+
+	might_sleep();
+
+	ret = __mutex_fastpath_lock_retval(&lock->base.count);
+
+	if (likely(!ret)) {
+		ww_mutex_set_context_fastpath(lock, ctx);
+		mutex_set_owner(&lock->base);
+	} else
+		ret = __ww_mutex_lock_interruptible_slowpath(lock, ctx);
+	return ret;
+}
+EXPORT_SYMBOL(__ww_mutex_lock_interruptible);
+
+#endif
  
 /**
  * atomic_dec_and_mutex_lock - return holding mutex if we dec to 0
@@ -30,6 +30,7 @@
  * a locking bug is detected.
  */
 int debug_locks_silent;
+EXPORT_SYMBOL_GPL(debug_locks_silent);
  
 /*
  * Generic 'turn off all lock debugging' function:
@@ -44,4 +45,5 @@
 	}
 	return 0;
 }
+EXPORT_SYMBOL_GPL(debug_locks_off);
	1	+Wait/Wound Deadlock-Proof Mutex Design
	2	+======================================
	3	+
	4	+Please read mutex-design.txt first, as it applies to wait/wound mutexes too.
	5	+
	6	+Motivation for WW-Mutexes
	7	+-------------------------
	8	+
	9	+GPU's do operations that commonly involve many buffers. Those buffers
	10	+can be shared across contexts/processes, exist in different memory
	11	+domains (for example VRAM vs system memory), and so on. And with
	12	+PRIME / dmabuf, they can even be shared across devices. So there are
	13	+a handful of situations where the driver needs to wait for buffers to
	14	+become ready. If you think about this in terms of waiting on a buffer
	15	+mutex for it to become available, this presents a problem because
	16	+there is no way to guarantee that buffers appear in a execbuf/batch in
	17	+the same order in all contexts. That is directly under control of
	18	+userspace, and a result of the sequence of GL calls that an application
	19	+makes. Which results in the potential for deadlock. The problem gets
	20	+more complex when you consider that the kernel may need to migrate the
	21	+buffer(s) into VRAM before the GPU operates on the buffer(s), which
	22	+may in turn require evicting some other buffers (and you don't want to
	23	+evict other buffers which are already queued up to the GPU), but for a
	24	+simplified understanding of the problem you can ignore this.
	25	+
	26	+The algorithm that the TTM graphics subsystem came up with for dealing with
	27	+this problem is quite simple. For each group of buffers (execbuf) that need
	28	+to be locked, the caller would be assigned a unique reservation id/ticket,
	29	+from a global counter. In case of deadlock while locking all the buffers
	30	+associated with a execbuf, the one with the lowest reservation ticket (i.e.
	31	+the oldest task) wins, and the one with the higher reservation id (i.e. the
	32	+younger task) unlocks all of the buffers that it has already locked, and then
	33	+tries again.
	34	+
	35	+In the RDBMS literature this deadlock handling approach is called wait/wound:
	36	+The older tasks waits until it can acquire the contended lock. The younger tasks
	37	+needs to back off and drop all the locks it is currently holding, i.e. the
	38	+younger task is wounded.
	39	+
	40	+Concepts
	41	+--------
	42	+
	43	+Compared to normal mutexes two additional concepts/objects show up in the lock
	44	+interface for w/w mutexes:
	45	+
	46	+Acquire context: To ensure eventual forward progress it is important the a task
	47	+trying to acquire locks doesn't grab a new reservation id, but keeps the one it
	48	+acquired when starting the lock acquisition. This ticket is stored in the
	49	+acquire context. Furthermore the acquire context keeps track of debugging state
	50	+to catch w/w mutex interface abuse.
	51	+
	52	+W/w class: In contrast to normal mutexes the lock class needs to be explicit for
	53	+w/w mutexes, since it is required to initialize the acquire context.
	54	+
	55	+Furthermore there are three different class of w/w lock acquire functions:
	56	+
	57	+* Normal lock acquisition with a context, using ww_mutex_lock.
	58	+
	59	+* Slowpath lock acquisition on the contending lock, used by the wounded task
	60	+ after having dropped all already acquired locks. These functions have the
	61	+ _slow postfix.
	62	+
	63	+ From a simple semantics point-of-view the _slow functions are not strictly
	64	+ required, since simply calling the normal ww_mutex_lock functions on the
	65	+ contending lock (after having dropped all other already acquired locks) will
	66	+ work correctly. After all if no other ww mutex has been acquired yet there's
	67	+ no deadlock potential and hence the ww_mutex_lock call will block and not
	68	+ prematurely return -EDEADLK. The advantage of the _slow functions is in
	69	+ interface safety:
	70	+ - ww_mutex_lock has a __must_check int return type, whereas ww_mutex_lock_slow
	71	+ has a void return type. Note that since ww mutex code needs loops/retries
	72	+ anyway the __must_check doesn't result in spurious warnings, even though the
	73	+ very first lock operation can never fail.
	74	+ - When full debugging is enabled ww_mutex_lock_slow checks that all acquired
	75	+ ww mutex have been released (preventing deadlocks) and makes sure that we
	76	+ block on the contending lock (preventing spinning through the -EDEADLK
	77	+ slowpath until the contended lock can be acquired).
	78	+
	79	+* Functions to only acquire a single w/w mutex, which results in the exact same
	80	+ semantics as a normal mutex. This is done by calling ww_mutex_lock with a NULL
	81	+ context.
	82	+
	83	+ Again this is not strictly required. But often you only want to acquire a
	84	+ single lock in which case it's pointless to set up an acquire context (and so
	85	+ better to avoid grabbing a deadlock avoidance ticket).
	86	+
	87	+Of course, all the usual variants for handling wake-ups due to signals are also
	88	+provided.
	89	+
	90	+Usage
	91	+-----
	92	+
	93	+Three different ways to acquire locks within the same w/w class. Common
	94	+definitions for methods #1 and #2:
	95	+
	96	+static DEFINE_WW_CLASS(ww_class);
	97	+
	98	+struct obj {
	99	+ struct ww_mutex lock;
	100	+ /* obj data */
	101	+};
	102	+
	103	+struct obj_entry {
	104	+ struct list_head head;
	105	+ struct obj *obj;
	106	+};
	107	+
	108	+Method 1, using a list in execbuf->buffers that's not allowed to be reordered.
	109	+This is useful if a list of required objects is already tracked somewhere.
	110	+Furthermore the lock helper can use propagate the -EALREADY return code back to
	111	+the caller as a signal that an object is twice on the list. This is useful if
	112	+the list is constructed from userspace input and the ABI requires userspace to
	113	+not have duplicate entries (e.g. for a gpu commandbuffer submission ioctl).
	114	+
	115	+int lock_objs(struct list_head list, struct ww_acquire_ctx ctx)
	116	+{
	117	+ struct obj *res_obj = NULL;
	118	+ struct obj_entry *contended_entry = NULL;
	119	+ struct obj_entry *entry;
	120	+
	121	+ ww_acquire_init(ctx, &ww_class);
	122	+
	123	+retry:
	124	+ list_for_each_entry (entry, list, head) {
	125	+ if (entry->obj == res_obj) {
	126	+ res_obj = NULL;
	127	+ continue;
	128	+ }
	129	+ ret = ww_mutex_lock(&entry->obj->lock, ctx);
	130	+ if (ret < 0) {
	131	+ contended_entry = entry;
	132	+ goto err;
	133	+ }
	134	+ }
	135	+
	136	+ ww_acquire_done(ctx);
	137	+ return 0;
	138	+
	139	+err:
	140	+ list_for_each_entry_continue_reverse (entry, list, head)
	141	+ ww_mutex_unlock(&entry->obj->lock);
	142	+
	143	+ if (res_obj)
	144	+ ww_mutex_unlock(&res_obj->lock);
	145	+
	146	+ if (ret == -EDEADLK) {
	147	+ /* we lost out in a seqno race, lock and retry.. */
	148	+ ww_mutex_lock_slow(&contended_entry->obj->lock, ctx);
	149	+ res_obj = contended_entry->obj;
	150	+ goto retry;
	151	+ }
	152	+ ww_acquire_fini(ctx);
	153	+
	154	+ return ret;
	155	+}
	156	+
	157	+Method 2, using a list in execbuf->buffers that can be reordered. Same semantics
	158	+of duplicate entry detection using -EALREADY as method 1 above. But the
	159	+list-reordering allows for a bit more idiomatic code.
	160	+
	161	+int lock_objs(struct list_head list, struct ww_acquire_ctx ctx)
	162	+{
	163	+ struct obj_entry entry, entry2;
	164	+
	165	+ ww_acquire_init(ctx, &ww_class);
	166	+
	167	+ list_for_each_entry (entry, list, head) {
	168	+ ret = ww_mutex_lock(&entry->obj->lock, ctx);
	169	+ if (ret < 0) {
	170	+ entry2 = entry;
	171	+
	172	+ list_for_each_entry_continue_reverse (entry2, list, head)
	173	+ ww_mutex_unlock(&entry2->obj->lock);
	174	+
	175	+ if (ret != -EDEADLK) {
	176	+ ww_acquire_fini(ctx);
	177	+ return ret;
	178	+ }
	179	+
	180	+ /* we lost out in a seqno race, lock and retry.. */
	181	+ ww_mutex_lock_slow(&entry->obj->lock, ctx);
	182	+
	183	+ /*
	184	+ * Move buf to head of the list, this will point
	185	+ * buf->next to the first unlocked entry,
	186	+ * restarting the for loop.
	187	+ */
	188	+ list_del(&entry->head);
	189	+ list_add(&entry->head, list);
	190	+ }
	191	+ }
	192	+
	193	+ ww_acquire_done(ctx);
	194	+ return 0;
	195	+}
	196	+
	197	+Unlocking works the same way for both methods #1 and #2:
	198	+
	199	+void unlock_objs(struct list_head list, struct ww_acquire_ctx ctx)
	200	+{
	201	+ struct obj_entry *entry;
	202	+
	203	+ list_for_each_entry (entry, list, head)
	204	+ ww_mutex_unlock(&entry->obj->lock);
	205	+
	206	+ ww_acquire_fini(ctx);
	207	+}
	208	+
	209	+Method 3 is useful if the list of objects is constructed ad-hoc and not upfront,
	210	+e.g. when adjusting edges in a graph where each node has its own ww_mutex lock,
	211	+and edges can only be changed when holding the locks of all involved nodes. w/w
	212	+mutexes are a natural fit for such a case for two reasons:
	213	+- They can handle lock-acquisition in any order which allows us to start walking
	214	+ a graph from a starting point and then iteratively discovering new edges and
	215	+ locking down the nodes those edges connect to.
	216	+- Due to the -EALREADY return code signalling that a given objects is already
	217	+ held there's no need for additional book-keeping to break cycles in the graph
	218	+ or keep track off which looks are already held (when using more than one node
	219	+ as a starting point).
	220	+
	221	+Note that this approach differs in two important ways from the above methods:
	222	+- Since the list of objects is dynamically constructed (and might very well be
	223	+ different when retrying due to hitting the -EDEADLK wound condition) there's
	224	+ no need to keep any object on a persistent list when it's not locked. We can
	225	+ therefore move the list_head into the object itself.
	226	+- On the other hand the dynamic object list construction also means that the -EALREADY return
	227	+ code can't be propagated.
	228	+
	229	+Note also that methods #1 and #2 and method #3 can be combined, e.g. to first lock a
	230	+list of starting nodes (passed in from userspace) using one of the above
	231	+methods. And then lock any additional objects affected by the operations using
	232	+method #3 below. The backoff/retry procedure will be a bit more involved, since
	233	+when the dynamic locking step hits -EDEADLK we also need to unlock all the
	234	+objects acquired with the fixed list. But the w/w mutex debug checks will catch
	235	+any interface misuse for these cases.
	236	+
	237	+Also, method 3 can't fail the lock acquisition step since it doesn't return
	238	+-EALREADY. Of course this would be different when using the _interruptible
	239	+variants, but that's outside of the scope of these examples here.
	240	+
	241	+struct obj {
	242	+ struct ww_mutex ww_mutex;
	243	+ struct list_head locked_list;
	244	+};
	245	+
	246	+static DEFINE_WW_CLASS(ww_class);
	247	+
	248	+void __unlock_objs(struct list_head *list)
	249	+{
	250	+ struct obj entry, temp;
	251	+
	252	+ list_for_each_entry_safe (entry, temp, list, locked_list) {
	253	+ /* need to do that before unlocking, since only the current lock holder is
	254	+ allowed to use object */
	255	+ list_del(&entry->locked_list);
	256	+ ww_mutex_unlock(entry->ww_mutex)
	257	+ }
	258	+}
	259	+
	260	+void lock_objs(struct list_head list, struct ww_acquire_ctx ctx)
	261	+{
	262	+ struct obj *obj;
	263	+
	264	+ ww_acquire_init(ctx, &ww_class);
	265	+
	266	+retry:
	267	+ /* re-init loop start state */
	268	+ loop {
	269	+ /* magic code which walks over a graph and decides which objects
	270	+ * to lock */
	271	+
	272	+ ret = ww_mutex_lock(obj->ww_mutex, ctx);
	273	+ if (ret == -EALREADY) {
	274	+ /* we have that one already, get to the next object */
	275	+ continue;
	276	+ }
	277	+ if (ret == -EDEADLK) {
	278	+ __unlock_objs(list);
	279	+
	280	+ ww_mutex_lock_slow(obj, ctx);
	281	+ list_add(&entry->locked_list, list);
	282	+ goto retry;
	283	+ }
	284	+
	285	+ /* locked a new object, add it to the list */
	286	+ list_add_tail(&entry->locked_list, list);
	287	+ }
	288	+
	289	+ ww_acquire_done(ctx);
	290	+ return 0;
	291	+}
	292	+
	293	+void unlock_objs(struct list_head list, struct ww_acquire_ctx ctx)
	294	+{
	295	+ __unlock_objs(list);
	296	+ ww_acquire_fini(ctx);
	297	+}
	298	+
	299	+Method 4: Only lock one single objects. In that case deadlock detection and
	300	+prevention is obviously overkill, since with grabbing just one lock you can't
	301	+produce a deadlock within just one class. To simplify this case the w/w mutex
	302	+api can be used with a NULL context.
	303	+
	304	+Implementation Details
	305	+----------------------
	306	+
	307	+Design:
	308	+ ww_mutex currently encapsulates a struct mutex, this means no extra overhead for
	309	+ normal mutex locks, which are far more common. As such there is only a small
	310	+ increase in code size if wait/wound mutexes are not used.
	311	+
	312	+ In general, not much contention is expected. The locks are typically used to
	313	+ serialize access to resources for devices. The only way to make wakeups
	314	+ smarter would be at the cost of adding a field to struct mutex_waiter. This
	315	+ would add overhead to all cases where normal mutexes are used, and
	316	+ ww_mutexes are generally less performance sensitive.
	317	+
	318	+Lockdep:
	319	+ Special care has been taken to warn for as many cases of api abuse
	320	+ as possible. Some common api abuses will be caught with
	321	+ CONFIG_DEBUG_MUTEXES, but CONFIG_PROVE_LOCKING is recommended.
	322	+
	323	+ Some of the errors which will be warned about:
	324	+ - Forgetting to call ww_acquire_fini or ww_acquire_init.
	325	+ - Attempting to lock more mutexes after ww_acquire_done.
	326	+ - Attempting to lock the wrong mutex after -EDEADLK and
	327	+ unlocking all mutexes.
	328	+ - Attempting to lock the right mutex after -EDEADLK,
	329	+ before unlocking all mutexes.
	330	+
	331	+ - Calling ww_mutex_lock_slow before -EDEADLK was returned.
	332	+
	333	+ - Unlocking mutexes with the wrong unlock function.
	334	+ - Calling one of the ww_acquire_* twice on the same context.
	335	+ - Using a different ww_class for the mutex than for the ww_acquire_ctx.
	336	+ - Normal lockdep errors that can result in deadlocks.
	337	+
	338	+ Some of the lockdep errors that can result in deadlocks:
	339	+ - Calling ww_acquire_init to initialize a second ww_acquire_ctx before
	340	+ having called ww_acquire_fini on the first.
	341	+ - 'normal' deadlocks that can occur.
	342	+
	343	+FIXME: Update this section once we have the TASK_DEADLOCK task state flag magic
	344	+implemented.
...	...	@@ -3,6 +3,7 @@
3	3
4	4	#include <linux/linkage.h>
5	5	#include <linux/lockdep.h>
	6	+#include <linux/debug_locks.h>
6	7
7	8	/*
8	9	* Mutexes - debugging helpers:
...	...	@@ -10,6 +10,7 @@
10	10	#ifndef __LINUX_MUTEX_H
11	11	#define __LINUX_MUTEX_H
12	12
	13	+#include <asm/current.h>
13	14	#include <linux/list.h>
14	15	#include <linux/spinlock_types.h>
15	16	#include <linux/linkage.h>
16	17
...	...	@@ -77,7 +78,37 @@
77	78	#endif
78	79	};
79	80
	81	+struct ww_class {
	82	+ atomic_long_t stamp;
	83	+ struct lock_class_key acquire_key;
	84	+ struct lock_class_key mutex_key;
	85	+ const char *acquire_name;
	86	+ const char *mutex_name;
	87	+};
	88	+
	89	+struct ww_acquire_ctx {
	90	+ struct task_struct *task;
	91	+ unsigned long stamp;
	92	+ unsigned acquired;
80	93	#ifdef CONFIG_DEBUG_MUTEXES
	94	+ unsigned done_acquire;
	95	+ struct ww_class *ww_class;
	96	+ struct ww_mutex *contending_lock;
	97	+#endif
	98	+#ifdef CONFIG_DEBUG_LOCK_ALLOC
	99	+ struct lockdep_map dep_map;
	100	+#endif
	101	+};
	102	+
	103	+struct ww_mutex {
	104	+ struct mutex base;
	105	+ struct ww_acquire_ctx *ctx;
	106	+#ifdef CONFIG_DEBUG_MUTEXES
	107	+ struct ww_class *ww_class;
	108	+#endif
	109	+};
	110	+
	111	+#ifdef CONFIG_DEBUG_MUTEXES
81	112	# include <linux/mutex-debug.h>
82	113	#else
83	114	# define __DEBUG_MUTEX_INITIALIZER(lockname)
84	115
...	...	@@ -101,8 +132,11 @@
101	132	#ifdef CONFIG_DEBUG_LOCK_ALLOC
102	133	# define __DEP_MAP_MUTEX_INITIALIZER(lockname) \
103	134	, .dep_map = { .name = #lockname }
	135	+# define __WW_CLASS_MUTEX_INITIALIZER(lockname, ww_class) \
	136	+ , .ww_class = &ww_class
104	137	#else
105	138	# define __DEP_MAP_MUTEX_INITIALIZER(lockname)
	139	+# define __WW_CLASS_MUTEX_INITIALIZER(lockname, ww_class)
106	140	#endif
107	141
108	142	#define __MUTEX_INITIALIZER(lockname) \
109	143
110	144
...	...	@@ -112,13 +146,49 @@
112	146	__DEBUG_MUTEX_INITIALIZER(lockname) \
113	147	__DEP_MAP_MUTEX_INITIALIZER(lockname) }
114	148
	149	+#define __WW_CLASS_INITIALIZER(ww_class) \
	150	+ { .stamp = ATOMIC_LONG_INIT(0) \
	151	+ , .acquire_name = #ww_class "_acquire" \
	152	+ , .mutex_name = #ww_class "_mutex" }
	153	+
	154	+#define __WW_MUTEX_INITIALIZER(lockname, class) \
	155	+ { .base = { \__MUTEX_INITIALIZER(lockname) } \
	156	+ __WW_CLASS_MUTEX_INITIALIZER(lockname, class) }
	157	+
115	158	#define DEFINE_MUTEX(mutexname) \
116	159	struct mutex mutexname = __MUTEX_INITIALIZER(mutexname)
117	160
	161	+#define DEFINE_WW_CLASS(classname) \
	162	+ struct ww_class classname = __WW_CLASS_INITIALIZER(classname)
	163	+
	164	+#define DEFINE_WW_MUTEX(mutexname, ww_class) \
	165	+ struct ww_mutex mutexname = __WW_MUTEX_INITIALIZER(mutexname, ww_class)
	166	+
	167	+
118	168	extern void __mutex_init(struct mutex lock, const char name,
119	169	struct lock_class_key *key);
120	170
121	171	/**
	172	+ * ww_mutex_init - initialize the w/w mutex
	173	+ * @lock: the mutex to be initialized
	174	+ * @ww_class: the w/w class the mutex should belong to
	175	+ *
	176	+ * Initialize the w/w mutex to unlocked state and associate it with the given
	177	+ * class.
	178	+ *
	179	+ * It is not allowed to initialize an already locked mutex.
	180	+ */
	181	+static inline void ww_mutex_init(struct ww_mutex *lock,
	182	+ struct ww_class *ww_class)
	183	+{
	184	+ __mutex_init(&lock->base, ww_class->mutex_name, &ww_class->mutex_key);
	185	+ lock->ctx = NULL;
	186	+#ifdef CONFIG_DEBUG_MUTEXES
	187	+ lock->ww_class = ww_class;
	188	+#endif
	189	+}
	190	+
	191	+/**
122	192	* mutex_is_locked - is the mutex locked
123	193	* @lock: the mutex to be queried
124	194	*
...	...	@@ -136,6 +206,7 @@
136	206	#ifdef CONFIG_DEBUG_LOCK_ALLOC
137	207	extern void mutex_lock_nested(struct mutex *lock, unsigned int subclass);
138	208	extern void _mutex_lock_nest_lock(struct mutex lock, struct lockdep_map nest_lock);
	209	+
139	210	extern int __must_check mutex_lock_interruptible_nested(struct mutex *lock,
140	211	unsigned int subclass);
141	212	extern int __must_check mutex_lock_killable_nested(struct mutex *lock,
...	...	@@ -147,7 +218,7 @@
147	218
148	219	#define mutex_lock_nest_lock(lock, nest_lock) \
149	220	do { \
150		- typecheck(struct lockdep_map *, &(nest_lock)->dep_map); \
	221	+ typecheck(struct lockdep_map *, &(nest_lock)->dep_map); \
151	222	_mutex_lock_nest_lock(lock, &(nest_lock)->dep_map); \
152	223	} while (0)
153	224
...	...	@@ -170,6 +241,288 @@
170	241	*/
171	242	extern int mutex_trylock(struct mutex *lock);
172	243	extern void mutex_unlock(struct mutex *lock);
	244	+
	245	+/**
	246	+ * ww_acquire_init - initialize a w/w acquire context
	247	+ * @ctx: w/w acquire context to initialize
	248	+ * @ww_class: w/w class of the context
	249	+ *
	250	+ * Initializes an context to acquire multiple mutexes of the given w/w class.
	251	+ *
	252	+ * Context-based w/w mutex acquiring can be done in any order whatsoever within
	253	+ * a given lock class. Deadlocks will be detected and handled with the
	254	+ * wait/wound logic.
	255	+ *
	256	+ * Mixing of context-based w/w mutex acquiring and single w/w mutex locking can
	257	+ * result in undetected deadlocks and is so forbidden. Mixing different contexts
	258	+ * for the same w/w class when acquiring mutexes can also result in undetected
	259	+ * deadlocks, and is hence also forbidden. Both types of abuse will be caught by
	260	+ * enabling CONFIG_PROVE_LOCKING.
	261	+ *
	262	+ * Nesting of acquire contexts for _different_ w/w classes is possible, subject
	263	+ * to the usual locking rules between different lock classes.
	264	+ *
	265	+ * An acquire context must be released with ww_acquire_fini by the same task
	266	+ * before the memory is freed. It is recommended to allocate the context itself
	267	+ * on the stack.
	268	+ */
	269	+static inline void ww_acquire_init(struct ww_acquire_ctx *ctx,
	270	+ struct ww_class *ww_class)
	271	+{
	272	+ ctx->task = current;
	273	+ ctx->stamp = atomic_long_inc_return(&ww_class->stamp);
	274	+ ctx->acquired = 0;
	275	+#ifdef CONFIG_DEBUG_MUTEXES
	276	+ ctx->ww_class = ww_class;
	277	+ ctx->done_acquire = 0;
	278	+ ctx->contending_lock = NULL;
	279	+#endif
	280	+#ifdef CONFIG_DEBUG_LOCK_ALLOC
	281	+ debug_check_no_locks_freed((void )ctx, sizeof(ctx));
	282	+ lockdep_init_map(&ctx->dep_map, ww_class->acquire_name,
	283	+ &ww_class->acquire_key, 0);
	284	+ mutex_acquire(&ctx->dep_map, 0, 0, _RET_IP_);
	285	+#endif
	286	+}
	287	+
	288	+/**
	289	+ * ww_acquire_done - marks the end of the acquire phase
	290	+ * @ctx: the acquire context
	291	+ *
	292	+ * Marks the end of the acquire phase, any further w/w mutex lock calls using
	293	+ * this context are forbidden.
	294	+ *
	295	+ * Calling this function is optional, it is just useful to document w/w mutex
	296	+ * code and clearly designated the acquire phase from actually using the locked
	297	+ * data structures.
	298	+ */
	299	+static inline void ww_acquire_done(struct ww_acquire_ctx *ctx)
	300	+{
	301	+#ifdef CONFIG_DEBUG_MUTEXES
	302	+ lockdep_assert_held(ctx);
	303	+
	304	+ DEBUG_LOCKS_WARN_ON(ctx->done_acquire);
	305	+ ctx->done_acquire = 1;
	306	+#endif
	307	+}
	308	+
	309	+/**
	310	+ * ww_acquire_fini - releases a w/w acquire context
	311	+ * @ctx: the acquire context to free
	312	+ *
	313	+ * Releases a w/w acquire context. This must be called _after_ all acquired w/w
	314	+ * mutexes have been released with ww_mutex_unlock.
	315	+ */
	316	+static inline void ww_acquire_fini(struct ww_acquire_ctx *ctx)
	317	+{
	318	+#ifdef CONFIG_DEBUG_MUTEXES
	319	+ mutex_release(&ctx->dep_map, 0, _THIS_IP_);
	320	+
	321	+ DEBUG_LOCKS_WARN_ON(ctx->acquired);
	322	+ if (!config_enabled(CONFIG_PROVE_LOCKING))
	323	+ /*
	324	+ * lockdep will normally handle this,
	325	+ * but fail without anyway
	326	+ */
	327	+ ctx->done_acquire = 1;
	328	+
	329	+ if (!config_enabled(CONFIG_DEBUG_LOCK_ALLOC))
	330	+ /* ensure ww_acquire_fini will still fail if called twice */
	331	+ ctx->acquired = ~0U;
	332	+#endif
	333	+}
	334	+
	335	+extern int __must_check __ww_mutex_lock(struct ww_mutex *lock,
	336	+ struct ww_acquire_ctx *ctx);
	337	+extern int __must_check __ww_mutex_lock_interruptible(struct ww_mutex *lock,
	338	+ struct ww_acquire_ctx *ctx);
	339	+
	340	+/**
	341	+ * ww_mutex_lock - acquire the w/w mutex
	342	+ * @lock: the mutex to be acquired
	343	+ * @ctx: w/w acquire context, or NULL to acquire only a single lock.
	344	+ *
	345	+ * Lock the w/w mutex exclusively for this task.
	346	+ *
	347	+ * Deadlocks within a given w/w class of locks are detected and handled with the
	348	+ * wait/wound algorithm. If the lock isn't immediately avaiable this function
	349	+ * will either sleep until it is (wait case). Or it selects the current context
	350	+ * for backing off by returning -EDEADLK (wound case). Trying to acquire the
	351	+ * same lock with the same context twice is also detected and signalled by
	352	+ * returning -EALREADY. Returns 0 if the mutex was successfully acquired.
	353	+ *
	354	+ * In the wound case the caller must release all currently held w/w mutexes for
	355	+ * the given context and then wait for this contending lock to be available by
	356	+ * calling ww_mutex_lock_slow. Alternatively callers can opt to not acquire this
	357	+ * lock and proceed with trying to acquire further w/w mutexes (e.g. when
	358	+ * scanning through lru lists trying to free resources).
	359	+ *
	360	+ * The mutex must later on be released by the same task that
	361	+ * acquired it. The task may not exit without first unlocking the mutex. Also,
	362	+ * kernel memory where the mutex resides must not be freed with the mutex still
	363	+ * locked. The mutex must first be initialized (or statically defined) before it
	364	+ * can be locked. memset()-ing the mutex to 0 is not allowed. The mutex must be
	365	+ * of the same w/w lock class as was used to initialize the acquire context.
	366	+ *
	367	+ * A mutex acquired with this function must be released with ww_mutex_unlock.
	368	+ */
	369	+static inline int ww_mutex_lock(struct ww_mutex lock, struct ww_acquire_ctx ctx)
	370	+{
	371	+ if (ctx)
	372	+ return __ww_mutex_lock(lock, ctx);
	373	+ else {
	374	+ mutex_lock(&lock->base);
	375	+ return 0;
	376	+ }
	377	+}
	378	+
	379	+/**
	380	+ * ww_mutex_lock_interruptible - acquire the w/w mutex, interruptible
	381	+ * @lock: the mutex to be acquired
	382	+ * @ctx: w/w acquire context
	383	+ *
	384	+ * Lock the w/w mutex exclusively for this task.
	385	+ *
	386	+ * Deadlocks within a given w/w class of locks are detected and handled with the
	387	+ * wait/wound algorithm. If the lock isn't immediately avaiable this function
	388	+ * will either sleep until it is (wait case). Or it selects the current context
	389	+ * for backing off by returning -EDEADLK (wound case). Trying to acquire the
	390	+ * same lock with the same context twice is also detected and signalled by
	391	+ * returning -EALREADY. Returns 0 if the mutex was successfully acquired. If a
	392	+ * signal arrives while waiting for the lock then this function returns -EINTR.
	393	+ *
	394	+ * In the wound case the caller must release all currently held w/w mutexes for
	395	+ * the given context and then wait for this contending lock to be available by
	396	+ * calling ww_mutex_lock_slow_interruptible. Alternatively callers can opt to
	397	+ * not acquire this lock and proceed with trying to acquire further w/w mutexes
	398	+ * (e.g. when scanning through lru lists trying to free resources).
	399	+ *
	400	+ * The mutex must later on be released by the same task that
	401	+ * acquired it. The task may not exit without first unlocking the mutex. Also,
	402	+ * kernel memory where the mutex resides must not be freed with the mutex still
	403	+ * locked. The mutex must first be initialized (or statically defined) before it
	404	+ * can be locked. memset()-ing the mutex to 0 is not allowed. The mutex must be
	405	+ * of the same w/w lock class as was used to initialize the acquire context.
	406	+ *
	407	+ * A mutex acquired with this function must be released with ww_mutex_unlock.
	408	+ */
	409	+static inline int __must_check ww_mutex_lock_interruptible(struct ww_mutex *lock,
	410	+ struct ww_acquire_ctx *ctx)
	411	+{
	412	+ if (ctx)
	413	+ return __ww_mutex_lock_interruptible(lock, ctx);
	414	+ else
	415	+ return mutex_lock_interruptible(&lock->base);
	416	+}
	417	+
	418	+/**
	419	+ * ww_mutex_lock_slow - slowpath acquiring of the w/w mutex
	420	+ * @lock: the mutex to be acquired
	421	+ * @ctx: w/w acquire context
	422	+ *
	423	+ * Acquires a w/w mutex with the given context after a wound case. This function
	424	+ * will sleep until the lock becomes available.
	425	+ *
	426	+ * The caller must have released all w/w mutexes already acquired with the
	427	+ * context and then call this function on the contended lock.
	428	+ *
	429	+ * Afterwards the caller may continue to (re)acquire the other w/w mutexes it
	430	+ * needs with ww_mutex_lock. Note that the -EALREADY return code from
	431	+ * ww_mutex_lock can be used to avoid locking this contended mutex twice.
	432	+ *
	433	+ * It is forbidden to call this function with any other w/w mutexes associated
	434	+ * with the context held. It is forbidden to call this on anything else than the
	435	+ * contending mutex.
	436	+ *
	437	+ * Note that the slowpath lock acquiring can also be done by calling
	438	+ * ww_mutex_lock directly. This function here is simply to help w/w mutex
	439	+ * locking code readability by clearly denoting the slowpath.
	440	+ */
	441	+static inline void
	442	+ww_mutex_lock_slow(struct ww_mutex lock, struct ww_acquire_ctx ctx)
	443	+{
	444	+ int ret;
	445	+#ifdef CONFIG_DEBUG_MUTEXES
	446	+ DEBUG_LOCKS_WARN_ON(!ctx->contending_lock);
	447	+#endif
	448	+ ret = ww_mutex_lock(lock, ctx);
	449	+ (void)ret;
	450	+}
	451	+
	452	+/**
	453	+ * ww_mutex_lock_slow_interruptible - slowpath acquiring of the w/w mutex,
	454	+ * interruptible
	455	+ * @lock: the mutex to be acquired
	456	+ * @ctx: w/w acquire context
	457	+ *
	458	+ * Acquires a w/w mutex with the given context after a wound case. This function
	459	+ * will sleep until the lock becomes available and returns 0 when the lock has
	460	+ * been acquired. If a signal arrives while waiting for the lock then this
	461	+ * function returns -EINTR.
	462	+ *
	463	+ * The caller must have released all w/w mutexes already acquired with the
	464	+ * context and then call this function on the contended lock.
	465	+ *
	466	+ * Afterwards the caller may continue to (re)acquire the other w/w mutexes it
	467	+ * needs with ww_mutex_lock. Note that the -EALREADY return code from
	468	+ * ww_mutex_lock can be used to avoid locking this contended mutex twice.
	469	+ *
	470	+ * It is forbidden to call this function with any other w/w mutexes associated
	471	+ * with the given context held. It is forbidden to call this on anything else
	472	+ * than the contending mutex.
	473	+ *
	474	+ * Note that the slowpath lock acquiring can also be done by calling
	475	+ * ww_mutex_lock_interruptible directly. This function here is simply to help
	476	+ * w/w mutex locking code readability by clearly denoting the slowpath.
	477	+ */
	478	+static inline int __must_check
	479	+ww_mutex_lock_slow_interruptible(struct ww_mutex *lock,
	480	+ struct ww_acquire_ctx *ctx)
	481	+{
	482	+#ifdef CONFIG_DEBUG_MUTEXES
	483	+ DEBUG_LOCKS_WARN_ON(!ctx->contending_lock);
	484	+#endif
	485	+ return ww_mutex_lock_interruptible(lock, ctx);
	486	+}
	487	+
	488	+extern void ww_mutex_unlock(struct ww_mutex *lock);
	489	+
	490	+/**
	491	+ * ww_mutex_trylock - tries to acquire the w/w mutex without acquire context
	492	+ * @lock: mutex to lock
	493	+ *
	494	+ * Trylocks a mutex without acquire context, so no deadlock detection is
	495	+ * possible. Returns 1 if the mutex has been acquired successfully, 0 otherwise.
	496	+ */
	497	+static inline int __must_check ww_mutex_trylock(struct ww_mutex *lock)
	498	+{
	499	+ return mutex_trylock(&lock->base);
	500	+}
	501	+
	502	+/***
	503	+ * ww_mutex_destroy - mark a w/w mutex unusable
	504	+ * @lock: the mutex to be destroyed
	505	+ *
	506	+ * This function marks the mutex uninitialized, and any subsequent
	507	+ * use of the mutex is forbidden. The mutex must not be locked when
	508	+ * this function is called.
	509	+ */
	510	+static inline void ww_mutex_destroy(struct ww_mutex *lock)
	511	+{
	512	+ mutex_destroy(&lock->base);
	513	+}
	514	+
	515	+/**
	516	+ * ww_mutex_is_locked - is the w/w mutex locked
	517	+ * @lock: the mutex to be queried
	518	+ *
	519	+ * Returns 1 if the mutex is locked, 0 if unlocked.
	520	+ */
	521	+static inline bool ww_mutex_is_locked(struct ww_mutex *lock)
	522	+{
	523	+ return mutex_is_locked(&lock->base);
	524	+}
	525	+
173	526	extern int atomic_dec_and_mutex_lock(atomic_t cnt, struct mutex lock);
174	527
175	528	#ifndef CONFIG_HAVE_ARCH_MUTEX_CPU_RELAX
...	...	@@ -254,16 +254,165 @@
254	254
255	255	EXPORT_SYMBOL(mutex_unlock);
256	256
	257	+/**
	258	+ * ww_mutex_unlock - release the w/w mutex
	259	+ * @lock: the mutex to be released
	260	+ *
	261	+ * Unlock a mutex that has been locked by this task previously with any of the
	262	+ * ww_mutex_lock* functions (with or without an acquire context). It is
	263	+ * forbidden to release the locks after releasing the acquire context.
	264	+ *
	265	+ * This function must not be used in interrupt context. Unlocking
	266	+ * of a unlocked mutex is not allowed.
	267	+ */
	268	+void __sched ww_mutex_unlock(struct ww_mutex *lock)
	269	+{
	270	+ /*
	271	+ * The unlocking fastpath is the 0->1 transition from 'locked'
	272	+ * into 'unlocked' state:
	273	+ */
	274	+ if (lock->ctx) {
	275	+#ifdef CONFIG_DEBUG_MUTEXES
	276	+ DEBUG_LOCKS_WARN_ON(!lock->ctx->acquired);
	277	+#endif
	278	+ if (lock->ctx->acquired > 0)
	279	+ lock->ctx->acquired--;
	280	+ lock->ctx = NULL;
	281	+ }
	282	+
	283	+#ifndef CONFIG_DEBUG_MUTEXES
	284	+ /*
	285	+ * When debugging is enabled we must not clear the owner before time,
	286	+ * the slow path will always be taken, and that clears the owner field
	287	+ * after verifying that it was indeed current.
	288	+ */
	289	+ mutex_clear_owner(&lock->base);
	290	+#endif
	291	+ __mutex_fastpath_unlock(&lock->base.count, __mutex_unlock_slowpath);
	292	+}
	293	+EXPORT_SYMBOL(ww_mutex_unlock);
	294	+
	295	+static inline int __sched
	296	+__mutex_lock_check_stamp(struct mutex lock, struct ww_acquire_ctx ctx)
	297	+{
	298	+ struct ww_mutex *ww = container_of(lock, struct ww_mutex, base);
	299	+ struct ww_acquire_ctx *hold_ctx = ACCESS_ONCE(ww->ctx);
	300	+
	301	+ if (!hold_ctx)
	302	+ return 0;
	303	+
	304	+ if (unlikely(ctx == hold_ctx))
	305	+ return -EALREADY;
	306	+
	307	+ if (ctx->stamp - hold_ctx->stamp <= LONG_MAX &&
	308	+ (ctx->stamp != hold_ctx->stamp \|\| ctx > hold_ctx)) {
	309	+#ifdef CONFIG_DEBUG_MUTEXES
	310	+ DEBUG_LOCKS_WARN_ON(ctx->contending_lock);
	311	+ ctx->contending_lock = ww;
	312	+#endif
	313	+ return -EDEADLK;
	314	+ }
	315	+
	316	+ return 0;
	317	+}
	318	+
	319	+static __always_inline void ww_mutex_lock_acquired(struct ww_mutex *ww,
	320	+ struct ww_acquire_ctx *ww_ctx)
	321	+{
	322	+#ifdef CONFIG_DEBUG_MUTEXES
	323	+ /*
	324	+ * If this WARN_ON triggers, you used ww_mutex_lock to acquire,
	325	+ * but released with a normal mutex_unlock in this call.
	326	+ *
	327	+ * This should never happen, always use ww_mutex_unlock.
	328	+ */
	329	+ DEBUG_LOCKS_WARN_ON(ww->ctx);
	330	+
	331	+ /*
	332	+ * Not quite done after calling ww_acquire_done() ?
	333	+ */
	334	+ DEBUG_LOCKS_WARN_ON(ww_ctx->done_acquire);
	335	+
	336	+ if (ww_ctx->contending_lock) {
	337	+ /*
	338	+ * After -EDEADLK you tried to
	339	+ * acquire a different ww_mutex? Bad!
	340	+ */
	341	+ DEBUG_LOCKS_WARN_ON(ww_ctx->contending_lock != ww);
	342	+
	343	+ /*
	344	+ * You called ww_mutex_lock after receiving -EDEADLK,
	345	+ * but 'forgot' to unlock everything else first?
	346	+ */
	347	+ DEBUG_LOCKS_WARN_ON(ww_ctx->acquired > 0);
	348	+ ww_ctx->contending_lock = NULL;
	349	+ }
	350	+
	351	+ /*
	352	+ * Naughty, using a different class will lead to undefined behavior!
	353	+ */
	354	+ DEBUG_LOCKS_WARN_ON(ww_ctx->ww_class != ww->ww_class);
	355	+#endif
	356	+ ww_ctx->acquired++;
	357	+}
	358	+
257	359	/*
	360	+ * after acquiring lock with fastpath or when we lost out in contested
	361	+ * slowpath, set ctx and wake up any waiters so they can recheck.
	362	+ *
	363	+ * This function is never called when CONFIG_DEBUG_LOCK_ALLOC is set,
	364	+ * as the fastpath and opportunistic spinning are disabled in that case.
	365	+ */
	366	+static __always_inline void
	367	+ww_mutex_set_context_fastpath(struct ww_mutex *lock,
	368	+ struct ww_acquire_ctx *ctx)
	369	+{
	370	+ unsigned long flags;
	371	+ struct mutex_waiter *cur;
	372	+
	373	+ ww_mutex_lock_acquired(lock, ctx);
	374	+
	375	+ lock->ctx = ctx;
	376	+
	377	+ /*
	378	+ * The lock->ctx update should be visible on all cores before
	379	+ * the atomic read is done, otherwise contended waiters might be
	380	+ * missed. The contended waiters will either see ww_ctx == NULL
	381	+ * and keep spinning, or it will acquire wait_lock, add itself
	382	+ * to waiter list and sleep.
	383	+ */
	384	+ smp_mb(); /* ^^^ */
	385	+
	386	+ /*
	387	+ * Check if lock is contended, if not there is nobody to wake up
	388	+ */
	389	+ if (likely(atomic_read(&lock->base.count) == 0))
	390	+ return;
	391	+
	392	+ /*
	393	+ * Uh oh, we raced in fastpath, wake up everyone in this case,
	394	+ * so they can see the new lock->ctx.
	395	+ */
	396	+ spin_lock_mutex(&lock->base.wait_lock, flags);
	397	+ list_for_each_entry(cur, &lock->base.wait_list, list) {
	398	+ debug_mutex_wake_waiter(&lock->base, cur);
	399	+ wake_up_process(cur->task);
	400	+ }
	401	+ spin_unlock_mutex(&lock->base.wait_lock, flags);
	402	+}
	403	+
	404	+/*
258	405	* Lock a mutex (possibly interruptible), slowpath:
259	406	*/
260		-static inline int __sched
	407	+static __always_inline int __sched
261	408	__mutex_lock_common(struct mutex *lock, long state, unsigned int subclass,
262		- struct lockdep_map *nest_lock, unsigned long ip)
	409	+ struct lockdep_map *nest_lock, unsigned long ip,
	410	+ struct ww_acquire_ctx *ww_ctx)
263	411	{
264	412	struct task_struct *task = current;
265	413	struct mutex_waiter waiter;
266	414	unsigned long flags;
	415	+ int ret;
267	416
268	417	preempt_disable();
269	418	mutex_acquire_nest(&lock->dep_map, subclass, 0, nest_lock, ip);
...	...	@@ -298,6 +447,22 @@
298	447	struct task_struct *owner;
299	448	struct mspin_node node;
300	449
	450	+ if (!__builtin_constant_p(ww_ctx == NULL) && ww_ctx->acquired > 0) {
	451	+ struct ww_mutex *ww;
	452	+
	453	+ ww = container_of(lock, struct ww_mutex, base);
	454	+ /*
	455	+ * If ww->ctx is set the contents are undefined, only
	456	+ * by acquiring wait_lock there is a guarantee that
	457	+ * they are not invalid when reading.
	458	+ *
	459	+ * As such, when deadlock detection needs to be
	460	+ * performed the optimistic spinning cannot be done.
	461	+ */
	462	+ if (ACCESS_ONCE(ww->ctx))
	463	+ break;
	464	+ }
	465	+
301	466	/*
302	467	* If there's an owner, wait for it to either
303	468	* release the lock or go to sleep.
...	...	@@ -312,6 +477,13 @@
312	477	if ((atomic_read(&lock->count) == 1) &&
313	478	(atomic_cmpxchg(&lock->count, 1, 0) == 1)) {
314	479	lock_acquired(&lock->dep_map, ip);
	480	+ if (!__builtin_constant_p(ww_ctx == NULL)) {
	481	+ struct ww_mutex *ww;
	482	+ ww = container_of(lock, struct ww_mutex, base);
	483	+
	484	+ ww_mutex_set_context_fastpath(ww, ww_ctx);
	485	+ }
	486	+
315	487	mutex_set_owner(lock);
316	488	mspin_unlock(MLOCK(lock), &node);
317	489	preempt_enable();
318	490
319	491
...	...	@@ -371,15 +543,16 @@
371	543	* TASK_UNINTERRUPTIBLE case.)
372	544	*/
373	545	if (unlikely(signal_pending_state(state, task))) {
374		- mutex_remove_waiter(lock, &waiter,
375		- task_thread_info(task));
376		- mutex_release(&lock->dep_map, 1, ip);
377		- spin_unlock_mutex(&lock->wait_lock, flags);
	546	+ ret = -EINTR;
	547	+ goto err;
	548	+ }
378	549
379		- debug_mutex_free_waiter(&waiter);
380		- preempt_enable();
381		- return -EINTR;
	550	+ if (!__builtin_constant_p(ww_ctx == NULL) && ww_ctx->acquired > 0) {
	551	+ ret = __mutex_lock_check_stamp(lock, ww_ctx);
	552	+ if (ret)
	553	+ goto err;
382	554	}
	555	+
383	556	__set_task_state(task, state);
384	557
385	558	/* didn't get the lock, go to sleep: */
...	...	@@ -394,6 +567,30 @@
394	567	mutex_remove_waiter(lock, &waiter, current_thread_info());
395	568	mutex_set_owner(lock);
396	569
	570	+ if (!__builtin_constant_p(ww_ctx == NULL)) {
	571	+ struct ww_mutex *ww = container_of(lock,
	572	+ struct ww_mutex,
	573	+ base);
	574	+ struct mutex_waiter *cur;
	575	+
	576	+ /*
	577	+ * This branch gets optimized out for the common case,
	578	+ * and is only important for ww_mutex_lock.
	579	+ */
	580	+
	581	+ ww_mutex_lock_acquired(ww, ww_ctx);
	582	+ ww->ctx = ww_ctx;
	583	+
	584	+ /*
	585	+ * Give any possible sleeping processes the chance to wake up,
	586	+ * so they can recheck if they have to back off.
	587	+ */
	588	+ list_for_each_entry(cur, &lock->wait_list, list) {
	589	+ debug_mutex_wake_waiter(lock, cur);
	590	+ wake_up_process(cur->task);
	591	+ }
	592	+ }
	593	+
397	594	/* set it to 0 if there are no waiters left: */
398	595	if (likely(list_empty(&lock->wait_list)))
399	596	atomic_set(&lock->count, 0);
...	...	@@ -404,6 +601,14 @@
404	601	preempt_enable();
405	602
406	603	return 0;
	604	+
	605	+err:
	606	+ mutex_remove_waiter(lock, &waiter, task_thread_info(task));
	607	+ spin_unlock_mutex(&lock->wait_lock, flags);
	608	+ debug_mutex_free_waiter(&waiter);
	609	+ mutex_release(&lock->dep_map, 1, ip);
	610	+ preempt_enable();
	611	+ return ret;
407	612	}
408	613
409	614	#ifdef CONFIG_DEBUG_LOCK_ALLOC
...	...	@@ -411,7 +616,8 @@
411	616	mutex_lock_nested(struct mutex *lock, unsigned int subclass)
412	617	{
413	618	might_sleep();
414		- __mutex_lock_common(lock, TASK_UNINTERRUPTIBLE, subclass, NULL, _RET_IP_);
	619	+ __mutex_lock_common(lock, TASK_UNINTERRUPTIBLE,
	620	+ subclass, NULL, _RET_IP_, NULL);
415	621	}
416	622
417	623	EXPORT_SYMBOL_GPL(mutex_lock_nested);
...	...	@@ -420,7 +626,8 @@
420	626	_mutex_lock_nest_lock(struct mutex lock, struct lockdep_map nest)
421	627	{
422	628	might_sleep();
423		- __mutex_lock_common(lock, TASK_UNINTERRUPTIBLE, 0, nest, _RET_IP_);
	629	+ __mutex_lock_common(lock, TASK_UNINTERRUPTIBLE,
	630	+ 0, nest, _RET_IP_, NULL);
424	631	}
425	632
426	633	EXPORT_SYMBOL_GPL(_mutex_lock_nest_lock);
...	...	@@ -429,7 +636,8 @@
429	636	mutex_lock_killable_nested(struct mutex *lock, unsigned int subclass)
430	637	{
431	638	might_sleep();
432		- return __mutex_lock_common(lock, TASK_KILLABLE, subclass, NULL, _RET_IP_);
	639	+ return __mutex_lock_common(lock, TASK_KILLABLE,
	640	+ subclass, NULL, _RET_IP_, NULL);
433	641	}
434	642	EXPORT_SYMBOL_GPL(mutex_lock_killable_nested);
435	643
436	644
...	...	@@ -438,10 +646,30 @@
438	646	{
439	647	might_sleep();
440	648	return __mutex_lock_common(lock, TASK_INTERRUPTIBLE,
441		- subclass, NULL, _RET_IP_);
	649	+ subclass, NULL, _RET_IP_, NULL);
442	650	}
443	651
444	652	EXPORT_SYMBOL_GPL(mutex_lock_interruptible_nested);
	653	+
	654	+
	655	+int __sched
	656	+__ww_mutex_lock(struct ww_mutex lock, struct ww_acquire_ctx ctx)
	657	+{
	658	+ might_sleep();
	659	+ return __mutex_lock_common(&lock->base, TASK_UNINTERRUPTIBLE,
	660	+ 0, &ctx->dep_map, _RET_IP_, ctx);
	661	+}
	662	+EXPORT_SYMBOL_GPL(__ww_mutex_lock);
	663	+
	664	+int __sched
	665	+__ww_mutex_lock_interruptible(struct ww_mutex lock, struct ww_acquire_ctx ctx)
	666	+{
	667	+ might_sleep();
	668	+ return __mutex_lock_common(&lock->base, TASK_INTERRUPTIBLE,
	669	+ 0, &ctx->dep_map, _RET_IP_, ctx);
	670	+}
	671	+EXPORT_SYMBOL_GPL(__ww_mutex_lock_interruptible);
	672	+
445	673	#endif
446	674
447	675	/*
448	676
449	677
450	678
...	...	@@ -544,20 +772,39 @@
544	772	{
545	773	struct mutex *lock = container_of(lock_count, struct mutex, count);
546	774
547		- __mutex_lock_common(lock, TASK_UNINTERRUPTIBLE, 0, NULL, _RET_IP_);
	775	+ __mutex_lock_common(lock, TASK_UNINTERRUPTIBLE, 0,
	776	+ NULL, _RET_IP_, NULL);
548	777	}
549	778
550	779	static noinline int __sched
551	780	__mutex_lock_killable_slowpath(struct mutex *lock)
552	781	{
553		- return __mutex_lock_common(lock, TASK_KILLABLE, 0, NULL, _RET_IP_);
	782	+ return __mutex_lock_common(lock, TASK_KILLABLE, 0,
	783	+ NULL, _RET_IP_, NULL);
554	784	}
555	785
556	786	static noinline int __sched
557	787	__mutex_lock_interruptible_slowpath(struct mutex *lock)
558	788	{
559		- return __mutex_lock_common(lock, TASK_INTERRUPTIBLE, 0, NULL, _RET_IP_);
	789	+ return __mutex_lock_common(lock, TASK_INTERRUPTIBLE, 0,
	790	+ NULL, _RET_IP_, NULL);
560	791	}
	792	+
	793	+static noinline int __sched
	794	+__ww_mutex_lock_slowpath(struct ww_mutex lock, struct ww_acquire_ctx ctx)
	795	+{
	796	+ return __mutex_lock_common(&lock->base, TASK_UNINTERRUPTIBLE, 0,
	797	+ NULL, _RET_IP_, ctx);
	798	+}
	799	+
	800	+static noinline int __sched
	801	+__ww_mutex_lock_interruptible_slowpath(struct ww_mutex *lock,
	802	+ struct ww_acquire_ctx *ctx)
	803	+{
	804	+ return __mutex_lock_common(&lock->base, TASK_INTERRUPTIBLE, 0,
	805	+ NULL, _RET_IP_, ctx);
	806	+}
	807	+
561	808	#endif
562	809
563	810	/*
...	...	@@ -612,6 +859,45 @@
612	859	return ret;
613	860	}
614	861	EXPORT_SYMBOL(mutex_trylock);
	862	+
	863	+#ifndef CONFIG_DEBUG_LOCK_ALLOC
	864	+int __sched
	865	+__ww_mutex_lock(struct ww_mutex lock, struct ww_acquire_ctx ctx)
	866	+{
	867	+ int ret;
	868	+
	869	+ might_sleep();
	870	+
	871	+ ret = __mutex_fastpath_lock_retval(&lock->base.count);
	872	+
	873	+ if (likely(!ret)) {
	874	+ ww_mutex_set_context_fastpath(lock, ctx);
	875	+ mutex_set_owner(&lock->base);
	876	+ } else
	877	+ ret = __ww_mutex_lock_slowpath(lock, ctx);
	878	+ return ret;
	879	+}
	880	+EXPORT_SYMBOL(__ww_mutex_lock);
	881	+
	882	+int __sched
	883	+__ww_mutex_lock_interruptible(struct ww_mutex lock, struct ww_acquire_ctx ctx)
	884	+{
	885	+ int ret;
	886	+
	887	+ might_sleep();
	888	+
	889	+ ret = __mutex_fastpath_lock_retval(&lock->base.count);
	890	+
	891	+ if (likely(!ret)) {
	892	+ ww_mutex_set_context_fastpath(lock, ctx);
	893	+ mutex_set_owner(&lock->base);
	894	+ } else
	895	+ ret = __ww_mutex_lock_interruptible_slowpath(lock, ctx);
	896	+ return ret;
	897	+}
	898	+EXPORT_SYMBOL(__ww_mutex_lock_interruptible);
	899	+
	900	+#endif
615	901
616	902	/**
617	903	* atomic_dec_and_mutex_lock - return holding mutex if we dec to 0
...	...	@@ -30,6 +30,7 @@
30	30	* a locking bug is detected.
31	31	*/
32	32	int debug_locks_silent;
	33	+EXPORT_SYMBOL_GPL(debug_locks_silent);
33	34
34	35	/*
35	36	* Generic 'turn off all lock debugging' function:
...	...	@@ -44,4 +45,5 @@
44	45	}
45	46	return 0;
46	47	}
	48	+EXPORT_SYMBOL_GPL(debug_locks_off);