Eric Lee / smarc-fsl-linux-kernel

04 Jan, 2012

29 commits

fc7be130c vfs: switch pnode.h macros to struct mount * ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:11 +0800
863d684f9 vfs: move the rest of int fields to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:10 +0800
15169fe78 vfs: mnt_id/mnt_group_id moved ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:10 +0800
143c8c91c vfs: mnt_ns moved to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:09 +0800
6776db3d3 vfs: take mnt_share/mnt_slave/mnt_slave_list and mnt_expire to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:08 +0800
32301920f vfs: and now we can make ->mnt_master point to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:08 +0800
d10e8def0 vfs: take mnt_master to struct mount ... Browse Code »

make IS_MNT_SLAVE take struct mount * at the same time

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:08 +0800
14cf1fa8f vfs: spread struct mount - remaining argument of mnt_set_mountpoint() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:07 +0800
a8d56d8e4 vfs: spread struct mount - propagate_mnt() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:07 +0800
c937135d9 vfs: spread struct mount - shared subtree iterators ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:07 +0800
6fc7871fe vfs: spread struct mount - get_dominating_id / do_make_slave ... Browse Code »

next pile of horrors, similar to mnt_parent one; this time it's
mnt_master.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:06 +0800
6b41d536f vfs: take mnt_child/mnt_mounts to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:06 +0800
83adc7532 vfs: spread struct mount - work with counters ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:05 +0800
a73324da7 vfs: move mnt_mountpoint to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:05 +0800
0714a5338 vfs: now it can be done - make mnt_parent point to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:05 +0800
3376f34ff vfs: mnt_parent moved to struct mount ... Browse Code »

the second victim...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:04 +0800
643822b41 vfs: spread struct mount - is_path_reachable ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:04 +0800
1ab597386 vfs: spread struct mount - do_umount/propagate_mount_busy ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:03 +0800
44d964d60 vfs: spread struct mount mnt_set_mountpoint child argument ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:03 +0800
87129cc0e vfs: spread struct mount - clone_mnt/copy_tree argument ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:03 +0800
761d5c38e vfs: spread struct mount - umount_tree argument ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:02 +0800
1b8e5564b vfs: the first spoils - mnt_hash moved ... Browse Code »

taken out of struct vfsmount into struct mount

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:02 +0800
cb338d06e vfs: spread struct mount - clone_mnt/copy_tree result ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:01 +0800
0f0afb1dc vfs: spread struct mount - change_mnt_propagation/set_mnt_shared ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:01 +0800
4b8b21f4f vfs: spread struct mount - mount group id handling ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:56:59 +0800
61ef47b1e vfs: spread struct mount - __propagate_umount() argument ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:56:58 +0800
c71053659 vfs: spread struct mount - __lookup_mnt() result ... Browse Code »

switch __lookup_mnt() to returning struct mount *; callers adjusted.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:56:58 +0800
afac7cba7 vfs: more mnt_parent cleanups ... Browse Code »

a) mount --move is checking that ->mnt_parent is non-NULL before
looking if that parent happens to be shared; ->mnt_parent is never
NULL and it's not even an misspelled !mnt_has_parent()

b) pivot_root open-codes is_path_reachable(), poorly.

c) so does path_is_under(), while we are at it.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:36 +0800
b2dba1af3 vfs: new internal helper: mnt_has_parent(mnt) ... Browse Code »

vfsmounts have ->mnt_parent pointing either to a different vfsmount
or to itself; it's never NULL and termination condition in loops
traversing the tree towards root is mnt == mnt->mnt_parent. At least
one place (see the next patch) is confused about what's going on;
let's add an explicit helper checking it right way and use it in
all places where we need it. Not that there had been too many,
but...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:36 +0800

07 Jan, 2011

1 commit

b3e19d924 fs: scale mntget/mntput ... Browse Code »

The problem that this patch aims to fix is vfsmount refcounting scalability.
We need to take a reference on the vfsmount for every successful path lookup,
which often go to the same mount point.

The fundamental difficulty is that a "simple" reference count can never be made
scalable, because any time a reference is dropped, we must check whether that
was the last reference. To do that requires communication with all other CPUs
that may have taken a reference count.

We can make refcounts more scalable in a couple of ways, involving keeping
distributed counters, and checking for the global-zero condition less
frequently.

- check the global sum once every interval (this will delay zero detection
for some interval, so it's probably a showstopper for vfsmounts).

- keep a local count and only taking the global sum when local reaches 0 (this
is difficult for vfsmounts, because we can't hold preempt off for the life of
a reference, so a counter would need to be per-thread or tied strongly to a
particular CPU which requires more locking).

- keep a local difference of increments and decrements, which allows us to sum
the total difference and hence find the refcount when summing all CPUs. Then,
keep a single integer "long" refcount for slow and long lasting references,
and only take the global sum of local counters when the long refcount is 0.

This last scheme is what I implemented here. Attached mounts and process root
and working directory references are "long" references, and everything else is
a short reference.

This allows scalable vfsmount references during path walking over mounted
subtrees and unattached (lazy umounted) mounts with processes still running
in them.

This results in one fewer atomic op in the fastpath: mntget is now just a
per-CPU inc, rather than an atomic inc; and mntput just requires a spinlock
and non-atomic decrement in the common case. However code is otherwise bigger
and heavier, so single threaded performance is basically a wash.

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:33 +0800

18 Aug, 2010

1 commit

99b7db7b8 fs: brlock vfsmount_lock ... Browse Code »

fs: brlock vfsmount_lock

Use a brlock for the vfsmount lock. It must be taken for write whenever
modifying the mount hash or associated fields, and may be taken for read when
performing mount hash lookups.

A new lock is added for the mnt-id allocator, so it doesn't need to take
the heavy vfsmount write-lock.

The number of atomics should remain the same for fastpath rlock cases, though
code would be slightly slower due to per-cpu access. Scalability is not not be
much improved in common cases yet, due to other locks (ie. dcache_lock) getting
in the way. However path lookups crossing mountpoints should be one case where
scalability is improved (currently requiring the global lock).

The slowpath is slower due to use of brlock. On a 64 core, 64 socket, 32 node
Altix system (high latency to remote nodes), a simple umount microbenchmark
(mount --bind mnt mnt2 ; umount mnt2 loop 1000 times), before this patch it
took 6.8s, afterwards took 7.1s, about 5% slower.

Cc: Al Viro
Signed-off-by: Nick Piggin
Signed-off-by: Al Viro

Nick Piggin
2010-08-18 20:35:48 +0800

04 Mar, 2010

1 commit

796a6b521 Kill CL_PROPAGATION, sanitize fs/pnode.c:get_source() ... Browse Code »

First of all, get_source() never results in CL_PROPAGATION
alone. We either get CL_MAKE_SHARED (for the continuation
of peer group) or CL_SLAVE (slave that is not shared) or both
(beginning of peer group among slaves). Massage the code to
make that explicit, kill CL_PROPAGATION test in clone_mnt()
(nothing sets CL_MAKE_SHARED without CL_PROPAGATION and in
clone_mnt() we are checking CL_PROPAGATION after we'd found
that there's no CL_SLAVE, so the check for CL_MAKE_SHARED
would do just as well).

Fix comments, while we are at it...

Signed-off-by: Al Viro

Al Viro
2010-03-04 02:00:22 +0800

23 Apr, 2008

2 commits

97e7e0f71 [patch 7/7] vfs: mountinfo: show dominating group id ... Browse Code »

Show peer group ID of nearest dominating group that has intersection
with the mount's namespace.

Signed-off-by: Miklos Szeredi
Signed-off-by: Al Viro

Miklos Szeredi
2008-04-23 12:05:09 +0800
719f5d7f0 [patch 4/7] vfs: mountinfo: add mount peer group ID ... Browse Code »

Add a unique ID to each peer group using the IDR infrastructure. The
identifiers are reused after the peer group dissolves.

The IDR structures are protected by holding namepspace_sem for write
while allocating or deallocating IDs.

IDs are allocated when a previously unshared vfsmount becomes the
first member of a peer group. When a new member is added to an
existing group, the ID is copied from one of the old members.

IDs are freed when the last member of a peer group is unshared.

Setting the MNT_SHARED flag on members of a subtree is done as a
separate step, after all the IDs have been allocated. This way an
allocation failure can be cleaned up easilty, without affecting the
propagation state.

Based on design sketch by Al Viro.

Signed-off-by: Miklos Szeredi
Signed-off-by: Al Viro

Miklos Szeredi
2008-04-23 12:04:51 +0800

22 Apr, 2008

2 commits

4e1b36fb4 [PATCH] umount_tree() will unhash everything itself ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-04-22 11:13:54 +0800
6d59e7f58 [PATCH] move a bunch of declarations to fs/internal.h ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-04-22 11:11:01 +0800

28 Mar, 2008

1 commit

7c4b93d82 [PATCH] count ghost references to vfsmounts ... Browse Code »

make propagate_mount_busy() exclude references from the vfsmounts
that had been isolated by umount_tree() and are just waiting for
release_mounts() to dispose of their ->mnt_parent/->mnt_mountpoint.

Signed-off-by: Al Viro

Al Viro
2008-03-28 08:47:46 +0800

07 Feb, 2008

1 commit

0b03cfb25 MNT_UNBINDABLE fix ... Browse Code »

Some time ago ( http://lkml.org/lkml/2007/6/19/128 ) I wrote about
MNT_UNBINDABLE that it felt like a bug that it is not reset by "mount
--make-private".

Today I happened to see mount(8) and Documentation/sharedsubtree.txt and
both document the version obtained by applying the little patch given in
the above (and again below).

So, the present kernel code is not according to specs and must be regarded
as buggy.

Specification in Documentation/sharedsubtree.txt:
See state diagram: unbindable should become private upon make-private.

Specification in mount(8):
... It's
also possible to set up uni-directional propagation (with --make-
slave), to make a mount point unavailable for --bind/--rbind (with
--make-unbindable), and to undo any of these (with --make-private).

Repeat of old fix-shared-subtrees-make-private.patch
(due to Dirk Gerrits, René Gabriëls, Peter Kooijmans):

Acked-by: Ram Pai
Cc: Al Viro
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andries E. Brouwer
2008-02-07 02:41:02 +0800

09 May, 2007

1 commit

b5e618181 Introduce a handy list_first_entry macro ... Browse Code »

There are many places in the kernel where the construction like

foo = list_entry(head->next, struct foo_struct, list);

are used.
The code might look more descriptive and neat if using the macro

list_first_entry(head, type, member) \
list_entry((head)->next, type, member)

Here is the macro itself and the examples of its usage in the generic code.
If it will turn out to be useful, I can prepare the set of patches to
inject in into arch-specific code, drivers, networking, etc.

Signed-off-by: Pavel Emelianov
Signed-off-by: Kirill Korotaev
Cc: Randy Dunlap
Cc: Andi Kleen
Cc: Zach Brown
Cc: Davide Libenzi
Cc: John McCutchan
Cc: Thomas Gleixner
Cc: Ingo Molnar
Cc: john stultz
Cc: Ram Pai
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Pavel Emelianov
2007-05-09 02:15:11 +0800

09 Dec, 2006

1 commit

6b3286ed1 [PATCH] rename struct namespace to struct mnt_namespace ... Browse Code »

Rename 'struct namespace' to 'struct mnt_namespace' to avoid confusion with
other namespaces being developped for the containers : pid, uts, ipc, etc.
'namespace' variables and attributes are also renamed to 'mnt_ns'

Signed-off-by: Kirill Korotaev
Signed-off-by: Cedric Le Goater
Cc: Eric W. Biederman
Cc: Herbert Poetzl
Cc: Sukadev Bhattiprolu
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Kirill Korotaev
2006-12-09 00:28:51 +0800