Eric Lee / smarc-ti-linux-kernel | Embedian Git Server

02 Apr, 2014

1 commit

c7999c362 reduce m_start() cost... ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2014-04-02 11:19:09 +0800

31 Mar, 2014

2 commits

38129a13e switch mnt_hash to hlist ... Browse Code »
32

fixes RCU bug - walking through hlist is safe in face of element moves,
since it's self-terminating. Cyclic lists are not - if we end up jumping
to another hash chain, we'll loop infinitely without ever hitting the
original list head.

[fix for dumb braino folded]

Spotted by: Max Kellermann
Cc: stable@vger.kernel.org
Signed-off-by: Al Viro

Al Viro
2014-03-31 07:18:51 +0800
0818bf27c resizable namespace.c hashes ... Browse Code »

* switch allocation to alloc_large_system_hash()
* make sizes overridable by boot parameters (mhash_entries=, mphash_entries=)
* switch mountpoint_hashtable from list_head to hlist_head

Cc: stable@vger.kernel.org
Signed-off-by: Al Viro

Al Viro
2014-03-31 07:18:49 +0800

26 Jan, 2014

1 commit

260a459d2 vfs: Is mounted should be testing mnt_ns for NULL or error. ... Browse Code »

A bug was introduced with the is_mounted helper function in
commit f7a99c5b7c8bd3d3f533c8b38274e33f3da9096e
Author: Al Viro
Date: Sat Jun 9 00:59:08 2012 -0400

get rid of ->mnt_longterm

it's enough to set ->mnt_ns of internal vfsmounts to something
distinct from all struct mnt_namespace out there; then we can
just use the check for ->mnt_ns != NULL in the fast path of
mntput_no_expire()

Signed-off-by: Al Viro

The intent was to test if the real_mount(vfsmount)->mnt_ns was
NULL_OR_ERR but the code is actually testing real_mount(vfsmount)
and always returning true.

The result is d_absolute_path returning paths it should be hiding.

Cc: stable@vger.kernel.org
Signed-off-by: "Eric W. Biederman"
Signed-off-by: Al Viro

Eric W. Biederman
2014-01-26 21:26:42 +0800

09 Nov, 2013

1 commit

48a066e72 RCU'd vfsmounts ... Browse Code »

* RCU-delayed freeing of vfsmounts
* vfsmount_lock replaced with a seqlock (mount_lock)
* sequence number from mount_lock is stored in nameidata->m_seq and
used when we exit RCU mode
* new vfsmount flag - MNT_SYNC_UMOUNT. Set by umount_tree() when its
caller knows that vfsmount will have no surviving references.
* synchronize_rcu() done between unlocking namespace_sem in namespace_unlock()
and doing pending mntput().
* new helper: legitimize_mnt(mnt, seq). Checks the mount_lock sequence
number against seq, then grabs reference to mnt. Then it rechecks mount_lock
again to close the race and either returns success or drops the reference it
has acquired. The subtle point is that in case of MNT_SYNC_UMOUNT we can
simply decrement the refcount and sod off - aforementioned synchronize_rcu()
makes sure that final mntput() won't come until we leave RCU mode. We need
that, since we don't want to end up with some lazy pathwalk racing with
umount() and stealing the final mntput() from it - caller of umount() may
expect it to return only once the fs is shut down and we don't want to break
that. In other cases (i.e. with MNT_SYNC_UMOUNT absent) we have to do
full-blown mntput() in case of mount_lock sequence number mismatch happening
just as we'd grabbed the reference, but in those cases we won't be stealing
the final mntput() from anything that would care.
* mntput_no_expire() doesn't lock anything on the fast path now. Incidentally,
SMP and UP cases are handled the same way - no ifdefs there.
* normal pathname resolution does *not* do any writes to mount_lock. It does,
of course, bump the refcounts of vfsmount and dentry in the very end, but that's
it.

Signed-off-by: Al Viro

Al Viro
2013-11-09 13:16:19 +0800

25 Oct, 2013

3 commits

474279dc0 split __lookup_mnt() in two functions ... Browse Code »

Instead of passing the direction as argument (and checking it on every
step through the hash chain), just have separate __lookup_mnt() and
__lookup_mnt_last(). And use the standard iterators...

Signed-off-by: Al Viro

Al Viro
2013-10-25 11:35:00 +0800
719ea2fbb new helpers: lock_mount_hash/unlock_mount_hash ... Browse Code »

aka br_write_{lock,unlock} of vfsmount_lock. Inlines in fs/mount.h,
vfsmount_lock extern moved over there as well.

Signed-off-by: Al Viro

Al Viro
2013-10-25 11:34:59 +0800
aba809cf0 namespace.c: get rid of mnt_ghosts ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-10-25 11:34:58 +0800

10 Apr, 2013

1 commit

84d17192d get rid of full-hash scan on detaching vfsmounts ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-04-10 02:12:52 +0800

20 Nov, 2012

1 commit

98f842e67 proc: Usable inode numbers for the namespace file descriptors. ... Browse Code »

Assign a unique proc inode to each namespace, and use that
inode number to ensure we only allocate at most one proc
inode for every namespace in proc.

A single proc inode per namespace allows userspace to test
to see if two processes are in the same namespace.

This has been a long requested feature and only blocked because
a naive implementation would put the id in a global space and
would ultimately require having a namespace for the names of
namespaces, making migration and certain virtualization tricks
impossible.

We still don't have per superblock inode numbers for proc, which
appears necessary for application unaware checkpoint/restart and
migrations (if the application is using namespace file descriptors)
but that is now allowd by the design if it becomes important.

I have preallocated the ipc and uts initial proc inode numbers so
their structures can be statically initialized.

Signed-off-by: Eric W. Biederman

Eric W. Biederman
2012-11-20 20:19:49 +0800

19 Nov, 2012

2 commits

771b13716 vfs: Add a user namespace reference from struct mnt_namespace ... Browse Code »

This will allow for support for unprivileged mounts in a new user namespace.

Acked-by: "Serge E. Hallyn"
Signed-off-by: "Eric W. Biederman"

Eric W. Biederman
2012-11-19 21:59:19 +0800
8823c079b vfs: Add setns support for the mount namespace ... Browse Code »

setns support for the mount namespace is a little tricky as an
arbitrary decision must be made about what to set fs->root and
fs->pwd to, as there is no expectation of a relationship between
the two mount namespaces. Therefore I arbitrarily find the root
mount point, and follow every mount on top of it to find the top
of the mount stack. Then I set fs->root and fs->pwd to that
location. The topmost root of the mount stack seems like a
reasonable place to be.

Bind mount support for the mount namespace inodes has the
possibility of creating circular dependencies between mount
namespaces. Circular dependencies can result in loops that
prevent mount namespaces from every being freed. I avoid
creating those circular dependencies by adding a sequence number
to the mount namespace and require all bind mounts be of a
younger mount namespace into an older mount namespace.

Add a helper function proc_ns_inode so it is possible to
detect when we are attempting to bind mound a namespace inode.

Acked-by: Serge Hallyn
Signed-off-by: Eric W. Biederman

Eric W. Biederman
2012-11-19 21:59:18 +0800

14 Jul, 2012

2 commits

6ce6e24e7 get rid of magic in proc_namespace.c ... Browse Code »

don't rely on proc_mounts->m being the first field; container_of()
is there for purpose. No need to bother with ->private, while
we are at it - the same container_of will do nicely.

Signed-off-by: Al Viro

Al Viro
2012-07-14 20:32:48 +0800
f7a99c5b7 get rid of ->mnt_longterm ... Browse Code »

it's enough to set ->mnt_ns of internal vfsmounts to something
distinct from all struct mnt_namespace out there; then we can
just use the check for ->mnt_ns != NULL in the fast path of
mntput_no_expire()

Signed-off-by: Al Viro

Al Viro
2012-07-14 20:32:47 +0800

07 Jan, 2012

1 commit

39f7c4db1 vfs: keep list of mounts for each superblock ... Browse Code »

Keep track of vfsmounts belonging to a superblock. List is protected
by vfsmount_lock.

Signed-off-by: Miklos Szeredi
Tested-by: Toshiyuki Okajima
Signed-off-by: Al Viro

Miklos Szeredi
2012-01-07 12:20:12 +0800

04 Jan, 2012

21 commits

be08d6d26 switch mnt_namespace ->root to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:13 +0800
0226f4923 vfs: take /proc/*/mounts and friends to fs/proc_namespace.c ... Browse Code »

rationale: that stuff is far tighter bound to fs/namespace.c than to
the guts of procfs proper.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:13 +0800
c63181e6b vfs: move fsnotify junk to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:12 +0800
52ba1621d vfs: move mnt_devname ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:11 +0800
1a4eeaf2a vfs: move mnt_list to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:11 +0800
863d684f9 vfs: move the rest of int fields to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:10 +0800
15169fe78 vfs: mnt_id/mnt_group_id moved ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:10 +0800
143c8c91c vfs: mnt_ns moved to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:09 +0800
6776db3d3 vfs: take mnt_share/mnt_slave/mnt_slave_list and mnt_expire to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:08 +0800
32301920f vfs: and now we can make ->mnt_master point to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:08 +0800
d10e8def0 vfs: take mnt_master to struct mount ... Browse Code »

make IS_MNT_SLAVE take struct mount * at the same time

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:08 +0800
6b41d536f vfs: take mnt_child/mnt_mounts to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:06 +0800
68e8a9fea vfs: all counters taken to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:06 +0800
a73324da7 vfs: move mnt_mountpoint to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:05 +0800
0714a5338 vfs: now it can be done - make mnt_parent point to struct mount ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:05 +0800
3376f34ff vfs: mnt_parent moved to struct mount ... Browse Code »

the second victim...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:04 +0800
676da58df vfs: spread struct mount - mnt_has_parent ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:04 +0800
1b8e5564b vfs: the first spoils - mnt_hash moved ... Browse Code »

taken out of struct vfsmount into struct mount

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:57:02 +0800
c71053659 vfs: spread struct mount - __lookup_mnt() result ... Browse Code »

switch __lookup_mnt() to returning struct mount *; callers adjusted.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:56:58 +0800
7d6fec45a vfs: start hiding vfsmount guts series ... Browse Code »

Almost all fields of struct vfsmount are used only by core VFS (and
a fairly small part of it, at that). The plan: embed struct vfsmount
into struct mount, making the latter visible only to core parts of VFS.
Then move fields from vfsmount to mount, eventually leaving only
mnt_root/mnt_sb/mnt_flags in struct vfsmount. Filesystem code still
gets pointers to struct vfsmount and remains unchanged; all such
pointers go to struct vfsmount embedded into the instances of struct
mount allocated by fs/namespace.c. When fs/namespace.c et.al. get
a pointer to vfsmount, they turn it into pointer to mount (using
container_of) and work with that.

This is the first part of series; struct mount is introduced,
allocation switched to using it.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:56:57 +0800
b2dba1af3 vfs: new internal helper: mnt_has_parent(mnt) ... Browse Code »

vfsmounts have ->mnt_parent pointing either to a different vfsmount
or to itself; it's never NULL and termination condition in loops
traversing the tree towards root is mnt == mnt->mnt_parent. At least
one place (see the next patch) is confused about what's going on;
let's add an explicit helper checking it right way and use it in
all places where we need it. Not that there had been too many,
but...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:36 +0800