Eric Lee / smarc-fsl-linux-kernel

26 Oct, 2010

1 commit

be1a16a0a vfs: fix infinite loop caused by clone_mnt race ... Browse Code »

If clone_mnt() happens while mnt_make_readonly() is running, the
cloned mount might have MNT_WRITE_HOLD flag set, which results in
mnt_want_write() spinning forever on this mount.

Needs CAP_SYS_ADMIN to trigger deliberately and unlikely to happen
accidentally. But if it does happen it can hang the machine.

Signed-off-by: Miklos Szeredi
Signed-off-by: Al Viro

Miklos Szeredi
2010-10-26 09:24:16 +0800

05 Oct, 2010

1 commit

6841c0502 BKL: Remove BKL from do_new_mount() ... Browse Code »

After pushing down the BKL to the get_sb/fill_super operations of the
filesystems that still make usage of the BKL it is safe to remove it from
do_new_mount().

I've read through all the code formerly covered by the BKL inside
do_kern_mount() and have satisfied myself that it doesn't need the BKL
any more.

Signed-off-by: Jan Blunck
Cc: Matthew Wilcox
Signed-off-by: Arnd Bergmann

Jan Blunck
2010-10-05 03:10:43 +0800

08 Sep, 2010

1 commit

7a2e8a8fa VFS: Sanity check mount flags passed to change_mnt_propagation() ... Browse Code »
45

Sanity check the flags passed to change_mnt_propagation(). Exactly
one flag should be set. Return EINVAL otherwise.

Userspace can pass in arbitrary combinations of MS_* flags to mount().
do_change_type() is called if any of MS_SHARED, MS_PRIVATE, MS_SLAVE,
or MS_UNBINDABLE is set. do_change_type() clears MS_REC and then
calls change_mnt_propagation() with the rest of the user-supplied
flags. change_mnt_propagation() clearly assumes only one flag is set
but do_change_type() does not check that this is true. For example,
mount() with flags MS_SHARED | MS_RDONLY does not actually make the
mount shared or read-only but does clear MNT_UNBINDABLE.

Signed-off-by: Valerie Aurora
Signed-off-by: Linus Torvalds

Valerie Aurora
2010-09-08 04:46:20 +0800

18 Aug, 2010

1 commit

99b7db7b8 fs: brlock vfsmount_lock ... Browse Code »

fs: brlock vfsmount_lock

Use a brlock for the vfsmount lock. It must be taken for write whenever
modifying the mount hash or associated fields, and may be taken for read when
performing mount hash lookups.

A new lock is added for the mnt-id allocator, so it doesn't need to take
the heavy vfsmount write-lock.

The number of atomics should remain the same for fastpath rlock cases, though
code would be slightly slower due to per-cpu access. Scalability is not not be
much improved in common cases yet, due to other locks (ie. dcache_lock) getting
in the way. However path lookups crossing mountpoints should be one case where
scalability is improved (currently requiring the global lock).

The slowpath is slower due to use of brlock. On a 64 core, 64 socket, 32 node
Altix system (high latency to remote nodes), a simple umount microbenchmark
(mount --bind mnt mnt2 ; umount mnt2 loop 1000 times), before this patch it
took 6.8s, afterwards took 7.1s, about 5% slower.

Cc: Al Viro
Signed-off-by: Nick Piggin
Signed-off-by: Al Viro

Nick Piggin
2010-08-18 20:35:48 +0800

11 Aug, 2010

3 commits

532490f0a vfs: remove unused MNT_STRICTATIME ... Browse Code »

Commit d0adde574b8487ef30f69e2d08bba769e4be513f added MNT_STRICTATIME
but it isn't actually used (MS_STRICTATIME clears MNT_RELATIME and
MNT_NOATIME rather than setting any mount flag).

Signed-off-by: Miklos Szeredi
Signed-off-by: Al Viro

Miklos Szeredi
2010-08-11 12:29:47 +0800
f7ad3c6be vfs: add helpers to get root and pwd ... Browse Code »

Add three helpers that retrieve a refcounted copy of the root and cwd
from the supplied fs_struct.

get_fs_root()
get_fs_pwd()
get_fs_root_and_pwd()

Signed-off-by: Miklos Szeredi
Signed-off-by: Al Viro

Miklos Szeredi
2010-08-11 12:28:20 +0800
8c8946f50 Merge branch 'for-linus' of git://git.infradead.org/users/eparis/notify ... Browse Code »

* 'for-linus' of git://git.infradead.org/users/eparis/notify: (132 commits)
fanotify: use both marks when possible
fsnotify: pass both the vfsmount mark and inode mark
fsnotify: walk the inode and vfsmount lists simultaneously
fsnotify: rework ignored mark flushing
fsnotify: remove global fsnotify groups lists
fsnotify: remove group->mask
fsnotify: remove the global masks
fsnotify: cleanup should_send_event
fanotify: use the mark in handler functions
audit: use the mark in handler functions
dnotify: use the mark in handler functions
inotify: use the mark in handler functions
fsnotify: send fsnotify_mark to groups in event handling functions
fsnotify: Exchange list heads instead of moving elements
fsnotify: srcu to protect read side of inode and vfsmount locks
fsnotify: use an explicit flag to indicate fsnotify_destroy_mark has been called
fsnotify: use _rcu functions for mark list traversal
fsnotify: place marks on object in order of group memory address
vfs/fsnotify: fsnotify_close can delay the final work in fput
fsnotify: store struct file not struct path
...

Fix up trivial delete/modify conflict in fs/notify/inotify/inotify.c.

Linus Torvalds
2010-08-11 02:39:13 +0800

10 Aug, 2010

1 commit

7a4dec538 Fix sget() race with failing mount ... Browse Code »

If sget() finds a matching superblock being set up, it'll
grab an active reference to it and grab s_umount. That's
fine - we'll wait for completion of foofs_get_sb() that way.
However, if said foofs_get_sb() fails we'll end up holding
the halfway-created superblock. deactivate_locked_super()
called by foofs_get_sb() will just unlock the sucker since
we are holding another active reference to it.

What we need is a way to tell if superblock has been successfully
set up. Unfortunately, neither ->s_root nor the check for
MS_ACTIVE quite fit. Cheap and easy way, suitable for backport:
new flag set by the (only) caller of ->get_sb(). If that flag
isn't present by the time sget() grabbed s_umount on preexisting
superblock it has found, it's seeing a stillborn and should
just bury it with deactivate_locked_super() (and repeat the search).

Longer term we want to set that flag in ->get_sb() instances (and
check for it to distinguish between "sget() found us a live sb"
and "sget() has allocated an sb, we need to set it up" in there,
instead of checking ->s_root as we do now).

Signed-off-by: Al Viro
Cc: stable@kernel.org

Al Viro
2010-08-10 04:49:01 +0800

28 Jul, 2010

2 commits

ca9c726ee fsnotify: Infrastructure for per-mount watches ... Browse Code »

Per-mount watches allow groups to listen to fsnotify events on an entire
mount. This patch simply adds and initializes the fields needed in the
vfsmount struct to make this happen.

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Eric Paris

Andreas Gruenbacher
2010-07-28 21:58:57 +0800
2504c5d63 fsnotify/vfsmount: add fsnotify fields to struct vfsmount ... Browse Code »

This patch adds the list and mask fields needed to support vfsmount marks.
These are the same fields fsnotify needs on an inode. They are not used,
just declared and we note where the cleanup hook should be (the function is
not yet defined)

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Eric Paris

Andreas Gruenbacher
2010-07-28 21:58:57 +0800

18 May, 2010

1 commit

539c99fd7 Merge branch 'next' into for-linus Browse Code »

James Morris
2010-05-18 06:57:00 +0800

15 May, 2010

1 commit

d83c49f3e Fix the regression created by "set S_DEAD on unlink()..." commit ... Browse Code »

1) i_flags simply doesn't work for mount/unlink race prevention;
we may have many links to file and rm on one of those obviously
shouldn't prevent bind on top of another later on. To fix it
right way we need to mark _dentry_ as unsuitable for mounting
upon; new flag (DCACHE_CANT_MOUNT) is protected by d_flags and
i_mutex on the inode in question. Set it (with dont_mount(dentry))
in unlink/rmdir/etc., check (with cant_mount(dentry)) in places
in namespace.c that used to check for S_DEAD. Setting S_DEAD
is still needed in places where we used to set it (for directories
getting killed), since we rely on it for readdir/rmdir race
prevention.

2) rename()/mount() protection has another bogosity - we unhash
the target before we'd checked that it's not a mountpoint. Fixed.

3) ancient bogosity in pivot_root() - we locked i_mutex on the
right directory, but checked S_DEAD on the different (and wrong)
one. Noticed and fixed.

Signed-off-by: Al Viro

Al Viro
2010-05-15 19:16:33 +0800

12 Apr, 2010

6 commits

91a9420f5 security: remove dead hook sb_post_pivotroot ... Browse Code »

Unused hook. Remove.

Signed-off-by: Eric Paris
Signed-off-by: James Morris

Eric Paris
2010-04-12 10:18:32 +0800
3db291017 security: remove dead hook sb_post_addmount ... Browse Code »

Unused hook. Remove.

Signed-off-by: Eric Paris
Signed-off-by: James Morris

Eric Paris
2010-04-12 10:18:31 +0800
82dab1045 security: remove dead hook sb_post_remount ... Browse Code »

Unused hook. Remove.

Signed-off-by: Eric Paris
Signed-off-by: James Morris

Eric Paris
2010-04-12 10:18:30 +0800
4b61d12c8 security: remove dead hook sb_umount_busy ... Browse Code »

Unused hook. Remove.

Signed-off-by: Eric Paris
Signed-off-by: James Morris

Eric Paris
2010-04-12 10:18:30 +0800
231923bd0 security: remove dead hook sb_umount_close ... Browse Code »

Unused hook. Remove.

Signed-off-by: Eric Paris
Signed-off-by: James Morris

Eric Paris
2010-04-12 10:18:29 +0800
353633100 security: remove sb_check_sb hooks ... Browse Code »

Unused hook. Remove it.

Signed-off-by: Eric Paris
Signed-off-by: James Morris

Eric Paris
2010-04-12 10:18:28 +0800

04 Mar, 2010

7 commits

db1f05bb8 vfs: add NOFOLLOW flag to umount(2) ... Browse Code »

Add a new UMOUNT_NOFOLLOW flag to umount(2). This is needed to prevent
symlink attacks in unprivileged unmounts (fuse, samba, ncpfs).

Additionally, return -EINVAL if an unknown flag is used (and specify
an explicitly unused flag: UMOUNT_UNUSED). This makes it possible for
the caller to determine if a flag is supported or not.

CC: Eugene Teo
CC: Michael Kerrisk
Signed-off-by: Miklos Szeredi
Signed-off-by: Al Viro

Miklos Szeredi
2010-03-04 03:08:00 +0800
8089352a1 Mirror MS_KERNMOUNT in ->mnt_flags ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2010-03-04 03:08:00 +0800
d498b25a4 get rid of useless vfsmount_lock use in put_mnt_ns() ... Browse Code »

It hadn't been needed since we'd sanitized the logics in
mark_mounts_for_expiry() (which, in turn, used to be a
rudiment of bad old times when namespace_sem was per-ns).

Signed-off-by: Al Viro

Al Viro
2010-03-04 03:07:59 +0800
9f5596af4 take check for new events in namespace (guts of mounts_poll()) to namespace.c ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2010-03-04 03:07:59 +0800
1f707137b new helper: iterate_mounts() ... Browse Code »

apply function to vfsmounts in set returned by collect_mounts(),
stop if it returns non-zero.

Signed-off-by: Al Viro

Al Viro
2010-03-04 03:07:57 +0800
495d6c9c6 VFS: Clean up shared mount flag propagation ... Browse Code »

The handling of mount flags in set_mnt_shared() got a little tangled
up during previous cleanups, with the following problems:

* MNT_PNODE_MASK is defined as a literal constant when it should be a
bitwise xor of other MNT_* flags
* set_mnt_shared() clears and then sets MNT_SHARED (part of MNT_PNODE_MASK)
* MNT_PNODE_MASK could use a comment in mount.h
* MNT_PNODE_MASK is a terrible name, change to MNT_SHARED_MASK

This patch fixes these problems.

Signed-off-by: Al Viro

Valerie Aurora
2010-03-04 03:07:55 +0800
796a6b521 Kill CL_PROPAGATION, sanitize fs/pnode.c:get_source() ... Browse Code »

First of all, get_source() never results in CL_PROPAGATION
alone. We either get CL_MAKE_SHARED (for the continuation
of peer group) or CL_SLAVE (slave that is not shared) or both
(beginning of peer group among slaves). Massage the code to
make that explicit, kill CL_PROPAGATION test in clone_mnt()
(nothing sets CL_MAKE_SHARED without CL_PROPAGATION and in
clone_mnt() we are checking CL_PROPAGATION after we'd found
that there's no CL_SLAVE, so the check for CL_MAKE_SHARED
would do just as well).

Fix comments, while we are at it...

Signed-off-by: Al Viro

Al Viro
2010-03-04 02:00:22 +0800

17 Jan, 2010

4 commits

27d55f1f4 do_add_mount() should sanitize mnt_flags ... Browse Code »

MNT_WRITE_HOLD shouldn't leak into new vfsmount and neither
should MNT_SHARED (the latter will be set properly, along with
the rest of shared-subtree data structures)

Signed-off-by: Al Viro

Al Viro
2010-01-17 02:07:36 +0800
7b43a79f3 mnt_flags fixes in do_remount() ... Browse Code »

* need vfsmount_lock over modifying it
* need to preserve MNT_SHARED/MNT_UNBINDABLE

Signed-off-by: Al Viro

Al Viro
2010-01-17 02:01:26 +0800
df1a1ad29 attach_recursive_mnt() needs to hold vfsmount_lock over set_mnt_shared() ... Browse Code »

race in mnt_flags update

Signed-off-by: Al Viro

Al Viro
2010-01-17 01:57:40 +0800
8ad08d8a0 may_umount() needs namespace_sem ... Browse Code »

otherwise it races with clone_mnt() changing mnt_share/mnt_slaves

Signed-off-by: Al Viro

Al Viro
2010-01-17 01:56:08 +0800

18 Dec, 2009

1 commit

a2770d86b Revert "fix mismerge with Trond's stuff (create_mnt_ns() export is gone now)" ... Browse Code »

This reverts commit e9496ff46a20a8592fdc7bdaaf41b45eb808d310. Quoth Al:

"it's dependent on a lot of other stuff not currently in mainline
and badly broken with current fs/namespace.c. Sorry, badly
out-of-order cherry-pick from old queue.

PS: there's a large pending series reworking the refcounting and
lifetime rules for vfsmounts that will, among other things, allow to
rip a subtree away _without_ dissolving connections in it, to be
garbage-collected when all active references are gone. It's
considerably saner wrt "is the subtree busy" logics, but it's nowhere
near being ready for merge at the moment; this changeset is one of the
things becoming possible with that sucker, but it certainly shouldn't
have been picked during this cycle. My apologies..."

Noticed-by: Eric Paris
Requested-by: Al Viro
Signed-off-by: Linus Torvalds

Linus Torvalds
2009-12-18 04:51:05 +0800

17 Dec, 2009

1 commit

e9496ff46 fix mismerge with Trond's stuff (create_mnt_ns() export is gone now) ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2009-12-17 01:16:44 +0800

12 Oct, 2009

1 commit

a27ab9f26 LSM: Pass original mount flags to security_sb_mount(). ... Browse Code »

This patch allows LSM modules to determine based on original mount flags
passed to mount(). A LSM module can get masked mount flags (if needed) by

flags &= ~(MS_NOSUID | MS_NOEXEC | MS_NODEV | MS_ACTIVE |
MS_NOATIME | MS_NODIRATIME | MS_RELATIME| MS_KERNMOUNT |
MS_STRICTATIME);

Signed-off-by: Tetsuo Handa
Signed-off-by: James Morris

Tetsuo Handa
2009-10-12 07:56:03 +0800

24 Sep, 2009

1 commit

eca6f534e fs: fix overflow in sys_mount() for in-kernel calls ... Browse Code »

sys_mount() reads/copies a whole page for its "type" parameter. When
do_mount_root() passes a kernel address that points to an object which is
smaller than a whole page, copy_mount_options() will happily go past this
memory object, possibly dereferencing "wild" pointers that could be in any
state (hence the kmemcheck warning, which shows that parts of the next
page are not even allocated).

(The likelihood of something going wrong here is pretty low -- first of
all this only applies to kernel calls to sys_mount(), which are mostly
found in the boot code. Secondly, I guess if the page was not mapped,
exact_copy_from_user() _would_ in fact handle it correctly because of its
access_ok(), etc. checks.)

But it is much nicer to avoid the dubious reads altogether, by stopping as
soon as we find a NUL byte. Is there a good reason why we can't do
something like this, using the already existing strndup_from_user()?

[akpm@linux-foundation.org: make copy_mount_string() static]
[AV: fix compat mount breakage, which involves undoing akpm's change above]

Reported-by: Ingo Molnar
Signed-off-by: Vegard Nossum
Cc: Al Viro
Cc: Pekka Enberg
Signed-off-by: Andrew Morton
Signed-off-by: al

Vegard Nossum
2009-09-24 20:40:15 +0800

08 Aug, 2009

1 commit

2d8dd38a5 vfs: mnt_want_write_file(): fix special file handling ... Browse Code »

I suspect that mnt_want_write_file() may have wrong assumption. I think
mnt_want_write_file() is assuming it increments ->mnt_writers if
(file->f_mode & FMODE_WRITE). But, if it's special_file(), it is false?

Signed-off-by: OGAWA Hirofumi
Acked-by: Dave Hansen
Cc: Al Viro
Cc: Nick Piggin
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

OGAWA Hirofumi
2009-08-08 01:39:56 +0800

09 Jul, 2009

1 commit

b43f3cbd2 headers: mnt_namespace.h redux ... Browse Code »

Fix various silly problems wrt mnt_namespace.h:

- exit_mnt_ns() isn't used, remove it
- done that, sched.h and nsproxy.h inclusions aren't needed
- mount.h inclusion was need for vfsmount_lock, but no longer
- remove mnt_namespace.h inclusion from files which don't use anything
from mnt_namespace.h

Signed-off-by: Alexey Dobriyan
Signed-off-by: Linus Torvalds

Alexey Dobriyan
2009-07-09 00:31:56 +0800

24 Jun, 2009

2 commits

f21f62208 ... and the same for vfsmount id/mount group id ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2009-06-24 20:15:26 +0800
3b22edc57 VFS: Switch init_mount_tree() to use the new create_mnt_ns() helper ... Browse Code »

Eliminates some duplicated code...

Signed-off-by: Trond Myklebust
Signed-off-by: Al Viro

Trond Myklebust
2009-06-24 20:15:24 +0800

23 Jun, 2009

2 commits

cf8d2c11c VFS: Add VFS helper functions for setting up private namespaces ... Browse Code »

The purpose of this patch is to improve the remote mount path lookup
support for distributed filesystems such as the NFSv4 client.

When given a mount command of the form "mount server:/foo/bar /mnt", the
NFSv4 client is required to look up the filehandle for "server:/", and
then look up each component of the remote mount path "foo/bar" in order
to find the directory that is actually going to be mounted on /mnt.
Following that remote mount path may involve following symlinks,
crossing server-side mount points and even following referrals to
filesystem volumes on other servers.

Since the standard VFS path lookup code already supports walking paths
that contain all these features (using in-kernel automounts for
following referrals) we would like to be able to reuse that rather than
duplicate the full path traversal functionality in the NFSv4 client code.

This patch therefore defines a VFS helper function create_mnt_ns(), that
sets up a temporary filesystem namespace and attaches a root filesystem to
it. It exports the create_mnt_ns() and put_mnt_ns() function for use by
filesystem modules.

Signed-off-by: Trond Myklebust
Signed-off-by: Linus Torvalds

Trond Myklebust
2009-06-23 12:28:25 +0800
616511d03 VFS: Uninline the function put_mnt_ns() ... Browse Code »

In order to allow modules to use it without having to export vfsmount_lock.

Signed-off-by: Trond Myklebust
Signed-off-by: Linus Torvalds

Trond Myklebust
2009-06-23 12:28:25 +0800

12 Jun, 2009

1 commit

4aa98cf76 Push BKL down into do_remount_sb() ... Browse Code »

[folded fix from Jiri Slaby]

Signed-off-by: Al Viro

Al Viro
2009-06-12 09:36:08 +0800