Eric Lee / smarc-fsl-linux-kernel

20 Nov, 2008

1 commit

f9454548e don't unlink an active swapfile ... Browse Code »

Peter Cordes is sorry that he rm'ed his swapfiles while they were in use,
he then had no pathname to swapoff. It's a curious little oversight, but
not one worth a lot of hackery. Kudos to Willy Tarreau for turning this
around from a discussion of synthetic pathnames to how to prevent unlink.
Mimic immutable: prohibit unlinking an active swapfile in may_delete()
(and don't worry my little head over the tiny race window).

Signed-off-by: Hugh Dickins
Cc: Willy Tarreau
Acked-by: Christoph Hellwig
Cc: Peter Cordes
Cc: Bodo Eggert
Cc: David Newall
Cc: Peter Zijlstra
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Hugh Dickins
2008-11-20 10:49:59 +0800

23 Oct, 2008

8 commits

f696a3659 [PATCH] move executable checking into ->permission() ... Browse Code »

For execute permission on a regular files we need to check if file has
any execute bits at all, regardless of capabilites.

This check is normally performed by generic_permission() but was also
added to the case when the filesystem defines its own ->permission()
method. In the latter case the filesystem should be responsible for
performing this check.

Move the check from inode_permission() inside filesystems which are
not calling generic_permission().

Create a helper function execute_ok() that returns true if the inode
is a directory or if any execute bits are present in i_mode.

Also fix up the following code:

- coda control file is never executable
- sysctl files are never executable
- hfs_permission seems broken on MAY_EXEC, remove
- hfsplus_permission is eqivalent to generic_permission(), remove

Signed-off-by: Miklos Szeredi

Miklos Szeredi
2008-10-23 17:13:25 +0800
4e9ed2f85 [PATCH vfs-2.6 6/6] vfs: add LOOKUP_RENAME_TARGET intent ... Browse Code »

This adds LOOKUP_RENAME_TARGET intent for lookup of rename destination.

LOOKUP_RENAME_TARGET is going to be used like LOOKUP_CREATE. But since
the destination of rename() can be existing directory entry, so it has a
difference. Although that difference doesn't matter in my usage, this
tells it to user of this intent.

Signed-off-by: OGAWA Hirofumi

OGAWA Hirofumi
2008-10-23 17:13:20 +0800
0612d9fb2 [PATCH vfs-2.6 5/6] vfs: remove LOOKUP_PARENT from non LOOKUP_PARENT lookup ... Browse Code »

lookup_hash() with LOOKUP_PARENT is bogus. And this prepares to add
new intent on those path.

The user of LOOKUP_PARENT intent is nfs only, and it checks whether
nd->flags has LOOKUP_CREATE or LOOKUP_OPEN, so the result is same.

Signed-off-by: OGAWA Hirofumi

OGAWA Hirofumi
2008-10-23 17:13:19 +0800
e2761a116 [PATCH vfs-2.6 2/6] vfs: add d_ancestor() ... Browse Code »

This adds d_ancestor() instead of d_isparent(), then use it.

If new_dentry == old_dentry, is_subdir() returns 1, looks strange.
"new_dentry == old_dentry" is not subdir obviously. But I'm not
checking callers for now, so this keeps current behavior.

Signed-off-by: OGAWA Hirofumi

OGAWA Hirofumi
2008-10-23 17:13:16 +0800
871c0067d [PATCH vfs-2.6 1/6] vfs: replace parent == dentry->d_parent by IS_ROOT() ... Browse Code »

Signed-off-by: OGAWA Hirofumi

OGAWA Hirofumi
2008-10-23 17:13:16 +0800
3516586a4 [PATCH] make O_EXCL in nd->intent.flags visible in nd->flags ... Browse Code »

New flag: LOOKUP_EXCL. Set before doing the final step of pathname
resolution on the paths that have LOOKUP_CREATE and O_EXCL.

Signed-off-by: Al Viro

Al Viro
2008-10-23 17:12:56 +0800
8737f3a1b [PATCH] get rid of path_lookup_create() ... Browse Code »

... and don't pass bogus flags when we are just looking for parent.
Fold __path_lookup_intent_open() into path_lookup_open() while we
are at it; that's the only remaining caller.

Signed-off-by: Al Viro

Al Viro
2008-10-23 17:12:54 +0800
d18114657 [PATCH] new helper - kern_path() ... Browse Code »

Analog of lookup_path(), takes struct path *.

Signed-off-by: Al Viro

Al Viro
2008-10-23 15:34:19 +0800

01 Aug, 2008

2 commits

a95164d97 [patch 3/4] vfs: remove unused nameidata argument of may_create() ... Browse Code »

Signed-off-by: Miklos Szeredi
Signed-off-by: Al Viro

Miklos Szeredi
2008-08-01 23:25:30 +0800
f418b0060 Re: BUG at security/selinux/avc.c:883 (was: Re: linux-next: Tree ... Browse Code »

for July 17: early crash on x86-64)

SELinux needs MAY_APPEND to be passed down to the security hook.
Otherwise, we get permission denials when only append permission is
granted by policy even if the opening process specified O_APPEND.
Shows up as a regression in the ltp selinux testsuite, fixed by
this patch.

Signed-off-by: Stephen Smalley
Signed-off-by: Al Viro

Stephen Smalley
2008-08-01 23:25:21 +0800

27 Jul, 2008

14 commits

964bd1836 [PATCH] get rid of __user_path_lookup_open ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:41 +0800
2ad94ae65 [PATCH] new (local) helper: user_path_parent() ... Browse Code »

Preparation to untangling intents mess: reduce the number of do_path_lookup()
callers.

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:35 +0800
2d8f30380 [PATCH] sanitize __user_walk_fd() et.al. ... Browse Code »

* do not pass nameidata; struct path is all the callers want.
* switch to new helpers:
user_path_at(dfd, pathname, flags, &path)
user_path(pathname, &path)
user_lpath(pathname, &path)
user_path_dir(pathname, &path) (fail if not a directory)
The last 3 are trivial macro wrappers for the first one.
* remove nameidata in callers.

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:34 +0800
f419a2e3b [PATCH] kill nameidata passing to permission(), rename to inode_permission() ... Browse Code »

Incidentally, the name that gives hundreds of false positives on grep
is not a good idea...

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:31 +0800
30524472c [PATCH] take noexec checks to very few callers that care ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:30 +0800
672b16b2f [PATCH] more nameidata removal: exec_permission_lite() doesn't need it ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:23 +0800
b77b0646e [PATCH] pass MAY_OPEN to vfs_permission() explicitly ... Browse Code »

... and get rid of the last "let's deduce mask from nameidata->flags"
bit.

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:22 +0800
a110343f0 [PATCH] fix MAY_CHDIR/MAY_ACCESS/LOOKUP_ACCESS mess ... Browse Code »

* MAY_CHDIR is redundant - it's an equivalent of MAY_ACCESS
* MAY_ACCESS on fuse should affect only the last step of pathname resolution
* fchdir() and chroot() should pass MAY_ACCESS, for the same reason why
chdir() needs that.
* now that we pass MAY_ACCESS explicitly in all cases, LOOKUP_ACCESS can be
removed; it has no business being in nameidata.

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:21 +0800
7f2da1e7d [PATCH] kill altroot ... Browse Code »

long overdue...

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:20 +0800
8bb79224b [PATCH] permission checks for chdir need special treatment only on the last step ... Browse Code »

... so we ought to pass MAY_CHDIR to vfs_permission() instead of having
it triggered on every step of preceding pathname resolution. LOOKUP_CHDIR
is killed by that.

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:19 +0800
db2e747b1 [patch 5/5] vfs: remove mode parameter from vfs_symlink() ... Browse Code »

Remove the unused mode parameter from vfs_symlink and callers.

Thanks to Tetsuo Handa for noticing.

CC: Tetsuo Handa
Signed-off-by: Miklos Szeredi

Miklos Szeredi
2008-07-27 08:53:18 +0800
7e79eedb3 [patch 4/5] vfs: reuse local variable in vfs_link() ... Browse Code »

Why not reuse "inode" which is assigned as

struct inode *inode = old_dentry->d_inode;

in the beginning of vfs_link() ?

Signed-off-by: Tetsuo Handa
Signed-off-by: Miklos Szeredi

Tetsuo Handa
2008-07-27 08:53:17 +0800
e6305c43e [PATCH] sanitize ->permission() prototype ... Browse Code »

* kill nameidata * argument; map the 3 bits in ->flags anybody cares
about to new MAY_... ones and pass with the mask.
* kill redundant gfs2_iop_permission()
* sanitize ecryptfs_permission()
* fix remaining places where ->permission() instances might barf on new
MAY_... found in mask.

The obvious next target in that direction is permission(9)

folded fix for nfs_permission() breakage from Miklos Szeredi

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:14 +0800
d70b67c8b [patch] vfs: fix lookup on deleted directory ... Browse Code »

Lookup can install a child dentry for a deleted directory. This keeps
the directory dentry alive, and the inode pinned in the cache and on
disk, even after all external references have gone away.

This isn't a big problem normally, since memory pressure or umount
will clear out the directory dentry and its children, releasing the
inode. But for UBIFS this causes problems because its orphan area can
overflow.

Fix this by returning ENOENT for all lookups on a S_DEAD directory
before creating a child dentry.

Thanks to Zoltan Sogor for noticing this while testing UBIFS, and
Artem for the excellent analysis of the problem and testing.

Reported-by: Artem Bityutskiy
Tested-by: Artem Bityutskiy
Signed-off-by: Miklos Szeredi
Signed-off-by: Al Viro

Miklos Szeredi
2008-07-27 08:53:05 +0800

23 Jun, 2008

2 commits

694a1764d [patch 3/4] vfs: fix ERR_PTR abuse in generic_readlink ... Browse Code »

generic_readlink calls ERR_PTR for negative and positive values
(vfs_readlink returns length of "link"), but it should not
(not an errno) and does not need to.

Signed-off-by: Marcin Slusarz
Cc: Al Viro
Cc: Christoph Hellwig
Acked-by: Miklos Szeredi
Signed-off-by: Andrew Morton
Signed-off-by: Al Viro

Marcin Slusarz
2008-06-23 23:52:30 +0800
c8e7f449b [patch 1/4] vfs: path_{get,put}() cleanups ... Browse Code »

Here are some more places where path_{get,put}() can be used instead of
dput()/mntput() pair.

Signed-off-by: Jan Blunck
Cc: Al Viro
Cc: Jens Axboe
Signed-off-by: Andrew Morton
Signed-off-by: Al Viro

Jan Blunck
2008-06-23 23:52:29 +0800

17 May, 2008

1 commit

e9baf6e59 [PATCH] return to old errno choice in mkdir() et.al. ... Browse Code »

In case when both EEXIST and EROFS would apply we used to
return the former in mkdir(2) and friends. Lest anyone suspects
us of being consistent, in the same situation knfsd gave clients
nfs_erofs...

ro-bind series had switched the syscall side of things to
returning -EROFS and immediately broke an application - namely,
mkdir -p. Patch restores the original behaviour...

Signed-off-by: Al Viro

Al Viro
2008-05-17 05:23:18 +0800

29 Apr, 2008

1 commit

08ce5f16e cgroups: implement device whitelist ... Browse Code »

Implement a cgroup to track and enforce open and mknod restrictions on device
files. A device cgroup associates a device access whitelist with each cgroup.
A whitelist entry has 4 fields. 'type' is a (all), c (char), or b (block).
'all' means it applies to all types and all major and minor numbers. Major
and minor are either an integer or * for all. Access is a composition of r
(read), w (write), and m (mknod).

The root device cgroup starts with rwm to 'all'. A child devcg gets a copy of
the parent. Admins can then remove devices from the whitelist or add new
entries. A child cgroup can never receive a device access which is denied its
parent. However when a device access is removed from a parent it will not
also be removed from the child(ren).

An entry is added using devices.allow, and removed using
devices.deny. For instance

echo 'c 1:3 mr' > /cgroups/1/devices.allow

allows cgroup 1 to read and mknod the device usually known as
/dev/null. Doing

echo a > /cgroups/1/devices.deny

will remove the default 'a *:* mrw' entry.

CAP_SYS_ADMIN is needed to change permissions or move another task to a new
cgroup. A cgroup may not be granted more permissions than the cgroup's parent
has. Any task can move itself between cgroups. This won't be sufficient, but
we can decide the best way to adequately restrict movement later.

[akpm@linux-foundation.org: coding-style fixes]
[akpm@linux-foundation.org: fix may-be-used-uninitialized warning]
Signed-off-by: Serge E. Hallyn
Acked-by: James Morris
Looks-good-to: Pavel Emelyanov
Cc: Daniel Hokka Zakrisson
Cc: Li Zefan
Cc: Paul Menage
Cc: Balbir Singh
Cc: KAMEZAWA Hiroyuki
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Serge E. Hallyn
2008-04-29 23:06:09 +0800

19 Apr, 2008

7 commits

4a3fd211c [PATCH] r/o bind mounts: elevate write count for open()s ... Browse Code »

This is the first really tricky patch in the series. It elevates the writer
count on a mount each time a non-special file is opened for write.

We used to do this in may_open(), but Miklos pointed out that __dentry_open()
is used as well to create filps. This will cover even those cases, while a
call in may_open() would not have.

There is also an elevated count around the vfs_create() call in open_namei().
See the comments for more details, but we need this to fix a 'create, remount,
fail r/w open()' race.

Some filesystems forego the use of normal vfs calls to create
struct files. Make sure that these users elevate the mnt
writer count because they will get __fput(), and we need
to make sure they're balanced.

Acked-by: Al Viro
Signed-off-by: Christoph Hellwig
Signed-off-by: Dave Hansen
Signed-off-by: Andrew Morton
Signed-off-by: Al Viro

Dave Hansen
2008-04-19 12:29:25 +0800
9079b1eb1 [PATCH] r/o bind mounts: get write access for vfs_rename() callers ... Browse Code »

This also uses the little helper in the NFS code to make an if() a little bit
less ugly. We introduced the helper at the beginning of the series.

Acked-by: Al Viro
Signed-off-by: Christoph Hellwig
Signed-off-by: Dave Hansen
Signed-off-by: Andrew Morton
Signed-off-by: Al Viro

Dave Hansen
2008-04-19 12:25:34 +0800
75c3f29de [PATCH] r/o bind mounts: write counts for link/symlink ... Browse Code »

[AV: add missing nfsd pieces]

Acked-by: Al Viro
Signed-off-by: Christoph Hellwig
Signed-off-by: Dave Hansen
Signed-off-by: Al Viro

Dave Hansen
2008-04-19 12:25:34 +0800
463c31972 [PATCH] r/o bind mounts: get callers of vfs_mknod/create/mkdir() ... Browse Code »

This takes care of all of the direct callers of vfs_mknod().
Since a few of these cases also handle normal file creation
as well, this also covers some calls to vfs_create().

So that we don't have to make three mnt_want/drop_write()
calls inside of the switch statement, we move some of its
logic outside of the switch and into a helper function
suggested by Christoph.

This also encapsulates a fix for mknod(S_IFREG) that Miklos
found.

[AV: merged mkdir handling, added missing nfsd pieces]

Acked-by: Al Viro
Signed-off-by: Christoph Hellwig
Signed-off-by: Dave Hansen
Signed-off-by: Andrew Morton
Signed-off-by: Al Viro

Dave Hansen
2008-04-19 12:25:34 +0800
0622753b8 [PATCH] r/o bind mounts: elevate write count for rmdir and unlink. ... Browse Code »

Elevate the write count during the vfs_rmdir() and vfs_unlink().

[AV: merged rmdir and unlink parts, added missing pieces in nfsd]

Acked-by: Serge Hallyn
Acked-by: Al Viro
Signed-off-by: Christoph Hellwig
Signed-off-by: Dave Hansen
Signed-off-by: Andrew Morton
Signed-off-by: Al Viro

Dave Hansen
2008-04-19 12:25:33 +0800
a70e65df8 [PATCH] merge open_namei() and do_filp_open() ... Browse Code »

open_namei() will, in the future, need to take mount write counts
over its creation and truncation (via may_open()) operations. It
needs to keep these write counts until any potential filp that is
created gets __fput()'d.

This gets complicated in the error handling and becomes very murky
as to how far open_namei() actually got, and whether or not that
mount write count was taken. That makes it a bad interface.

All that the current do_filp_open() really does is allocate the
nameidata on the stack, then call open_namei().

So, this merges those two functions and moves filp_open() over
to namei.c so it can be close to its buddy: do_filp_open(). It
also gets a kerneldoc comment in the process.

Acked-by: Al Viro
Signed-off-by: Christoph Hellwig
Signed-off-by: Andrew Morton
Signed-off-by: Dave Hansen
Signed-off-by: Al Viro

Christoph Hellwig
2008-04-19 12:25:32 +0800
d57999e15 [PATCH] do namei_flags calculation inside open_namei() ... Browse Code »

My end goal here is to make sure all users of may_open()
return filps. This will ensure that we properly release
mount write counts which were taken for the filp in
may_open().

This patch moves the sys_open flags to namei flags
calculation into fs/namei.c. We'll shortly be moving
the nameidata_to_filp() calls into namei.c, and this
gets the sys_open flags to a place where we can get
at them when we need them.

Acked-by: Al Viro
Signed-off-by: Christoph Hellwig
Signed-off-by: Dave Hansen
Signed-off-by: Al Viro

Dave Hansen
2008-04-19 12:25:31 +0800

25 Mar, 2008

1 commit

7ed7fe5e8 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs-2.6 ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs-2.6:
[PATCH] get stack footprint of pathname resolution back to relative sanity
[PATCH] double iput() on failure exit in hugetlb
[PATCH] double dput() on failure exit in tiny-shmem
[PATCH] fix up new filp allocators
[PATCH] check for null vfsmount in dentry_open()
[PATCH] reiserfs: eliminate private use of struct file in xattr
[PATCH] sanitize hppfs
hppfs pass vfsmount to dentry_open()
[PATCH] restore export of do_kern_mount()

Linus Torvalds
2008-03-25 23:57:47 +0800

20 Mar, 2008

1 commit

a6b91919e fs: fix kernel-doc notation warnings ... Browse Code »

Fix kernel-doc notation warnings in fs/.

Warning(mmotm-2008-0314-1449//fs/super.c:560): missing initial short description on line:
* mark_files_ro
Warning(mmotm-2008-0314-1449//fs/locks.c:1277): missing initial short description on line:
* lease_get_mtime
Warning(mmotm-2008-0314-1449//fs/locks.c:1277): missing initial short description on line:
* lease_get_mtime
Warning(mmotm-2008-0314-1449//fs/namei.c:1368): missing initial short description on line:
* lookup_one_len: filesystem helper to lookup single pathname component
Warning(mmotm-2008-0314-1449//fs/buffer.c:3221): missing initial short description on line:
* bh_uptodate_or_lock: Test whether the buffer is uptodate
Warning(mmotm-2008-0314-1449//fs/buffer.c:3240): missing initial short description on line:
* bh_submit_read: Submit a locked buffer for reading
Warning(mmotm-2008-0314-1449//fs/fs-writeback.c:30): missing initial short description on line:
* writeback_acquire: attempt to get exclusive writeback access to a device
Warning(mmotm-2008-0314-1449//fs/fs-writeback.c:47): missing initial short description on line:
* writeback_in_progress: determine whether there is writeback in progress
Warning(mmotm-2008-0314-1449//fs/fs-writeback.c:58): missing initial short description on line:
* writeback_release: relinquish exclusive writeback access against a device.
Warning(mmotm-2008-0314-1449//include/linux/jbd.h:351): contents before sections
Warning(mmotm-2008-0314-1449//include/linux/jbd.h:561): contents before sections
Warning(mmotm-2008-0314-1449//fs/jbd/transaction.c:1935): missing initial short description on line:
* void journal_invalidatepage()

Signed-off-by: Randy Dunlap
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Randy Dunlap
2008-03-20 09:53:36 +0800

19 Mar, 2008

1 commit

a02f76c34 [PATCH] get stack footprint of pathname resolution back to relative sanity ... Browse Code »

Somebody had put struct nameidata in stack frame of link_path_walk().
Unfortunately, there are certain realities to deal with:
* It's in the middle of recursion. Depth is equal to the nesting
depth of symlinks, i.e. up to 8.
* struct namiedata is, even if one discards the intent junk,
at least 12 pointers + 5 ints.
* moreover, adding a stack frame is not free in that situation.
* there are fs methods called on top of that, and they also have
stack footprint.
* kernel stack is not infinite.

The thing is, even if one chooses to deal with -ESTALE that way (and it's
one hell of an overkill), the only thing that needs to be preserved is
vfsmount + dentry, not the entire struct nameidata.

Signed-off-by: Al Viro

Al Viro
2008-03-19 18:55:46 +0800

15 Feb, 2008

1 commit

6ac08c39a Use struct path in fs_struct ... Browse Code »

* Use struct path in fs_struct.

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Jan Blunck
Acked-by: Christoph Hellwig
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Jan Blunck
2008-02-15 13:13:33 +0800