Eric Lee / smarc-ti-linux-kernel | Embedian Git Server

22 Nov, 2013

1 commit

76ae281f6 configfs: fix race between dentry put and lookup ... Browse Code »
2

A race window in configfs, it starts from one dentry is UNHASHED and end
before configfs_d_iput is called. In this window, if a lookup happen,
since the original dentry was UNHASHED, so a new dentry will be
allocated, and then in configfs_attach_attr(), sd->s_dentry will be
updated to the new dentry. Then in configfs_d_iput(),
BUG_ON(sd->s_dentry != dentry) will be triggered and system panic.

sys_open: sys_close:
... fput
dput
dentry_kill
__d_drop dentry still point
to this dentry.

lookup_real
configfs_lookup
configfs_attach_attr---> update sd->s_dentry
to new allocated dentry here.

d_kill
configfs_d_iput s_dentry != dentry)
triggered here.

To fix it, change configfs_d_iput to not update sd->s_dentry if
sd->s_count > 2, that means there are another dentry is using the sd
beside the one that is going to be put. Use configfs_dirent_lock in
configfs_attach_attr to sync with configfs_d_iput.

With the following steps, you can reproduce the bug.

1. enable ocfs2, this will mount configfs at /sys/kernel/config and
fill configure in it.

2. run the following script.
while [ 1 ]; do cat /sys/kernel/config/cluster/$your_cluster_name/idle_timeout_ms > /dev/null; done &
while [ 1 ]; do cat /sys/kernel/config/cluster/$your_cluster_name/idle_timeout_ms > /dev/null; done &

Signed-off-by: Junxiao Bi
Cc: Joel Becker
Cc: Al Viro
Cc:
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Junxiao Bi
2013-11-22 08:42:27 +0800

16 Nov, 2013

1 commit

b26d4cd38 consolidate simple ->d_delete() instances ... Browse Code »

Rename simple_delete_dentry() to always_delete_dentry() and export it.
Export simple_dentry_operations, while we are at it, and get rid of
their duplicates

Signed-off-by: Al Viro

Al Viro
2013-11-16 11:04:17 +0800

15 Jul, 2013

1 commit

41d9884c4 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs ... Browse Code »

Pull more vfs stuff from Al Viro:
"O_TMPFILE ABI changes, Oleg's fput() series, misc cleanups, including
making simple_lookup() usable for filesystems with non-NULL s_d_op,
which allows us to get rid of quite a bit of ugliness"

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs:
sunrpc: now we can just set ->s_d_op
cgroup: we can use simple_lookup() now
efivarfs: we can use simple_lookup() now
make simple_lookup() usable for filesystems that set ->s_d_op
configfs: don't open-code d_alloc_name()
__rpc_lookup_create_exclusive: pass string instead of qstr
rpc_create_*_dir: don't bother with qstr
llist: llist_add() can use llist_add_batch()
llist: fix/simplify llist_add() and llist_add_batch()
fput: turn "list_head delayed_fput_list" into llist_head
fs/file_table.c:fput(): add comment
Safer ABI for O_TMPFILE

Linus Torvalds
2013-07-15 02:42:26 +0800

14 Jul, 2013

1 commit

ec193cf5a configfs: don't open-code d_alloc_name() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-07-14 21:16:52 +0800

10 Jul, 2013

1 commit

c75e24752 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs ... Browse Code »

Pull third set of VFS updates from Al Viro:
"Misc stuff all over the place. There will be one more pile in a
couple of days"

This is an "evil merge" that also uses the new d_count helper in
fs/configfs/dir.c, missed by commit 84d08fa888e7 ("helper for reading
->d_count")

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs:
ncpfs: fix error return code in ncp_parse_options()
locks: move file_lock_list to a set of percpu hlist_heads and convert file_lock_lock to an lglock
seq_file: add seq_list_*_percpu helpers
f2fs: fix readdir incorrectness
mode_t whack-a-mole...
lustre: kill the pointless wrapper
helper for reading ->d_count

Linus Torvalds
2013-07-10 02:26:44 +0800

04 Jul, 2013

1 commit

7121064b2 configfs: use capped length for ->store_attribute() ... Browse Code »

The difference between "count" and "len" is that "len" is capped at
4095. Changing it like this makes it match how sysfs_write_file() is
implemented.

This is a static analysis patch. I haven't found any store_attribute()
functions where this change makes a difference.

Signed-off-by: Dan Carpenter
Acked-by: Joel Becker
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Dan Carpenter
2013-07-04 07:07:23 +0800

29 Jun, 2013

1 commit

52018855e [readdir] convert configfs ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-06-29 16:56:30 +0800

04 Mar, 2013

1 commit

7f78e0351 fs: Limit sys_mount to only request filesystem modules. ... Browse Code »

Modify the request_module to prefix the file system type with "fs-"
and add aliases to all of the filesystems that can be built as modules
to match.

A common practice is to build all of the kernel code and leave code
that is not commonly needed as modules, with the result that many
users are exposed to any bug anywhere in the kernel.

Looking for filesystems with a fs- prefix limits the pool of possible
modules that can be loaded by mount to just filesystems trivially
making things safer with no real cost.

Using aliases means user space can control the policy of which
filesystem modules are auto-loaded by editing /etc/modprobe.d/*.conf
with blacklist and alias directives. Allowing simple, safe,
well understood work-arounds to known problematic software.

This also addresses a rare but unfortunate problem where the filesystem
name is not the same as it's module name and module auto-loading
would not work. While writing this patch I saw a handful of such
cases. The most significant being autofs that lives in the module
autofs4.

This is relevant to user namespaces because we can reach the request
module in get_fs_type() without having any special permissions, and
people get uncomfortable when a user specified string (in this case
the filesystem type) goes all of the way to request_module.

After having looked at this issue I don't think there is any
particular reason to perform any filtering or permission checks beyond
making it clear in the module request that we want a filesystem
module. The common pattern in the kernel is to call request_module()
without regards to the users permissions. In general all a filesystem
module does once loaded is call register_filesystem() and go to sleep.
Which means there is not much attack surface exposed by loading a
filesytem module unless the filesystem is mounted. In a user
namespace filesystems are not mounted unless .fs_flags = FS_USERNS_MOUNT,
which most filesystems do not set today.

Acked-by: Serge Hallyn
Acked-by: Kees Cook
Reported-by: Kees Cook
Signed-off-by: "Eric W. Biederman"

Eric W. Biederman
2013-03-04 11:36:31 +0800

27 Feb, 2013

1 commit

d895cb1af Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs ... Browse Code »

Pull vfs pile (part one) from Al Viro:
"Assorted stuff - cleaning namei.c up a bit, fixing ->d_name/->d_parent
locking violations, etc.

The most visible changes here are death of FS_REVAL_DOT (replaced with
"has ->d_weak_revalidate()") and a new helper getting from struct file
to inode. Some bits of preparation to xattr method interface changes.

Misc patches by various people sent this cycle *and* ocfs2 fixes from
several cycles ago that should've been upstream right then.

PS: the next vfs pile will be xattr stuff."

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (46 commits)
saner proc_get_inode() calling conventions
proc: avoid extra pde_put() in proc_fill_super()
fs: change return values from -EACCES to -EPERM
fs/exec.c: make bprm_mm_init() static
ocfs2/dlm: use GFP_ATOMIC inside a spin_lock
ocfs2: fix possible use-after-free with AIO
ocfs2: Fix oops in ocfs2_fast_symlink_readpage() code path
get_empty_filp()/alloc_file() leave both ->f_pos and ->f_version zero
target: writev() on single-element vector is pointless
export kernel_write(), convert open-coded instances
fs: encode_fh: return FILEID_INVALID if invalid fid_type
kill f_vfsmnt
vfs: kill FS_REVAL_DOT by adding a d_weak_revalidate dentry op
nfsd: handle vfs_getattr errors in acl protocol
switch vfs_getattr() to struct path
default SET_PERSONALITY() in linux/elf.h
ceph: prepopulate inodes only when request is aborted
d_hash_and_lookup(): export, switch open-coded instances
9p: switch v9fs_set_create_acl() to inode+fid, do it before d_instantiate()
9p: split dropping the acls from v9fs_set_create_acl()
...

Linus Torvalds
2013-02-27 12:16:07 +0800

23 Feb, 2013

1 commit

496ad9aa8 new helper: file_inode(file) ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-02-23 12:31:31 +0800

22 Feb, 2013

1 commit

49deb4bc2 configfs: move the dereference below the NULL test ... Browse Code »

The dereference should be moved below the NULL test.

spatch with a semantic match is used to found this.
(http://coccinelle.lip6.fr/)

Signed-off-by: Wei Yongjun
Cc: Joel Becker
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Wei Yongjun
2013-02-22 09:22:19 +0800

18 Dec, 2012

1 commit

965c8e59c lseek: the "whence" argument is called "whence" ... Browse Code »
13

But the kernel decided to call it "origin" instead. Fix most of the
sites.

Acked-by: Hugh Dickins
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andrew Morton
2012-12-18 09:15:12 +0800

18 Sep, 2012

1 commit

69552c0c5 userns: Convert configfs to use kuid and kgid where appropriate ... Browse Code »

Cc: Joel Becker
Acked-by: Serge Hallyn
Signed-off-by: Eric W. Biederman

Eric W. Biederman
2012-09-18 16:01:37 +0800

14 Jul, 2012

1 commit

00cd8dd3b stop passing nameidata to ->lookup() ... Browse Code »

Just the flags; only NFS cares even about that, but there are
legitimate uses for such argument. And getting rid of that
completely would require splitting ->lookup() into a couple
of methods (at least), so let's leave that alone for now...

Signed-off-by: Al Viro

Al Viro
2012-07-14 20:34:32 +0800

21 Mar, 2012

6 commits

2a152ad3a make configfs_pin_fs() return root dentry on success ... Browse Code »

... and make configfs_mnt static

Signed-off-by: Al Viro

Al Viro
2012-03-21 09:29:48 +0800
0dd6c08a0 configfs: configfs_create_dir() has parent dentry in dentry->d_parent ... Browse Code »

no need to play sick games with parent item, internal mount, etc.

Signed-off-by: Al Viro

Al Viro
2012-03-21 09:29:47 +0800
16d13b59b configfs: sanitize configfs_create() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-03-21 09:29:47 +0800
b7c177fcd configfs: kill configfs_sb ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-03-21 09:29:47 +0800
81d44ed15 configfs: don't bother with checks for mkdir/rmdir/unlink/symlink in root ... Browse Code »

just give root directory separate inode_operations without all those
methods...

Signed-off-by: Al Viro

Al Viro
2012-03-21 09:29:46 +0800
48fde701a switch open-coded instances of d_make_root() to new helper ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-03-21 09:29:35 +0800

04 Jan, 2012

4 commits

439475140 configfs: convert to umode_t ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:54:57 +0800
18bb1db3e switch vfs_mkdir() and ->mkdir() to umode_t ... Browse Code »

vfs_mkdir() gets int, but immediately drops everything that might not
fit into umode_t and that's the only caller of ->mkdir()...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:54:53 +0800
c972b4bc8 vfs: live vfsmounts never have NULL ->mnt_sb ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:42 +0800
4c1d5a64f vfs: for usbfs, etc. internal vfsmounts ->mnt_sb->s_root == ->mnt_root ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:41 +0800

14 Dec, 2011

1 commit

7c6455e36 configfs: register_filesystem() called too early ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2011-12-14 01:35:15 +0800

28 Sep, 2011

1 commit

395cf9691 doc: fix broken references ... Browse Code »

There are numerous broken references to Documentation files (in other
Documentation files, in comments, etc.). These broken references are
caused by typo's in the references, and by renames or removals of the
Documentation files. Some broken references are simply odd.

Fix these broken references, sometimes by dropping the irrelevant text
they were part of.

Signed-off-by: Paul Bolle
Signed-off-by: Jiri Kosina

Paul Bolle
2011-09-28 00:08:04 +0800

28 May, 2011

1 commit

98702467f configfs: remove unnecessary dentry_unhash on rmdir, dir rename ... Browse Code »

configfs does not have problems with references to unlinked directories.

CC: Joel Becker
Signed-off-by: Sage Weil
Signed-off-by: Al Viro

Sage Weil
2011-05-28 13:02:54 +0800

27 May, 2011

1 commit

32e51f141 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs-2.6 ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs-2.6: (25 commits)
cifs: remove unnecessary dentry_unhash on rmdir/rename_dir
ocfs2: remove unnecessary dentry_unhash on rmdir/rename_dir
exofs: remove unnecessary dentry_unhash on rmdir/rename_dir
nfs: remove unnecessary dentry_unhash on rmdir/rename_dir
ext2: remove unnecessary dentry_unhash on rmdir/rename_dir
ext3: remove unnecessary dentry_unhash on rmdir/rename_dir
ext4: remove unnecessary dentry_unhash on rmdir/rename_dir
btrfs: remove unnecessary dentry_unhash in rmdir/rename_dir
ceph: remove unnecessary dentry_unhash calls
vfs: clean up vfs_rename_other
vfs: clean up vfs_rename_dir
vfs: clean up vfs_rmdir
vfs: fix vfs_rename_dir for FS_RENAME_DOES_D_MOVE filesystems
libfs: drop unneeded dentry_unhash
vfs: update dentry_unhash() comment
vfs: push dentry_unhash on rename_dir into file systems
vfs: push dentry_unhash on rmdir into file systems
vfs: remove dget() from dentry_unhash()
vfs: dentry_unhash immediately prior to rmdir
vfs: Block mmapped writes while the fs is frozen
...

Linus Torvalds
2011-05-27 00:52:14 +0800

26 May, 2011

1 commit

79bf7c732 vfs: push dentry_unhash on rmdir into file systems ... Browse Code »

Only a few file systems need this. Start by pushing it down into each
fs rmdir method (except gfs2 and xfs) so it can be dealt with on a per-fs
basis.

This does not change behavior for any in-tree file systems.

Acked-by: Christoph Hellwig
Signed-off-by: Sage Weil
Signed-off-by: Al Viro

Sage Weil
2011-05-26 19:26:47 +0800

18 May, 2011

2 commits

24307aa1e configfs: Fix race between configfs_readdir() and configfs_d_iput() ... Browse Code »

configfs_readdir() will use the existing inode numbers of inodes in the
dcache, but it makes them up for attribute files that aren't currently
instantiated. There is a race where a closing attribute file can be
tearing down at the same time as configfs_readdir() is trying to get its
inode number.

We want to get the inode number of open attribute files, because they
should match while instantiated. We can't lock down the transition
where dentry->d_inode is set to NULL, so we just check for NULL there.
We can, however, ensure that an inode we find isn't iput() in
configfs_d_iput() until after we've accessed it.

Signed-off-by: Joel Becker

Joel Becker
2011-05-18 19:08:16 +0800
df7f99670 configfs: Don't try to d_delete() negative dentries. ... Browse Code »

When configfs is faking mkdir() on its subsystem or default group
objects, it starts by adding a negative dentry. It then tries to
instantiate the group. If that should fail, it must clean up after
itself.

I was using d_delete() here, but configfs_attach_group() promises to
return an empty dentry on error. d_delete() explodes with the entry
dentry. Let's try d_drop() instead. The unhashing is what we want for
our dentry.

Signed-off-by: Joel Becker

Joel Becker
2011-05-18 18:30:58 +0800

31 Mar, 2011

1 commit

25985edce Fix common misspellings ... Browse Code »

Fixes generated by 'codespell' and manually reviewed.

Signed-off-by: Lucas De Marchi

Lucas De Marchi
2011-03-31 22:26:23 +0800

17 Jan, 2011

1 commit

e20511728 configfs: change depends -> select SYSFS ... Browse Code »

This patch changes configfs to select SYSFS to fix the following:

warning: (TARGET_CORE && GFS2_FS) selects CONFIGFS_FS which has unmet direct dependencies (SYSFS)

Reported-by: Randy Dunlap
Signed-off-by: Nicholas A. Bellinger
Acked-by: Joel Becker

Nicholas Bellinger
2011-01-17 05:22:29 +0800

13 Jan, 2011

1 commit

d463a0c4b switch configfs ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2011-01-13 09:03:12 +0800

07 Jan, 2011

6 commits

fb045adb9 fs: dcache reduce branches in lookup path ... Browse Code »

Reduce some branches and memory accesses in dcache lookup by adding dentry
flags to indicate common d_ops are set, rather than having to check them.
This saves a pointer memory access (dentry->d_op) in common path lookup
situations, and saves another pointer load and branch in cases where we
have d_op but not the particular operation.

Patched with:

git grep -E '[.>]([[:space:]])*d_op([[:space:]])*=' | xargs sed -e 's/$[^\t ]*$->d_op = $.*$;/d_set_d_op(\1, \2);/' -e 's/$[^\t ]*$\.d_op = $.*$;/d_set_d_op(\&\1, \2);/' -i

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:28 +0800
dc0474be3 fs: dcache rationalise dget variants ... Browse Code »

dget_locked was a shortcut to avoid the lazy lru manipulation when we already
held dcache_lock (lru manipulation was relatively cheap at that point).
However, how that the lru lock is an innermost one, we never hold it at any
caller, so the lock cost can now be avoided. We already have well working lazy
dcache LRU, so it should be fine to defer LRU manipulations to scan time.

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:24 +0800
b5c84bf6f fs: dcache remove dcache_lock ... Browse Code »

dcache_lock no longer protects anything. remove it.

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:23 +0800
da5029563 fs: dcache scale d_unhashed ... Browse Code »

Protect d_unhashed(dentry) condition with d_lock. This means keeping
DCACHE_UNHASHED bit in synch with hash manipulations.

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:21 +0800
b7ab39f63 fs: dcache scale dentry refcount ... Browse Code »

Make d_count non-atomic and protect it with d_lock. This allows us to ensure a
0 refcount dentry remains 0 without dcache_lock. It is also fairly natural when
we start protecting many other dentry members with d_lock.

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:21 +0800
fe15ce446 fs: change d_delete semantics ... Browse Code »

Change d_delete from a dentry deletion notification to a dentry caching
advise, more like ->drop_inode. Require it to be constant and idempotent,
and not take d_lock. This is how all existing filesystems use the callback
anyway.

This makes fine grained dentry locking of dput and dentry lru scanning
much simpler.

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:18 +0800