Eric Lee / smarc-fsl-linux-kernel

04 Jan, 2012

24 commits

9be96f3fd move fs/partitions to block/ ... Browse Code »
43

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:54:06 +0800
dabe0dc19 vfs: fix the rest of sget() races ... Browse Code »

unfortunately, just checking MS_BORN after having grabbed ->s_umount in
sget() is not enough; places that pick superblock from a list and
grab s_umount shared need the same check in addition to checking for
->s_root; otherwise three-way race between failing mount, sget() and
such list-walker can leave us with list-walker coming *second*, when
temporary active ref grabbed by sget() (to be dropped when sget()
notices that original mount has failed by checking MS_BORN) has
lead to deactivate_locked_super() from failing ->mount() *not* doing
->kill_sb() and just releasing ->s_umount. Once sget() gets through
and notices that MS_BORN had never been set it will drop the active
ref and fs will be shut down and kicked out of all lists, but it's
too late for something like sync_supers().

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:53:10 +0800
cf31e70d6 vfs: new helper - vfs_ustat() ... Browse Code »

... and bury user_get_super()/statfs_by_dentry() - they are
purely internal now.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:53:07 +0800
c972b4bc8 vfs: live vfsmounts never have NULL ->mnt_sb ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:42 +0800
4c1d5a64f vfs: for usbfs, etc. internal vfsmounts ->mnt_sb->s_root == ->mnt_root ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:41 +0800
84b92d39f vfs: pipe.c is really non-modular ... Browse Code »

... so no exitcalls there. Not much would work if pipe(2) would stop
working, after all...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:41 +0800
6b520e056 vfs: fix the stupidity with i_dentry in inode destructors ... Browse Code »

Seeing that just about every destructor got that INIT_LIST_HEAD() copied into
it, there is no point whatsoever keeping this INIT_LIST_HEAD in inode_init_once();
the cost of taking it into inode_init_always() will be negligible for pipes
and sockets and negative for everything else. Not to mention the removal of
boilerplate code from ->destroy_inode() instances...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:40 +0800
2a79f17e4 vfs: mnt_drop_write_file() ... Browse Code »

new helper (wrapper around mnt_drop_write()) to be used in pair with
mnt_want_write_file().

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:40 +0800
8c9379e97 constify seq_file stuff ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:40 +0800
79e801a90 vfs: make do_kern_mount() static ... Browse Code »

the only user outside of fs/namespace.c has died

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:39 +0800
a5166169f vfs: convert fs_supers to hlist ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:39 +0800
5352d3b65 make nfs_follow_remote_path() handle ERR_PTR() passed as root_mnt ... Browse Code »

... rather than duplicating that in callers

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:39 +0800
5ffc2836a vfs: kill ->mnt_devname use in afs printks ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:38 +0800
e407699ef btrfs, nfs, apparmor: don't pull mnt_namespace.h for no reason... ... Browse Code »

it's not needed anymore; we used to, back when we had to do
mount_subtree() by hand, complete with put_mnt_ns() in it.
No more... Apparmor didn't need it since the __d_path() fix.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:38 +0800
aa0a4cf0a vfs: dentry_reset_mounted() doesn't use vfsmount argument ... Browse Code »

lose it

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:37 +0800
6c449c8df unexport put_mnt_ns(), make create_mnt_ns() static outright ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:37 +0800
aafd08dad vfs: add missing parens in pnode.h macros ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:37 +0800
afac7cba7 vfs: more mnt_parent cleanups ... Browse Code »

a) mount --move is checking that ->mnt_parent is non-NULL before
looking if that parent happens to be shared; ->mnt_parent is never
NULL and it's not even an misspelled !mnt_has_parent()

b) pivot_root open-codes is_path_reachable(), poorly.

c) so does path_is_under(), while we are at it.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:36 +0800
b2dba1af3 vfs: new internal helper: mnt_has_parent(mnt) ... Browse Code »

vfsmounts have ->mnt_parent pointing either to a different vfsmount
or to itself; it's never NULL and termination condition in loops
traversing the tree towards root is mnt == mnt->mnt_parent. At least
one place (see the next patch) is confused about what's going on;
let's add an explicit helper checking it right way and use it in
all places where we need it. Not that there had been too many,
but...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:36 +0800
aa9c0e07b vfs: kill pointless helpers in namespace.c ... Browse Code »

mnt_{inc,dec}_count() is not cleaner than doing the corresponding
mnt_add_count() directly and mnt_set_count() is not used at all.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:36 +0800
bad0dcffc new helpers: fh_{want,drop}_write() ... Browse Code »

A bunch of places in nfsd does mnt_{want,drop}_write on vfsmount of
export of given fhandle. Switched to obvious inlined helpers...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:35 +0800
a561be710 switch a bunch of places to mnt_want_write_file() ... Browse Code »

it's both faster (in case when file has been opened for write) and cleaner.

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:35 +0800
f47ec3f28 trim fs/internal.h ... Browse Code »

some stuff in there can actually become static; some belongs to pnode.h
as it's a private interface between namespace.c and pnode.c...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:35 +0800
5ede7b1cf pull manipulations of rpc_cred inside alloc_nfs_open_context() ... Browse Code »

No need to duplicate them in both callers; make it return
ERR_PTR(-ENOMEM) on allocation failure instead of NULL and
it'll be able to report rpc_lookup_cred() failures just
fine. Callers are much happier that way...

Signed-off-by: Al Viro

Al Viro
2012-01-04 11:52:34 +0800

31 Dec, 2011

1 commit

d65616a92 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client:
ceph: disable use of dcache for readdir etc.

Linus Torvalds
2011-12-31 05:34:22 +0800

30 Dec, 2011

3 commits

d2bac6ab9 Merge branch 'for-linus' of git://oss.sgi.com/xfs/xfs ... Browse Code »

* 'for-linus' of git://oss.sgi.com/xfs/xfs:
xfs: log all dirty inodes in xfs_fs_sync_fs
xfs: log the inode in ->write_inode calls for kupdate

Linus Torvalds
2011-12-30 09:05:45 +0800
34845636a procfs: do not confuse jiffies with cputime64_t ... Browse Code »

Commit 2a95ea6c0d129b4 ("procfs: do not overflow get_{idle,iowait}_time
for nohz") did not take into account that one some architectures jiffies
and cputime use different units.

This causes get_idle_time() to return numbers in the wrong units, making
the idle time fields in /proc/stat wrong.

Instead of converting the usec value returned by
get_cpu_{idle,iowait}_time_us to units of jiffies, use the new function
usecs_to_cputime64 to convert it to the correct unit of cputime64_t.

Signed-off-by: Andreas Schwab
Acked-by: Michal Hocko
Cc: Arnd Bergmann
Cc: "Artem S. Tashkinov"
Cc: Dave Jones
Cc: Alexey Dobriyan
Cc: Thomas Gleixner
Cc: "Luck, Tony"
Cc: Benjamin Herrenschmidt
Cc: Martin Schwidefsky
Cc: Heiko Carstens
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andreas Schwab
2011-12-30 08:31:57 +0800
a4d46363c ceph: disable use of dcache for readdir etc. ... Browse Code »

Ceph attempts to use the dcache to satisfy negative lookups and readdir
when the entire directory contents are in cache. Disable this behavior
until lingering bugs in this code are shaken out; we'll re-enable these
hooks once things are fully stable.

Signed-off-by: Sage Weil

Sage Weil
2011-12-30 00:05:14 +0800

27 Dec, 2011

1 commit

6d4b9e38d vfs: fix handling of lock allocation failure in lease-break case ... Browse Code »

Bruce Fields notes that commit 778fc546f749 ("locks: fix tracking of
inprogress lease breaks") introduced a possible error pointer
dereference on failure to allocate memory. locks_conflict() will
dereference the passed-in new lease lock structure that may be an error pointer.

This means an open (without O_NONBLOCK set) on a file with a lease
applied (generally only done when Samba or nfsd (with v4) is running)
could crash if a kmalloc() fails.

So instead of playing games with IS_ERROR() all over the place, just
check the allocation failure early. That makes the code more
straightforward, and avoids this possible bad pointer dereference.

Based-on-patch-by: J. Bruce Fields
Cc: Al Viro
Signed-off-by: Linus Torvalds

Linus Torvalds
2011-12-27 02:25:26 +0800

24 Dec, 2011

4 commits

6d451c578 Merge tag 'writeback' of git://git.kernel.org/pub/scm/linux/kernel/git/wfg/linux ... Browse Code »

for linus: writeback reason binary tracing format fix

* tag 'writeback' of git://git.kernel.org/pub/scm/linux/kernel/git/wfg/linux:
writeback: show writeback reason with __print_symbolic

Linus Torvalds
2011-12-24 12:25:36 +0800
827fa4c76 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs:
Btrfs: call d_instantiate after all ops are setup
Btrfs: fix worker lock misuse in find_worker

Linus Torvalds
2011-12-24 06:58:39 +0800
be4f1ac82 xfs: log all dirty inodes in xfs_fs_sync_fs ... Browse Code »
1

Since Linux 2.6.36 the writeback code has introduces various measures for
live lock prevention during sync(). Unfortunately some of these are
actively harmful for the XFS model, where the inode gets marked dirty for
metadata from the data I/O handler.

The older_than_this checks that are now more strictly enforced since

writeback: avoid livelocking WB_SYNC_ALL writeback

by only calling into __writeback_inodes_sb and thus only sampling the
current cut off time once. But on a slow enough devices the previous
asynchronous sync pass might not have fully completed yet, and thus XFS
might mark metadata dirty only after that sampling of the cut off time for
the blocking pass already happened. I have not myself reproduced this
myself on a real system, but by introducing artificial delay into the
XFS I/O completion workqueues it can be reproduced easily.

Fix this by iterating over all XFS inodes in ->sync_fs and log all that
are dirty. This might log inode that only got redirtied after the
previous pass, but given how cheap delayed logging of inodes is it
isn't a major concern for performance.

Signed-off-by: Christoph Hellwig
Reviewed-by: Dave Chinner
Tested-by: Mark Tinguely
Reviewed-by: Mark Tinguely
Signed-off-by: Ben Myers

Christoph Hellwig
2011-12-24 06:41:47 +0800
0b8fd3033 xfs: log the inode in ->write_inode calls for kupdate ... Browse Code »
1

If the writeback code writes back an inode because it has expired we currently
use the non-blockin ->write_inode path. This means any inode that is pinned
is skipped. With delayed logging and a workload that has very little log
traffic otherwise it is very likely that an inode that gets constantly
written to is always pinned, and thus we keep refusing to write it. The VM
writeback code at that point redirties it and doesn't try to write it again
for another 30 seconds. This means under certain scenarious time based
metadata writeback never happens.

Fix this by calling into xfs_log_inode for kupdate in addition to data
integrity syncs, and thus transfer the inode to the log ASAP.

Signed-off-by: Christoph Hellwig
Reviewed-by: Dave Chinner
Tested-by: Mark Tinguely
Reviewed-by: Mark Tinguely
Signed-off-by: Ben Myers

Christoph Hellwig
2011-12-24 06:41:47 +0800

23 Dec, 2011

2 commits

08c422c27 Btrfs: call d_instantiate after all ops are setup ... Browse Code »

This closes races where btrfs is calling d_instantiate too soon during
inode creation. All of the callers of btrfs_add_nondir are updated to
instantiate after the inode is fully setup in memory.

Signed-off-by: Al Viro
Signed-off-by: Chris Mason

Al Viro
2011-12-23 21:02:26 +0800
8d532b2af Btrfs: fix worker lock misuse in find_worker ... Browse Code »

Dan Carpenter noticed that we were doing a double unlock on the worker
lock, and sometimes picking a worker thread without the lock held.

This fixes both errors.

Signed-off-by: Chris Mason
Reported-by: Dan Carpenter

Chris Mason
2011-12-23 20:53:00 +0800

21 Dec, 2011

3 commits

822a5d313 Merge branch 'bugfixes' of git://git.linux-nfs.org/projects/trondmy/linux-nfs ... Browse Code »

* 'bugfixes' of git://git.linux-nfs.org/projects/trondmy/linux-nfs:
NFS: Fix a regression in nfs_file_llseek()
NFSv4: Do not accept delegated opens when a delegation recall is in effect
NFSv4: Ensure correct locking when accessing the 'lock_states' list
NFSv4.1: Ensure that we handle _all_ SEQUENCE status bits.
NFSv4: Don't error if we handled it in nfs4_recovery_handle_error
SUNRPC: Ensure we always bump the backlog queue in xprt_free_slot
SUNRPC: Fix the execution time statistics in the face of RPC restarts

Linus Torvalds
2011-12-21 03:31:56 +0800
481fe17e9 nilfs2: potential integer overflow in nilfs_ioctl_clean_segments() ... Browse Code »
43

There is a potential integer overflow in nilfs_ioctl_clean_segments().
When a large argv[n].v_nmembs is passed from the userspace, the subsequent
call to vmalloc() will allocate a buffer smaller than expected, which
leads to out-of-bound access in nilfs_ioctl_move_blocks() and
lfs_clean_segments().

The following check does not prevent the overflow because nsegs is also
controlled by the userspace and could be very large.

if (argv[n].v_nmembs > nsegs * nilfs->ns_blocks_per_segment)
goto out_free;

This patch clamps argv[n].v_nmembs to UINT_MAX / argv[n].v_size, and
returns -EINVAL when overflow.

Signed-off-by: Haogang Chen
Signed-off-by: Ryusuke Konishi
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Haogang Chen
2011-12-21 02:25:04 +0800
695c60f21 nilfs2: unbreak compat ioctl ... Browse Code »
1

commit 828b1c50ae ("nilfs2: add compat ioctl") incidentally broke all
other NILFS compat ioctls. Make them work again.

Signed-off-by: Thomas Meyer
Signed-off-by: Ryusuke Konishi
Tested-by: Ryusuke Konishi
Cc: [3.0+]
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Thomas Meyer
2011-12-21 02:25:04 +0800

18 Dec, 2011

1 commit

b3bba872d writeback: show writeback reason with __print_symbolic ... Browse Code »

This makes the binary trace understandable by trace-cmd.

CC: Dave Chinner
CC: Curt Wohlgemuth
CC: Steven Rostedt
Signed-off-by: Wu Fengguang

Wu Fengguang
2011-12-18 14:20:17 +0800

17 Dec, 2011

1 commit

c9a7fe967 Merge branches 'for-linus' and 'for-linus-3.2' of git://git.kernel.org/pub/scm/l… ... Browse Code »

…inux/kernel/git/mason/linux-btrfs

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs:
Btrfs: unplug every once and a while
Btrfs: deal with NULL srv_rsv in the delalloc inode reservation code
Btrfs: only set cache_generation if we setup the block group
Btrfs: don't panic if orphan item already exists
Btrfs: fix leaked space in truncate
Btrfs: fix how we do delalloc reservations and how we free reservations on error
Btrfs: deal with enospc from dirtying inodes properly
Btrfs: fix num_workers_starting bug and other bugs in async thread
BTRFS: Establish i_ops before calling d_instantiate
Btrfs: add a cond_resched() into the worker loop
Btrfs: fix ctime update of on-disk inode
btrfs: keep orphans for subvolume deletion
Btrfs: fix inaccurate available space on raid0 profile
Btrfs: fix wrong disk space information of the files
Btrfs: fix wrong i_size when truncating a file to a larger size
Btrfs: fix btrfs_end_bio to deal with write errors to a single mirror

* 'for-linus-3.2' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs:
btrfs: lower the dirty balance poll interval

Linus Torvalds
2011-12-17 04:15:50 +0800