Eric Lee / smarc-fsl-linux-kernel

08 Apr, 2016

1 commit

93061f390 Merge tag 'ext4_for_linus_stable' of git://git.kernel.org/pub/scm/linux/kernel/git/tytso/ext4 ... Browse Code »

Pull ext4 bugfixes from Ted Ts'o:
"These changes contains a fix for overlayfs interacting with some
(badly behaved) dentry code in various file systems. These have been
reviewed by Al and the respective file system mtinainers and are going
through the ext4 tree for convenience.

This also has a few ext4 encryption bug fixes that were discovered in
Android testing (yes, we will need to get these sync'ed up with the
fs/crypto code; I'll take care of that). It also has some bug fixes
and a change to ignore the legacy quota options to allow for xfstests
regression testing of ext4's internal quota feature and to be more
consistent with how xfs handles this case"

* tag 'ext4_for_linus_stable' of git://git.kernel.org/pub/scm/linux/kernel/git/tytso/ext4:
ext4: ignore quota mount options if the quota feature is enabled
ext4 crypto: fix some error handling
ext4: avoid calling dquot_get_next_id() if quota is not enabled
ext4: retry block allocation for failed DIO and DAX writes
ext4: add lockdep annotations for i_data_sem
ext4: allow readdir()'s of large empty directories to be interrupted
btrfs: fix crash/invalid memory access on fsync when using overlayfs
ext4 crypto: use dget_parent() in ext4_d_revalidate()
ext4: use file_dentry()
ext4: use dget_parent() in ext4_file_open()
nfs: use file_dentry()
fs: add file_dentry()
ext4 crypto: don't let data integrity writebacks fail with ENOMEM
ext4: check if in-inode xattr is corrupted in ext4_expand_extra_isize_ea()

Linus Torvalds
2016-04-08 08:22:20 +0800

27 Mar, 2016

1 commit

be62a1a8f nfs: use file_dentry() ... Browse Code »

NFS may be used as lower layer of overlayfs and accessing f_path.dentry can
lead to a crash.

Fix by replacing direct access of file->f_path.dentry with the
file_dentry() accessor, which will always return a native object.

Fixes: 4bacc9c9234c ("overlayfs: Make f_path always point to the overlay and f_inode to the underlay")
Signed-off-by: Miklos Szeredi
Tested-by: Goldwyn Rodrigues
Acked-by: Trond Myklebust
Signed-off-by: Theodore Ts'o
Cc: # v4.2
Cc: David Howells
Cc: Al Viro

Miklos Szeredi
2016-03-27 04:14:39 +0800

17 Mar, 2016

1 commit

4ff79bc70 nfs: remove nfs4_file_fsync ... Browse Code »

The only difference to nfs_file_fsync is the call to pnfs_sync_inode. But
pnfs_sync_inode is just an inline that calls a pNFS layout driver method
if CONFIG_PNFS is designed, and thus can be called just fine from the core
NFS module.

Signed-off-by: Christoph Hellwig
Signed-off-by: Trond Myklebust

Christoph Hellwig
2016-03-17 03:42:43 +0800

23 Jan, 2016

1 commit

5955102c9 wrappers for ->i_mutex access ... Browse Code »

parallel to mutex_{lock,unlock,trylock,is_locked,lock_nested},
inode_foo(inode) being mutex_foo(&inode->i_mutex).

Please, use those for access to ->i_mutex; over the coming cycle
->i_mutex will become rwsem, with ->lookup() done with it held
only shared.

Signed-off-by: Al Viro

Al Viro
2016-01-23 07:04:28 +0800

08 Dec, 2015

1 commit

04b38d601 vfs: pull btrfs clone API to vfs layer ... Browse Code »

The btrfs clone ioctls are now adopted by other file systems, with NFS
and CIFS already having support for them, and XFS being under active
development. To avoid growth of various slightly incompatible
implementations, add one to the VFS. Note that clones are different from
file copies in several ways:

- they are atomic vs other writers
- they support whole file clones
- they support 64-bit legth clones
- they do not allow partial success (aka short writes)
- clones are expected to be a fast metadata operation

Because of that it would be rather cumbersome to try to piggyback them on
top of the recent clone_file_range infrastructure. The converse isn't
true and the clone_file_range system call could try clone file range as
a first attempt to copy, something that further patches will enable.

Based on earlier work from Peng Tao.

Signed-off-by: Christoph Hellwig
Signed-off-by: Al Viro

Christoph Hellwig
2015-12-08 12:11:33 +0800

24 Nov, 2015

4 commits

6b7153da2 nfs: reduce the amount of ifdefs for v4.2 in nfs4file.c ... Browse Code »

Signed-off-by: Christoph Hellwig
Signed-off-by: Trond Myklebust

Christoph Hellwig
2015-11-24 10:53:14 +0800
0f42a6a9b nfs: use btrfs ioctl defintions for clone ... Browse Code »

The NFS CLONE_RANGE defintion was wrong and thus never worked. Fix this
by simply using the btrfs ioctl defintion.

Signed-off-by: Christoph Hellwig
Signed-off-by: Trond Myklebust

Christoph Hellwig
2015-11-24 10:53:08 +0800
21fad313d nfs: allow intra-file CLONE ... Browse Code »

Originally CLONE didn't allow for intra-file clones, but we recently
updated the spec to support this feature which is also supported by
local Linux file systems.

Signed-off-by: Christoph Hellwig
Signed-off-by: Trond Myklebust

Christoph Hellwig
2015-11-24 10:52:51 +0800
3a2e17690 nfs: offer native ioctls even if CONFIG_COMPAT is set ... Browse Code »

Without this for example 64-bit binaries on typical amd64 distributions
would not be able to use ioctls on NFS. For now this only affects clones.
Additionally ->compat_ioctl is defined even for non-compat builds, so
get rid of the pointless ifdef.

Signed-off-by: Christoph Hellwig
Signed-off-by: Trond Myklebust

Christoph Hellwig
2015-11-24 10:52:28 +0800

16 Oct, 2015

4 commits

275058a21 NFS: Fix an 'unused variable' complaint when #ifndef CONFIG_NFS_V4_2 ... Browse Code »

Signed-off-by: Trond Myklebust

Trond Myklebust
2015-10-16 04:20:11 +0800
a340abcf4 nfs42: add NFS_IOC_CLONE_RANGE ioctl ... Browse Code »

It follows btrfs BTRFS_IOC_CLONE_RANGE lead on ioctl number and
arguments.

Signed-off-by: Peng Tao
Signed-off-by: Trond Myklebust

Peng Tao
2015-10-16 04:08:33 +0800
811b7b85d nfs42: respect clone_blksize ... Browse Code »

draft-ietf-nfsv4-minorversion2-38.txt says:
Both cl_src_offset and
cl_dst_offset must be aligned to the clone block size Section 12.2.1.
The number of bytes to be cloned must be a multiple of the clone
block size, except in the case in which cl_src_offset plus the number
of bytes to be cloned is equal to the source file size.

Signed-off-by: Peng Tao
Signed-off-by: Trond Myklebust

Peng Tao
2015-10-16 04:08:28 +0800
bea51b30b nfs42: add NFS_IOC_CLONE ioctl ... Browse Code »

It can be called by user space to CLONE two files.
Follow btrfs lead and define NFS_IOC_CLONE same as BTRFS_IOC_CLONE.
Thus we don't mess up userspace with too many ioctls.

Signed-off-by: Peng Tao
Signed-off-by: Trond Myklebust

Peng Tao
2015-10-16 04:08:04 +0800

08 Sep, 2015

1 commit

5445b1fbd NFSv4: Respect the server imposed limit on how many changes we may cache ... Browse Code »

The NFSv4 delegation spec allows the server to tell a client to limit how
much data it cache after the file is closed. In return, the server
guarantees enough free space to avoid ENOSPC situations, etc.
Prior to this patch, we assumed we could always cache aggressively after
close. Unfortunately, this causes problems with servers that set the
limit to 0 and therefore do not offer any ENOSPC guarantees.

Signed-off-by: Trond Myklebust

Trond Myklebust
2015-09-08 00:36:17 +0800

28 Aug, 2015

1 commit

c5c3fb5f9 NFS: Make opened as optional argument in _nfs4_do_open ... Browse Code »

Check opened, only update it when non-NULL.
It's not needs define an unused value for the opened
when calling _nfs4_do_open.

v3, same as v2.

Signed-off-by: Kinglong Mee
Signed-off-by: Trond Myklebust

Kinglong Mee
2015-08-28 07:44:38 +0800

26 Jun, 2015

1 commit

18a600897 nfs: verify open flags before allowing open ... Browse Code »

Commit 9597c13b forbade opens with O_APPEND|O_DIRECT for NFSv4:

nfs: verify open flags before allowing an atomic open

Currently, you can open a NFSv4 file with O_APPEND|O_DIRECT, but cannot
fcntl(F_SETFL,...) with those flags. This flag combination is explicitly
forbidden on NFSv3 opens, and it seems like it should also be on NFSv4.

However, you can still open a file with O_DIRECT|O_APPEND if there exists a
cached dentry for the file because nfs4_file_open() is used instead of
nfs_atomic_open() and the check is bypassed. Add the check in
nfs4_file_open() as well.

Signed-off-by: Benjamin Coddington
Signed-off-by: Trond Myklebust

Benjamin Coddington
2015-06-26 07:38:00 +0800

27 Apr, 2015

1 commit

59953fba8 Merge tag 'nfs-for-4.1-1' of git://git.linux-nfs.org/projects/trondmy/linux-nfs ... Browse Code »

Pull NFS client updates from Trond Myklebust:
"Another set of mainly bugfixes and a couple of cleanups. No new
functionality in this round.

Highlights include:

Stable patches:
- Fix a regression in /proc/self/mountstats
- Fix the pNFS flexfiles O_DIRECT support
- Fix high load average due to callback thread sleeping

Bugfixes:
- Various patches to fix the pNFS layoutcommit support
- Do not cache pNFS deviceids unless server notifications are enabled
- Fix a SUNRPC transport reconnection regression
- make debugfs file creation failure non-fatal in SUNRPC
- Another fix for circular directory warnings on NFSv4 "junctioned"
mountpoints
- Fix locking around NFSv4.2 fallocate() support
- Truncating NFSv4 file opens should also sync O_DIRECT writes
- Prevent infinite loop in rpcrdma_ep_create()

Features:
- Various improvements to the RDMA transport code's handling of
memory registration
- Various code cleanups"

* tag 'nfs-for-4.1-1' of git://git.linux-nfs.org/projects/trondmy/linux-nfs: (55 commits)
fs/nfs: fix new compiler warning about boolean in switch
nfs: Remove unneeded casts in nfs
NFS: Don't attempt to decode missing directory entries
Revert "nfs: replace nfs_add_stats with nfs_inc_stats when add one"
NFS: Rename idmap.c to nfs4idmap.c
NFS: Move nfs_idmap.h into fs/nfs/
NFS: Remove CONFIG_NFS_V4 checks from nfs_idmap.h
NFS: Add a stub for GETDEVICELIST
nfs: remove WARN_ON_ONCE from nfs_direct_good_bytes
nfs: fix DIO good bytes calculation
nfs: Fetch MOUNTED_ON_FILEID when updating an inode
sunrpc: make debugfs file creation failure non-fatal
nfs: fix high load average due to callback thread sleeping
NFS: Reduce time spent holding the i_mutex during fallocate()
NFS: Don't zap caches on fallocate()
xprtrdma: Make rpcrdma_{un}map_one() into inline functions
xprtrdma: Handle non-SEND completions via a callout
xprtrdma: Add "open" memreg op
xprtrdma: Add "destroy MRs" memreg op
xprtrdma: Add "reset MRs" memreg op
...

Linus Torvalds
2015-04-27 08:33:59 +0800

24 Apr, 2015

2 commits

f830f7ddd NFS: Reduce time spent holding the i_mutex during fallocate() ... Browse Code »

At the very least, we should not be taking the i_mutex until after
checking if the server even supports ALLOCATE or DEALLOCATE, allowing
v4.0 or v4.1 to exit without potentially waiting on a lock.

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2015-04-24 02:36:28 +0800
9a51940bf NFS: Don't zap caches on fallocate() ... Browse Code »

This patch adds a GETATTR to the end of ALLOCATE and DEALLOCATE
operations so we can set the updated inode size and change attribute
directly. DEALLOCATE will still need to release pagecache pages, so
nfs42_proc_deallocate() now calls truncate_pagecache_range() before
contacting the server.

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2015-04-24 02:36:28 +0800

16 Apr, 2015

1 commit

2b0143b5c VFS: normal filesystems (and lustre): d_inode() annotations ... Browse Code »

that's the bulk of filesystem drivers dealing with inodes of their own

Signed-off-by: David Howells
Signed-off-by: Al Viro

David Howells
2015-04-16 03:06:57 +0800

12 Apr, 2015

1 commit

5d5d56897 make new_sync_{read,write}() static ... Browse Code »

All places outside of core VFS that checked ->read and ->write for being NULL or
called the methods directly are gone now, so NULL {read,write} with non-NULL
{read,write}_iter will do the right thing in all cases.

Signed-off-by: Al Viro

Al Viro
2015-04-12 10:29:40 +0800

28 Mar, 2015

4 commits

5bb89b470 NFSv4.1/pnfs: Separate out metadata and data consistency for pNFS ... Browse Code »

The LAYOUTCOMMIT operation means different things to different layout types.
For blocks and objects, it is both a data and metadata consistency operation.
For files and flexfiles, it is only a metadata consistency operation.

This patch separates out the 2 cases, allowing the files/flexfiles layout
drivers to optimise away the data consistency calls to layoutcommit.

Signed-off-by: Trond Myklebust

Trond Myklebust
2015-03-28 00:39:38 +0800
a0815d556 NFSv4.1/pnfs: Ensure that writes respect the O_SYNC flag when doing O_DIRECT ... Browse Code »

If the caller does not specify the O_SYNC flag, then it is legitimate
to return from O_DIRECT without doing a pNFS layoutcommit operation.
However if the file is opened O_DIRECT|O_SYNC then we'd better get it
right.

Signed-off-by: Trond Myklebust

Trond Myklebust
2015-03-28 00:39:37 +0800
9e1681c2e NFSv4: Truncating file opens should also sync O_DIRECT writes ... Browse Code »

We don't just want to sync out buffered writes, but also O_DIRECT ones.

Signed-off-by: Trond Myklebust

Trond Myklebust
2015-03-28 00:39:37 +0800
81b79afb5 NFSv4: Allow tracing of NFSv4 fsync calls ... Browse Code »

I appear to have missed this when adding the ftrace probes.

Signed-off-by: Trond Myklebust

Trond Myklebust
2015-03-28 00:39:34 +0800

26 Nov, 2014

2 commits

624bd5b7b nfs: Add DEALLOCATE support ... Browse Code »

This patch adds support for using the NFS v4.2 operation DEALLOCATE to
punch holes in a file.

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2014-11-26 05:38:32 +0800
f4ac1674f nfs: Add ALLOCATE support ... Browse Code »

This patch adds support for using the NFS v4.2 operation ALLOCATE to
preallocate data in a file.

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2014-11-26 05:38:32 +0800

19 Oct, 2014

1 commit

ead13aee2 Merge tag 'nfs-for-3.18-2' of git://git.linux-nfs.org/projects/trondmy/linux-nfs ... Browse Code »

Pull NFS client updates from Trond Myklebust:
"Highlights include:

Stable fixes:
- fix an uninitialised pointer Oops in the writeback error path
- fix a bogus warning (and early exit from the loop) in nfs_generic_pgio()

Features:
- Add NFSv4.2 SEEK feature and client support for lseek(SEEK_HOLE/SEEK_DATA)

Other fixes:
- pnfs: replace broken pnfs_put_lseg_async
- Remove dead prototype for nfs4_insert_deviceid_node"

* tag 'nfs-for-3.18-2' of git://git.linux-nfs.org/projects/trondmy/linux-nfs:
NFS: Fix a bogus warning in nfs_generic_pgio
NFS: Fix an uninitialised pointer Oops in the writeback error path
NFSv4.1/pnfs: replace broken pnfs_put_lseg_async
NFSv4: Remove dead prototype for nfs4_insert_deviceid_node()
NFS: Implement SEEK

Linus Torvalds
2014-10-19 03:52:08 +0800

01 Oct, 2014

1 commit

1c6dcbe5c NFS: Implement SEEK ... Browse Code »

The SEEK operation is used when an application makes an lseek call with
either the SEEK_HOLE or SEEK_DATA flags set. I fall back on
nfs_file_llseek() if the server does not have SEEK support.

Signed-off-by: Anna Schumaker
Signed-off-by: Trond Myklebust

Anna Schumaker
2014-10-01 04:24:56 +0800

10 Sep, 2014

1 commit

1c994a090 locks: consolidate "nolease" routines ... Browse Code »

GFS2 and NFS have setlease routines that always just return -EINVAL.
Turn that into a generic routine that can live in fs/libfs.c.

Cc:
Cc: Steven Whitehouse
Cc:
Signed-off-by: Jeff Layton
Acked-by: Trond Myklebust
Reviewed-by: Christoph Hellwig

Jeff Layton
2014-09-10 04:01:36 +0800

13 Jun, 2014

1 commit

16b905780 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs ... Browse Code »

Pull vfs updates from Al Viro:
"This the bunch that sat in -next + lock_parent() fix. This is the
minimal set; there's more pending stuff.

In particular, I really hope to get acct.c fixes merged this cycle -
we need that to deal sanely with delayed-mntput stuff. In the next
pile, hopefully - that series is fairly short and localized
(kernel/acct.c, fs/super.c and fs/namespace.c). In this pile: more
iov_iter work. Most of prereqs for ->splice_write with sane locking
order are there and Kent's dio rewrite would also fit nicely on top of
this pile"

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (70 commits)
lock_parent: don't step on stale ->d_parent of all-but-freed one
kill generic_file_splice_write()
ceph: switch to iter_file_splice_write()
shmem: switch to iter_file_splice_write()
nfs: switch to iter_splice_write_file()
fs/splice.c: remove unneeded exports
ocfs2: switch to iter_file_splice_write()
->splice_write() via ->write_iter()
bio_vec-backed iov_iter
optimize copy_page_{to,from}_iter()
bury generic_file_aio_{read,write}
lustre: get rid of messing with iovecs
ceph: switch to ->write_iter()
ceph_sync_direct_write: stop poking into iov_iter guts
ceph_sync_read: stop poking into iov_iter guts
new helper: copy_page_from_iter()
fuse: switch to ->write_iter()
btrfs: switch to ->write_iter()
ocfs2: switch to ->write_iter()
xfs: switch to ->write_iter()
...

Linus Torvalds
2014-06-13 01:30:18 +0800

12 Jun, 2014

1 commit

4da54c218 nfs: switch to iter_splice_write_file() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2014-06-12 12:21:11 +0800

29 May, 2014

1 commit

1b33809ea nfs: commit layouts in fdatasync ... Browse Code »

"fdatasync() is similar to fsync(), but does not flush modified metadata
unless that metadata is needed in order to allow a subsequent data
retrieval to be correctly handled."

We absolutely need to commit the layouts to be able to retrieve the data
in case either the client, the server or the storage subsystem go down.

Signed-off-by: Christoph Hellwig
Signed-off-by: Trond Myklebust

Christoph Hellwig
2014-05-29 05:45:30 +0800

07 May, 2014

2 commits

edaf43694 nfs: switch to ->write_iter() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2014-05-07 05:39:38 +0800
3aa2d199f nfs: switch to ->read_iter() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2014-05-07 05:37:59 +0800

13 Nov, 2013

1 commit

9bc9ccd7d Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs ... Browse Code »

Pull vfs updates from Al Viro:
"All kinds of stuff this time around; some more notable parts:

- RCU'd vfsmounts handling
- new primitives for coredump handling
- files_lock is gone
- Bruce's delegations handling series
- exportfs fixes

plus misc stuff all over the place"

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (101 commits)
ecryptfs: ->f_op is never NULL
locks: break delegations on any attribute modification
locks: break delegations on link
locks: break delegations on rename
locks: helper functions for delegation breaking
locks: break delegations on unlink
namei: minor vfs_unlink cleanup
locks: implement delegations
locks: introduce new FL_DELEG lock flag
vfs: take i_mutex on renamed file
vfs: rename I_MUTEX_QUOTA now that it's not used for quotas
vfs: don't use PARENT/CHILD lock classes for non-directories
vfs: pull ext4's double-i_mutex-locking into common code
exportfs: fix quadratic behavior in filehandle lookup
exportfs: better variable name
exportfs: move most of reconnect_path to helper function
exportfs: eliminate unused "noprogress" counter
exportfs: stop retrying once we race with rename/remove
exportfs: clear DISCONNECTED on all parents sooner
exportfs: more detailed comment for path_reconnect
...

Linus Torvalds
2013-11-13 14:34:18 +0800

29 Oct, 2013

1 commit

c698dbf9f Merge branch 'fscache' of git://git.kernel.org/pub/scm/linux/kernel/git/dhowells… ... Browse Code »

…/linux-fs into linux-next

Pull fs-cache fixes from David Howells:

Can you pull these commits to fix an issue with NFS whereby caching can be
enabled on a file that is open for writing by subsequently opening it for
reading. This can be made to crash by opening it for writing again if you're
quick enough.

The gist of the patchset is that the cookie should be acquired at inode
creation only and subsequently enabled and disabled as appropriate (which
dispenses with the backing objects when they're not needed).

The extra synchronisation that NFS does can then be dispensed with as it is
thenceforth managed by FS-Cache.

Could you send these on to Linus?

This likely will need fixing also in CIFS and 9P also once the FS-Cache
changes are upstream. AFS and Ceph are probably safe.

* 'fscache' of git://git.kernel.org/pub/scm/linux/kernel/git/dhowells/linux-fs:
NFS: Use i_writecount to control whether to get an fscache cookie in nfs_open()
FS-Cache: Provide the ability to enable/disable cookies
FS-Cache: Add use/unuse/wake cookie wrappers

Trond Myklebust
2013-10-29 07:36:46 +0800

25 Oct, 2013

1 commit

6de1472f1 nfs: use %p[dD] instead of open-coded (and often racy) equivalents ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-10-25 11:34:50 +0800

28 Sep, 2013

1 commit

f1fe29b4a NFS: Use i_writecount to control whether to get an fscache cookie in nfs_open() ... Browse Code »

Use i_writecount to control whether to get an fscache cookie in nfs_open() as
NFS does not do write caching yet. I *think* this is the cause of a problem
encountered by Mark Moseley whereby __fscache_uncache_page() gets a NULL
pointer dereference because cookie->def is NULL:

BUG: unable to handle kernel NULL pointer dereference at 0000000000000010
IP: [] __fscache_uncache_page+0x23/0x160
PGD 0
Thread overran stack, or stack corrupted
Oops: 0000 [#1] SMP
Modules linked in: ...
CPU: 7 PID: 18993 Comm: php Not tainted 3.11.1 #1
Hardware name: Dell Inc. PowerEdge R420/072XWF, BIOS 1.3.5 08/21/2012
task: ffff8804203460c0 ti: ffff880420346640
RIP: 0010:[] __fscache_uncache_page+0x23/0x160
RSP: 0018:ffff8801053af878 EFLAGS: 00210286
RAX: 0000000000000000 RBX: ffff8800be2f8780 RCX: ffff88022ffae5e8
RDX: 0000000000004c66 RSI: ffffea00055ff440 RDI: ffff8800be2f8780
RBP: ffff8801053af898 R08: 0000000000000001 R09: 0000000000000003
R10: 0000000000000000 R11: 0000000000000000 R12: ffffea00055ff440
R13: 0000000000001000 R14: ffff8800c50be538 R15: 0000000000000000
FS: 0000000000000000(0000) GS:ffff88042fc60000(0063) knlGS:00000000e439c700
CS: 0010 DS: 002b ES: 002b CR0: 0000000080050033
CR2: 0000000000000010 CR3: 0000000001d8f000 CR4: 00000000000607f0
Stack:
...
Call Trace:
[] __nfs_fscache_invalidate_page+0x42/0x70
[] nfs_invalidate_page+0x75/0x90
[] truncate_inode_page+0x8e/0x90
[] truncate_inode_pages_range.part.12+0x14d/0x620
[] ? __mutex_lock_slowpath+0x1fd/0x2e0
[] truncate_inode_pages_range+0x53/0x70
[] truncate_inode_pages+0x2d/0x40
[] truncate_pagecache+0x4f/0x70
[] nfs_setattr_update_inode+0xa0/0x120
[] nfs3_proc_setattr+0xc4/0xe0
[] nfs_setattr+0xc8/0x150
[] notify_change+0x1cb/0x390
[] do_truncate+0x7b/0xc0
[] do_last+0xa4c/0xfd0
[] path_openat+0xcc/0x670
[] do_filp_open+0x4e/0xb0
[] do_sys_open+0x13f/0x2b0
[] compat_SyS_open+0x36/0x50
[] sysenter_dispatch+0x7/0x24

The code at the instruction pointer was disassembled:

> (gdb) disas __fscache_uncache_page
> Dump of assembler code for function __fscache_uncache_page:
> ...
> 0xffffffff812a18ff : mov 0x48(%rbx),%rax
> 0xffffffff812a1903 : cmpb $0x0,0x10(%rax)
> 0xffffffff812a1907 : je 0xffffffff812a19cd

These instructions make up:

ASSERTCMP(cookie->def->type, !=, FSCACHE_COOKIE_TYPE_INDEX);

That cmpb is the faulting instruction (%rax is 0). So cookie->def is NULL -
which presumably means that the cookie has already been at least partway
through __fscache_relinquish_cookie().

What I think may be happening is something like a three-way race on the same
file:

PROCESS 1 PROCESS 2 PROCESS 3
=============== =============== ===============
open(O_TRUNC|O_WRONLY)
open(O_RDONLY)
open(O_WRONLY)
-->nfs_open()
-->nfs_fscache_set_inode_cookie()
nfs_fscache_inode_lock()
nfs_fscache_disable_inode_cookie()
__fscache_relinquish_cookie()
nfs_inode->fscache = NULL
nfs_open()
-->nfs_fscache_set_inode_cookie()
nfs_fscache_inode_lock()
nfs_fscache_enable_inode_cookie()
__fscache_acquire_cookie()
nfs_inode->fscache = cookie
nfs_setattr()
...
...
-->nfs_invalidate_page()
-->__nfs_fscache_invalidate_page()
cookie = nfsi->fscache
-->nfs_open()
-->nfs_fscache_set_inode_cookie()
nfs_fscache_inode_lock()
nfs_fscache_disable_inode_cookie()
-->__fscache_relinquish_cookie()
-->__fscache_uncache_page(cookie)

fscache = NULL

Signed-off-by: David Howells

David Howells
2013-09-28 01:40:25 +0800

26 Sep, 2013

1 commit

5bc2afc2b NFSv4: Honour the 'opened' parameter in the atomic_open() filesystem method ... Browse Code »

Determine if we've created a new file by examining the directory change
attribute and/or the O_EXCL flag.

This fixes a regression when doing a non-exclusive create of a new file.
If the FILE_CREATED flag is not set, the atomic_open() command will
perform full file access permissions checks instead of just checking
for MAY_OPEN.

Signed-off-by: Trond Myklebust

Trond Myklebust
2013-09-26 22:20:18 +0800