Eric Lee / smarc-ti-linux-kernel | Embedian Git Server

13 Apr, 2014

1 commit

5166701b3 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs ... Browse Code »

Pull vfs updates from Al Viro:
"The first vfs pile, with deep apologies for being very late in this
window.

Assorted cleanups and fixes, plus a large preparatory part of iov_iter
work. There's a lot more of that, but it'll probably go into the next
merge window - it *does* shape up nicely, removes a lot of
boilerplate, gets rid of locking inconsistencie between aio_write and
splice_write and I hope to get Kent's direct-io rewrite merged into
the same queue, but some of the stuff after this point is having
(mostly trivial) conflicts with the things already merged into
mainline and with some I want more testing.

This one passes LTP and xfstests without regressions, in addition to
usual beating. BTW, readahead02 in ltp syscalls testsuite has started
giving failures since "mm/readahead.c: fix readahead failure for
memoryless NUMA nodes and limit readahead pages" - might be a false
positive, might be a real regression..."

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (63 commits)
missing bits of "splice: fix racy pipe->buffers uses"
cifs: fix the race in cifs_writev()
ceph_sync_{,direct_}write: fix an oops on ceph_osdc_new_request() failure
kill generic_file_buffered_write()
ocfs2_file_aio_write(): switch to generic_perform_write()
ceph_aio_write(): switch to generic_perform_write()
xfs_file_buffered_aio_write(): switch to generic_perform_write()
export generic_perform_write(), start getting rid of generic_file_buffer_write()
generic_file_direct_write(): get rid of ppos argument
btrfs_file_aio_write(): get rid of ppos
kill the 5th argument of generic_file_buffered_write()
kill the 4th argument of __generic_file_aio_write()
lustre: don't open-code kernel_recvmsg()
ocfs2: don't open-code kernel_recvmsg()
drbd: don't open-code kernel_recvmsg()
constify blk_rq_map_user_iov() and friends
lustre: switch to kernel_sendmsg()
ocfs2: don't open-code kernel_sendmsg()
take iov_iter stuff to mm/iov_iter.c
process_vm_access: tidy up a bit
...

Linus Torvalds
2014-04-13 05:49:50 +0800

05 Apr, 2014

1 commit

d15e03104 Merge tag 'xfs-for-linus-3.15-rc1' of git://oss.sgi.com/xfs/xfs ... Browse Code »

Pull xfs update from Dave Chinner:
"There are a couple of new fallocate features in this request - it was
decided that it was easiest to push them through the XFS tree using
topic branches and have the ext4 support be based on those branches.
Hence you may see some overlap with the ext4 tree merge depending on
how they including those topic branches into their tree. Other than
that, there is O_TMPFILE support, some cleanups and bug fixes.

The main changes in the XFS tree for 3.15-rc1 are:

- O_TMPFILE support
- allowing AIO+DIO writes beyond EOF
- FALLOC_FL_COLLAPSE_RANGE support for fallocate syscall and XFS
implementation
- FALLOC_FL_ZERO_RANGE support for fallocate syscall and XFS
implementation
- IO verifier cleanup and rework
- stack usage reduction changes
- vm_map_ram NOIO context fixes to remove lockdep warings
- various bug fixes and cleanups"

* tag 'xfs-for-linus-3.15-rc1' of git://oss.sgi.com/xfs/xfs: (34 commits)
xfs: fix directory hash ordering bug
xfs: extra semi-colon breaks a condition
xfs: Add support for FALLOC_FL_ZERO_RANGE
fs: Introduce FALLOC_FL_ZERO_RANGE flag for fallocate
xfs: inode log reservations are still too small
xfs: xfs_check_page_type buffer checks need help
xfs: avoid AGI/AGF deadlock scenario for inode chunk allocation
xfs: use NOIO contexts for vm_map_ram
xfs: don't leak EFSBADCRC to userspace
xfs: fix directory inode iolock lockdep false positive
xfs: allocate xfs_da_args to reduce stack footprint
xfs: always do log forces via the workqueue
xfs: modify verifiers to differentiate CRC from other errors
xfs: print useful caller information in xfs_error_report
xfs: add xfs_verifier_error()
xfs: add helper for updating checksums on xfs_bufs
xfs: add helper for verifying checksums on xfs_bufs
xfs: Use defines for CRC offsets in all cases
xfs: skip pointless CRC updates after verifier failures
xfs: Add support FALLOC_FL_COLLAPSE_RANGE for fallocate
...

Linus Torvalds
2014-04-05 06:50:08 +0800

02 Apr, 2014

5 commits

3f4d5a000 tidy do_dentry_open() up a bit ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2014-04-02 11:19:13 +0800
83f936c75 mark struct file that had write access grabbed by open() ... Browse Code »

new flag in ->f_mode - FMODE_WRITER. Set by do_dentry_open() in case
when it has grabbed write access, checked by __fput() to decide whether
it wants to drop the sucker. Allows to stop bothering with mnt_clone_write()
in alloc_file(), along with fewer special_file() checks.

Signed-off-by: Al Viro

Al Viro
2014-04-02 11:19:12 +0800
0ccb28634 fold __get_file_write_access() into its only caller ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2014-04-02 11:19:12 +0800
4597e695b get rid of DEBUG_WRITECOUNT ... Browse Code »

it only makes control flow in __fput() and friends more convoluted.

Signed-off-by: Al Viro

Al Viro
2014-04-02 11:19:12 +0800
dd20908a8 don't bother with {get,put}_write_access() on non-regular files ... Browse Code »
5

it's pointless and actually leads to wrong behaviour in at least one
moderately convoluted case (pipe(), close one end, try to get to
another via /proc/*/fd and run into ETXTBUSY).

Cc: stable@vger.kernel.org
Signed-off-by: Al Viro

Al Viro
2014-04-02 11:19:11 +0800

13 Mar, 2014

1 commit

409332b65 fs: Introduce FALLOC_FL_ZERO_RANGE flag for fallocate ... Browse Code »

Introduce new FALLOC_FL_ZERO_RANGE flag for fallocate. This has the same
functionality as xfs ioctl XFS_IOC_ZERO_RANGE.

It can be used to convert a range of file to zeros preferably without
issuing data IO. Blocks should be preallocated for the regions that span
holes in the file, and the entire range is preferable converted to
unwritten extents - even though file system may choose to zero out the
extent or do whatever which will result in reading zeros from the range
while the range remains allocated for the file.

This can be also used to preallocate blocks past EOF in the same way as
with fallocate. Flag FALLOC_FL_KEEP_SIZE which should cause the inode
size to remain the same.

Signed-off-by: Lukas Czerner
Reviewed-by: Dave Chinner
Signed-off-by: Dave Chinner

Lukas Czerner
2014-03-13 16:07:42 +0800

10 Mar, 2014

1 commit

9c225f265 vfs: atomic f_pos accesses as per POSIX ... Browse Code »

Our write() system call has always been atomic in the sense that you get
the expected thread-safe contiguous write, but we haven't actually
guaranteed that concurrent writes are serialized wrt f_pos accesses, so
threads (or processes) that share a file descriptor and use "write()"
concurrently would quite likely overwrite each others data.

This violates POSIX.1-2008/SUSv4 Section XSI 2.9.7 that says:

"2.9.7 Thread Interactions with Regular File Operations

All of the following functions shall be atomic with respect to each
other in the effects specified in POSIX.1-2008 when they operate on
regular files or symbolic links: [...]"

and one of the effects is the file position update.

This unprotected file position behavior is not new behavior, and nobody
has ever cared. Until now. Yongzhi Pan reported unexpected behavior to
Michael Kerrisk that was due to this.

This resolves the issue with a f_pos-specific lock that is taken by
read/write/lseek on file descriptors that may be shared across threads
or processes.

Reported-by: Yongzhi Pan
Reported-by: Michael Kerrisk
Cc: Al Viro
Signed-off-by: Linus Torvalds
Signed-off-by: Al Viro

Linus Torvalds
2014-03-10 23:44:41 +0800

24 Feb, 2014

1 commit

00f5e6199 fs: Add new flag(FALLOC_FL_COLLAPSE_RANGE) for fallocate ... Browse Code »

This patch is in response of the following post:
http://lwn.net/Articles/556136/
"ext4: introduce two new ioctls"

Dave chinner suggested that truncate_block_range
(which was one of the ioctls name) should be a fallocate operation
and not any fs specific ioctl, hence we add this functionality to new flags of fallocate.

This new functionality of collapsing range could be used by media editing tools
which does non linear editing to quickly purge and edit parts of a media file.
This will immensely improve the performance of these operations.
The limitation of fs block size aligned offsets can be easily handled
by media codecs which are encapsulated in a conatiner as they have to
just change the offset to next keyframe value to match the proper alignment.

Signed-off-by: Namjae Jeon
Signed-off-by: Ashish Sangwan
Reviewed-by: Dave Chinner
Signed-off-by: Dave Chinner

Namjae Jeon
2014-02-24 07:58:15 +0800

09 Nov, 2013

2 commits

27ac0ffea locks: break delegations on any attribute modification ... Browse Code »
30

NFSv4 uses leases to guarantee that clients can cache metadata as well
as data.

Cc: Mikulas Patocka
Cc: David Howells
Cc: Tyler Hicks
Cc: Dustin Kirkland
Acked-by: Jeff Layton
Signed-off-by: J. Bruce Fields
Signed-off-by: Al Viro

J. Bruce Fields
2013-11-09 13:16:44 +0800
eee5cc270 get rid of s_files and files_lock ... Browse Code »

The only thing we need it for is alt-sysrq-r (emergency remount r/o)
and these days we can do just as well without going through the
list of files.

Signed-off-by: Al Viro

Al Viro
2013-11-09 13:16:20 +0800

25 Oct, 2013

1 commit

72c2d5319 file->f_op is never NULL... ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-10-25 11:34:54 +0800

17 Sep, 2013

1 commit

0854d450e vfs: improve i_op->atomic_open() documentation ... Browse Code »

Fix documentation of ->atomic_open() and related functions: finish_open()
and finish_no_open(). Also add details that seem to be unclear and a
source of bugs (some of which are fixed in the following series).

Cc-ing maintainers of all filesystems implementing ->atomic_open().

Signed-off-by: Miklos Szeredi
Cc: Eric Van Hensbergen
Cc: Sage Weil
Cc: Steve French
Cc: Steven Whitehouse
Cc: Trond Myklebust
Signed-off-by: Al Viro

Miklos Szeredi
2013-09-17 07:17:24 +0800

08 Sep, 2013

1 commit

c7c4591db Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace ... Browse Code »

Pull namespace changes from Eric Biederman:
"This is an assorted mishmash of small cleanups, enhancements and bug
fixes.

The major theme is user namespace mount restrictions. nsown_capable
is killed as it encourages not thinking about details that need to be
considered. A very hard to hit pid namespace exiting bug was finally
tracked and fixed. A couple of cleanups to the basic namespace
infrastructure.

Finally there is an enhancement that makes per user namespace
capabilities usable as capabilities, and an enhancement that allows
the per userns root to nice other processes in the user namespace"

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace:
userns: Kill nsown_capable it makes the wrong thing easy
capabilities: allow nice if we are privileged
pidns: Don't have unshare(CLONE_NEWPID) imply CLONE_THREAD
userns: Allow PR_CAPBSET_DROP in a user namespace.
namespaces: Simplify copy_namespaces so it is clear what is going on.
pidns: Fix hang in zap_pid_ns_processes by sending a potentially extra wakeup
sysfs: Restrict mounting sysfs
userns: Better restrictions on when proc and sysfs can be mounted
vfs: Don't copy mount bind mounts of /proc//ns/mnt between namespaces
kernel/nsproxy.c: Improving a snippet of code.
proc: Restrict mounting the proc filesystem
vfs: Lock in place mounts from more privileged users

Linus Torvalds
2013-09-08 05:35:32 +0800

04 Sep, 2013

1 commit

173c84012 switch fchmod() to fdget ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-09-04 11:04:45 +0800

31 Aug, 2013

1 commit

c7b96acf1 userns: Kill nsown_capable it makes the wrong thing easy ... Browse Code »

nsown_capable is a special case of ns_capable essentially for just CAP_SETUID and
CAP_SETGID. For the existing users it doesn't noticably simplify things and
from the suggested patches I have seen it encourages people to do the wrong
thing. So remove nsown_capable.

Acked-by: Serge Hallyn
Signed-off-by: "Eric W. Biederman"

Eric W. Biederman
2013-08-31 14:44:11 +0800

05 Aug, 2013

1 commit

e305f48bc fs: Fix file mode for O_TMPFILE ... Browse Code »

O_TMPFILE, like O_CREAT, should respect the requested mode and should
create regular files.

This fixes two bugs: O_TMPFILE required privilege (because the mode
ended up as 000) and it produced bogus inodes with no type.

Signed-off-by: Andy Lutomirski
Signed-off-by: Al Viro

Andy Lutomirski
2013-08-05 22:24:10 +0800

20 Jul, 2013

1 commit

ba57ea64c allow O_TMPFILE to work with O_WRONLY ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-07-20 07:11:32 +0800

13 Jul, 2013

1 commit

bb458c644 Safer ABI for O_TMPFILE ... Browse Code »

[suggested by Rasmus Villemoes] make O_DIRECTORY | O_RDWR part of O_TMPFILE;
that will fail on old kernels in a lot more cases than what I came up with.
And make sure O_CREAT doesn't get there...

Signed-off-by: Al Viro

Al Viro
2013-07-13 17:26:37 +0800

29 Jun, 2013

2 commits

60545d0d4 [O_TMPFILE] it's still short a few helpers, but infrastructure should be OK now... ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-06-29 16:57:10 +0800
f9652e10c allow build_open_flags() to return an error ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-06-29 16:57:09 +0800

04 Mar, 2013

3 commits

2cf096668 make SYSCALL_DEFINE<n>-generated wrappers do asmlinkage_protect ... Browse Code »

... and switch i386 to HAVE_SYSCALL_WRAPPERS, killing open-coded
uses of asmlinkage_protect() in a bunch of syscalls.

Signed-off-by: Al Viro

Al Viro
2013-03-04 11:58:33 +0800
4a0fd5bf0 teach SYSCALL_DEFINE<n> how to deal with long long/unsigned long long ... Browse Code »

... and convert a bunch of SYSCALL_DEFINE ones to SYSCALL_DEFINE,
killing the boilerplate crap around them.

Signed-off-by: Al Viro

Al Viro
2013-03-04 11:46:22 +0800
56a79b7b0 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs ... Browse Code »

Pull more VFS bits from Al Viro:
"Unfortunately, it looks like xattr series will have to wait until the
next cycle ;-/

This pile contains 9p cleanups and fixes (races in v9fs_fid_add()
etc), fixup for nommu breakage in shmem.c, several cleanups and a bit
more file_inode() work"

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs:
constify path_get/path_put and fs_struct.c stuff
fix nommu breakage in shmem.c
cache the value of file_inode() in struct file
9p: if v9fs_fid_lookup() gets to asking server, it'd better have hashed dentry
9p: make sure ->lookup() adds fid to the right dentry
9p: untangle ->lookup() a bit
9p: double iput() in ->lookup() if d_materialise_unique() fails
9p: v9fs_fid_add() can't fail now
v9fs: get rid of v9fs_dentry
9p: turn fid->dlist into hlist
9p: don't bother with private lock in ->d_fsdata; dentry->d_lock will do just fine
more file_inode() open-coded instances
selinux: opened file can't have NULL or negative ->f_path.dentry

(In the meantime, the hlist traversal macros have changed, so this
required a semantic conflict fixup for the newly hlistified fid->dlist)

Linus Torvalds
2013-03-04 05:23:03 +0800

03 Mar, 2013

1 commit

14cc0b55b Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/signal ... Browse Code »

Pull signal/compat fixes from Al Viro:
"Fixes for several regressions introduced in the last signal.git pile,
along with fixing bugs in truncate and ftruncate compat (on just about
anything biarch at least one of those two had been done wrong)."

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/signal:
compat: restore timerfd settime and gettime compat syscalls
[regression] braino in "sparc: convert to ksignal"
fix compat truncate/ftruncate
switch lseek to COMPAT_SYSCALL_DEFINE
lseek() and truncate() on sparc really need sign extension

Linus Torvalds
2013-03-03 00:34:06 +0800

02 Mar, 2013

1 commit

dd37978c5 cache the value of file_inode() in struct file ... Browse Code »

Note that this thing does *not* contribute to inode refcount;
it's pinned down by dentry.

Signed-off-by: Al Viro

Al Viro
2013-03-02 08:48:30 +0800

26 Feb, 2013

1 commit

21d206819 get_empty_filp()/alloc_file() leave both ->f_pos and ->f_version zero ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-02-26 15:46:11 +0800

25 Feb, 2013

1 commit

3f6d078d4 fix compat truncate/ftruncate ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-02-25 22:24:55 +0800

23 Feb, 2013

2 commits

1afc99bea propagate error from get_empty_filp() to its callers ... Browse Code »

Based on parts from Anatol's patch (the rest is the next commit).

Signed-off-by: Al Viro

Al Viro
2013-02-23 12:31:32 +0800
496ad9aa8 new helper: file_inode(file) ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2013-02-23 12:31:31 +0800

21 Dec, 2012

7 commits

99a5df37a vfs: make fchownat retry once on ESTALE errors ... Browse Code »

Signed-off-by: Jeff Layton
Signed-off-by: Al Viro

Jeff Layton
2012-12-21 07:50:07 +0800
14ff690c0 vfs: make fchmodat retry once on ESTALE errors ... Browse Code »

Signed-off-by: Jeff Layton
Signed-off-by: Al Viro

Jeff Layton
2012-12-21 07:50:07 +0800
2771261ec vfs: have chroot retry once on ESTALE error ... Browse Code »

Signed-off-by: Jeff Layton
Signed-off-by: Al Viro

Jeff Layton
2012-12-21 07:50:06 +0800
0291c0a55 vfs: have chdir retry lookup and call once on ESTALE error ... Browse Code »

Signed-off-by: Jeff Layton
Signed-off-by: Al Viro

Jeff Layton
2012-12-21 07:50:06 +0800
87fa55952 vfs: have faccessat retry once on an ESTALE error ... Browse Code »

Signed-off-by: Jeff Layton
Signed-off-by: Al Viro

Jeff Layton
2012-12-21 07:50:05 +0800
48f7530d3 vfs: have do_sys_truncate retry once on an ESTALE error ... Browse Code »

Signed-off-by: Jeff Layton
Signed-off-by: Al Viro

Jeff Layton
2012-12-21 07:50:05 +0800
a02de9608 VFS: Make more complete truncate operation available to CacheFiles ... Browse Code »

Make a more complete truncate operation available to CacheFiles (including
security checks and suchlike) so that it can use this to clear invalidated
cache files.

Signed-off-by: David Howells
Acked-by: Al Viro

David Howells
2012-12-21 06:05:41 +0800

19 Nov, 2012

1 commit

a85fb273c vfs: Allow chroot if you have CAP_SYS_CHROOT in your user namespace ... Browse Code »

Once you are confined to a user namespace applications can not gain
privilege and escape the user namespace so there is no longer a reason
to restrict chroot.

Acked-by: Serge Hallyn
Signed-off-by: "Eric W. Biederman"

Eric W. Biederman
2012-11-19 21:59:17 +0800

13 Oct, 2012

1 commit

669abf4e5 vfs: make path_openat take a struct filename pointer ... Browse Code »

...and fix up the callers. For do_file_open_root, just declare a
struct filename on the stack and fill out the .name field. For
do_filp_open, make it also take a struct filename pointer, and fix up its
callers to call it appropriately.

For filp_open, add a variant that takes a struct filename pointer and turn
filp_open into a wrapper around it.

Signed-off-by: Jeff Layton
Signed-off-by: Al Viro

Jeff Layton
2012-10-13 08:15:09 +0800