26 May, 2014

2 commits

  • Pull AFS fixes and cleanups from David Howells:
    "Here are some patches to the AFS filesystem:

    1) Fix problems in the clean-up parts of the cache manager service
    handler.

    2) Split afs_end_call() introduced in (1) and replace some identical
    code elsewhere with a call to the first half of the split function.

    3) Fix an error introduced in the workqueue PREPARE_WORK() elimination
    commits.

    4) Clean up argument passing to functions called from the workqueue as
    there's now an insulating layer between them and the workqueue.
    This is possible from (3)"

    * 'afs' of git://git.kernel.org/pub/scm/linux/kernel/git/dhowells/linux-fs:
    AFS: Pass an afs_call* to call->async_workfn() instead of a work_struct*
    AFS: Fix kafs module unloading
    AFS: Part of afs_end_call() is identical to code elsewhere, so split it
    AFS: Fix cache manager service handlers

    Linus Torvalds
     
  • Pull two nfsd bugfixes from Bruce Fields:
    "Just two bugfixes, one for a merge-window-introduced ACL regression,
    the other for a longer-standing v4 state bug"

    * 'for-3.15' of git://linux-nfs.org/~bfields/linux:
    nfsd4: warn on finding lockowner without stateid's
    nfsd4: remove lockowner when removing lock stateid
    nfsd4: fix corruption on setting an ACL.

    Linus Torvalds
     

24 May, 2014

1 commit

  • In dlm_init, if create dlm_lockname_cache failed in
    dlm_init_master_caches, it will destroy dlm_lockres_cache which created
    before twice. And this will cause system die when loading modules.

    Signed-off-by: Joseph Qi
    Cc: Mark Fasheh
    Cc: Joel Becker
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Joseph Qi
     

23 May, 2014

3 commits

  • call->async_workfn() can take an afs_call* arg rather than a work_struct* as
    the functions assigned there are now called from afs_async_workfn() which has
    to call container_of() anyway.

    Signed-off-by: David Howells
    Reviewed-by: Nathaniel Wesley Filardo
    Reviewed-by: Tejun Heo

    David Howells
     
  • At present, it is not possible to successfully unload the kafs module if there
    are outstanding async outgoing calls (those made with afs_make_call()). This
    appears to be due to the changes introduced by:

    commit 059499453a9abd1857d442b44da8b4c126dc72a8
    Author: Tejun Heo
    Date: Fri Mar 7 10:24:50 2014 -0500
    Subject: afs: don't use PREPARE_WORK

    which didn't go far enough. The problem is due to:

    (1) The aforementioned commit introduced a separate handler function pointer
    in the call, call->async_workfn, in addition to the original workqueue
    item, call->async_work, for asynchronous operations because workqueues
    subsystem cannot handle the workqueue item pointer being changed whilst
    the item is queued or being processed.

    (2) afs_async_workfn() was introduced in that commit to be the callback for
    call->async_work. Its sole purpose is to run whatever call->async_workfn
    points to.

    (3) call->async_workfn is only used from afs_async_workfn(), which is only
    set on async_work by afs_collect_incoming_call() - ie. for incoming
    calls.

    (4) call->async_workfn is *not* set by afs_make_call() when outgoing calls are
    made, and call->async_work is set afs_process_async_call() - and not
    afs_async_workfn().

    (5) afs_process_async_call() now changes call->async_workfn rather than
    call->async_work to point to afs_delete_async_call() to clean up, but this
    is only effective for incoming calls because call->async_work does not
    point to afs_async_workfn() for outgoing calls.

    (6) Because, for incoming calls, call->async_work remains pointing to
    afs_process_async_call() this results in an infinite loop.

    Instead, make the workqueue uniformly vector through call->async_workfn, via
    afs_async_workfn() and simply initialise call->async_workfn to point to
    afs_process_async_call() in afs_make_call().

    Signed-off-by: Nathaniel Wesley Filardo
    Signed-off-by: David Howells
    Reviewed-by: Tejun Heo

    Nathaniel Wesley Filardo
     
  • Split afs_end_call() into two pieces, one of which is identical to code in
    afs_process_async_call(). Replace the latter with a call to the first part of
    afs_end_call().

    Signed-off-by: Nathaniel Wesley Filardo
    Signed-off-by: David Howells

    Nathaniel Wesley Filardo
     

22 May, 2014

2 commits

  • Pull two btrfs fixes from Chris Mason:
    "This has two fixes that we've been testing for 3.16, but since both
    are safe and fix real bugs, it makes sense to send for 3.15 instead"

    * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs:
    Btrfs: send, fix incorrect ref access when using extrefs
    Btrfs: fix EIO on reading file after ioctl clone works on it

    Linus Torvalds
     
  • Pull xfs fixes from Dave Chinner:
    "Code inspection of the XFS error number sign translations found a
    bunch of issues, including returning incorrectly signed errors for
    some data integrity operations.

    These leak to userspace and result in applications not getting the
    errors correctly reported. Hence they need fixing sooner rather than
    later.

    A couple of the bugs are in data integrity operations, a couple more
    are in the new COLLAPSE_RANGE code. One of these came in through a
    recent ext4 merge and so I had to update the base tree to 3.15-rc5
    before fixing the issues"

    * tag 'xfs-for-linus-3.15-rc6' of git://oss.sgi.com/xfs/xfs:
    xfs: list_lru_init returns a negative error
    xfs: negate xfs_icsb_init_counters error value
    xfs: negate mount workqueue init error value
    xfs: fix wrong err sign on xfs_set_acl()
    xfs: fix wrong errno from xfs_initxattrs
    xfs: correct error sign on COLLAPSE_RANGE errors
    xfs: xfs_commit_metadata returns wrong errno
    xfs: fix incorrect error sign in xfs_file_aio_read
    xfs: xfs_dir_fsync() returns positive errno

    Linus Torvalds
     

21 May, 2014

6 commits

  • The current code assumes a one-to-one lockownerlock stateid
    correspondance.

    Cc: stable@vger.kernel.org
    Signed-off-by: J. Bruce Fields

    J. Bruce Fields
     
  • The nfsv4 state code has always assumed a one-to-one correspondance
    between lock stateid's and lockowners even if it appears not to in some
    places.

    We may actually change that, but for now when FREE_STATEID releases a
    lock stateid it also needs to release the parent lockowner.

    Symptoms were a subsequent LOCK crashing in find_lockowner_str when it
    calls same_lockowner_ino on a lockowner that unexpectedly has an empty
    so_stateids list.

    Cc: stable@vger.kernel.org
    Signed-off-by: J. Bruce Fields

    J. Bruce Fields
     
  • Fix the cache manager RPC service handlers. The afs_send_empty_reply() and
    afs_send_simple_reply() functions:

    (a) Kill the call and free up the buffers associated with it if they fail.

    (b) Return with call intact if it they succeed.

    However, none of the callers actually check the result or clean up if
    successful - and may use the now non-existent data if it fails.

    This was detected by Dan Carpenter using a static checker:

    The patch 08e0e7c82eea: "[AF_RXRPC]: Make the in-kernel AFS
    filesystem use AF_RXRPC." from Apr 26, 2007, leads to the following
    static checker warning:
    "fs/afs/cmservice.c:155 SRXAFSCB_CallBack()
    warn: 'call' was already freed."

    Reported-by: Dan Carpenter
    Signed-off-by: David Howells

    David Howells
     
  • Pull driver core fixes from Greg KH:
    "Here are two driver core (well, sysfs) fixes for 3.15-rc6 that resolve
    some reported issues and a regression from 3.13"

    * tag 'driver-core-3.15-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/driver-core:
    sysfs: make sure read buffer is zeroed
    kernfs, sysfs, cgroup: restrict extra perm check on open to sysfs

    Linus Torvalds
     
  • When running send, if an inode only has extended reference items
    associated to it and no regular references, send.c:get_first_ref()
    was incorrectly assuming the reference it found was of type
    BTRFS_INODE_REF_KEY due to use of the wrong key variable.
    This caused weird behaviour when using the found item has a regular
    reference, such as weird path string, and occasionally (when lucky)
    a crash:

    [ 190.600652] general protection fault: 0000 [#1] SMP DEBUG_PAGEALLOC
    [ 190.600994] Modules linked in: btrfs xor raid6_pq binfmt_misc nfsd auth_rpcgss oid_registry nfs_acl nfs lockd fscache sunrpc psmouse serio_raw evbug pcspkr i2c_piix4 e1000 floppy
    [ 190.602565] CPU: 2 PID: 14520 Comm: btrfs Not tainted 3.13.0-fdm-btrfs-next-26+ #1
    [ 190.602728] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
    [ 190.602868] task: ffff8800d447c920 ti: ffff8801fa79e000 task.ti: ffff8801fa79e000
    [ 190.603030] RIP: 0010:[] [] memcpy+0x54/0x110
    [ 190.603262] RSP: 0018:ffff8801fa79f880 EFLAGS: 00010202
    [ 190.603395] RAX: ffff8800d4326e3f RBX: 000000000000036a RCX: ffff880000000000
    [ 190.603553] RDX: 000000000000032a RSI: ffe708844042936a RDI: ffff8800d43271a9
    [ 190.603710] RBP: ffff8801fa79f8c8 R08: 00000000003a4ef0 R09: 0000000000000000
    [ 190.603867] R10: 793a4ef09f000000 R11: 9f0000000053726f R12: ffff8800d43271a9
    [ 190.604020] R13: 0000160000000000 R14: ffff8802110134f0 R15: 000000000000036a
    [ 190.604020] FS: 00007fb423d09b80(0000) GS:ffff880216200000(0000) knlGS:0000000000000000
    [ 190.604020] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
    [ 190.604020] CR2: 00007fb4229d4b78 CR3: 00000001f5d76000 CR4: 00000000000006e0
    [ 190.604020] Stack:
    [ 190.604020] ffffffffa01f4d49 ffff8801fa79f8f0 00000000000009f9 ffff8801fa79f8c8
    [ 190.604020] 00000000000009f9 ffff880211013260 000000000000f971 ffff88021147dba8
    [ 190.604020] 00000000000009f9 ffff8801fa79f918 ffffffffa02367f5 ffff8801fa79f928
    [ 190.604020] Call Trace:
    [ 190.604020] [] ? read_extent_buffer+0xb9/0x120 [btrfs]
    [ 190.604020] [] fs_path_add_from_extent_buffer+0x45/0x60 [btrfs]
    [ 190.604020] [] get_first_ref+0x1f6/0x210 [btrfs]
    [ 190.604020] [] __get_cur_name_and_parent+0x174/0x3a0 [btrfs]
    [ 190.604020] [] ? kmem_cache_alloc_trace+0x11d/0x1e0
    [ 190.604020] [] ? fs_path_alloc+0x24/0x60 [btrfs]
    [ 190.604020] [] get_cur_path+0xd1/0x240 [btrfs]
    (...)

    Steps to reproduce (either crash or some weirdness like an odd path string):

    mkfs.btrfs -f -O extref /dev/sdd
    mount /dev/sdd /mnt

    mkdir /mnt/testdir
    touch /mnt/testdir/foobar

    for i in `seq 1 2550`; do
    ln /mnt/testdir/foobar /mnt/testdir/foobar_link_`printf "%04d" $i`
    done

    ln /mnt/testdir/foobar /mnt/testdir/final_foobar_name

    rm -f /mnt/testdir/foobar
    for i in `seq 1 2550`; do
    rm -f /mnt/testdir/foobar_link_`printf "%04d" $i`
    done

    btrfs subvolume snapshot -r /mnt /mnt/mysnap
    btrfs send /mnt/mysnap -f /tmp/mysnap.send

    Signed-off-by: Filipe David Borba Manana
    Signed-off-by: Chris Mason
    Reviewed-by: Liu Bo

    Filipe Manana
     
  • For inline data extent, we need to make its length aligned, otherwise,
    we can get a phantom extent map which confuses readpages() to return -EIO.

    This can be detected by xfstests/btrfs/035.

    Reported-by: David Disseldorp
    Signed-off-by: Liu Bo
    Signed-off-by: Chris Mason

    Liu Bo
     

20 May, 2014

2 commits

  • Pull Metag architecture and related fixes from James Hogan:
    "Mostly fixes for metag and parisc relating to upgrowing stacks.

    - Fix missing compiler barriers in metag memory barriers.
    - Fix BUG_ON on metag when RLIMIT_STACK hard limit is increased
    beyond safe value.
    - Make maximum stack size configurable. This reduces the default
    user stack size back to 80MB (especially on parisc after their
    removal of _STK_LIM_MAX override). This only affects metag and
    parisc.
    - Remove metag _STK_LIM_MAX override to match other arches and follow
    parisc, now that it is safe to do so (due to the BUG_ON fix
    mentioned above).
    - Finally now that both metag and parisc _STK_LIM_MAX overrides have
    been removed, it makes sense to remove _STK_LIM_MAX altogether"

    * tag 'metag-for-v3.15-2' of git://git.kernel.org/pub/scm/linux/kernel/git/jhogan/metag:
    asm-generic: remove _STK_LIM_MAX
    metag: Remove _STK_LIM_MAX override
    parisc,metag: Do not hardcode maximum userspace stack size
    metag: Reduce maximum stack size to 256MB
    metag: fix memory barriers

    Linus Torvalds
     
  • 13c589d5b0ac ("sysfs: use seq_file when reading regular files")
    switched sysfs from custom read implementation to seq_file to enable
    later transition to kernfs. After the change, the buffer passed to
    ->show() is acquired through seq_get_buf(); unfortunately, this
    introduces a subtle behavior change. Before the commit, the buffer
    passed to ->show() was always zero as it was allocated using
    get_zeroed_page(). Because seq_file doesn't clear buffers on
    allocation and neither does seq_get_buf(), after the commit, depending
    on the behavior of ->show(), we may end up exposing uninitialized data
    to userland thus possibly altering userland visible behavior and
    leaking information.

    Fix it by explicitly clearing the buffer.

    Signed-off-by: Tejun Heo
    Reported-by: Ron
    Fixes: 13c589d5b0ac ("sysfs: use seq_file when reading regular files")
    Cc: stable # 3.13+
    Signed-off-by: Greg Kroah-Hartman

    Tejun Heo
     

16 May, 2014

1 commit


15 May, 2014

10 commits


13 May, 2014

3 commits

  • The kernfs open method - kernfs_fop_open() - inherited extra
    permission checks from sysfs. While the vfs layer allows ignoring the
    read/write permissions checks if the issuer has CAP_DAC_OVERRIDE,
    sysfs explicitly denied open regardless of the cap if the file doesn't
    have any of the UGO perms of the requested access or doesn't implement
    the requested operation. It can be debated whether this was a good
    idea or not but the behavior is too subtle and dangerous to change at
    this point.

    After cgroup got converted to kernfs, this extra perm check also got
    applied to cgroup breaking libcgroup which opens write-only files with
    O_RDWR as root. This patch gates the extra open permission check with
    a new flag KERNFS_ROOT_EXTRA_OPEN_PERM_CHECK and enables it for sysfs.
    For sysfs, nothing changes. For cgroup, root now can perform any
    operation regardless of the permissions as it was before kernfs
    conversion. Note that kernfs still fails unimplemented operations
    with -EINVAL.

    While at it, add comments explaining KERNFS_ROOT flags.

    Signed-off-by: Tejun Heo
    Reported-by: Andrey Wagin
    Tested-by: Andrey Wagin
    Cc: Li Zefan
    References: http://lkml.kernel.org/g/CANaxB-xUm3rJ-Cbp72q-rQJO5mZe1qK6qXsQM=vh0U8upJ44+A@mail.gmail.com
    Fixes: 2bd59d48ebfb ("cgroup: convert to kernfs")
    Signed-off-by: Greg Kroah-Hartman

    Tejun Heo
     
  • Pull file locking fix from Jeff Layton:
    "Fix for regression in handling of F_GETLK commands"

    * tag 'locks-v3.15-4' of git://git.samba.org/jlayton/linux:
    locks: only validate the lock vs. f_mode in F_SETLK codepaths

    Linus Torvalds
     
  • Pull cifs fix from Steve French:
    "Small cifs fix for metadata caching"

    * 'for-linus' of git://git.samba.org/sfrench/cifs-2.6:
    cifs: fix actimeo=0 corner case when cifs_i->time == jiffies

    Linus Torvalds
     

11 May, 2014

1 commit


09 May, 2014

3 commits

  • v2: replace missing break in switch statement (as pointed out by Dave
    Jones)

    commit bce7560d4946 (locks: consolidate checks for compatible
    filp->f_mode values in setlk handlers) introduced a regression in the
    F_GETLK handler.

    flock64_to_posix_lock is a shared codepath between F_GETLK and F_SETLK,
    but the f_mode checks should only be applicable to the F_SETLK codepaths
    according to POSIX.

    Instead of just reverting the patch, add a new function to do this
    checking and have the F_SETLK handlers call it.

    Cc: Dave Jones
    Reported-and-Tested-by: Reuben Farrelly
    Signed-off-by: Jeff Layton

    Jeff Layton
     
  • Pull xfs fixes from Dave Chinner:
    "The main fix is adding support for default ACLs on O_TMPFILE opened
    inodes to bring XFS into line with other filesystems. Metadata CRCs
    are now also considered well enough tested to be fully supported, so
    we're removing the shouty warnings issued at mount time for
    filesystems with that format. And there's transaction block
    reservation overrun fix.

    Summary:
    - fix a remote attribute size calculation bug that leads to a
    transaction overrun
    - add default ACLs to O_TMPFILE files
    - Remove the EXPERIMENTAL tag from filesystems with metadata CRC
    support"

    * tag 'xfs-for-linus-3.15-rc5' of git://oss.sgi.com/xfs/xfs:
    xfs: remote attribute overwrite causes transaction overrun
    xfs: initialize default acls for ->tmpfile()
    xfs: fully support v5 format filesystems

    Linus Torvalds
     
  • After setting ACL for directory, I got two problems that caused
    by the cached zero-length default posix acl.

    This patch make sure nfsd4_set_nfs4_acl calls ->set_acl
    with a NULL ACL structure if there are no entries.

    Thanks for Christoph Hellwig's advice.

    First problem:
    ............ hang ...........

    Second problem:
    [ 1610.167668] ------------[ cut here ]------------
    [ 1610.168320] kernel BUG at /root/nfs/linux/fs/nfsd/nfs4acl.c:239!
    [ 1610.168320] invalid opcode: 0000 [#1] SMP DEBUG_PAGEALLOC
    [ 1610.168320] Modules linked in: nfsv4(OE) nfs(OE) nfsd(OE)
    rpcsec_gss_krb5 fscache ip6t_rpfilter ip6t_REJECT cfg80211 xt_conntrack
    rfkill ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables
    ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6
    ip6table_mangle ip6table_security ip6table_raw ip6table_filter
    ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4
    nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw
    auth_rpcgss nfs_acl snd_intel8x0 ppdev lockd snd_ac97_codec ac97_bus
    snd_pcm snd_timer e1000 pcspkr parport_pc snd parport serio_raw joydev
    i2c_piix4 sunrpc(OE) microcode soundcore i2c_core ata_generic pata_acpi
    [last unloaded: nfsd]
    [ 1610.168320] CPU: 0 PID: 27397 Comm: nfsd Tainted: G OE
    3.15.0-rc1+ #15
    [ 1610.168320] Hardware name: innotek GmbH VirtualBox/VirtualBox, BIOS
    VirtualBox 12/01/2006
    [ 1610.168320] task: ffff88005ab653d0 ti: ffff88005a944000 task.ti:
    ffff88005a944000
    [ 1610.168320] RIP: 0010:[] []
    _posix_to_nfsv4_one+0x3cd/0x3d0 [nfsd]
    [ 1610.168320] RSP: 0018:ffff88005a945b00 EFLAGS: 00010293
    [ 1610.168320] RAX: 0000000000000001 RBX: ffff88006700bac0 RCX:
    0000000000000000
    [ 1610.168320] RDX: 0000000000000000 RSI: ffff880067c83f00 RDI:
    ffff880068233300
    [ 1610.168320] RBP: ffff88005a945b48 R08: ffffffff81c64830 R09:
    0000000000000000
    [ 1610.168320] R10: ffff88004ea85be0 R11: 000000000000f475 R12:
    ffff880068233300
    [ 1610.168320] R13: 0000000000000003 R14: 0000000000000002 R15:
    ffff880068233300
    [ 1610.168320] FS: 0000000000000000(0000) GS:ffff880077800000(0000)
    knlGS:0000000000000000
    [ 1610.168320] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
    [ 1610.168320] CR2: 00007f5bcbd3b0b9 CR3: 0000000001c0f000 CR4:
    00000000000006f0
    [ 1610.168320] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
    0000000000000000
    [ 1610.168320] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
    0000000000000400
    [ 1610.168320] Stack:
    [ 1610.168320] ffffffff00000000 0000000b67c83500 000000076700bac0
    0000000000000000
    [ 1610.168320] ffff88006700bac0 ffff880068233300 ffff88005a945c08
    0000000000000002
    [ 1610.168320] 0000000000000000 ffff88005a945b88 ffffffffa034e2d5
    000000065a945b68
    [ 1610.168320] Call Trace:
    [ 1610.168320] [] nfsd4_get_nfs4_acl+0x95/0x150 [nfsd]
    [ 1610.168320] [] nfsd4_encode_fattr+0x646/0x1e70 [nfsd]
    [ 1610.168320] [] ? kmemleak_alloc+0x4e/0xb0
    [ 1610.168320] [] ?
    nfsd_setuser_and_check_port+0x52/0x80 [nfsd]
    [ 1610.168320] [] ? selinux_cred_prepare+0x1b/0x30
    [ 1610.168320] [] nfsd4_encode_getattr+0x5a/0x60 [nfsd]
    [ 1610.168320] [] nfsd4_encode_operation+0x67/0x110
    [nfsd]
    [ 1610.168320] [] nfsd4_proc_compound+0x21d/0x810 [nfsd]
    [ 1610.168320] [] nfsd_dispatch+0xbb/0x200 [nfsd]
    [ 1610.168320] [] svc_process_common+0x46d/0x6d0 [sunrpc]
    [ 1610.168320] [] svc_process+0x103/0x170 [sunrpc]
    [ 1610.168320] [] nfsd+0xbf/0x130 [nfsd]
    [ 1610.168320] [] ? nfsd_destroy+0x80/0x80 [nfsd]
    [ 1610.168320] [] kthread+0xd2/0xf0
    [ 1610.168320] [] ? insert_kthread_work+0x40/0x40
    [ 1610.168320] [] ret_from_fork+0x7c/0xb0
    [ 1610.168320] [] ? insert_kthread_work+0x40/0x40
    [ 1610.168320] Code: 78 02 e9 e7 fc ff ff 31 c0 31 d2 31 c9 66 89 45 ce
    41 8b 04 24 66 89 55 d0 66 89 4d d2 48 8d 04 80 49 8d 5c 84 04 e9 37 fd
    ff ff 0b 90 0f 1f 44 00 00 55 8b 56 08 c7 07 00 00 00 00 8b 46 0c
    [ 1610.168320] RIP [] _posix_to_nfsv4_one+0x3cd/0x3d0
    [nfsd]
    [ 1610.168320] RSP
    [ 1610.257313] ---[ end trace 838254e3e352285b ]---

    Signed-off-by: Kinglong Mee
    Cc: stable@vger.kernel.org
    Signed-off-by: J. Bruce Fields

    Kinglong Mee
     

07 May, 2014

6 commits

  • Merge misc fixes from Andrew Morton:
    "13 fixes"

    * emailed patches from Andrew Morton :
    agp: info leak in agpioc_info_wrap()
    fs/affs/super.c: bugfix / double free
    fanotify: fix -EOVERFLOW with large files on 64-bit
    slub: use sysfs'es release mechanism for kmem_cache
    revert "mm: vmscan: do not swap anon pages just because free+file is low"
    autofs: fix lockref lookup
    mm: filemap: update find_get_pages_tag() to deal with shadow entries
    mm/compaction: make isolate_freepages start at pageblock boundary
    MAINTAINERS: zswap/zbud: change maintainer email address
    mm/page-writeback.c: fix divide by zero in pos_ratio_polynom
    hugetlb: ensure hugepage access is denied if hugepages are not supported
    slub: fix memcg_propagate_slab_attrs
    drivers/rtc/rtc-pcf8523.c: fix month definition

    Linus Torvalds
     
  • Commit 842a859db26b ("affs: use ->kill_sb() to simplify ->put_super()
    and failure exits of ->mount()") adds .kill_sb which frees sbi but
    doesn't remove sbi free in case of parse_options error causing double
    free+random crash.

    Signed-off-by: Fabian Frederick
    Cc: Alexander Viro
    Cc: [3.14.x]
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Fabian Frederick
     
  • On 64-bit systems, O_LARGEFILE is automatically added to flags inside
    the open() syscall (also openat(), blkdev_open(), etc). Userspace
    therefore defines O_LARGEFILE to be 0 - you can use it, but it's a
    no-op. Everything should be O_LARGEFILE by default.

    But: when fanotify does create_fd() it uses dentry_open(), which skips
    all that. And userspace can't set O_LARGEFILE in fanotify_init()
    because it's defined to 0. So if fanotify gets an event regarding a
    large file, the read() will just fail with -EOVERFLOW.

    This patch adds O_LARGEFILE to fanotify_init()'s event_f_flags on 64-bit
    systems, using the same test as open()/openat()/etc.

    Addresses https://bugzilla.redhat.com/show_bug.cgi?id=696821

    Signed-off-by: Will Woods
    Acked-by: Eric Paris
    Reviewed-by: Jan Kara
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Will Woods
     
  • autofs needs to be able to see private data dentry flags for its dentrys
    that are being created but not yet hashed and for its dentrys that have
    been rmdir()ed but not yet freed. It needs to do this so it can block
    processes in these states until a status has been returned to indicate
    the given operation is complete.

    It does this by keeping two lists, active and expring, of dentrys in
    this state and uses ->d_release() to keep them stable while it checks
    the reference count to determine if they should be used.

    But with the recent lockref changes dentrys being freed sometimes don't
    transition to a reference count of 0 before being freed so autofs can
    occassionally use a dentry that is invalid which can lead to a panic.

    Signed-off-by: Ian Kent
    Cc: Al Viro
    Cc: Linus Torvalds
    Cc:
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • Currently, I am seeing the following when I `mount -t hugetlbfs /none
    /dev/hugetlbfs`, and then simply do a `ls /dev/hugetlbfs`. I think it's
    related to the fact that hugetlbfs is properly not correctly setting
    itself up in this state?:

    Unable to handle kernel paging request for data at address 0x00000031
    Faulting instruction address: 0xc000000000245710
    Oops: Kernel access of bad area, sig: 11 [#1]
    SMP NR_CPUS=2048 NUMA pSeries
    ....

    In KVM guests on Power, in a guest not backed by hugepages, we see the
    following:

    AnonHugePages: 0 kB
    HugePages_Total: 0
    HugePages_Free: 0
    HugePages_Rsvd: 0
    HugePages_Surp: 0
    Hugepagesize: 64 kB

    HPAGE_SHIFT == 0 in this configuration, which indicates that hugepages
    are not supported at boot-time, but this is only checked in
    hugetlb_init(). Extract the check to a helper function, and use it in a
    few relevant places.

    This does make hugetlbfs not supported (not registered at all) in this
    environment. I believe this is fine, as there are no valid hugepages
    and that won't change at runtime.

    [akpm@linux-foundation.org: use pr_info(), per Mel]
    [akpm@linux-foundation.org: fix build when HPAGE_SHIFT is undefined]
    Signed-off-by: Nishanth Aravamudan
    Reviewed-by: Aneesh Kumar K.V
    Acked-by: Mel Gorman
    Cc: Randy Dunlap
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Nishanth Aravamudan
     
  • Pull vfs fixes from Al Viro:
    "dcache fixes + kvfree() (uninlined, exported by mm/util.c) + posix_acl
    bugfix from hch"

    The dcache fixes are for a subtle LRU list corruption bug reported by
    Miklos Szeredi, where people inside IBM saw list corruptions with the
    LTP/host01 test.

    * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs:
    nick kvfree() from apparmor
    posix_acl: handle NULL ACL in posix_acl_equiv_mode
    dcache: don't need rcu in shrink_dentry_list()
    more graceful recovery in umount_collect()
    don't remove from shrink list in select_collect()
    dentry_kill(): don't try to remove from shrink list
    expand the call of dentry_lru_del() in dentry_kill()
    new helper: dentry_free()
    fold try_prune_one_dentry()
    fold d_kill() and d_free()
    fix races between __d_instantiate() and checks of dentry flags

    Linus Torvalds