01 Apr, 2012

34 commits

  • Linus Torvalds
     
  • Pull virtio S3 support patches from Amit Shah:
    "Turns out S3 is not different from S4 for virtio devices: the device
    is assumed to be reset, so the host and guest state are to be assumed
    to be out of sync upon resume. We handle the S4 case with exactly the
    same scenario, so just point the suspend/resume routines to the
    freeze/restore ones.

    Once that is done, we also use the PM API's macro to initialise the
    sleep functions.

    A couple of cleanups are included: there's no need for special thaw
    processing in the balloon driver, so that's addressed in patches 1 and
    2.

    Testing: both S3 and S4 support have been tested using these patches
    using a similar method used earlier during S4 patch development: a
    guest is started with virtio-blk as the only disk, a virtio network
    card, a virtio-serial port and a virtio balloon device. Ping from
    guest to host, dd /dev/zero to a file on the disk, and IO from the
    host on the virtio-serial port, all at once, while exercising S4 and
    S3 (separately) were tested. They all continue to work fine after
    resume. virtio balloon values too were tested by inflating and
    deflating the balloon."

    Pulling from Amit, since Rusty is off getting married (and presumably
    shaving people).

    * 's3-for-3.4' of git://git.kernel.org/pub/scm/linux/kernel/git/amit/virtio-console:
    virtio-pci: switch to PM ops macro to initialise PM functions
    virtio-pci: S3 support
    virtio-pci: drop restore_common()
    virtio: drop thaw PM operation
    virtio: balloon: Allow stats update after restore from S4

    Linus Torvalds
     
  • Pull second try at vfs part d#2 from Al Viro:
    "Miklos' first series (with do_lookup() rewrite split into edible
    chunks) + assorted bits and pieces.

    The 'untangling of do_lookup()' series is is a splitup of what used to
    be a monolithic patch from Miklos, so this series is basically "how do
    I convince myself that his patch is correct (or find a hole in it)".
    No holes found and I like the resulting cleanup, so in it went..."

    Changes from try 1: Fix a boot problem with selinux, and commit messages
    prettied up a bit.

    * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (24 commits)
    vfs: fix out-of-date dentry_unhash() comment
    vfs: split __lookup_hash
    untangling do_lookup() - take __lookup_hash()-calling case out of line.
    untangling do_lookup() - switch to calling __lookup_hash()
    untangling do_lookup() - merge d_alloc_and_lookup() callers
    untangling do_lookup() - merge failure exits in !dentry case
    untangling do_lookup() - massage !dentry case towards __lookup_hash()
    untangling do_lookup() - get rid of need_reval in !dentry case
    untangling do_lookup() - eliminate a loop.
    untangling do_lookup() - expand the area under ->i_mutex
    untangling do_lookup() - isolate !dentry stuff from the rest of it.
    vfs: move MAY_EXEC check from __lookup_hash()
    vfs: don't revalidate just looked up dentry
    vfs: fix d_need_lookup/d_revalidate order in do_lookup
    ext3: move headers to fs/ext3/
    migrate ext2_fs.h guts to fs/ext2/ext2.h
    new helper: ext2_image_size()
    get rid of pointless includes of ext2_fs.h
    ext2: No longer export ext2_fs.h to user space
    mtdchar: kill persistently held vfsmount
    ...

    Linus Torvalds
     
  • Pull scheduler fixes from Ingo Molnar.

    * 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
    sched: Fix incorrect usage of for_each_cpu_mask() in select_fallback_rq()
    sched: Fix __schedule_bug() output when called from an interrupt
    sched/arch: Introduce the finish_arch_post_lock_switch() scheduler callback

    Linus Torvalds
     
  • Pull perf updates and fixes from Ingo Molnar:
    "It's mostly fixes, but there's also two late items:

    - preliminary GTK GUI support for perf report
    - PMU raw event format descriptors in sysfs, to be parsed by tooling

    The raw event format in sysfs is a new ABI. For example for the 'CPU'
    PMU we have:

    aldebaran:~> ll /sys/bus/event_source/devices/cpu/format/*
    -r--r--r--. 1 root root 4096 Mar 31 10:29 /sys/bus/event_source/devices/cpu/format/any
    -r--r--r--. 1 root root 4096 Mar 31 10:29 /sys/bus/event_source/devices/cpu/format/cmask
    -r--r--r--. 1 root root 4096 Mar 31 10:29 /sys/bus/event_source/devices/cpu/format/edge
    -r--r--r--. 1 root root 4096 Mar 31 10:29 /sys/bus/event_source/devices/cpu/format/event
    -r--r--r--. 1 root root 4096 Mar 31 10:29 /sys/bus/event_source/devices/cpu/format/inv
    -r--r--r--. 1 root root 4096 Mar 31 10:29 /sys/bus/event_source/devices/cpu/format/offcore_rsp
    -r--r--r--. 1 root root 4096 Mar 31 10:29 /sys/bus/event_source/devices/cpu/format/pc
    -r--r--r--. 1 root root 4096 Mar 31 10:29 /sys/bus/event_source/devices/cpu/format/umask

    those lists of fields contain a specific format:

    aldebaran:~> cat /sys/bus/event_source/devices/cpu/format/offcore_rsp
    config1:0-63

    So, those who wish to specify raw events can now use the following
    event format:

    -e cpu/cmask=1,event=2,umask=3

    Most people will not want to specify any events (let alone raw
    events), they'll just use whatever default event the tools use.

    But for more obscure PMU events that have no cross-architecture
    generic events the above syntax is more usable and a bit more
    structured than specifying hex numbers."

    * 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (41 commits)
    perf tools: Remove auto-generated bison/flex files
    perf annotate: Fix off by one symbol hist size allocation and hit accounting
    perf tools: Add missing ref-cycles event back to event parser
    perf annotate: addr2line wants addresses in same format as objdump
    perf probe: Finder fails to resolve function name to address
    tracing: Fix ent_size in trace output
    perf symbols: Handle NULL dso in dso__name_len
    perf symbols: Do not include libgen.h
    perf tools: Fix bug in raw sample parsing
    perf tools: Fix display of first level of callchains
    perf tools: Switch module.h into export.h
    perf: Move mmap page data_head offset assertion out of header
    perf: Fix mmap_page capabilities and docs
    perf diff: Fix to work with new hists design
    perf tools: Fix modifier to be applied on correct events
    perf tools: Fix various casting issues for 32 bits
    perf tools: Simplify event_read_id exit path
    tracing: Fix ftrace stack trace entries
    tracing: Move the tracing_on/off() declarations into CONFIG_TRACING
    perf report: Add a simple GTK2-based 'perf report' browser
    ...

    Linus Torvalds
     
  • Pull PARISC misc updates from James Bottomley:
    "This is a couple of minor updates (fixing lws futex locking and
    removing some obsolete cpu_*_map calls)."

    * tag 'parisc-misc' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/parisc-2.6:
    [PARISC] remove references to cpu_*_map.
    [PARISC] futex: Use same lock set as lws calls

    Linus Torvalds
     
  • Pull SCSI updates from James Bottomley:
    "This is primarily another round of driver updates (lpfc, bfa, fcoe,
    ipr) plus a new ufshcd driver. There shouldn't be anything
    controversial in here (The final deletion of scsi proc_ops which
    caused some build breakage has been held over until the next merge
    window to give us more time to stabilise it).

    I'm afraid, with me moving continents at exactly the wrong time,
    anything submitted after the merge window opened has been held over to
    the next merge window."

    * tag 'scsi-misc' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi-misc-2.6: (63 commits)
    [SCSI] ipr: Driver version 2.5.3
    [SCSI] ipr: Increase alignment boundary of command blocks
    [SCSI] ipr: Increase max concurrent oustanding commands
    [SCSI] ipr: Remove unnecessary memory barriers
    [SCSI] ipr: Remove unnecessary interrupt clearing on new adapters
    [SCSI] ipr: Fix target id allocation re-use problem
    [SCSI] atp870u, mpt2sas, qla4xxx use pci_dev->revision
    [SCSI] fcoe: Drop the rtnl_mutex before calling fcoe_ctlr_link_up
    [SCSI] bfa: Update the driver version to 3.0.23.0
    [SCSI] bfa: BSG and User interface fixes.
    [SCSI] bfa: Fix to avoid vport delete hang on request queue full scenario.
    [SCSI] bfa: Move service parameter programming logic into firmware.
    [SCSI] bfa: Revised Fabric Assigned Address(FAA) feature implementation.
    [SCSI] bfa: Flash controller IOC pll init fixes.
    [SCSI] bfa: Serialize the IOC hw semaphore unlock logic.
    [SCSI] bfa: Modify ISR to process pending completions
    [SCSI] bfa: Add fc host issue lip support
    [SCSI] mpt2sas: remove extraneous sas_log_info messages
    [SCSI] libfc: fcoe_transport_create fails in single-CPU environment
    [SCSI] fcoe: reduce contention for fcoe_rx_list lock [v2]
    ...

    Linus Torvalds
     
  • 64252c75a2196a0cf1e0d3777143ecfe0e3ae650 "vfs: remove dget() from
    dentry_unhash()" changed the implementation but not the comment.

    Cc: Sage Weil
    Signed-off-by: J. Bruce Fields
    Signed-off-by: Al Viro

    J. Bruce Fields
     
  • Split __lookup_hash into two component functions:

    lookup_dcache - tries cached lookup, returns whether real lookup is needed
    lookup_real - calls i_op->lookup

    This eliminates code duplication between d_alloc_and_lookup() and
    d_inode_lookup().

    Signed-off-by: Miklos Szeredi
    Signed-off-by: Al Viro

    Miklos Szeredi
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • now we have __lookup_hash() open-coded if !dentry case;
    just call the damn thing instead...

    Signed-off-by: Al Viro

    Al Viro
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • Reorder if-else cases for starters...

    Signed-off-by: Al Viro

    Al Viro
     
  • Everything arriving into if (!dentry) will have need_reval = 1.
    Indeed, the only way to get there with need_reval reset to 0 would
    be via
    if (unlikely(d_need_lookup(dentry)))
    goto unlazy;
    if (unlikely(dentry->d_flags & DCACHE_OP_REVALIDATE)) {
    status = d_revalidate(dentry, nd);
    if (unlikely(status

    Al Viro
     
  • d_lookup() *will* fail after successful d_invalidate(), if we are
    holding i_mutex all along. IOW, we don't need to jump back to
    l: - we know what path will be taken there and can do that (i.e.
    d_alloc_and_lookup()) directly.

    Signed-off-by: Al Viro

    Al Viro
     
  • keep holding ->i_mutex over revalidation parts

    Signed-off-by: Al Viro

    Al Viro
     
  • Duplicate the revalidation-related parts into if (!dentry) branch.
    Next step will be to pull them under i_mutex.

    This and the next 8 commits are more or less a splitup of patch
    by Miklos; folks, when you are working with something that convoluted,
    carve your patches up into easily reviewed steps, especially when
    a lot of codepaths involved are rarely hit...

    Signed-off-by: Al Viro

    Al Viro
     
  • The only caller of __lookup_hash() that needs the exec permission check on
    parent is lookup_one_len().

    All lookup_hash() callers already checked permission in LOOKUP_PARENT walk.

    Signed-off-by: Miklos Szeredi
    Signed-off-by: Al Viro

    Miklos Szeredi
     
  • __lookup_hash() calls ->lookup() if the dentry needs lookup and on success
    revalidates the dentry (all under dir->i_mutex).

    While this is harmless it doesn't make a lot of sense.

    Signed-off-by: Miklos Szeredi
    Signed-off-by: Al Viro

    Miklos Szeredi
     
  • Doing revalidate on a dentry which has not yet been looked up makes no sense.

    Move the d_need_lookup() check before d_revalidate().

    Signed-off-by: Miklos Szeredi
    Signed-off-by: Al Viro

    Miklos Szeredi
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • ... implemented that way since the next commit will leave it
    almost alone in ext2_fs.h - most of the file (including
    struct ext2_super_block) is going to move to fs/ext2/ext2.h.

    Signed-off-by: Al Viro

    Al Viro
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • Since the on-disk format has been stable for quite some time, users
    should either use the headers provided by libext2fs or keep a private
    copy of this header. For the full discussion, see this thread:

    https://lkml.org/lkml/2012/3/21/516

    While at it, this commit removes all __KERNEL__ guards, which are now
    unnecessary.

    Signed-off-by: Thierry Reding
    Cc: Al Viro
    Cc: Jan Kara
    Cc: Ted Ts'o
    Cc: Artem Bityutskiy
    Cc: Andreas Dilger
    Cc: linux-ext4@vger.kernel.org

    Thierry Reding
     
  • ... and mtdchar_notifier along with it; just have ->drop_inode() that
    will unconditionally get evict them instead of dances on mtd device
    removal and use simple_pin_fs() instead of kern_mount()

    Signed-off-by: Al Viro

    Al Viro
     
  • move mode-dependent parts to callers, kill unused arguments

    Signed-off-by: Al Viro

    Al Viro
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • Signed-off-by: Al Viro

    Al Viro
     
  • Since 3.2.12 and 3.3, some systems are failing to boot with a BUG_ON.
    Some other systems using the pata_jmicron driver fail to boot because no
    disks are detected. Passing pcie_aspm=force on the kernel command line
    works around it.

    The cause: commit 4949be16822e ("PCI: ignore pre-1.1 ASPM quirking when
    ASPM is disabled") changed the behaviour of pcie_aspm_sanity_check() to
    always return 0 if aspm is disabled, in order to avoid cases where we
    changed ASPM state on pre-PCIe 1.1 devices.

    This skipped the secondary function of pcie_aspm_sanity_check which was
    to avoid us enabling ASPM on devices that had non-PCIe children, causing
    trouble later on. Move the aspm_disabled check so we continue to honour
    that scenario.

    Addresses https://bugzilla.kernel.org/show_bug.cgi?id=42979 and
    http://bugs.debian.org/665420

    Reported-by: Romain Francoise # kernel panic
    Reported-by: Chris Holland # disk detection trouble
    Signed-off-by: Matthew Garrett
    Cc: stable@vger.kernel.org
    Tested-by: Hatem Masmoudi # Dell Latitude E5520
    Tested-by: janek # pata_jmicron with JMB362/JMB363
    [jn: with more symptoms in log message]
    Signed-off-by: Jonathan Nieder
    Signed-off-by: Linus Torvalds

    Matthew Garrett
     
  • Now that all the slow-path code is gone from these functions, we can
    inline them into the main caller - avc_has_perm_flags().

    Now the compiler can see that 'avc' is allocated on the stack for this
    case, which helps register pressure a bit. It also actually shrinks the
    total stack frame, because the stack frame that avc_has_perm_flags()
    always needed (for that 'avc' allocation) is now sufficient for the
    inlined functions too.

    Inlining isn't bad - but mindless inlining of cold code (see the
    previous commit) is.

    Signed-off-by: Linus Torvalds

    Linus Torvalds
     
  • The selinux AVC paths remain some of the hottest (and deepest) codepaths
    at filename lookup time, and we make it worse by having the slow path
    cases take up I$ and stack space even when they don't trigger. Gcc
    tends to always want to inline functions that are just called once -
    never mind that this might make for slower and worse code in the caller.

    So this tries to improve on it a bit by making the slow-path cases
    explicitly separate functions that are marked noinline, causing gcc to
    at least no longer allocate stack space for them unless they are
    actually called. It also seems to help register allocation a tiny bit,
    since gcc now doesn't take the slow case code into account.

    Uninlining the slow path may also allow us to inline the remaining hot
    path into the one caller that actually matters: avc_has_perm_flags().
    I'll have to look at that separately, but both avc_audit() and
    avc_has_perm_noaudit() are now small and lean enough that inlining them
    may make sense.

    Signed-off-by: Linus Torvalds

    Linus Torvalds
     

31 Mar, 2012

6 commits