14 Apr, 2012

40 commits

  • Greg Kroah-Hartman
     
  • commit 011afa1ed8c408d694957d2474d89dc81a60b70c upstream.

    This reverts commit c1afdaff90538ef085b756454f12b29575411214.

    Users have reported connection failures in 3.3.1 and suspend/resume
    failures in 3.4-rcX. Revert this commit for now - PS IDLE can be
    fixed in a clean manner later on.

    Signed-off-by: Sujith Manoharan
    Signed-off-by: John W. Linville
    Signed-off-by: Greg Kroah-Hartman

    Sujith Manoharan
     
  • commit 79549c6dfda0603dba9a70a53467ce62d9335c33 upstream.

    keyctl_session_to_parent(task) sets ->replacement_session_keyring,
    it should be processed and cleared by key_replace_session_keyring().

    However, this task can fork before it notices TIF_NOTIFY_RESUME and
    the new child gets the bogus ->replacement_session_keyring copied by
    dup_task_struct(). This is obviously wrong and, if nothing else, this
    leads to put_cred(already_freed_cred).

    change copy_creds() to clear this member. If copy_process() fails
    before this point the wrong ->replacement_session_keyring doesn't
    matter, exit_creds() won't be called.

    Signed-off-by: Oleg Nesterov
    Acked-by: David Howells
    Signed-off-by: Linus Torvalds
    Signed-off-by: Greg Kroah-Hartman

    Oleg Nesterov
     
  • commit 45145b67f5895ff92207cffd74e65460a87920b2 upstream.

    Commit 7e07222 breaks DVBFE_ALGO_HW tuning after a retune is requested,
    which causes bad tuning on my TBS 6920.

    [ 0.769091] pci 0000:06:00.0: [14f1:8852] type 0 class 0x000400
    [ 19.733530] CORE cx23885[0]: subsystem: 6920:8888, board: TurboSight TBS 6920 [card=14,autodetected]
    [ 762.824912] cx24116_load_firmware: FW version 1.23.86.1

    7e0722215a510921cbb73ab4c37477d4dcb91bf8 [media] dvb-core: Don't pass DVBv3 parameters on tune() fops

    Although re_tune is set to true when FESTATE_RETUNE occurs, it is never
    set back to false which the old code used to do when !FESTATE_RETUNE.

    This patch sets re_tune to false if !(state & FESTATE_RETUNE).

    $ szap-s2 -a 2 "Channel 5"
    reading channels from file '/home/simon/.szap/channels.conf'
    zapping to 247 'Channel 5':
    delivery DVB-S, modulation QPSK
    sat 0, frequency 10964 MHz H, symbolrate 22000000, coderate 5/6, rolloff 0.35
    vpid 0x092a, apid 0x092b, sid 0x092d
    using '/dev/dvb/adapter2/frontend0' and '/dev/dvb/adapter2/demux0'
    status 1f | signal cf40 | snr 0000 | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr eccd | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr 0000 | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr 0000 | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr eccd | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr 0000 | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr 0000 | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr eb33 | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr eccd | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr eccd | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr 0000 | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr eccd | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr eccd | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cf40 | snr 0000 | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cec0 | snr eccd | ber 00000000 | unc 00000000 | FE_HAS_LOCK
    status 1f | signal cec0 | snr 0000 | ber 00000000 | unc 00000000 | FE_HAS_LOCK

    Signed-off-by: Simon Arlott
    Signed-off-by: Mauro Carvalho Chehab

    Simon Arlott
     
  • commit 556a0442e08a8bc8541587a349cbf26ed14ec6de upstream.

    The commit e399ce77e6e has broken the DVB ABI for xine:

    The problem is that xine is expecting every event after a successful
    FE_SET_FRONTEND ioctl to have a non-zero frequency parameter, regardless
    of whether the tuning process has LOCKed yet. What used to happen is
    that the events inherited the initial tuning parameters from the
    FE_SET_FRONTEND call. However, the fepriv->parameters_out struct is now
    not initialised until the status contains the FE_HAS_LOCK bit.

    You might argue that this behaviour is intentional, except that if an
    application other than xine uses the DVB adapter and manages to set the
    parameters_out.frequency field to something other than zero, then xine
    no longer has any problems until either the adapter is replugged or the
    kernel modules reloaded. This can only mean that the
    fepriv->parameters_out struct still contains the (stale) tuning
    information from the previous application.

    Signed-off-by: Chris Rankin
    Signed-off-by: Mauro Carvalho Chehab
    Signed-off-by: Greg Kroah-Hartman

    Chris Rankin
     
  • commit 8abe05c6eb358967f16bce8a02c88d57c82cfbd6 upstream.

    Commit d4a2eca "ASoC: Tegra I2S: Remove dependency on pdev->id" changed
    the prototype of tegra_i2s_debug_add, but didn't update the dummy inline
    used when !CONFIG_DEBUG_FS. Fix that.

    Signed-off-by: Stephen Warren
    Signed-off-by: Mark Brown
    Signed-off-by: Greg Kroah-Hartman

    Stephen Warren
     
  • commit 1f99e44cf059d2ed43c5a0724fa738b83800f725 upstream.

    ak4642 out_tlv is +12.0dB to -115.0 dB, and it supports mute.
    But current settings didn't care +1 step for mute.
    This patch adds it

    Signed-off-by: Kuninori Morimoto
    Signed-off-by: Mark Brown
    Signed-off-by: Greg Kroah-Hartman

    Kuninori Morimoto
     
  • commit ed0ee0ce0a3224dab5caa088a5f8b6df25924276 upstream.

    The driver frees the clock samples buffer before stopping the video
    buffers queue. If a DQBUF call arrives in-between,
    uvc_video_clock_update() will be called with a NULL clock samples
    buffer, leading to a crash. This occurs very frequently when using the
    webcam with the flash browser plugin.

    Move clock initialization/cleanup to uvc_video_enable() in order to free
    the clock samples buffer after the queue is stopped. Make sure the clock
    is reset at resume time to avoid miscalculating timestamps.

    Signed-off-by: Laurent Pinchart
    Signed-off-by: Mauro Carvalho Chehab
    Signed-off-by: Greg Kroah-Hartman

    Laurent Pinchart
     
  • commit 275029353953c2117941ade84f02a2303912fad1 upstream.

    Starting with v3.2 Jonathan reports that Xen crashes loading the ioatdma
    driver. A debug run shows:

    ioatdma 0000:00:16.4: desc[0]: (0x300cc7000->0x300cc7040) cookie: 0 flags: 0x2 ctl: 0x29 (op: 0 int_en: 1 compl: 1)
    ...
    ioatdma 0000:00:16.4: ioat_get_current_completion: phys_complete: 0xcc7000

    ...which shows that in this environment GFP_KERNEL memory may be backed
    by a 64-bit dma address. This breaks the driver's assumption that an
    unsigned long should be able to contain the physical address for
    descriptor memory. Switch to dma_addr_t which beyond being the right
    size, is the true type for the data i.e. an io-virtual address
    inidicating the engine's last processed descriptor.

    Reported-by: Jonathan Nieder
    Reported-by: William Dauchy
    Tested-by: William Dauchy
    Tested-by: Dave Jiang
    Signed-off-by: Dan Williams
    Signed-off-by: Greg Kroah-Hartman

    Dan Williams
     
  • commit a2daf263107ba3eb6db33931881731fa51c95045 upstream.

    Added Vendor/Device Id of Motorola Rokr E6 (22b8:6027) so it can be
    recognized by the "zaurus" USBNet driver.
    Applies to Linux 3.2.13 and 2.6.39.4.
    Signed-off-by: Guan Xin
    Signed-off-by: David S. Miller
    Signed-off-by: Greg Kroah-Hartman

    Guan Xin
     
  • commit 3f8349e6e98ba0455437724589072523865eae5e upstream.

    TWL6030 family of PMIC use a shadow interrupt status register
    while kernel processes the current interrupt event.
    However, any write(0 or 1) to register INT_STS_A, INT_STS_B or
    INT_STS_C clears all 3 interrupt status registers.

    Since clear of the interrupt is done on 32k clk, depending on I2C
    bus speed, we could in-adverently clear the status of a interrupt
    status pending on shadow register in the current implementation.
    This is due to the fact that multi-byte i2c write operation into
    three seperate status register could result in multiple load
    and clear of status and result in lost interrupts.

    Instead, doing a single byte write to INT_STS_A register with 0x0
    will clear all three interrupt status registers without the related
    risk.

    Acked-by: Santosh Shilimkar
    Signed-off-by: Nishanth Menon
    Signed-off-by: Samuel Ortiz
    Signed-off-by: Greg Kroah-Hartman

    Nishanth Menon
     
  • commit 9993bc635d01a6ee7f6b833b4ee65ce7c06350b1 upstream.

    When a machine boots up, the TSC generally gets reset. However,
    when kexec is used to boot into a kernel, the TSC value would be
    carried over from the previous kernel. The computation of
    cycns_offset in set_cyc2ns_scale is prone to an overflow, if the
    machine has been up more than 208 days prior to the kexec. The
    overflow happens when we multiply *scale, even though there is
    enough room to store the final answer.

    We fix this issue by decomposing tsc_now into the quotient and
    remainder of division by CYC2NS_SCALE_FACTOR and then performing
    the multiplication separately on the two components.

    Refactor code to share the calculation with the previous
    fix in __cycles_2_ns().

    Signed-off-by: Salman Qazi
    Acked-by: John Stultz
    Acked-by: Peter Zijlstra
    Cc: Paul Turner
    Cc: john stultz
    Link: http://lkml.kernel.org/r/20120310004027.19291.88460.stgit@dungbeetle.mtv.corp.google.com
    Signed-off-by: Ingo Molnar
    Cc: Mike Galbraith
    Signed-off-by: Greg Kroah-Hartman

    Salman Qazi
     
  • commit 20e0fa98b751facf9a1101edaefbc19c82616a68 upstream.

    _copy_from_pages() used to copy data from the temporary buffer to the
    user passed buffer is passed the wrong size parameter when copying
    data. res.acl_len contains both the bitmap and acl lenghts while
    acl_len contains the acl length after adjusting for the bitmap size.

    Signed-off-by: Sachin Prabhu
    Signed-off-by: Trond Myklebust
    Cc: Josh Boyer
    Signed-off-by: Greg Kroah-Hartman

    Sachin Prabhu
     
  • commit 5719b81988f3c24ff694dc3a37e35b35630a3966 upstream.

    The wireless rfkill should charged by sony-laptop but not acer-wmi.
    So, add Sony's SNY5001 acpi device to blacklist in acer-wmi.

    Tested on Sony Vaio

    Cc: Carlos Corbacho
    Cc: Matthew Garrett
    Cc: Mattia Dongili
    Cc: Dimitris N
    Tested-by: Dimitris N
    Signed-off-by: Lee, Chun-Yi
    Signed-off-by: Matthew Garrett
    Signed-off-by: Greg Kroah-Hartman

    Lee, Chun-Yi
     
  • This reverts commit a998dc2fa76f496d2944f0602b920d1d10d7467d
    [73d63d038ee9f769f5e5b46792d227fe20e442c5 upstream]

    It causes problems, so needs to be reverted from 3.2-stable for now.

    Reported-by: Konrad Rzeszutek Wilk
    Cc: Jon Dufresne
    Cc: Suresh Siddha
    Cc:
    Cc: Josh Boyer
    Cc: Ingo Molnar
    Cc: Teck Choon Giam
    Cc: Ben Guthro
    Signed-off-by: Greg Kroah-Hartman

    Greg Kroah-Hartman
     
  • commit 883a649b737cdbe3ede7e50f3f939fd706ed5c4e upstream.

    This il->vif is dereferenced in different part of iwlegacy code, so do
    not nullify it. This should fix random crashes observed in companion
    with microcode errors i.e. crash in il3945_config_ap().

    Additionally this should address also
    WARNING: at drivers/net/wireless/iwlegacy/common.c:4656 il_mac_remove_interface
    at least one of the possible reasons of that warning.

    Signed-off-by: Stanislaw Gruszka
    Signed-off-by: John W. Linville
    Signed-off-by: Greg Kroah-Hartman

    Stanislaw Gruszka
     
  • commit df91e49477a9be15921cb2854e1d12a3bdb5e425 upstream.

    Userspace can pass in arbitrary combinations of MS_* flags to mount().

    If both MS_BIND and one of MS_SHARED/MS_PRIVATE/MS_SLAVE/MS_UNBINDABLE are
    passed, device name which should be checked for MS_BIND was not checked because
    MS_SHARED/MS_PRIVATE/MS_SLAVE/MS_UNBINDABLE had higher priority than MS_BIND.

    If both one of MS_BIND/MS_MOVE and MS_REMOUNT are passed, device name which
    should not be checked for MS_REMOUNT was checked because MS_BIND/MS_MOVE had
    higher priority than MS_REMOUNT.

    Fix these bugs by changing priority to MS_REMOUNT -> MS_BIND ->
    MS_SHARED/MS_PRIVATE/MS_SLAVE/MS_UNBINDABLE -> MS_MOVE as with do_mount() does.

    Also, unconditionally return -EINVAL if more than one of
    MS_SHARED/MS_PRIVATE/MS_SLAVE/MS_UNBINDABLE is passed so that TOMOYO will not
    generate inaccurate audit logs, for commit 7a2e8a8f "VFS: Sanity check mount
    flags passed to change_mnt_propagation()" clarified that these flags must be
    exclusively passed.

    Signed-off-by: Tetsuo Handa
    Signed-off-by: James Morris
    Signed-off-by: Greg Kroah-Hartman

    Tetsuo Handa
     
  • commit 83dbbdbb38666e20a75fad2294cf1df77c52f121 upstream.

    The task handoff notifier leaks task_struct since it never gets freed
    after the callback returns NOTIFY_OK, which means it is responsible for
    doing so.

    It turns out the lowmemorykiller actually doesn't need this notifier at
    all. It's used to prevent unnecessary killing by waiting for a thread
    to exit as a result of lowmem_shrink(), however, it's possible to do
    this in the same way the kernel oom killer works by setting TIF_MEMDIE
    and avoid killing if we're still waiting for it to exit.

    The kernel oom killer will already automatically set TIF_MEMDIE for
    threads that are attempting to allocate memory that have a fatal signal.
    The thread selected by lowmem_shrink() will have such a signal after the
    lowmemorykiller sends it a SIGKILL, so this won't result in an
    unnecessary use of memory reserves for the thread to exit.

    This has the added benefit that we don't have to rely on
    CONFIG_PROFILING to prevent needlessly killing tasks.

    Reported-by: Werner Landgraf
    Signed-off-by: David Rientjes
    Acked-by: Colin Cross
    Signed-off-by: Linus Torvalds
    Signed-off-by: Greg Kroah-Hartman

    David Rientjes
     
  • commit e536b62095301271d974983044a011c29fcb2ea2 upstream.

    Add __devinit annotation for da9052_spi_probe to fix below build warning:

    WARNING: drivers/built-in.o(.text+0x349b4): Section mismatch in reference from the function da9052_spi_probe() to the function .devinit.text:da9052_device_init()
    The function da9052_spi_probe() references
    the function __devinit da9052_device_init().
    This is often because da9052_spi_probe lacks a __devinit
    annotation or the annotation of da9052_device_init is wrong.

    Also add __devexit annotation for da9052_spi_remove because we have
    __devexit_p around it in the remove callback.

    Signed-off-by: Axel Lin
    Signed-off-by: Samuel Ortiz
    Signed-off-by: Greg Kroah-Hartman

    Axel Lin
     
  • commit 258f742635360175564e9470eb060ff4d4b984e7 upstream.

    Commit f02e8a6596b7 ("module: Sort exported symbols") sorts symbols
    placing each of them in its own elf section. This sorting and merging
    into the canonical sections are done by the linker.

    Unfortunately modpost to generate Module.symvers file parses vmlinux.o
    (which is not linked yet) and all modules object files (which aren't
    linked yet). These aren't sanitized by the linker yet. That breaks
    modpost that can't detect license properly for modules.

    This patch makes modpost aware of the new exported symbols structure.

    [ This above is a slightly corrected version of the explanation of the
    problem, copied from commit 62a2635610db ("modpost: Fix modpost's
    license checking V3"). That commit fixed the problem for module
    object files, but not for vmlinux.o. This patch fixes modpost for
    vmlinux.o. ]

    Signed-off-by: Frank Rowand
    Signed-off-by: Alessio Igor Bogani
    Signed-off-by: Linus Torvalds
    Signed-off-by: Greg Kroah-Hartman

    Frank Rowand
     
  • commit c04c1b9ee8f30c7a3a25e20e406247003f634ebe upstream.

    If there are no nodes in the cache, nodes will be 0, so calculating
    "registers / nodes" will cause division by zero.

    Signed-off-by: Stephen Warren
    Signed-off-by: Mark Brown
    Signed-off-by: Greg Kroah-Hartman

    Stephen Warren
     
  • commit 620f6e8e855d6d447688a5f67a4e176944a084e8 upstream.

    Commit bfdc0b4 adds code to restrict access to dmesg_restrict,
    however, it incorrectly alters kptr_restrict rather than
    dmesg_restrict.

    The original patch from Richard Weinberger
    (https://lkml.org/lkml/2011/3/14/362) alters dmesg_restrict as
    expected, and so the patch seems to have been misapplied.

    This adds the CAP_SYS_ADMIN check to both dmesg_restrict and
    kptr_restrict, since both are sensitive.

    Reported-by: Phillip Lougher
    Signed-off-by: Kees Cook
    Acked-by: Serge Hallyn
    Acked-by: Richard Weinberger
    Signed-off-by: James Morris
    Signed-off-by: Greg Kroah-Hartman

    Kees Cook
     
  • commit 06383f10c49f507220594a455c6491ca6f8c94ab upstream.

    Avoid freeing a registered tpg structure if an alloc_workqueue call
    fails. This fixes a bug where the failure was leaking memory associated
    with se_portal_group setup during the original core_tpg_register() call.

    Signed-off-by: Mark Rustad
    Acked-by: Kiran Patil
    Signed-off-by: Nicholas Bellinger
    Signed-off-by: Greg Kroah-Hartman

    Mark Rustad
     
  • commit e1c4038282c7586c3544542b37872c434669d3ac upstream.

    Add abort flag and use it to terminate processing when an exchange
    is timed out or is reset. The abort flag is used in place of the
    transport_generic_free_cmd function call in the reset and timeout
    cases, because calling that function in that context would free
    memory that was in use. The aborted flag allows the lifetime to
    be managed in a more normal way, while truncating the processing.

    This change eliminates a source of memory corruption which
    manifested in a variety of ugly ways.

    (nab: Drop unused struct fc_exch *ep in ft_recv_seq)

    Signed-off-by: Mark Rustad
    Acked-by: Kiran Patil
    Signed-off-by: Nicholas Bellinger
    Signed-off-by: Greg Kroah-Hartman

    Mark Rustad
     
  • commit 66292ad92c6d3f2f1c137a1c826b331ca8595dfd upstream.

    The HSMCI operates at a rate of up to Master Clock divided by two.
    Moreover previous calculation can cause overflows and so wrong
    timeouts.

    Signed-off-by: Ludovic Desroches
    Acked-by: Nicolas Ferre
    Signed-off-by: Chris Ball
    Signed-off-by: Greg Kroah-Hartman

    Ludovic Desroches
     
  • commit 8c2fc8e413ecc2c96b696e28d4eb1bc6cee8dc84 upstream.

    This patch fixes a compile error in drivers/mmc/host/sdhci-dove.c
    by including the linux/module.h file.

    Signed-off-by: Alf Høgemark
    Signed-off-by: Chris Ball
    Signed-off-by: Greg Kroah-Hartman

    Alf Høgemark
     
  • commit e841a7c69b708eeaf784fd517978006e8319b03a upstream.

    Neil Brown reports that commit 35cd133c

    PM: Run the driver callback directly if the subsystem one is not there

    breaks suspend for his libertas wifi, because SDIO has a protocol
    where the suspend method can return -ENOSYS and this means "There is
    no point in suspending, just turn me off". Moreover, the suspend
    methods provided by SDIO drivers are not supposed to be called by
    the PM core or bus-level suspend routines (which aren't presend for
    SDIO). Instead, when the SDIO core gets to suspend the device's
    ancestor, it calls the device driver's suspend function, catches the
    ENOSYS, and turns the device off.

    The commit above breaks the SDIO core's assumption that the device
    drivers' callbacks won't be executed if it doesn't provide any
    bus-level callbacks. If fact, however, this assumption has never
    been really satisfied, because device class or device type suspend
    might very well use the driver's callback even without that commit.

    The simplest way to address this problem is to make the SDIO core
    tell the PM core to ignore driver callbacks, for example by providing
    no-operation suspend/resume callbacks at the bus level for it,
    which is implemented by this change.

    Reported-and-tested-by: Neil Brown
    Signed-off-by: Rafael J. Wysocki
    Signed-off-by: Chris Ball
    Signed-off-by: Greg Kroah-Hartman

    Rafael J. Wysocki
     
  • commit cca0355a09b1bfe9f8985285199a346e13cacf39 upstream.

    Due to an error while handling vbus_pin_active_low in ohci-at91 driver,
    the specification of this property was not good in devices/board files.

    Signed-off-by: Nicolas Ferre
    Acked-by: Jean-Christophe PLAGNIOL-VILLARD
    Signed-off-by: Greg Kroah-Hartman

    Nicolas Ferre
     
  • commit 1e7caf8bcf1b49eae152ad7cf442775472dd587c upstream.

    The information is not properly taken into account
    for {get|set}_power() functions.

    Signed-off-by: Nicolas Ferre
    Acked-by: Jean-Christophe PLAGNIOL-VILLARD
    Acked-by: Alan Stern
    Signed-off-by: Greg Kroah-Hartman

    Nicolas Ferre
     
  • commit 66189be74ff5f9f3fd6444315b85be210d07cef2 upstream.

    We can deadlock if we have a write oplock and two processes
    use the same file handle. In this case the first process can't
    unlock its lock if the second process blocked on the lock in the
    same time.

    Fix it by using posix_lock_file rather than posix_lock_file_wait
    under cinode->lock_mutex. If we request a blocking lock and
    posix_lock_file indicates that there is another lock that prevents
    us, wait untill that lock is released and restart our call.

    Acked-by: Jeff Layton
    Signed-off-by: Pavel Shilovsky
    Signed-off-by: Steve French
    Signed-off-by: Greg Kroah-Hartman

    Pavel Shilovsky
     
  • commit 3751d3e85cf693e10e2c47c03c8caa65e171099b upstream.

    There has long been a limitation using software breakpoints with a
    kernel compiled with CONFIG_DEBUG_RODATA going back to 2.6.26. For
    this particular patch, it will apply cleanly and has been tested all
    the way back to 2.6.36.

    The kprobes code uses the text_poke() function which accommodates
    writing a breakpoint into a read-only page. The x86 kgdb code can
    solve the problem similarly by overriding the default breakpoint
    set/remove routines and using text_poke() directly.

    The x86 kgdb code will first attempt to use the traditional
    probe_kernel_write(), and next try using a the text_poke() function.
    The break point install method is tracked such that the correct break
    point removal routine will get called later on.

    Cc: x86@kernel.org
    Cc: Thomas Gleixner
    Cc: Ingo Molnar
    Cc: H. Peter Anvin
    Inspried-by: Masami Hiramatsu
    Signed-off-by: Jason Wessel
    Signed-off-by: Greg Kroah-Hartman

    Jason Wessel
     
  • commit 23bbd8e346f1ef3fc1219c79cea53d8d52b207d8 upstream.

    The do_fork and sys_open tests have never worked properly on anything
    other than a UP configuration with the kgdb test suite. This is
    because the test suite did not fully implement the behavior of a real
    debugger. A real debugger tracks the state of what thread it asked to
    single step and can correctly continue other threads of execution or
    conditionally stop while waiting for the original thread single step
    request to return.

    Below is a simple method to cause a fatal kernel oops with the kgdb
    test suite on a 2 processor ARM system:

    while [ 1 ] ; do ls > /dev/null 2> /dev/null; done&
    while [ 1 ] ; do ls > /dev/null 2> /dev/null; done&
    echo V1I1F100 > /sys/module/kgdbts/parameters/kgdbts

    Very soon after starting the test the kernel will start warning with
    messages like:

    kgdbts: BP mismatch c002487c expected c0024878
    ------------[ cut here ]------------
    WARNING: at drivers/misc/kgdbts.c:317 check_and_rewind_pc+0x9c/0xc4()
    [] (check_and_rewind_pc+0x9c/0xc4)
    [] (validate_simple_test+0x3c/0xc4)
    [] (run_simple_test+0x1e8/0x274)

    The kernel will eventually recovers, but the test suite has completely
    failed to test anything useful.

    This patch implements behavior similar to a real debugger that does
    not rely on hardware single stepping by using only software planted
    breakpoints.

    In order to mimic a real debugger, the kgdb test suite now tracks the
    most recent thread that was continued (cont_thread_id), with the
    intent to single step just this thread. When the response to the
    single step request stops in a different thread that hit the original
    break point that thread will now get continued, while the debugger
    waits for the thread with the single step pending. Here is a high
    level description of the sequence of events.

    cont_instead_of_sstep = 0;

    1) set breakpoint at do_fork
    2) continue
    3) Save the thread id where we stop to cont_thread_id
    4) Remove breakpoint at do_fork
    5) Reset the PC if needed depending on kernel exception type
    6) soft single step
    7) Check where we stopped
    if current thread != cont_thread_id {
    if (here for more than 2 times for the same thead) {
    ### must be a really busy system, start test again ###
    goto step 1
    }
    goto step 5
    } else {
    cont_instead_of_sstep = 0;
    }
    8) clean up and run test again if needed
    9) Clear out any threads that were waiting on a break point at the
    point in time the test is ended with get_cont_catch(). This
    happens sometimes because breakpoints are used in place of single
    stepping and some threads could have been in the debugger exception
    handling queue because breakpoints were hit concurrently on
    different CPUs. This also means we wait at least one second before
    unplumbing the debugger connection at the very end, so as respond
    to any debug threads waiting to be serviced.

    Signed-off-by: Jason Wessel
    Signed-off-by: Greg Kroah-Hartman

    Jason Wessel
     
  • commit 486c5987a00a89d56c2c04c506417ef8f823ca2e upstream.

    The do_fork and sys_open tests have never worked properly on anything
    other than a UP configuration with the kgdb test suite. This is
    because the test suite did not fully implement the behavior of a real
    debugger. A real debugger tracks the state of what thread it asked to
    single step and can correctly continue other threads of execution or
    conditionally stop while waiting for the original thread single step
    request to return.

    Below is a simple method to cause a fatal kernel oops with the kgdb
    test suite on a 4 processor x86 system:

    while [ 1 ] ; do ls > /dev/null 2> /dev/null; done&
    while [ 1 ] ; do ls > /dev/null 2> /dev/null; done&
    while [ 1 ] ; do ls > /dev/null 2> /dev/null; done&
    while [ 1 ] ; do ls > /dev/null 2> /dev/null; done&
    echo V1I1F1000 > /sys/module/kgdbts/parameters/kgdbts

    Very soon after starting the test the kernel will oops with a message like:

    kgdbts: BP mismatch 3b7da66480 expected ffffffff8106a590
    WARNING: at drivers/misc/kgdbts.c:303 check_and_rewind_pc+0xe0/0x100()
    Call Trace:
    [] check_and_rewind_pc+0xe0/0x100
    [] validate_simple_test+0x25/0xc0
    [] run_simple_test+0x107/0x2c0
    [] kgdbts_put_char+0x18/0x20

    The warn will turn to a hard kernel crash shortly after that because
    the pc will not get properly rewound to the right value after hitting
    a breakpoint leading to a hard lockup.

    This change is broken up into 2 pieces because archs that have hw
    single stepping (2.6.26 and up) need different changes than archs that
    do not have hw single stepping (3.0 and up). This change implements
    the correct behavior for an arch that supports hw single stepping.

    A minor defect was fixed where sys_open should be do_sys_open
    for the sys_open break point test. This solves the problem of running
    a 64 bit with a 32 bit user space. The sys_open() never gets called
    when using the 32 bit file system for the kgdb testsuite because the
    32 bit binaries invoke the compat_sys_open() call leading to the test
    never completing.

    In order to mimic a real debugger, the kgdb test suite now tracks the
    most recent thread that was continued (cont_thread_id), with the
    intent to single step just this thread. When the response to the
    single step request stops in a different thread that hit the original
    break point that thread will now get continued, while the debugger
    waits for the thread with the single step pending. Here is a high
    level description of the sequence of events.

    cont_instead_of_sstep = 0;

    1) set breakpoint at do_fork
    2) continue
    3) Save the thread id where we stop to cont_thread_id
    4) Remove breakpoint at do_fork
    5) Reset the PC if needed depending on kernel exception type
    6) if (cont_instead_of_sstep) { continue } else { single step }
    7) Check where we stopped
    if current thread != cont_thread_id {
    cont_instead_of_sstep = 1;
    goto step 5
    } else {
    cont_instead_of_sstep = 0;
    }
    8) clean up and run test again if needed

    Signed-off-by: Jason Wessel
    Signed-off-by: Greg Kroah-Hartman

    Jason Wessel
     
  • commit 456ca7ff24841bf2d2a2dfd690fe7d42ef70d932 upstream.

    On x86 the kgdb test suite will oops when the kernel is compiled with
    CONFIG_DEBUG_RODATA and you run the tests after boot time. This is
    regression has existed since 2.6.26 by commit: b33cb815 (kgdbts: Use
    HW breakpoints with CONFIG_DEBUG_RODATA).

    The test suite can use hw breakpoints for all the tests, but it has to
    execute the hardware breakpoint specific tests first in order to
    determine that the hw breakpoints actually work. Specifically the
    very first test causes an oops:

    # echo V1I1 > /sys/module/kgdbts/parameters/kgdbts
    kgdb: Registered I/O driver kgdbts.
    kgdbts:RUN plant and detach test

    Entering kdb (current=0xffff880017aa9320, pid 1078) on processor 0 due to Keyboard Entry
    [0]kdb> kgdbts: ERROR PUT: end of test buffer on 'plant_and_detach_test' line 1 expected OK got $E14#aa
    WARNING: at drivers/misc/kgdbts.c:730 run_simple_test+0x151/0x2c0()
    [...oops clipped...]

    This commit re-orders the running of the tests and puts the RODATA
    check into its own function so as to correctly avoid the kernel oops
    by detecting and using the hw breakpoints.

    Signed-off-by: Jason Wessel
    Signed-off-by: Greg Kroah-Hartman

    Jason Wessel
     
  • commit 98b54aa1a2241b59372468bd1e9c2d207bdba54b upstream.

    There is extra state information that needs to be exposed in the
    kgdb_bpt structure for tracking how a breakpoint was installed. The
    debug_core only uses the the probe_kernel_write() to install
    breakpoints, but this is not enough for all the archs. Some arch such
    as x86 need to use text_poke() in order to install a breakpoint into a
    read only page.

    Passing the kgdb_bpt structure to kgdb_arch_set_breakpoint() and
    kgdb_arch_remove_breakpoint() allows other archs to set the type
    variable which indicates how the breakpoint was installed.

    Signed-off-by: Jason Wessel
    Signed-off-by: Greg Kroah-Hartman

    Jason Wessel
     
  • commit 247bc03742545fec2f79939a3b9f738392a0f7b4 upstream.

    There is a race condition between the freezer and request_firmware()
    such that if request_firmware() is run on one CPU and
    freeze_processes() is run on another CPU and usermodehelper_disable()
    called by it succeeds to grab umhelper_sem for writing before
    usermodehelper_read_trylock() called from request_firmware()
    acquires it for reading, the request_firmware() will fail and
    trigger a WARN_ON() complaining that it was called at a wrong time.
    However, in fact, it wasn't called at a wrong time and
    freeze_processes() simply happened to be executed simultaneously.

    To avoid this race, at least in some cases, modify
    usermodehelper_read_trylock() so that it doesn't fail if the
    freezing of tasks has just started and hasn't been completed yet.
    Instead, during the freezing of tasks, it will try to freeze the
    task that has called it so that it can wait until user space is
    thawed without triggering the scary warning.

    For this purpose, change usermodehelper_disabled so that it can
    take three different values, UMH_ENABLED (0), UMH_FREEZING and
    UMH_DISABLED. The first one means that usermode helpers are
    enabled, the last one means "hard disable" (i.e. the system is not
    ready for usermode helpers to be used) and the second one
    is reserved for the freezer. Namely, when freeze_processes() is
    started, it sets usermodehelper_disabled to UMH_FREEZING which
    tells usermodehelper_read_trylock() that it shouldn't fail just
    yet and should call try_to_freeze() if woken up and cannot
    return immediately. This way all freezable tasks that happen
    to call request_firmware() right before freeze_processes() is
    started and lose the race for umhelper_sem with it will be
    frozen and will sleep until thaw_processes() unsets
    usermodehelper_disabled. [For the non-freezable callers of
    request_firmware() the race for umhelper_sem against
    freeze_processes() is unfortunately unavoidable.]

    Reported-by: Stephen Boyd
    Signed-off-by: Rafael J. Wysocki
    Acked-by: Greg Kroah-Hartman
    Signed-off-by: Greg Kroah-Hartman

    Rafael J. Wysocki
     
  • commit 1e73203cd1157a03facc41ffb54050f5b28e55bd upstream.

    The core suspend/hibernation code calls usermodehelper_disable() to
    avoid race conditions between the freezer and the starting of
    usermode helpers and each code path has to do that on its own.
    However, it is always called right before freeze_processes()
    and usermodehelper_enable() is always called right after
    thaw_processes(). For this reason, to avoid code duplication and
    to make the connection between usermodehelper_disable() and the
    freezer more visible, make freeze_processes() call it and remove the
    direct usermodehelper_disable() and usermodehelper_enable() calls
    from all suspend/hibernation code paths.

    Signed-off-by: Rafael J. Wysocki
    Acked-by: Greg Kroah-Hartman
    Signed-off-by: Greg Kroah-Hartman

    Rafael J. Wysocki
     
  • commit 7b5179ac14dbad945647ac9e76bbbf14ed9e0dbe upstream.

    There is no reason to call usermodehelper_disable() before creating
    memory bitmaps in hibernate() and software_resume(), so call it right
    before freeze_processes(), in accordance with the other suspend and
    hibernation code. Consequently, call usermodehelper_enable() right
    after the thawing of tasks rather than after freeing the memory
    bitmaps.

    Signed-off-by: Rafael J. Wysocki
    Acked-by: Greg Kroah-Hartman
    Signed-off-by: Greg Kroah-Hartman

    Rafael J. Wysocki
     
  • commit f2791d733a2f06997b573d1a3cfde21e6f529826 upstream.

    This patch (as1535) fixes a bug in the runtime PM core. When a
    runtime suspend attempt completes, whether successfully or not, the
    device's power.wait_queue is supposed to be signalled. But this
    doesn't happen in the failure pathway of rpm_suspend() when another
    autosuspend attempt is rescheduled. As a result, a task can get stuck
    indefinitely on the wait queue (I have seen this happen in testing).

    The patch fixes the problem by moving the wake_up_all() call up near
    the start of the failure code.

    Signed-off-by: Alan Stern
    Signed-off-by: Rafael J. Wysocki
    Signed-off-by: Greg Kroah-Hartman

    Alan Stern
     
  • commit 9b78c1da60b3c62ccdd1509f0902ad19ceaf776b upstream.

    If firmware is requested asynchronously, by calling
    request_firmware_nowait(), there is no reason to fail the request
    (and warn the user) when the system is (presumably temporarily)
    unready to handle it (because user space is not available yet or
    frozen). For this reason, introduce an alternative routine for
    read-locking umhelper_sem, usermodehelper_read_lock_wait(), that
    will wait for usermodehelper_disabled to be unset (possibly with
    a timeout) and make request_firmware_work_func() use it instead of
    usermodehelper_read_trylock().

    Accordingly, modify request_firmware() so that it uses
    usermodehelper_read_trylock() to acquire umhelper_sem and remove
    the code related to that lock from _request_firmware().

    Signed-off-by: Rafael J. Wysocki
    Acked-by: Greg Kroah-Hartman
    Signed-off-by: Greg Kroah-Hartman

    Rafael J. Wysocki