18 Oct, 2015

5 commits


15 Oct, 2015

3 commits

  • Prevent clean ext3 filesystems from mounting by default with the ext2
    driver (with no journal!) by putting ext4 ahead of ext2 in the default
    probe order. This will have the effect of mounting ext2 filesystems
    with ext4.ko by default, which is a safer failure than hoping the user
    notices that their journalled ext3 is now running without a journal!

    Users who require ext2.ko for ext2 can either disable ext4.ko or
    explicitly request ext2 via "mount -t ext2" or "rootfstype=ext2".

    Signed-off-by: Darrick J. Wong
    Signed-off-by: Theodore Ts'o

    Darrick J. Wong
     
  • Change the journal's checksum functions to gate on whether or not the
    crc32c driver is loaded, and gate the loading on the superblock bits.
    This prevents a journal crash if someone loads a journal in no-csum
    mode and then randomizes the superblock, thus flipping on the feature
    bits.

    Tested-By: Nikolay Borisov
    Reported-by: Nikolay Borisov
    Signed-off-by: Darrick J. Wong
    Signed-off-by: Theodore Ts'o

    Darrick J. Wong
     
  • If there is a error while copying data from userspace into the page
    cache during a write(2) system call, in data=journal mode, in
    ext4_journalled_write_end() were using page_zero_new_buffers() from
    fs/buffer.c. Unfortunately, this sets the buffer dirty flag, which is
    no good if journalling is enabled. This is a long-standing bug that
    goes back for years and years in ext3, but a combination of (a)
    data=journal not being very common, (b) in many case it only results
    in a warning message. and (c) only very rarely causes the kernel hang,
    means that we only really noticed this as a problem when commit
    998ef75ddb caused this failure to happen frequently enough to cause
    generic/208 to fail when run in data=journal mode.

    The fix is to have our own version of this function that doesn't call
    mark_dirty_buffer(), since we will end up calling
    ext4_handle_dirty_metadata() on the buffer head(s) in questions very
    shortly afterwards in ext4_journalled_write_end().

    Thanks to Dave Hansen and Linus Torvalds for helping to identify the
    root cause of the problem.

    Signed-off-by: Theodore Ts'o
    Reviewed-by: Jan Kara

    Theodore Ts'o
     

03 Oct, 2015

5 commits

  • Fix multiple bugs in ext4_encrypted_zeroout(), including one that
    could cause us to write an encrypted zero page to the wrong location
    on disk, potentially causing data and file system corruption.
    Fortunately, this tends to only show up in stress tests, but even with
    these fixes, we are seeing some test failures with generic/127 --- but
    these are now caused by data failures instead of metadata corruption.

    Since ext4_encrypted_zeroout() is only used for some optimizations to
    keep the extent tree from being too fragmented, and
    ext4_encrypted_zeroout() itself isn't all that optimized from a time
    or IOPS perspective, disable the extent tree optimization for
    encrypted inodes for now. This prevents the data corruption issues
    reported by generic/127 until we can figure out what's going wrong.

    Signed-off-by: Theodore Ts'o
    Cc: stable@vger.kernel.org

    Theodore Ts'o
     
  • Buggy (or hostile) userspace should not be able to cause the kernel to
    crash.

    Signed-off-by: Theodore Ts'o
    Cc: stable@vger.kernel.org

    Theodore Ts'o
     
  • Since ext4_page_crypto() doesn't need an encryption context (at least
    not any more), this allows us to simplify a number function signature
    and also allows us to avoid needing to allocate a context in
    ext4_block_write_begin(). It also means we no longer need a separate
    ext4_decrypt_one() function.

    Signed-off-by: Theodore Ts'o

    Theodore Ts'o
     
  • In cases where the file system block size is the same as the page
    size, and ext4_writepage() is asked to write out a page which is
    either has the unwritten bit set in the extent tree, or which does not
    yet have a block assigned due to delayed allocation, we can bail out
    early and, unlocking the page earlier and avoiding a round trip
    through ext4_bio_write_page() with the attendant calls to
    set_page_writeback() and redirty_page_for_writeback().

    Signed-off-by: Theodore Ts'o

    Theodore Ts'o
     
  • There are times when ext4_bio_write_page() is called even though we
    don't actually need to do any I/O. This happens when ext4_writepage()
    gets called by the jbd2 commit path when an inode needs to force its
    pages written out in order to provide data=ordered guarantees --- and
    a page is backed by an unwritten (e.g., uninitialized) block on disk,
    or if delayed allocation means the page's backing store hasn't been
    allocated yet. In that case, we need to skip the call to
    ext4_encrypt_page(), since in addition to wasting CPU, it leads to a
    bounce page and an ext4 crypto context getting leaked.

    Signed-off-by: Theodore Ts'o
    Cc: stable@vger.kernel.org

    Theodore Ts'o
     

24 Sep, 2015

3 commits


21 Sep, 2015

1 commit


20 Sep, 2015

9 commits

  • Pull ARM fixes from Russell King:
    "Three fixes and a resulting cleanup for -rc2:

    - Andre Przywara reported that he was seeing a warning with the new
    cast inside DMA_ERROR_CODE's definition, and fixed the incorrect
    use.

    - Doug Anderson noticed that kgdb causes a "scheduling while atomic"
    bug.

    - OMAP5 folk noticed that their Thumb-2 compiled X servers crashed
    when enabling support to cover ARMv6 CPUs due to a kernel bug
    leaking some conditional context into the signal handler"

    * 'fixes' of git://ftp.arm.linux.org.uk/~rmk/linux-arm:
    ARM: 8425/1: kgdb: Don't try to stop the machine when setting breakpoints
    ARM: 8437/1: dma-mapping: fix build warning with new DMA_ERROR_CODE definition
    ARM: get rid of needless #if in signal handling code
    ARM: fix Thumb2 signal handling when ARMv6 is enabled

    Linus Torvalds
     
  • …/git/shuah/linux-kselftest

    Pull kselftest fixes from Shuah Khan:
    "This update contains 7 fixes for problems ranging from build failurs
    to incorrect error reporting"

    * tag 'linux-kselftest-4.3-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest:
    selftests: exec: revert to default emit rule
    selftests: change install command to rsync
    selftests: mqueue: simplify the Makefile
    selftests: mqueue: allow extra cflags
    selftests: rename jump label to static_keys
    selftests/seccomp: add support for s390
    seltests/zram: fix syntax error

    Linus Torvalds
     
  • Pull power management and ACPI updates from Rafael Wysocki:
    "Included are: a somewhat late devfreq update which however is mostly
    fixes and cleanups with one new thing only (the PPMUv2 support on
    Exynos5433), an ACPI cpufreq driver fixup and two ACPI core cleanups
    related to preprocessor directives.

    Specifics:

    - Fix a memory allocation size in the devfreq core (Xiaolong Ye).

    - Fix a mistake in the exynos-ppmu DT binding (Javier Martinez
    Canillas).

    - Add support for PPMUv2 ((Platform Performance Monitoring Unit
    version 2.0) on the Exynos5433 SoCs (Chanwoo Choi).

    - Fix a type casting bug in the Exynos PPMU code (MyungJoo Ham).

    - Assorted devfreq code cleanups and optimizations (Javi Merino,
    MyungJoo Ham, Viresh Kumar).

    - Fix up the ACPI cpufreq driver to use a more lightweight way to get
    to its private data in the ->get() callback (Rafael J Wysocki).

    - Fix a CONFIG_ prefix bug in one of the ACPI drivers and make the
    ACPI subsystem use IS_ENABLED() instead of #ifdefs in function
    bodies (Sudeep Holla)"

    * tag 'pm+acpi-4.3-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm:
    cpufreq: acpi-cpufreq: Use cpufreq_cpu_get_raw() in ->get()
    ACPI: Eliminate CONFIG_.*{, _MODULE} #ifdef in favor of IS_ENABLED()
    ACPI: int340x_thermal: add missing CONFIG_ prefix
    PM / devfreq: Fix incorrect type issue.
    PM / devfreq: tegra: Update governor to use devfreq_update_stats()
    PM / devfreq: comments for get_dev_status usage updated
    PM / devfreq: drop comment about thermal setting max_freq
    PM / devfreq: cache the last call to get_dev_status()
    PM / devfreq: Drop unlikely before IS_ERR(_OR_NULL)
    PM / devfreq: exynos-ppmu: bit-wise operation bugfix.
    PM / devfreq: exynos-ppmu: Update documentation to support PPMUv2
    PM / devfreq: exynos-ppmu: Add the support of PPMUv2 for Exynos5433
    PM / devfreq: event: Remove incorrect property in exynos-ppmu DT binding

    Linus Torvalds
     
  • Pull clk fixes from Stephen Boyd:
    "A few driver fixes for tegra, rockchip, and st SoCs and a two-liner in
    the framework to avoid oops when get_parent ops return out of range
    values on tegra platforms"

    * tag 'clk-fixes-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/clk/linux:
    drivers: clk: st: Rename st_pll3200c32_407_c0_x into st_pll3200c32_cx_x
    clk: check for invalid parent index of orphans in __clk_init()
    clk: tegra: dfll: Properly protect OPP list
    clk: rockchip: add critical clock for rk3368

    Linus Torvalds
     
  • …git/j.anaszewski/linux-leds

    Pull LED fixes from Jacek Anaszewski:
    - fix module autoload for six OF platform drivers (aat1290, bcm6328,
    bcm6358, ktd2692, max77693, ns2)
    - aat1290: add missing static modifier
    - ipaq-micro: add missing LEDS_CLASS dependency
    - lp55xx: correct Kconfig dependecy for f/w user helper

    * tag 'led-fixes-for-v4.3-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/j.anaszewski/linux-leds:
    leds:lp55xx: Correct Kconfig dependency for f/w user helper
    leds: leds-ipaq-micro: Add LEDS_CLASS dependency
    leds: aat1290: add 'static' modifier to init_mm_current_scale
    leds: leds-ns2: Fix module autoload for OF platform driver
    leds: max77693: Fix module autoload for OF platform driver
    leds: ktd2692: Fix module autoload for OF platform driver
    leds: bcm6358: Fix module autoload for OF platform driver
    leds: bcm6328: Fix module autoload for OF platform driver
    leds: aat1290: Fix module autoload for OF platform driver

    Linus Torvalds
     
  • Pull rdma fixes from Doug Ledford:
    "The new hfi1 driver in staging/rdma has had a number of fixup patches
    since being added to the tree. This is the first batch of those fixes"

    * tag 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/dledford/rdma:
    IB/hfi: Properly set permissions for user device files
    IB/hfi1: mask vs shift confusion
    IB/hfi1: clean up some defines
    IB/hfi1: info leak in get_ctxt_info()
    IB/hfi1: fix a locking bug
    IB/hfi1: checking for NULL instead of IS_ERR
    IB/hfi1: fix sdma_descq_cnt parameter parsing
    IB/hfi1: fix copy_to/from_user() error handling
    IB/hfi1: fix pstateinfo from returning improperly byteswapped value

    Linus Torvalds
     
  • Pull libnvdimm fixes from Dan Williams:

    - a boot regression (since v4.2) fix for some ARM configurations from
    Tyler

    - regression (since v4.1) fixes for mkfs.xfs on a DAX enabled device
    from Jeff. These are tagged for -stable.

    - a pair of locking fixes from Axel that are hidden from lockdep since
    they involve device_lock(). The "btt" one is tagged for -stable, the
    other only applies to the new "pfn" mechanism in v4.3.

    - a fix for the pmem ->rw_page() path to use wmb_pmem() from Ross.

    * 'libnvdimm-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/nvdimm/nvdimm:
    mm: fix type cast in __pfn_to_phys()
    pmem: add proper fencing to pmem_rw_page()
    libnvdimm: pfn_devs: Fix locking in namespace_store
    libnvdimm: btt_devs: Fix locking in namespace_store
    blockdev: don't set S_DAX for misaligned partitions
    dax: fix O_DIRECT I/O to the last block of a blockdev

    Linus Torvalds
     
  • Pull block updates from Jens Axboe:
    "This is a bit bigger than it should be, but I could (did) not want to
    send it off last week due to both wanting extra testing, and expecting
    a fix for the bounce regression as well. In any case, this contains:

    - Fix for the blk-merge.c compilation warning on gcc 5.x from me.

    - A set of back/front SG gap merge fixes, from me and from Sagi.
    This ensures that we honor SG gapping for integrity payloads as
    well.

    - Two small fixes for null_blk from Matias, fixing a leak and a
    capacity propagation issue.

    - A blkcg fix from Tejun, fixing a NULL dereference.

    - A fast clone optimization from Ming, fixing a performance
    regression since the arbitrarily sized bio's were introduced.

    - Also from Ming, a regression fix for bouncing IOs"

    * 'for-linus' of git://git.kernel.dk/linux-block:
    block: fix bounce_end_io
    block: blk-merge: fast-clone bio when splitting rw bios
    block: blkg_destroy_all() should clear q->root_blkg and ->root_rl.blkg
    block: Copy a user iovec if it includes gaps
    block: Refuse adding appending a gapped integrity page to a bio
    block: Refuse request/bio merges with gaps in the integrity payload
    block: Check for gaps on front and back merges
    null_blk: fix wrong capacity when bs is not 512 bytes
    null_blk: fix memory leak on cleanup
    block: fix bogus compiler warnings in blk-merge.c

    Linus Torvalds
     
  • Commit 505a666ee3fc ("writeback: plug writeback in wb_writeback() and
    writeback_inodes_wb()") has us holding a plug during writeback_sb_inodes,
    which increases the merge rate when relatively contiguous small files
    are written by the filesystem. It helps both on flash and spindles.

    For an fs_mark workload creating 4K files in parallel across 8 drives,
    this commit improves performance ~9% more by unplugging before calling
    cond_resched(). cond_resched() doesn't trigger an implicit unplug, so
    explicitly getting the IO down to the device before scheduling reduces
    latencies for anyone waiting on clean pages.

    It also cuts down on how often we use kblockd to unplug, which means
    less work bouncing from one workqueue to another.

    Many more details about how we got here:

    https://lkml.org/lkml/2015/9/11/570

    Signed-off-by: Chris Mason
    Signed-off-by: Linus Torvalds

    Chris Mason
     

19 Sep, 2015

5 commits

  • The various definitions of __pfn_to_phys() have been consolidated to
    use a generic macro in include/asm-generic/memory_model.h. This hit
    mainline in the form of 012dcef3f058 "mm: move __phys_to_pfn and
    __pfn_to_phys to asm/generic/memory_model.h". When the generic macro
    was implemented the type cast to phys_addr_t was dropped which caused
    boot regressions on ARM platforms with more than 4GB of memory and
    LPAE enabled.

    It was suggested to use PFN_PHYS() defined in include/linux/pfn.h
    as provides the correct logic and avoids further duplication.

    Reported-by: kernelci.org bot
    Suggested-by: Dan Williams
    Signed-off-by: Tyler Baker
    Signed-off-by: Dan Williams

    Tyler Baker
     
  • * acpi-bus:
    ACPI: Eliminate CONFIG_.*{, _MODULE} #ifdef in favor of IS_ENABLED()
    ACPI: int340x_thermal: add missing CONFIG_ prefix

    Rafael J. Wysocki
     
  • * pm-cpufreq:
    cpufreq: acpi-cpufreq: Use cpufreq_cpu_get_raw() in ->get()

    * pm-devfreq:
    PM / devfreq: Fix incorrect type issue.
    PM / devfreq: tegra: Update governor to use devfreq_update_stats()
    PM / devfreq: comments for get_dev_status usage updated
    PM / devfreq: drop comment about thermal setting max_freq
    PM / devfreq: cache the last call to get_dev_status()
    PM / devfreq: Drop unlikely before IS_ERR(_OR_NULL)
    PM / devfreq: exynos-ppmu: bit-wise operation bugfix.
    PM / devfreq: exynos-ppmu: Update documentation to support PPMUv2
    PM / devfreq: exynos-ppmu: Add the support of PPMUv2 for Exynos5433
    PM / devfreq: event: Remove incorrect property in exynos-ppmu DT binding

    Rafael J. Wysocki
     
  • Pull virtio fixes and cleanups from Michael Tsirkin:
    "This fixes the virtio-test tool, and improves the error handling for
    virtio-ccw"

    * tag 'for_linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mst/vhost:
    virtio/s390: handle failures of READ_VQ_CONF ccw
    tools/virtio: propagate V=X to kernel build
    vhost: move features to core
    tools/virtio: fix build after 4.2 changes

    Linus Torvalds
     
  • Pull KVM fixes from Paolo Bonzini:
    "Mostly stable material, a lot of ARM fixes"

    * tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm: (22 commits)
    sched: access local runqueue directly in single_task_running
    arm/arm64: KVM: Remove 'config KVM_ARM_MAX_VCPUS'
    arm64: KVM: Remove all traces of the ThumbEE registers
    arm: KVM: Disable virtual timer even if the guest is not using it
    arm64: KVM: Disable virtual timer even if the guest is not using it
    arm/arm64: KVM: vgic: Check for !irqchip_in_kernel() when mapping resources
    KVM: s390: Replace incorrect atomic_or with atomic_andnot
    arm: KVM: Fix incorrect device to IPA mapping
    arm64: KVM: Fix user access for debug registers
    KVM: vmx: fix VPID is 0000H in non-root operation
    KVM: add halt_attempted_poll to VCPU stats
    kvm: fix zero length mmio searching
    kvm: fix double free for fast mmio eventfd
    kvm: factor out core eventfd assign/deassign logic
    kvm: don't try to register to KVM_FAST_MMIO_BUS for non mmio eventfd
    KVM: make the declaration of functions within 80 characters
    KVM: arm64: add workaround for Cortex-A57 erratum #852523
    KVM: fix polling for guest halt continued even if disable it
    arm/arm64: KVM: Fix PSCI affinity info return value for non valid cores
    arm64: KVM: set {v,}TCR_EL2 RES1 bits
    ...

    Linus Torvalds
     

18 Sep, 2015

9 commits