Eric Lee / linux-smarc-t335x-v3.2

21 Jun, 2010

1 commit

9e495db1a cfq: fix recursive call in cfq_blkiocg_update_completion_stats() ... Browse Code »

e98ef89b has a typo, causing cfq_blkiocg_update_completion_stats()
to call itself instead of blkiocg_update_completion_stats().

Reported-by: Ingo Molnar
Signed-off-by: Jens Axboe

Jens Axboe
2010-06-21 15:10:55 +0800

19 Jun, 2010

1 commit

e98ef89b3 cfq-iosched: Fixed boot warning with BLK_CGROUP=y and CFQ_GROUP_IOSCHED=n ... Browse Code »

Hi Jens,

Few days back Ingo noticed a CFQ boot time warning. This patch fixes it.
The issue here is that with CFQ_GROUP_IOSCHED=n, CFQ should not really
be making blkio stat related calls.

> Hm, it's still not entirely fixed, as of 2.6.35-rc2-00131-g7908a9e. With
> some
> configs i get bad spinlock warnings during bootup:
>
> [ 28.968013] initcall net_olddevs_init+0x0/0x82 returned 0 after 93750
> usecs
> [ 28.972003] calling b44_init+0x0/0x55 @ 1
> [ 28.976009] bus: 'pci': add driver b44
> [ 28.976374] sda:
> [ 28.978157] BUG: spinlock bad magic on CPU#1, async/0/117
> [ 28.980000] lock: 7e1c5bbc, .magic: 00000000, .owner: /-1, +.owner_cpu: 0
> [ 28.980000] Pid: 117, comm: async/0 Not tainted +2.6.35-rc2-tip-01092-g010e7ef-dirty #8183
> [ 28.980000] Call Trace:
> [ 28.980000] [] ? printk+0x20/0x24
> [ 28.980000] [] spin_bug+0x7c/0x87
> [ 28.980000] [] do_raw_spin_lock+0x1e/0x123
> [ 28.980000] [] ? _raw_spin_lock_irqsave+0x12/0x20
> [ 28.980000] [] _raw_spin_lock_irqsave+0x1a/0x20
> [ 28.980000] [] blkiocg_update_io_add_stats+0x25/0xfb
> [ 28.980000] [] ? cfq_prio_tree_add+0xb1/0xc1
> [ 28.980000] [] cfq_insert_request+0x8c/0x425

Signed-off-by: Vivek Goyal
Signed-off-by: Jens Axboe

Vivek Goyal
2010-06-19 01:57:47 +0800

18 Jun, 2010

1 commit

c10b61f09 cfq: Don't allow queue merges for queues that have no process references ... Browse Code »

Hi,

A user reported a kernel bug when running a particular program that did
the following:

created 32 threads
- each thread took a mutex, grabbed a global offset, added a buffer size
to that offset, released the lock
- read from the given offset in the file
- created a new thread to do the same
- exited

The result is that cfq's close cooperator logic would trigger, as the
threads were issuing I/O within the mean seek distance of one another.
This workload managed to routinely trigger a use after free bug when
walking the list of merge candidates for a particular cfqq
(cfqq->new_cfqq). The logic used for merging queues looks like this:

static void cfq_setup_merge(struct cfq_queue *cfqq, struct cfq_queue *new_cfqq)
{
int process_refs, new_process_refs;
struct cfq_queue *__cfqq;

/* Avoid a circular list and skip interim queue merges */
while ((__cfqq = new_cfqq->new_cfqq)) {
if (__cfqq == cfqq)
return;
new_cfqq = __cfqq;
}

process_refs = cfqq_process_refs(cfqq);
/*
* If the process for the cfqq has gone away, there is no
* sense in merging the queues.
*/
if (process_refs == 0)
return;

/*
* Merge in the direction of the lesser amount of work.
*/
new_process_refs = cfqq_process_refs(new_cfqq);
if (new_process_refs >= process_refs) {
cfqq->new_cfqq = new_cfqq;
atomic_add(process_refs, &new_cfqq->ref);
} else {
new_cfqq->new_cfqq = cfqq;
atomic_add(new_process_refs, &cfqq->ref);
}
}

When a merge candidate is found, we add the process references for the
queue with less references to the queue with more. The actual merging
of queues happens when a new request is issued for a given cfqq. In the
case of the test program, it only does a single pread call to read in
1MB, so the actual merge never happens.

Normally, this is fine, as when the queue exits, we simply drop the
references we took on the other cfqqs in the merge chain:

/*
* If this queue was scheduled to merge with another queue, be
* sure to drop the reference taken on that queue (and others in
* the merge chain). See cfq_setup_merge and cfq_merge_cfqqs.
*/
__cfqq = cfqq->new_cfqq;
while (__cfqq) {
if (__cfqq == cfqq) {
WARN(1, "cfqq->new_cfqq loop detected\n");
break;
}
next = __cfqq->new_cfqq;
cfq_put_queue(__cfqq);
__cfqq = next;
}

However, there is a hole in this logic. Consider the following (and
keep in mind that each I/O keeps a reference to the cfqq):

q1->new_cfqq = q2 // q2 now has 2 process references
q3->new_cfqq = q2 // q2 now has 3 process references

// the process associated with q2 exits
// q2 now has 2 process references

// queue 1 exits, drops its reference on q2
// q2 now has 1 process reference

// q3 exits, so has 0 process references, and hence drops its references
// to q2, which leaves q2 also with 0 process references

q4 comes along and wants to merge with q3

q3->new_cfqq still points at q2! We follow that link and end up at an
already freed cfqq.

So, the fix is to not follow a merge chain if the top-most queue does
not have a process reference, otherwise any queue in the chain could be
already freed. I also changed the logic to disallow merging with a
queue that does not have any process references. Previously, we did
this check for one of the merge candidates, but not the other. That
doesn't really make sense.

Without the attached patch, my system would BUG within a couple of
seconds of running the reproducer program. With the patch applied, my
system ran the program for over an hour without issues.

This addresses the following bugzilla:
https://bugzilla.kernel.org/show_bug.cgi?id=16217

Thanks a ton to Phil Carns for providing the bug report and an excellent
reproducer.

[ Note for stable: this applies to 2.6.32/33/34 ].

Signed-off-by: Jeff Moyer
Reported-by: Phil Carns
Cc: stable@kernel.org
Signed-off-by: Jens Axboe

Jeff Moyer
2010-06-18 02:17:35 +0800

17 Jun, 2010

1 commit

fbbf05569 block: fix DISCARD_BARRIER requests ... Browse Code »

Filesystems assume that DISCARD_BARRIER are full barriers, so that they
don't have to track in-progress discard operation when submitting new I/O.
But currently we only treat them as elevator barriers, which don't
actually do the nessecary queue drains.

Also remove the unlikely around both the DISCARD and BARRIER requests -
the happen far too often for a static mispredict.

Signed-off-by: Christoph Hellwig
Signed-off-by: Jens Axboe

Christoph Hellwig
2010-06-17 16:10:53 +0800

15 Jun, 2010

1 commit

79600aadc cciss: set SCSI max cmd len to 16, as default is wrong ... Browse Code »

Signed-off-by: Stephen M. Cameron
Cc: Mike Miller
Signed-off-by: Andrew Morton
Signed-off-by: Jens Axboe

Stephen M. Cameron
2010-06-15 14:12:34 +0800

14 Jun, 2010

4 commits

552618d12 cpqarray: fix two more wrong section type ... Browse Code »

cpqarray_register_ctlr() and cpqarray_eisa_detect() also
need to be marked as __devinit.

Signed-off-by: Jens Axboe

Jens Axboe
2010-06-14 21:21:33 +0800
d4a3895f5 cpqarray: fix wrong __init type on pci probe function ... Browse Code »

It needs to be __devinit, not __init.

Signed-off-by: Jens Axboe

Jens Axboe
2010-06-14 18:55:09 +0800
575f55201 Merge branch 'for-jens' of git://git.drbd.org/linux-2.6-drbd into for-linus Browse Code »

Jens Axboe
2010-06-14 18:54:57 +0800
dc66c74de drbd: Fixed a race between disk-attach and unexpected state changes ... Browse Code »

This was a very hard to trigger race condition.

If we got a state packet from the peer, after drbd_nl_disk() has
already changed the disk state to D_NEGOTIATING but
after_state_ch() was not yet run by the worker, then receive_state()
might called drbd_sync_handshake(), which in turn crashed
when accessing p_uuid.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Philipp Reisner
2010-06-14 18:19:41 +0800

12 Jun, 2010

27 commits

7e27d6e77 Linux 2.6.35-rc3 Browse Code »

Linus Torvalds
2010-06-12 10:14:04 +0800
4cea8706c Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-2.6 ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-2.6:
wimax/i2400m: fix missing endian correction read in fw loader
net8139: fix a race at the end of NAPI
pktgen: Fix accuracy of inter-packet delay.
pkt_sched: gen_estimator: add a new lock
net: deliver skbs on inactive slaves to exact matches
ipv6: fix ICMP6_MIB_OUTERRORS
r8169: fix mdio_read and update mdio_write according to hw specs
gianfar: Revive the driver for eTSEC devices (disable timestamping)
caif: fix a couple range checks
phylib: Add support for the LXT973 phy.
net: Print num_rx_queues imbalance warning only when there are allocated queues

Linus Torvalds
2010-06-12 05:20:03 +0800
7ae1277a5 Merge branch 'pm-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/suspend-2.6 ... Browse Code »

* 'pm-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/suspend-2.6:
PM / x86: Save/restore MISC_ENABLE register

Linus Torvalds
2010-06-12 05:19:45 +0800
b25b550bb Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/btrfs-unstable ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/btrfs-unstable:
Btrfs: The file argument for fsync() is never null
Btrfs: handle ERR_PTR from posix_acl_from_xattr()
Btrfs: avoid BUG when dropping root and reference in same transaction
Btrfs: prohibit a operation of changing acl's mask when noacl mount option used
Btrfs: should add a permission check for setfacl
Btrfs: btrfs_lookup_dir_item() can return ERR_PTR
Btrfs: btrfs_read_fs_root_no_name() returns ERR_PTRs
Btrfs: unwind after btrfs_start_transaction() errors
Btrfs: btrfs_iget() returns ERR_PTR
Btrfs: handle kzalloc() failure in open_ctree()
Btrfs: handle error returns from btrfs_lookup_dir_item()
Btrfs: Fix BUG_ON for fs converted from extN
Btrfs: Fix null dereference in relocation.c
Btrfs: fix remap_file_pages error
Btrfs: uninitialized data is check_path_shared()
Btrfs: fix fallocate regression
Btrfs: fix loop device on top of btrfs

Linus Torvalds
2010-06-12 05:18:47 +0800
eda054770 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jbarnes/pci-2.6 ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jbarnes/pci-2.6:
PCI: clear bridge resource range if BIOS assigned bad one
PCI: hotplug/cpqphp, fix NULL dereference
Revert "PCI: create function symlinks in /sys/bus/pci/slots/N/"
PCI: change resource collision messages from KERN_ERR to KERN_INFO

Linus Torvalds
2010-06-12 05:15:44 +0800
837c4ef13 PCI: clear bridge resource range if BIOS assigned bad one ... Browse Code »

Yannick found that video does not work with 2.6.34. The cause of this
bug was that the BIOS had assigned the wrong range to the PCI bridge
above the video device. Before 2.6.34 the kernel would have shrunk
the size of the bridge window, but since
d65245c PCI: don't shrink bridge resources
the kernel will avoid shrinking BIOS ranges.

So zero out the old range if we fail to claim it at boot time; this will
cause us to allocate a new range at startup, restoring the 2.6.34
behavior.

Fixes regression https://bugzilla.kernel.org/show_bug.cgi?id=16009.

Reported-by: Yannick
Acked-by: Bjorn Helgaas
Signed-off-by: Yinghai Lu
Signed-off-by: Jesse Barnes

Yinghai Lu
2010-06-12 04:24:51 +0800
a7ef7d1f5 PCI: hotplug/cpqphp, fix NULL dereference ... Browse Code »

There are devices out there which are PCI Hot-plug controllers with
compaq PCI IDs, but are not bridges, hence have pdev->subordinate
NULL. But cpqphp expects the pointer to be non-NULL.

Add a check to the probe function to avoid oopses like:
BUG: unable to handle kernel NULL pointer dereference at 00000050
IP: [] cpqhpc_probe+0x951/0x1120 [cpqphp]
*pdpt = 0000000033779001 *pde = 0000000000000000
...

The device here was:
00:0b.0 PCI Hot-plug controller [0804]: Compaq Computer Corporation PCI Hotplug Controller [0e11:a0f7] (rev 11)
Subsystem: Compaq Computer Corporation Device [0e11:a2f8]

Signed-off-by: Jiri Slaby
Cc: Greg KH
Signed-off-by: Jesse Barnes

Jiri Slaby
2010-06-12 04:10:21 +0800
3be434f02 Revert "PCI: create function symlinks in /sys/bus/pci/slots/N/" ... Browse Code »

This reverts commit 75568f8094eb0333e9c2109b23cbc8b82d318a3c.

Since they're just a convenience anyway, remove these symlinks since
they're causing duplicate filename errors in the wild.

Acked-by: Alex Chiang
Signed-off-by: Jesse Barnes

Jesse Barnes
2010-06-12 04:08:37 +0800
f6d440dae PCI: change resource collision messages from KERN_ERR to KERN_INFO ... Browse Code »

We can often deal with PCI resource issues by moving devices around. In
that case, there's no point in alarming the user with messages like these.
There are many bug reports where the message itself is the only problem,
e.g., https://bugs.launchpad.net/ubuntu/+source/linux/+bug/413419 .

Signed-off-by: Bjorn Helgaas
Signed-off-by: Jesse Barnes

Bjorn Helgaas
2010-06-12 04:08:14 +0800
6f902af40 Btrfs: The file argument for fsync() is never null ... Browse Code »

The "file" argument for fsync is never null so we can remove this check.

What drew my attention here is that 7ea8085910e: "drop unused dentry
argument to ->fsync" introduced an unconditional dereference at the
start of the function and that generated a smatch warning.

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-12 03:57:40 +0800
834e74759 Btrfs: handle ERR_PTR from posix_acl_from_xattr() ... Browse Code »

posix_acl_from_xattr() returns both ERR_PTRs and null, but it's OK to
pass null values to set_cached_acl()

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-12 03:57:39 +0800
15e700009 Btrfs: avoid BUG when dropping root and reference in same transaction ... Browse Code »

If btrfs_ioctl_snap_destroy() deletes a snapshot but finishes
with end_transaction(), the cleaner kthread may come in and
drop the root in the same transaction. If that's the case, the
root's refs still == 1 in the tree when btrfs_del_root() deletes
the item, because commit_fs_roots() hasn't updated it yet (that
happens during the commit).

This wasn't a problem before only because
btrfs_ioctl_snap_destroy() would commit the transaction before dropping
the dentry reference, so the dead root wouldn't get queued up until
after the fs root item was updated in the btree.

Since it is not an error to drop the root reference and the root in the
same transaction, just drop the BUG_ON() in btrfs_del_root().

Signed-off-by: Sage Weil
Signed-off-by: Chris Mason

Sage Weil
2010-06-12 03:57:39 +0800
731e3d1b4 Btrfs: prohibit a operation of changing acl's mask when noacl mount option used ... Browse Code »

when used Posix File System Test Suite(pjd-fstest) to test btrfs,
some cases about setfacl failed when noacl mount option used.
I simplified used commands in pjd-fstest, and the following steps
can reproduce it.
------------------------
# cd btrfs-part/
# mkdir aaa
# setfacl -m m::rw aaa
Signed-off-by: Chris Mason

Shi Weihua
2010-06-12 03:57:38 +0800
2f26afba4 Btrfs: should add a permission check for setfacl ... Browse Code »

On btrfs, do the following
------------------
# su user1
# cd btrfs-part/
# touch aaa
# getfacl aaa
# file: aaa
# owner: user1
# group: user1
user::rw-
group::rw-
other::r--
# su user2
# cd btrfs-part/
# setfacl -m u::rwx aaa
# getfacl aaa
# file: aaa
# owner: user1
# group: user1
user::rwx
Signed-off-by: Chris Mason

Shi Weihua
2010-06-12 03:57:37 +0800
cf1e99a4e Btrfs: btrfs_lookup_dir_item() can return ERR_PTR ... Browse Code »

btrfs_lookup_dir_item() can return either ERR_PTRs or null.

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-12 03:57:37 +0800
3140c9a34 Btrfs: btrfs_read_fs_root_no_name() returns ERR_PTRs ... Browse Code »

btrfs_read_fs_root_no_name() returns ERR_PTRs on error so I added a
check for that. It's not clear to me if it can also return NULL
pointers or not so I left the original NULL pointer check as is.

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-12 03:57:36 +0800
d327099a2 Btrfs: unwind after btrfs_start_transaction() errors ... Browse Code »

This was added by a22285a6a3: "Btrfs: Integrate metadata reservation
with start_transaction". If we goto out here then we skip all the
unwinding and there are locks still held etc.

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-12 03:57:35 +0800
4cbd1149f Btrfs: btrfs_iget() returns ERR_PTR ... Browse Code »

btrfs_iget() returns an ERR_PTR() on failure and not null.

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-12 03:57:35 +0800
676e4c863 Btrfs: handle kzalloc() failure in open_ctree() ... Browse Code »

Unwind and return -ENOMEM if the allocation fails here.

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-12 03:57:34 +0800
fb4f6f910 Btrfs: handle error returns from btrfs_lookup_dir_item() ... Browse Code »

If btrfs_lookup_dir_item() fails, we should can just let the mount fail
with an error.

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-12 03:57:33 +0800
3bf84a5a8 Btrfs: Fix BUG_ON for fs converted from extN ... Browse Code »

Tree blocks can live in data block groups in FS converted from extN.
So it's easy to trigger the BUG_ON.

Signed-off-by: Yan Zheng
Signed-off-by: Chris Mason

Yan, Zheng
2010-06-12 03:48:35 +0800
046f264f6 Btrfs: Fix null dereference in relocation.c ... Browse Code »

Fix a potential null dereference in relocation.c

Signed-off-by: Yan Zheng
Acked-by: Dan Carpenter
Signed-off-by: Chris Mason

Yan, Zheng
2010-06-12 03:48:34 +0800
e79aa8671 Merge branch 'wimax-2.6.35.y' of git://git.kernel.org/pub/scm/linux/kernel/git/inaky/wimax Browse Code »

David S. Miller
2010-06-12 03:38:23 +0800
a385a53e6 wimax/i2400m: fix missing endian correction read in fw loader ... Browse Code »

i2400m_fw_hdr_check() was accessing hardware field
bcf_hdr->module_type (little endian 32) without converting to host
byte sex.

Reported-by: Данилин Михаил

Signed-off-by: Inaky Perez-Gonzalez

Inaky Perez-Gonzalez
2010-06-12 02:51:20 +0800
891a9894e Merge branch 'rc-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/mmarek/kbuild-2.6 ... Browse Code »

* 'rc-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/mmarek/kbuild-2.6:
kbuild: Create output directory in Makefile.modbuiltin
kbuild: Generate modules.builtin in make modules

Linus Torvalds
2010-06-12 00:55:50 +0800
f1f6ea352 Merge branch 'urgent' of git://git.kernel.org/pub/scm/linux/kernel/git/brodo/pcmcia-2.6 ... Browse Code »

* 'urgent' of git://git.kernel.org/pub/scm/linux/kernel/git/brodo/pcmcia-2.6:
pcmcia: avoid validate_cis failure on CIS override
pcmcia: dev_node removal bugfix
pcmcia: yenta_socket.c Remove extra #ifdef CONFIG_YENTA_TI
pcmcia: only keep saved I365_CSCINT flag if there is no PCI irq

Linus Torvalds
2010-06-12 00:55:21 +0800
63c70a0d7 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client:
ceph: try to send partial cap release on cap message on missing inode
ceph: release cap on import if we don't have the inode
ceph: fix misleading/incorrect debug message
ceph: fix atomic64_t initialization on ia64
ceph: fix lease revocation when seq doesn't match
ceph: fix f_namelen reported by statfs
ceph: fix memory leak in statfs
ceph: fix d_subdirs ordering problem

Linus Torvalds
2010-06-12 00:52:23 +0800

11 Jun, 2010

4 commits

058a457ef Btrfs: fix remap_file_pages error ... Browse Code »

when we use remap_file_pages() to remap a file, remap_file_pages always return
error. It is because btrfs didn't set VM_CAN_NONLINEAR for vma.

Signed-off-by: Miao Xie
Signed-off-by: Chris Mason

Miao Xie
2010-06-11 23:46:12 +0800
0e4dcbef1 Btrfs: uninitialized data is check_path_shared() ... Browse Code »

refs can be used with uninitialized data if btrfs_lookup_extent_info()
fails on the first pass through the loop. In the original code if that
happens then check_path_shared() probably returns 1, this patch
changes it to return 1 for safety.

Signed-off-by: Dan Carpenter
Signed-off-by: Chris Mason

Dan Carpenter
2010-06-11 23:46:12 +0800
836097797 Btrfs: fix fallocate regression ... Browse Code »

Seems that when btrfs_fallocate was converted to use the new ENOSPC stuff we
dropped passing the mode to the function that actually does the preallocation.
This breaks anybody who wants to use FALLOC_FL_KEEP_SIZE. Thanks,

Signed-off-by: Josef Bacik
Signed-off-by: Chris Mason

Josef Bacik
2010-06-11 23:46:12 +0800
4a001071d Btrfs: fix loop device on top of btrfs ... Browse Code »

We cannot use the loop device which has been connected to a file in the btrf

The reproduce steps is following:
# dd if=/dev/zero of=vdev0 bs=1M count=1024
# losetup /dev/loop0 vdev0
# mkfs.btrfs /dev/loop0
...
failed to zero device start -5

The reason is that the btrfs don't implement either ->write_begin or ->write
the VFS API, so we fix it by setting ->write to do_sync_write().

Signed-off-by: Miao Xie
Signed-off-by: Chris Mason

Miao Xie
2010-06-11 23:46:11 +0800