Eric Lee / smarc-fsl-linux-kernel

12 Mar, 2011

4 commits

167400d34 blk-cgroup: Add unaccounted time to timeslice_used. ... Browse Code »

There are two kind of times that tasks are not charged for: the first
seek and the extra time slice used over the allocated timeslice. Both
of these exported as a new unaccounted_time stat.

I think it would be good to have this reported in 'time' as well, but
that is probably a separate discussion.

Signed-off-by: Justin TerAvest
Signed-off-by: Jens Axboe

Justin TerAvest
2011-03-12 23:54:00 +0800
1f940bdfc block: fixup plugging stubs for !CONFIG_BLOCK ... Browse Code »

They used an older prototype, fix it up.

Reported-by: Randy Dunlap
Signed-off-by: Jens Axboe

Jens Axboe
2011-03-12 03:17:08 +0800
eba2ed9c9 block: remove obsolete comments for blkdev_issue_zeroout. ... Browse Code »

barrier is already removed, so remove the obsolete comments
in blkdev_issue_zeroout.

Cc: Jens Axboe
Signed-off-by: Tao Ma
Signed-off-by: Jens Axboe

Tao Ma
2011-03-12 03:13:54 +0800
805f6b5e1 blktrace: Use rq->cmd_flags directly in blk_add_trace_rq. ... Browse Code »

In blk_add_trace_rq, we only chose the minor 2 bits from
request's cmd_flags and did some check for discard.
so most of other flags(e.g, REQ_SYNC) are missing.

For example, with a sync write after blkparse we get:
8,16 1 1 0.001776503 7509 A WS 1349632 + 1024 cmd_flags directly to __blk_add_trace.

With this patch, after a sync write we get:
8,16 1 1 0.001776900 5425 A WS 1189888 + 1024
Acked-by: Jeff Moyer
Signed-off-by: Jens Axboe

Tao Ma
2011-03-12 03:11:59 +0800

10 Mar, 2011

29 commits

4c63f5646 Merge branch 'for-2.6.39/stack-plug' into for-2.6.39/core ... Browse Code »

Conflicts:
block/blk-core.c
block/blk-flush.c
drivers/md/raid1.c
drivers/md/raid10.c
drivers/md/raid5.c
fs/nilfs2/btnode.c
fs/nilfs2/mdt.c

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:58:35 +0800
69d60eb96 blk-throttle: Use blk_plug in throttle dispatch ... Browse Code »

Use plug in throttle dispatch also as we are dispatching a bunch of
bios in throttle context and some of them might merge.

Signed-off-by: Vivek Goyal
Signed-off-by: Jens Axboe

Vivek Goyal
2011-03-10 15:52:27 +0800
721a9602e block: kill off REQ_UNPLUG ... Browse Code »

With the plugging now being explicitly controlled by the
submitter, callers need not pass down unplugging hints
to the block layer. If they want to unplug, it's because they
manually plugged on their own - in which case, they should just
unplug at will.

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:52:27 +0800
cf15900e1 aio: remove request submission batching ... Browse Code »

This should be useless now that we have on-stack plugging. So lets just
kill it.

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:52:27 +0800
9f5b94254 fs: make aio plug ... Browse Code »

Signed-off-by: Shaohua Li
Signed-off-by: Jens Axboe

Shaohua Li
2011-03-10 15:52:27 +0800
2ed1a6bcf fs: make mpage read/write_pages() plug ... Browse Code »

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:52:26 +0800
5b417b187 read-ahead: use plugging ... Browse Code »

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:52:26 +0800
55602dd66 fs: make generic file read/write functions plug ... Browse Code »

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:52:26 +0800
7eaceacca block: remove per-queue plugging ... Browse Code »
88

Code has been converted over to the new explicit on-stack plugging,
and delay users have been converted to use the new API for that.
So lets kill off the old plugging along with aops->sync_page().

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:52:07 +0800
73c101011 block: initial patch for on-stack per-task plugging ... Browse Code »

This patch adds support for creating a queuing context outside
of the queue itself. This enables us to batch up pieces of IO
before grabbing the block device queue lock and submitting them to
the IO scheduler.

The context is created on the stack of the process and assigned in
the task structure, so that we can auto-unplug it if we hit a schedule
event.

The current queue plugging happens implicitly if IO is submitted to
an empty device, yet callers have to remember to unplug that IO when
they are going to wait for it. This is an ugly API and has caused bugs
in the past. Additionally, it requires hacks in the vm (->sync_page()
callback) to handle that logic. By switching to an explicit plugging
scheme we make the API a lot nicer and can get rid of the ->sync_page()
hack in the vm.

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:45:54 +0800
a488e7497 scsi: convert to blk_delay_queue() ... Browse Code »

It was always abuse to reuse the plugging infrastructure for this,
convert it to the (new) real API for delaying queueing a bit. A
default delay of 3 msec is defined, to match the previous
behaviour.

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:45:54 +0800
0a41e90bb ide-cd: convert to blk_delay_queue() for a short pause ... Browse Code »

It was always abuse to reuse the plugging infrastructure for this,
convert it to the (new) real API for delaying queueing a bit.

Signed-off-by: Jens Axboe
Acked-by: David S. Miller

Jens Axboe
2011-03-10 15:45:54 +0800
3cca6dc1c block: add API for delaying work/request_fn a little bit ... Browse Code »

Currently we use plugging for that, but as plugging is going away,
we need an alternative mechanism.

Signed-off-by: Jens Axboe

Jens Axboe
2011-03-10 15:45:54 +0800
cafb0bfca staging: Convert to bdops->check_events() ... Browse Code »

Convert two staging drivers - blkvsc_drv and cyasblkdev_block - from
->media_changed() to ->check_events(). The former always indicated
media changed while the latter always indicated media not changed.
Not sure what the drivers are trying to achieve but keep the original
behavior.

Signed-off-by: Tejun Heo
Acked-by: Greg Kroah-Hartman
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:29 +0800
3c0d20609 pktcdvd: Convert to bdops->check_events() ... Browse Code »

Convert from ->media_changed() to ->check_events().

pktcdvd needs to forward all event related operations to the
underlying device. Forward ->check_events() instead of
->media_changed() and inherit disk->[async_]events.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers
Cc: Peter Osterlund

Tejun Heo
2011-03-10 02:54:28 +0800
6fac80e3a umem: Drop dummy ->media_changed() ... Browse Code »

umem doesn't implement media changed detection and there's no need to
implement dummy callback anymore. Remove it.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:28 +0800
ffe80cea3 s390/tape_block: Convert to bdops->check_events() ... Browse Code »

Convert from ->media_changed() to ->check_events().

s390/tape_block buffers media changed state and clears it on
revalidation. It will behave correctly with kernel event polling.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers
Cc: Martin Schwidefsky
Cc: Heiko Carstens

Tejun Heo
2011-03-10 02:54:28 +0800
f47350fde i2o_block: Convert to bdops->check_events() ... Browse Code »

Convert from ->media_changed() to ->check_events().

i2o_block buffers media changed state and clears it after reporting.
It will behave correctly with kernel event polling.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers
Cc: Markus Lidel

Tejun Heo
2011-03-10 02:54:28 +0800
3a200911a xsysace: Convert to bdops->check_events() ... Browse Code »

Convert from ->media_changed() to ->check_events().

xsysace buffers media changed state and clears it on revalidation. It
will behave correctly with kernel event polling.

Signed-off-by: Tejun Heo
Acked-by: Grant Likely
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:28 +0800
aaa7c0154 ub: Convert to bdops->check_events() ... Browse Code »

Convert from ->media_changed() to ->check_events().

ub buffers media changed state and clears it on revalidation. It will
behave correctly with kernel event polling.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers
Cc: Pete Zaitcev

Tejun Heo
2011-03-10 02:54:28 +0800
4bbde7778 swim[3]: Convert to bdops->check_events() ... Browse Code »

Convert from ->media_changed() to ->check_events().

Both swim and swim3 buffer media changed state and clear it on
revalidation. They will behave correctly with kernel event polling.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers
Cc: Laurent Vivier
Cc: Benjamin Herrenschmidt

Tejun Heo
2011-03-10 02:54:28 +0800
507daea22 dac960: Convert to bdops->check_events() ... Browse Code »

Convert from ->media_changed() to ->check_events().

DAC960 media change notification seems to be one way (once set, never
cleared) and will generate spurious events when polled once the
condition triggers.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:28 +0800
b1b56b93f paride: Convert to bdops->check_events() ... Browse Code »

Convert paride drivers from ->media_changed() to ->check_events().

pcd and pd buffer and clear events after reporting; however, pf
unconditionally reports MEDIA_CHANGE and will generate spurious events
when polled.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers
Cc: Tim Waugh

Tejun Heo
2011-03-10 02:54:28 +0800
1c27030bd gdrom,viocd: Convert to bdops->check_events() ... Browse Code »

Convert gdrom and viocd from ->media_changed() to ->check_events().

It's unclear how the conditions are cleared and it's possible that it
may generate spurious events when polled.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:28 +0800
1a8a74f03 floppy,{ami|ata}flop: Convert to bdops->check_events() ... Browse Code »

Convert the floppy drivers from ->media_changed() to ->check_events().
Both floppy and ataflop buffer media changed state bit and clear them
on revalidation and will behave correctly with kernel event polling.

I can't tell how amiflop clears its event and it's possible that it
may generate spurious events when polled.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:27 +0800
5b03a1b14 ide: Convert to bdops->check_events() ... Browse Code »

Convert ->media_changed() to the new ->check_events() method. The
conversion is mostly mechanical. The only notable change is that
cdrom now doesn't generate any event if @slot_nr isn't CDSL_CURRENT.
It used to return -EINVAL which would be treated as media changed. As
media changer isn't supported anyway, this doesn't make any
difference.

This makes ide emit the standard disk events and allows kernel event
polling. Currently, only MEDIA_CHANGE event is implemented. Adding
support for EJECT_REQUEST shouldn't be difficult; however, given that
ide driver is already deprecated, it probably is best to leave it
alone.

Signed-off-by: Tejun Heo
Acked-by: Jens Axboe
Cc: Kay Sievers
Cc: "David S. Miller"
Cc: linux-ide@vger.kernel.org

Tejun Heo
2011-03-10 02:54:27 +0800
69e02c59a block: Don't check events while open is in progress ... Browse Code »

Not all block drivers clear events immediately after reporting. Some
do so in ->revalidate_disk() or other steps during ->open(). There is
a slim chance event poll may happen between the clearing event check
from check_disk_change() and the actual clearing of the events which
would result in spurious events.

Block event checks while block device open is in progress. There is
no need to kick explicit event check afterwards as events are always
checked during open.

-v2: The original patch could have called disk_unblock_events() with
an already released or %NULL @disk causing oops. Fixed by making
sure references are put after disk_unblock_events() is called.
It also makes the error path of __blkdev_get() a bit simpler.
This problem was reported by Jens.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:27 +0800
6936217cc block: Don't check events on close unless it was blocked ... Browse Code »

The block event mechanism currently always checks events when the
device is being closed regardless of the open mode. The intention was
to allow detection of EJECT_REQUEST when a device is closed whether
disk event polling is enabled or not.

This is unnecessary as, for devices of interest, events are checked
from either userland or kernel and in the former case ->check_events()
is performed on open of each poll attempt anyway. Furthermore, this
unconditional event check on close makes the code susceptible to event
loop if the block driver doesn't clear reported events correctly - an
event triggers userland to open and close the device which in turn
causes another event, rinse and repeat.

Check events on close only if it was blocked by excl write open.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:27 +0800
facc31ddc block: Don't implicitly trigger event check on disk_unblock_events() ... Browse Code »

Currently, disk_unblock_events() implicitly kick event check if the
block count reaches zero. This behavior is not described in the
comment and hinders with future changes. Make the unblocker
explicitly check events by calling disk_check_events() as necessary.

This patch doesn't cause any behavior difference.

Signed-off-by: Tejun Heo
Cc: Jens Axboe
Cc: Kay Sievers

Tejun Heo
2011-03-10 02:54:27 +0800

09 Mar, 2011

1 commit

df457f845 blk-cgroup: Lower minimum weight from 100 to 10. ... Browse Code »

We've found that we still get good, useful isolation at weights this
low. I'd like to adjust the minimum so that any other changes can take
these values into account.

Signed-off-by: Justin TerAvest
Acked-by: Vivek Goyal
Signed-off-by: Jens Axboe

Justin TerAvest
2011-03-09 02:45:00 +0800

08 Mar, 2011

3 commits

df6771402 block: biovec_slab vs. CONFIG_BLK_DEV_INTEGRITY ... Browse Code »

The block integrity subsystem no longer uses the bio_vec slabs so this
code can safely be compiled in.

Signed-off-by: Martin K. Petersen
Signed-off-by: Jens Axboe

Martin K. Petersen
2011-03-08 15:28:01 +0800
de701c74a blk-throttle: Some cleanups and race fixes in limit update code ... Browse Code »

When throttle group limits are updated through cgroups, a thread is
woken up to process these updates. While reviewing that code, oleg noted
couple of race conditions existed in the code and he also suggested that
code can be simplified.

This patch fixes the races simplifies the code based on Oleg's suggestions:

- Use xchg().
- Introduced a common function throtl_update_blkio_group_common()
which is shared now by all iops/bps update functions.

Reviewed-by: Oleg Nesterov
Reviewed-by: Paul E. McKenney
Signed-off-by: Vivek Goyal

Fixed a merge issue, throtl_schedule_delayed_work() takes throtl_data
as the argument now, not the queue.

Signed-off-by: Jens Axboe

Vivek Goyal
2011-03-08 04:09:32 +0800
231d704b4 blk-throttle: process limit change only through one function ... Browse Code »

With the help of cgroup interface one can go and upate the bps/iops
limits of existing group. Once the limits are udpated, a thread is
woken up to see if some blocked group needs recalculation based on new
limits and needs to be requeued.

There was also a piece of code where I was checking for group limit
update when a fresh bio comes in. This patch gets rid of that piece of
code and keeps processing the limit change at one place
throtl_process_limit_change(). It just keeps the code simple and easy
to understand.

Signed-off-by: Vivek Goyal
Signed-off-by: Jens Axboe

Vivek Goyal
2011-03-08 04:05:14 +0800

07 Mar, 2011

3 commits

b873c5d69 Merge branch 'block-for-2.6.39-core' of ssh://master.kernel.org/pub/scm/linux/ke… ... Browse Code »

…rnel/git/tj/misc into for-2.6.39/core

Jens Axboe
2011-03-07 16:40:21 +0800
a60327107 cfq-iosched: Fix update_vdisktime logic ... Browse Code »

The update_vdisktime logic is broken since commit
b54ce60eb7f61f8e314b8b241b0469eda3bb1d42, st->min_vdisktime never makes
a progress. Fix it.

Thanks Vivek for pointing it out.

Signed-off-by: Gui Jianfeng
Acked-by: Vivek Goyal
Signed-off-by: Jens Axboe

Gui Jianfeng
2011-03-07 16:28:09 +0800
ef8a41df8 cfq-iosched: give busy sync queue no dispatch limit ... Browse Code »

If there are a sync and an async queue and the sync queue's think time
is small, we can ignore the sync queue's dispatch quantum. Because the
sync queue will always preempt the async queue, we don't need to care
about async's latency. This can fix a performance regression of
aiostress test, which is introduced by commit f8ae6e3eb825. The issue
should exist even without the commit, but the commit amplifies the
impact.

The initial post does the same optimization for RT queue too, but since
I have no real workload for it, Vivek suggests to drop it.

Signed-off-by: Shaohua Li
Reviewed-by: Gui Jianfeng
Signed-off-by: Jens Axboe

Shaohua Li
2011-03-07 16:26:29 +0800