Eric Lee / smarc-ti-linux-kernel | Embedian Git Server

24 Jul, 2007

1 commit

165125e1e [BLOCK] Get rid of request_queue_t typedef ... Browse Code »

Some of the code has been gradually transitioned to using the proper
struct request_queue, but there's lots left. So do a full sweet of
the kernel and get rid of this typedef and replace its uses with
the proper type.

Signed-off-by: Jens Axboe

Jens Axboe
2007-07-24 15:28:11 +0800

20 Jul, 2007

1 commit

20c2df83d mm: Remove slab destructors from kmem_cache_create(). ... Browse Code »

Slab destructors were no longer supported after Christoph's
c59def9f222d44bb7e2f0a559f2906191a0862d7 change. They've been
BUGs for both slab and slub, and slob never supported them
either.

This rips out support for the dtor pointer from kmem_cache_create()
completely and fixes up every single callsite in the kernel (there were
about 224, not including the slab allocator definitions themselves,
or the documentation references).

Signed-off-by: Paul Mundt

Paul Mundt
2007-07-20 09:11:58 +0800

10 Jul, 2007

1 commit

72d3a38ee unexport bio_{,un}map_user ... Browse Code »

bio_{,un}map_user no longer have any modular users.

Signed-off-by: Adrian Bunk
Signed-off-by: Jens Axboe

Adrian Bunk
2007-07-10 14:03:34 +0800

08 May, 2007

1 commit

0a31bd5f2 KMEM_CACHE(): simplify slab cache creation ... Browse Code »

This patch provides a new macro

KMEM_CACHE(, )

to simplify slab creation. KMEM_CACHE creates a slab with the name of the
struct, with the size of the struct and with the alignment of the struct.
Additional slab flags may be specified if necessary.

Example

struct test_slab {
int a,b,c;
struct list_head;
} __cacheline_aligned_in_smp;

test_slab_cache = KMEM_CACHE(test_slab, SLAB_PANIC)

will create a new slab named "test_slab" of the size sizeof(struct
test_slab) and aligned to the alignment of test slab. If it fails then we
panic.

Signed-off-by: Christoph Lameter
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Christoph Lameter
2007-05-08 03:12:55 +0800

30 Apr, 2007

1 commit

5972511b7 [BLOCK] Don't pin lots of memory in mempools ... Browse Code »

Currently we scale the mempool sizes depending on memory installed
in the machine, except for the bio pool itself which sits at a fixed
256 entry pre-allocation.

There's really no point in "optimizing" this OOM path, we just need
enough preallocated to make progress. A single unit is enough, lets
scale it down to 2 just to be on the safe side.

This patch saves ~150kb of pinned kernel memory on a 32-bit box.

Signed-off-by: Jens Axboe

Jens Axboe
2007-04-30 15:08:17 +0800

14 Dec, 2006

1 commit

e61c90188 [PATCH] optimize o_direct on block devices ... Browse Code »

Implement block device specific .direct_IO method instead of going through
generic direct_io_worker for block device.

direct_io_worker() is fairly complex because it needs to handle O_DIRECT on
file system, where it needs to perform block allocation, hole detection,
extents file on write, and tons of other corner cases. The end result is
that it takes tons of CPU time to submit an I/O.

For block device, the block allocation is much simpler and a tight triple
loop can be written to iterate each iovec and each page within the iovec in
order to construct/prepare bio structure and then subsequently submit it to
the block layer. This significantly speeds up O_D on block device.

[akpm@osdl.org: small speedup]
Signed-off-by: Ken Chen
Cc: Christoph Hellwig
Cc: Zach Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Chen, Kenneth W
2006-12-14 01:05:50 +0800

08 Dec, 2006

1 commit

e18b890bb [PATCH] slab: remove kmem_cache_t ... Browse Code »

Replace all uses of kmem_cache_t with struct kmem_cache.

The patch was generated using the following script:

#!/bin/sh
#
# Replace one string by another in all the kernel sources.
#

set -e

for file in `find * -name "*.c" -o -name "*.h"|xargs grep -l $1`; do
quilt add $file
sed -e "1,\$s/$1/$2/g" $file >/tmp/$$
mv /tmp/$$ $file
quilt refresh
done

The script was run like this

sh replace kmem_cache_t "struct kmem_cache"

Signed-off-by: Christoph Lameter
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Christoph Lameter
2006-12-08 00:39:25 +0800

05 Dec, 2006

1 commit

4c1ac1b49 Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6 ... Browse Code »

Conflicts:

drivers/infiniband/core/iwcm.c
drivers/net/chelsio/cxgb2.c
drivers/net/wireless/bcm43xx/bcm43xx_main.c
drivers/net/wireless/prism54/islpci_eth.c
drivers/usb/core/hub.h
drivers/usb/input/hid-core.c
net/core/netpoll.c

Fix up merge failures with Linus's head and fix new compilation failures.

Signed-Off-By: David Howells

David Howells
2006-12-05 22:37:56 +0800

01 Dec, 2006

2 commits

0e75f9063 [PATCH] block: support larger block pc requests ... Browse Code »

This patch modifies blk_rq_map/unmap_user() and the cdrom and scsi_ioctl.c
users so that it supports requests larger than bio by chaining them together.

Signed-off-by: Mike Christie
Signed-off-by: Jens Axboe

Mike Christie
2006-12-01 17:40:55 +0800
ad2d72257 [PATCH] block: kill length alignment test in bio_map_user() ... Browse Code »

The target mode support is mapping in bios using bio_map_user. The
current targets do not need their len to be aligned with a queue limit
so this check is causing some problems. Note: pointers passed into the
kernel are properly aligned by usersapace tgt code so the uaddr check
in bio_map_user is ok.

The major user, blk_bio_map_user checks for the len before mapping
so it is not affected by this patch.

And the semi-newly added user blk_rq_map_user_iov has been failing
out when the len is not aligned properly so maybe people have been
good and not sending misaligned lens or that path is not used very
often and this change will not be very dangerous. st and sg do not
check the length and we have not seen any problem reports from those
wider used paths so this patch should be fairly safe - for mm
and wider testing at least.

Signed-off-by: Mike Christie
Signed-off-by: FUJITA Tomonori
Signed-off-by: James Bottomley
Signed-off-by: Jens Axboe

Mike Christie
2006-12-01 17:40:20 +0800

22 Nov, 2006

1 commit

65f27f384 WorkStruct: Pass the work_struct pointer instead of context data ... Browse Code »

Pass the work_struct pointer to the work function rather than context data.
The work function can use container_of() to work out the data.

For the cases where the container of the work_struct may go away the moment the
pending bit is cleared, it is made possible to defer the release of the
structure by deferring the clearing of the pending bit.

To make this work, an extra flag is introduced into the management side of the
work_struct. This governs auto-release of the structure upon execution.

Ordinarily, the work queue executor would release the work_struct for further
scheduling or deallocation by clearing the pending bit prior to jumping to the
work function. This means that, unless the driver makes some guarantee itself
that the work_struct won't go away, the work function may not access anything
else in the work_struct or its container lest they be deallocated.. This is a
problem if the auxiliary data is taken away (as done by the last patch).

However, if the pending bit is *not* cleared before jumping to the work
function, then the work function *may* access the work_struct and its container
with no problems. But then the work function must itself release the
work_struct by calling work_release().

In most cases, automatic release is fine, so this is the default. Special
initiators exist for the non-auto-release case (ending in _NAR).

Signed-Off-By: David Howells

David Howells
2006-11-22 22:55:48 +0800

12 Oct, 2006

1 commit

bf02c082b [PATCH] fs/bio.c: tweaks ... Browse Code »

- Calculate a variable in bvec_alloc_bs() only once needed, not earlier
(bio.o down from 18408 to 18376 Bytes, 32 Bytes saved, probably due to
data locality improvements).

- Init variable idx to silence a gcc warning which already existed in the
unmodified original base file (bvec_alloc_bs() handles idx correctly, so
there's no need for the warning):

fs/bio.c: In function `bio_alloc_bioset':
fs/bio.c:169: warning: `idx' may be used uninitialized in this function

Signed-off-by: Andreas Mohr
Acked-by: Jens Axboe
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andreas Mohr
2006-10-12 02:14:25 +0800

01 Oct, 2006

2 commits

0fe234795 [PATCH] Update axboe@suse.de email address ... Browse Code »

As people often look for the copyright in files to see who to mail,
update the link to a neutral one.

Signed-off-by: Jens Axboe

Jens Axboe
2006-10-01 02:52:34 +0800
50be34556 [PATCH] fix creating zero sized bio mempools in low memory system ... Browse Code »

In the very low memory systems is in the init_bio call
scale parameter set to zero and it leads to creating
zero sized mempool.

This patch prevents pool_entries parameter become zero,
so the created pool have at least 1 entry.

Mempool with 0 entries lead to incorrect behaviour
of mempool_free. (Alloc requests are not waken up
and system stalls in mempool_alloc->ioschedule).

Signed-off-by: Milan Broz
Signed-off-by: Jens Axboe

Milan Broz
2006-10-01 02:52:33 +0800

18 Jun, 2006

1 commit

991721572 [PATCH] Fix missing ret assignment in __bio_map_user() error path ... Browse Code »

If get_user_pages() returns less pages than what we asked for, we jump
to out_unmap which will return ERR_PTR(ret). But ret can contain a
positive number just smaller than local_nr_pages, so be sure to set it
to -EFAULT always.

Problem found and diagnosed by Damien Le Moal

Signed-off-by: Jens Axboe
Signed-off-by: Linus Torvalds

Jens Axboe
2006-06-18 01:52:12 +0800

24 May, 2006

1 commit

a2eb0c101 [PATCH] md: Make sure bi_max_vecs is set properly in bio_split ... Browse Code »

Else a subsequent bio_clone might make a mess.

Signed-off-by: Neil Brown
Cc: "Don Dupuis"
Acked-by: Jens Axboe
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-05-24 01:35:31 +0800

27 Mar, 2006

3 commits

93d2341c7 [PATCH] mempool: use mempool_create_slab_pool() ... Browse Code »

Modify well over a dozen mempool users to call mempool_create_slab_pool()
rather than calling mempool_create() with extra arguments, saving about 30
lines of code and increasing readability.

Signed-off-by: Matthew Dobson
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Matthew Dobson
2006-03-27 00:57:00 +0800
0eaae62ab [PATCH] mempool: use common mempool kmalloc allocator ... Browse Code »

This patch changes several mempool users, all of which are basically just
wrappers around kmalloc(), to use the common mempool_kmalloc/kfree, rather
than their own wrapper function, removing a bunch of duplicated code.

Signed-off-by: Matthew Dobson
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Matthew Dobson
2006-03-27 00:56:59 +0800
fa3536cc1 [PATCH] Use __read_mostly on some hot fs variables ... Browse Code »

I discovered on oprofile hunting on a SMP platform that dentry lookups were
slowed down because d_hash_mask, d_hash_shift and dentry_hashtable were in
a cache line that contained inodes_stat. So each time inodes_stats is
changed by a cpu, other cpus have to refill their cache line.

This patch moves some variables to the __read_mostly section, in order to
avoid false sharing. RCU dentry lookups can go full speed.

Signed-off-by: Eric Dumazet
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Eric Dumazet
2006-03-27 00:56:56 +0800

26 Mar, 2006

1 commit

11b0b5abb [PATCH] use kzalloc and kcalloc in core fs code ... Browse Code »

Signed-off-by: Oliver Neukum
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Oliver Neukum
2006-03-26 00:23:00 +0800

24 Mar, 2006

1 commit

2056a782f [PATCH] Block queue IO tracing support (blktrace) as of 2006-03-23 ... Browse Code »

Signed-off-by: Jens Axboe

Jens Axboe
2006-03-24 03:00:26 +0800

23 Mar, 2006

1 commit

b0e6e9629 [PATCH] reduce size of bio mempools ... Browse Code »

The biovec default mempool limit of 256 entries results in over 3MB of RAM
being permanently pinned, even on systems with only 128MB of RAM. Since
mempool tries to allocate from the system pool first, it makes sense to
reduce the size of the mempool fallbacks to a more reasonable limit of 1-5
entries -- enough for the system to be able to make progress even under
load.

Signed-off-by: Benjamin LaHaise
Acked-by: Jens Axboe
Cc: Matt Mackall
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Benjamin LaHaise
2006-03-23 23:38:18 +0800

31 Jan, 2006

1 commit

fddfdeafa [BLOCK] A few kerneldoc fixups ... Browse Code »

Signed-off-by: Jens Axboe

Jens Axboe
2006-01-31 22:24:34 +0800

15 Jan, 2006

1 commit

858119e15 [PATCH] Unlinline a bunch of other functions ... Browse Code »

Remove the "inline" keyword from a bunch of big functions in the kernel with
the goal of shrinking it by 30kb to 40kb

Signed-off-by: Arjan van de Ven
Signed-off-by: Ingo Molnar
Acked-by: Jeff Garzik
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Arjan van de Ven
2006-01-15 10:27:06 +0800

09 Jan, 2006

1 commit

0ea60b5ad [BLOCK] bio: init ->bi_bdev in bio_init() ... Browse Code »

For SG_IO requests, bio->bi_bdev may not be explicitly initialized. So make
bio_init() clear the field to make sure it's always NULL or valid.

Signed-off-by: Jens Axboe

Jens Axboe
2006-01-09 21:45:10 +0800

06 Jan, 2006

1 commit

80cfd548e [BLOCK] bio: check for same page merge possibilities in __bio_add_page() ... Browse Code »

For filesystems with a blocksize < page size, we can merge same page
calls into the bio_vec at the end of the bio. This saves segments
on systems with a page size > the "normal" 4kb fs block size.

Signed-off-by: Christoph Hellwig
Signed-off-by: Jens Axboe

Jens Axboe
2006-01-06 16:43:28 +0800

16 Dec, 2005

1 commit

defd94b75 [SCSI] seperate max_sectors from max_hw_sectors ... Browse Code »

- export __blk_put_request and blk_execute_rq_nowait
needed for async REQ_BLOCK_PC requests
- seperate max_hw_sectors and max_sectors for block/scsi_ioctl.c and
SG_IO bio.c helpers per Jens's last comments. Since block/scsi_ioctl.c SG_IO was
already testing against max_sectors and SCSI-ml was setting max_sectors and
max_hw_sectors to the same value this does not change any scsi SG_IO behavior. It only
prepares ll_rw_blk.c, scsi_ioctl.c and bio.c for when SCSI-ml begins to set
a valid max_hw_sectors for all LLDs. Today if a LLD does not set it
SCSI-ml sets it to a safe default and some LLDs set it to a artificial low
value to overcome memory and feedback issues.

Note: Since we now cap max_sectors to BLK_DEF_MAX_SECTORS, which is 1024,
drivers that used to call blk_queue_max_sectors with a large value of
max_sectors will now see the fs requests capped to BLK_DEF_MAX_SECTORS.

Signed-off-by: Mike Christie
Signed-off-by: James Bottomley

Mike Christie
2005-12-16 07:11:40 +0800

15 Dec, 2005

1 commit

6e68af666 [SCSI] Convert SCSI mid-layer to scsi_execute_async ... Browse Code »

Add scsi helpers to create really-large-requests and convert
scsi-ml to scsi_execute_async().

Per Jens's previous comments, I placed this function in scsi_lib.c.
I made it follow all the queue's limits - I think I did at least :), so
I removed the warning on the function header.

I think the scsi_execute_* functions should eventually take a request_queue
and be placed some place where the dm-multipath hw_handler can use them
if that failover code is going to stay in the kernel. That conversion
patch will be sent in another mail though.

Signed-off-by: Mike Christie
Signed-off-by: James Bottomley

Mike Christie
2005-12-15 11:03:35 +0800

28 Oct, 2005

1 commit

27496a8c6 [PATCH] gfp_t: fs/* ... Browse Code »

- ->releasepage() annotated (s/int/gfp_t), instances updated
- missing gfp_t in fs/* added
- fixed misannotation from the original sweep caught by bitwise checks:
XFS used __nocast both for gfp_t and for flags used by XFS allocator.
The latter left with unsigned int __nocast; we might want to add a
different type for those but for now let's leave them alone. That,
BTW, is a case when __nocast use had been actively confusing - it had
been used in the same code for two different and similar types, with
no way to catch misuses. Switch of gfp_t to bitwise had caught that
immediately...

One tricky bit is left alone to be dealt with later - mapping->flags is
a mix of gfp_t and error indications. Left alone for now.

Signed-off-by: Al Viro
Signed-off-by: Linus Torvalds

Al Viro
2005-10-28 23:16:47 +0800

09 Oct, 2005

1 commit

dd0fc66fb [PATCH] gfp flags annotations - part 1 ... Browse Code »

- added typedef unsigned int __nocast gfp_t;

- replaced __nocast uses for gfp flags with gfp_t - it gives exactly
the same warnings as far as sparse is concerned, doesn't change
generated code (from gcc point of view we replaced unsigned int with
typedef) and documents what's going on far better.

Signed-off-by: Al Viro
Signed-off-by: Linus Torvalds

Al Viro
2005-10-09 06:00:57 +0800

10 Sep, 2005

1 commit

3f70353ea [PATCH] bogus cast in bio.c ... Browse Code »

void * is not the same as void *...
Signed-off-by: Al Viro
Signed-off-by: Linus Torvalds

viro@ZenIV.linux.org.uk
2005-09-10 01:31:58 +0800

08 Sep, 2005

2 commits

0481990b7 Merge master.kernel.org:/pub/scm/linux/kernel/git/jejb/scsi-for-linus-2.6 Browse Code »

Linus Torvalds
2005-09-08 08:31:27 +0800
3676347a5 [PATCH] kill bio->bi_set ... Browse Code »

Jens:

->bi_set is totally unnecessary bloat of struct bio. Just define a proper
destructor for the bio and it already knows what bio_set it belongs too.

Peter:

Fixed the bugs.

Signed-off-by: Jens Axboe
Signed-off-by: Peter Osterlund
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Peter Osterlund
2005-09-08 07:57:20 +0800

28 Aug, 2005

1 commit

31151ba2c fix mismerge in ll_rw_blk.c Browse Code »

James Bottomley
2005-08-28 23:43:07 +0800

08 Aug, 2005

1 commit

e525e153c [PATCH] __bio_clone() dead comment ... Browse Code »

Remove a very wrong comment.

Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andrew Morton
2005-08-08 01:00:38 +0800

28 Jul, 2005

1 commit

a5453be48 [PATCH] bio_clone fix ... Browse Code »

Fix bug introduced in 2.6.11-rc2: when we clone a BIO we need to copy over the
current index into it as well.

It corrupts data with some MD setups.

See http://bugzilla.kernel.org/show_bug.cgi?id=4946

Huuuuuuuuge thanks to Matthew Stapleton for doggedly
chasing this one down.

Acked-by: Jens Axboe
Cc:
Cc:
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andrew Morton
2005-07-28 23:38:59 +0800

08 Jul, 2005

1 commit

6c036527a [PATCH] mostly_read data section ... Browse Code »

Add a new section called ".data.read_mostly" for data items that are read
frequently and rarely written to like cpumaps etc.

If these maps are placed in the .data section then these frequenly read
items may end up in cachelines with data is is frequently updated. In that
case all processors in an SMP system must needlessly reload the cachelines
again and again containing elements of those frequently used variables.

The ability to share these cachelines will allow each cpu in an SMP system
to keep local copies of those shared cachelines thereby optimizing
performance.

Signed-off-by: Alok N Kataria
Signed-off-by: Shobhit Dayal
Signed-off-by: Christoph Lameter
Signed-off-by: Shai Fultheim
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Christoph Lameter
2005-07-08 09:23:46 +0800

20 Jun, 2005

3 commits

f1970baf6 [PATCH] Add scatter-gather support for the block layer SG_IO ... Browse Code »

Signed-off-by: Jens Axboe

James Bottomley
2005-06-20 20:06:52 +0800
b823825e8 [PATCH] Keep the bio end_io parts inside of bio.c for blk_rq_map_kern() ... Browse Code »

Signed-off-by: Jens Axboe

Jens Axboe
2005-06-20 20:05:27 +0800
df46b9a44 [PATCH] Add blk_rq_map_kern() ... Browse Code »

Add blk_rq_map_kern which takes a kernel buffer and maps it into
a request and bio. This can be used by the dm hw_handlers, old
sg_scsi_ioctl, and one day scsi special requests so all requests
comming into scsi will have bios. All requests having bios
should allow scsi to use scatter lists for all IO and allow it
to use block layer functions.

Signed-off-by: Jens Axboe

Mike Christie
2005-06-20 20:04:44 +0800