Eric Lee / smarc-fsl-linux-kernel

07 Jan, 2006

40 commits

93c8cad03 [PATCH] md: export rdev->data_offset via sysfs ... Browse Code »

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:09 +0800
014236d2b [PATCH] md: expose device slot information via sysfs ... Browse Code »

This the role that a device has in an array can be viewed and set.

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:09 +0800
2bf071bf5 [PATCH] md: keep better track of dev/array size when assembling md arrays ... Browse Code »

Move the checks - that dev size is never less than array size - into
bind_rdev_to_array to make sure it always happens properly (there is one place
where currently it doesn't).

Also reject any superblock which claims an array size smaller than the device
in question can hold.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:09 +0800
da943b991 [PATCH] md: allow md/raid_disks to be settable ... Browse Code »

If array is active, try to reshape, else just set the value.

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:09 +0800
4dbcdc751 [PATCH] md: count corrected read errors per drive ... Browse Code »

Store this total in superblock (As appropriate), and make it available to
userspace via sysfs.

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:09 +0800
d9d166c2a [PATCH] md: allow array level to be set textually via sysfs ... Browse Code »

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:09 +0800
8bb93aaca [PATCH] md: expose md metadata format in sysfs ... Browse Code »

Allow it to be set to a particular version, or 'none'.

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:08 +0800
a35b0d695 [PATCH] md: allow md array component size to be accessed and set via sysfs ... Browse Code »

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:08 +0800
3b34380ae [PATCH] md: allow chunk_size to be settable through sysfs ... Browse Code »

... only before array is started of course.

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:08 +0800
03c902e17 [PATCH] md: fix rdev->pending counts in raid1 ... Browse Code »

When we do a user-requested check/repair, we lose count of the outstanding
requests...

Also make sure that when anything is written to md/sync_action, the
RECOVERY_NEEDED flag is set and the thread is woken up so any changes take
effect.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:08 +0800
c708443c0 [PATCH] md: make sure bitmap updates are visible through filesystem ... Browse Code »

When we update a page_cache page in the kernel, we need to flush_dache_page or
userspace might not see the change.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:08 +0800
07dbd3772 [PATCH] drivers/md/md.c: make md_new_event() static ... Browse Code »

Make the needlessly global function md_new_event() static.

Signed-off-by: Adrian Bunk
Cc: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Adrian Bunk
2006-01-07 00:34:07 +0800
2989ddbd6 [PATCH] md: make a couple of names in md.c static ... Browse Code »

.. because they aren't used outside md.c

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:07 +0800
f188593ee [PATCH] md: fix typo in comment ... Browse Code »

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:07 +0800
bce74dac0 [PATCH] md: helper function to match commands written to sysfs files ... Browse Code »

Commands written to sysfs files may, or my not, be \n terminated. We want to
accept with case. For this we use cmd_match.

Signed-off-by: Neil Brown
Acked-by: Greg KH
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:07 +0800
1345b1d8a [PATCH] md: define and use safe_put_page for md ... Browse Code »

md sometimes call put_page on NULL pointers (treating it like kfree). This is
not safe, so define and use a 'safe_put_page' which checks for NULL.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:07 +0800
7dd5d34c6 [PATCH] md: remove inappropriate limits in md/bitmap configuration. ... Browse Code »

The kernel should not be imposing these policy limits: The time between
bitmap updates should certainly be allowed to be more than 15 seconds, and
if someone wants a bitmap chunk size in excess of 4MB, the kernel isn't the
place to stop them.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:07 +0800
097426f68 [PATCH] md: fix possible problem in raid1/raid10 error overwriting ... Browse Code »

The code to overwrite/reread for addressing read errors in raid1/raid10
currently assumes that the read will not alter the buffer which could be used
to write to the next device. This is not a safe assumption to make.

So we split the loops into a overwrite loop and a separate re-read loop, so
that the writing is complete before reading is attempted.

Cc: Paul Clements
Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:06 +0800
2604b703b [PATCH] md: remove personality numbering from md ... Browse Code »

md supports multiple different RAID level, each being implemented by a
'personality' (which is often in a separate module).

These personalities have fairly artificial 'numbers'. The numbers
are use to:
1- provide an index into an array where the various personalities
are recorded
2- identify the module (via an alias) which implements are particular
personality.

Neither of these uses really justify the existence of personality numbers.
The array can be replaced by a linked list which is searched (array lookup
only happens very rarely). Module identification can be done using an alias
based on level rather than 'personality' number.

The current 'raid5' modules support two level (4 and 5) but only one
personality. This slight awkwardness (which was handled in the mapping from
level to personality) can be better handled by allowing raid5 to register 2
personalities.

With this change in place, the core md module does not need to have an
exhaustive list of all possible personalities, so other personalities can be
added independently.

This patch also moves the check for chunksize being non-zero into the ->run
routines for the personalities that need it, rather than having it in core-md.
This has a side effect of allowing 'faulty' and 'linear' not to have a
chunk-size set.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:06 +0800
a24a8dd85 [PATCH] md: break out of a loop that doesn't need to run to completion ... Browse Code »

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:06 +0800
a8745db23 [PATCH] md: convert recently exported symbol to GPL ... Browse Code »

...because that seems to be the preferred practice these days.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:06 +0800
ea03aff93 [PATCH] md: convert various kmap calls to kmap_atomic ... Browse Code »

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:06 +0800
fccddba06 [PATCH] md: tidy up raid5/6 hash table code ... Browse Code »

- replace open-coded hash chain with hlist macros

- Fix hash-table size at one page - it is already quite generous, so there
will never be a need to use multiple pages, so no need for __get_free_pages

No functional change.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:06 +0800
9ffae0cf3 [PATCH] md: convert md to use kzalloc throughout ... Browse Code »

Replace multiple kmalloc/memset pairs with kzalloc calls.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:05 +0800
2d1f3b5d1 [PATCH] md: clean up 'page' related names in md ... Browse Code »

Substitute:

page_cache_get -> get_page
page_cache_release -> put_page
PAGE_CACHE_SHIFT -> PAGE_SHIFT
PAGE_CACHE_SIZE -> PAGE_SIZE
PAGE_CACHE_MASK -> PAGE_MASK
__free_page -> put_page

because we aren't using the page cache, we are just using pages.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:05 +0800
d7603b7e3 [PATCH] md: make /proc/mdstat pollable ... Browse Code »

With this patch it is possible to poll /proc/mdstat to detect arrays appearing
or disappearing, to detect failures, recovery starting, recovery completing,
and devices being added and removed.

It is similar to the poll-ability of /proc/mounts, though different in that:

We always report that the file is readable (because face it, it is, even if
only for EOF).

We report POLLPRI when there is a change so that select() can detect
it as an exceptional event. Not only are these exceptional events, but
that is the mechanism that the current 'mdadm' uses to watch for events
(It also polls after a timeout).
(We also report POLLERR like /proc/mounts).

Finally, we only reset the per-file event counter when the start of the file
is read, rather than when poll() returns an event. This is more robust as it
means that an fd will continue to report activity to poll/select until the
program clearly responds to that activity.

md_new_event takes an 'mddev' which isn't currently used, but it will be soon.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:05 +0800
0eb3ff12a [PATCH] md: raid10 read-error handling - resync and read-only ... Browse Code »

Add in correct read-error handling for resync and read-only situations.

When read-only, we don't over-write, so we need to mark the failed drive in
the r10_bio so we don't re-try it. During resync, we always read all blocks,
so if there is a read error, we simply over-write it with the good block that
we found (assuming we found one).

Note that the recovery case still isn't handled in an interesting way. There
is nothing useful to do for the 2-copies case. If there are 3 or more copies,
then we could try reading from one of the non-missing copies, but this is a
bit complicated and very rarely would be used, so I'm leaving it for now.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:05 +0800
4443ae10c [PATCH] md: auto-correct correctable read errors in raid10 ... Browse Code »

Largely just a cross-port from raid1.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:05 +0800
220946c90 [PATCH] md: make sure read error on last working drive of raid1 actually returns failure ... Browse Code »

We are inadvertently setting the R1BIO_Uptodate bit on read errors when we
decide not to try correcting (because there are no other working devices).
This means that the read error is reported to the client as success.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:04 +0800
d11c171e6 [PATCH] md: allow raid1 to check consistency ... Browse Code »

Where performing a user-requested 'check' or 'repair', we read all readable
devices, and compare the contents. We only write to blocks which had read
errors, or blocks with content that differs from the first good device found.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:04 +0800
18f08819f [PATCH] md: support check-without-repair of raid10 arrays ... Browse Code »

Also keep count on the number of errors found.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:04 +0800
9910f16af [PATCH] md: fix up some rdev rcu locking in raid5/6 ... Browse Code »

There is this "FIXME" comment with a typo in it!! that been annoying me for
days, so I just had to remove it.

conf->disks[i].rdev should only be accessed if
- we know we hold a reference or
- the mddev->reconfig_sem is down or
- we have a rcu_readlock

handle_stripe was referencing rdev in three places without any of these. For
the first two, get an rcu_readlock. For the last, the same access
(md_sync_acct call) is made a little later after the rdev has been claimed
under and rcu_readlock, if R5_Syncio is set. So just use that access...
However R5_Syncio isn't really needed as the 'syncing' variable contains the
same information. So use that instead.

Issues, comment, and fix are identical in raid5 and raid6.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:04 +0800
cf30a473a [PATCH] md: handle errors when read-only ... Browse Code »

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:04 +0800
69382e853 [PATCH] md: better handling for read error in raid1 during resync ... Browse Code »

Handling of read errors during resync is separate from handling of read errors
during normal IO in raid1. A previous patch added support for read errors
during normal IO. This one adds support for read errors during resync or
recovery.

The key differences are that we don't need to freeze the array, because the
normal handling of resync means that this part of the array will be idle
except for resync, and the read/overwrite/re-read is needed in a separate
piece of code.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:04 +0800
3e198f782 [PATCH] md: tidyup some issues with raid1 resync and prepare for catching read errors ... Browse Code »

We are dereferencing ->rdev without an rcu lock!

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:03 +0800
ddaf22aba [PATCH] md: attempt to auto-correct read errors in raid1 ... Browse Code »

On a read-error we suspend the array, then synchronously read the block from
other arrays until we find one where we can read it. Then we try writing the
good data back everywhere and make sure it works. If any write or subsequent
read fails, only then do we fail the device out of the array.

To be able to suspend the array, we need to also keep track of how many
requests are queued for handling by raid1d.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:03 +0800
d69762e98 [PATCH] md: improve handing of read errors with raid6 ... Browse Code »

This is a simple port of match functionality across from raid5. If we get a
read error, we don't kick the drive straight away, but try to over-write with
good data first.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:03 +0800
ca65b73bd [PATCH] md: fix raid6 resync check/repair code ... Browse Code »

raid6 currently does not check the P/Q syndromes when doing a resync, it just
calculates the correct value and writes it. Doing the check can reduce writes
(often to 0) for a resync, and it is needed to properly implement the

echo check > sync_action

operation.

This patch implements the appropriate checks and tidies up some related code.

It also allows raid6 user-requested resync to bypass the intent bitmap.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:03 +0800
6cce3b23f [PATCH] md: write intent bitmap support for raid10 ... Browse Code »

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:03 +0800
b15c2e57f [PATCH] md: move bitmap_create to after md array has been initialised ... Browse Code »

This is important because bitmap_create uses
mddev->resync_max_sectors
and that doesn't have a valid value until after the array
has been initialised (with pers->run()).
[It doesn't make a difference for current personalities that
support bitmaps, but will make a difference for raid10]

This has the added advantage of meaning with can move the thread->timeout
manipulation inside the bitmap.c code instead of sprinkling identical code
throughout all personalities.

Signed-off-by: Neil Brown
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

NeilBrown
2006-01-07 00:34:03 +0800