Eric Lee / smarc-fsl-linux-kernel

23 Dec, 2011

1 commit

8f19ccb2f md/raid1: Allocate spare to store replacement devices and their bios. ... Browse Code »

In RAID1, a replacement is much like a normal device, so we just
double the size of the relevant arrays and look at all possible
devices for reads and writes.

This means that the array looks like it is now double the size in some
way - we need to be careful about that.
In particular, we checking if the array is still degraded while
creating a recovery request we need to only consider the first 'half'
- i.e. the real (non-replacement) devices.

Signed-off-by: NeilBrown

NeilBrown
2011-12-23 07:17:56 +0800

11 Oct, 2011

7 commits

34db0cd60 md: add proper write-congestion reporting to RAID1 and RAID10. ... Browse Code »

RAID1 and RAID10 handle write requests by queuing them for handling by
a separate thread. This is because when a write-intent-bitmap is
active we might need to update the bitmap first, so it is good to
queue a lot of writes, then do one big bitmap update for them all.

However writeback request devices to appear to be congested after a
while so it can make some guesstimate of throughput. The infinite
queue defeats that (note that RAID5 has already has a finite queue so
it doesn't suffer from this problem).

So impose a limit on the number of pending write requests. By default
it is 1024 which seems to be generally suitable. Make it configurable
via module option just in case someone finds a regression.

Signed-off-by: NeilBrown

NeilBrown
2011-10-11 13:50:01 +0800
e80963604 md/raid1: typedef removal: conf_t -> struct r1conf ... Browse Code »

Signed-off-by: NeilBrown

NeilBrown
2011-10-11 13:49:05 +0800
0f6d02d58 md: remove typedefs: mirror_info_t -> struct mirror_info ... Browse Code »

Signed-off-by: NeilBrown

NeilBrown
2011-10-11 13:48:46 +0800
9f2c9d12b md: remove typedefs: r10bio_t -> struct r10bio and r1bio_t -> struct r1bio ... Browse Code »

Signed-off-by: NeilBrown

NeilBrown
2011-10-11 13:48:43 +0800
2b8bf3451 md: remove typedefs: mdk_thread_t -> struct md_thread ... Browse Code »

Signed-off-by: NeilBrown

NeilBrown
2011-10-11 13:48:23 +0800
fd01b88c7 md: remove typedefs: mddev_t -> struct mddev ... Browse Code »

Having mddev_t and 'struct mddev_s' is ugly and not preferred

Signed-off-by: NeilBrown

NeilBrown
2011-10-11 13:47:53 +0800
3cb030020 md: removing typedefs: mdk_rdev_t -> struct md_rdev ... Browse Code »

The typedefs are just annoying. 'mdk' probably refers to 'md_k.h'
which used to be an include file that defined this thing.

Signed-off-by: NeilBrown

NeilBrown
2011-10-11 13:45:26 +0800

07 Oct, 2011

1 commit

ce550c205 md/raid1: add documentation to r1_private_data_s data structure. ... Browse Code »

There wasn't much and it is inconsistent.
Also rearrange fields to keep related fields together.

Reported-by: Aapo Laine
Signed-off-by: NeilBrown

NeilBrown
2011-10-07 11:22:33 +0800

28 Jul, 2011

4 commits

cd5ff9a16 md/raid1: Handle write errors by updating badblock log. ... Browse Code »

When we get a write error (in the data area, not in metadata),
update the badblock log rather than failing the whole device.

As the write may well be many blocks, we trying writing each
block individually and only log the ones which fail.

Signed-off-by: NeilBrown
Reviewed-by: Namhyung Kim

NeilBrown
2011-07-28 09:32:41 +0800
2ca68f5ed md/raid1: store behind-write pages in bi_vecs. ... Browse Code »

When performing write-behind we allocate pages to store the data
during write.
Previously we just keep a list of pages. Now we keep a list of
bi_vec which includes offset and size.
This means that the r1bio has complete information to create a new
bio which will be needed for retrying after write errors.

Signed-off-by: NeilBrown
Reviewed-by: Namhyung Kim

NeilBrown
2011-07-28 09:32:10 +0800
4367af556 md/raid1: clear bad-block record when write succeeds. ... Browse Code »
43

If we succeed in writing to a block that was recorded as
being bad, we clear the bad-block record.

This requires some delayed handling as the bad-block-list update has
to happen in process-context.

Signed-off-by: NeilBrown
Reviewed-by: Namhyung Kim

NeilBrown
2011-07-28 09:31:49 +0800
d2eb35acf md/raid1: avoid reading from known bad blocks. ... Browse Code »

Now that we have a bad block list, we should not read from those
blocks.
There are several main parts to this:
1/ read_balance needs to check for bad blocks, and return not only
the chosen device, but also how many good blocks are available
there.
2/ fix_read_error needs to avoid trying to read from bad blocks.
3/ read submission must be ready to issue multiple reads to
different devices as different bad blocks on different devices
could mean that a single large read cannot be served by any one
device, but can still be served by the array.
This requires keeping count of the number of outstanding requests
per bio. This count is stored in 'bi_phys_segments'
4/ retrying a read needs to also be ready to submit a smaller read
and queue another request for the rest.

This does not yet handle bad blocks when reading to perform resync,
recovery, or check.

'md_trim_bio' will also be used for RAID10, so put it in md.c and
export it.

Signed-off-by: NeilBrown

NeilBrown
2011-07-28 09:31:48 +0800

27 Jul, 2011

1 commit

5389042ff md: change managed of recovery_disabled. ... Browse Code »

If we hit a read error while recovering a mirror, we want to abort the
recovery without necessarily failing the disk - as having a disk this
a read error is better than not having an array at all.

Currently this is managed with a per-array flag "recovery_disabled"
and is only implemented for RAID1. For RAID10 we will need finer
grained control as we might want to disable recovery for individual
devices separately.

So push more of the decision making into the personality.
'recovery_disabled' is now a 'cookie' which is copied when the
personality want to disable recovery and is changed when a device is
added to the array as this is used as a trigger to 'try recovery
again'.

This will allow RAID10 to get the control that it needs.

Signed-off-by: NeilBrown

NeilBrown
2011-07-27 09:00:36 +0800

08 Jun, 2011

1 commit

1ed7242e5 MD: raid1 changes to allow use by device mapper ... Browse Code »
43

MD RAID1: Changes to allow RAID1 to be used by device-mapper (dm-raid.c)

Added the necessary congestion function and conditionalize calls requiring an
array 'queue' or 'gendisk'.

Signed-off-by: Jonathan Brassow
Signed-off-by: NeilBrown

Jonathan Brassow
2011-06-08 13:11:31 +0800

11 May, 2011

1 commit

af6d7b760 md/raid1: improve handling of pages allocated for write-behind. ... Browse Code »

The current handling and freeing of these pages is a bit fragile.
We only keep the list of allocated pages in each bio, so we need to
still have a valid bio when freeing the pages, which is a bit clumsy.

So simply store the allocated page list in the r1_bio so it can easily
be found and freed when we are finished with the r1_bio.

Signed-off-by: NeilBrown

NeilBrown
2011-05-11 12:51:19 +0800

29 Oct, 2010

1 commit

9b19553e0 md/raid1: discard unused variable. ... Browse Code »

This structure field (flushing_bio_list) is never used, so remove it.

Signed-off-by: NeilBrown

NeilBrown
2010-10-29 13:40:33 +0800

10 Sep, 2010

1 commit

e9c7469bb md: implment REQ_FLUSH/FUA support ... Browse Code »

This patch converts md to support REQ_FLUSH/FUA instead of now
deprecated REQ_HARDBARRIER. In the core part (md.c), the following
changes are notable.

* Unlike REQ_HARDBARRIER, REQ_FLUSH/FUA don't interfere with
processing of other requests and thus there is no reason to mark the
queue congested while FLUSH/FUA is in progress.

* REQ_FLUSH/FUA failures are final and its users don't need retry
logic. Retry logic is removed.

* Preflush needs to be issued to all member devices but FUA writes can
be handled the same way as other writes - their processing can be
deferred to request_queue of member devices. md_barrier_request()
is renamed to md_flush_request() and simplified accordingly.

For linear, raid0 and multipath, the core changes are enough. raid1,
5 and 10 need the following conversions.

* raid1: Handling of FLUSH/FUA bio's can simply be deferred to
request_queues of member devices. Barrier related logic removed.

* raid5: Queue draining logic dropped. FUA bit is propagated through
biodrain and stripe resconstruction such that all the updated parts
of the stripe are written out with FUA writes if any of the dirtying
writes was FUA. preread_active_stripes handling in make_request()
is updated as suggested by Neil Brown.

* raid10: FUA bit needs to be propagated to write clones.

linear, raid0, 1, 5 and 10 tested.

Signed-off-by: Tejun Heo
Reviewed-by: Neil Brown
Signed-off-by: Jens Axboe

Tejun Heo
2010-09-10 18:35:38 +0800

14 Dec, 2009

1 commit

709ae4879 md/raid1: add takeover support for raid5->raid1 ... Browse Code »

A 2-device raid5 array can now be converted to raid1.

Signed-off-by: NeilBrown

NeilBrown
2009-12-14 09:51:41 +0800

16 Jun, 2009

1 commit

070ec55d0 md: remove mddev_to_conf "helper" macro ... Browse Code »

Having a macro just to cast a void* isn't really helpful.
I would must rather see that we are simply de-referencing ->private,
than have to know what the macro does.

So open code the macro everywhere and remove the pointless cast.

Signed-off-by: NeilBrown

NeilBrown
2009-06-16 14:54:21 +0800

31 Mar, 2009

2 commits

bff61975b md: move lots of #include lines out of .h files and into .c ... Browse Code »

This makes the includes more explicit, and is preparation for moving
md_k.h to drivers/md/md.h

Remove include/raid/md.h as its only remaining use was to #include
other files.

Signed-off-by: NeilBrown

NeilBrown
2009-03-31 11:33:13 +0800
ef740c372 md: move headers out of include/linux/raid/ ... Browse Code »

Move the headers with the local structures for the disciplines and
bitmap.h into drivers/md/ so that they are more easily grepable for
hacking and not far away. md.h is left where it is for now as there
are some uses from the outside.

Signed-off-by: Christoph Hellwig
Signed-off-by: NeilBrown

Christoph Hellwig
2009-03-31 11:27:03 +0800