Eric Lee / smarc-fsl-linux-kernel

11 Feb, 2020

1 commit

1426201af dm thin metadata: use pool locking at end of dm_pool_metadata_close ... Browse Code »

commit 44d8ebf436399a40fcd10dd31b29d37823d62fcc upstream.

Ensure that the pool is locked during calls to __commit_transaction and
__destroy_persistent_data_objects. Just being consistent with locking,
but reality is dm_pool_metadata_close is called once pool is being
destroyed so access to pool shouldn't be contended.

Also, use pmd_write_lock_in_core rather than __pmd_write_lock in
dm_pool_commit_metadata and rename __pmd_write_lock to
pmd_write_lock_in_core -- there was no need for the alias.

In addition, verify that the pool is locked in __commit_transaction().

Fixes: 873f258becca ("dm thin metadata: do not write metadata if no changes occurred")
Cc: stable@vger.kernel.org
Signed-off-by: Mike Snitzer
Signed-off-by: Greg Kroah-Hartman

Mike Snitzer
2020-02-11 20:35:26 +0800

21 Dec, 2019

1 commit

d2688d36c dm thin metadata: Add support for a pre-commit callback ... Browse Code »

commit ecda7c0280e6b3398459dc589b9a41c1adb45529 upstream.

Add support for one pre-commit callback which is run right before the
metadata are committed.

This allows the thin provisioning target to run a callback before the
metadata are committed and is required by the next commit.

Cc: stable@vger.kernel.org
Signed-off-by: Nikos Tsironis
Acked-by: Joe Thornber
Signed-off-by: Mike Snitzer
Signed-off-by: Greg Kroah-Hartman

Nikos Tsironis
2019-12-21 18:05:01 +0800

03 Jul, 2019

1 commit

54fa16ee5 dm thin metadata: check if in fail_io mode when setting needs_check ... Browse Code »

Check if in fail_io mode at start of dm_pool_metadata_set_needs_check().
Otherwise dm_pool_metadata_set_needs_check()'s superblock_lock() can
crash in dm_bm_write_lock() while accessing the block manager object
that was previously destroyed as part of a failed
dm_pool_abort_metadata() that ultimately set fail_io to begin with.

Also, update DMERR() message to more accurately describe
superblock_lock() failure.

Cc: stable@vger.kernel.org
Reported-by: Zdenek Kabelac
Signed-off-by: Mike Snitzer

Mike Snitzer
2019-07-03 03:50:08 +0800

19 Apr, 2019

3 commits

873f258be dm thin metadata: do not write metadata if no changes occurred ... Browse Code »

Otherwise, just activating a thin-pool and thin device and then
deactivating them will cause the thin-pool metadata to be changed
(e.g. superblock written) -- even without any metadata being changed.

Add 'in_service' flag to struct dm_pool_metadata and set it in
pmd_write_lock() because all on-disk metadata changes must take a write
lock of pmd->root_lock. Once 'in_service' is set it is never cleared.
__commit_transaction() will return 0 if 'in_service' is not set.
dm_pool_commit_metadata() is updated to use __pmd_write_lock() so that
it isn't the sole reason for putting a thin-pool in service.

Also fix dm_pool_commit_metadata() to open the next transaction if the
return from __commit_transaction() is 0. Not seeing why the early
return ever made since for a return of 0 given that dm-io's async_io(),
as used by bufio, always returns 0.

Signed-off-by: Mike Snitzer

Mike Snitzer
2019-04-19 04:18:34 +0800
6a1b1ddc6 dm thin metadata: add wrappers for managing write locking of metadata ... Browse Code »

No functional change, but this prepares to hook off of pmd_write_lock()
with additional functionality (as provided in next commit).

Suggested-by: Joe Thornber
Signed-off-by: Mike Snitzer

Mike Snitzer
2019-04-19 04:18:34 +0800
a1ed4d9e9 dm thin metadata: check __commit_transaction()'s return ... Browse Code »

Fix __reserve_metadata_snap() to return early if __commit_transaction()
fails.

Signed-off-by: Mike Snitzer

Mike Snitzer
2019-04-19 04:18:33 +0800

16 Jan, 2019

1 commit

d445bd9ce dm thin: fix passdown_double_checking_shared_status() ... Browse Code »

Commit 00a0ea33b495 ("dm thin: do not queue freed thin mapping for next
stage processing") changed process_prepared_discard_passdown_pt1() to
increment all the blocks being discarded until after the passdown had
completed to avoid them being prematurely reused.

IO issued to a thin device that breaks sharing with a snapshot, followed
by a discard issued to snapshot(s) that previously shared the block(s),
results in passdown_double_checking_shared_status() being called to
iterate through the blocks double checking their reference count is zero
and issuing the passdown if so. So a side effect of commit 00a0ea33b495
is passdown_double_checking_shared_status() was broken.

Fix this by checking if the block reference count is greater than 1.
Also, rename dm_pool_block_is_used() to dm_pool_block_is_shared().

Fixes: 00a0ea33b495 ("dm thin: do not queue freed thin mapping for next stage processing")
Cc: stable@vger.kernel.org # 4.9+
Reported-by: ryan.p.norwood@gmail.com
Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2019-01-16 05:10:41 +0800

17 Sep, 2018

1 commit

013ad0439 dm thin metadata: fix __udivdi3 undefined on 32-bit ... Browse Code »

sector_div() is only viable for use with sector_t.
dm_block_t is typedef'd to uint64_t -- so use div_u64() instead.

Fixes: 3ab918281 ("dm thin metadata: try to avoid ever aborting transactions")
Signed-off-by: Mike Snitzer

Mike Snitzer
2018-09-17 23:49:34 +0800

11 Sep, 2018

1 commit

3ab918281 dm thin metadata: try to avoid ever aborting transactions ... Browse Code »

Committing a transaction can consume some metadata of it's own, we now
reserve a small amount of metadata to cover this. Free metadata
reported by the kernel will not include this reserve.

If any of the reserve has been used after a commit we enter a new
internal state PM_OUT_OF_METADATA_SPACE. This is reported as
PM_READ_ONLY, so no userland changes are needed. If the metadata
device is resized the pool will move back to PM_WRITE.

These changes mean we never need to abort and rollback a transaction due
to running out of metadata space. This is particularly important
because there have been a handful of reports of data corruption against
DM thin-provisioning that can all be attributed to the thin-pool having
ran out of metadata space.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2018-09-11 05:03:18 +0800

23 Jun, 2018

1 commit

7ccdbf85d dm thin metadata: remove needless work from __commit_transaction ... Browse Code »

Commit 5a32083d03fb5 ("dm: take care to copy the space map roots before
locking the superblock") properly removed the calls to dm_sm_root_size()
from __write_initial_superblock(). But the dm_sm_root_size() calls were
left dangling in __commit_transaction().

Fixes: 5a32083d03fb5 ("dm: take care to copy the space map roots before locking the superblock")
Signed-off-by: Mike Snitzer

Mike Snitzer
2018-06-23 02:51:11 +0800

17 Jan, 2018

1 commit

490ae017f dm thin metadata: THIN_MAX_CONCURRENT_LOCKS should be 6 ... Browse Code »

For btree removal, there is a corner case that a single thread
could takes 6 locks which is more than THIN_MAX_CONCURRENT_LOCKS(5)
and leads to deadlock.

A btree removal might eventually call
rebalance_children()->rebalance3() to rebalance entries of three
neighbor child nodes when shadow_spine has already acquired two
write locks. In rebalance3(), it tries to shadow and acquire the
write locks of all three child nodes. However, shadowing a child
node requires acquiring a read lock of the original child node and
a write lock of the new block. Although the read lock will be
released after block shadowing, shadowing the third child node
in rebalance3() could still take the sixth lock.
(2 write locks for shadow_spine +
2 write locks for the first two child nodes's shadow +
1 write lock for the last child node's shadow +
1 read lock for the last child node)

Cc: stable@vger.kernel.org
Signed-off-by: Dennis Yang
Acked-by: Joe Thornber
Signed-off-by: Mike Snitzer

Dennis Yang
2018-01-17 22:07:54 +0800

16 May, 2017

1 commit

91bcdb92d dm thin metadata: call precommit before saving the roots ... Browse Code »

These calls were the wrong way round in __write_initial_superblock.

Cc: stable@vger.kernel.org
Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2017-05-16 03:09:49 +0800

28 Apr, 2017

1 commit

73cbca6a6 dm block manager: remove an unused argument from dm_block_manager_create() ... Browse Code »

The 'cache_size' argument of dm_block_manager_create() has never been
used. Remove it along with the definitions of the constants passed as
the 'cache_size' argument.

Signed-off-by: Bart Van Assche
Reviewed-by: Hannes Reinecke
Signed-off-by: Mike Snitzer

Bart Van Assche
2017-04-28 05:08:41 +0800

21 Jul, 2016

1 commit

2a0fbffb1 dm thin: fix a race condition between discarding and provisioning a block ... Browse Code »

The discard passdown was being issued after the block was unmapped,
which meant the block could be reprovisioned whilst the passdown discard
was still in flight.

We can only identify unshared blocks (safe to do a passdown a discard
to) once they're unmapped and their ref count hits zero. Block ref
counts are now used to guard against concurrent allocation of these
blocks that are being discarded. So now we unmap the block, issue
passdown discards, and the immediately increment ref counts for regions
that have been discarded via passed down (this is safe because
allocation occurs within the same thread). We then decrement ref counts
once the passdown discard IO is complete -- signaling these blocks may
now be allocated.

This fixes the potential for corruption that was reported here:
https://www.redhat.com/archives/dm-devel/2016-June/msg00311.html

Reported-by: Dennis Yang
Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2016-07-21 00:43:35 +0800

11 Mar, 2016

2 commits

2eae9e448 dm thin metadata: don't issue prefetches if a transaction abort has failed ... Browse Code »

If a transaction abort has failed then we can no longer use the metadata
device. Typically this happens if the superblock is unreadable.

This fix addresses a crash seen during metadata device failure testing.

Fixes: 8a01a6af75 ("dm thin: prefetch missing metadata pages")
Cc: stable@vger.kernel.org # 3.19+
Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2016-03-11 06:12:09 +0800
29f929b52 dm thin metadata: remove needless newline from subtree_dec() DMERR message ... Browse Code »

Signed-off-by: Mike Snitzer

Mike Snitzer
2016-03-11 06:12:05 +0800

10 Dec, 2015

3 commits

086fbbbda dm thin metadata: make dm_thin_find_mapped_range() atomic ... Browse Code »

Refactor dm_thin_find_mapped_range() so that it takes the read lock on
the metadata's lock; rather than relying on finer grained locking that
is pushed down inside dm_thin_find_next_mapped_block() and
dm_thin_find_block().

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2015-12-10 23:38:55 +0800
3d5f67332 dm thin metadata: speed up discard of partially mapped volumes ... Browse Code »

Use dm_btree_lookup_next() to more quickly discard partially mapped
volumes.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2015-12-10 23:30:56 +0800
49e99fc71 dm thin metadata: fix bug when taking a metadata snapshot ... Browse Code »

When you take a metadata snapshot the btree roots for the mapping and
details tree need to have their reference counts incremented so they
persist for the lifetime of the metadata snap.

The roots being incremented were those currently written in the
superblock, which could possibly be out of date if concurrent IO is
triggering new mappings, breaking of sharing, etc.

Fix this by performing a commit with the metadata lock held while taking
a metadata snapshot.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer
Cc: stable@vger.kernel.org

Joe Thornber
2015-12-10 02:18:12 +0800

03 Dec, 2015

1 commit

993ceab91 dm thin metadata: fix bug in dm_thin_remove_range() ... Browse Code »

dm_btree_remove_leaves() only unmaps a contiguous region so we need a
loop, in __remove_range(), to handle ranges that contain multiple
regions.

A new btree function, dm_btree_lookup_next(), is introduced which is
more efficiently able to skip over regions of the thin device which
aren't mapped. __remove_range() uses dm_btree_lookup_next() for each
iteration of __remove_range()'s loop.

Also, improve description of dm_btree_remove_leaves().

Fixes: 6550f075 ("dm thin metadata: add dm_thin_remove_range()")
Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer
Cc: stable@vger.kernel.org # 4.1+

Joe Thornber
2015-12-03 02:26:49 +0800

01 Nov, 2015

1 commit

4c7da06f5 dm persistent data: eliminate unnecessary return values ... Browse Code »

dm_bm_unlock and dm_tm_unlock return an integer value but the returned
value is always 0. The calling code sometimes checks the return value
and sometimes doesn't.

Eliminate these unnecessary return values and also the checks for them.

Signed-off-by: Mikulas Patocka
Signed-off-by: Mike Snitzer

Mikulas Patocka
2015-11-01 07:06:02 +0800

12 Aug, 2015

1 commit

7f518ad0a dm thin metadata: delete btrees when releasing metadata snapshot ... Browse Code »

The device details and mapping trees were just being decremented
before. Now btree_del() is called to do a deep delete.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer
Cc: stable@vger.kernel.org

Joe Thornber
2015-08-12 22:42:51 +0800

12 Jun, 2015

3 commits

b1f11aff0 dm thin metadata: fix a race when entering fail mode ... Browse Code »

In dm_thin_find_block() the ->fail_io flag was checked outside the
metadata device's root_lock, causing dm_thin_find_block() to race with
the setting of this flag.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2015-06-12 05:13:06 +0800
6550f075f dm thin metadata: add dm_thin_remove_range() ... Browse Code »

Removes a range of blocks from the btree.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2015-06-12 05:13:04 +0800
a5d895a90 dm thin metadata: add dm_thin_find_mapped_range() ... Browse Code »

Retrieve the next run of contiguously mapped blocks. Useful for working
out where to break up IO.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2015-06-12 05:13:03 +0800

30 May, 2015

1 commit

49f154c73 dm thin metadata: remove in-core 'read_only' flag ... Browse Code »

Leverage the block manager's read_only flag instead of duplicating it;
access with new dm_bm_is_read_only() method.

Signed-off-by: Mike Snitzer

Mike Snitzer
2015-05-30 02:18:59 +0800

10 Feb, 2015

1 commit

9cb1397d5 dm thin metadata: remove unused dm_pool_get_data_block_size() ... Browse Code »

The thin-pool target doesn't display the data block size as part of
its table status, unlike the dm-cache target, so there is no need for
dm_pool_get_data_block_size().

This was found using cppcheck.

Signed-off-by: Rickard Strandqvist
Signed-off-by: Mike Snitzer

Rickard Strandqvist
2015-02-10 02:06:49 +0800

11 Nov, 2014

2 commits

8a01a6af7 dm thin: prefetch missing metadata pages ... Browse Code »

Prefetch metadata at the start of the worker thread and then again every
128th bio processed from the deferred list.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2014-11-11 04:25:27 +0800
e5cfc69a5 dm thin metadata: change dm_thin_find_block to allow blocking, but not issuing, IO ... Browse Code »

This change is a prerequisite for allowing metadata to be prefetched.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer

Joe Thornber
2014-11-11 04:25:26 +0800

16 Jul, 2014

1 commit

9aec8629e dm thin metadata: do not allow the data block size to change ... Browse Code »

The block size for the thin-pool's data device must remained fixed for
the life of the thin-pool. Disallow any attempt to change the
thin-pool's data block size.

It should be noted that attempting to change the data block size via
thin-pool table reload will be ignored as a side-effect of the thin-pool
handover that the thin-pool target does during thin-pool table reload.

Here is an example outcome of attempting to load a thin-pool table that
reduced the thin-pool's data block size from 1024K to 512K.

Before:
kernel: device-mapper: thin: 253:4: growing the data device from 204800 to 409600 blocks

After:
kernel: device-mapper: thin metadata: changing the data block size (from 2048 to 1024) is not supported
kernel: device-mapper: table: 253:4: thin-pool: Error creating metadata object
kernel: device-mapper: ioctl: error adding target to table

Signed-off-by: Mike Snitzer
Acked-by: Joe Thornber
Cc: stable@vger.kernel.org

Mike Snitzer
2014-07-16 02:05:26 +0800

28 Mar, 2014

1 commit

5a32083d0 dm: take care to copy the space map roots before locking the superblock ... Browse Code »

In theory copying the space map root can fail, but in practice it never
does because we're careful to check what size buffer is needed.

But make certain we're able to copy the space map roots before
locking the superblock.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer
Cc: stable@vger.kernel.org # drop dm-era and dm-cache changes as needed

Joe Thornber
2014-03-28 04:56:23 +0800

06 Mar, 2014

1 commit

07f2b6e03 dm thin: ensure user takes action to validate data and metadata consistency ... Browse Code »

If a thin metadata operation fails the current transaction will abort,
whereby causing potential for IO layers up the stack (e.g. filesystems)
to have data loss. As such, set THIN_METADATA_NEEDS_CHECK_FLAG in the
thin metadata's superblock which:
1) requires the user verify the thin metadata is consistent (e.g. use
thin_check, etc)
2) suggests the user verify the thin data is consistent (e.g. use fsck)

The only way to clear the superblock's THIN_METADATA_NEEDS_CHECK_FLAG is
to run thin_repair.

On metadata operation failure: abort current metadata transaction, set
pool in read-only mode, and now set the needs_check flag.

As part of this change, constraints are introduced or relaxed:
* don't allow a pool to transition to write mode if needs_check is set
* don't allow data or metadata space to be resized if needs_check is set
* if a thin pool's metadata space is exhausted: the kernel will now
force the user to take the pool offline for repair before the kernel
will allow the metadata space to be extended.

Also, update Documentation to include information about when the thin
provisioning target commits metadata, how it handles metadata failures
and running out of space.

Signed-off-by: Mike Snitzer
Signed-off-by: Joe Thornber

Mike Snitzer
2014-03-06 04:25:35 +0800

28 Feb, 2014

1 commit

7d48935ef dm thin: allow metadata space larger than supported to go unused ... Browse Code »

It was always intended that a user could provide a thin metadata device
that is larger than the max supported by the on-disk format. The extra
space would just go unused.

Unfortunately that never worked. If the user attempted to use a larger
metadata device on creation they would get an error like the following:

device-mapper: space map common: space map too large
device-mapper: transaction manager: couldn't create metadata space map
device-mapper: thin metadata: tm_create_with_sm failed
device-mapper: table: 252:17: thin-pool: Error creating metadata object
device-mapper: ioctl: error adding target to table

Fix this by allowing the initial metadata space map creation to cap its
size at the max number of blocks supported (DM_SM_METADATA_MAX_BLOCKS).
get_metadata_dev_size() must also impose DM_SM_METADATA_MAX_BLOCKS (via
THIN_METADATA_MAX_SECTORS), otherwise extending metadata would cap at
THIN_METADATA_MAX_SECTORS_WARNING (which is larger than supported).

Also, the calculation for THIN_METADATA_MAX_SECTORS didn't account for
the sizeof the disk_bitmap_header. So the supported maximum metadata
size is a bit smaller (reduced from 33423360 to 33292800 sectors).

Lastly, remove the "excess space will not be used" warning message from
get_metadata_dev_size(); it resulted in printing the warning multiple
times. Factor out warn_if_metadata_device_too_big(), call it from
pool_ctr() and maybe_resize_metadata_dev().

Signed-off-by: Mike Snitzer
Acked-by: Joe Thornber

Mike Snitzer
2014-02-28 00:49:08 +0800

18 Feb, 2014

1 commit

4d1662a30 dm thin: avoid metadata commit if a pool's thin devices haven't changed ... Browse Code »

Commit 905e51b ("dm thin: commit outstanding data every second")
introduced a periodic commit. This commit occurs regardless of whether
any thin devices have made changes.

Fix the periodic commit to check if any of a pool's thin devices have
changed using dm_pool_changed_this_transaction().

Reported-by: Alexander Larsson
Signed-off-by: Mike Snitzer
Acked-by: Joe Thornber
Cc: stable@vger.kernel.org

Mike Snitzer
2014-02-18 00:00:05 +0800

07 Jan, 2014

1 commit

19fa1a675 dm thin: fix discard support to a previously shared block ... Browse Code »

If a snapshot is created and later deleted the origin dm_thin_device's
snapshotted_time will have been updated to reflect the snapshot's
creation time. The 'shared' flag in the dm_thin_lookup_result struct
returned from dm_thin_find_block() is an approximation based on
snapshotted_time -- this is done to avoid 0(n), or worse, time
complexity. In this case, the shared flag would be true.

But because the 'shared' flag reflects an approximation a block can be
incorrectly assumed to be shared (e.g. false positive for 'shared'
because the snapshot no longer exists). This could result in discards
issued to a thin device not being passed down to the pool's underlying
data device.

To fix this we double check that a thin block is really still in-use
after a mapping is removed using dm_pool_block_is_used(). If the
reference count for a block is now zero the discard is allowed to be
passed down.

Also add a 'definitely_not_shared' member to the dm_thin_new_mapping
structure -- reflects that the 'shared' flag in the response from
dm_thin_find_block() can only be held as definitive if false is
returned.

Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1043527

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer
Cc: stable@vger.kernel.org

Joe Thornber
2014-01-07 23:11:43 +0800

11 Dec, 2013

1 commit

9b7aaa64f dm thin: allow pool in read-only mode to transition to read-write mode ... Browse Code »

A thin-pool may be in read-only mode because the pool's data or metadata
space was exhausted. To allow for recovery, by adding more space to the
pool, we must allow a pool to transition from PM_READ_ONLY to PM_WRITE
mode. Otherwise, running out of space will render the pool permanently
read-only.

Signed-off-by: Joe Thornber
Signed-off-by: Mike Snitzer
Cc: stable@vger.kernel.org

Joe Thornber
2013-12-11 05:35:13 +0800

10 May, 2013

3 commits

ac8c3f3df dm thin: generate event when metadata threshold passed ... Browse Code »

Generate a dm event when the amount of remaining thin pool metadata
space falls below a certain level.

The threshold is taken to be a quarter of the size of the metadata
device with a minimum threshold of 4MB.

Signed-off-by: Joe Thornber
Signed-off-by: Alasdair G Kergon

Joe Thornber
2013-05-10 21:37:21 +0800
24347e959 dm thin: detect metadata device resizing ... Browse Code »

Allow the dm thin pool metadata device to be extended.

Whenever a pool is resumed, detect whether the size of the metadata
device has increased, and if so, extend the metadata to use the new
space.

Signed-off-by: Joe Thornber
Signed-off-by: Alasdair G Kergon

Joe Thornber
2013-05-10 21:37:19 +0800
b17446df2 dm thin: refactor data dev resize ... Browse Code »

Refactor device size functions in preparation for similar metadata
device resizing functions.

Signed-off-by: Joe Thornber
Signed-off-by: Alasdair G Kergon

Joe Thornber
2013-05-10 21:37:18 +0800

02 Mar, 2013

1 commit

018cede93 dm persistent data: set some btree fn parms const ... Browse Code »

Mark some constant parameters constant in some dm-btree functions.

Signed-off-by: Mike Snitzer
Signed-off-by: Alasdair G Kergon

Mike Snitzer
2013-03-02 06:45:47 +0800