Eric Lee / smarc-fsl-linux-kernel

14 Jun, 2016

2 commits

dd4f699da drbd: when receiving P_TRIM, zero-out partial unaligned chunks ... Browse Code »

We can avoid spurious data divergence caused by partially-ignored
discards on certain backends with discard_zeroes_data=0, if we
translate partial unaligned discard requests into explicit zero-out.

The relevant use case is LVM/DM thin.

If on different nodes, DRBD is backed by devices with differing
discard characteristics, discards may lead to data divergence
(old data or garbage left over on one backend, zeroes due to
unmapped areas on the other backend). Online verify would now
potentially report tons of spurious differences.

While probably harmless for most use cases (fstrim on a file system),
DRBD cannot have that, it would violate our promise to upper layers
that our data instances on the nodes are identical.

To be correct and play safe (make sure data is identical on both copies),
we would have to disable discard support, if our local backend (on a
Primary) does not support "discard_zeroes_data=true".

We'd also have to translate discards to explicit zero-out on the
receiving (typically: Secondary) side, unless the receiving side
supports "discard_zeroes_data=true".

Which both would allocate those blocks, instead of unmapping them,
in contrast with expectations.

LVM/DM thin does set discard_zeroes_data=0,
because it silently ignores discards to partial chunks.

We can work around this by checking the alignment first.
For unaligned (wrt. alignment and granularity) or too small discards,
we zero-out the initial (and/or) trailing unaligned partial chunks,
but discard all the aligned full chunks.

At least for LVM/DM thin, the result is effectively "discard_zeroes_data=1".

Arguably it should behave this way internally, by default,
and we'll try to make that happen.

But our workaround is still valid for already deployed setups,
and for other devices that may behave this way.

Setting discard-zeroes-if-aligned=yes will allow DRBD to use
discards, and to announce discard_zeroes_data=true, even on
backends that announce discard_zeroes_data=false.

Setting discard-zeroes-if-aligned=no will cause DRBD to always
fall-back to zero-out on the receiving side, and to not even
announce discard capabilities on the Primary, if the respective
backend announces discard_zeroes_data=false.

We used to ignore the discard_zeroes_data setting completely.
To not break established and expected behaviour, and suddenly
cause fstrim on thin-provisioned LVs to run out-of-space,
instead of freeing up space, the default value is "yes".

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg
Signed-off-by: Jens Axboe

Lars Ellenberg
2016-06-14 11:43:05 +0800
a5ca66c41 drbd: Introduce new disk config option rs-discard-granularity ... Browse Code »

As long as the value is 0 the feature is disabled. With setting
it to a positive value, DRBD limits and aligns its resync requests
to the rs-discard-granularity setting. If the sync source detects
all zeros in such a block, the resync target discards the range
on disk.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg
Signed-off-by: Jens Axboe

Philipp Reisner
2016-06-14 11:43:04 +0800

26 Nov, 2015

2 commits

a55bbd375 drbd: Backport the "status" command ... Browse Code »

The status command originates the drbd9 code base. While for now we
keep the status information in /proc/drbd available, this commit
allows the user base to gracefully migrate their monitoring
infrastructure to the new status reporting interface.

In drbd9 no status information is exposed through /proc/drbd.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg
Signed-off-by: Jens Axboe

Andreas Gruenbacher
2015-11-26 00:22:00 +0800
a29728463 drbd: Backport the "events2" command ... Browse Code »

The events2 command originates from drbd-9 development. It features
more information but requires a incompatible change in output
format.
Therefore the previous events command continues to exist, the new
improved events2 command becomes available now.

This prepares the user-base for a later switch to the complete
drbd9 code base.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg
Signed-off-by: Jens Axboe

Andreas Gruenbacher
2015-11-26 00:22:00 +0800

11 Jul, 2014

2 commits

5d0b17f1a drbd: New net configuration option socket-check-timeout ... Browse Code »

In setups involving a DRBD-proxy and connections that experience a lot of
buffer-bloat it might be necessary to set ping-timeout to an
unusual high value. By default DRBD uses the same value to wait if a newly
established TCP-connection is stable. Since the DRBD-proxy is usually located
in the same data center such a long wait time may hinder DRBD's connect process.

In such setups socket-check-timeout should be set to
at least to the round trip time between DRBD and DRBD-proxy. I.e. in most
cases to 1.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Philipp Reisner
2014-07-11 00:35:01 +0800
aaaba3457 drbd: implement csums-after-crash-only ... Browse Code »

Checksum based resync trades CPU cycles for network bandwidth,
in situations where we expect much of the to-be-resynced blocks
to be actually identical on both sides already.

In a "network hickup" scenario, it won't help:
all to-be-resynced blocks will typically be different.

The use case is for the resync of *potentially* different blocks
after crash recovery -- the crash recovery had marked larger areas
(those covered by the activity log) as need-to-be-resynced,
just in case. Most of those blocks will be identical.

This option makes it possible to configure checksum based resync,
but only actually use it for the first resync after primary crash.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Lars Ellenberg
2014-07-11 00:35:00 +0800

17 Feb, 2014

2 commits

f44d0436d drbd: Define the size of res_opts->cpu_mask in a single place ... Browse Code »

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Philipp Reisner

Andreas Gruenbacher
2014-02-17 23:46:48 +0800
05a10ec79 drbd: Improve some function and variable naming ... Browse Code »

Rename functions
conn_destroy() -> drbd_destroy_connection(),
drbd_minor_destroy() -> drbd_destroy_device()
drbd_adm_add_minor() -> drbd_adm_add_minor()
drbd_adm_delete_minor() -> drbd_adm_del_minor()

Rename global variable minors to drbd_devices

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Philipp Reisner

Andreas Gruenbacher
2014-02-17 23:44:52 +0800

28 Jun, 2013

1 commit

d752b2696 drbd: Allow online change of al-stripes and al-stripe-size ... Browse Code »

Allow to change the AL layout with an resize operation. For that
the reisze command gets two new fields: al_stripes and al_stripe_size.

In order to make the operation crash save:
1) Lock out all IO and MD-IO
2) Write the super block with MDF_PRIMARY_IND clear
3) write the bitmap to the new location (all zeros, since
we allow only while connected)
4) Initialize the new AL-area
5) Write the super block with the restored MDF_PRIMARY_IND.
6) Unfreeze all IO

Since the AL-layout has no influence on the protocol, this operation
needs to be beforemed on both sides of a resource (if intended).

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Philipp Reisner
Signed-off-by: Jens Axboe

Philipp Reisner
2013-06-28 22:04:36 +0800

09 Nov, 2012

2 commits

3174f8c50 drbd: pass some more information to userspace. ... Browse Code »

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Philipp Marek
2012-11-09 21:05:45 +0800
58ffa580a drbd: introduce stop-sector to online verify ... Browse Code »

We now can schedule only a specific range of sectors for online verify,
or interrupt a running verify without interrupting the connection.

Had to bump the protocol version differently, we are now 101.
Added verify_can_do_stop_sector() { protocol >= 97 && protocol != 100; }

Also, the return value convention for worker callbacks has changed,
we returned "true/false" for "keep the connection up" in 8.3,
we return 0 for success and
Signed-off-by: Lars Ellenberg

Lars Ellenberg
2012-11-09 21:05:32 +0800

08 Nov, 2012

22 commits

9a51ab1c1 drbd: New disk option al-updates ... Browse Code »

By disabling al-updates one might increase performace. The price for
that is that in case a crashed primary (that had al-updates disabled)
is reintegraded, it will receive a full-resync instead of a bitmap
based resync.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Philipp Reisner
2012-11-08 23:58:31 +0800
380207d08 drbd: Load balancing of read requests ... Browse Code »

New config option for the disk secition "read-balancing", with
the values: prefer-local, prefer-remote, round-robin, when-congested-remote.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Philipp Reisner
2012-11-08 23:58:10 +0800
cdfda633d drbd: detach from frozen backing device ... Browse Code »

* drbd-8.3:
documentation: Documented detach's --force and disk's --disk-timeout
drbd: Implemented the disk-timeout option
drbd: Force flag for the detach operation
drbd: Allow new IOs while the local disk in in FAILED state
drbd: Bitmap IO functions can not return prematurely if the disk breaks
drbd: Added a kref to bm_aio_ctx
drbd: Hold a reference to ldev while doing meta-data IO
drbd: Keep a reference to the bio until the completion handler finished
drbd: Implemented wait_until_done_or_disk_failure()
drbd: Replaced md_io_mutex by an atomic: md_io_in_use
drbd: moved md_io into mdev
drbd: Immediately allow completion of IOs, that wait for IO completions on a failed disk
drbd: Keep a reference to barrier acked requests

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Philipp Reisner
2012-11-08 23:57:50 +0800
6dff29022 drbd: Rename --dry-run to --tentative ... Browse Code »

drbdadm already has a --dry-run option, so this option cannot directly be
passed through to drbdsetup. Rename the drbdsetup option to resolve this
conflict.

For backward compatibility, make --dry-run an alias of --tentative.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:57:47 +0800
089c075d8 drbd: Convert the generic netlink interface to accept connection endpoints ... Browse Code »

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:57:46 +0800
7c3063cc6 drbd: Also need to check for DRBD_GENLA_F_MANDATORY flags before nla_find_nested() ... Browse Code »

This is done by introducing drbd_nla_find_nested() which handles the flag
before calling nla_find_nested().

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:57:45 +0800
789c1b626 drbd: Use the terminology suggested by the command names in the source code and messages ... Browse Code »

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:57:44 +0800
5f9359201 drbd: Make drbd's use of netlink attribute flags less confusing ... Browse Code »

Make it more clear in the flag names which flags are internal to drbd, and
which are not.

The check for mandatory attributes is the only extension visible at the netlink
layer. Attributes with this flag set would look like unknown attributes to
some kernel versions. The netlink layer would ignore them and also skip
consistency checks on the attribute type and legth. To avoid this, we check
for mandatory attributes first, remove the mandatory flag, and then process the
attributes normally.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:55 +0800
3a45abd57 drbd: Convert resync-after into a signed netlink field ... Browse Code »

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:51 +0800
95f8efd08 drbd: Fix the upper limit of resync-after ... Browse Code »

The 32-bit resync_after netlink field takes a device minor number as
parameter, which is no longer limited to 255. We cannot statically
verify which device numbers are valid, so set the ummer limit to the
highest possible signed 32-bit integer.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:51 +0800
69ef82dea drbd: Refer to connect-int consistently throughout the code ... Browse Code »

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:50 +0800
6394b9358 drbd: Refer to resync-rate consistently throughout the code ... Browse Code »

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:50 +0800
6139f60dc drbd: Rename the want_lose field/flag to discard_my_data ... Browse Code »

This is what it is called in config files and on the command line as
well.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:49 +0800
7bac3e6f7 drbd: Also define the default values of boolean flags in a single place ... Browse Code »

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:49 +0800
bb77d34ec drbd: Turn no-tcp-cork into tcp-cork={yes|no} ... Browse Code »

Change the --no-tcp-cork drbdsetup command line option as well as
the no_cork netlink packet.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:46 +0800
e544046ab drbd: Turn no-md-flushes into md-flushes={yes|no} ... Browse Code »

Change the --no-md-flushes drbdsetup command line option as well as
the no_md_flush netlink packet.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:46 +0800
d0c980e23 drbd: Turn no-disk-drain into disk-drain={yes|no} ... Browse Code »

Change the --no-disk-drain drbdsetup command line option as well as
the no_disk_drain netlink packet.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:46 +0800
66b2f6b9c drbd: Turn no-disk-flushes into disk-flushes={yes|no} ... Browse Code »

Change the --no-disk-flushes drbdsetup command line option as well as
the no_disk_flush netlink packet.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:45 +0800
563e4cf25 drbd: Introduce __s32_field in the genetlink macro magic ... Browse Code »

...and drop explicit typecasts (int)meta_dev_idx < 0.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Lars Ellenberg
2012-11-08 23:55:43 +0800
b966b5dd8 drbd: Generate the drbd_set_*_defaults() functions from drbd_genl.h ... Browse Code »

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:55:38 +0800
0c8e36d9b drbd: Introduce protocol version 100 headers ... Browse Code »

The 8 byte header finally becomes too small. With the protocol 100 header we
have 16 bit for the volume number, proper 32 bit for the data length, and
32 bit for further extensions in the future.

Previous versions of drbd are using version 80 headers for all packets
short enough for protocol 80. They support both header versions in
worker context, but only version 80 headers in asynchronous context.
For backwards compatibility, continue to use version 80 headers for
short packets before protocol version 100.

From protocol version 100 on, use the same header version for all
packets.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Andreas Gruenbacher
2012-11-08 23:45:10 +0800
f399002e6 drbd: distribute former syncer_conf settings to disk, connection, and resource level ... Browse Code »

This commit breaks the API again.

Move per-volume former syncer options into disk_conf.
Move per-connection former syncer options into net_conf.
Renamed the remainign sync_conf to res_opts

Syncer settings have been changeable at runtime, so we need to prepare
for these settings to be runtime-changeable in their new home as well.

Introduce new configuration operations, and share the netlink attribute
between "attach" (create new disk) and "disk-opts" (change options).
Same for "connect" and "net-opts".

Some fields cannot be changed at runtime, however.
Introduce a new flag GENLA_F_INVARIANT to be able to trigger on that in
the generated validation and assignment functions.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Lars Ellenberg
2012-11-08 23:44:20 +0800

04 Nov, 2012

1 commit

85f75dd76 drbd: introduce in-kernel "down" command ... Browse Code »

This greatly simplifies deconfiguration of whole resources.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Lars Ellenberg
2012-11-04 07:16:23 +0800

14 Oct, 2011

1 commit

ec2c35ac1 drbd: prepare the transition from connector to genetlink ... Browse Code »

This adds the new API header and helper files.

Signed-off-by: Philipp Reisner
Signed-off-by: Lars Ellenberg

Lars Ellenberg
2011-10-14 22:48:08 +0800