Eric Lee / smarc-fsl-linux-kernel

03 Apr, 2019

1 commit

d952c337b btrfs: don't report readahead errors and don't update statistics ... Browse Code »

commit 0cc068e6ee59c1fffbfa977d8bf868b7551d80ac upstream.

As readahead is an optimization, all errors are usually filtered out,
but still properly handled when the real read call is done. The commit
5e9d398240b2 ("btrfs: readpages() should submit IO as read-ahead") added
REQ_RAHEAD to readpages() because that's only used for readahead
(despite what one would expect from the callback name).

This causes a flood of messages and inflated read error stats, so skip
reporting in case it's readahead.

Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=202403
Reported-by: LimeTech
Fixes: 5e9d398240b2 ("btrfs: readpages() should submit IO as read-ahead")
CC: stable@vger.kernel.org # 4.19+
Signed-off-by: David Sterba
Signed-off-by: Greg Kroah-Hartman

David Sterba
2019-04-03 12:26:21 +0800

24 Mar, 2019

1 commit

1a00f7fd0 btrfs: ensure that a DUP or RAID1 block group has exactly two stripes ... Browse Code »

commit 349ae63f40638a28c6fce52e8447c2d14b84cc0c upstream.

We recently had a customer issue with a corrupted filesystem. When
trying to mount this image btrfs panicked with a division by zero in
calc_stripe_length().

The corrupt chunk had a 'num_stripes' value of 1. calc_stripe_length()
takes this value and divides it by the number of copies the RAID profile
is expected to have to calculate the amount of data stripes. As a DUP
profile is expected to have 2 copies this division resulted in 1/2 = 0.
Later then the 'data_stripes' variable is used as a divisor in the
stripe length calculation which results in a division by 0 and thus a
kernel panic.

When encountering a filesystem with a DUP block group and a
'num_stripes' value unequal to 2, refuse mounting as the image is
corrupted and will lead to unexpected behaviour.

Code inspection showed a RAID1 block group has the same issues.

Fixes: e06cd3dd7cea ("Btrfs: add validadtion checks for chunk loading")
CC: stable@vger.kernel.org # 4.4+
Reviewed-by: Qu Wenruo
Reviewed-by: Nikolay Borisov
Signed-off-by: Johannes Thumshirn
Reviewed-by: David Sterba
Signed-off-by: David Sterba
Signed-off-by: Greg Kroah-Hartman

Johannes Thumshirn
2019-03-24 03:10:00 +0800

13 Feb, 2019

1 commit

3733632e8 btrfs: harden agaist duplicate fsid on scanned devices ... Browse Code »

[ Upstream commit a9261d4125c97ce8624e9941b75dee1b43ad5df9 ]

It's not that impossible to imagine that a device OR a btrfs image is
copied just by using the dd or the cp command. Which in case both the
copies of the btrfs will have the same fsid. If on the system with
automount enabled, the copied FS gets scanned.

We have a known bug in btrfs, that we let the device path be changed
after the device has been mounted. So using this loop hole the new
copied device would appears as if its mounted immediately after it's
been copied.

For example:

Initially.. /dev/mmcblk0p4 is mounted as /

$ lsblk
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
mmcblk0 179:0 0 29.2G 0 disk
|-mmcblk0p4 179:4 0 4G 0 part /
|-mmcblk0p2 179:2 0 500M 0 part /boot
|-mmcblk0p3 179:3 0 256M 0 part [SWAP]
`-mmcblk0p1 179:1 0 256M 0 part /boot/efi

$ btrfs fi show
Label: none uuid: 07892354-ddaa-4443-90ea-f76a06accaba
Total devices 1 FS bytes used 1.40GiB
devid 1 size 4.00GiB used 3.00GiB path /dev/mmcblk0p4

Copy mmcblk0 to sda

$ dd if=/dev/mmcblk0 of=/dev/sda

And immediately after the copy completes the change in the device
superblock is notified which the automount scans using btrfs device scan
and the new device sda becomes the mounted root device.

$ lsblk
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
sda 8:0 1 14.9G 0 disk
|-sda4 8:4 1 4G 0 part /
|-sda2 8:2 1 500M 0 part
|-sda3 8:3 1 256M 0 part
`-sda1 8:1 1 256M 0 part
mmcblk0 179:0 0 29.2G 0 disk
|-mmcblk0p4 179:4 0 4G 0 part
|-mmcblk0p2 179:2 0 500M 0 part /boot
|-mmcblk0p3 179:3 0 256M 0 part [SWAP]
`-mmcblk0p1 179:1 0 256M 0 part /boot/efi

$ btrfs fi show /
Label: none uuid: 07892354-ddaa-4443-90ea-f76a06accaba
Total devices 1 FS bytes used 1.40GiB
devid 1 size 4.00GiB used 3.00GiB path /dev/sda4

The bug is quite nasty that you can't either unmount /dev/sda4 or
/dev/mmcblk0p4. And the problem does not get solved until you take sda
out of the system on to another system to change its fsid using the
'btrfstune -u' command.

Signed-off-by: Anand Jain
Reviewed-by: David Sterba
Signed-off-by: David Sterba
Signed-off-by: Sasha Levin

Anand Jain
2019-02-13 02:47:10 +0800

26 Jan, 2019

2 commits

720b86a53 btrfs: alloc_chunk: fix more DUP stripe size handling ... Browse Code »

[ Upstream commit baf92114c7e6dd6124aa3d506e4bc4b694da3bc3 ]

Commit 92e222df7b "btrfs: alloc_chunk: fix DUP stripe size handling"
fixed calculating the stripe_size for a new DUP chunk.

However, the same calculation reappears a bit later, and that one was
not changed yet. The resulting bug that is exposed is that the newly
allocated device extents ('stripes') can have a few MiB overlap with the
next thing stored after them, which is another device extent or the end
of the disk.

The scenario in which this can happen is:
* The block device for the filesystem is less than 10GiB in size.
* The amount of contiguous free unallocated disk space chosen to use for
chunk allocation is 20% of the total device size, or a few MiB more or
less.

An example:
- The filesystem device is 7880MiB (max_chunk_size gets set to 788MiB)
- There's 1578MiB unallocated raw disk space left in one contiguous
piece.

In this case stripe_size is first calculated as 789MiB, (half of
1578MiB).

Since 789MiB (stripe_size * data_stripes) > 788MiB (max_chunk_size), we
enter the if block. Now stripe_size value is immediately overwritten
while calculating an adjusted value based on max_chunk_size, which ends
up as 788MiB.

Next, the value is rounded up to a 16MiB boundary, 800MiB, which is
actually more than the value we had before. However, the last comparison
fails to detect this, because it's comparing the value with the total
amount of free space, which is about twice the size of stripe_size.

In the example above, this means that the resulting raw disk space being
allocated is 1600MiB, while only a gap of 1578MiB has been found. The
second device extent object for this DUP chunk will overlap for 22MiB
with whatever comes next.

The underlying problem here is that the stripe_size is reused all the
time for different things. So, when entering the code in the if block,
stripe_size is immediately overwritten with something else. If later we
decide we want to have the previous value back, then the logic to
compute it was copy pasted in again.

With this change, the value in stripe_size is not unnecessarily
destroyed, so the duplicated calculation is not needed any more.

Signed-off-by: Hans van Kranenburg
Signed-off-by: David Sterba
Signed-off-by: Sasha Levin

Hans van Kranenburg
2019-01-26 16:32:39 +0800
bb5717a4a btrfs: volumes: Make sure there is no overlap of dev extents at mount time ... Browse Code »

[ Upstream commit 5eb193812a42dc49331f25137a38dfef9612d3e4 ]

Enhance btrfs_verify_dev_extents() to remember previous checked dev
extents, so it can verify no dev extents can overlap.

Analysis from Hans:

"Imagine allocating a DATA|DUP chunk.

In the chunk allocator, we first set...
max_stripe_size = SZ_1G;
max_chunk_size = BTRFS_MAX_DATA_CHUNK_SIZE
... which is 10GiB.

Then...
/* we don't want a chunk larger than 10% of writeable space */
max_chunk_size = min(div_factor(fs_devices->total_rw_bytes, 1),
max_chunk_size);

Imagine we only have one 7880MiB block device in this filesystem. Now
max_chunk_size is down to 788MiB.

The next step in the code is to search for max_stripe_size * dev_stripes
amount of free space on the device, which is in our example 1GiB * 2 =
2GiB. Imagine the device has exactly 1578MiB free in one contiguous
piece. This amount of bytes will be put in devices_info[ndevs - 1].max_avail

Next we recalculate the stripe_size (which is actually the device extent
length), based on the actual maximum amount of available raw disk space:
stripe_size = div_u64(devices_info[ndevs - 1].max_avail, dev_stripes);

stripe_size is now 789MiB

Next we do...
data_stripes = num_stripes / ncopies
...where data_stripes ends up as 1, because num_stripes is 2 (the amount
of device extents we're going to have), and DUP has ncopies 2.

Next there's a check...
if (stripe_size * data_stripes > max_chunk_size)
...which matches because 789MiB * 1 > 788MiB.

We go into the if code, and next is...
stripe_size = div_u64(max_chunk_size, data_stripes);
...which resets stripe_size to max_chunk_size: 788MiB

Next is a fun one...
/* bump the answer up to a 16MB boundary */
stripe_size = round_up(stripe_size, SZ_16M);
...which changes stripe_size from 788MiB to 800MiB.

We're not done changing stripe_size yet...
/* But don't go higher than the limits we found while searching
* for free extents
*/
stripe_size = min(devices_info[ndevs - 1].max_avail,
stripe_size);

This is bad. max_avail is twice the stripe_size (we need to fit 2 device
extents on the same device for DUP).

The result here is that 800MiB < 1578MiB, so it's unchanged. However,
the resulting DUP chunk will need 1600MiB disk space, which isn't there,
and the second dev_extent might extend into the next thing (next
dev_extent? end of device?) for 22MiB.

The last shown line of code relies on a situation where there's twice
the value of stripe_size present as value for the variable stripe_size
when it's DUP. This was actually the case before commit 92e222df7b
"btrfs: alloc_chunk: fix DUP stripe size handling", from which I quote:
"[...] in the meantime there's a check to see if the stripe_size does
not exceed max_chunk_size. Since during this check stripe_size is twice
the amount as intended, the check will reduce the stripe_size to
max_chunk_size if the actual correct to be used stripe_size is more than
half the amount of max_chunk_size."

In the previous version of the code, the 16MiB alignment (why is this
done, by the way?) would result in a 50% chance that it would actually
do an 8MiB alignment for the individual dev_extents, since it was
operating on double the size. Does this matter?

Does it matter that stripe_size can be set to anything which is not
16MiB aligned because of the amount of remaining available disk space
which is just taken?

What is the main purpose of this round_up?

The most straightforward thing to do seems something like...
stripe_size = min(
div_u64(devices_info[ndevs - 1].max_avail, dev_stripes),
stripe_size
)
..just putting half of the max_avail into stripe_size."

Link: https://lore.kernel.org/linux-btrfs/b3461a38-e5f8-f41d-c67c-2efac8129054@mendix.com/
Reported-by: Hans van Kranenburg
Signed-off-by: Qu Wenruo
[ add analysis from report ]
Signed-off-by: David Sterba
Signed-off-by: Sasha Levin

Qu Wenruo
2019-01-26 16:32:39 +0800

17 Jan, 2019

1 commit

829431a2a Btrfs: fix access to available allocation bits when starting balance ... Browse Code »

commit 5a8067c0d17feb7579db0476191417b441a8996e upstream.

The available allocation bits members from struct btrfs_fs_info are
protected by a sequence lock, and when starting balance we access them
incorrectly in two different ways:

1) In the read sequence lock loop at btrfs_balance() we use the values we
read from fs_info->avail_*_alloc_bits and we can immediately do actions
that have side effects and can not be undone (printing a message and
jumping to a label). This is wrong because a retry might be needed, so
our actions must not have side effects and must be repeatable as long
as read_seqretry() returns a non-zero value. In other words, we were
essentially ignoring the sequence lock;

2) Right below the read sequence lock loop, we were reading the values
from avail_metadata_alloc_bits and avail_data_alloc_bits without any
protection from concurrent writers, that is, reading them outside of
the read sequence lock critical section.

So fix this by making sure we only read the available allocation bits
while in a read sequence lock critical section and that what we do in the
critical section is repeatable (has nothing that can not be undone) so
that any eventual retry that is needed is handled properly.

Fixes: de98ced9e743 ("Btrfs: use seqlock to protect fs_info->avail_{data, metadata, system}_alloc_bits")
Fixes: 14506127979a ("btrfs: fix a bogus warning when converting only data or metadata")
Reviewed-by: Nikolay Borisov
Signed-off-by: Filipe Manana
Reviewed-by: David Sterba
Signed-off-by: David Sterba
Signed-off-by: Greg Kroah-Hartman

Filipe Manana
2019-01-17 05:04:37 +0800

23 Aug, 2018

1 commit

801660b04 btrfs: btrfs_shrink_device should call commit transaction at the end ... Browse Code »

Test case btrfs/164 reports use-after-free:

[ 6712.084324] general protection fault: 0000 [#1] PREEMPT SMP
..
[ 6712.195423] btrfs_update_commit_device_size+0x75/0xf0 [btrfs]
[ 6712.201424] btrfs_commit_transaction+0x57d/0xa90 [btrfs]
[ 6712.206999] btrfs_rm_device+0x627/0x850 [btrfs]
[ 6712.211800] btrfs_ioctl+0x2b03/0x3120 [btrfs]

Reason for this is that btrfs_shrink_device adds the resized device to
the fs_devices::resized_devices after it has called the last commit
transaction.

So the list fs_devices::resized_devices is not empty when
btrfs_shrink_device returns. Now the parent function
btrfs_rm_device calls:

btrfs_close_bdev(device);
call_rcu(&device->rcu, free_device_rcu);

and then does the transactio ncommit. It goes through the
fs_devices::resized_devices in btrfs_update_commit_device_size and
leads to use-after-free.

Fix this by making sure btrfs_shrink_device calls the last needed
btrfs_commit_transaction before the return. This is consistent with what
the grow counterpart does and this makes sure the on-disk state is
persistent when the function returns.

Reported-by: Lu Fengqi
Tested-by: Lu Fengqi
Signed-off-by: Anand Jain
Reviewed-by: David Sterba
[ update changelog ]
Signed-off-by: David Sterba

Anand Jain
2018-08-23 23:37:27 +0800

06 Aug, 2018

33 commits

39379faaa btrfs: revert fs_devices state on error of btrfs_init_new_device ... Browse Code »

When btrfs hits error after modifying fs_devices in
btrfs_init_new_device() (such as btrfs_add_dev_item() returns error), it
leaves everything as is, but frees allocated btrfs_device. As a result,
fs_devices->devices and fs_devices->alloc_list contain already freed
btrfs_device, leading to later use-after-free bug.

Error path also messes the things like ->num_devices. While they go back
to the original value by unscanning btrfs devices, it is safe to revert
them here.

Fixes: 79787eaab461 ("btrfs: replace many BUG_ONs with proper error handling")
Signed-off-by: Naohiro Aota
Reviewed-by: Filipe Manana
Signed-off-by: David Sterba

Naohiro Aota
2018-08-06 19:13:04 +0800
64f64f43c btrfs: Exit gracefully when chunk map cannot be inserted to the tree ... Browse Code »

It's entirely possible that a crafted btrfs image contains overlapping
chunks.

Although we can't detect such problem by tree-checker, it's not a
catastrophic problem, current extent map can already detect such problem
and return -EEXIST.

We just only need to exit gracefully and fail the mount.

Reported-by: Xu Wen
Link: https://bugzilla.kernel.org/show_bug.cgi?id=200409
Signed-off-by: Qu Wenruo
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Qu Wenruo
2018-08-06 19:13:03 +0800
cf90d884b btrfs: Introduce mount time chunk <-> dev extent mapping check ... Browse Code »

This patch will introduce chunk dev extent mapping check, to protect
us against invalid dev extents or chunks.

Since chunk mapping is the fundamental infrastructure of btrfs, extra
check at mount time could prevent a lot of unexpected behavior (BUG_ON).

Reported-by: Xu Wen
Link: https://bugzilla.kernel.org/show_bug.cgi?id=200403
Link: https://bugzilla.kernel.org/show_bug.cgi?id=200407
Signed-off-by: Qu Wenruo
Reviewed-by: Su Yue
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Qu Wenruo
2018-08-06 19:13:03 +0800
672d59904 btrfs: Use wrapper macro for rcu string to remove duplicate code ... Browse Code »

Cleanup patch and no functional changes.

Signed-off-by: Misono Tomohiro
Reviewed-by: Qu Wenruo
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Misono Tomohiro
2018-08-06 19:13:02 +0800
97aff912a btrfs: Remove fs_info from btrfs_finish_chunk_alloc ... Browse Code »

It can be referenced from the passed transaction handle.

Signed-off-by: Nikolay Borisov
Reviewed-by: Lu Fengqi
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:58 +0800
f4208794d btrfs: Remove fs_info form btrfs_free_chunk ... Browse Code »

It can be referenced from the passed transaction handle.

Signed-off-by: Nikolay Borisov
Reviewed-by: Lu Fengqi
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:57 +0800
4f5ad7bd6 btrfs: Remove fs_info from btrfs_destroy_dev_replace_tgtdev ... Browse Code »

This function is always passed a well-formed tgtdevice so the fs_info
can be referenced from there.

Signed-off-by: Nikolay Borisov
Reviewed-by: Lu Fengqi
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:57 +0800
d6507cf1e btrfs: Remove fs_info from btrfs_assign_next_active_device ... Browse Code »

It can be referenced from the passed 'device' argument which is always
a well-formed device.

Signed-off-by: Nikolay Borisov
Reviewed-by: Lu Fengqi
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:57 +0800
5495f195f btrfs: remove fs_info argument from update_dev_stat_item ... Browse Code »

It can be referenced from the passed transaction handle.

Signed-off-by: Nikolay Borisov
Reviewed-by: Lu Fengqi
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:57 +0800
68a9db5f2 btrfs: Remove fs_info from btrfs_rm_dev_replace_remove_srcdev ... Browse Code »

It can be referenced from the passed srcdev argument.

Signed-off-by: Nikolay Borisov
Reviewed-by: Lu Fengqi
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:57 +0800
8e87e8562 btrfs: Remove fs_info argument from btrfs_add_dev_item ... Browse Code »

It can be referenced form the passed transaction handle.

Signed-off-by: Nikolay Borisov
Reviewed-by: Lu Fengqi
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:56 +0800
315409b00 btrfs: validate type when reading a chunk ... Browse Code »

Reported in https://bugzilla.kernel.org/show_bug.cgi?id=199839, with an
image that has an invalid chunk type but does not return an error.

Add chunk type check in btrfs_check_chunk_valid, to detect the wrong
type combinations.

Link: https://bugzilla.kernel.org/show_bug.cgi?id=199839
Reported-by: Xu Wen
Reviewed-by: Qu Wenruo
Signed-off-by: Gu Jinxiang
Signed-off-by: David Sterba

Gu Jinxiang
2018-08-06 19:12:55 +0800
46df06b85 btrfs: refactor block group replication factor calculation to a helper ... Browse Code »

There are many places that open code the duplicity factor of the block
group profiles, create a common helper. This can be easily extended for
more copies.

Signed-off-by: David Sterba

David Sterba
2018-08-06 19:12:53 +0800
321a4bf72 btrfs: use the assigned fs_devices instead of the dereference ... Browse Code »

We have assigned the %fs_info->fs_devices in %fs_devices as its not
modified just use it for the mutex_lock().

Signed-off-by: Anand Jain
Signed-off-by: David Sterba

Anand Jain
2018-08-06 19:12:53 +0800
36350e95a btrfs: return device pointer from btrfs_scan_one_device ... Browse Code »

Return device pointer (with the IS_ERR semantics) from
btrfs_scan_one_device so we don't have to return in through pointer.

And since btrfs_fs_devices can be obtained from btrfs_device, return that.

Signed-off-by: Gu Jinxiang
Reviewed-by: Nikolay Borisov
Reviewed-by: David Sterba
[ fixed conflics after recent changes to btrfs_scan_one_device ]
Signed-off-by: David Sterba

Gu Jinxiang
2018-08-06 19:12:48 +0800
f5194e34c btrfs: lift uuid_mutex to callers of btrfs_open_devices ... Browse Code »

Prepartory work to fix race between mount and device scan.

The callers will have to manage the critical section, eg. mount wants to
scan and then call btrfs_open_devices without the ioctl scan walking in
and modifying the fs devices in the meantime.

Reviewed-by: Anand Jain
Signed-off-by: David Sterba

David Sterba
2018-08-06 19:12:47 +0800
899f9307c btrfs: lift uuid_mutex to callers of btrfs_scan_one_device ... Browse Code »

Prepartory work to fix race between mount and device scan.

The callers will have to manage the critical section, eg. mount wants to
scan and then call btrfs_open_devices without the ioctl scan walking in
and modifying the fs devices in the meantime.

Reviewed-by: Anand Jain
Signed-off-by: David Sterba

David Sterba
2018-08-06 19:12:47 +0800
7bcb8164a btrfs: use device_list_mutex when removing stale devices ... Browse Code »

btrfs_free_stale_devices() finds a stale (not opened) device matching
path in the fs_uuid list. We are already under uuid_mutex so when we
check for each fs_devices, hold the device_list_mutex too.

Signed-off-by: Anand Jain
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Anand Jain
2018-08-06 19:12:47 +0800
fa6d2ae54 btrfs: rename local devices for fs_devices in btrfs_free_stale_devices( ... Browse Code »

Over the years we named %fs_devices and %devices to represent the
struct btrfs_fs_devices and the struct btrfs_device. So follow the same
scheme here too. No functional changes.

Signed-off-by: Anand Jain
Signed-off-by: David Sterba

Anand Jain
2018-08-06 19:12:47 +0800
9c6d173ea btrfs: extend locked section when adding a new device in device_list_add ... Browse Code »

Make sure the device_list_lock is held the whole time:

* when the device is being looked up
* new device is initialized and put to the list
* the list counters are updated (fs_devices::opened, fs_devices::total_devices)

Signed-off-by: Anand Jain
[ update changelog ]
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Anand Jain
2018-08-06 19:12:46 +0800
4306a9744 btrfs: do btrfs_free_stale_devices outside of device_list_add ... Browse Code »

btrfs_free_stale_devices() looks for device path reused for another
filesystem, and deletes the older fs_devices::device entry.

In preparation to handle locking in device_list_add, move
btrfs_free_stale_devices outside as these two functions serve a
different purpose.

Signed-off-by: Anand Jain
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Anand Jain
2018-08-06 19:12:46 +0800
959b1c046 btrfs: close devices without offloading to a temporary list ... Browse Code »

Since commit 88c14590cdd6 ("btrfs: use RCU in btrfs_show_devname for
device list traversal") btrfs_show_devname no longer takes
device_list_mutex. As such the deadlock that 0ccd05285e7f ("btrfs: fix a
possible umount deadlock") aimed to fix no longer exists, we can free
the devices immediatelly and remove the code that does the pending work.

Signed-off-by: Nikolay Borisov
Reviewed-by: Anand Jain
[ update changelog ]
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:46 +0800
621567a28 btrfs: Remove unused function btrfs_account_dev_extents_size ... Browse Code »

This function is not used since the alloc_start parameter has been
obsoleted in commit 0d0c71b317207082856 ("btrfs: obsolete and remove
mount option alloc_start").

Signed-off-by: Qu Wenruo
Reviewed-by: Nikolay Borisov
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Qu Wenruo
2018-08-06 19:12:46 +0800
b4993e64f btrfs: fix in-memory value of total_devices after seed device deletion ... Browse Code »

In case of deleting the seed device the %cur_devices (seed) and the
%fs_devices (parent) are different. Now, as the parent
fs_devices::total_devices also maintains the total number of devices
including the seed device, so decrement its in-memory value for the
successful seed delete. We are already updating its corresponding
on-disk btrfs_super_block::number_devices value.

Signed-off-by: Anand Jain
Signed-off-by: David Sterba

Anand Jain
2018-08-06 19:12:45 +0800
d7f663fa3 btrfs: prune unused includes ... Browse Code »

Remove includes if none of the interfaces and exports is used in the
given source file.

Signed-off-by: David Sterba

David Sterba
2018-08-06 19:12:43 +0800
694c51fb2 btrfs: drop unnecessary variable in btrfs_init_new_device ... Browse Code »

There is only usage of the declared devices variable, instead use its
value directly.

Signed-off-by: Anand Jain
Reviewed-by: Nikolay Borisov
Signed-off-by: David Sterba

Anand Jain
2018-08-06 19:12:42 +0800
5da54bc13 btrfs: use a temporary variable for fs_devices in btrfs_init_new_device ... Browse Code »

There are many instances of the %fs_info->fs_devices pointer
dereferences, use a temporary variable instead.

Signed-off-by: Anand Jain
Signed-off-by: David Sterba

Anand Jain
2018-08-06 19:12:42 +0800
fce466eab btrfs: tree-checker: Verify block_group_item ... Browse Code »

A crafted image with invalid block group items could make free space cache
code to cause panic.

We could detect such invalid block group item by checking:
1) Item size
Known fixed value.
2) Block group size (key.offset)
We have an upper limit on block group item (10G)
3) Chunk objectid
Known fixed value.
4) Type
Only 4 valid type values, DATA, METADATA, SYSTEM and DATA|METADATA.
No more than 1 bit set for profile type.
5) Used space
No more than the block group size.

This should allow btrfs to detect and refuse to mount the crafted image.

Link: https://bugzilla.kernel.org/show_bug.cgi?id=199849
Reported-by: Xu Wen
Signed-off-by: Qu Wenruo
Reviewed-by: Gu Jinxiang
Reviewed-by: Nikolay Borisov
Tested-by: Gu Jinxiang
Reviewed-by: David Sterba
Signed-off-by: David Sterba

Qu Wenruo
2018-08-06 19:12:41 +0800
43a7e99db btrfs: Remove fs_info from btrfs_force_chunk_alloc ... Browse Code »

It can be referenced from the passed transaction handle.

Signed-off-by: Nikolay Borisov
Reviewed-by: Qu Wenruo
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:38 +0800
451a2c130 btrfs: Remove fs_info from check_system_chunk ... Browse Code »

It can be referenced from trans since the function is always called
within a transaction.

Signed-off-by: Nikolay Borisov
Reviewed-by: Qu Wenruo
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:36 +0800
c216b2039 btrfs: Remove fs_info from btrfs_alloc_chunk ... Browse Code »

It can be referenced from trans since the function is always called
within a transaction.

Signed-off-by: Nikolay Borisov
Reviewed-by: Qu Wenruo
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:36 +0800
5a98ec014 btrfs: Remove fs_info from btrfs_remove_block_group ... Browse Code »

This function is always called with a valid transaction handle from
where we can reference fs_info. No functional changes.

Signed-off-by: Nikolay Borisov
Reviewed-by: Qu Wenruo
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:34 +0800
e7e02096d btrfs: Remove fs_info from btrfs_make_block_group ... Browse Code »

This function is always called with a valid transaction handle from
where we can reference the fs_info. No functional changes.

Signed-off-by: Nikolay Borisov
Reviewed-by: Qu Wenruo
Signed-off-by: David Sterba

Nikolay Borisov
2018-08-06 19:12:34 +0800