Eric Lee / smarc-ti-linux-kernel | Embedian Git Server

05 Oct, 2014

1 commit

b277da0a8 block: disable entropy contributions for nonrot devices ... Browse Code »
13

Clear QUEUE_FLAG_ADD_RANDOM in all block drivers that set
QUEUE_FLAG_NONROT.

Historically, all block devices have automatically made entropy
contributions. But as previously stated in commit e2e1a148 ("block: add
sysfs knob for turning off disk entropy contributions"):
- On SSD disks, the completion times aren't as random as they
are for rotational drives. So it's questionable whether they
should contribute to the random pool in the first place.
- Calling add_disk_randomness() has a lot of overhead.

There are more reliable sources for randomness than non-rotational block
devices. From a security perspective it is better to err on the side of
caution than to allow entropy contributions from unreliable "random"
sources.

Signed-off-by: Mike Snitzer
Signed-off-by: Jens Axboe

Mike Snitzer
2014-10-05 00:55:32 +0800

05 Aug, 2014

22 commits

0781c8748 bcache: Drop unneeded blk_sync_queue() calls ... Browse Code »

this is needed for the queue/block device we created (it's done by
blk_cleanup_queue() which we do call) - but calling it for the block devices we
only opened is pointless.

Change-Id: I53dfded14ed15b9581d10ca8399d5e1b3abbf9f2

Kent Overstreet
2014-08-05 06:23:04 +0800
789d21dbd bcache: add mutex lock for bch_is_open ... Browse Code »

Since bch_is_open will iterate linked list bch_cache_sets and
uncached_devices, it needs bch_register_lock.

Signed-off-by: Jianjian Huo

Jianjian Huo
2014-08-05 06:23:04 +0800
5b25abade bcache: Correct printing of btree_gc_max_duration_ms ... Browse Code »

time_stats::btree_gc_max_duration_mc is not bit shifted by 8

Fixes BUG #138

Change-Id: I44fc6e1d0579674016acc533f1a546b080e5371a
Signed-off-by: Surbhi Palande

Surbhi Palande
2014-08-05 06:23:04 +0800
2452cc890 bcache: try to set b->parent properly ... Browse Code »

bcache_flash_dev.ktest would reliably crash with 8k and 16k bucket size
before; now it passes.

Change-Id: Ib542232235e39298c3a7548fe52b645cabb823d1

Slava Pestov
2014-08-05 06:23:04 +0800
c9a78332b bcache: fix memory corruption in init error path ... Browse Code »

If register_cache_set() failed, we would touch ca->set after
it had already been freed. Also, fix an assertion to catch
this.

Change-Id: I748e5f5b223e2d9b2602075dec2f997cced2394d

Slava Pestov
2014-08-05 06:23:04 +0800
bf0c55c98 bcache: fix crash with incomplete cache set ... Browse Code »

Change-Id: I6abde52afe917633480caaf4e2518f42a816d886

Slava Pestov
2014-08-05 06:23:04 +0800
d83353b31 bcache: Fix more early shutdown bugs ... Browse Code »

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-08-05 06:23:04 +0800
400ffaa2a bcache: fix use-after-free in btree_gc_coalesce() ... Browse Code »

If we goto out_nocoalesce after we free new_nodes[0], we end up freeing
new_nodes[0] again. This was generating a lockdep warning. The fix is
to set new_nodes[0] to NULL, since the out_nocoalesce path safely
ignores NULL entries in the new_nodes array.

This regression was introduced in 2d7f9531.

Change-Id: I76564d7257800583214376b4bacf236cda90c89c

Slava Pestov
2014-08-05 06:23:04 +0800
6b708de64 bcache: Fix an infinite loop in journal replay ... Browse Code »

When running with multiple cache devices, if one of the devices has a completely
empty journal but we'd already found some journal entries on a previosu device
we'd go into an infinite loop.

Change-Id: I1dcdc0d738192746de28f40e8b08825b0dea5e2b
Signed-off-by: Kent Overstreet

Kent Overstreet
2014-08-05 06:23:03 +0800
913dc33fb bcache: fix crash in bcache_btree_node_alloc_fail tracepoint ... Browse Code »

'b' was NULL.

Change-Id: Icac0fd04afa2d23f213d96d51afd53374e6dd0c0

Slava Pestov
2014-08-05 06:23:03 +0800
60ae81eee bcache: bcache_write tracepoint was crashing ... Browse Code »

Signed-off-by: Kent Overstreet

Slava Pestov
2014-08-05 06:23:03 +0800
8e0948080 bcache: fix typo in bch_bkey_equal_header ... Browse Code »

Signed-off-by: Kent Overstreet

Slava Pestov
2014-08-05 06:23:03 +0800
501d52a90 bcache: Allocate bounce buffers with GFP_NOWAIT ... Browse Code »

There's no point in blocking on these allocations, since our fallback paths will
probably go faster than blocking.

Change-Id: I733ca202c25cb36bde02607a0a60552229a4241c

Kent Overstreet
2014-08-05 06:23:03 +0800
bcf090e00 bcache: Make sure to pass GFP_WAIT to mempool_alloc() ... Browse Code »
3

this was very wrong - mempool_alloc() only guarantees success with GFP_WAIT.
bcache uses GFP_NOWAIT in various other places where we have a fallback,
circuits must've gotten crossed when writing this code or something.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-08-05 06:23:03 +0800
9e5c35351 bcache: fix uninterruptible sleep in writeback thread ... Browse Code »

There were two issues here:

- writeback thread did not start until the device first became dirty
- writeback thread used uninterruptible sleep once running

Without this patch I see kernel warnings printed and a load average of
1.52 after booting my test VM. With this patch the warnings are gone and
the load average is near 0.00 as expected.

Signed-off-by: Kent Overstreet

Slava Pestov
2014-08-05 06:23:03 +0800
c5aa4a315 bcache: wait for buckets when allocating new btree root ... Browse Code »

Tested:
- sometimes bcache_tier test would hang on startup with a failure
to allocate the btree root -- no longer seeing this

Signed-off-by: Kent Overstreet

Slava Pestov
2014-08-05 06:23:03 +0800
a664d0f05 bcache: fix crash on shutdown in passthrough mode ... Browse Code »

We never started the writeback thread in this case, so don't stop it.

Slava Pestov
2014-08-05 06:23:03 +0800
e5112201c bcache: fix lockdep warnings on shutdown Browse Code »

Slava Pestov
2014-08-05 06:23:03 +0800
8b326d3a2 bcache allocator: send discards with correct size Browse Code »

Slava Pestov
2014-08-05 06:23:03 +0800
dbd810ab6 bcache: Fix to remove the rcu_sched stalls. ... Browse Code »

while loop was executing infinitely.
This fix ends the while loop gracefully.

Signed-off-by: Surbhi Palande
Signed-off-by: Kent Overstreet

Surbhi Palande
2014-08-05 06:23:02 +0800
9aa61a992 bcache: Fix a journal replay bug ... Browse Code »

journal replay wansn't validating pointers with bch_extent_invalid() before
derefing, fixed

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-08-05 06:23:02 +0800
5b1016e62 bcache: Fix a bug when detaching ... Browse Code »

After detaching a backing device from a cache set, a bit wasn't getting
reset meaning the second detach wouldn't work correctly.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-08-05 06:23:02 +0800

18 Apr, 2014

1 commit

4e857c58e arch: Mass conversion of smp_mb__*() ... Browse Code »

Mostly scripted conversion of the smp_mb__* barriers.

Signed-off-by: Peter Zijlstra
Acked-by: Paul E. McKenney
Link: http://lkml.kernel.org/n/tip-55dhyhocezdw1dg7u19hmh1u@git.kernel.org
Cc: Linus Torvalds
Cc: linux-arch@vger.kernel.org
Signed-off-by: Ingo Molnar

Peter Zijlstra
2014-04-18 20:20:48 +0800

19 Mar, 2014

16 commits

cb8511495 bcache: remove nested function usage ... Browse Code »

Uninlined nested functions can cause crashes when using ftrace, as they don't
follow the normal calling convention and confuse the ftrace function graph
tracer as it examines the stack.

Also, nested functions are supported as a gcc extension, but may fail on other
compilers (e.g. llvm).

Signed-off-by: John Sheu

John Sheu
2014-03-19 03:39:28 +0800
3a2fd9d50 bcache: Kill bucket->gc_gen ... Browse Code »

gc_gen was a temporary used to recalculate last_gc, but since we only need
bucket->last_gc when gc isn't running (gc_mark_valid = 1), we can just update
last_gc directly.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:24:54 +0800
2531d9ee6 bcache: Kill unused freelist ... Browse Code »

This was originally added as at optimization that for various reasons isn't
needed anymore, but it does add a lot of nasty corner cases (and it was
responsible for some recently fixed bugs). Just get rid of it now.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:23:36 +0800
0a63b66db bcache: Rework btree cache reserve handling ... Browse Code »

This changes the bucket allocation reserves to use _real_ reserves - separate
freelists - instead of watermarks, which if nothing else makes the current code
saner to reason about and is going to be important in the future when we add
support for multiple btrees.

It also adds btree_check_reserve(), which checks (and locks) the reserves for
both bucket allocation and memory allocation for btree nodes; the old code just
kinda sorta assumed that since (e.g. for btree node splits) it had the root
locked and that meant no other threads could try to make use of the same
reserve; this technically should have been ok for memory allocation (we should
always have a reserve for memory allocation (the btree node cache is used as a
reserve and we preallocate it)), but multiple btrees will mean that locking the
root won't be sufficient anymore, and for the bucket allocation reserve it was
technically possible for the old code to deadlock.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:23:35 +0800
56b30770b bcache: Kill btree_io_wq ... Browse Code »

With the locking rework in the last patch, this shouldn't be needed anymore -
btree_node_write_work() only takes b->write_lock which is never held for very
long.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:23:35 +0800
2a285686c bcache: btree locking rework ... Browse Code »

Add a new lock, b->write_lock, which is required to actually modify - or write -
a btree node; this lock is only held for short durations.

This means we can write out a btree node without taking b->lock, which _is_ held
for long durations - solving a deadlock when btree_flush_write() (from the
journalling code) is called with a btree node locked.

Right now just occurs in bch_btree_set_root(), but with an upcoming journalling
rework is going to happen a lot more.

This also turns b->lock is now more of a read/intent lock instead of a
read/write lock - but not completely, since it still blocks readers. May turn it
into a real intent lock at some point in the future.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:23:35 +0800
05335cff9 bcache: Fix a race when freeing btree nodes ... Browse Code »

This isn't a bulletproof fix; btree_node_free() -> bch_bucket_free() puts the
bucket on the unused freelist, where it can be reused right away without any
ordering requirements. It would be better to wait on at least a journal write to
go down before reusing the bucket. bch_btree_set_root() does this, and inserting
into non leaf nodes is completely synchronous so we should be ok, but future
patches are just going to get rid of the unused freelist - it was needed in the
past for various reasons but shouldn't be anymore.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:23:34 +0800
4fe6a8167 bcache: Add a real GC_MARK_RECLAIMABLE ... Browse Code »

This means the garbage collection code can better check for data and metadata
pointers to the same buckets.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:22:36 +0800
c13f3af92 bcache: Add bch_keylist_init_single() ... Browse Code »

This will potentially save us an allocation when we've got inode/dirent bkeys
that don't fit in the keylist's inline keys.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:22:36 +0800
157540205 bcache: Improve priority_stats ... Browse Code »

Break down data into clean data/dirty data/metadata.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:22:35 +0800
7159b1ad3 bcache: Better alloc tracepoints ... Browse Code »

Change the invalidate tracepoint to indicate how much data we're invalidating,
and change the alloc tracepoints to indicate what offset they're for.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:22:35 +0800
3f5e0a34d bcache: Kill dead cgroup code ... Browse Code »

This hasn't been used or even enabled in ages.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:22:35 +0800
3f6ef3811 bcache: stop moving_gc marking buckets that can't be moved. ... Browse Code »

Signed-off-by: Nicholas Swenson

Nicholas Swenson
2014-03-19 03:22:34 +0800
10d9dcf6e bcache: Fix moving_pred() ... Browse Code »

Avoid a potential null pointer deref (e.g. from check keys for cache misses)

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:22:34 +0800
da415a096 bcache: Fix moving_gc deadlocking with a foreground write ... Browse Code »

Deadlock happened because a foreground write slept, waiting for a bucket
to be allocated. Normally the gc would mark buckets available for invalidation.
But the moving_gc was stuck waiting for outstanding writes to complete.
These writes used the bcache_wq, the same queue foreground writes used.

This fix gives moving_gc its own work queue, so it was still finish moving
even if foreground writes are stuck waiting for allocation. It also makes
work queue a parameter to the data_insert path, so moving_gc can use its
workqueue for writes.

Signed-off-by: Nicholas Swenson
Signed-off-by: Kent Overstreet

Nicholas Swenson
2014-03-19 03:22:33 +0800
90db6919f bcache: Fix discard granularity ... Browse Code »

blk_stack_limits() doesn't like a discard granularity of 0.

Signed-off-by: Kent Overstreet

Kent Overstreet
2014-03-19 03:22:33 +0800