Eric Lee / smarc-fsl-linux-kernel

19 Dec, 2011

1 commit

60e07cf51 ext4: do not reference pa_inode from group_pa ... Browse Code »

pa_inode in group_pa is set NULL in ext4_mb_new_group_pa, so
pa_inode should be not referenced.

Reported-by: Wu Fengguang
Signed-off-by: Yongqiang Yang
Signed-off-by: "Theodore Ts'o"

Yongqiang Yang
2011-12-19 04:49:54 +0800

26 Oct, 2011

4 commits

0a10da73e ext4: fix a wrong comment in __mb_check_buddy() ... Browse Code »

The comment says the bit should be 0, but the after code assert the
bit to be 1. This makes people confused, so fix it.

Signed-off-by: Robin Dong
Signed-off-by: "Theodore Ts'o"

Robin Dong
2011-10-26 20:48:54 +0800
b051d8dc4 ext4: remove unused variable in mb_find_extent() ... Browse Code »

The variable 'ord' in function mb_find_extent() is redundant, so
remove it.

Signed-off-by: Robin Dong
Signed-off-by: "Theodore Ts'o"

Robin Dong
2011-10-26 17:30:30 +0800
66a83cde4 ext4: remove unused variable in ext4_mb_generate_from_pa() ... Browse Code »

The variable 'count' in function ext4_mb_generate_from_pa() looks
useless, so remove it.

Signed-off-by: Robin Dong
Signed-off-by: "Theodore Ts'o"

Robin Dong
2011-10-26 17:29:21 +0800
ebbe02779 ext4: use stream-alloc when mb_group_prealloc set to zero ... Browse Code »

The kernel will crash on

ext4_mb_mark_diskspace_used:
BUG_ON(ac->ac_b_ex.fe_len
Signed-off-by: "Theodore Ts'o"

Robin Dong
2011-10-26 17:14:27 +0800

21 Oct, 2011

1 commit

45dc63e7d ext4: Allow quota file use root reservation ... Browse Code »

Quota file is fs's metadata, so it is reasonable to permit use
root resevation if necessary. This patch fix 265'th xfstest failure

Signed-off-by: Dmitry Monakhov
Signed-off-by: "Theodore Ts'o"

Dmitry Monakhov
2011-10-21 08:07:23 +0800

06 Oct, 2011

1 commit

7aa0baeab ext4: Free resources in ext4_mb_init()'s error paths ... Browse Code »

In commit 79a77c5ac, we move ext4_mb_init_backend after the allocation
of s_locality_group to avoid memory leak in error path, but there are
still some other error paths in ext4_mb_init that need to do the same
work. So this patch adds all the error patch for ext4_mb_init. And all
the pointers are reset to NULL in case the caller may double free them.

Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-10-06 22:22:28 +0800

10 Sep, 2011

10 commits

e7d5f3156 ext4: rename ext4_claim_free_blocks() to ext4_claim_free_clusters() ... Browse Code »

This function really claims a number of free clusters, not blocks, so
rename it so it's clearer what's going on.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 07:14:51 +0800
cff1dfd76 ext4: rename ext4_free_blocks_after_init() to ext4_free_clusters_after_init() ... Browse Code »

This function really returns the number of clusters after initializing
an uninitalized block bitmap has been initialized.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 07:12:51 +0800
021b65bb1 ext4: Rename ext4_free_blks_{count,set}() to refer to clusters ... Browse Code »

The field bg_free_blocks_count_{lo,high} in the block group
descriptor has been repurposed to hold the number of free clusters for
bigalloc functions. So rename the functions so it makes it easier to
read and audit the block allocation and block freeing code.

Note: at this point in bigalloc development we doesn't support
online resize, so this also makes it really obvious all of the places
we need to fix up to add support for online resize.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 07:08:51 +0800
7b415bf60 ext4: Fix bigalloc quota accounting and i_blocks value ... Browse Code »

With bigalloc changes, the i_blocks value was not correctly set (it was still
set to number of blocks being used, but in case of bigalloc, we want i_blocks
to represent the number of clusters being used). Since the quota subsystem sets
the i_blocks value, this patch fixes the quota accounting and makes sure that
the i_blocks value is set correctly.

Signed-off-by: Aditya Kali
Signed-off-by: "Theodore Ts'o"

Aditya Kali
2011-09-10 07:04:51 +0800
27baebb84 ext4: tune mballoc's default group prealloc size for bigalloc file systems ... Browse Code »

The default group preallocation size had been previously set to 512
blocks/clusters, regardless of the block/cluster size. This is
probably too big for large cluster sizes. So adjust the default so
that it is 2 megabytes or 32 clusters, whichever is larger.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 07:02:51 +0800
24aaa8ef4 ext4: convert the free_blocks field in s_flex_groups to be free_clusters ... Browse Code »

Convert the free_blocks to be free_clusters to make the final revised
bigalloc changes easier to read/understand.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 06:58:51 +0800
570426518 ext4: convert s_{dirty,free}blocks_counter to s_{dirty,free}clusters_counter ... Browse Code »

Convert the percpu counters s_dirtyblocks_counter and
s_freeblocks_counter in struct ext4_super_info to be
s_dirtyclusters_counter and s_freeclusters_counter.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 06:56:51 +0800
84130193e ext4: teach ext4_free_blocks() about bigalloc and clusters ... Browse Code »

The ext4_free_blocks() function now has two new flags that indicate
whether a partial cluster at the beginning or the end of the block
extents should be freed or not. That will be up the caller (i.e.,
truncate), who can figure out whether partial clusters at the
beginning or the end of a block range can be freed.

We also have to update the ext4_mb_free_metadata() and
release_blocks_on_commit() machinery to be cluster-based, since it is
used by ext4_free_blocks().

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 06:50:51 +0800
53accfa9f ext4: teach mballoc preallocation code about bigalloc clusters ... Browse Code »

In most of mballoc.c, we do everything in units of clusters, since the
block allocation bitmaps and buddy bitmaps are all denominated in
clusters. The one place where we do deal with absolute block numbers
is in the code that handles the preallocation regions, since in the
case of inode-based preallocation regions, the start of the
preallocation region can't be relative to the beginning of the group.

So this adds a bit of complexity, where pa_pstart and pa_lstart are
block numbers, while pa_free, pa_len, and fe_len are denominated in
units of clusters.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 06:48:51 +0800
7137d7a48 ext4: convert instances of EXT4_BLOCKS_PER_GROUP to EXT4_CLUSTERS_PER_GROUP ... Browse Code »

Change the places in fs/ext4/mballoc.c where EXT4_BLOCKS_PER_GROUP are
used to indicate the number of bits in a block bitmap (which is really
a cluster allocation bitmap in bigalloc file systems). There are
still some places in the ext4 codebase where usage of
EXT4_BLOCKS_PER_GROUP needs to be audited/fixed, in code paths that
aren't used given the initial restricted assumptions for bigalloc.
These will need to be fixed before we can relax those restrictions.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-09-10 06:38:51 +0800

02 Aug, 2011

3 commits

79a77c5ac ext4: prevent memory leaks from ext4_mb_init_backend() on error path ... Browse Code »

In ext4_mb_init(), if the s_locality_group allocation fails it will
currently cause the allocations made in ext4_mb_init_backend() to
be leaked. Moving the ext4_mb_init_backend() allocation after the
s_locality_group allocation avoids that problem.

Signed-off-by: Yu Jian
Signed-off-by: Andreas Dilger
Signed-off-by: "Theodore Ts'o"

Yu Jian
2011-08-02 05:41:46 +0800
48e6061bf ext4: use EXT4_BAD_INO for buddy cache to avoid colliding with valid inode # ... Browse Code »

Signed-off-by: Yu Jian
Signed-off-by: Andreas Dilger
Signed-off-by: "Theodore Ts'o"

Yu Jian
2011-08-02 05:41:39 +0800
9d8b9ec44 ext4: use ext4_msg() instead of printk in mballoc ... Browse Code »

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-08-02 05:41:35 +0800

01 Aug, 2011

1 commit

f18a5f21c ext4: use ext4_kvzalloc()/ext4_kvmalloc() for s_group_desc and s_group_info ... Browse Code »

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2011-08-01 20:45:38 +0800

27 Jul, 2011

4 commits

c3e94d1df ext4: let setup_new_group_blocks() set multiple bits at a time ... Browse Code »

Rename mb_set_bits() to ext4_set_bits() and make it a global function
so that setup_new_group_blocks() can use it.

Signed-off-by: Yongqiang Yang
Signed-off-by: "Theodore Ts'o"

Yongqiang Yang
2011-07-27 10:05:53 +0800
4740b830e ext4: let ext4_group_add_blocks() handle 0 blocks quickly ... Browse Code »

If ext4_group_add_blocks() is called with 0 block, make it return 0
without doing any extra work.

Signed-off-by: Yongqiang Yang
Signed-off-by: "Theodore Ts'o"

Yongqiang Yang
2011-07-27 09:51:08 +0800
cc7365dfe ext4: let ext4_group_add_blocks() return an error code ... Browse Code »

This patch lets ext4_group_add_blocks() return an error code if it
fails, so that upper functions can handle error correctly.

Signed-off-by: Yongqiang Yang
Signed-off-by: "Theodore Ts'o"

Yongqiang Yang
2011-07-27 09:46:07 +0800
0529155e8 ext4: rename ext4_add_groupblocks() to ext4_group_add_blocks() ... Browse Code »

Signed-off-by: Yongqiang Yang
Signed-off-by: "Theodore Ts'o"

Yongqiang Yang
2011-07-27 09:43:56 +0800

24 Jul, 2011

2 commits

ced156e46 ext4: don't increment s_mb_buddies_generated in ext4_mb_release ... Browse Code »

In ext4_mb_release, we use s_mb_buddies_generated++. Although
the output is OK, but I don't think we need this extra ++.

Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-24 04:18:05 +0800
529da704a ext4: remove unnecessary ext4_get_group_info in ext4_mb_load_buddy ... Browse Code »

ext4_mb_load_buddy() calls ext4_get_group_info() for setting both
"grp" and "e4b->bd_info", but it could do "e4b->bd_info = grp".

Reported-by: Andreas Dilger
Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-24 04:07:26 +0800

18 Jul, 2011

1 commit

d7a1fee13 ext4: make the preallocation size be a multiple of stripe size ... Browse Code »

Previously, if a stripe width was provided, then it would be used
as the preallocation granularity, with no santiy checking and no
way to override this. Now, mb_prealloc_size defaults to the smallest
multiple of stripe size that is greater than or equal to the old
default mb_prealloc_size, and this can be overridden with the sysfs
interface.

Signed-off-by: Dan Ehrenberg
Signed-off-by: "Theodore Ts'o"

Dan Ehrenberg
2011-07-18 09:11:30 +0800

12 Jul, 2011

2 commits

caaf7a29d ext4: Fix a double free of sbi->s_group_info in ext4_mb_init_backend ... Browse Code »

If we meet with an error in ext4_mb_add_groupinfo, we kfree
sbi->s_group_info[group >> EXT4_DESC_PER_BLOCK_BITS(sb)], but fail to
reset it to NULL. So the caller ext4_mb_init_backend will try to kfree
it again and causes a double free. So fix it by resetting it to NULL.

Some typo in comments of mballoc.c are also changed.

Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-12 06:42:42 +0800
823ba01fc ext4: fix a race which could leak memory in ext4_groupinfo_create_slab() ... Browse Code »

In ext4_groupinfo_create_slab, we create ext4_groupinfo_caches within
ext4_grpinfo_slab_create_mutex, but set it outside the lock, and there
does exist some case that we may create it twice and causes a memory
leak. So set it before we call mutex_unlock.

Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-12 06:26:01 +0800

11 Jul, 2011

6 commits

22612283f ext4: Change the wrong param comment for ext4_trim_all_free ... Browse Code »

at ext4_trim_all_free() comment, there is no longer an @e4b parameter,
instead it is @group.

Reported-by: Andreas Dilger
Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-11 12:04:34 +0800
3d56b8d2c ext4: Speed up FITRIM by recording flags in ext4_group_info ... Browse Code »

In ext4, when FITRIM is called every time, we iterate all the
groups and do trim one by one. It is a bit time wasting if the
group has been trimmed and there is no change since the last
trim.

So this patch adds a new flag in ext4_group_info->bb_state to
indicate that the group has been trimmed, and it will be cleared
if some blocks is freed(in release_blocks_on_commit). Another
trim_minlen is added in ext4_sb_info to record the last minlen
we use to trim the volume, so that if the caller provide a small
one, we will go on the trim regardless of the bb_state.

A simple test with my intel x25m ssd:
df -h shows:
/dev/sdb1 40G 21G 17G 56% /mnt/ext4
Block size: 4096

run the FITRIM with the following parameter:
range.start = 0;
range.len = UINT64_MAX;
range.minlen = 1048576;

without the patch:
[root@boyu-tm linux-2.6]# time ./ftrim /mnt/ext4/a
real 0m5.505s
user 0m0.000s
sys 0m1.224s
[root@boyu-tm linux-2.6]# time ./ftrim /mnt/ext4/a
real 0m5.359s
user 0m0.000s
sys 0m1.178s
[root@boyu-tm linux-2.6]# time ./ftrim /mnt/ext4/a
real 0m5.228s
user 0m0.000s
sys 0m1.151s

with the patch:
[root@boyu-tm linux-2.6]# time ./ftrim /mnt/ext4/a
real 0m5.625s
user 0m0.000s
sys 0m1.269s
[root@boyu-tm linux-2.6]# time ./ftrim /mnt/ext4/a
real 0m0.002s
user 0m0.000s
sys 0m0.001s
[root@boyu-tm linux-2.6]# time ./ftrim /mnt/ext4/a
real 0m0.002s
user 0m0.000s
sys 0m0.001s

A big improvement for the 2nd and 3rd run.

Even after I delete some big image files, it is still much
faster than iterating the whole disk.

[root@boyu-tm test]# time ./ftrim /mnt/ext4/a
real 0m1.217s
user 0m0.000s
sys 0m0.196s

Cc: Lukas Czerner
Reviewed-by: Andreas Dilger
Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-11 12:03:38 +0800
b3d4c2b10 ext4: Add new ext4 trim tracepoints ... Browse Code »

Add ext4_trim_extent and ext4_trim_all_free.

Reviewed-by: Lukas Czerner
Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-11 12:01:52 +0800
169ddc3ec ext4: speed up group trim with the right free block count ... Browse Code »

When we trim some free blocks in a group of ext4, we need to
calculate the free blocks properly and check whether there are
enough freed blocks left for us to trim. Current solution will
only calculate free spaces if they are large for a trim which
isn't appropriate.

Let us see a small example:
a group has 1.5M free which are 300k, 300k, 300k, 300k, 300k.
And minblocks is 1M. With current solution, we have to iterate
the whole group since these 300k will never be subtracted from
1.5M. But actually we should exit after we find the first 2
free spaces since the left 3 chunks only sum up to 900K if we
subtract the first 600K although they can't be trimed.

Reviewed-by: Andreas Dilger
Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-11 12:00:07 +0800
22f104574 ext4: fix trim length underflow with small trim length ... Browse Code »

In 0f0a25b, we adjust 'len' with s_first_data_block - start, but
it could underflow in case blocksize=1K, fstrim_range.len=512 and
fstrim_range.start = 0. In this case, when we run the code:
len -= first_data_blk - start; len will be underflow to -1ULL.
In the end, although we are safe that last_group check later will limit
the trim to the whole volume, but that isn't what the user really want.

So this patch fix it. It also adds the check for 'start' like ext3 so that
we can break immediately if the start is invalid.

Cc: Lukas Czerner
Signed-off-by: Tao Ma
Signed-off-by: "Theodore Ts'o"

Tao Ma
2011-07-11 11:52:37 +0800
7132de744 ext4: fix i_blocks/quota accounting when extent insertion fails ... Browse Code »
1

The current implementation of ext4_free_blocks() always calls
dquot_free_block This looks quite sensible in the most cases: blocks
to be freed are associated with inode and were accounted in quota and
i_blocks some time ago.

However, there is a case when blocks to free were not accounted by the
time calling ext4_free_blocks() yet:

1. delalloc is on, write_begin pre-allocated some space in quota
2. write-back happens, ext4 allocates some blocks in ext4_ext_map_blocks()
3. then ext4_ext_map_blocks() gets an error (e.g. ENOSPC) from
ext4_ext_insert_extent() and calls ext4_free_blocks().

In this scenario, ext4_free_blocks() calls dquot_free_block() who, in
turn, decrements i_blocks for blocks which were not accounted yet (due
to delalloc) After clean umount, e2fsck reports something like:

> Inode 21, i_blocks is 5080, should be 5128. Fix?
because i_blocks was erroneously decremented as explained above.

The patch fixes the problem by passing the new flag
EXT4_FREE_BLOCKS_NO_QUOT_UPDATE to ext4_free_blocks(), to request
that the dquot_free_block() call be skipped.

Signed-off-by: Maxim Patlasov
Signed-off-by: "Theodore Ts'o"
Cc: stable@kernel.org

Maxim Patlasov
2011-07-11 07:37:48 +0800

28 Jun, 2011

1 commit

9331b6261 ext4: quiet 'unused variables' compile warnings ... Browse Code »

Unused variables was deleted.

Signed-off-by: Yongqiang Yang
Signed-off-by: "Theodore Ts'o"

Yongqiang Yang
2011-06-28 22:19:05 +0800

06 Jun, 2011

2 commits

a9c667f8f ext4: fixed tracepoints cleanup ... Browse Code »

While creating fixed tracepoints for ext3, basically by porting them
from ext4, I found a lot of useless retyping, wrong type usage, useless
variable passing and other inconsistencies in the ext4 fixed tracepoint
code.

This patch cleans the fixed tracepoint code for ext4 and also simplify
some of them.

Signed-off-by: Lukas Czerner
Signed-off-by: "Theodore Ts'o"

Lukas Czerner
2011-06-06 21:51:52 +0800
5def13602 ext4: correct comments for ext4_free_blocks() ... Browse Code »

metadata is not parameter of ext4_free_blocks() any more.

Signed-off-by: Yongqiang Yang
Signed-off-by: "Theodore Ts'o"

Yongqiang Yang
2011-06-06 11:26:40 +0800

25 May, 2011

1 commit

55f020db6 ext4: add flag to ext4_has_free_blocks ... Browse Code »

This patch adds an allocation request flag to the ext4_has_free_blocks
function which enables the use of reserved blocks. This will allow a
punch hole to proceed even if the disk is full. Punching a hole may
require additional blocks to first split the extents.

Because ext4_has_free_blocks is a low level function, the flag needs
to be passed down through several functions listed below:

ext4_ext_insert_extent
ext4_ext_create_new_leaf
ext4_ext_grow_indepth
ext4_ext_split
ext4_ext_new_meta_block
ext4_mb_new_blocks
ext4_claim_free_blocks
ext4_has_free_blocks

[ext4 punch hole patch series 1/5 v7]

Signed-off-by: Allison Henderson
Signed-off-by: "Theodore Ts'o"
Reviewed-by: Mingming Cao

Allison Henderson
2011-05-25 19:41:26 +0800