Eric Lee / smarc-ti-linux-kernel | Embedian Git Server

29 Dec, 2008

1 commit

b3a6ffe16 Get rid of CONFIG_LSF ... Browse Code »

We have two seperate config entries for large devices/files. One
is CONFIG_LBD that guards just the devices, the other is CONFIG_LSF
that handles large files. This doesn't make a lot of sense, you typically
want both or none. So get rid of CONFIG_LSF and change CONFIG_LBD wording
to indicate that it covers both.

Acked-by: Jean Delvare
Signed-off-by: Jens Axboe

Jens Axboe
2008-12-29 15:29:51 +0800

25 Dec, 2008

1 commit

cbacc2c7f Merge branch 'next' into for-linus Browse Code »

James Morris
2008-12-25 08:40:09 +0800

11 Dec, 2008

2 commits

02d211688 revert "percpu_counter: new function percpu_counter_sum_and_set" ... Browse Code »

Revert

commit e8ced39d5e8911c662d4d69a342b9d053eaaac4e
Author: Mingming Cao
Date: Fri Jul 11 19:27:31 2008 -0400

percpu_counter: new function percpu_counter_sum_and_set

As described in

revert "percpu counter: clean up percpu_counter_sum_and_set()"

the new percpu_counter_sum_and_set() is racy against updates to the
cpu-local accumulators on other CPUs. Revert that change.

This means that ext4 will be slow again. But correct.

Reported-by: Eric Dumazet
Cc: "David S. Miller"
Cc: Peter Zijlstra
Cc: Mingming Cao
Cc:
Cc: [2.6.27.x]
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andrew Morton
2008-12-11 00:01:52 +0800
71c5576fb revert "percpu counter: clean up percpu_counter_sum_and_set()" ... Browse Code »

Revert

commit 1f7c14c62ce63805f9574664a6c6de3633d4a354
Author: Mingming Cao
Date: Thu Oct 9 12:50:59 2008 -0400

percpu counter: clean up percpu_counter_sum_and_set()

Before this patch we had the following:

percpu_counter_sum(): return the percpu_counter's value

percpu_counter_sum_and_set(): return the percpu_counter's value, copying
that value into the central value and zeroing the per-cpu counters before
returning.

After this patch, percpu_counter_sum_and_set() has gone, and
percpu_counter_sum() gets the old percpu_counter_sum_and_set()
functionality.

Problem is, as Eric points out, the old percpu_counter_sum_and_set()
functionality was racy and wrong. It zeroes out counters on "other" cpus,
without holding any locks which will prevent races agaist updates from
those other CPUS.

This patch reverts 1f7c14c62ce63805f9574664a6c6de3633d4a354. This means
that percpu_counter_sum_and_set() still has the race, but
percpu_counter_sum() does not.

Note that this is not a simple revert - ext4 has since started using
percpu_counter_sum() for its dirty_blocks counter as well.

Note that this revert patch changes percpu_counter_sum() semantics.

Before the patch, a call to percpu_counter_sum() will bring the counter's
central counter mostly up-to-date, so a following percpu_counter_read()
will return a close value.

After this patch, a call to percpu_counter_sum() will leave the counter's
central accumulator unaltered, so a subsequent call to
percpu_counter_read() can now return a significantly inaccurate result.

If there is any code in the tree which was introduced after
e8ced39d5e8911c662d4d69a342b9d053eaaac4e was merged, and which depends
upon the new percpu_counter_sum() semantics, that code will break.

Reported-by: Eric Dumazet
Cc: "David S. Miller"
Cc: Peter Zijlstra
Cc: Mingming Cao
Cc:
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Andrew Morton
2008-12-11 00:01:52 +0800

14 Nov, 2008

2 commits

2b8289256 Merge branch 'master' into next ... Browse Code »

Conflicts:
security/keys/internal.h
security/keys/process_keys.c
security/keys/request_key.c

Fixed conflicts above by using the non 'tsk' versions.

Signed-off-by: James Morris

James Morris
2008-11-14 08:29:12 +0800
4c9c544e4 CRED: Wrap task credential accesses in the Ext4 filesystem ... Browse Code »

Wrap access to task credentials so that they can be separated more easily from
the task_struct during the introduction of COW creds.

Change most current->(|e|s|fs)[ug]id to current_(|e|s|fs)[ug]id().

Change some task->e?[ug]id to task_e?[ug]id(). In some places it makes more
sense to use RCU directly rather than a convenient wrapper; these will be
addressed by later patches.

Signed-off-by: David Howells
Reviewed-by: James Morris
Acked-by: Serge Hallyn
Cc: Stephen Tweedie
Cc: Andrew Morton
Cc: adilger@sun.com
Cc: linux-ext4@vger.kernel.org
Signed-off-by: James Morris

David Howells
2008-11-14 07:38:51 +0800

07 Nov, 2008

3 commits

23712a9c2 ext4: add checksum calculation when clearing UNINIT flag in ext4_new_inode ... Browse Code »

When initializing an uninitialized block group in ext4_new_inode(),
its block group checksum must be re-calculated. This fixes a race
when several threads try to allocate a new inode in an UNINIT'd group.

There is some question whether we need to be initializing the block
bitmap in ext4_new_inode() at all, but for now, if we are going to
init the block group, let's eliminate the race.

Signed-off-by: Frederic Bohe
Signed-off-by: "Theodore Ts'o"

Frederic Bohe
2008-11-07 22:21:01 +0800
ed9b3e337 ext4: Mark the buffer_heads as dirty and uptodate after prepare_write ... Browse Code »

We need to make sure we mark the buffer_heads as dirty and uptodate
so that block_write_full_page write them correctly.

This fixes mmap corruptions that can occur in low memory situations.

Signed-off-by: Aneesh Kumar K.V
Signed-off-by: "Theodore Ts'o"

Aneesh Kumar K.V
2008-11-07 22:06:45 +0800
ac51d8370 ext4: calculate journal credits correctly ... Browse Code »

This fixes a 2.6.27 regression which was introduced in commit a02908f1.

We weren't passing the chunk parameter down to the two subections,
ext4_indirect_trans_blocks() and ext4_ext_index_trans_blocks(), with
the result that massively overestimate the amount of credits needed by
ext4_da_writepages, especially in the non-extents case. This causes
failures especially on /boot partitions, which tend to be small and
non-extent using since GRUB doesn't handle extents.

This patch fixes the bug reported by Joseph Fannin at:
http://bugzilla.kernel.org/show_bug.cgi?id=11964

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2008-11-07 05:49:36 +0800

04 Nov, 2008

3 commits

14ce0cb41 ext4: wait on all pending commits in ext4_sync_fs() ... Browse Code »

In ext4_sync_fs, we only wait for a commit to finish if we started it,
but there may be one already in progress which will not be synced.

In the case of a data=ordered umount with pending long symlinks which
are delayed due to a long list of other I/O on the backing block
device, this causes the buffer associated with the long symlinks to
not be moved to the inode dirty list in the second phase of
fsync_super. Then, before they can be dirtied again, kjournald exits,
seeing the UMOUNT flag and the dirty pages are never written to the
backing block device, causing long symlink corruption and exposing new
or previously freed block data to userspace.

To ensure all commits are synced, we flush all journal commits now
when sync_fs'ing ext4.

Signed-off-by: Arthur Jones
Signed-off-by: Andrew Morton
Signed-off-by: "Theodore Ts'o"
Cc: Eric Sandeen
Cc:

Theodore Ts'o
2008-11-04 07:10:55 +0800
d94e99a64 ext4: Convert to host order before using the values. ... Browse Code »

Use le16_to_cpu to read the s_reserved_gdt_blocks values
from super block.

Signed-off-by: Aneesh Kumar K.V
Signed-off-by: "Theodore Ts'o"

Aneesh Kumar K.V
2008-11-04 22:11:26 +0800
ae2d9fb18 ext4: fix missing ext4_unlock_group in error path ... Browse Code »

If we try to free a block which is already freed, the code was
returning without first unlocking the group.

Signed-off-by: Aneesh Kumar K.V
Signed-off-by: "Theodore Ts'o"

Aneesh Kumar K.V
2008-11-04 22:10:50 +0800

28 Oct, 2008

3 commits

a996031c8 delay capable() check in ext4_has_free_blocks() ... Browse Code »

As reported by Eric Paris, the capable() check in ext4_has_free_blocks()
sometimes causes SELinux denials.

We can rearrange the logic so that we only try to use the root-reserved
blocks when necessary, and even then we can move the capable() test
to last, to avoid the check most of the time.

Signed-off-by: Eric Sandeen
Reviewed-by: Mingming Cao
Signed-off-by: "Theodore Ts'o"

Eric Sandeen
2008-10-28 12:08:17 +0800
8c3bf8a01 merge ext4_claim_free_blocks & ext4_has_free_blocks ... Browse Code »

Mingming pointed out that ext4_claim_free_blocks & ext4_has_free_blocks
are largely cut & pasted; they can be collapsed/merged as follows.

Signed-off-by: Eric Sandeen
Reviewed-by: Mingming Cao
Signed-off-by: "Theodore Ts'o"

Eric Sandeen
2008-10-28 12:08:12 +0800
ef2cabf7c ext4: fix a bug accessing freed memory in ext4_abort ... Browse Code »

Vegard Nossum reported a bug which accesses freed memory (found via
kmemcheck). When journal has been aborted, ext4_put_super() calls
ext4_abort() after freeing the journal_t object, and then ext4_abort()
accesses it. This patch fix it.

Signed-off-by: Hidehiro Kawai
Acked-by: Jan Kara
Signed-off-by: "Theodore Ts'o"

Hidehiro Kawai
2008-10-28 10:53:05 +0800

26 Oct, 2008

1 commit

3c37fc86d ext4: Fix duplicate entries returned from getdents() system call ... Browse Code »

Fix a regression caused by commit d0156417, "ext4: fix ext4_dx_readdir
hash collision handling", where deleting files in a large directory
(requiring more than one getdents system call), results in some
filenames being returned twice. This was caused by a failure to
update info->curr_hash and info->curr_minor_hash, so that if the
directory had gotten modified since the last getdents() system call
(as would be the case if the user is running "rm -r" or "git clean"),
a directory entry would get returned twice to the userspace.

Signed-off-by: "Theodore Ts'o"

This patch fixes the bug reported by Markus Trippelsdorf at:
http://bugzilla.kernel.org/show_bug.cgi?id=11844

Signed-off-by: "Theodore Ts'o"
Tested-by: Markus Trippelsdorf

Theodore Ts'o
2008-10-26 10:37:55 +0800

24 Oct, 2008

2 commits

3856d30de ext4: remove unused variable in ext4_get_parent ... Browse Code »

Signed-off-by: Christoph Hellwig
[ All users removed in "switch all filesystems over to d_obtain_alias",
aka commit 440037287c5ebb07033ab927ca16bb68c291d309 ]
Signed-off-by: Linus Torvalds

Christoph Hellwig
2008-10-24 03:03:23 +0800
224848564 Merge git://git.kernel.org/pub/scm/linux/kernel/git/viro/bdev ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/viro/bdev: (66 commits)
[PATCH] kill the rest of struct file propagation in block ioctls
[PATCH] get rid of struct file use in blkdev_ioctl() BLKBSZSET
[PATCH] get rid of blkdev_locked_ioctl()
[PATCH] get rid of blkdev_driver_ioctl()
[PATCH] sanitize blkdev_get() and friends
[PATCH] remember mode of reiserfs journal
[PATCH] propagate mode through swsusp_close()
[PATCH] propagate mode through open_bdev_excl/close_bdev_excl
[PATCH] pass fmode_t to blkdev_put()
[PATCH] kill the unused bsize on the send side of /dev/loop
[PATCH] trim file propagation in block/compat_ioctl.c
[PATCH] end of methods switch: remove the old ones
[PATCH] switch sr
[PATCH] switch sd
[PATCH] switch ide-scsi
[PATCH] switch tape_block
[PATCH] switch dcssblk
[PATCH] switch dasd
[PATCH] switch mtd_blkdevs
[PATCH] switch mmc
...

Linus Torvalds
2008-10-24 01:23:07 +0800

23 Oct, 2008

2 commits

440037287 [PATCH] switch all filesystems over to d_obtain_alias ... Browse Code »

Switch all users of d_alloc_anon to d_obtain_alias.

Signed-off-by: Christoph Hellwig
Signed-off-by: Al Viro

Christoph Hellwig
2008-10-23 17:13:01 +0800
8264613de [PATCH] switch quota_on-related stuff to kern_path() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-10-23 17:12:44 +0800

21 Oct, 2008

2 commits

9a1c35427 [PATCH] pass fmode_t to blkdev_put() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-10-21 19:48:58 +0800
6da0b38f4 fs/Kconfig: move ext2, ext3, ext4, JBD, JBD2 out ... Browse Code »

Use fs/*/Kconfig more, which is good because everything related to one
filesystem is in one place and fs/Kconfig is quite fat.

Signed-off-by: Alexey Dobriyan
Signed-off-by: Linus Torvalds

Alexey Dobriyan
2008-10-21 02:43:59 +0800

18 Oct, 2008

1 commit

0b09923ea ext4: Remove compile warnings when building w/o CONFIG_PROC_FS ... Browse Code »

Signed-off-by: Manish Katiyar
Signed-off-by: "Theodore Ts'o"

Manish Katiyar
2008-10-18 02:58:45 +0800

17 Oct, 2008

4 commits

f287a1a56 ext4: Remove automatic enabling of the HUGE_FILE feature flag ... Browse Code »

If the HUGE_FILE feature flag is not set, don't allow the creation of
large files, instead of automatically enabling the feature flag.
Recent versions of mke2fs will set the HUGE_FILE flag automatically
anyway for ext4 filesystems.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2008-10-17 10:50:48 +0800
3e624fc72 ext4: Replace hackish ext4_mb_poll_new_transaction with commit callback ... Browse Code »

The multiblock allocator needs to be able to release blocks (and issue
a blkdev discard request) when the transaction which freed those
blocks is committed. Previously this was done via a polling mechanism
when blocks are allocated or freed. A much better way of doing things
is to create a jbd2 callback function and attaching the list of blocks
to be freed directly to the transaction structure.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2008-10-17 08:00:24 +0800
01436ef2e ext4: Remove unused mount options: nomballoc, mballoc, nocheck ... Browse Code »

These mount options don't actually do anything any more, so remove
them.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2008-10-17 19:22:35 +0800
5128273a3 ext4: Add missing newlines to printk messages ... Browse Code »

There are some newlines missing in ext4_check_descriptors, which
cause the printk level to be printed out when the next printk call
is made:

[ 778.847265] EXT4-fs: ext4_check_descriptors: Block bitmap for group 0
not in group (block 1509949442)!EXT4-fs: group descriptors corrupted!
[ 802.646630] EXT4-fs: ext4_check_descriptors: Inode bitmap for group 0
not in group (block 9043971)!EXT4-fs: group descriptors corrupted!

Signed-off-by: Eric Sesterhenn
Signed-off-by: "Theodore Ts'o"

Eric Sesterhenn
2008-10-17 21:16:19 +0800

16 Oct, 2008

3 commits

22208dedb ext4: Fix file fragmentation during large file write. ... Browse Code »

The range_cyclic writeback mode uses the address_space writeback_index
as the start index for writeback. With delayed allocation we were
updating writeback_index wrongly resulting in highly fragmented file.
This patch reduces the number of extents reduced from 4000 to 27 for a
3GB file.

Signed-off-by: Aneesh Kumar K.V
Signed-off-by: Theodore Ts'o

Aneesh Kumar K.V
2008-10-16 22:10:36 +0800
8a0aba733 ext4: let the block device know when unused blocks can be discarded ... Browse Code »

Let the block device know when unused blocks can be discarded, using
the new sb_issue_discard() interface.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2008-10-16 22:06:27 +0800
c894058d6 ext4: Use an rbtree for tracking blocks freed during transaction. ... Browse Code »

With this patch we track the block freed during a transaction using
red-black tree. We also make sure contiguous blocks freed are collected
in one node in the tree.

Signed-off-by: Aneesh Kumar K.V
Signed-off-by: Theodore Ts'o

Aneesh Kumar K.V
2008-10-16 22:14:27 +0800

14 Oct, 2008

3 commits

af6f029d3 ext4: Use tag dirty lookup during mpage_da_submit_io ... Browse Code »

This enables us to drop the range_cont writeback mode
use from ext4_da_writepages.

Signed-off-by: Aneesh Kumar K.V

Aneesh Kumar K.V
2008-10-14 21:20:19 +0800
688f05a01 ext4: Free ext4_prealloc_space using kmem_cache_free ... Browse Code »

We should use kmem_cache_free to free memory allocated
via kmem_cache_alloc

Signed-off-by: Aneesh Kumar K.V
Signed-off-by: Theodore Ts'o

Aneesh Kumar K.V
2008-10-14 00:14:14 +0800
a447c0932 vfs: Use const for kernel parser table ... Browse Code »

This is a much better version of a previous patch to make the parser
tables constant. Rather than changing the typedef, we put the "const" in
all the various places where its required, allowing the __initconst
exception for nfsroot which was the cause of the previous trouble.

This was posted for review some time ago and I believe its been in -mm
since then.

Signed-off-by: Steven Whitehouse
Cc: Alexander Viro
Signed-off-by: Linus Torvalds

Steven Whitehouse
2008-10-14 01:10:37 +0800

13 Oct, 2008

1 commit

3244fcb1a ext4: fix build failure without procfs ... Browse Code »

fs/ext4/super.c: In function 'ext4_fill_super':
fs/ext4/super.c:2226: error: 'ext4_ui_proc_fops' undeclared (first use
in this function)
fs/ext4/super.c:2226: error: (Each undeclared identifier is reported
only once
fs/ext4/super.c:2226: error: for each function it appears in.)

Signed-off-by: Alexander Beregalov
Signed-off-by: Theodore Ts'o

Alexander Beregalov
2008-10-13 05:27:49 +0800

11 Oct, 2008

5 commits

a1aebc1e2 ext4: Don't reuse released data blocks until transaction commits ... Browse Code »

We need to make sure we don't reuse the data blocks released
during the transaction untill the transaction commits. We force
this mode only for ordered and journalled mode. Writeback mode
already don't provided data consistency.

Signed-off-by: Aneesh Kumar K.V
Signed-off-by: Theodore Ts'o

Aneesh Kumar K.V
2008-10-11 08:13:31 +0800
c2774d84f ext4: Do mballoc init before doing filesystem recovery ... Browse Code »

During filesystem recovery we may be doing a truncate
which expects some of the mballoc data structures to
be initialized. So do ext4_mb_init before recovery.

Signed-off-by: Aneesh Kumar K.V
Signed-off-by: Theodore Ts'o

Aneesh Kumar K.V
2008-10-11 08:07:20 +0800
5bf5683a3 ext4: add an option to control error handling on file data ... Browse Code »

If the journal doesn't abort when it gets an IO error in file data
blocks, the file data corruption will spread silently. Because
most of applications and commands do buffered writes without fsync(),
they don't notice the IO error. It's scary for mission critical
systems. On the other hand, if the journal aborts whenever it gets
an IO error in file data blocks, the system will easily become
inoperable. So this patch introduces a filesystem option to
determine whether it aborts the journal or just call printk() when
it gets an IO error in file data.

If you mount an ext4 fs with data_err=abort option, it aborts on file
data write error. If you mount it with data_err=ignore, it doesn't
abort, just call printk(). data_err=ignore is the default.

Here is the corresponding patch of the ext3 version:
http://kerneltrap.org/mailarchive/linux-kernel/2008/9/9/3239374

Signed-off-by: Hidehiro Kawai
Signed-off-by: Theodore Ts'o

Hidehiro Kawai
2008-10-11 10:12:43 +0800
7ffe1ea89 ext4: add checks for errors from jbd2 ... Browse Code »

If the journal has aborted due to a checkpointing failure, we
have to keep the contents of the journal space. Otherwise, the
filesystem will lose uncheckpointed metadata completely and
become inconsistent. To avoid this, we need to keep needs_recovery
flag if checkpoint has failed.

With this patch, ext4_put_super() detects a checkpointing failure
from the return value of journal_destroy(), then it invokes
ext4_abort() to make the filesystem read only and keep
needs_recovery flag. Errors from jbd2_journal_flush() are also
handled by this patch in some places.

Signed-off-by: Hidehiro Kawai
Signed-off-by: Theodore Ts'o

Hidehiro Kawai
2008-10-11 08:29:21 +0800
03010a335 ext4: Rename ext4dev to ext4 ... Browse Code »

The ext4 filesystem is getting stable enough that it's time to drop
the "dev" prefix. Also remove the requirement for the TEST_FILESYS
flag.

Signed-off-by: "Theodore Ts'o"

Theodore Ts'o
2008-10-11 08:02:48 +0800

07 Oct, 2008

1 commit

39d80c33a ext4: Avoid double dirtying of super block in ext4_put_super() ... Browse Code »

While reading code I noticed that ext4_put_super() dirties the
superblock bh twice. It is always done in ext4_commit_super()
too. Remove the redundant dirty operation.
Should be a nop semantically.

Signed-off-by: Andi Kleen

Andi Kleen
2008-10-07 09:37:44 +0800