Eric Lee / smarc-fsl-linux-kernel

20 Jul, 2007

3 commits

d0217ac04 mm: fault feedback #1 ... Browse Code »

Change ->fault prototype. We now return an int, which contains
VM_FAULT_xxx code in the low byte, and FAULT_RET_xxx code in the next byte.
FAULT_RET_ code tells the VM whether a page was found, whether it has been
locked, and potentially other things. This is not quite the way he wanted
it yet, but that's changed in the next patch (which requires changes to
arch code).

This means we no longer set VM_CAN_INVALIDATE in the vma in order to say
that a page is locked which requires filemap_nopage to go away (because we
can no longer remain backward compatible without that flag), but we were
going to do that anyway.

struct fault_data is renamed to struct vm_fault as Linus asked. address
is now a void __user * that we should firmly encourage drivers not to use
without really good reason.

The page is now returned via a page pointer in the vm_fault struct.

Signed-off-by: Nick Piggin
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Nick Piggin
2007-07-20 01:04:41 +0800
54cb8821d mm: merge populate and nopage into fault (fixes nonlinear) ... Browse Code »

Nonlinear mappings are (AFAIKS) simply a virtual memory concept that encodes
the virtual address -> file offset differently from linear mappings.

->populate is a layering violation because the filesystem/pagecache code
should need to know anything about the virtual memory mapping. The hitch here
is that the ->nopage handler didn't pass down enough information (ie. pgoff).
But it is more logical to pass pgoff rather than have the ->nopage function
calculate it itself anyway (because that's a similar layering violation).

Having the populate handler install the pte itself is likewise a nasty thing
to be doing.

This patch introduces a new fault handler that replaces ->nopage and
->populate and (later) ->nopfn. Most of the old mechanism is still in place
so there is a lot of duplication and nice cleanups that can be removed if
everyone switches over.

The rationale for doing this in the first place is that nonlinear mappings are
subject to the pagefault vs invalidate/truncate race too, and it seemed stupid
to duplicate the synchronisation logic rather than just consolidate the two.

After this patch, MAP_NONBLOCK no longer sets up ptes for pages present in
pagecache. Seems like a fringe functionality anyway.

NOPAGE_REFAULT is removed. This should be implemented with ->fault, and no
users have hit mainline yet.

[akpm@linux-foundation.org: cleanup]
[randy.dunlap@oracle.com: doc. fixes for readahead]
[akpm@linux-foundation.org: build fix]
Signed-off-by: Nick Piggin
Signed-off-by: Randy Dunlap
Cc: Mark Fasheh
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Nick Piggin
2007-07-20 01:04:41 +0800
d00806b18 mm: fix fault vs invalidate race for linear mappings ... Browse Code »

Fix the race between invalidate_inode_pages and do_no_page.

Andrea Arcangeli identified a subtle race between invalidation of pages from
pagecache with userspace mappings, and do_no_page.

The issue is that invalidation has to shoot down all mappings to the page,
before it can be discarded from the pagecache. Between shooting down ptes to
a particular page, and actually dropping the struct page from the pagecache,
do_no_page from any process might fault on that page and establish a new
mapping to the page just before it gets discarded from the pagecache.

The most common case where such invalidation is used is in file truncation.
This case was catered for by doing a sort of open-coded seqlock between the
file's i_size, and its truncate_count.

Truncation will decrease i_size, then increment truncate_count before
unmapping userspace pages; do_no_page will read truncate_count, then find the
page if it is within i_size, and then check truncate_count under the page
table lock and back out and retry if it had subsequently been changed (ptl
will serialise against unmapping, and ensure a potentially updated
truncate_count is actually visible).

Complexity and documentation issues aside, the locking protocol fails in the
case where we would like to invalidate pagecache inside i_size. do_no_page
can come in anytime and filemap_nopage is not aware of the invalidation in
progress (as it is when it is outside i_size). The end result is that
dangling (->mapping == NULL) pages that appear to be from a particular file
may be mapped into userspace with nonsense data. Valid mappings to the same
place will see a different page.

Andrea implemented two working fixes, one using a real seqlock, another using
a page->flags bit. He also proposed using the page lock in do_no_page, but
that was initially considered too heavyweight. However, it is not a global or
per-file lock, and the page cacheline is modified in do_no_page to increment
_count and _mapcount anyway, so a further modification should not be a large
performance hit. Scalability is not an issue.

This patch implements this latter approach. ->nopage implementations return
with the page locked if it is possible for their underlying file to be
invalidated (in that case, they must set a special vm_flags bit to indicate
so). do_no_page only unlocks the page after setting up the mapping
completely. invalidation is excluded because it holds the page lock during
invalidation of each page (and ensures that the page is not mapped while
holding the lock).

This also allows significant simplifications in do_no_page, because we have
the page locked in the right place in the pagecache from the start.

Signed-off-by: Nick Piggin
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Nick Piggin
2007-07-20 01:04:41 +0800

19 Jul, 2007

1 commit

60446067b gfs2: stop giving out non-cluster-coherent leases ... Browse Code »

Since gfs2 can't prevent conflicting opens or leases on other nodes, we
probably shouldn't allow it to give out leases at all.

Put the newly defined lease operation into use in gfs2 by turning off
lease, unless we're using the "nolock' locking module (in which case all
locking is local anyway).

Signed-off-by: Marc Eshel
Signed-off-by: J. Bruce Fields
Cc: Steven Whitehouse

Marc Eshel
2007-07-19 07:17:19 +0800

11 Jul, 2007

1 commit

1b21f458d Merge git://git.kernel.org/pub/scm/linux/kernel/git/steve/gfs2-2.6-nmw ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/steve/gfs2-2.6-nmw: (57 commits)
[GFS2] Accept old format NFS filehandles
[GFS2] Small fixes to logging code
[DLM] dump more lock values
[GFS2] Remove i_mode passing from NFS File Handle
[GFS2] Obtaining no_formal_ino from directory entry
[GFS2] git-gfs2-nmw-build-fix
[GFS2] System won't suspend with GFS2 file system mounted
[GFS2] remounting w/o acl option leaves acls enabled
[GFS2] inode size inconsistency
[DLM] Telnet to port 21064 can stop all lockspaces
[GFS2] Fix gfs2_block_truncate_page err return
[GFS2] Addendum to the journaled file/unmount patch
[GFS2] Simplify multiple glock aquisition
[GFS2] assertion failure after writing to journaled file, umount
[GFS2] Use zero_user_page() in stuffed_readpage()
[GFS2] Remove bogus '\0' in rgrp.c
[GFS2] Journaled file write/unstuff bug
[DLM] don't require FS flag on all nodes
[GFS2] Fix deallocation issues
[GFS2] return conflicts for GETLK
...

Linus Torvalds
2007-07-11 04:56:13 +0800

10 Jul, 2007

1 commit

5ffc4ef45 sendfile: remove .sendfile from filesystems that use generic_file_sendfile() ... Browse Code »

They can use generic_file_splice_read() instead. Since sys_sendfile() now
prefers that, there should be no change in behaviour.

Signed-off-by: Jens Axboe

Jens Axboe
2007-07-10 14:04:13 +0800

09 Jul, 2007

1 commit

dbb7cae2a [GFS2] Clean up inode number handling ... Browse Code »

This patch cleans up the inode number handling code. The main difference
is that instead of looking up the inodes using a struct gfs2_inum_host
we now use just the no_addr member of this structure. The tests relating
to no_formal_ino can then be done by the calling code. This has
advantages in that we want to do different things in different code
paths if the no_formal_ino doesn't match. In the NFS patch we want to
return -ESTALE, but in the ->lookup() path, its a bug in the fs if the
no_formal_ino doesn't match and thus we can withdraw in this case.

In order to later fix bz #201012, we need to be able to look up an inode
without knowing no_formal_ino, as the only information that is known to
us is the on-disk location of the inode in question.

This patch will also help us to fix bz #236099 at a later date by
cleaning up a lot of the code in that area.

There are no user visible changes as a result of this patch and there
are no changes to the on-disk format either.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2007-07-09 15:22:24 +0800

09 May, 2007

1 commit

e63340ae6 header cleaning: don't include smp_lock.h when not used ... Browse Code »

Remove includes of where it is not used/needed.
Suggested by Al Viro.

Builds cleanly on x86_64, i386, alpha, ia64, powerpc, sparc,
sparc64, and arm (all 59 defconfigs).

Signed-off-by: Randy Dunlap
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Randy Dunlap
2007-05-09 02:15:07 +0800

07 May, 2007

2 commits

586759f03 gfs2: nfs lock support for gfs2 ... Browse Code »

Add NFS lock support to GFS2.

Signed-off-by: Marc Eshel
Signed-off-by: J. Bruce Fields
Acked-by: Steven Whitehouse

Marc Eshel
2007-05-07 08:38:50 +0800
9d6a8c5c2 locks: give posix_test_lock same interface as ->lock ... Browse Code »

posix_test_lock() and ->lock() do the same job but have gratuitously
different interfaces. Modify posix_test_lock() so the two agree,
simplifying some code in the process.

Signed-off-by: Marc Eshel
Signed-off-by: "J. Bruce Fields"

Marc Eshel
2007-05-07 05:39:00 +0800

15 Feb, 2007

1 commit

cd354f1ae [PATCH] remove many unneeded #includes of sched.h ... Browse Code »

After Al Viro (finally) succeeded in removing the sched.h #include in module.h
recently, it makes sense again to remove other superfluous sched.h includes.
There are quite a lot of files which include it but don't actually need
anything defined in there. Presumably these includes were once needed for
macros that used to live in sched.h, but moved to other header files in the
course of cleaning it up.

To ease the pain, this time I did not fiddle with any header files and only
removed #includes from .c-files, which tend to cause less trouble.

Compile tested against 2.6.20-rc2 and 2.6.20-rc2-mm2 (with offsets) on alpha,
arm, i386, ia64, mips, powerpc, and x86_64 with allnoconfig, defconfig,
allmodconfig, and allyesconfig as well as a few randconfigs on x86_64 and all
configs in arch/arm/configs on arm. I also checked that no new warnings were
introduced by the patch (actually, some warnings are removed that were emitted
by unnecessarily included header files).

Signed-off-by: Tim Schmielau
Acked-by: Russell King
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Tim Schmielau
2007-02-15 00:09:54 +0800

06 Feb, 2007

1 commit

3699e3a44 [GFS2] Clean up/speed up readdir ... Browse Code »

This removes the extra filldir callback which gfs2 was using to
enclose an attempt at readahead for inodes during readdir. The
code was too complicated and also hurts performance badly in the
case that the getdents64/readdir call isn't being followed by
stat() and it wasn't even getting it right all the time when it
was.

As a result, on my test box an "ls" of a directory containing 250000
files fell from about 7mins (freshly mounted, so nothing cached) to
between about 15 to 25 seconds. When the directory content was cached,
the time taken fell from about 3mins to about 4 or 5 seconds.

Interestingly in the cached case, running "ls -l" once reduced the time
taken for subsequent runs of "ls" to about 6 secs even without this
patch. Now it turns out that there was a special case of glocks being
used for prefetching the metadata, but because of the timeouts for these
locks (set to 10 secs) the metadata was being timed out before it was
being used and this the prefetch code was constantly trying to prefetch
the same data over and over.

Calling "ls -l" meant that the inodes were brought into memory and once
the inodes are cached, the glocks are not disposed of until the inodes
are pushed out of the cache, thus extending the lifetime of the glocks,
and thus bringing down the time for subsequent runs of "ls"
considerably.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2007-02-06 02:37:04 +0800

09 Dec, 2006

1 commit

81454098f [PATCH] struct path: convert gfs2 ... Browse Code »

Signed-off-by: Josef Sipek
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Josef Sipek
2006-12-09 00:28:45 +0800

07 Dec, 2006

1 commit

34126f9f4 [GFS2] Change gfs2_fsync() to use write_inode_now() ... Browse Code »

This is a bit better than the previous version of gfs2_fsync()
although it would be better still if we were able to call a
function which only wrote the inode & metadata. Its no big deal
though that this will potentially write the data as well since
the VFS has already done that before calling gfs2_fsync(). I've
also added a comment to explain whats going on here.

Signed-off-by: Steven Whitehouse
Cc: Andrew Morton

Steven Whitehouse
2006-12-07 22:13:14 +0800

30 Nov, 2006

7 commits

33c3de328 [GFS2] Don't flush everything on fdatasync ... Browse Code »

The gfs2_fsync() function was doing a journal flush on each
and every call. While this is correct, its also a lot of
overhead. This patch means that on fdatasync flushes we
rely on the VFS to flush the data for us and we don't do
a journal flush unless we really need to.

We have to do a journal flush for stuffed files though because
they have the data and the inode metadata in the same block.
Journaled files also need a journal flush too of course.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:37:44 +0800
dcd247995 [GFS2] Remove unused function from inode.c ... Browse Code »

The gfs2_glock_nq_m_atime function is unused in so far as its only
ever called with num_gh = 1, and this falls through to the
gfs2_glock_nq_atime function, so we might as well call that directly.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:35:57 +0800
6b124d8db [GFS2] Only set inode flags when required ... Browse Code »

We were setting the inode flags from GFS2's flags far too often, even when they
couldn't possibly have changed. This patch reduces the amount of flag
setting going on so that we do it only when the inode is read in or
when the flags have changed. The create case is covered by the "when
the inode is read in" case.

This also fixes a bug where we didn't set S_SYNC correctly.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:34:45 +0800
b60623c23 [GFS2] Shrink gfs2_inode (3) - di_mode ... Browse Code »

This removes the duplicate di_mode field in favour of using the
inode->i_mode field. This saves 4 bytes.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:34:14 +0800
539e5d6b7 [GFS2] Change argument of gfs2_dinode_out ... Browse Code »

Everywhere this was called, a struct gfs2_inode was available,
but despite that, it was always called with a struct gfs2_dinode
as an argument. By making this change it paves the way to start
eliminating fields duplicated between the kernel's struct inode
and the struct gfs2_dinode.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:33:54 +0800
9c9ab3d54 [GFS2] gfs2 __user misannotation fix ... Browse Code »

Signed-off-by: Al Viro
Signed-off-by: Steven Whitehouse

Al Viro
2006-11-30 23:33:49 +0800
629a21e7e [GFS2] split and annotate gfs2_inum ... Browse Code »

Signed-off-by: Al Viro
Signed-off-by: Steven Whitehouse

Al Viro
2006-11-30 23:33:32 +0800

02 Oct, 2006

4 commits

128e5ebaf [GFS2] Remove iflags.h, use FS_ ... Browse Code »

Update GFS2 in the light of David Howells' patch:

[PATCH] BLOCK: Move common FS-specific ioctls to linux/fs.h [try #6]
36695673b012096228ebdc1b39a6a5850daa474e

which calls the filesystem independant flags FS_..._FL. As a result
we no longer need the flags.h file and the conversion routine is
moved into the GFS2 source code.

Userland programs which used to include iflags.h should now include
fs.h and use the new flag names.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-10-02 23:24:43 +0800
d00223f16 [GFS2] Fix code style/indent in ops_file.c ... Browse Code »

Fix a couple of minor issues.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-10-02 22:28:05 +0800
930cc237d [GFS2] streamline-generic_file_-interfaces-and-filemap gfs fix ... Browse Code »

Fix GFS for streamline-generic_file_-interfaces-and-filemap.patch

Signed-off-by: Andrew Morton
Signed-off-by: Steven Whitehouse

Andrew Morton
2006-10-02 21:02:54 +0800
9c9eb21ee [GFS2] Remove readv/writev methods and use aio_read/aio_write instead (gfs bits) ... Browse Code »

This patch removes readv() and writev() methods and replaces them with
aio_read()/aio_write() methods.

Signed-off-by: Badari Pulavarty
Signed-off-by: Christoph Hellwig
Signed-off-by: Andrew Morton
Signed-off-by: Steven Whitehouse

Badari Pulavarty
2006-10-02 21:02:12 +0800

25 Sep, 2006

1 commit

907b9bceb [GFS2/DLM] Fix trailing whitespace ... Browse Code »

As per Andrew Morton's request, removed trailing whitespace.

Cc: Andrew Morton
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-25 21:26:04 +0800

20 Sep, 2006

1 commit

f0e522a90 [GFS2] Remove "NFS only" readdir path ... Browse Code »

This code path shouldn't be needed, so remove it for now. This
tidys things up.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-20 04:41:11 +0800

19 Sep, 2006

1 commit

7d308590a [GFS2] Export lm_interface to kernel headers ... Browse Code »

lm_interface.h has a few out of the tree clients such as GFS1
and userland tools.

Right now, these clients keeps a copy of the file in their build tree
that can go out of sync.

Move lm_interface.h to include/linux, export it to userland and
clean up fs/gfs2 to use the new location.

Signed-off-by: Fabio M. Di Nitto
Signed-off-by: Steven Whitehouse

Fabio Massimo Di Nitto
2006-09-19 20:45:18 +0800

05 Sep, 2006

4 commits

26c1a5741 [GFS2] More code style updates ... Browse Code »

As per Jan Engelhardt's fifth email. This has most of the changes
recommended, which is the removal of casts which are not required,
some indenting fixes and similar.

Cc: Jan Engelhardt
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-05 03:32:10 +0800
dd538c832 [GFS2] Spelling sentinal -> sentinel ... Browse Code »

A spelling mistake (one of mine).

Cc: Jan Engelhardt
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-05 02:53:30 +0800
cd915493f [GFS2] Change all types to uX style ... Browse Code »

This makes all fixed size types have consistent names.

Cc: Jan Engelhardt
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-05 00:49:07 +0800
a91ea69ff [GFS2] Align all labels against LH side ... Browse Code »

This makes everything consistent.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-05 00:04:26 +0800

01 Sep, 2006

1 commit

e9fc2aa09 [GFS2] Update copyright, tidy up incore.h ... Browse Code »

As per comments from Jan Engelhardt this
updates the copyright message to say "version" in full rather than
"v.2". Also incore.h has been updated to remove forward structure
declarations which are not required.

The gfs2_quota_lvb structure has now had endianess annotations added
to it. Also quota.c has been updated so that we now store the
lvb data locally in endian independant format to avoid needing
a structure in host endianess too. As a result the endianess
conversions are done as required at various points and thus the
conversion routines in lvb.[ch] are no longer required. I've
moved the one remaining constant in lvb.h thats used into lm.h
and removed the unused lvb.[ch].

I have not changed the HIF_ constants. That is left to a later patch
which I hope will unify the gh_flags and gh_iflags fields of the
struct gfs2_holder.

Cc: Jan Engelhardt
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-01 23:05:15 +0800

01 Aug, 2006

1 commit

420b9e5e4 [GFS2] Tidy up in various files ... Browse Code »

Tidy up some files and remove an unused routine in meta_io.h. Also
added a bit of extra debugging in meta_io.h.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-08-01 03:42:17 +0800

26 Jul, 2006

1 commit

a9e5f4d07 [GFS2] Alter direct I/O path ... Browse Code »

As per comments received, alter the GFS2 direct I/O path so that
it uses the standard read functions "out of the box". Needs a
small change to one of the VFS functions. This reduces the size
of the code quite a lot and also removes the need for one new export.

Some more work remains to be done, but this is the bones of the
thing.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-07-26 05:24:12 +0800

21 Jul, 2006

1 commit

52f341cf7 [GFS2] gfs2_set_flags double locking patch ... Browse Code »

traced the "umount hang due to spurious glock" issue that I was having
with gfs2meta. It's in the do_gfs2_set_flags function, which does a
gfs2_holder_init as well as a gfs2_glock_nq_init (increases ref count by
2 instead of 1).

Signed-off-by: Abhijith Das
Signed-off-by: Steven Whitehouse

Abhijith Das
2006-07-21 14:03:21 +0800

11 Jul, 2006

1 commit

4340fe625 [GFS2] Add generation number ... Browse Code »

This adds a generation number for the eventual use of NFS to the
ondisk inode. Its backward compatible with the current code since
it doesn't really matter what the generation number is to start with,
and indeed since its set to zero, due to it being taken from padding
in both the inode and rgrp header, it should be fine.

The eventual plan is to use this rather than no_formal_ino in the
NFS filehandles. At that point no_formal_ino will be unused.

At the same time we also add a releasepages call back to the
"normal" address space for gfs2 inodes. Also I've removed a
one-linrer function thats not required any more.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-07-11 21:46:33 +0800

04 Jul, 2006

1 commit

b0dd9308b [GFS2] Mark file_operations const ... Browse Code »

As per Arjan's patches:
http://www.kernel.org/git/?p=linux/kernel/git/steve/gfs2-2.6.git;a=commitdiff;h=99ac48f54a91d02140c497edc31dc57d4bc5c85d
and
http://www.kernel.org/git/?p=linux/kernel/git/steve/gfs2-2.6.git;a=commitdiff;h=4b6f5d20b04dcbc3d888555522b90ba6d36c4106

make the GFS2 file_operations structures const.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-07-04 01:47:02 +0800

24 Jun, 2006

1 commit

a53311d4d [GFS2] Use generic_file_sendfile directly ... Browse Code »

Don't use a wrapper for generic_file_sendfile but call it
directly.

Cc: Christoph Hellwig
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-06-24 04:16:29 +0800

22 Jun, 2006

1 commit

faf450ef4 [GFS2] Remove gfs2_repermission ... Browse Code »

gfs2_repermission is just a wrapper for permission, so remove it and
call permission directly where required.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-06-22 22:59:10 +0800