Eric Lee / linux-smarc-t335x-v3.2

21 May, 2010

1 commit

ed4878e8a GFS2: Rework reclaiming unlinked dinodes ... Browse Code »

The previous patch I wrote for reclaiming unlinked dinodes
had some shortcomings and did not prevent all hangs.
This version is much cleaner and more logical, and has
passed very difficult testing. Sorry for the churn.

Signed-off-by: Bob Peterson
Signed-off-by: Steven Whitehouse

Bob Peterson
2010-05-21 23:11:36 +0800

14 Apr, 2010

1 commit

1a0eae884 GFS2: glock livelock ... Browse Code »

This patch fixes a couple gfs2 problems with the reclaiming of
unlinked dinodes. First, there were a couple of livelocks where
everything would come to a halt waiting for a glock that was
seemingly held by a process that no longer existed. In fact, the
process did exist, it just had the wrong pid number in the holder
information. Second, there was a lock ordering problem between
inode locking and glock locking. Third, glock/inode contention
could sometimes cause inodes to be improperly marked invalid by
iget_failed.

Signed-off-by: Bob Peterson

Bob Peterson
2010-04-14 23:48:05 +0800

22 May, 2009

4 commits

87ec21741 GFS2: Move gfs2_unlink_ok into ops_inode.c ... Browse Code »

Another function which is only called from one ops_inode.c so
we move it and make it static.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-05-22 17:54:50 +0800
536baf02f GFS2: Move gfs2_readlinki into ops_inode.c ... Browse Code »

Move gfs2_readlinki into ops_inode.c and make it static

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-05-22 17:48:59 +0800
2286dbfad GFS2: Move gfs2_rmdiri into ops_inode.c ... Browse Code »

Move gfs2_rmdiri() into ops_inode.c and make it static.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-05-22 17:45:09 +0800
b1e71b062 GFS2: Clean up some file names ... Browse Code »

This patch renames the ops_*.c files which have no counterpart
without the ops_ prefix in order to shorten the name and make
it more readable. In addition, ops_address.h (which was very
small) is moved into inode.h and inode.h is cleaned up by
adding extern where required.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-05-22 17:01:55 +0800

15 Apr, 2009

1 commit

10d219880 GFS2: cleanup file_operations mess ... Browse Code »

Remove the weird pointer to file_operations mess and replace it with
straight-forward defining of the lockinginstance names to the _nolock
variants.

Signed-off-by: Christoph Hellwig
Signed-off-by: Steven Whitehouse

Christoph Hellwig
2009-04-15 17:17:18 +0800

24 Mar, 2009

1 commit

f057f6cdf GFS2: Merge lock_dlm module into GFS2 ... Browse Code »

This is the big patch that I've been working on for some time
now. There are many reasons for wanting to make this change
such as:
o Reducing overhead by eliminating duplicated fields between structures
o Simplifcation of the code (reduces the code size by a fair bit)
o The locking interface is now the DLM interface itself as proposed
some time ago.
o Fewer lookups of glocks when processing replies from the DLM
o Fewer memory allocations/deallocations for each glock
o Scope to do further optimisations in the future (but this patch is
more than big enough for now!)

Please note that (a) this patch relates to the lock_dlm module and
not the DLM itself, that is still a separate module; and (b) that
we retain the ability to build GFS2 as a standalone single node
filesystem with out requiring the DLM.

This patch needs a lot of testing, hence my keeping it I restarted
my -git tree after the last merge window. That way, this has the maximum
exposure before its merged. This is (modulo a few minor bug fixes) the
same patch that I've been posting on and off the the last three months
and its passed a number of different tests so far.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-03-24 19:21:14 +0800

05 Jan, 2009

2 commits

383f01fbf GFS2: Banish struct gfs2_dinode_host ... Browse Code »

The final field in gfs2_dinode_host was the i_flags field. Thats
renamed to i_diskflags in order to avoid confusion with the existing
inode flags, and moved into the inode proper at a suitable location
to avoid creating a "hole".

At that point struct gfs2_dinode_host is no longer needed and as
promised (quite some time ago!) it can now be removed completely.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-01-05 15:38:59 +0800
b27605837 GFS2: Rationalise header files ... Browse Code »

Move the contents of some headers which contained very
little into more sensible places, and remove the original
header files. This should make it easier to find things.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-01-05 15:38:48 +0800

18 Sep, 2008

1 commit

719ee3446 GFS2: high time to take some time over atime ... Browse Code »

Until now, we've used the same scheme as GFS1 for atime. This has failed
since atime is a per vfsmnt flag, not a per fs flag and as such the
"noatime" flag was not getting passed down to the filesystems. This
patch removes all the "special casing" around atime updates and we
simply use the VFS's atime code.

The net result is that GFS2 will now support all the same atime related
mount options of any other filesystem on a per-vfsmnt basis. We do lose
the "lazy atime" updates, but we gain "relatime". We could add lazy
atime to the VFS at a later date, if there is a requirement for that
variant still - I suspect relatime will be enough.

Also we lose about 100 lines of code after this patch has been applied,
and I have a suspicion that it will speed things up a bit, even when
atime is "on". So it seems like a nice clean up as well.

From a user perspective, everything stays the same except the loss of
the per-fs atime quantum tweekable (ought to be per-vfsmnt at the very
least, and to be honest I don't think anybody ever used it) and that a
number of options which were ignored before now work correctly.

Please let me know if you've got any comments. I'm pushing this out
early so that you can all see what my plans are.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-09-18 20:53:59 +0800

27 Aug, 2008

1 commit

0188d6c58 GFS2: Fix & clean up GFS2 rename ... Browse Code »

This patch fixes a locking issue in the rename code by ensuring that we hold
the per sb rename lock over both directory and "other" renames which involve
different parent directories.

At the same time, this moved the (only called from one place) function
gfs2_ok_to_move into the file that its called from, so we can mark it
static. This should make a code a bit easier to follow.

Signed-off-by: Steven Whitehouse
Cc: Peter Staubach

Steven Whitehouse
2008-08-27 20:33:10 +0800

27 Jul, 2008

1 commit

a569c711f [PATCH] don't pass nameidata to gfs2_lookupi() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:36 +0800

10 Jul, 2008

1 commit

a93a6ce24 [GFS2] Remove unused declaration ... Browse Code »

The implementation of gfs2_inode_attr_in is removed.
So remove its declaration.

Signed-off-by: Li Xiaodong
Signed-off-by: Steven Whitehouse

Li Xiaodong
2008-07-10 23:22:23 +0800

03 Jul, 2008

1 commit

f58ba8891 [GFS2] don't call permission() ... Browse Code »

GFS2 calls permission() to verify permissions after locks on the files
have been taken.

For this it's sufficient to call gfs2_permission() instead. This
results in the following changes:

- IS_RDONLY() check is not performed
- IS_IMMUTABLE() check is not performed
- devcgroup_inode_permission() is not called
- security_inode_permission() is not called

IS_RDONLY() should be unnecessary anyway, as the per-mount read-only
flag should provide protection against read-only remounts during
operations. do_gfs2_set_flags() has been fixed to perform
mnt_want_write()/mnt_drop_write() to protect against remounting
read-only.

IS_IMMUTABLE has been added to gfs2_permission()

Repeating the security checks seems to be pointless, as they don't
normally change, and if they do, it's independent of the filesystem
state.

Signed-off-by: Miklos Szeredi
Signed-off-by: Steven Whitehouse

Miklos Szeredi
2008-07-03 17:22:01 +0800

31 Mar, 2008

2 commits

77658aad2 [GFS2] Eliminate (almost) duplicate field from gfs2_inode ... Browse Code »

The blocks counter is almost a duplicate of the i_blocks
field in the VFS inode. The only difference is that i_blocks
can be only 32bits long for 32bit arch without large single file
support. Since GFS2 doesn't handle the non-large single file
case (for 32 bit anyway) this adds a new config dependency on
64BIT || LSF. This has always been the case, however we've never
explicitly said so before.

Even if we do add support for the non-LSF case, we will still
not require this field to be duplicated since we will not be
able to access oversized files anyway.

So the net result of all this is that we shave 8 bytes from a gfs2_inode
and get our config deps correct.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-03-31 17:40:55 +0800
ecc30c791 [GFS2] Streamline indirect pointer tree height calculation ... Browse Code »

This patch improves the calculation of the tree height in order to reduce
the number of operations which are carried out on each call to gfs2_block_map.
In the common case, we now make a single comparison, rather than calculating
the required tree height from scratch each time. Also in the case that the
tree does need some extra height, we start from the current height rather from
zero when we work out what the new height ought to be.

In addition the di_height field is moved into the inode proper and reduced
in size to a u8 since the value must be between 0 and GFS2_MAX_META_HEIGHT (10).

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-03-31 17:39:46 +0800

25 Jan, 2008

2 commits

5561093e2 [GFS2] Introduce gfs2_set_aops() ... Browse Code »

Just like ext3 we now have three sets of address space operations
to cover the cases of writeback, ordered and journalled data
writes. This means that the individual operations can now become
less complicated as we are able to remove some of the tests for
file data mode from the code.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-01-25 16:07:23 +0800
bf36a7131 [GFS2] Add gfs2_is_writeback() ... Browse Code »

This adds a function "gfs2_is_writeback()" along the lines of the
existing "gfs2_is_jdata()" in order to clean up the code and make
the various tests for the inode mode more obvious. It also fixes
the PageChecked() logic where we were resetting the flag too early
in the case of an error path.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-01-25 16:07:21 +0800

10 Oct, 2007

1 commit

7a9f53b3c [GFS2] Alternate gfs2_iget to avoid looking up inodes being freed ... Browse Code »

There is a possible deadlock between two processes on the same node, where one
process is deleting an inode, and another process is looking for allocated but
unused inodes to delete in order to create more space.

process A does an iput() on inode X, and it's i_count drops to 0. This causes
iput_final() to be called, which puts an inode into state I_FREEING at
generic_delete_inode(). There no point between when iput_final() is called, and
when I_FREEING is set where GFS2 could acquire any glocks. Once I_FREEING is
set, no other process on that node can successfully look up that inode until
the delete finishes.

process B locks the the resource group for the same inode in get_local_rgrp(),
which is called by gfs2_inplace_reserve_i()

process A tries to lock the resource group for the inode in
gfs2_dinode_dealloc(), but it's already locked by process B

process B waits in find_inode for the inode to have the I_FREEING state cleared.

Deadlock.

This patch solves the problem by adding an alternative to gfs2_iget(),
gfs2_iget_skip(), that simply skips any inodes that are in the I_FREEING
state.o The alternate test function is just like the original one, except that
it fails if the inode is being freed, and sets a skipped flag. The alternate
set function is just like the original, except that it fails if the skipped
flag is set. Only try_rgrp_unlink() calls gfs2_iget_skip() instead of
gfs2_iget().

Signed-off-by: Benjamin E. Marzinski
Signed-off-by: Steven Whitehouse

Benjamin Marzinski
2007-10-10 15:56:29 +0800

09 Jul, 2007

4 commits

35dcc52e3 [GFS2] Remove i_mode passing from NFS File Handle ... Browse Code »

GFS2 has been passing i_mode within NFS File Handle. Other than the
wrong assumption that there is always room for this extra 16 bit value,
the current gfs2_get_dentry doesn't really need the i_mode to work
correctly. Note that GFS2 NFS code does go thru the same lookup code
path as direct file access route (where the mode is obtained from name
lookup) but gfs2_get_dentry() is coded for different purpose. It is not
used during lookup time. It is part of the file access procedure call.
When the call is invoked, if on-disk inode is not in-memory, it has to
be read-in. This makes i_mode passing a useless overhead.

Signed-off-by: S. Wendy Cheng
Signed-off-by: Steven Whitehouse

Wendy Cheng
2007-07-09 15:24:11 +0800
bb9bcf061 [GFS2] Obtaining no_formal_ino from directory entry ... Browse Code »

GFS2 lookup code doesn't ask for inode shared glock. This implies during
in-memory inode creation for existing file, GFS2 will not disk-read in
the inode contents. This leaves no_formal_ino un-initialized during
lookup time. The un-initialized no_formal_ino is subsequently encoded
into file handle. Clients will get ESTALE error whenever it tries to
access these files.

Signed-off-by: S. Wendy Cheng
Signed-off-by: Steven Whitehouse

Wendy Cheng
2007-07-09 15:24:08 +0800
bb8d8a6f5 [GFS2] Fix sign problem in quota/statfs and cleanup _host structures ... Browse Code »

This patch fixes some sign issues which were accidentally introduced
into the quota & statfs code during the endianess annotation process.
Also included is a general clean up which moves all of the _host
structures out of gfs2_ondisk.h (where they should not have been to
start with) and into the places where they are actually used (often only
one place). Also those _host structures which are not required any more
are removed entirely (which is the eventual plan for all of them).

The conversion routines from ondisk.c are also moved into the places
where they are actually used, which for almost every one, was just one
single place, so all those are now static functions. This also cleans up
the end of gfs2_ondisk.h which no longer needs the #ifdef __KERNEL__.

The net result is a reduction of about 100 lines of code, many functions
now marked static plus the bug fixes as mentioned above. For good
measure I ran the code through sparse after making these changes to
check that there are no warnings generated.

This fixes Red Hat bz #239686

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2007-07-09 15:23:10 +0800
dbb7cae2a [GFS2] Clean up inode number handling ... Browse Code »

This patch cleans up the inode number handling code. The main difference
is that instead of looking up the inodes using a struct gfs2_inum_host
we now use just the no_addr member of this structure. The tests relating
to no_formal_ino can then be done by the calling code. This has
advantages in that we want to do different things in different code
paths if the no_formal_ino doesn't match. In the NFS patch we want to
return -ESTALE, but in the ->lookup() path, its a bug in the fs if the
no_formal_ino doesn't match and thus we can withdraw in this case.

In order to later fix bz #201012, we need to be able to look up an inode
without knowing no_formal_ino, as the only information that is known to
us is the on-disk location of the inode in question.

This patch will also help us to fix bz #236099 at a later date by
cleaning up a lot of the code in that area.

There are no user visible changes as a result of this patch and there
are no changes to the on-disk format either.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2007-07-09 15:22:24 +0800

30 Nov, 2006

6 commits

dcd247995 [GFS2] Remove unused function from inode.c ... Browse Code »

The gfs2_glock_nq_m_atime function is unused in so far as its only
ever called with num_gh = 1, and this falls through to the
gfs2_glock_nq_atime function, so we might as well call that directly.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:35:57 +0800
9e2dbdac3 [GFS2] Remove gfs2_inode_attr_in ... Browse Code »

This function wasn't really doing the right thing. There was no need
to update the inode size at this point and the updating of the
i_blocks field has now been moved to the places where di_blocks is
updated. A result of this patch and some those preceeding it is that
unlocking a glock is now a much more efficient process, since there
is no longer any requirement to copy data from the gfs2 inode into
the vfs inode at this point.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:34:52 +0800
1a7b1eed5 [GFS2] Shrink gfs2_inode (6) - di_atime/di_mtime/di_ctime ... Browse Code »

Remove the di_[amc]time fields and use inode->i_[amc]time
fields instead. This saves 24 bytes from the gfs2_inode.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:34:23 +0800
b60623c23 [GFS2] Shrink gfs2_inode (3) - di_mode ... Browse Code »

This removes the duplicate di_mode field in favour of using the
inode->i_mode field. This saves 4 bytes.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:34:14 +0800
e7f14f4d0 [GFS2] Shrink gfs2_inode (2) - di_major/di_minor ... Browse Code »

This removes the device numbers from this structure by using
inode->i_rdev instead. It also cleans up the code in gfs2_mknod.
It results in shrinking the gfs2_inode by 8 bytes.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-11-30 23:34:11 +0800
629a21e7e [GFS2] split and annotate gfs2_inum ... Browse Code »

Signed-off-by: Al Viro
Signed-off-by: Steven Whitehouse

Al Viro
2006-11-30 23:33:32 +0800

25 Sep, 2006

1 commit

907b9bceb [GFS2/DLM] Fix trailing whitespace ... Browse Code »

As per Andrew Morton's request, removed trailing whitespace.

Cc: Andrew Morton
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-25 21:26:04 +0800

01 Sep, 2006

1 commit

e9fc2aa09 [GFS2] Update copyright, tidy up incore.h ... Browse Code »

As per comments from Jan Engelhardt this
updates the copyright message to say "version" in full rather than
"v.2". Also incore.h has been updated to remove forward structure
declarations which are not required.

The gfs2_quota_lvb structure has now had endianess annotations added
to it. Also quota.c has been updated so that we now store the
lvb data locally in endian independant format to avoid needing
a structure in host endianess too. As a result the endianess
conversions are done as required at various points and thus the
conversion routines in lvb.[ch] are no longer required. I've
moved the one remaining constant in lvb.h thats used into lm.h
and removed the unused lvb.[ch].

I have not changed the HIF_ constants. That is left to a later patch
which I hope will unify the gh_flags and gh_iflags fields of the
struct gfs2_holder.

Cc: Jan Engelhardt
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-09-01 23:05:15 +0800

22 Jun, 2006

1 commit

faf450ef4 [GFS2] Remove gfs2_repermission ... Browse Code »

gfs2_repermission is just a wrapper for permission, so remove it and
call permission directly where required.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-06-22 22:59:10 +0800

15 Jun, 2006

1 commit

feaa7bba0 [GFS2] Fix unlinked file handling ... Browse Code »

This patch fixes the way we have been dealing with unlinked,
but still open files. It removes all limits (other than memory
for inodes, as per every other filesystem) on numbers of these
which we can support on GFS2. It also means that (like other
fs) its the responsibility of the last process to close the file
to deallocate the storage, rather than the person who did the
unlinking. Note that with GFS2, those two events might take place
on different nodes.

Also there are a number of other changes:

o We use the Linux inode subsystem as it was intended to be
used, wrt allocating GFS2 inodes
o The Linux inode cache is now the point which we use for
local enforcement of only holding one copy of the inode in
core at once (previous to this we used the glock layer).
o We no longer use the unlinked "special" file. We just ignore it
completely. This makes unlinking more efficient.
o We now use the 4th block allocation state. The previously unused
state is used to track unlinked but still open inodes.
o gfs2_inoded is no longer needed
o Several fields are now no longer needed (and removed) from the in
core struct gfs2_inode
o Several fields are no longer needed (and removed) from the in core
superblock

There are a number of future possible optimisations and clean ups
which have been made possible by this patch.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-06-15 03:32:57 +0800

19 May, 2006

1 commit

3a8a9a103 [GFS2] Update copyright date to 2006 ... Browse Code »

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-05-19 03:09:15 +0800

28 Apr, 2006

1 commit

363275216 [GFS2] Reordering in deallocation to avoid recursive locking ... Browse Code »

Despite my earlier careful search, there was a recursive lock left
in the deallocation code. This removes it. It also should speed up
deallocation be reducing the number of locking operations which take
place by using two "try lock" operations on the two locks involved in
inode deallocation which allows us to grab the locks out of order
(compared with NFS which grabs the inode lock first and the iopen
lock later). It is ok for us to fail while doing this since if it
does fail it means that someone else is still using the inode and
thus it wouldn't be possible to deallocate anyway.

This fixes the bug reported to me by Rob Kenna.

Cc: Rob Kenna
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-04-28 22:46:21 +0800

21 Mar, 2006

1 commit

c752666c1 [GFS2] Fix bug in directory code and tidy up ... Browse Code »

Due to a typo, the dir leaf split operation was (for the first
split in a directory) writing the new hash vaules at the
wrong offset. This is now fixed.

Also some other tidy ups are included:

- We use GFS2's hash function for dentries (see ops_dentry.c) so that
we don't have to keep recalculating the hash values.
- A lot of common code is eliminated between the various directory
lookup routines.
- Better error checking on directory lookup (previously different
routines checked for different errors)
- The leaf split operation has a couple of redundant operations
removed from it, so it should be faster.

There is still further scope for further clean ups in the directory
code, and readdir in particular could do with slimming down a bit.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-03-21 01:30:04 +0800

02 Mar, 2006

1 commit

c9fd43078 [GFS2] Tidy up mount code. ... Browse Code »

We no longer lookup ".gfs2_admin" in the root directory in order to
find it, but instead use the inode number given in the superblock.
Both the root directory and the admin directory are now looked up using
the same routine, so the redundant code is removed.

Also, there is no longer a reference to the root inode in the
GFS2 super block. When required this can be retreived via
sb->s_root->d_inode instead.

Assuming that we introduce a metadata filesystem type for GFS, then
this is a first step towards that goal.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-03-02 04:31:02 +0800

28 Feb, 2006

1 commit

568f4c965 [GFS2] 80 Column audit of GFS2 ... Browse Code »

Requested by:
Prarit Bhargava

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-02-28 01:00:42 +0800

13 Feb, 2006

1 commit

7359a19cc [GFS2] Fix for root inode ref count bug ... Browse Code »

Umount is now working correctly again. The bug was due to
not getting an extra ref count when mounting the fs. We
should have bumped it by two (once for the internal pointer
to the root inode from the super block and once for the
inode hanging off the dcache entry for root).

Also this patch tidys up the code dealing with looking up
and creating inodes. We now pass Linux inodes (with gfs2_inodes
attached) rather than the other way around and this reduces code
duplication in various places.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-02-13 20:27:43 +0800