Eric Lee / smarc-fsl-linux-kernel

03 Aug, 2016

1 commit

47a9a5279 GFS2: use BIT() macro ... Browse Code »

Replace 1 << value shift by more explicit BIT() macro

Also fixes two bare unsigned definitions:

WARNING: Prefer 'unsigned int' to bare use of 'unsigned'
+ unsigned hsize = BIT(ip->i_depth);

Signed-off-by: Fabian Frederick
Signed-off-by: Bob Peterson

Fabian Frederick
2016-08-03 01:05:27 +0800

27 Jun, 2016

2 commits

ec5ec66ba gfs2: Get rid of gfs2_ilookup ... Browse Code »

Now that gfs2_lookup_by_inum only takes the inode glock for new inodes
(and not for cached inodes anymore), there no longer is a need to
optimize the cached-inode case in gfs2_get_dentry or delete_work_func,
and gfs2_ilookup can be removed.

In addition, gfs2_get_dentry wasn't checking the GFS2_DIF_SYSTEM flag in
i_diskflags in the gfs2_ilookup case (see gfs2_lookup_by_inum); this
inconsistency goes away as well.

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Bob Peterson

Andreas Gruenbacher
2016-06-27 22:47:08 +0800
3ce37b2cb gfs2: Fix gfs2_lookup_by_inum lock inversion ... Browse Code »

The current gfs2_lookup_by_inum takes the glock of a presumed inode
identified by block number, verifies that the block is indeed an inode,
and then instantiates and reads the new inode via gfs2_inode_lookup.

However, instantiating a new inode may block on freeing a previous
instance of that inode (__wait_on_freeing_inode), and freeing an inode
requires to take the glock already held, leading to lock inversion and
deadlock.

Fix this by first instantiating the new inode, then verifying that the
block is an inode (if required), and then reading in the new inode, all
in gfs2_inode_lookup.

If the block we are looking for is not an inode, we discard the new
inode via iget_failed, which marks inodes as bad and unhashes them.
Other tasks waiting on that inode will get back a bad inode back from
ilookup or iget_locked; in that case, retry the lookup.

Signed-off-by: Andreas Gruenbacher
Signed-off-by: Bob Peterson

Andreas Gruenbacher
2016-06-27 22:47:07 +0800

15 Mar, 2016

2 commits

73b462d28 GFS2: Eliminate parameter non_block on gfs2_inode_lookup ... Browse Code »

Now that we're not filtering out I_FREEING inodes from our lookups
anymore, we can eliminate the non_block parameter from the lookup
function.

Signed-off-by: Bob Peterson
Acked-by: Steven Whitehouse

Bob Peterson
2016-03-15 22:46:50 +0800
ff34245d5 GFS2: Don't filter out I_FREEING inodes anymore ... Browse Code »

This patch basically reverts a very old patch from 2008,
7a9f53b3c1875bef22ad4588e818bc046ef183da, with the title
"Alternate gfs2_iget to avoid looking up inodes being freed".
The original patch was designed to avoid a deadlock caused by lock
ordering with try_rgrp_unlink. The patch forced the function to not
find inodes that were being removed by VFS. The problem is, that
made it impossible for nodes to delete their own unlinked dinodes
after a certain point in time, because the inode needed was not found
by this filtering process. There is no longer a need for the patch,
since function try_rgrp_unlink no longer locks the inode: All it does
is queue the glock onto the delete work_queue, so there should be no
more deadlock.

Signed-off-by: Bob Peterson
Signed-off-by: Steven Whitehouse

Bob Peterson
2016-03-15 22:46:45 +0800

14 Jun, 2013

1 commit

6d4ade986 GFS2: Add atomic_open support ... Browse Code »

I've restricted atomic_open to only operate on regular files, although
I still don't understand why atomic_open should not be possible also for
directories on GFS2. That can always be added in later though, if it
makes sense.

The ->atomic_open function can be passed negative dentries, which
in most cases means either ENOENT (->lookup) or a call to d_instantiate
(->create). In the GFS2 case though, we need to actually perform the
look up, since we do not know whether there has been a new inode created
on another node. The look up calls d_splice_alias which then tries to
rehash the dentry - so the solution here is to simply check for that
in d_splice_alias. The same issue is likely to affect any other cluster
filesystem implementing ->atomic_open

Signed-off-by: Steven Whitehouse
Cc: Al Viro
Cc: "J. Bruce Fields"
Cc: Jeff Layton

Steven Whitehouse
2013-06-14 18:17:15 +0800

24 Apr, 2012

2 commits

4306629e1 GFS2: Remove unused argument from gfs2_internal_read ... Browse Code »

gfs2_internal_read accepts an unused ra_state argument, left over from
when we did readahead on the rindex. Since there are currently no plans
to add back this readahead, this patch removes the ra_state parameter
and updates the functions which call gfs2_internal_read accordingly.

Signed-off-by: Andrew Price
Signed-off-by: Steven Whitehouse

Andrew Price
2012-04-24 23:44:37 +0800
b120193e3 GFS2: make function gfs2_page_add_databufs static ... Browse Code »

This patch makes function gfs2_page_add_databufs static.

Signed-off-by: Bob Peterson
Signed-off-by: Steven Whitehouse

Bob Peterson
2012-04-24 23:44:28 +0800

21 Oct, 2011

1 commit

ab9bbda02 GFS2: Use ->dirty_inode() ... Browse Code »

The aim of this patch is to use the newly enhanced ->dirty_inode()
super block operation to deal with atime updates, rather than
piggy backing that code into ->write_inode() as is currently
done.

The net result is a simplification of the code in various places
and a reduction of the number of gfs2_dinode_out() calls since
this is now implied by ->dirty_inode().

Some of the mark_inode_dirty() calls have been moved under glocks
in order to take advantage of then being able to avoid locking in
->dirty_inode() when we already have suitable locks.

One consequence is that generic_write_end() now correctly deals
with file size updates, so that we do not need a separate check
for that afterwards. This also, indirectly, means that fdatasync
should work correctly on GFS2 - the current code always syncs the
metadata whether it needs to or not.

Has survived testing with postmark (with and without atime) and
also fsx.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2011-10-21 19:39:26 +0800

20 Jul, 2011

1 commit

10556cb21 ->permission() sanitizing: don't pass flags to ->permission() ... Browse Code »

not used by the instances anymore.

Signed-off-by: Al Viro

Al Viro
2011-07-20 13:43:24 +0800

13 May, 2011

1 commit

160b4026d GFS2: Clean up symlink creation ... Browse Code »

This moves the symlink specific parts of inode creation
into the function where we initialise the rest of the
dinode. As a result we have one less place where we need
to look up the inode's buffer.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2011-05-13 17:34:59 +0800

09 May, 2011

2 commits

94fb763b1 GFS2: Remove gfs2_dinode_print() function ... Browse Code »

This function was intended for debugging purposes, but it is not very
useful. If we want to know what is on disk then all we need is a
block number and gfs2_edit can give us much better information about
what is there. Otherwise, if we are interested in what is stored in
the in-core inode, it doesn't help us out there either.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2011-05-09 23:44:29 +0800
3d6ecb7d1 GFS2: When adding a new dir entry, inc link count if it is a subdir ... Browse Code »

This adds an increment of the link count when we add a new directory
entry, if that entry is itself a directory. This means that we no
longer need separate code to perform this operation.

Now that both adding and removing directory entries automatically
update the parent directory's link count if required, that makes
the code shorter and simpler than before.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2011-05-09 23:43:53 +0800

20 Apr, 2011

2 commits

4667a0ec3 GFS2: Make writeback more responsive to system conditions ... Browse Code »

This patch adds writeback_control to writing back the AIL
list. This means that we can then take advantage of the
information we get in ->write_inode() in order to set off
some pre-emptive writeback.

In addition, the AIL code is cleaned up a bit to make it
a bit simpler to understand.

There is still more which can usefully be done in this area,
but this is a good start at least.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2011-04-20 16:01:37 +0800
f42ab0852 GFS2: Optimise glock lru and end of life inodes ... Browse Code »

The GLF_LRU flag introduced in the previous patch can be
used to check if a glock is on the lru list when a new
holder is queued and if so remove it, without having first
to get the lru_lock.

The main purpose of this patch however is to optimise the
glocks left over when an inode at end of life is being
evicted. Previously such glocks were left with the GLF_LFLUSH
flag set, so that when reclaimed, each one required a log flush.
This patch resets the GLF_LFLUSH flag when there is nothing
left to flush thus preventing later log flushes as glocks are
reused or demoted.

In order to do this, we need to keep track of the number of
revokes which are outstanding, and also to clear the GLF_LFLUSH
bit after a log commit when only revokes have been processed.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2011-04-20 16:01:17 +0800

18 Apr, 2011

1 commit

44ad37d69 GFS2: filesystem hang caused by incorrect lock order ... Browse Code »

This patch fixes a deadlock in GFS2 where two processes are trying
to reclaim an unlinked dinode:
One holds the inode glock and calls gfs2_lookup_by_inum trying to look
up the inode, which it can't, due to I_FREEING. The other has set
I_FREEING from vfs and is at the beginning of gfs2_delete_inode
waiting for the glock, which is held by the first. The solution is to
add a new non_block parameter to the gfs2_iget function that causes it
to return -ENOENT if the inode is being freed.

Signed-off-by: Bob Peterson
Signed-off-by: Steven Whitehouse

Bob Peterson
2011-04-18 22:23:50 +0800

18 Jan, 2011

1 commit

24d9765fc GFS2: Fix error path in gfs2_lookup_by_inum() ... Browse Code »

In the (impossible, except if there is fs corruption) error path
in gfs2_lookup_by_inum() if the call to gfs2_inode_refresh()
fails, it was leaving the function by calling iput() rather
than iget_failed(). This would cause future lookups of the same
inode to block forever.

This patch fixes the problem by moving the call to gfs2_inode_refresh()
into gfs2_inode_lookup() where iget_failed() is part of the error path
already. Also this cleans up some unreachable code and makes
gfs2_set_iop() static.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2011-01-18 22:49:08 +0800

07 Jan, 2011

1 commit

b74c79e99 fs: provide rcu-walk aware permission i_ops ... Browse Code »

Signed-off-by: Nick Piggin

Nick Piggin
2011-01-07 14:50:29 +0800

15 Nov, 2010

1 commit

044b9414c GFS2: Fix inode deallocation race ... Browse Code »

This area of the code has always been a bit delicate due to the
subtleties of lock ordering. The problem is that for "normal"
alloc/dealloc, we always grab the inode locks first and the rgrp lock
later.

In order to ensure no races in looking up the unlinked, but still
allocated inodes, we need to hold the rgrp lock when we do the lookup,
which means that we can't take the inode glock.

The solution is to borrow the technique already used by NFS to solve
what is essentially the same problem (given an inode number, look up
the inode carefully, checking that it really is in the expected
state).

We cannot do that directly from the allocation code (lock ordering
again) so we give the job to the pre-existing delete workqueue and
carry on with the allocation as normal.

If we find there is no space, we do a journal flush (required anyway
if space from a deallocation is to be released) which should block
against the pending deallocations, so we should always get the space
back.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2010-11-15 20:44:42 +0800

20 Sep, 2010

2 commits

3921120e7 GFS2: fallocate support ... Browse Code »

This patch adds support for fallocate to gfs2. Since the gfs2 does not support
uninitialized data blocks, it must write out zeros to all the blocks. However,
since it does not need to lock any pages to read from, gfs2 can write out the
zero blocks much more efficiently. On a moderately full filesystem, fallocate
works around 5 times faster on average. The fallocate call also allows gfs2 to
add blocks to the file without changing the filesize, which will make it
possible for gfs2 to preallocate space for the rindex file, so that gfs2 can
grow a completely full filesystem.

Signed-off-by: Benjamin Marzinski
Signed-off-by: Steven Whitehouse

Benjamin Marzinski
2010-09-20 18:19:17 +0800
a2e0f7993 GFS2: Remove i_disksize ... Browse Code »

With the update of the truncate code, ip->i_disksize and
inode->i_size are merely copies of each other. This means
we can remove ip->i_disksize and use inode->i_size exclusively
reducing the size of a GFS2 inode by 8 bytes.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2010-09-20 18:18:29 +0800

21 May, 2010

1 commit

ed4878e8a GFS2: Rework reclaiming unlinked dinodes ... Browse Code »

The previous patch I wrote for reclaiming unlinked dinodes
had some shortcomings and did not prevent all hangs.
This version is much cleaner and more logical, and has
passed very difficult testing. Sorry for the churn.

Signed-off-by: Bob Peterson
Signed-off-by: Steven Whitehouse

Bob Peterson
2010-05-21 23:11:36 +0800

14 Apr, 2010

1 commit

1a0eae884 GFS2: glock livelock ... Browse Code »

This patch fixes a couple gfs2 problems with the reclaiming of
unlinked dinodes. First, there were a couple of livelocks where
everything would come to a halt waiting for a glock that was
seemingly held by a process that no longer existed. In fact, the
process did exist, it just had the wrong pid number in the holder
information. Second, there was a lock ordering problem between
inode locking and glock locking. Third, glock/inode contention
could sometimes cause inodes to be improperly marked invalid by
iget_failed.

Signed-off-by: Bob Peterson

Bob Peterson
2010-04-14 23:48:05 +0800

22 May, 2009

4 commits

87ec21741 GFS2: Move gfs2_unlink_ok into ops_inode.c ... Browse Code »

Another function which is only called from one ops_inode.c so
we move it and make it static.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-05-22 17:54:50 +0800
536baf02f GFS2: Move gfs2_readlinki into ops_inode.c ... Browse Code »

Move gfs2_readlinki into ops_inode.c and make it static

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-05-22 17:48:59 +0800
2286dbfad GFS2: Move gfs2_rmdiri into ops_inode.c ... Browse Code »

Move gfs2_rmdiri() into ops_inode.c and make it static.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-05-22 17:45:09 +0800
b1e71b062 GFS2: Clean up some file names ... Browse Code »

This patch renames the ops_*.c files which have no counterpart
without the ops_ prefix in order to shorten the name and make
it more readable. In addition, ops_address.h (which was very
small) is moved into inode.h and inode.h is cleaned up by
adding extern where required.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-05-22 17:01:55 +0800

15 Apr, 2009

1 commit

10d219880 GFS2: cleanup file_operations mess ... Browse Code »

Remove the weird pointer to file_operations mess and replace it with
straight-forward defining of the lockinginstance names to the _nolock
variants.

Signed-off-by: Christoph Hellwig
Signed-off-by: Steven Whitehouse

Christoph Hellwig
2009-04-15 17:17:18 +0800

24 Mar, 2009

1 commit

f057f6cdf GFS2: Merge lock_dlm module into GFS2 ... Browse Code »

This is the big patch that I've been working on for some time
now. There are many reasons for wanting to make this change
such as:
o Reducing overhead by eliminating duplicated fields between structures
o Simplifcation of the code (reduces the code size by a fair bit)
o The locking interface is now the DLM interface itself as proposed
some time ago.
o Fewer lookups of glocks when processing replies from the DLM
o Fewer memory allocations/deallocations for each glock
o Scope to do further optimisations in the future (but this patch is
more than big enough for now!)

Please note that (a) this patch relates to the lock_dlm module and
not the DLM itself, that is still a separate module; and (b) that
we retain the ability to build GFS2 as a standalone single node
filesystem with out requiring the DLM.

This patch needs a lot of testing, hence my keeping it I restarted
my -git tree after the last merge window. That way, this has the maximum
exposure before its merged. This is (modulo a few minor bug fixes) the
same patch that I've been posting on and off the the last three months
and its passed a number of different tests so far.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-03-24 19:21:14 +0800

05 Jan, 2009

2 commits

383f01fbf GFS2: Banish struct gfs2_dinode_host ... Browse Code »

The final field in gfs2_dinode_host was the i_flags field. Thats
renamed to i_diskflags in order to avoid confusion with the existing
inode flags, and moved into the inode proper at a suitable location
to avoid creating a "hole".

At that point struct gfs2_dinode_host is no longer needed and as
promised (quite some time ago!) it can now be removed completely.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-01-05 15:38:59 +0800
b27605837 GFS2: Rationalise header files ... Browse Code »

Move the contents of some headers which contained very
little into more sensible places, and remove the original
header files. This should make it easier to find things.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2009-01-05 15:38:48 +0800

18 Sep, 2008

1 commit

719ee3446 GFS2: high time to take some time over atime ... Browse Code »

Until now, we've used the same scheme as GFS1 for atime. This has failed
since atime is a per vfsmnt flag, not a per fs flag and as such the
"noatime" flag was not getting passed down to the filesystems. This
patch removes all the "special casing" around atime updates and we
simply use the VFS's atime code.

The net result is that GFS2 will now support all the same atime related
mount options of any other filesystem on a per-vfsmnt basis. We do lose
the "lazy atime" updates, but we gain "relatime". We could add lazy
atime to the VFS at a later date, if there is a requirement for that
variant still - I suspect relatime will be enough.

Also we lose about 100 lines of code after this patch has been applied,
and I have a suspicion that it will speed things up a bit, even when
atime is "on". So it seems like a nice clean up as well.

From a user perspective, everything stays the same except the loss of
the per-fs atime quantum tweekable (ought to be per-vfsmnt at the very
least, and to be honest I don't think anybody ever used it) and that a
number of options which were ignored before now work correctly.

Please let me know if you've got any comments. I'm pushing this out
early so that you can all see what my plans are.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-09-18 20:53:59 +0800

27 Aug, 2008

1 commit

0188d6c58 GFS2: Fix & clean up GFS2 rename ... Browse Code »

This patch fixes a locking issue in the rename code by ensuring that we hold
the per sb rename lock over both directory and "other" renames which involve
different parent directories.

At the same time, this moved the (only called from one place) function
gfs2_ok_to_move into the file that its called from, so we can mark it
static. This should make a code a bit easier to follow.

Signed-off-by: Steven Whitehouse
Cc: Peter Staubach

Steven Whitehouse
2008-08-27 20:33:10 +0800

27 Jul, 2008

1 commit

a569c711f [PATCH] don't pass nameidata to gfs2_lookupi() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2008-07-27 08:53:36 +0800

10 Jul, 2008

1 commit

a93a6ce24 [GFS2] Remove unused declaration ... Browse Code »

The implementation of gfs2_inode_attr_in is removed.
So remove its declaration.

Signed-off-by: Li Xiaodong
Signed-off-by: Steven Whitehouse

Li Xiaodong
2008-07-10 23:22:23 +0800

03 Jul, 2008

1 commit

f58ba8891 [GFS2] don't call permission() ... Browse Code »

GFS2 calls permission() to verify permissions after locks on the files
have been taken.

For this it's sufficient to call gfs2_permission() instead. This
results in the following changes:

- IS_RDONLY() check is not performed
- IS_IMMUTABLE() check is not performed
- devcgroup_inode_permission() is not called
- security_inode_permission() is not called

IS_RDONLY() should be unnecessary anyway, as the per-mount read-only
flag should provide protection against read-only remounts during
operations. do_gfs2_set_flags() has been fixed to perform
mnt_want_write()/mnt_drop_write() to protect against remounting
read-only.

IS_IMMUTABLE has been added to gfs2_permission()

Repeating the security checks seems to be pointless, as they don't
normally change, and if they do, it's independent of the filesystem
state.

Signed-off-by: Miklos Szeredi
Signed-off-by: Steven Whitehouse

Miklos Szeredi
2008-07-03 17:22:01 +0800

31 Mar, 2008

2 commits

77658aad2 [GFS2] Eliminate (almost) duplicate field from gfs2_inode ... Browse Code »

The blocks counter is almost a duplicate of the i_blocks
field in the VFS inode. The only difference is that i_blocks
can be only 32bits long for 32bit arch without large single file
support. Since GFS2 doesn't handle the non-large single file
case (for 32 bit anyway) this adds a new config dependency on
64BIT || LSF. This has always been the case, however we've never
explicitly said so before.

Even if we do add support for the non-LSF case, we will still
not require this field to be duplicated since we will not be
able to access oversized files anyway.

So the net result of all this is that we shave 8 bytes from a gfs2_inode
and get our config deps correct.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-03-31 17:40:55 +0800
ecc30c791 [GFS2] Streamline indirect pointer tree height calculation ... Browse Code »

This patch improves the calculation of the tree height in order to reduce
the number of operations which are carried out on each call to gfs2_block_map.
In the common case, we now make a single comparison, rather than calculating
the required tree height from scratch each time. Also in the case that the
tree does need some extra height, we start from the current height rather from
zero when we work out what the new height ought to be.

In addition the di_height field is moved into the inode proper and reduced
in size to a u8 since the value must be between 0 and GFS2_MAX_META_HEIGHT (10).

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-03-31 17:39:46 +0800

25 Jan, 2008

2 commits

5561093e2 [GFS2] Introduce gfs2_set_aops() ... Browse Code »

Just like ext3 we now have three sets of address space operations
to cover the cases of writeback, ordered and journalled data
writes. This means that the individual operations can now become
less complicated as we are able to remove some of the tests for
file data mode from the code.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-01-25 16:07:23 +0800
bf36a7131 [GFS2] Add gfs2_is_writeback() ... Browse Code »

This adds a function "gfs2_is_writeback()" along the lines of the
existing "gfs2_is_jdata()" in order to clean up the code and make
the various tests for the inode mode more obvious. It also fixes
the PageChecked() logic where we were resetting the flag too early
in the case of an error path.

Signed-off-by: Steven Whitehouse

Steven Whitehouse
2008-01-25 16:07:21 +0800