Eric Lee / smarc-fsl-linux-kernel

26 Jan, 2008

5 commits

cf8e06f1a [PATCH 1/2] ocfs2: add flock lock type ... Browse Code »

This adds a new dlmglue lock type which is intended to back flock()
requests.

Since these locks are driven from userspace, usage rules are much more
liberal than the typical Ocfs2 internal cluster lock. As a result, we can't
make use of most dlmglue features - lock caching and lock level
optimizations in particular. Additionally, userspace is free to deadlock
itself, so we have to deal with that in the same way as the rest of the
kernel - by allowing a signal to abort a lock request.

In order to keep ocfs2_cluster_lock() complexity down, ocfs2_file_lock()
does it's own dlm coordination. We still use the same helper functions
though, so duplicated code is kept to a minimum.

Signed-off-by: Mark Fasheh

Mark Fasheh
2008-01-26 07:05:43 +0800
e63aecb65 ocfs2: Rename ocfs2_meta_[un]lock ... Browse Code »

Call this the "inode_lock" now, since it covers both data and meta data.
This patch makes no functional changes.

Signed-off-by: Mark Fasheh

Mark Fasheh
2008-01-26 06:46:01 +0800
c934a92d0 ocfs2: Remove data locks ... Browse Code »

The meta lock now covers both meta data and data, so this just removes the
now-redundant data lock.

Combining locks saves us a round of lock mastery per inode and one less lock
to ping between nodes during read/write.

We don't lose much - since meta locks were always held before a data lock
(and at the same level) ordered writeout mode (the default) ensured that
flushing for the meta data lock also pushed out data anyways.

Signed-off-by: Mark Fasheh

Mark Fasheh
2008-01-26 06:45:57 +0800
f1f540688 ocfs2: Add data downconvert worker to inode lock ... Browse Code »

In order to extend inode lock coverage to inode data, we use the same data
downconvert worker with only a small modification to only do work for
regular files.

Signed-off-by: Mark Fasheh

Mark Fasheh
2008-01-26 06:45:54 +0800
34d024f84 ocfs2: Remove mount/unmount votes ... Browse Code »

The node maps that are set/unset by these votes are no longer relevant, thus
we can remove the mount and umount votes. Since those are the last two
remaining votes, we can also remove the entire vote infrastructure.

The vote thread has been renamed to the downconvert thread, and the small
amount of functionality related to managing it has been moved into
fs/ocfs2/dlmglue.c. All references to votes have been removed or updated.

Signed-off-by: Mark Fasheh

Mark Fasheh
2008-01-26 06:45:34 +0800

07 Nov, 2007

2 commits

019d1b224 ocfs2: Create locks at initially requested level ... Browse Code »

If we have not yet created a cluster lock, ocfs2_cluster_lock() will
first create it at NLMODE, and then convert the lock to either PRMODE or
EXMODE (whichever is requested).

Change ocfs2_cluster_lock() to just create the lock at the initially
requested level. ocfs2_locking_ast() handles this case fine, so the only
update required was in setup of locking state. This should reduce the number
of network messages required for a new lock by one, providing an incremental
performance enhancement.

Signed-off-by: Mark Fasheh

Mark Fasheh
2007-11-07 07:31:45 +0800
3cf0c507d [PATCH] Fix priority mistakes in fs/ocfs2/{alloc.c, dlmglue.c} ... Browse Code »

Fixes priority mistakes similar to '!x & y'

Signed-off-by: Roel Kluin
Signed-off-by: Mark Fasheh

Roel Kluin
2007-11-07 07:31:39 +0800

13 Oct, 2007

1 commit

15b1e36bd ocfs2: Structure updates for inline data ... Browse Code »

Add the disk, network and memory structures needed to support data in inode.

Struct ocfs2_inline_data is defined and embedded in ocfs2_dinode for storing
inline data.

A new inode field, i_dyn_features, is added to facilitate tracking of
dynamic inode state. Since it will be used often, we want to mirror it on
ocfs2_inode_info, and transfer it via the meta data lvb.

Signed-off-by: Mark Fasheh
Reviewed-by: Joel Becker

Mark Fasheh
2007-10-13 02:54:39 +0800

11 Jul, 2007

1 commit

800deef3f [PATCH] ocfs2: use list_for_each_entry where benefical ... Browse Code »

Signed-off-by: Christoph Hellwig
Signed-off-by: Mark Fasheh

Christoph Hellwig
2007-07-11 08:19:49 +0800

09 May, 2007

1 commit

e63340ae6 header cleaning: don't include smp_lock.h when not used ... Browse Code »

Remove includes of where it is not used/needed.
Suggested by Al Viro.

Builds cleanly on x86_64, i386, alpha, ia64, powerpc, sparc,
sparc64, and arm (all 59 defconfigs).

Signed-off-by: Randy Dunlap
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Randy Dunlap
2007-05-09 02:15:07 +0800

03 May, 2007

1 commit

6cb129f56 [PATCH] fs/ocfs2/: make 3 functions static ... Browse Code »

This patch makes the following needlessly global functions static:
- aops.c: ocfs2_write_data_page()
- dlmglue.c: ocfs2_dump_meta_lvb_info()
- file.c: ocfs2_set_inode_size()

Signed-off-by: Adrian Bunk
Signed-off-by: Andrew Morton
Signed-off-by: Mark Fasheh

Adrian Bunk
2007-05-03 06:07:27 +0800

27 Apr, 2007

5 commits

834189788 ocfs2: Cache extent records ... Browse Code »

The extent map code was ripped out earlier because of an inability to deal
with holes. This patch adds back a simpler caching scheme requiring far less
code.

Our old extent map caching was designed back when meta data block caching in
Ocfs2 didn't work very well, resulting in many disk reads. These days our
metadata caching is much better, resulting in no un-necessary disk reads. As
a result, extent caching doesn't have to be as fancy, nor does it have to
cache as many extents. Keeping the last 3 extents seen should be sufficient
to give us a small performance boost on some streaming workloads.

Signed-off-by: Mark Fasheh

Mark Fasheh
2007-04-27 06:10:40 +0800
8110b073a ocfs2: Fix up i_blocks calculation to know about holes ... Browse Code »

Older file systems which didn't support holes did a dumb calculation of
i_blocks based on i_size. This is no longer accurate, so fix things up to
take actual allocation into account.

Signed-off-by: Mark Fasheh

Mark Fasheh
2007-04-27 06:07:40 +0800
363041a5f ocfs2: temporarily remove extent map caching ... Browse Code »

The code in extent_map.c is not prepared to deal with a subtree being
rotated between lookups. This can happen when filling holes in sparse files.
Instead of a lengthy patch to update the code (which would likely lose the
benefit of caching subtree roots), we remove most of the algorithms and
implement a simple path based lookup. A less ambitious extent caching scheme
will be added in a later patch.

Signed-off-by: Mark Fasheh

Mark Fasheh
2007-04-27 06:01:31 +0800
500086300 ocfs2: Remove delete inode vote ... Browse Code »

Ocfs2 currently does cluster-wide node messaging to check the open state of
an inode during delete. This patch removes that mechanism in favor of an
inode cluster lock which is taken at shared read when an inode is first read
and dropped in clear_inode(). This allows a deleting node to test the
liveness of an inode by attempting to take an exclusive lock.

Signed-off-by: Tiger Yang
Signed-off-by: Mark Fasheh

Tiger Yang
2007-04-27 05:39:48 +0800
be9e986b8 ocfs2: Local mounts should skip inode updates ... Browse Code »

We don't want the extent map and uptodate cache destruction in
ocfs2_meta_lock_update() on a local mount, so skip that.

This fixes several bugs with uptodate being cleared on buffers and extent
maps being corrupted.

Signed-off-by: Mark Fasheh

Mark Fasheh
2007-04-27 04:35:21 +0800

29 Dec, 2006

1 commit

7f4a2a97e ocfs2: always unmap in ocfs2_data_convert_worker() ... Browse Code »

Mmap-heavy clustered workloads were sometimes finding stale data on mmap
reads. The solution is to call unmap_mapping_range() on any down convert of
a data lock.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-12-29 08:38:59 +0800

08 Dec, 2006

1 commit

c271c5c22 ocfs2: local mounts ... Browse Code »

This allows users to format an ocfs2 file system with a special flag,
OCFS2_FEATURE_INCOMPAT_LOCAL_MOUNT. When the file system sees this flag, it
will not use any cluster services, nor will it require a cluster
configuration, thus acting like a 'local' file system.

Signed-off-by: Sunil Mushran
Signed-off-by: Mark Fasheh

Sunil Mushran
2006-12-08 09:37:53 +0800

02 Dec, 2006

4 commits

7f1a37e31 ocfs2: core atime update functions ... Browse Code »

This patch adds the core routines for updating atime in ocfs2.

Signed-off-by: Tiger Yang
Signed-off-by: Mark Fasheh

Tiger Yang
2006-12-02 10:28:51 +0800
4bcec1847 ocfs2: remove unused handle argument from ocfs2_meta_lock_full() ... Browse Code »

Now that this is unused and all callers pass NULL, we can safely remove it.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-12-02 10:28:05 +0800
daf29e9cd ocfs2: remove unused ocfs2_handle_add_lock() ... Browse Code »

This gets us rid of a slab we no longer need, as well as removing the
majority of what's left on ocfs2_journal_handle.

ocfs2_commit_unstarted_handle() has no more real work to do, so remove that
function too.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-12-02 10:27:58 +0800
da66116ee [2.6 patch] make ocfs2_create_new_lock() static ... Browse Code »

This patch makes the needlessly global ocfs2_create_new_lock() static.

Signed-off-by: Adrian Bunk
Signed-off-by: Mark Fasheh

Adrian Bunk
2006-12-02 10:26:50 +0800

27 Sep, 2006

1 commit

8e18e2941 [PATCH] inode_diet: Replace inode.u.generic_ip with inode.i_private ... Browse Code »

The following patches reduce the size of the VFS inode structure by 28 bytes
on a UP x86. (It would be more on an x86_64 system). This is a 10% reduction
in the inode size on a UP kernel that is configured in a production mode
(i.e., with no spinlock or other debugging functions enabled; if you want to
save memory taken up by in-core inodes, the first thing you should do is
disable the debugging options; they are responsible for a huge amount of bloat
in the VFS inode structure).

This patch:

The filesystem or device-specific pointer in the inode is inside a union,
which is pretty pointless given that all 30+ users of this field have been
using the void pointer. Get rid of the union and rename it to i_private, with
a comment to explain who is allowed to use the void pointer. This is just a
cleanup, but it allows us to reuse the union 'u' for something something where
the union will actually be used.

[judith@osdl.org: powerpc build fix]
Signed-off-by: "Theodore Ts'o"
Signed-off-by: Judith Lebzelter
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Theodore Ts'o
2006-09-27 23:26:17 +0800

25 Sep, 2006

17 commits

0d5dc6c2d ocfs2: Teach ocfs2_drop_lock() to use ->set_lvb() callback ... Browse Code »

With this, we don't need to pass an additional struct with function pointer.

Now that the callbacks are fully used, comment the remaining API.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:48 +0800
b5e500e23 ocfs2: Remove ->unblock lockres operation ... Browse Code »

Have ocfs2_process_blocked_lock() call ocfs2_generic_unblock_lock(), which
gets to be ocfs2_unblock_lock() now that it's the only possible unblock
function.

Remove the ->unblock() callback from the structure, and all lock type
specific unblock functions.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:48 +0800
cc567d89b ocfs2: move downconvert worker to lockres ops ... Browse Code »

This way lock types don't have to manually pass it to
ocfs2_generic_unblock_lock().

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:48 +0800
08280f11d ocfs2: Remove unused dlmglue functions ... Browse Code »

The meta data unblocking code no longer needs ocfs2_do_unblock_meta() or
ocfs2_can_downconvert_meta_lock(), so remove them.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:48 +0800
810d5aeba ocfs2: Have the metadata lock use generic dlmglue functions ... Browse Code »

Fill in the ->check_downconvert and ->set_lvb callbacks with meta data
specific operations and switch ocfs2_unblock_meta() to call
ocfs2_generic_unblock_lock()

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:47 +0800
5ef0d4ea0 ocfs2: Add ->set_lvb callback in dlmglue ... Browse Code »

This allows a lock type to set the value block before downconvert.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:47 +0800
16d5b9567 ocfs2: Add ->check_downconvert callback in dlmglue ... Browse Code »

This will allow lock types to force a requeue of a lock downconvert.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:47 +0800
f7fbfdd1f ocfs2: Check for refreshing locks in generic unblock function ... Browse Code »

Tidy up the exit path a bit too.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:47 +0800
b80fc012e ocfs2: don't unconditionally pass LVB flags ... Browse Code »

Allow a lock type to specifiy whether it makes use of the LVB. The only type
which does this right now is the meta data lock. This should save us some
space on network messages since they won't have to needlessly transmit value
blocks.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:47 +0800
aa2623ad8 ocfs2: combine inode and generic blocking AST functions ... Browse Code »

There is extremely little difference between the two now. We can remove the
callback from ocfs2_lock_res_ops as well.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:46 +0800
54a7e7552 ocfs2: Add ->get_osb() dlmglue locking operation ... Browse Code »

Will be used to find the ocfs2_super structure from a given lockres.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:46 +0800
2a45f2d13 ocfs2: remove ->unlock_ast() callback from ocfs2_lock_res_ops ... Browse Code »

This was always defined to the same function in all locks, so clean things
up by removing and passing ocfs2_unlock_ast() directly to the DLM.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:46 +0800
e92d57df2 ocfs2: combine inode and generic AST functions ... Browse Code »

There is extremely little difference between the two now. We can remove the
callback from ocfs2_lock_res_ops as well.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:46 +0800
f625c9793 ocfs2: Clean up lock resource refresh flags ... Browse Code »

Use of the refresh mechanism is lock-type wide, so move knowledge of that to
the ocfs2_lock_res_ops structure.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:46 +0800
24c19ef40 ocfs2: Remove i_generation from inode lock names ... Browse Code »

OCFS2 puts inode meta data in the "lock value block" provided by the DLM.
Typically, i_generation is encoded in the lock name so that a deleted inode
on and a new one in the same block don't share the same lvb.

Unfortunately, that scheme means that the read in ocfs2_read_locked_inode()
is potentially thrown away as soon as the meta data lock is taken - we
cannot encode the lock name without first knowing i_generation, which
requires a disk read.

This patch encodes i_generation in the inode meta data lvb, and removes the
value from the inode meta data lock name. This way, the read can be covered
by a lock, and at the same time we can distinguish between an up to date and
a stale LVB.

This will help cold-cache stat(2) performance in particular.

Since this patch changes the protocol version, we take the opportunity to do
a minor re-organization of two of the LVB fields.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:46 +0800
f9e2d82e6 ocfs2: Encode i_generation in the meta data lvb ... Browse Code »

When i_generation is removed from the lockname, this will help us determine
whether a meta data lvb has information that is in sync with the local
struct inode.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:45 +0800
4d3b83f73 ocfs2: Free up some space in the lvb ... Browse Code »

lvb_version doesn't need to be a whole 32 bits. Make it an 8 bit field to
free up some space. This should be backwards compatible until we use one of
the fields, in which case we'd bump the lvb version anyway.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:45 +0800