Doug / smarc-fsl-linux-kernel | Embedian Git Server

23 Dec, 2010

4 commits

e453039f8 ocfs2/cluster: Track process message timing stats for each socket ... Browse Code »

Tracks total time taken to process messages received on a socket.

Signed-off-by: Sunil Mushran
Signed-off-by: Joel Becker

Sunil Mushran
2010-12-23 10:38:10 +0800
3c193b380 ocfs2/cluster: Track send message timing stats for each socket ... Browse Code »

Tracks total send and status times for all messages sent on a socket.

Signed-off-by: Sunil Mushran
Signed-off-by: Joel Becker

Sunil Mushran
2010-12-23 10:38:09 +0800
ff1becbf8 ocfs2/cluster: Use ktime instead of timeval in struct o2net_sock_container ... Browse Code »

Replace time trackers in struct o2net_sock_container from struct timeval to
union ktime.

Signed-off-by: Sunil Mushran
Signed-off-by: Joel Becker

Sunil Mushran
2010-12-23 10:37:57 +0800
3f9c14fab ocfs2/cluster: Replace timeval with ktime in struct o2net_send_tracking ... Browse Code »

Replace time trackers in struct o2net_send_tracking from struct timeval to
union ktime.

Signed-off-by: Sunil Mushran
Signed-off-by: Joel Becker

Sunil Mushran
2010-12-23 10:34:49 +0800

21 Sep, 2010

1 commit

817f2c842 Fix various typos of valid in comments ... Browse Code »

Fix various typos of valid.

Signed-off-by: Nikanth Karthikesan
Signed-off-by: Jiri Kosina

Nikanth Karthikesan
2010-09-21 23:04:50 +0800

26 Jan, 2010

1 commit

2bd632165 ocfs2/trivial: Remove trailing whitespaces ... Browse Code »

Patch removes trailing whitespaces.

Signed-off-by: Sunil Mushran
Signed-off-by: Joel Becker

Sunil Mushran
2010-01-26 11:20:51 +0800

23 Aug, 2008

1 commit

18496e80f [PATCH] ocfs2/cluster/tcp.c: make some functions static ... Browse Code »

Commit 0f475b2abed6cbccee1da20a0bef2895eb2a0edd (ocfs2/net: Silence build
warnings) made sense as far as it fixed compile warnings, but it was not
required that it made the functions global.

Signed-off-by: Adrian Bunk
Signed-off-by: Mark Fasheh

Adrian Bunk
2008-08-23 01:56:40 +0800

31 May, 2008

1 commit

0f475b2ab [PATCH 3/3] ocfs2/net: Silence build warnings ... Browse Code »

This patch silences the build warnings concerning o2net_init_nst()
and friends when building without CONFIG_DEBUG_FS enabled.

Signed-off-by: Sunil Mushran
Signed-off-by: Mark Fasheh

Sunil Mushran
2008-05-31 06:15:12 +0800

18 Apr, 2008

2 commits

2309e9e04 ocfs2/net: Add debug interface to o2net ... Browse Code »

This patch exposes o2net information via debugfs. The information includes
the list of sockets (sock_containers) as well as the list of outstanding
messages (send_tracking). Useful for o2dlm debugging.

(This patch is derived from an earlier one written by Zach Brown that
exposed the same information via /proc.)

[Mark: checkpatch fixes]

Signed-off-by: Sunil Mushran
Reviewed-by: Joel Becker
Signed-off-by: Mark Fasheh

Sunil Mushran
2008-04-18 23:56:20 +0800
5cc3bf278 ocfs2: Reconnect after idle time out. ... Browse Code »

Currently, o2net connects to a node on hb_up and disconnects on
hb_down and net timeout.

It disconnects on net timeout is ok, but it should attempt to
reconnect back. This is because sometimes nodes get overloaded
enough that the network connection breaks but the disk hb does not.
And if we get into that situation, we either fence (unnecessarily)
or wait for its disk hb to die (and sometimes hang in the process).

So in this updated scheme, when the network disconnects, we keep
attempting to reconnect till we succeed or we get a disk hb down
event.

If the other node is really dead, then we will eventually get a
node down event. If not, we should be able to connect again and
continue.

Signed-off-by: Tao Ma
Signed-off-by: Mark Fasheh

Tao Ma
2008-04-18 23:56:10 +0800

07 Feb, 2008

1 commit

d24fbcda0 ocfs2: Negotiate locking protocol versions. ... Browse Code »

Currently, when ocfs2 nodes connect via TCP, they advertise their
compatibility level. If the versions do not match, two nodes cannot speak
to each other and they disconnect. As a result, this provides no forward or
backwards compatibility.

This patch implements a simple protocol negotiation at the dlm level by
introducing a major/minor version number scheme for entities that
communicate. Specifically, o2dlm has a major/minor version for interaction
with o2dlm on other nodes, and ocfs2 itself has a major/minor version for
interacting with the filesystem on other nodes.

This will allow rolling upgrades of ocfs2 clusters when changes to the
locking or network protocols can be done in a backwards compatible manner.
In those cases, only the minor number is changed and the negotatied protocol
minor is returned from dlm join. In the far less likely event that a
required protocol change makes backwards compatibility impossible, we simply
bump the major number.

Signed-off-by: Joel Becker
Signed-off-by: Mark Fasheh

Joel Becker
2008-02-07 08:11:29 +0800

26 Jan, 2008

2 commits

c934a92d0 ocfs2: Remove data locks ... Browse Code »

The meta lock now covers both meta data and data, so this just removes the
now-redundant data lock.

Combining locks saves us a round of lock mastery per inode and one less lock
to ping between nodes during read/write.

We don't lose much - since meta locks were always held before a data lock
(and at the same level) ordered writeout mode (the default) ensured that
flushing for the meta data lock also pushed out data anyways.

Signed-off-by: Mark Fasheh

Mark Fasheh
2008-01-26 06:45:57 +0800
34d024f84 ocfs2: Remove mount/unmount votes ... Browse Code »

The node maps that are set/unset by these votes are no longer relevant, thus
we can remove the mount and umount votes. Since those are the last two
remaining votes, we can also remove the entire vote infrastructure.

The vote thread has been renamed to the downconvert thread, and the small
amount of functionality related to managing it has been moved into
fs/ocfs2/dlmglue.c. All references to votes have been removed or updated.

Signed-off-by: Mark Fasheh

Mark Fasheh
2008-01-26 06:45:34 +0800

27 Apr, 2007

1 commit

500086300 ocfs2: Remove delete inode vote ... Browse Code »

Ocfs2 currently does cluster-wide node messaging to check the open state of
an inode during delete. This patch removes that mechanism in favor of an
inode cluster lock which is taken at shared read when an inode is first read
and dropped in clear_inode(). This allows a deleting node to test the
liveness of an inode by attempting to take an exclusive lock.

Signed-off-by: Tiger Yang
Signed-off-by: Mark Fasheh

Tiger Yang
2007-04-27 05:39:48 +0800

08 Feb, 2007

4 commits

925037bcb ocfs2: introduce sc->sc_send_lock to protect outbound outbound messages ... Browse Code »

When there is a lot of multithreaded I/O usage, two threads can collide
while sending out a message to the other nodes. This is due to the lack of
locking between threads while sending out the messages.

When a connected TCP send(), sendto(), or sendmsg() arrives in the Linux
kernel, it eventually comes through tcp_sendmsg(). tcp_sendmsg() protects
itself by acquiring a lock at invocation by calling lock_sock().
tcp_sendmsg() then loops over the buffers in the iovec, allocating
associated sk_buff's and cache pages for use in the actual send. As it does
so, it pushes the data out to tcp for actual transmission. However, if one
of those allocation fails (because a large number of large sends is being
processed, for example), it must wait for memory to become available. It
does so by jumping to wait_for_sndbuf or wait_for_memory, both of which
eventually cause a call to sk_stream_wait_memory(). sk_stream_wait_memory()
contains a code path that calls sk_wait_event(). Finally, sk_wait_event()
contains the call to release_sock().

The following patch adds a lock to the socket container in order to
properly serialize outbound requests.

From: Zhen Wei
Acked-by: Jeff Mahoney
Signed-off-by: Mark Fasheh

Zhen Wei
2007-02-08 04:15:11 +0800
1faf28945 ocfs2_dlm: disallow a domain join if node maps mismatch ... Browse Code »

There is a small window where a joining node may not see the node(s) that
just died but are still part of the domain. To fix this, we must disallow
join requests if the joining node has a different node map.

A new field node_map is added to dlm_query_join_request to send the current
nodes nodemap along with join request. On the receiving end the nodes that
are part of the cluster verifies if this new node sees all the nodes that
are still part of the cluster. They disallow the join if the maps mismatch.

Signed-off-by: Srinivas Eeda
Signed-off-by: Mark Fasheh

Srinivas Eeda
2007-02-08 04:09:14 +0800
d74c9803a ocfs2: Added post handler callable function in o2net message handler ... Browse Code »

Currently o2net allows one handler function per message type. This
patch adds the ability to call another function to be called after
the handler has returned the message to the other node.

Handlers are now given the option of returning a context (in the form of a
void **) which will be passed back into the post message handler function.

Signed-off-by: Kurt Hackel
Signed-off-by: Sunil Mushran
Signed-off-by: Mark Fasheh

Kurt Hackel
2007-02-08 04:06:56 +0800
ba2bf2185 ocfs2_dlm: fix cluster-wide refcounting of lock resources ... Browse Code »

This was previously broken and migration of some locks had to be temporarily
disabled. We use a new (and backward-incompatible) set of network messages
to account for all references to a lock resources held across the cluster.
once these are all freed, the master node may then free the lock resource
memory once its local references are dropped.

Signed-off-by: Kurt Hackel
Signed-off-by: Mark Fasheh

Kurt Hackel
2007-02-08 03:53:07 +0800

12 Dec, 2006

1 commit

828ae6afb [patch 3/3] OCFS2 Configurable timeouts - Protocol changes ... Browse Code »

Modify the OCFS2 handshake to ensure essential timeouts are configured
identically on all nodes.

Only allow changes when there are no connected peers

Improves the logic in o2net_advance_rx() which broke now that
sizeof(struct o2net_handshake) is greater than sizeof(struct o2net_msg)

Included is the field for userspace-heartbeat timeout to avoid the need for
further protocol changes.

Uses a global spinlock to ensure the decisions to update configfs entries
are made on the correct value. The region covered by the spinlock when
incrementing the counter is much larger as this is the more critical case.

Small cleanup contributed by Adrian Bunk

Signed-off-by: Andrew Beekhof
Signed-off-by: Mark Fasheh

Andrew Beekhof
2006-12-12 06:26:44 +0800

08 Dec, 2006

1 commit

b5dd80304 [patch 2/3] OCFS2 Configurable timeouts ... Browse Code »

Allow configuration of OCFS2 timeouts from userspace via configfs

Signed-off-by: Andrew Beekhof
Signed-off-by: Mark Fasheh

Jeff Mahoney
2006-12-08 10:13:20 +0800

22 Nov, 2006

1 commit

c4028958b WorkStruct: make allyesconfig ... Browse Code »

Fix up for make allyesconfig.

Signed-Off-By: David Howells

David Howells
2006-11-22 22:57:56 +0800

25 Sep, 2006

2 commits

24c19ef40 ocfs2: Remove i_generation from inode lock names ... Browse Code »

OCFS2 puts inode meta data in the "lock value block" provided by the DLM.
Typically, i_generation is encoded in the lock name so that a deleted inode
on and a new one in the same block don't share the same lvb.

Unfortunately, that scheme means that the read in ocfs2_read_locked_inode()
is potentially thrown away as soon as the meta data lock is taken - we
cannot encode the lock name without first knowing i_generation, which
requires a disk read.

This patch encodes i_generation in the inode meta data lvb, and removes the
value from the inode meta data lock name. This way, the read can be covered
by a lock, and at the same time we can distinguish between an up to date and
a stale LVB.

This will help cold-cache stat(2) performance in particular.

Since this patch changes the protocol version, we take the opportunity to do
a minor re-organization of two of the LVB fields.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:46 +0800
379dfe9d0 ocfs2: Hook rest of the file system into dentry locking API ... Browse Code »

Actually replace the vote calls with the new dentry operations. Make any
necessary adjustments to get the scheme to work.

Signed-off-by: Mark Fasheh

Mark Fasheh
2006-09-25 04:50:43 +0800

04 Jan, 2006

1 commit

98211489d [PATCH] OCFS2: The Second Oracle Cluster Filesystem ... Browse Code »

Node messaging via tcp. Used by the dlm and the file system for point
to point communication between nodes.

Signed-off-by: Mark Fasheh
Signed-off-by: Kurt Hackel

Zach Brown
2006-01-04 03:45:46 +0800