Eric Lee / smarc-fsl-linux-kernel

16 May, 2009

1 commit

748285ccf dlm: use more NOFS allocation ... Browse Code »

Change some GFP_KERNEL allocations to use either GFP_NOFS or
ls_allocation (when available) which the fs sets to GFP_NOFS.
The point is to prevent allocations from going back into the
cluster fs in places where that might lead to deadlock.

Signed-off-by: David Teigland

David Teigland
2009-05-16 00:24:59 +0800

15 May, 2009

1 commit

391fbdc5d dlm: connect to nodes earlier ... Browse Code »

Make network connections to other nodes earlier, in the context of
dlm_recoverd. This avoids connecting to nodes from dlm_send where we
try to avoid allocations which could possibly deadlock if memory reclaim
goes into the cluster fs which may try to do a dlm operation.

Signed-off-by: Christine Caulfield
Signed-off-by: David Teigland

Christine Caulfield
2009-05-15 22:34:12 +0800

12 Mar, 2009

1 commit

5e9ccc372 dlm: replace idr with hash table for connections ... Browse Code »

Integer nodeids can be too large for the idr code; use a hash
table instead.

Signed-off-by: Christine Caulfield
Signed-off-by: David Teigland

Christine Caulfield
2009-03-12 01:20:58 +0800

29 Jan, 2009

2 commits

2cf12c0bf dlm: comment typo fixes ... Browse Code »

Signed-off-by: Joe Perches
Signed-off-by: David Teigland

Joe Perches
2009-01-29 02:56:07 +0800
44ad532b3 dlm: use ipv6_addr_copy ... Browse Code »

Signed-off-by: Joe Perches
Signed-off-by: David Teigland

Joe Perches
2009-01-29 02:56:02 +0800

24 Dec, 2008

2 commits

1521848cb dlm: remove kmap/kunmap ... Browse Code »

The pages used in lowcomms are not highmem, so kmap is not necessary.

Cc: Christine Caulfield
Signed-off-by: Steven Whitehouse
Signed-off-by: David Teigland

Steven Whitehouse
2008-12-24 00:16:01 +0800
d6d7b702a dlm: fix up memory allocation flags ... Browse Code »

Use ls_allocation for memory allocations, which a cluster fs sets to
GFP_NOFS. Use GFP_NOFS for allocations when no lockspace struct is
available. Taking dlm locks needs to avoid calling back into the
cluster fs because write-out can require taking dlm locks.

Cc: Christine Caulfield
Signed-off-by: Steven Whitehouse
Signed-off-by: David Teigland

Steven Whitehouse
2008-12-24 00:15:40 +0800

15 Jul, 2008

1 commit

311f6fc77 dlm: release socket on error ... Browse Code »

It seems that `sock' allocated by sock_create_kern in
tcp_connect_to_sock() of dlm/fs/lowcomms.c is not released if
dlm_nodeid_to_addr an error.

Acked-by: Christine Caulfield
Signed-off-by: Masatake YAMATO
Signed-off-by: David Teigland

Masatake YAMATO
2008-07-15 02:56:59 +0800

20 May, 2008

2 commits

0035a4b14 dlm: tcp_connect_to_sock should check for -EINVAL, not EINVAL ... Browse Code »

Signed-off-by: Marcin Slusarz
Cc: Christine Caulfield
Cc: David Teigland
Cc: cluster-devel@redhat.com
Signed-off-by: David Teigland

Marcin Slusarz
2008-05-20 04:37:27 +0800
7a936ce71 dlm: convert connections_lock in a mutex ... Browse Code »

The semaphore connections_lock is used as a mutex. Convert it to the mutex
API.

Signed-off-by: Matthias Kaehlcke
Cc: Christine Caulfield
Cc: David Teigland
Cc: Steven Whitehouse
Signed-off-by: Andrew Morton
Signed-off-by: David Teigland

Matthias Kaehlcke
2008-05-20 04:37:27 +0800

30 Jan, 2008

2 commits

39bd4177d dlm: close othercons ... Browse Code »

This patch addresses a problem introduced with the last round of
lowcomms patches where the 'othercon' connections do not get freed when
the DLM shuts down.

This results in the error message
"slab error in kmem_cache_destroy(): cache `dlm_conn': Can't free all
objects"

and the DLM cannot be restarted without a system reboot.

See bz#428119

Signed-off-by: Patrick Caulfield
Signed-off-by: Fabio M. Di Nitto
Signed-off-by: David Teigland

Patrick Caulfeld
2008-01-30 07:17:32 +0800
6bd8fedaa dlm: bind connections from known local address when using TCP ... Browse Code »

A common problem occurs when multiple IP addresses within the same
subnet are assigned to the same NIC. If we make a connection attempt to
another address on the same subnet as one of those addresses, the
connection attempt will not necessarily be routed from the address we
want.

In the case of the DLM, the other nodes will quickly drop the connection
attempt, causing problems.

This patch makes the DLM bind to the local address it acquired from the
cluster manager when using TCP prior to making a connection, obviating
the need for administrators to "fix" their systems or use clever routing
tricks.

Signed-off-by: Lon Hohberger
Signed-off-by: Patrick Caulfield
Signed-off-by: David Teigland

Lon Hohberger
2008-01-30 06:44:25 +0800

07 Nov, 2007

1 commit

df61c9526 [DLM] lowcomms: Do not muck with sysctl_rmem_max. ... Browse Code »

Use SO_RCVBUFFORCE instead.

Signed-off-by: David S. Miller

David S. Miller
2007-11-07 20:11:42 +0800

10 Oct, 2007

2 commits

d66f8277f [DLM] Make dlm_sendd cond_resched more ... Browse Code »

Under high recovery loads dlm_sendd can monopolise the CPU and cause soft lockups.

This one extra and one moved cond_resched() make it yield a little more during
such times keeping work moving.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-10-10 15:56:19 +0800
61d96be0f [DLM] Fix lowcomms socket closing ... Browse Code »

This patch fixes the slight mess made in lowcomms closing by previous patches
and fixes all sorts of DLM hangs.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-10-10 15:55:39 +0800

14 Aug, 2007

3 commits

9e5f2825a [DLM] More othercon fixes ... Browse Code »

The last patch to clean out 'othercon' structures only fixed half the problem.
The attached addresses the other situations too, and fixes bz#238490

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-08-14 17:30:36 +0800
01c8cab25 [DLM] zero unused parts of sockaddr_storage ... Browse Code »

When we build a sockaddr_storage for an IP address, clear the unused parts as
they could be used for node comparisons.

I have seen this occasionally make sctp connections fail.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-08-14 17:29:27 +0800
25720c2d7 [DLM] Clear othercon pointers when a connection is closed ... Browse Code »

This patch clears the othercon pointer and frees the memory when a connnection
is closed. This could cause a small memory leak when nodes leave the cluster.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-08-14 17:28:05 +0800

20 Jul, 2007

1 commit

20c2df83d mm: Remove slab destructors from kmem_cache_create(). ... Browse Code »

Slab destructors were no longer supported after Christoph's
c59def9f222d44bb7e2f0a559f2906191a0862d7 change. They've been
BUGs for both slab and slub, and slob never supported them
either.

This rips out support for the dtor pointer from kmem_cache_create()
completely and fixes up every single callsite in the kernel (there were
about 224, not including the slab allocator definitions themselves,
or the documentation references).

Signed-off-by: Paul Mundt

Paul Mundt
2007-07-20 09:11:58 +0800

09 Jul, 2007

3 commits

f4fadb23c [GFS2] git-gfs2-nmw-build-fix ... Browse Code »

Cc: Steven Whitehouse
Signed-off-by: Andrew Morton
Signed-off-by: Steven Whitehouse

akpm@linux-foundation.org
2007-07-09 15:24:06 +0800
97d848365 [DLM] Telnet to port 21064 can stop all lockspaces ... Browse Code »

This patch fixes Red Hat bz#245892

Opening a tcp connection from a cluster member to another cluster member
targeting the dlm port it is enough to stop every dlm operation in the cluster.
This means that GFS and rgmanager will hang.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-07-09 15:23:57 +0800
afb853fb4 [DLM] fix socket shutdown ... Browse Code »

This patch clears the user_data of active sockets as part of cleanup.
This prevents any late-arriving data from trying to add jobs to the work
queue while we are tidying up.

Signed-Off-By: Patrick Caulfield
Signed-Off-By: David Teigland
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-07-09 15:23:05 +0800

01 May, 2007

4 commits

617e82e10 [DLM] lowcomms style ... Browse Code »

Replace some printk with log_print, and fix some simple cases of lines
over 80. Also, return -ENOTCONN if lowcomms_start fails due to no local
IP address being available.

Signed-off-by: David Teigland
Signed-off-by: Steven Whitehouse

David Teigland
2007-05-01 16:11:51 +0800
30d3a2373 [DLM] Lowcomms nodeid range & initialisation fixes ... Browse Code »

Fix a few range & initialization bugs in lowcomms.
- max_nodeid is really the highest nodeid encountered, so all loops must include
it in their iterations.
- clean dlm_local_count & connection_idr so we can do a clean restart.
- Remove a spurious BUG_ON

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-05-01 16:11:41 +0800
2439fe507 [DLM] Fix dlm_lowcoms_stop hang ... Browse Code »

When you attempt to release a lockspace in DLM, it will hang trying to down a
semaphore that has already been downed. The attached patch fixes the problem.

Signed-off-by: Josef Bacik
Signed-off-by: Steven Whitehouse
Cc: Patrick Caulfield

Josef Bacik
2007-05-01 16:11:38 +0800
6ed7257b4 [DLM] Consolidate transport protocols ... Browse Code »

This patch consolidates the TCP & SCTP protocols for the DLM into a single file
and makes it switchable at run-time (well, at least before the DLM actually
starts up!)

For RHEL5 this patch requires Neil Horman's patch that expands the in-kernel
socket API but that has already been twice ACKed so it should be OK.

The patch adds a new lowcomms.c file that replaces the existing lowcomms-sctp.c
& lowcomms-tcp.c files.

Signed-off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-05-01 16:11:23 +0800

30 Nov, 2006

1 commit

fdda387f7 [DLM] Add support for tcp communications ... Browse Code »

The following patch adds a TCP based communications layer
to the DLM which is compile time selectable. The existing SCTP
layer gives the advantage of allowing multihoming, whereas
the TCP layer has been heavily tested in previous versions of
the DLM and is known to be robust and therefore can be used as
a baseline for performance testing.

Signed-off-by: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2006-11-30 23:35:00 +0800

20 Oct, 2006

1 commit

42fb00838 [DLM] fix iovec length in recvmsg ... Browse Code »

I didn't spot that the msg_iovlen was set to 2 if there
were two elements in the iovec but left at zero if not :(

I think this might be why bob was still seeing trouble.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2006-10-20 21:13:10 +0800

13 Oct, 2006

1 commit

4c5e1b1a8 [DLM] fix iovec length in recvmsg ... Browse Code »

The DLM always passes the iovec length as 1, this is wrong when the circular
buffer wraps round.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2006-10-13 05:11:33 +0800

10 Oct, 2006

1 commit

38d6fd26e [PATCH] dlm gfp_t annotations ... Browse Code »

Signed-off-by: Al Viro
Signed-off-by: Linus Torvalds

Al Viro
2006-10-10 05:19:08 +0800

11 Aug, 2006

1 commit

fcc8abc8d [DLM] move kmap to after spin_unlock ... Browse Code »

Doing the kmap() while holding the spinlock was causing recursive spinlock
problems. It seems the kmap was scheduling, although there was no warning
as I'd expect. Patrick, do we need locking around the kmap?

Signed-off-by: David Teigland
Signed-off-by: Steven Whitehouse

David Teigland
2006-08-11 21:44:00 +0800

19 Jun, 2006

1 commit

7d5513d58 [DLM] init rwsem earlier ... Browse Code »

The nodeinfo_lock rwsem needs to be initialized when the module is loaded
instead of when the dlm is first used.

Signed-off-by: David Teigland
Signed-off-by: Steven Whitehouse

David Teigland
2006-06-19 21:15:38 +0800

26 May, 2006

1 commit

47c96298c [GFS2] Change name due to local_nodeid being a macro ... Browse Code »

Change names of local_nodeid to dlm_local_nodeid to prevent a
namespace collision. Changed other local variable to match.

Cc: David Teigland
Signed-off-by: Steven Whitehouse

Steven Whitehouse
2006-05-26 05:43:14 +0800

28 Apr, 2006

1 commit

1c032c031 [DLM] PATCH 2/3 dlm: lowcomms close ... Browse Code »

When a node is removed from a lockspace configuration, close our
connection to it, clearing any remaining messages for it.

Signed-off-by: David Teigland
Signed-off-by: Patrick Caulfield
Signed-off-by: Steven Whitehouse

David Teigland
2006-04-28 22:50:41 +0800

18 Jan, 2006

1 commit

e7fd41792 [DLM] The core of the DLM for GFS2/CLVM ... Browse Code »

This is the core of the distributed lock manager which is required
to use GFS2 as a cluster filesystem. It is also used by CLVM and
can be used as a standalone lock manager independantly of either
of these two projects.

It implements VAX-style locking modes.

Signed-off-by: David Teigland
Signed-off-by: Steve Whitehouse

David Teigland
2006-01-18 17:30:29 +0800