Eric Lee / smarc-fsl-linux-kernel

23 Nov, 2011

1 commit

4e3fd7a06 net: remove ipv6_addr_copy() ... Browse Code »

C assignment can handle struct in6_addr copying.

Signed-off-by: Alexey Dobriyan
Signed-off-by: David S. Miller

Alexey Dobriyan
2011-11-23 05:43:32 +0800

07 Jul, 2011

1 commit

bcaadf5c1 dlm: dump address of unknown node ... Browse Code »

When the dlm fails to make a network connection to another
node, include the address of the node in the error message.

Signed-off-by: Masatake YAMATO
Signed-off-by: David Teigland

Masatake YAMATO
2011-07-07 05:37:23 +0800

31 Mar, 2011

1 commit

25985edce Fix common misspellings ... Browse Code »

Fixes generated by 'codespell' and manually reviewed.

Signed-off-by: Lucas De Marchi

Lucas De Marchi
2011-03-31 22:26:23 +0800

11 Mar, 2011

1 commit

e43f055a9 dlm: use alloc_workqueue function ... Browse Code »

Replaces deprecated create_singlethread_workqueue().

Signed-off-by: David Teigland

David Teigland
2011-03-11 03:22:34 +0800

12 Feb, 2011

1 commit

6b155c8fd dlm: use single thread workqueues ... Browse Code »

The recent commit to use cmwq for send and recv threads
dcce240ead802d42b1e45ad2fcb2ed4a399cb255 introduced problems,
apparently due to multiple workqueue threads. Single threads
make the problems go away, so return to that until we fully
understand the concurrency issues with multiple threads.

Signed-off-by: David Teigland

David Teigland
2011-02-12 06:50:47 +0800

14 Dec, 2010

1 commit

b9d410527 dlm: sanitize work_start() in lowcomms.c ... Browse Code »

The create_workqueue() returns NULL if failed rather than ERR_PTR().
Fix error checking and remove unnecessary variable 'error'.

Signed-off-by: Namhyung Kim
Cc: Tejun Heo
Signed-off-by: David Teigland

Namhyung Kim
2010-12-14 03:42:24 +0800

13 Nov, 2010

3 commits

f92c8dd7a dlm: reduce cond_resched during send ... Browse Code »

Calling cond_resched() after every send can unnecessarily
degrade performance. Go back to an old method of scheduling
after 25 messages.

Signed-off-by: Bob Peterson
Signed-off-by: David Teigland

Bob Peterson
2010-11-13 01:15:20 +0800
cb2d45da8 dlm: use TCP_NODELAY ... Browse Code »

Nagling doesn't help and can sometimes hurt dlm comms.

Signed-off-by: David Teigland

David Teigland
2010-11-13 01:12:55 +0800
dcce240ea dlm: Use cmwq for send and receive workqueues ... Browse Code »

So far as I can tell, there is no reason to use a single-threaded
send workqueue for dlm, since it may need to send to several sockets
concurrently. Both workqueues are set to WQ_MEM_RECLAIM to avoid
any possible deadlocks, WQ_HIGHPRI since locking traffic is highly
latency sensitive (and to avoid a priority inversion wrt GFS2's
glock_workqueue) and WQ_FREEZABLE just in case someone needs to do
that (even though with current cluster infrastructure, it doesn't
make sense as the node will most likely land up ejected from the
cluster) in the future.

Signed-off-by: Steven Whitehouse
Cc: Tejun Heo
Signed-off-by: David Teigland

Steven Whitehouse
2010-11-13 01:08:03 +0800

12 Nov, 2010

1 commit

b36930dd5 dlm: Handle application limited situations properly. ... Browse Code »

In the normal regime where an application uses non-blocking I/O
writes on a socket, they will handle -EAGAIN and use poll() to
wait for send space.

They don't actually sleep on the socket I/O write.

But kernel level RPC layers that do socket I/O operations directly
and key off of -EAGAIN on the write() to "try again later" don't
use poll(), they instead have their own sleeping mechanism and
rely upon ->sk_write_space() to trigger the wakeup.

So they do effectively sleep on the write(), but this mechanism
alone does not let the socket layers know what's going on.

Therefore they must emulate what would have happened, otherwise
TCP cannot possibly see that the connection is application window
size limited.

Handle this, therefore, like SUNRPC by setting SOCK_NOSPACE and
bumping the ->sk_write_count as needed when we hit the send buffer
limits.

This should make TCP send buffer size auto-tuning and the
->sk_write_space() callback invocations actually happen.

Signed-off-by: David S. Miller
Signed-off-by: David Teigland

David Miller
2010-11-12 03:05:12 +0800

06 Aug, 2010

1 commit

f70cb33b9 fs/dlm: Drop unnecessary null test ... Browse Code »

hlist_for_each_entry binds its first argument to a non-null value, and thus
any null test on the value of that argument is superfluous.

The semantic patch that makes this change is as follows:
(http://coccinelle.lip6.fr/)

//
@@
iterator I;
expression x,E,E1,E2;
statement S,S1,S2;
@@

I(x,...) { }
//

Signed-off-by: Julia Lawall
Signed-off-by: David Teigland

Julia Lawall
2010-08-06 03:23:45 +0800

30 Mar, 2010

1 commit

5a0e3ad6a include cleanup: Update gfp.h and slab.h includes to prepare for breaking implic… ... Browse Code »

…it slab.h inclusion from percpu.h

percpu.h is included by sched.h and module.h and thus ends up being
included when building most .c files. percpu.h includes slab.h which
in turn includes gfp.h making everything defined by the two files
universally available and complicating inclusion dependencies.

percpu.h -> slab.h dependency is about to be removed. Prepare for
this change by updating users of gfp and slab facilities include those
headers directly instead of assuming availability. As this conversion
needs to touch large number of source files, the following script is
used as the basis of conversion.

http://userweb.kernel.org/~tj/misc/slabh-sweep.py

The script does the followings.

* Scan files for gfp and slab usages and update includes such that
only the necessary includes are there. ie. if only gfp is used,
gfp.h, if slab is used, slab.h.

* When the script inserts a new include, it looks at the include
blocks and try to put the new include such that its order conforms
to its surrounding. It's put in the include block which contains
core kernel includes, in the same order that the rest are ordered -
alphabetical, Christmas tree, rev-Xmas-tree or at the end if there
doesn't seem to be any matching order.

* If the script can't find a place to put a new include (mostly
because the file doesn't have fitting include block), it prints out
an error message indicating which .h file needs to be added to the
file.

The conversion was done in the following steps.

1. The initial automatic conversion of all .c files updated slightly
over 4000 files, deleting around 700 includes and adding ~480 gfp.h
and ~3000 slab.h inclusions. The script emitted errors for ~400
files.

2. Each error was manually checked. Some didn't need the inclusion,
some needed manual addition while adding it to implementation .h or
embedding .c file was more appropriate for others. This step added
inclusions to around 150 files.

3. The script was run again and the output was compared to the edits
from #2 to make sure no file was left behind.

4. Several build tests were done and a couple of problems were fixed.
e.g. lib/decompress_*.c used malloc/free() wrappers around slab
APIs requiring slab.h to be added manually.

5. The script was run on all .h files but without automatically
editing them as sprinkling gfp.h and slab.h inclusions around .h
files could easily lead to inclusion dependency hell. Most gfp.h
inclusion directives were ignored as stuff from gfp.h was usually
wildly available and often used in preprocessor macros. Each
slab.h inclusion directive was examined and added manually as
necessary.

6. percpu.h was updated not to include slab.h.

7. Build test were done on the following configurations and failures
were fixed. CONFIG_GCOV_KERNEL was turned off for all tests (as my
distributed build env didn't work with gcov compiles) and a few
more options had to be turned off depending on archs to make things
build (like ipr on powerpc/64 which failed due to missing writeq).

* x86 and x86_64 UP and SMP allmodconfig and a custom test config.
* powerpc and powerpc64 SMP allmodconfig
* sparc and sparc64 SMP allmodconfig
* ia64 SMP allmodconfig
* s390 SMP allmodconfig
* alpha SMP allmodconfig
* um on x86_64 SMP allmodconfig

8. percpu.h modifications were reverted so that it could be applied as
a separate patch and serve as bisection point.

Given the fact that I had only a couple of failures from tests on step
6, I'm fairly confident about the coverage of this conversion patch.
If there is a breakage, it's likely to be something in one of the arch
headers which should be easily discoverable easily on most builds of
the specific arch.

Signed-off-by: Tejun Heo <tj@kernel.org>
Guess-its-ok-by: Christoph Lameter <cl@linux-foundation.org>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Lee Schermerhorn <Lee.Schermerhorn@hp.com>

Tejun Heo
2010-03-30 21:02:32 +0800

01 Dec, 2009

1 commit

573c24c4a dlm: always use GFP_NOFS ... Browse Code »

Replace all GFP_KERNEL and ls_allocation with GFP_NOFS.
ls_allocation would be GFP_KERNEL for userland lockspaces
and GFP_NOFS for file system lockspaces.

It was discovered that any lockspaces on the system can
affect all others by triggering memory reclaim in the
file system which could in turn call back into the dlm
to acquire locks, deadlocking dlm threads that were
shared by all lockspaces, like dlm_recv.

Signed-off-by: David Teigland

David Teigland
2009-12-01 06:34:43 +0800

01 Oct, 2009

2 commits

6861f3507 dlm: fix socket fd translation ... Browse Code »

The code to set up sctp sockets was not using the sockfd_lookup()
and sockfd_put() routines to translate an fd to a socket. The
direct fget and fput calls were resulting in error messages from
alloc_fd().

Also clean up two log messages and remove a third, related to
setting up sctp associations.

Signed-off-by: David Teigland

David Teigland
2009-10-01 01:19:44 +0800
04bedd79a dlm: fix lowcomms_connect_node for sctp ... Browse Code »

The recently added dlm_lowcomms_connect_node() from
391fbdc5d527149578490db2f1619951d91f3561 does not work
when using SCTP instead of TCP. The sctp connection code
has nothing to do without data to send. Check for no data
in the sctp connection code and do nothing instead of
triggering a BUG. Also have connect_node() do nothing
when the protocol is sctp.

Signed-off-by: David Teigland

David Teigland
2009-10-01 01:19:44 +0800

25 Aug, 2009

2 commits

1329e3f2c dlm: use kernel_sendpage ... Browse Code »

Using kernel_sendpage() is cleaner and safer than following
sock->ops ourselves.

Signed-off-by: Paolo Bonzini
Signed-off-by: David Teigland

Paolo Bonzini
2009-08-25 02:18:04 +0800
063c4c996 dlm: fix connection close handling ... Browse Code »

Closing a connection to a node can create problems if there are
outstanding messages for that node. The problems include dlm_send
spinning attempting to reconnect, or BUG from tcp_connect_to_sock()
attempting to use a partially closed connection.

To cleanly close a connection, we now first attempt to send any pending
messages, cancel any remaining workqueue work, and flag the connection
as closed to avoid reconnect attempts.

Signed-off-by: Lars Marowsky-Bree
Signed-off-by: Christine Caulfield
Signed-off-by: David Teigland

Lars Marowsky-Bree
2009-08-25 02:13:56 +0800

19 Aug, 2009

1 commit

b5711b8e5 dlm: fix double-release of socket in error exit path ... Browse Code »

The last correction to the tcp_connect_to_sock error exit path,
commit a89d63a159b1ba5833be2bef00adf8ad8caac8be, can free an already
freed socket, due to collision with a previous (incomplete) attempt
to fix the same issue, commit 311f6fc77c51926dbdfbeab0a5d88d70f01fa3f4.

Signed-off-by: Casey Dahlin
Signed-off-by: David Teigland

Casey Dahlin
2009-08-19 04:09:24 +0800

15 Jul, 2009

1 commit

a89d63a15 dlm: free socket in error exit path ... Browse Code »

In the tcp_connect_to_sock() error exit path, the socket
allocated at the top of the function was not being freed.

Signed-off-by: Casey Dahlin
Signed-off-by: David Teigland

Casey Dahlin
2009-07-15 01:28:43 +0800

16 May, 2009

1 commit

748285ccf dlm: use more NOFS allocation ... Browse Code »

Change some GFP_KERNEL allocations to use either GFP_NOFS or
ls_allocation (when available) which the fs sets to GFP_NOFS.
The point is to prevent allocations from going back into the
cluster fs in places where that might lead to deadlock.

Signed-off-by: David Teigland

David Teigland
2009-05-16 00:24:59 +0800

15 May, 2009

1 commit

391fbdc5d dlm: connect to nodes earlier ... Browse Code »

Make network connections to other nodes earlier, in the context of
dlm_recoverd. This avoids connecting to nodes from dlm_send where we
try to avoid allocations which could possibly deadlock if memory reclaim
goes into the cluster fs which may try to do a dlm operation.

Signed-off-by: Christine Caulfield
Signed-off-by: David Teigland

Christine Caulfield
2009-05-15 22:34:12 +0800

12 Mar, 2009

1 commit

5e9ccc372 dlm: replace idr with hash table for connections ... Browse Code »

Integer nodeids can be too large for the idr code; use a hash
table instead.

Signed-off-by: Christine Caulfield
Signed-off-by: David Teigland

Christine Caulfield
2009-03-12 01:20:58 +0800

29 Jan, 2009

2 commits

2cf12c0bf dlm: comment typo fixes ... Browse Code »

Signed-off-by: Joe Perches
Signed-off-by: David Teigland

Joe Perches
2009-01-29 02:56:07 +0800
44ad532b3 dlm: use ipv6_addr_copy ... Browse Code »

Signed-off-by: Joe Perches
Signed-off-by: David Teigland

Joe Perches
2009-01-29 02:56:02 +0800

24 Dec, 2008

2 commits

1521848cb dlm: remove kmap/kunmap ... Browse Code »

The pages used in lowcomms are not highmem, so kmap is not necessary.

Cc: Christine Caulfield
Signed-off-by: Steven Whitehouse
Signed-off-by: David Teigland

Steven Whitehouse
2008-12-24 00:16:01 +0800
d6d7b702a dlm: fix up memory allocation flags ... Browse Code »

Use ls_allocation for memory allocations, which a cluster fs sets to
GFP_NOFS. Use GFP_NOFS for allocations when no lockspace struct is
available. Taking dlm locks needs to avoid calling back into the
cluster fs because write-out can require taking dlm locks.

Cc: Christine Caulfield
Signed-off-by: Steven Whitehouse
Signed-off-by: David Teigland

Steven Whitehouse
2008-12-24 00:15:40 +0800

15 Jul, 2008

1 commit

311f6fc77 dlm: release socket on error ... Browse Code »

It seems that `sock' allocated by sock_create_kern in
tcp_connect_to_sock() of dlm/fs/lowcomms.c is not released if
dlm_nodeid_to_addr an error.

Acked-by: Christine Caulfield
Signed-off-by: Masatake YAMATO
Signed-off-by: David Teigland

Masatake YAMATO
2008-07-15 02:56:59 +0800

20 May, 2008

2 commits

0035a4b14 dlm: tcp_connect_to_sock should check for -EINVAL, not EINVAL ... Browse Code »

Signed-off-by: Marcin Slusarz
Cc: Christine Caulfield
Cc: David Teigland
Cc: cluster-devel@redhat.com
Signed-off-by: David Teigland

Marcin Slusarz
2008-05-20 04:37:27 +0800
7a936ce71 dlm: convert connections_lock in a mutex ... Browse Code »

The semaphore connections_lock is used as a mutex. Convert it to the mutex
API.

Signed-off-by: Matthias Kaehlcke
Cc: Christine Caulfield
Cc: David Teigland
Cc: Steven Whitehouse
Signed-off-by: Andrew Morton
Signed-off-by: David Teigland

Matthias Kaehlcke
2008-05-20 04:37:27 +0800

30 Jan, 2008

2 commits

39bd4177d dlm: close othercons ... Browse Code »

This patch addresses a problem introduced with the last round of
lowcomms patches where the 'othercon' connections do not get freed when
the DLM shuts down.

This results in the error message
"slab error in kmem_cache_destroy(): cache `dlm_conn': Can't free all
objects"

and the DLM cannot be restarted without a system reboot.

See bz#428119

Signed-off-by: Patrick Caulfield
Signed-off-by: Fabio M. Di Nitto
Signed-off-by: David Teigland

Patrick Caulfeld
2008-01-30 07:17:32 +0800
6bd8fedaa dlm: bind connections from known local address when using TCP ... Browse Code »

A common problem occurs when multiple IP addresses within the same
subnet are assigned to the same NIC. If we make a connection attempt to
another address on the same subnet as one of those addresses, the
connection attempt will not necessarily be routed from the address we
want.

In the case of the DLM, the other nodes will quickly drop the connection
attempt, causing problems.

This patch makes the DLM bind to the local address it acquired from the
cluster manager when using TCP prior to making a connection, obviating
the need for administrators to "fix" their systems or use clever routing
tricks.

Signed-off-by: Lon Hohberger
Signed-off-by: Patrick Caulfield
Signed-off-by: David Teigland

Lon Hohberger
2008-01-30 06:44:25 +0800

07 Nov, 2007

1 commit

df61c9526 [DLM] lowcomms: Do not muck with sysctl_rmem_max. ... Browse Code »

Use SO_RCVBUFFORCE instead.

Signed-off-by: David S. Miller

David S. Miller
2007-11-07 20:11:42 +0800

10 Oct, 2007

2 commits

d66f8277f [DLM] Make dlm_sendd cond_resched more ... Browse Code »

Under high recovery loads dlm_sendd can monopolise the CPU and cause soft lockups.

This one extra and one moved cond_resched() make it yield a little more during
such times keeping work moving.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-10-10 15:56:19 +0800
61d96be0f [DLM] Fix lowcomms socket closing ... Browse Code »

This patch fixes the slight mess made in lowcomms closing by previous patches
and fixes all sorts of DLM hangs.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-10-10 15:55:39 +0800

14 Aug, 2007

3 commits

9e5f2825a [DLM] More othercon fixes ... Browse Code »

The last patch to clean out 'othercon' structures only fixed half the problem.
The attached addresses the other situations too, and fixes bz#238490

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-08-14 17:30:36 +0800
01c8cab25 [DLM] zero unused parts of sockaddr_storage ... Browse Code »

When we build a sockaddr_storage for an IP address, clear the unused parts as
they could be used for node comparisons.

I have seen this occasionally make sctp connections fail.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-08-14 17:29:27 +0800
25720c2d7 [DLM] Clear othercon pointers when a connection is closed ... Browse Code »

This patch clears the othercon pointer and frees the memory when a connnection
is closed. This could cause a small memory leak when nodes leave the cluster.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-08-14 17:28:05 +0800

20 Jul, 2007

1 commit

20c2df83d mm: Remove slab destructors from kmem_cache_create(). ... Browse Code »

Slab destructors were no longer supported after Christoph's
c59def9f222d44bb7e2f0a559f2906191a0862d7 change. They've been
BUGs for both slab and slub, and slob never supported them
either.

This rips out support for the dtor pointer from kmem_cache_create()
completely and fixes up every single callsite in the kernel (there were
about 224, not including the slab allocator definitions themselves,
or the documentation references).

Signed-off-by: Paul Mundt

Paul Mundt
2007-07-20 09:11:58 +0800

09 Jul, 2007

2 commits

f4fadb23c [GFS2] git-gfs2-nmw-build-fix ... Browse Code »

Cc: Steven Whitehouse
Signed-off-by: Andrew Morton
Signed-off-by: Steven Whitehouse

akpm@linux-foundation.org
2007-07-09 15:24:06 +0800
97d848365 [DLM] Telnet to port 21064 can stop all lockspaces ... Browse Code »

This patch fixes Red Hat bz#245892

Opening a tcp connection from a cluster member to another cluster member
targeting the dlm port it is enough to stop every dlm operation in the cluster.
This means that GFS and rgmanager will hang.

Signed-Off-By: Patrick Caulfield
Signed-off-by: Steven Whitehouse

Patrick Caulfield
2007-07-09 15:23:57 +0800