Eric Lee / smarc-fsl-linux-kernel

21 Oct, 2010

1 commit

ff51bf841 rds: make local functions/variables static ... Browse Code »

The RDS protocol has lots of functions that should be
declared static. rds_message_get/add_version_extension is
removed since it defined but never used.

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

stephen hemminger
2010-10-21 19:26:39 +0800

09 Sep, 2010

30 commits

20c72bd5f RDS: Implement masked atomic operations ... Browse Code »

Add two CMSGs for masked versions of cswp and fadd. args
struct modified to use a union for different atomic op type's
arguments. Change IB to do masked atomic ops. Atomic op type
in rds_message similarly unionized.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:16:51 +0800
59f740a6a RDS/IB: print string constants in more places ... Browse Code »

This prints the constant identifier for work completion status and rdma
cm event types, like we already do for IB event types.

A core string array helper is added that each string type uses.

Signed-off-by: Zach Brown

Zach Brown
2010-09-09 09:16:50 +0800
5adb5bc65 RDS: have sockets get transport module references ... Browse Code »

Right now there's nothing to stop the various paths that use
rs->rs_transport from racing with rmmod and executing freed transport
code. The simple fix is to have binding to a transport also hold a
reference to the transport's module, removing this class of races.

We already had an unused t_owner field which was set for the modular
transports and which wasn't set for the built-in loop transport.

Signed-off-by: Zach Brown

Zach Brown
2010-09-09 09:16:47 +0800
77510481c RDS: remove old rs_transport comment ... Browse Code »

rs_transport is now also used by the rdma paths once the socket is
bound. We don't need this stale comment to tell us what cscope can.

Signed-off-by: Zach Brown

Zach Brown
2010-09-09 09:16:46 +0800
ef87b7ea3 RDS: remove __init and __exit annotation ... Browse Code »

The trivial amount of memory saved isn't worth the cost of dealing with section
mismatches.

Signed-off-by: Zach Brown

Zach Brown
2010-09-09 09:16:39 +0800
0f4b1c7e8 rds: fix rds_send_xmit() serialization ... Browse Code »

rds_send_xmit() was changed to hold an interrupt masking spinlock instead of a
mutex so that it could be called from the IB receive tasklet path. This broke
the TCP transport because its xmit method can block and masks and unmasks
interrupts.

This patch serializes callers to rds_send_xmit() with a simple bit instead of
the current spinlock or previous mutex. This enables rds_send_xmit() to be
called from any context and to call functions which block. Getting rid of the
c_send_lock exposes the bare c_lock acquisitions which are changed to block
interrupts.

A waitqueue is added so that rds_conn_shutdown() can wait for callers to leave
rds_send_xmit() before tearing down partial send state. This lets us get rid
of c_senders.

rds_send_xmit() is changed to check the conn state after acquiring the
RDS_IN_XMIT bit to resolve races with the shutdown path. Previously both
worked with the conn state and then the lock in the same order, allowing them
to race and execute the paths concurrently.

rds_send_reset() isn't racing with rds_send_xmit() now that rds_conn_shutdown()
properly ensures that rds_send_xmit() can't start once the conn state has been
changed. We can remove its previous use of the spinlock.

Finally, c_send_generation is redundant. Callers can race to test the c_flags
bit by simply retrying instead of racing to test the c_send_generation atomic.

Signed-off-by: Zach Brown

Zach Brown
2010-09-09 09:15:27 +0800
671202f34 rds: remove unused rds_send_acked_before() ... Browse Code »

rds_send_acked_before() wasn't blocking interrupts when acquiring c_lock from
user context but nothing calls it. Rather than fix its use of c_lock we just
remove the function.

Signed-off-by: Zach Brown

Zach Brown
2010-09-09 09:15:25 +0800
f3c6808d3 RDS: introduce rds_conn_connect_if_down() ... Browse Code »

A few paths had the same block of code to queue a connection's connect work if
it was in the right state. Let's move this in to a helper function.

Signed-off-by: Zach Brown

Zach Brown
2010-09-09 09:15:18 +0800
7e3f2952e rds: don't let RDS shutdown a connection while senders are present ... Browse Code »

This is the first in a long line of patches that tries to fix races
between RDS connection shutdown and RDS traffic.

Here we are maintaining a count of active senders to make sure
the connection doesn't go away while they are using it.

Signed-off-by: Chris Mason

Chris Mason
2010-09-09 09:15:09 +0800
38a4e5e61 rds: Use RCU for the bind lookup searches ... Browse Code »

The RDS bind lookups are somewhat expensive in terms of CPU
time and locking overhead. This commit changes them into a
faster RCU based hash tree instead of the rbtrees they were using
before.

On large NUMA systems it is a significant improvement.

Signed-off-by: Chris Mason

Chris Mason
2010-09-09 09:15:08 +0800
c83188dcd rds: per-rm flush_wait waitq ... Browse Code »

This removes a global waitqueue used to wait for rds messages
and replaces it with a waitqueue inside the rds_message struct.

The global waitqueue turns into a global lock and significantly
bottlenecks operations on large machines.

Signed-off-by: Chris Mason

Chris Mason
2010-09-09 09:12:27 +0800
9e29db0e3 RDS: Use a generation counter to avoid rds_send_xmit loop ... Browse Code »

rds_send_xmit is required to loop around after it releases the lock
because someone else could done a trylock, found someone working on the
list and backed off.

But, once we drop our lock, it is possible that someone else does come
in and make progress on the list. We should detect this and not loop
around if another process is actually working on the list.

This patch adds a generation counter that is bumped every time we
get the lock and do some send work. If the retry notices someone else
has bumped the generation counter, it does not need to loop around and
continue working.

Signed-off-by: Chris Mason
Signed-off-by: Andy Grover

Chris Mason
2010-09-09 09:12:24 +0800
51e2cba8b RDS: Move atomic stats from general to ib-specific area ... Browse Code »

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:12:20 +0800
049ee3f50 RDS: Change send lock from a mutex to a spinlock ... Browse Code »

This change allows us to call rds_send_xmit() from a tasklet,
which is crucial to our new operating model.

* Change c_send_lock to a spinlock
* Update stats fields "sem_" to "_lock"
* Remove unneeded rds_conn_is_sending()

About locking between shutdown and send -- send checks if the
connection is up. Shutdown puts the connection into
DISCONNECTING. After this, all threads entering send will exit
immediately. However, a thread could be *in* send_xmit(), so
shutdown acquires the c_send_lock to ensure everyone is out
before proceeding with connection shutdown.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:12:12 +0800
77dd550e5 RDS: Stop supporting old cong map sending method ... Browse Code »

We now ask the transport to give us a rm for the congestion
map, and then we handle it normally. Previously, the
transport defined a function that we would call to send
a congestion map.

Convert TCP and loop transports to new cong map method.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:12:10 +0800
ff3d7d361 RDS: Perform unmapping ops in stages ... Browse Code »

Previously, RDS would wait until the final send WR had completed
and then handle cleanup. With silent ops, we do not know
if an atomic, rdma, or data op will be last. This patch
handles any of these cases by keeping a pointer to the last
op in the message in m_last_op.

When the TX completion event fires, rds dispatches to per-op-type
cleanup functions, and then does whole-message cleanup, if the
last op equalled m_last_op.

This patch also moves towards having op-specific functions take
the op struct, instead of the overall rm struct.

rds_ib_connection has a pointer to keep track of a a partially-
completed data send operation. This patch changes it from an
rds_message pointer to the narrower rm_data_op pointer, and
modifies places that use this pointer as needed.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:12:08 +0800
2c3a5f9ab RDS: Add flag for silent ops. Do atomic op before RDMA ... Browse Code »

Add a flag to the API so users can indicate they want
silent operations. This is needed because silent ops
cannot be used with USE_ONCE MRs, so we can't just
assume silent.

Also, change send_xmit to do atomic op before rdma op if
both are present, and centralize the hairy logic to determine if
we want to attempt silent, or not.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:12:06 +0800
7e3bd65eb RDS: Move some variables around for consistency ... Browse Code »

Also, add a comment.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:12:05 +0800
5b2366bd2 RDS: Rewrite rds_send_xmit ... Browse Code »

Simplify rds_send_xmit().

Send a congestion map (via xmit_cong_map) without
decrementing send_quota.

Move resetting of conn xmit variables to end of loop.

Update comments.

Implement a special case to turn off sending an rds header
when there is an atomic op and no other data.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:12:01 +0800
6c7cc6e46 RDS: Rename data op members prefix from m_ to op_ ... Browse Code »

For consistency.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:59 +0800
f8b3aaf2b RDS: Remove struct rds_rdma_op ... Browse Code »

A big changeset, but it's all pretty dumb.

struct rds_rdma_op was already embedded in struct rm_rdma_op.
Remove rds_rdma_op and put its members in rm_rdma_op. Rename
members with "op_" prefix instead of "r_", for consistency.

Of course this breaks a lot, so fixup the code accordingly.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:58 +0800
d0ab25a83 RDS: purge atomic resources too in rds_message_purge() ... Browse Code »

Add atomic_free_op function, analogous to rdma_free_op,
and call it in rds_message_purge().

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:57 +0800
241eef3e2 RDS: Implement silent atomics ... Browse Code »

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:55 +0800
809fa148a RDS: inc_purge() transport function unused - remove it ... Browse Code »

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:46 +0800
15133f6e6 RDS: Implement atomic operations ... Browse Code »

Implement a CMSG-based interface to do FADD and CSWP ops.

Alter send routines to handle atomic ops.

Add atomic counters to stats.

Add xmit_atomic() to struct rds_transport

Inline rds_ib_send_unmap_rdma into unmap_rm

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:41 +0800
ff87e97a9 RDS: make m_rdma_op a member of rds_message ... Browse Code »

This eliminates a separate memory alloc, although
it is now necessary to add an "r_active" flag, since
it is no longer to use the m_rdma_op pointer as an
indicator of if an rdma op is present.

rdma SGs allocated from rm sg pool.

rds_rm_size also gets bigger. It's a little inefficient to
run through CMSGs twice, but it makes later steps a lot smoother.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:38 +0800
21f79afa5 RDS: fold rdma.h into rds.h ... Browse Code »

RDMA is now an intrinsic part of RDS, so it's easier to just have
a single header.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:37 +0800
fc445084f RDS: Explicitly allocate rm in sendmsg() ... Browse Code »

r_m_copy_from_user used to allocate the rm as well as kernel
buffers for the data, and then copy the data in. Now, sendmsg()
allocates the rm, although the data buffer alloc still happens
in r_m_copy_from_user.

SGs are still allocated with rm, but now r_m_alloc_sgs() is
used to reserve them. This allows multiple SG lists to be
allocated from the one rm -- this is important once we also
want to alloc our rdma sgl from this pool.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:36 +0800
e779137aa RDS: break out rdma and data ops into nested structs in rds_message ... Browse Code »

Clearly separate rdma-related variables in rm from data-related ones.
This is in anticipation of adding atomic support.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:11:33 +0800
2dc393573 RDS: move rds_shutdown_worker impl. to rds_conn_shutdown ... Browse Code »

This fits better in connection.c, rather than threads.c.

Signed-off-by: Andy Grover

Andy Grover
2010-09-09 09:10:13 +0800

21 Apr, 2010

1 commit

aa3951451 net: sk_sleep() helper ... Browse Code »

Define a new function to return the waitqueue of a "struct sock".

static inline wait_queue_head_t *sk_sleep(struct sock *sk)
{
return sk->sk_sleep;
}

Change all read occurrences of sk_sleep by a call to this function.

Needed for a future RCU conversion. sk_sleep wont be a field directly
available.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2010-04-21 07:37:13 +0800

17 Mar, 2010

1 commit

b98ba52f9 RDS: only put sockets that have seen congestion on the poll_waitq ... Browse Code »

rds_poll_waitq's listeners will be awoken if we receive a congestion
notification. Bad performance may result because *all* polled sockets
contend for this single lock. However, it should not be necessary to
wake pollers when a congestion update arrives if they have never
experienced congestion, and not putting these on the waitq will
hopefully greatly reduce contention.

Signed-off-by: Andy Grover
Signed-off-by: David S. Miller

Andy Grover
2010-03-17 12:16:59 +0800

24 Aug, 2009

1 commit

335776bd6 RDS: Track transports via an array, not a list ... Browse Code »

Now that transports can be loaded in arbitrary order,
it is important for rds_trans_get_preferred() to look
for them in a particular order, instead of walking the list
until it finds a transport that works for a given address.
Now, each transport registers for a specific transport slot,
and these are ordered so that preferred transports come first,
and then if they are not loaded, other transports are queried.

Signed-off-by: Andy Grover
Signed-off-by: David S. Miller

Andy Grover
2009-08-24 10:13:12 +0800

06 Aug, 2009

1 commit

36cbd3dcc net: mark read-only arrays as const ... Browse Code »

String literals are constant, and usually, we can also tag the array
of pointers const too, moving it to the .rodata section.

Signed-off-by: Jan Engelhardt
Signed-off-by: David S. Miller

Jan Engelhardt
2009-08-06 01:42:58 +0800

19 May, 2009

1 commit

bb803cfbe Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 ... Browse Code »

Conflicts:
drivers/scsi/fcoe/fcoe.c

David S. Miller
2009-05-19 12:08:20 +0800

22 Apr, 2009

1 commit

9b8de7479 FRV: Fix the section attribute on UP DECLARE_PER_CPU() ... Browse Code »

In non-SMP mode, the variable section attribute specified by DECLARE_PER_CPU()
does not agree with that specified by DEFINE_PER_CPU(). This means that
architectures that have a small data section references relative to a base
register may throw up linkage errors due to too great a displacement between
where the base register points and the per-CPU variable.

On FRV, the .h declaration says that the variable is in the .sdata section, but
the .c definition says it's actually in the .data section. The linker throws
up the following errors:

kernel/built-in.o: In function `release_task':
kernel/exit.c:78: relocation truncated to fit: R_FRV_GPREL12 against symbol `per_cpu__process_counts' defined in .data section in kernel/built-in.o
kernel/exit.c:78: relocation truncated to fit: R_FRV_GPREL12 against symbol `per_cpu__process_counts' defined in .data section in kernel/built-in.o

To fix this, DECLARE_PER_CPU() should simply apply the same section attribute
as does DEFINE_PER_CPU(). However, this is made slightly more complex by
virtue of the fact that there are several variants on DEFINE, so these need to
be matched by variants on DECLARE.

Signed-off-by: David Howells
Signed-off-by: Linus Torvalds

David Howells
2009-04-22 10:39:59 +0800

10 Apr, 2009

1 commit

7b70d0336 RDS/IW+IB: Allow max credit advertise window. ... Browse Code »

Fix hack that restricts the credit advertisement to 127.

Signed-off-by: Steve Wise
Signed-off-by: Andy Grover
Signed-off-by: David S. Miller

Steve Wise
2009-04-10 08:21:17 +0800

02 Apr, 2009

1 commit

8cbd9606a RDS: Use spinlock to protect 64b value update on 32b archs ... Browse Code »

We have a 64bit value that needs to be set atomically.
This is easy and quick on all 64bit archs, and can also be done
on x86/32 with set_64bit() (uses cmpxchg8b). However other
32b archs don't have this.

I actually changed this to the current state in preparation for
mainline because the old way (using a spinlock on 32b) resulted in
unsightly #ifdefs in the code. But obviously, being correct takes
precedence.

Signed-off-by: Andy Grover
Signed-off-by: David S. Miller

Andy Grover
2009-04-02 15:52:22 +0800

27 Feb, 2009

1 commit

39de82817 RDS: Main header file ... Browse Code »

RDS's main data structure definitions and exported functions.

Signed-off-by: Andy Grover
Signed-off-by: David S. Miller

Andy Grover
2009-02-27 15:39:23 +0800