Eric Lee / smarc-fsl-linux-kernel

18 Jul, 2011

4 commits

34006cee2 SUNRPC: Replace xprt->resend and xprt->sending with a priority queue ... Browse Code »

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-07-18 06:11:34 +0800
d9ba131d8 SUNRPC: Support dynamic slot allocation for TCP connections ... Browse Code »
43

Allow the number of available slots to grow with the TCP window size.

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-07-18 06:11:30 +0800
21de0a955 SUNRPC: Clean up the slot table allocation ... Browse Code »

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-07-18 04:57:32 +0800
43cedbf0e SUNRPC: Ensure that we grab the XPRT_LOCK before calling xprt_alloc_slot ... Browse Code »
43

This throttles the allocation of new slots when the socket is busy
reconnecting and/or is out of buffer space.

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-07-18 04:01:03 +0800

15 Jul, 2011

1 commit

9e00abc3c SUNRPC: sunrpc should not explicitly depend on NFS config options ... Browse Code »

Change explicit references to CONFIG_NFS_V4_1 to implicit ones
Get rid of the unnecessary defines in backchannel_rqst.c and
bc_svc.c: the Makefile takes care of those dependency.

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-07-15 21:12:23 +0800

28 May, 2011

1 commit

176e21ee2 SUNRPC: Support for RPC over AF_LOCAL transports ... Browse Code »

TI-RPC introduces the capability of performing RPC over AF_LOCAL
sockets. It uses this mainly for registering and unregistering
local RPC services securely with the local rpcbind, but we could
also conceivably use it as a generic upcall mechanism.

This patch provides a client-side only implementation for the moment.
We might also consider a server-side implementation to provide
AF_LOCAL access to NLM (for statd downcalls, and such like).

Autobinding is not supported on kernel AF_LOCAL transports at this
time. Kernel ULPs must specify the pathname of the remote endpoint
when an AF_LOCAL transport is created. rpcbind supports registering
services available via AF_LOCAL, so the kernel could handle it with
some adjustment to ->rpcbind and ->set_port. But we don't need this
feature for doing upcalls via well-known named sockets.

This has not been tested with ULPs that move a substantial amount of
data. Thus, I can't attest to how robust the write_space and
congestion management logic is.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2011-05-28 05:42:47 +0800

18 Mar, 2011

1 commit

a8de240a9 SUNRPC: Convert struct rpc_xprt to use atomic_t counters ... Browse Code »

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-03-18 00:38:59 +0800

12 Jan, 2011

1 commit

f0418aa4b rpc: allow xprt_class->setup to return a preexisting xprt ... Browse Code »

This allows us to reuse the xprt associated with a server connection if
one has already been set up.

Signed-off-by: J. Bruce Fields

J. Bruce Fields
2011-01-12 04:04:10 +0800

02 Oct, 2010

4 commits

37aa21337 sunrpc: Tag rpc_xprt with net ... Browse Code »

The net is known from the xprt_create and this tagging will also
give un the context in the conntection workers where real sockets
are created.

Signed-off-by: Pavel Emelyanov
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-02 05:18:58 +0800
9a23e332e sunrpc: Add net to xprt_create ... Browse Code »

Signed-off-by: Pavel Emelyanov
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-02 05:18:57 +0800
e204e621b sunrpc: Factor out rpc_xprt freeing ... Browse Code »

Signed-off-by: Pavel Emelyanov
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-02 05:18:53 +0800
bd1722d43 sunrpc: Factor out rpc_xprt allocation ... Browse Code »

Signed-off-by: Pavel Emelyanov
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-02 05:18:52 +0800

04 Aug, 2010

1 commit

a17c2153d SUNRPC: Move the bound cred to struct rpc_rqst ... Browse Code »

This will allow us to save the original generic cred in rpc_message, so
that if we migrate from one server to another, we can generate a new bound
cred without having to punt back to the NFS layer.

Signed-off-by: Trond Myklebust

Trond Myklebust
2010-08-04 20:54:09 +0800

15 May, 2010

4 commits

d60dbb20a SUNRPC: Move the task->tk_bytes_sent and tk_rtt to struct rpc_rqst ... Browse Code »

It seems strange to maintain stats for bytes_sent in one structure, and
bytes received in another. Try to assemble all the RPC request-related
stats in struct rpc_rqst

Signed-off-by: Trond Myklebust

Trond Myklebust
2010-05-15 03:09:36 +0800
ff8399709 SUNRPC: Replace jiffies-based metrics with ktime-based metrics ... Browse Code »

Currently RPC performance metrics that tabulate elapsed time use
jiffies time values. This is problematic on systems that use slow
jiffies (for instance 100HZ systems built for paravirtualized
environments). It is also a problem for computing precise latency
statistics for advanced network transports, such as InfiniBand,
that can have round-trip latencies significanly faster than a single
clock tick.

For the RPC client, adopt the high resolution time stamp mechanism
already used by the network layer and blktrace: ktime.

We use ktime format time stamps for all internal computations, and
convert to milliseconds for presentation. As a result, we need only
addition operations in the performance critical paths; multiply/divide
is required only for presentation.

We could report RTT metrics in microseconds. In fact the mountstats
format is versioned to accomodate exactly this kind of interface
improvement.

For now, however, we'll stay with millisecond precision for
presentation to maintain backwards compatibility with the handful of
currently deployed user space tools. At a later point, we'll move to
an API such as BDI_STATS where a finer timestamp precision can be
reported.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2010-05-15 03:09:33 +0800
bbc72cea5 SUNRPC: RPC metrics and RTT estimator should use same RTT value ... Browse Code »

Compute an RPC request's RTT once, and use that value both for reporting
RPC metrics, and for adjusting the RTT context used by the RPC client's RTT
estimator algorithm.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2010-05-15 03:09:32 +0800
a8ce4a8f3 SUNRPC: Fail over more quickly on connect errors ... Browse Code »

We should not allow soft tasks to wait for longer than the major timeout
period when waiting for a reconnect to occur.

Remove the field xprt->connect_timeout since it has been obsoleted by
xprt->reestablish_timeout.

Signed-off-by: Trond Myklebust

Trond Myklebust
2010-05-15 03:09:30 +0800

14 Sep, 2009

1 commit

f300baba5 nfsd41: sunrpc: add new xprt class for nfsv4.1 backchannel ... Browse Code »

[sunrpc: change idle timeout value for the backchannel]
Signed-off-by: Alexandros Batsakis
Signed-off-by: Benny Halevy
Acked-by: Trond Myklebust
Signed-off-by: J. Bruce Fields

Alexandros Batsakis
2009-09-14 03:46:15 +0800

12 Sep, 2009

1 commit

4cfc7e601 nfsd41: sunrpc: Added rpc server-side backchannel handling ... Browse Code »

When the call direction is a reply, copy the xid and call direction into the
req->rq_private_buf.head[0].iov_base otherwise rpc_verify_header returns
rpc_garbage.

Signed-off-by: Rahul Iyer
Signed-off-by: Mike Sager
Signed-off-by: Marc Eshel
Signed-off-by: Benny Halevy
Signed-off-by: Ricardo Labiaga
Signed-off-by: Andy Adamson
Signed-off-by: Benny Halevy
[get rid of CONFIG_NFSD_V4_1]
[sunrpc: refactoring of svc_tcp_recvfrom]
[nfsd41: sunrpc: create common send routine for the fore and the back channels]
[nfsd41: sunrpc: Use free_page() to free server backchannel pages]
[nfsd41: sunrpc: Document server backchannel locking]
[nfsd41: sunrpc: remove bc_connect_worker()]
[nfsd41: sunrpc: Define xprt_server_backchannel()[
[nfsd41: sunrpc: remove bc_close and bc_init_auto_disconnect dummy functions]
[nfsd41: sunrpc: eliminate unneeded switch statement in xs_setup_tcp()]
[nfsd41: sunrpc: Don't auto close the server backchannel connection]
[nfsd41: sunrpc: Remove unused functions]
Signed-off-by: Alexandros Batsakis
Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy
[nfsd41: change bc_sock to bc_xprt]
[nfsd41: sunrpc: move struct rpc_buffer def into a common header file]
[nfsd41: sunrpc: use rpc_sleep in bc_send_request so not to block on mutex]
[removed cosmetic changes]
Signed-off-by: Benny Halevy
[sunrpc: add new xprt class for nfsv4.1 backchannel]
[sunrpc: v2.1 change handling of auto_close and init_auto_disconnect operations for the nfsv4.1 backchannel]
Signed-off-by: Alexandros Batsakis
[reverted more cosmetic leftovers]
[got rid of xprt_server_backchannel]
[separated "nfsd41: sunrpc: add new xprt class for nfsv4.1 backchannel"]
Signed-off-by: Benny Halevy
Cc: Trond Myklebust
[sunrpc: change idle timeout value for the backchannel]
Signed-off-by: Alexandros Batsakis
Signed-off-by: Benny Halevy
Acked-by: Trond Myklebust
Signed-off-by: J. Bruce Fields

Rahul Iyer
2009-09-12 03:04:16 +0800

10 Aug, 2009

2 commits

c740eff84 SUNRPC: Kill RPC_DISPLAY_ALL ... Browse Code »

At some point, I recall that rpc_pipe_fs used RPC_DISPLAY_ALL.
Currently there are no uses of RPC_DISPLAY_ALL outside the transport
modules themselves, so we can safely get rid of it.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2009-08-10 03:09:46 +0800
ba809130b SUNRPC: Remove duplicate universal address generation ... Browse Code »

RPC universal address generation is currently done in several places:
rpcb_clnt.c, nfs4proc.c xprtsock.c, and xprtrdma.c. Remove the
redundant cases that convert a socket address to a universal
address. The nfs4proc.c case takes a pre-formatted presentation
address string, not a socket address, so we'll leave that one.

Because the new uaddr constructor uses the recently introduced
rpc_ntop(), it now supports proper "::" shorthanding for IPv6
addresses. This allows the kernel to register properly formed
universal addresses with the local rpcbind service, in _all_ cases.

The kernel can now also send properly formed universal addresses in
RPCB_GETADDR requests, and support link-local properly when
encoding and decoding IPv6 addresses.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2009-08-10 03:09:35 +0800

18 Jun, 2009

4 commits

dd2b63d04 nfs41: Rename rq_received to rq_reply_bytes_recvd ... Browse Code »

The 'rq_received' member of 'struct rpc_rqst' is used to track when we
have received a reply to our request. With v4.1, the backchannel
can now accept callback requests over the existing connection. Rename
this field to make it clear that it is only used for tracking reply bytes
and not all bytes received on the connection.

Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy

Ricardo Labiaga
2009-06-18 05:11:40 +0800
55ae1aabf nfs41: Add backchannel processing support to RPC state machine ... Browse Code »

Adds rpc_run_bc_task() which is called by the NFS callback service to
process backchannel requests. It performs similar work to rpc_run_task()
though "schedules" the backchannel task to be executed starting at the
call_trasmit state in the RPC state machine.

It also introduces some miscellaneous updates to the argument validation,
call_transmit, and transport cleanup functions to take into account
that there are now forechannel and backchannel tasks.

Backchannel requests do not carry an RPC message structure, since the
payload has already been XDR encoded using the existing NFSv4 callback
mechanism.

Introduce a new transmit state for the client to reply on to backchannel
requests. This new state simply reserves the transport and issues the
reply. In case of a connection related error, disconnects the transport and
drops the reply. It requires the forechannel to re-establish the connection
and the server to retransmit the request, as stated in NFSv4.1 section
2.9.2 "Client and Server Transport Behavior".

Note: There is no need to loop attempting to reserve the transport. If EAGAIN
is returned by xprt_prepare_transmit(), return with tk_status == 0,
setting tk_action to call_bc_transmit. rpc_execute() will invoke it again
after the task is taken off the sleep queue.

[nfs41: rpc_run_bc_task() need not be exported outside RPC module]
[nfs41: New call_bc_transmit RPC state]
Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy
[nfs41: Backchannel: No need to loop in call_bc_transmit()]
Signed-off-by: Andy Adamson
Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy
[rpc_count_iostats incorrectly exits early]
Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy
[Convert rpc_reply_expected() to inline function]
[Remove unnecessary BUG_ON()]
[Rename variable]
Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy

Ricardo Labiaga
2009-06-18 05:11:24 +0800
fb7a0b9ad nfs41: New backchannel helper routines ... Browse Code »

This patch introduces support to setup the callback xprt on the client side.
It allocates/ destroys the preallocated memory structures used to process
backchannel requests.

At setup time, xprt_setup_backchannel() is invoked to allocate one or
more rpc_rqst structures and substructures. This ensures that they
are available when an RPC callback arrives. The rpc_rqst structures
are maintained in a linked list attached to the rpc_xprt structure.
We keep track of the number of allocations so that they can be correctly
removed when the channel is destroyed.

When an RPC callback arrives, xprt_alloc_bc_request() is invoked to
obtain a preallocated rpc_rqst structure. An rpc_xprt structure is
returned, and its RPC_BC_PREALLOC_IN_USE bit is set in
rpc_xprt->bc_flags. The structure is removed from the the list
since it is now in use, and it will be later added back when its
user is done with it.

After the RPC callback replies, the rpc_rqst structure is returned
by invoking xprt_free_bc_request(). This clears the
RPC_BC_PREALLOC_IN_USE bit and adds it back to the list, allowing it
to be reused by a subsequent RPC callback request.

To be consistent with the reception of RPC messages, the backchannel requests
should be placed into the 'struct rpc_rqst' rq_rcv_buf, which is then in turn
copied to the 'struct rpc_rqst' rq_private_buf.

[nfs41: Preallocate rpc_rqst receive buffer for handling callbacks]
Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy
[Update copyright notice and explain page allocation]
Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy

Ricardo Labiaga
2009-06-18 04:06:14 +0800
56632b5bf nfs41: client callback structures ... Browse Code »

Adds new list of rpc_xprt structures, and a readers/writers lock to
protect the list. The list is used to preallocate resources for
the backchannel during backchannel requests. Callbacks are not
expected to cause significant latency, so only one callback will
be allowed at this time.

It also adds a pointer to the NFS callback service so that
requests can be directed to it for processing.

New callback members added to svc_serv. The NFSv4.1 callback service will
sleep on the svc_serv->svc_cb_waitq until new callback requests arrive.
The request will be queued in svc_serv->svc_cb_list. This patch adds this
list, the sleep queue and spinlock to svc_serv.

[nfs41: NFSv4.1 callback support]
Signed-off-by: Ricardo Labiaga
Signed-off-by: Benny Halevy

Ricardo Labiaga
2009-06-18 04:06:13 +0800

03 May, 2009

1 commit

f75e6745a SUNRPC: Fix the problem of EADDRNOTAVAIL syslog floods on reconnect ... Browse Code »

See http://bugzilla.kernel.org/show_bug.cgi?id=13034

If the port gets into a TIME_WAIT state, then we cannot reconnect without
binding to a new port.

Tested-by: Petr Vandrovec
Tested-by: Jean Delvare
Signed-off-by: Trond Myklebust
Signed-off-by: Linus Torvalds

Trond Myklebust
2009-05-03 07:35:08 +0800

20 Mar, 2009

1 commit

7d1e8255c SUNRPC: Add the equivalent of the linger and linger2 timeouts to RPC sockets ... Browse Code »

This fixes a regression against FreeBSD servers as reported by Tomas
Kasparek. Apparently when using RPC over a TCP socket, the FreeBSD servers
don't ever react to the client closing the socket, and so commit
e06799f958bf7f9f8fae15f0c6f519953fb0257c (SUNRPC: Use shutdown() instead of
close() when disconnecting a TCP socket) causes the setup to hang forever
whenever the client attempts to close and then reconnect.

We break the deadlock by adding a 'linger2' style timeout to the socket,
after which, the client will abort the connection using a TCP 'RST'.

The default timeout is set to 15 seconds. A subsequent patch will put it
under user control by means of a systctl.

Signed-off-by: Trond Myklebust

Trond Myklebust
2009-03-20 03:17:34 +0800

12 Mar, 2009

1 commit

441e3e242 SUNRPC: dynamically load RPC transport modules on-demand ... Browse Code »

Provide an api to attempt to load any necessary kernel RPC
client transport module automatically. By convention, the
desired module name is "xprt"+"transport name". For example,
when NFS mounting with "-o proto=rdma", attempt to load the
"xprtrdma" module.

Signed-off-by: Tom Talpey
Cc: Chuck Lever
Signed-off-by: Trond Myklebust

Tom Talpey
2009-03-12 02:37:56 +0800

24 Dec, 2008

1 commit

c977a2ef4 sunrpc: get rid of rpc_rqst.rq_bufsize ... Browse Code »

rq_bufsize is not used.

Signed-off-by: Mike Sager
Signed-off-by: Benny Halevy
Signed-off-by: Trond Myklebust

Benny Halevy
2008-12-24 05:06:13 +0800

20 Apr, 2008

2 commits

7c1d71cf5 SUNRPC: Don't disconnect more than once if retransmitting NFSv4 requests ... Browse Code »

NFSv4 requires us to ensure that we break the TCP connection before we're
allowed to retransmit a request. However in the case where we're
retransmitting several requests that have been sent on the same
connection, we need to ensure that we don't interfere with the attempt to
reconnect and/or break the connection again once it has been established.

We therefore introduce a 'connection' cookie that is bumped every time a
connection is broken. This allows requests to track if they need to force a
disconnection.

Signed-off-by: Trond Myklebust

Trond Myklebust
2008-04-20 04:55:12 +0800
b6ddf64ff SUNRPC: Fix up xprt_write_space() ... Browse Code »

The rest of the networking layer uses SOCK_ASYNC_NOSPACE to signal whether
or not we have someone waiting for buffer memory. Convert the SUNRPC layer
to use the same idiom.
Remove the unlikely()s in xs_udp_write_space and xs_tcp_write_space. In
fact, the most common case will be that there is nobody waiting for buffer
space.

SOCK_NOSPACE is there to tell the TCP layer whether or not the cwnd was
limited by the application window. Ensure that we follow the same idiom as
the rest of the networking layer here too.

Finally, ensure that we clear SOCK_ASYNC_NOSPACE once we wake up, so that
write_space() doesn't keep waking things up on xprt->pending.

Signed-off-by: Trond Myklebust

Trond Myklebust
2008-04-20 04:52:44 +0800

30 Jan, 2008

6 commits

b454ae906 SUNRPC: fewer conditionals in the format_ip_address routines ... Browse Code »

Clean up: have the set up routines explicitly pass the strings to be used
for the transport name and NETID. This removes a number of conditionals
and dependencies on rpc_xprt.prot, which is overloaded.

Tighten up type checking on the address_strings array while we're at it.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2008-01-30 15:06:04 +0800
ba7392bb3 SUNRPC: Add support for per-client timeout values ... Browse Code »

In order to be able to support setting the timeo and retrans parameters on
a per-mountpoint basis, we move the rpc_timeout structure into the
rpc_clnt.

Signed-off-by: Trond Myklebust

Trond Myklebust
2008-01-30 15:05:59 +0800
2881ae74e SUNRPC: Clean up the transport timeout initialisation ... Browse Code »

Signed-off-by: Trond Myklebust

Trond Myklebust
2008-01-30 15:05:58 +0800
62da3b248 SUNRPC: Rename xprt_disconnect() ... Browse Code »

xprt_disconnect() should really only be called when the transport shutdown
is completed, and it is time to wake up any pending tasks. Rename it to
xprt_disconnect_done() in order to reflect the semantical change.

Signed-off-by: Trond Myklebust

Trond Myklebust
2008-01-30 15:05:27 +0800
3b948ae5b SUNRPC: Allow the client to detect if the TCP connection is closed ... Browse Code »

Add an xprt->state bit to enable the TCP ->state_change() method to signal
whether or not the TCP connection is in the process of closing down.
This will to be used by the reconnection logic in a separate patch.

Signed-off-by: Trond Myklebust

Trond Myklebust
2008-01-30 15:05:25 +0800
66af1e558 SUNRPC: Fix a race in xs_tcp_state_change() ... Browse Code »

When scheduling the autoclose RPC call, we want to ensure that we don't
race against the test_bit() call in xprt_clear_locked().

Signed-off-by: Trond Myklebust

Trond Myklebust
2008-01-30 15:05:24 +0800

10 Oct, 2007

3 commits

4fa016eb2 NFS/SUNRPC: support transport protocol naming ... Browse Code »

To prepare for including non-sockets-based RPC transports, select
RPC transports by an identifier (to be used in following patches).

Signed-off-by: Tom Talpey
Signed-off-by: Trond Myklebust

\"Talpey, Thomas\
2007-10-10 05:17:50 +0800
49c36fcc4 SUNRPC: rearrange RPC sockets definitions ... Browse Code »

To prepare for including non-sockets-based RPC transports, move the
sockets-dependent definitions into their own file.

Signed-off-by: Tom Talpey
Signed-off-by: Trond Myklebust

\"Talpey, Thomas\
2007-10-10 05:17:48 +0800
3c341b0b9 SUNRPC: rename the rpc_xprtsock_create structure ... Browse Code »

To prepare for including non-sockets-based RPC transports, change the
overly suggestive name of the transport creation arguments struct.

Signed-off-by: Tom Talpey
Signed-off-by: Trond Myklebust

\"Talpey, Thomas\
2007-10-10 05:17:45 +0800