Eric Lee / smarc-fsl-linux-kernel

23 Nov, 2011

1 commit

24ca9a847 SUNRPC: Ensure we return EAGAIN in xs_nospace if congestion is cleared ... Browse Code »
1

By returning '0' instead of 'EAGAIN' when the tests in xs_nospace() fail
to find evidence of socket congestion, we are making the RPC engine believe
that the message was incorrectly sent and so it disconnects the socket
instead of just retrying.

The bug appears to have been introduced by commit
5e3771ce2d6a69e10fcc870cdf226d121d868491 (SUNRPC: Ensure that xs_nospace
return values are propagated).

Reported-by: Andrew Cooper
Signed-off-by: Trond Myklebust
Cc: stable@vger.kernel.org [>= 2.6.30]
Tested-by: Andrew Cooper

Trond Myklebust
2011-11-23 05:55:27 +0800

11 Nov, 2011

1 commit

2aa13531b SUNRPC: destroy freshly allocated transport in case of sockaddr init error ... Browse Code »

Otherwise we will leak xprt structure and struct net reference.

Signed-off-by: Stanislav Kinsbursky
Signed-off-by: Trond Myklebust

Stanislav Kinsbursky
2011-11-11 03:50:07 +0800

18 Jul, 2011

2 commits

d9ba131d8 SUNRPC: Support dynamic slot allocation for TCP connections ... Browse Code »
43

Allow the number of available slots to grow with the TCP window size.

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-07-18 06:11:30 +0800
43cedbf0e SUNRPC: Ensure that we grab the XPRT_LOCK before calling xprt_alloc_slot ... Browse Code »
43

This throttles the allocation of new slots when the socket is busy
reconnecting and/or is out of buffer space.

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-07-18 04:01:03 +0800

15 Jul, 2011

1 commit

9e00abc3c SUNRPC: sunrpc should not explicitly depend on NFS config options ... Browse Code »

Change explicit references to CONFIG_NFS_V4_1 to implicit ones
Get rid of the unnecessary defines in backchannel_rqst.c and
bc_svc.c: the Makefile takes care of those dependency.

Signed-off-by: Trond Myklebust

Trond Myklebust
2011-07-15 21:12:23 +0800

28 May, 2011

3 commits

176e21ee2 SUNRPC: Support for RPC over AF_LOCAL transports ... Browse Code »

TI-RPC introduces the capability of performing RPC over AF_LOCAL
sockets. It uses this mainly for registering and unregistering
local RPC services securely with the local rpcbind, but we could
also conceivably use it as a generic upcall mechanism.

This patch provides a client-side only implementation for the moment.
We might also consider a server-side implementation to provide
AF_LOCAL access to NLM (for statd downcalls, and such like).

Autobinding is not supported on kernel AF_LOCAL transports at this
time. Kernel ULPs must specify the pathname of the remote endpoint
when an AF_LOCAL transport is created. rpcbind supports registering
services available via AF_LOCAL, so the kernel could handle it with
some adjustment to ->rpcbind and ->set_port. But we don't need this
feature for doing upcalls via well-known named sockets.

This has not been tested with ULPs that move a substantial amount of
data. Thus, I can't attest to how robust the write_space and
congestion management logic is.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2011-05-28 05:42:47 +0800
61677eeec SUNRPC: Rename xs_encode_tcp_fragment_header() ... Browse Code »

Clean up: Use a more generic name for xs_encode_tcp_fragment_header();
it's appropriate to use for all stream transport types. We're about
to add new stream transport.

Also, move it to a place where it is more easily shared amongst the
various send_request methods. And finally, replace the "htonl" macro
invocation with its modern equivalent.

Signed-off-by: Chuck Lever
Signed-off-by: Trond Myklebust

Chuck Lever
2011-05-28 05:42:47 +0800
fe19a96b1 SUNRPC: Deal with the lack of a SYN_SENT sk->sk_state_change callback... ... Browse Code »

The TCP connection state code depends on the state_change() callback
being called when the SYN_SENT state is set. However the networking layer
doesn't actually call us back in that case.

Signed-off-by: Trond Myklebust
Cc: stable@kernel.org

Trond Myklebust
2011-05-28 05:42:00 +0800

31 Mar, 2011

1 commit

25985edce Fix common misspellings ... Browse Code »

Fixes generated by 'codespell' and manually reviewed.

Signed-off-by: Lucas De Marchi

Lucas De Marchi
2011-03-31 22:26:23 +0800

23 Mar, 2011

1 commit

246408dcd SUNRPC: Never reuse the socket port after an xs_close() ... Browse Code »

If we call xs_close(), we're in one of two situations:
- Autoclose, which means we don't expect to resend a request
- bind+connect failed, which probably means the port is in use

Signed-off-by: Trond Myklebust
Cc: stable@kernel.org

Trond Myklebust
2011-03-23 06:42:33 +0800

11 Mar, 2011

1 commit

4cea288aa sunrpc: Propagate errors from xs_bind() through xs_create_sock() ... Browse Code »

xs_create_sock() is supposed to return a pointer or an ERR_PTR-encoded
error, but it currently returns 0 if xs_bind() fails.

Signed-off-by: Ben Hutchings
Cc: stable@kernel.org [v2.6.37]
Signed-off-by: Trond Myklebust

Ben Hutchings
2011-03-11 04:04:58 +0800

15 Jan, 2011

1 commit

18bce371a Merge branch 'for-2.6.38' of git://linux-nfs.org/~bfields/linux ... Browse Code »

* 'for-2.6.38' of git://linux-nfs.org/~bfields/linux: (62 commits)
nfsd4: fix callback restarting
nfsd: break lease on unlink, link, and rename
nfsd4: break lease on nfsd setattr
nfsd: don't support msnfs export option
nfsd4: initialize cb_per_client
nfsd4: allow restarting callbacks
nfsd4: simplify nfsd4_cb_prepare
nfsd4: give out delegations more quickly in 4.1 case
nfsd4: add helper function to run callbacks
nfsd4: make sure sequence flags are set after destroy_session
nfsd4: re-probe callback on connection loss
nfsd4: set sequence flag when backchannel is down
nfsd4: keep finer-grained callback status
rpc: allow xprt_class->setup to return a preexisting xprt
rpc: keep backchannel xprt as long as server connection
rpc: move sk_bc_xprt to svc_xprt
nfsd4: allow backchannel recovery
nfsd4: support BIND_CONN_TO_SESSION
nfsd4: modify session list under cl_lock
Documentation: fl_mylease no longer exists
...

Fix up conflicts in fs/nfsd/vfs.c with the vfs-scale work. The
vfs-scale work touched some msnfs cases, and this merge removes support
for that entirely, so the conflict was trivial to resolve.

Linus Torvalds
2011-01-15 05:17:26 +0800

12 Jan, 2011

3 commits

f0418aa4b rpc: allow xprt_class->setup to return a preexisting xprt ... Browse Code »

This allows us to reuse the xprt associated with a server connection if
one has already been set up.

Signed-off-by: J. Bruce Fields

J. Bruce Fields
2011-01-12 04:04:10 +0800
99de8ea96 rpc: keep backchannel xprt as long as server connection ... Browse Code »

Multiple backchannels can share the same tcp connection; from rfc 5661 section
2.10.3.1:

A connection's association with a session is not exclusive. A
connection associated with the channel(s) of one session may be
simultaneously associated with the channel(s) of other sessions
including sessions associated with other client IDs.

However, multiple backchannels share a connection, they must all share
the same xid stream (hence the same rpc_xprt); the only way we have to
match replies with calls at the rpc layer is using the xid.

So, keep the rpc_xprt around as long as the connection lasts, in case
we're asked to use the connection as a backchannel again.

Requests to create new backchannel clients over a given server
connection should results in creating new clients that reuse the
existing rpc_xprt.

But to start, just reject attempts to associate multiple rpc_xprt's with
the same underlying bc_xprt.

Signed-off-by: J. Bruce Fields

J. Bruce Fields
2011-01-12 04:04:10 +0800
d75faea33 rpc: move sk_bc_xprt to svc_xprt ... Browse Code »

This seems obviously transport-level information even if it's currently
used only by the server socket code.

Signed-off-by: J. Bruce Fields

J. Bruce Fields
2011-01-12 04:04:10 +0800

15 Dec, 2010

1 commit

afe2c511f workqueue: convert cancel_rearming_delayed_work[queue]() users to cancel_delayed_work_sync() ... Browse Code »

cancel_rearming_delayed_work[queue]() has been superceded by
cancel_delayed_work_sync() quite some time ago. Convert all the
in-kernel users. The conversions are completely equivalent and
trivial.

Signed-off-by: Tejun Heo
Acked-by: "David S. Miller"
Acked-by: Greg Kroah-Hartman
Acked-by: Evgeniy Polyakov
Cc: Jeff Garzik
Cc: Benjamin Herrenschmidt
Cc: Mauro Carvalho Chehab
Cc: netdev@vger.kernel.org
Cc: Anton Vorontsov
Cc: David Woodhouse
Cc: "J. Bruce Fields"
Cc: Neil Brown
Cc: Alex Elder
Cc: xfs-masters@oss.sgi.com
Cc: Christoph Lameter
Cc: Pekka Enberg
Cc: Andrew Morton
Cc: netfilter-devel@vger.kernel.org
Cc: Trond Myklebust
Cc: linux-nfs@vger.kernel.org

Tejun Heo
2010-12-15 17:56:11 +0800

27 Oct, 2010

1 commit

4390110fe Merge branch 'for-2.6.37' of git://linux-nfs.org/~bfields/linux ... Browse Code »

* 'for-2.6.37' of git://linux-nfs.org/~bfields/linux: (99 commits)
svcrpc: svc_tcp_sendto XPT_DEAD check is redundant
svcrpc: no need for XPT_DEAD check in svc_xprt_enqueue
svcrpc: assume svc_delete_xprt() called only once
svcrpc: never clear XPT_BUSY on dead xprt
nfsd4: fix connection allocation in sequence()
nfsd4: only require krb5 principal for NFSv4.0 callbacks
nfsd4: move minorversion to client
nfsd4: delay session removal till free_client
nfsd4: separate callback change and callback probe
nfsd4: callback program number is per-session
nfsd4: track backchannel connections
nfsd4: confirm only on succesful create_session
nfsd4: make backchannel sequence number per-session
nfsd4: use client pointer to backchannel session
nfsd4: move callback setup into session init code
nfsd4: don't cache seq_misordered replies
SUNRPC: Properly initialize sock_xprt.srcaddr in all cases
SUNRPC: Use conventional switch statement when reclassifying sockets
sunrpc/xprtrdma: clean up workqueue usage
sunrpc: Turn list_for_each-s into the ..._entry-s
...

Fix up trivial conflicts (two different deprecation notices added in
separate branches) in Documentation/feature-removal-schedule.txt

Linus Torvalds
2010-10-27 00:55:25 +0800

21 Oct, 2010

2 commits

924768508 SUNRPC: Properly initialize sock_xprt.srcaddr in all cases ... Browse Code »

The source address field in the transport's sock_xprt is initialized
ONLY IF the RPC application passed a pointer to a source address
during the call to rpc_create(). However, xs_bind() subsequently uses
the value of this field without regard to whether the source address
was initialized during transport creation or not.

So far we've been lucky: the uninitialized value of this field is
zeroes. xs_bind(), until recently, used only the sin[6]_addr field in
this sockaddr, and all zeroes is a valid value for this: it means
ANYADDR. This is a happy coincidence.

However, xs_bind() now wants to use the sa_family field as well, and
expects it to be initialized to something other than zero.

Therefore, the source address sockaddr field should be fully
initialized at transport create time in _every_ case, not just when
the RPC application wants to use a specific bind address.

Bruce added a workaround for this missing initialization by adjusting
commit 6bc9638a, but the "right" way to do this is to ensure that the
source address sockaddr is always correctly initialized from the
get-go.

This patch doesn't introduce a behavior change. It's simply a
clean-up of Bruce's fix, to prevent future problems of this kind. It
may look like overkill, but

a) it clearly documents the default initial value of this field,

b) it doesn't assume that the sockaddr_storage memory is first
initialized to any particular value, and

c) it will fail verbosely if some unknown address family is passed
in

Originally introduced by commit d3bc9a1d.

Signed-off-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Chuck Lever
2010-10-21 22:11:47 +0800
4232e8634 SUNRPC: Use conventional switch statement when reclassifying sockets ... Browse Code »

Clean up.

Defensive coding: If "family" is ever something that is neither
AF_INET nor AF_INET6, xs_reclassify_socket6() is not the appropriate
default action. Choose to do nothing in that case.

Introduced by commit 6bc9638a.

Signed-off-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Chuck Lever
2010-10-21 22:11:46 +0800

19 Oct, 2010

14 commits

50fa0d40a sunrpc: Remove dead "else" branch from bc xprt creation ... Browse Code »

Since the xprt in question is forcibly set to be bound the else
branch of this check is unneeded.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:16 +0800
8c14ff2aa sunrpc: Remove UDP worker wrappers ... Browse Code »

Same for UDP sockets creation paths.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:15 +0800
cdd518d52 sunrpc: Remove TCP worker wrappers ... Browse Code »

The v4 and the v6 wrappers only pass the respective family
to the xs_tcp_setup_socket. This family can be taken from the
xprt's sockaddr.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:15 +0800
7dfe1fc36 sunrpc: Pass family to setup_socket calls ... Browse Code »

Now we have a single socket creation routine and can call it
directly from the setup_socket routines.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:15 +0800
6bc9638ab sunrpc: Merge xs_create_sock code ... Browse Code »

After xs_bind is merged it's easy to merge its callers.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
[bfields@redhat.com: fix address family initialization]
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:15 +0800
beb59b682 sunrpc: Merge the xs_bind code ... Browse Code »

There's the only difference betseen the xs_bind4 and the
xs_bind6 - the size of sockaddr structure they use.

Fortunatelly its size can be indirectly get from the transport.

Change since v1:
* use sockaddr_storage instead of sockaddr
* use rpc_set_port instead of manual port assigning

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
[bfields@redhat.com: fix address family initialization]
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:15 +0800
573018c07 sunrpc: Call xs_create_sockX directly from setup_socket ... Browse Code »

Remove now unneeded wrappers that just add type and protocol
to socket creation callback.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:15 +0800
22d44a7d8 sunrpc: Factor out v6 sockets creation ... Browse Code »

Same patch for v6 protocols.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:15 +0800
22f793268 sunrpc: Factor out v4 sockets creation ... Browse Code »

The UDPv4 and TCPv4 socket creation callbacks now look very similar.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:14 +0800
b65c03106 sunrpc: Factor out udp sockets creation ... Browse Code »

Make it look like the TCP sockets creation.
Unfortunately the git diff made the patch look messy :(

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:14 +0800
58dddac9c sunrpc: Remove duplicate xprt/transport arguments from calls ... Browse Code »

The xs_tcp_reuse_connection takes the xprt only to pass it down
to the xs_abort_connection. The later one can get it from the given
transport itself.

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:14 +0800
a9f5f0f7b sunrpc: Get xprt pointer once in xs_tcp_setup_socket ... Browse Code »

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:14 +0800
baaf4e487 sunrpc: Remove unused sock arg from xs_next_srcport ... Browse Code »

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:14 +0800
5d4ec9329 sunrpc: Remove unused sock arg from xs_get_srcport ... Browse Code »

Signed-off-by: Pavel Emelyanov
Reviewed-by: Chuck Lever
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-19 22:48:14 +0800

02 Oct, 2010

4 commits

14ec63c33 sunrpc: Create sockets in net namespaces ... Browse Code »

The context is already known in all the sock_create callers.

Signed-off-by: Pavel Emelyanov
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-02 05:19:00 +0800
37aa21337 sunrpc: Tag rpc_xprt with net ... Browse Code »

The net is known from the xprt_create and this tagging will also
give un the context in the conntection workers where real sockets
are created.

Signed-off-by: Pavel Emelyanov
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-02 05:18:58 +0800
e204e621b sunrpc: Factor out rpc_xprt freeing ... Browse Code »

Signed-off-by: Pavel Emelyanov
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-02 05:18:53 +0800
bd1722d43 sunrpc: Factor out rpc_xprt allocation ... Browse Code »

Signed-off-by: Pavel Emelyanov
Signed-off-by: J. Bruce Fields

Pavel Emelyanov
2010-10-02 05:18:52 +0800

25 Sep, 2010

1 commit

f064af1e5 net: fix a lockdep splat ... Browse Code »

We have for each socket :

One spinlock (sk_slock.slock)
One rwlock (sk_callback_lock)

Possible scenarios are :

(A) (this is used in net/sunrpc/xprtsock.c)
read_lock(&sk->sk_callback_lock) (without blocking BH)

spin_lock(&sk->sk_slock.slock);
...
read_lock(&sk->sk_callback_lock);
...

(B)
write_lock_bh(&sk->sk_callback_lock)
stuff
write_unlock_bh(&sk->sk_callback_lock)

(C)
spin_lock_bh(&sk->sk_slock)
...
write_lock_bh(&sk->sk_callback_lock)
stuff
write_unlock_bh(&sk->sk_callback_lock)
spin_unlock_bh(&sk->sk_slock)

This (C) case conflicts with (A) :

CPU1 [A] CPU2 [C]
read_lock(callback_lock)
spin_lock_bh(slock)

We have one problematic (C) use case in inet_csk_listen_stop() :

local_bh_disable();
bh_lock_sock(child); // spin_lock_bh(&sk->sk_slock)
WARN_ON(sock_owned_by_user(child));
...
sock_orphan(child); // write_lock_bh(&sk->sk_callback_lock)

lockdep is not happy with this, as reported by Tetsuo Handa

It seems only way to deal with this is to use read_lock_bh(callbacklock)
everywhere.

Thanks to Jarek for pointing a bug in my first attempt and suggesting
this solution.

Reported-by: Tetsuo Handa
Tested-by: Tetsuo Handa
Signed-off-by: Eric Dumazet
CC: Jarek Poplawski
Tested-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2010-09-25 13:26:10 +0800

19 Aug, 2010

1 commit

763008c43 Merge branch 'bugfixes' of git://git.linux-nfs.org/projects/trondmy/nfs-2.6 ... Browse Code »

* 'bugfixes' of git://git.linux-nfs.org/projects/trondmy/nfs-2.6:
NFS: Fix an Oops in the NFSv4 atomic open code
NFS: Fix the selection of security flavours in Kconfig
NFS: fix the return value of nfs_file_fsync()
rpcrdma: Fix SQ size calculation when memreg is FRMR
xprtrdma: Do not truncate iova_start values in frmr registrations.
nfs: Remove redundant NULL check upon kfree()
nfs: Add "lookupcache" to displayed mount options
NFS: allow close-to-open cache semantics to apply to root of NFS filesystem
SUNRPC: fix NFS client over TCP hangs due to packet loss (Bug 16494)

Linus Torvalds
2010-08-19 06:45:23 +0800

11 Aug, 2010

1 commit

9bbb9e5a3 param: use ops in struct kernel_param, rather than get and set fns directly ... Browse Code »

This is more kernel-ish, saves some space, and also allows us to
expand the ops without breaking all the callers who are happy for the
new members to be NULL.

The few places which defined their own param types are changed to the
new scheme (more which crept in recently fixed in following patches).

Since we're touching them anyway, we change get() and set() to take a
const struct kernel_param (which they really are). This causes some
harmless warnings until we fix them (in following patches).

To reduce churn, module_param_call creates the ops struct so the callers
don't have to change (and casts the functions to reduce warnings).
The modern version which takes an ops struct is called module_param_cb.

Signed-off-by: Rusty Russell
Reviewed-by: Takashi Iwai
Tested-by: Phil Carmody
Cc: "David S. Miller"
Cc: Ville Syrjala
Cc: Dmitry Torokhov
Cc: Alessandro Rubini
Cc: Michal Januszewski
Cc: Trond Myklebust
Cc: "J. Bruce Fields"
Cc: Neil Brown
Cc: linux-kernel@vger.kernel.org
Cc: linux-input@vger.kernel.org
Cc: linux-fbdev-devel@lists.sourceforge.net
Cc: linux-nfs@vger.kernel.org
Cc: netdev@vger.kernel.org

Rusty Russell
2010-08-11 21:34:13 +0800