Eric Lee / smarc-fsl-linux-kernel

13 Oct, 2016

2 commits

07096f612 rxrpc: Fix checking of error from ip6_route_output() ... Browse Code »

ip6_route_output() doesn't return a negative error when it fails, rather
the ->error field of the returned dst_entry struct needs to be checked.

Reported-by: Dan Carpenter
Fixes: 75b54cb57ca3 ("rxrpc: Add IPv6 support")
Signed-off-by: David Howells

David Howells
2016-10-13 15:43:17 +0800
54fde4234 rxrpc: Fix checker warning by not passing always-zero value to ERR_PTR() ... Browse Code »

Fix the following checker warning:

net/rxrpc/call_object.c:279 rxrpc_new_client_call()
warn: passing zero to 'ERR_PTR'

where a value that's always zero is passed to ERR_PTR() so that it can be
passed to a tracepoint in an auxiliary pointer field.

Just pass NULL instead to the tracepoint.

Fixes: a84a46d73050 ("rxrpc: Add some additional call tracing")
Reported-by: Dan Carpenter
Signed-off-by: David Howells

David Howells
2016-10-13 15:39:52 +0800

06 Oct, 2016

12 commits

bf7d620ab rxrpc: Don't request an ACK on the last DATA packet of a call's Tx phase ... Browse Code »

Don't request an ACK on the last DATA packet of a call's Tx phase as for a
client there will be a reply packet or some sort of ACK to shift phase. If
the ACK is requested, OpenAFS sends a REQUESTED-ACK ACK with soft-ACKs in
it and doesn't follow up with a hard-ACK.

If we don't set the flag, OpenAFS will send a DELAY ACK that hard-ACKs the
reply data, thereby allowing the call to terminate cleanly.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:51 +0800
9749fd2be rxrpc: Need to produce an ACK for service op if op takes a long time ... Browse Code »

We need to generate a DELAY ACK from the service end of an operation if we
start doing the actual operation work and it takes longer than expected.
This will hard-ACK the request data and allow the client to release its
resources.

To make this work:

(1) We have to set the ack timer and propose an ACK when the call moves to
the RXRPC_CALL_SERVER_ACK_REQUEST and clear the pending ACK and cancel
the timer when we start transmitting the reply (the first DATA packet
of the reply implicitly ACKs the request phase).

(2) It must be possible to set the timer when the caller is holding
call->state_lock, so split the lock-getting part of the timer function
out.

(3) Add trace notes for the ACK we're requesting and the timer we clear.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:50 +0800
cf69207af rxrpc: Return negative error code to kernel service ... Browse Code »

In rxrpc_kernel_recv_data(), when we return the error number incurred by a
failed call, we must negate it before returning it as it's stored as
positive (that's what we have to pass back to userspace).

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:50 +0800
94bc669ef rxrpc: Add missing notification ... Browse Code »

The call's background processor work item needs to notify the socket when
it completes a call so that recvmsg() or the AFS fs can deal with it.
Without this, call expiry isn't handled.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:50 +0800
d7833d009 rxrpc: Queue the call on expiry ... Browse Code »

When a call expires, it must be queued for the background processor to deal
with otherwise a service call that is improperly terminated will just sit
there awaiting an ACK and won't expire.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:50 +0800
b3156274c rxrpc: Partially handle OpenAFS's improper termination of calls ... Browse Code »

OpenAFS doesn't always correctly terminate client calls that it makes -
this includes calls the OpenAFS servers make to the cache manager service.
It should end the client call with either:

(1) An ACK that has firstPacket set to one greater than the seq number of
the reply DATA packet with the LAST_PACKET flag set (thereby
hard-ACK'ing all packets). nAcks should be 0 and acks[] should be
empty (ie. no soft-ACKs).

(2) An ACKALL packet.

OpenAFS, though, may send an ACK packet with firstPacket set to the last
seq number or less and soft-ACKs listed for all packets up to and including
the last DATA packet.

The transmitter, however, is obliged to keep the call live and the
soft-ACK'd DATA packets around until they're hard-ACK'd as the receiver is
permitted to drop any merely soft-ACK'd packet and request retransmission
by sending an ACK packet with a NACK in it.

Further, OpenAFS will also terminate a client call by beginning the next
client call on the same connection channel. This implicitly completes the
previous call.

This patch handles implicit ACK of a call on a channel by the reception of
the first packet of the next call on that channel.

If another call doesn't come along to implicitly ACK a call, then we have
to time the call out. There are some bugs there that will be addressed in
subsequent patches.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:49 +0800
a5af7e1fc rxrpc: Fix loss of PING RESPONSE ACK production due to PING ACKs ... Browse Code »

Separate the output of PING ACKs from the output of other sorts of ACK so
that if we receive a PING ACK and schedule transmission of a PING RESPONSE
ACK, the response doesn't get cancelled by a PING ACK we happen to be
scheduling transmission of at the same time.

If a PING RESPONSE gets lost, the other side might just sit there waiting
for it and refuse to proceed otherwise.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:49 +0800
26cb02aa6 rxrpc: Fix warning by splitting rxrpc_send_call_packet() ... Browse Code »

Split rxrpc_send_data_packet() to separate ACK generation (which is more
complicated) from ABORT generation. This simplifies the code a bit and
fixes the following warning:

In file included from ../net/rxrpc/output.c:20:0:
net/rxrpc/output.c: In function 'rxrpc_send_call_packet':
net/rxrpc/ar-internal.h:1187:27: error: 'top' may be used uninitialized in this function [-Werror=maybe-uninitialized]
net/rxrpc/output.c:103:24: note: 'top' was declared here
net/rxrpc/output.c:225:25: error: 'hard_ack' may be used uninitialized in this function [-Werror=maybe-uninitialized]

Reported-by: Arnd Bergmann
Signed-off-by: David Howells

David Howells
2016-10-06 15:11:49 +0800
a9f312d98 rxrpc: Only ping for lost reply in client call ... Browse Code »

When a reply is deemed lost, we send a ping to find out the other end
received all the request data packets we sent. This should be limited to
client calls and we shouldn't do this on service calls.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:49 +0800
7212a57e8 rxrpc: Fix oops on incoming call to serviceless endpoint ... Browse Code »

If an call comes in to a local endpoint that isn't listening for any
incoming calls at the moment, an oops will happen. We need to check that
the local endpoint's service pointer isn't NULL before we dereference it.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:49 +0800
19c0dbd54 rxrpc: Fix duplicate const ... Browse Code »

Remove a duplicate const keyword.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:48 +0800
b63452c11 rxrpc: Accesses of rxrpc_local::service need to be RCU managed ... Browse Code »

struct rxrpc_local->service is marked __rcu - this means that accesses of
it need to be managed using RCU wrappers. There are two such places in
rxrpc_release_sock() where the value is checked and cleared. Fix this by
using the appropriate wrappers.

Signed-off-by: David Howells

David Howells
2016-10-06 15:11:48 +0800

30 Sep, 2016

12 commits

405dea1de rxrpc: Fix the call timer handling ... Browse Code »

The call timer's concept of a call timeout (of which there are three) that
is inactive is that it is the timeout has the same expiration time as the
call expiration timeout (the expiration timer is never inactive). However,
I'm not resetting the timeouts when they expire, leading to repeated
processing of expired timeouts when other timeout events occur.

Fix this by:

(1) Move the timer expiry detection into rxrpc_set_timer() inside the
locked section. This means that if a timeout is set that will expire
immediately, we deal with it immediately.

(2) If a timeout is at or before now then it has expired. When an expiry
is detected, an event is raised, the timeout is automatically
inactivated and the event processor is queued.

(3) If a timeout is at or after the expiry timeout then it is inactive.
Inactive timeouts do not contribute to the timer setting.

(4) The call timer callback can now just call rxrpc_set_timer() to handle
things.

(5) The call processor work function now checks the event flags rather
than checking the timeouts directly.

Signed-off-by: David Howells

David Howells
2016-09-30 21:40:11 +0800
df0adc788 rxrpc: Keep the call timeouts as ktimes rather than jiffies ... Browse Code »

Keep that call timeouts as ktimes rather than jiffies so that they can be
expressed as functions of RTT.

Signed-off-by: David Howells

David Howells
2016-09-30 21:40:11 +0800
c31410ea0 rxrpc: Remove error from struct rxrpc_skb_priv as it is unused ... Browse Code »

Remove error from struct rxrpc_skb_priv as it is no longer used.

Signed-off-by: David Howells

David Howells
2016-09-30 21:39:32 +0800
775e5b71d rxrpc: The offset field in struct rxrpc_skb_priv is unnecessary ... Browse Code »

The offset field in struct rxrpc_skb_priv is unnecessary as the value can
always be calculated.

Signed-off-by: David Howells

David Howells
2016-09-30 21:39:28 +0800
085111509 rxrpc: Reduce ssthresh to peer's receive window ... Browse Code »

When we receive an ACK from the peer that tells us what the peer's receive
window (rwind) is, we should reduce ssthresh to rwind if rwind is smaller
than ssthresh.

Signed-off-by: David Howells

David Howells
2016-09-30 21:38:59 +0800
8782def20 rxrpc: Switch to Congestion Avoidance mode at cwnd==ssthresh ... Browse Code »

Switch to Congestion Avoidance mode at cwnd == ssthresh rather than relying
on cwnd getting incremented beyond ssthresh and the window size, the mode
being shifted and then cwnd being corrected.

We need to make sure we switch into CA mode so that we stop marking every
packet for ACK.

Signed-off-by: David Howells

David Howells
2016-09-30 21:38:56 +0800
ed1e8679d rxrpc: Note serial number being ACK'd in the congestion management trace ... Browse Code »

Note the serial number of the packet being ACK'd in the congestion
management trace rather than the serial number of the ACK packet. Whilst
the serial number of the ACK packet is useful for matching ACK packet in
the output of wireshark, the serial number that the ACK is in response to
is of more use in working out how different trace lines relate.

Signed-off-by: David Howells

David Howells
2016-09-30 05:57:47 +0800
b112a6708 rxrpc: Request more ACKs in slow-start mode ... Browse Code »

Set the request-ACK on more DATA packets whilst we're in slow start mode so
that we get sufficient ACKs back to supply information to configure the
window.

Signed-off-by: David Howells

David Howells
2016-09-30 05:57:47 +0800
1e9e5c952 rxrpc: Reduce the rxrpc_local::services list to a pointer ... Browse Code »

Reduce the rxrpc_local::services list to just a pointer as we don't permit
multiple service endpoints to bind to a single transport endpoints (this is
excluded by rxrpc_lookup_local()).

The reason we don't allow this is that if you send a request to an AFS
filesystem service, it will try to talk back to your cache manager on the
port you sent from (this is how file change notifications are handled). To
prevent someone from stealing your CM callbacks, we don't let AF_RXRPC
sockets share a UDP socket if at least one of them has a service bound.

Signed-off-by: David Howells

David Howells
2016-09-30 05:57:47 +0800
2629c7fa7 rxrpc: When activating client conn channels, do state check inside lock ... Browse Code »

In rxrpc_activate_channels(), the connection cache state is checked outside
of the lock, which means it can change whilst we're waking calls up,
thereby changing whether or not we're allowed to wake calls up.

Fix this by moving the check inside the locked region. The check to see if
all the channels are currently busy can stay outside of the locked region.

Whilst we're at it:

(1) Split the locked section out into its own function so that we can call
it from other places in a later patch.

(2) Determine the mask of channels dependent on the state as we're going
to add another state in a later patch that will restrict the number of
simultaneous calls to 1 on a connection.

Signed-off-by: David Howells

David Howells
2016-09-30 05:57:47 +0800
a1767077b rxrpc: Make Tx loss-injection go through normal return and adjust tracing ... Browse Code »

In rxrpc_send_data_packet() make the loss-injection path return through the
same code as the transmission path so that the RTT determination is
initiated and any future timer shuffling will be done, despite the packet
having been binned.

Whilst we're at it:

(1) Add to the tx_data tracepoint an indication of whether or not we're
retransmitting a data packet.

(2) When we're deciding whether or not to request an ACK, rather than
checking if we're in fast-retransmit mode check instead if we're
retransmitting.

(3) Don't invoke the lose_skb tracepoint when losing a Tx packet as we're
not altering the sk_buff refcount nor are we just seeing it after
getting it off the Tx list.

(4) The rxrpc_skb_tx_lost note is then no longer used so remove it.

(5) rxrpc_lose_skb() no longer needs to deal with rxrpc_skb_tx_lost.

Signed-off-by: David Howells

David Howells
2016-09-30 05:37:15 +0800
8732db67c rxrpc: Fix exclusive client connections ... Browse Code »

Exclusive connections are currently reusable (which they shouldn't be)
because rxrpc_alloc_client_connection() checks the exclusive flag in the
rxrpc_connection struct before it's initialised from the function
parameters. This means that the DONT_REUSE flag doesn't get set.

Fix this by checking the function parameters for the exclusive flag.

Signed-off-by: David Howells

David Howells
2016-09-30 05:37:15 +0800

25 Sep, 2016

8 commits

57494343c rxrpc: Implement slow-start ... Browse Code »

Implement RxRPC slow-start, which is similar to RFC 5681 for TCP. A
tracepoint is added to log the state of the congestion management algorithm
and the decisions it makes.

Notes:

(1) Since we send fixed-size DATA packets (apart from the final packet in
each phase), counters and calculations are in terms of packets rather
than bytes.

(2) The ACK packet carries the equivalent of TCP SACK.

(3) The FLIGHT_SIZE calculation in RFC 5681 doesn't seem particularly
suited to SACK of a small number of packets. It seems that, almost
inevitably, by the time three 'duplicate' ACKs have been seen, we have
narrowed the loss down to one or two missing packets, and the
FLIGHT_SIZE calculation ends up as 2.

(4) In rxrpc_resend(), if there was no data that apparently needed
retransmission, we transmit a PING ACK to ask the peer to tell us what
its Rx window state is.

Signed-off-by: David Howells

David Howells
2016-09-25 06:49:46 +0800
0d967960d rxrpc: Schedule an ACK if the reply to a client call appears overdue ... Browse Code »

If we've sent all the request data in a client call but haven't seen any
sign of the reply data yet, schedule an ACK to be sent to the server to
find out if the reply data got lost.

If the server hasn't yet hard-ACK'd the request data, we send a PING ACK to
demand a response to find out whether we need to retransmit.

If the server says it has received all of the data, we send an IDLE ACK to
tell the server that we haven't received anything in the receive phase as
yet.

To make this work, a non-immediate PING ACK must carry a delay. I've chosen
the same as the IDLE ACK for the moment.

Signed-off-by: David Howells

David Howells
2016-09-25 06:49:46 +0800
31a1b9895 rxrpc: Generate a summary of the ACK state for later use ... Browse Code »

Generate a summary of the Tx buffer packet state when an ACK is received
for use in a later patch that does congestion management.

Signed-off-by: David Howells

David Howells
2016-09-25 06:49:46 +0800
df0562a72 rxrpc: Delay the resend timer to allow for nsec->jiffies conv error ... Browse Code »

When determining the resend timer value, we have a value in nsec but the
timer is in jiffies which may be a million or more times more coarse.
nsecs_to_jiffies() rounds down - which means that the resend timeout
expressed as jiffies is very likely earlier than the one expressed as
nanoseconds from which it was derived.

The problem is that rxrpc_resend() gets triggered by the timer, but can't
then find anything to resend yet. It sets the timer again - but gets
kicked off immediately again and again until the nanosecond-based expiry
time is reached and we actually retransmit.

Fix this by adding 1 to the jiffies-based resend_at value to counteract the
rounding and make sure that the timer happens after the nanosecond-based
expiry is passed.

Alternatives would be to adjust the timestamp on the packets to align
with the jiffie scale or to switch back to using jiffie-timestamps.

Signed-off-by: David Howells

David Howells
2016-09-25 06:49:46 +0800
dd7c1ee59 rxrpc: Reinitialise the call ACK and timer state for client reply phase ... Browse Code »

Clear the ACK reason, ACK timer and resend timer when entering the client
reply phase when the first DATA packet is received. New ACKs will be
proposed once the data is queued.

The resend timer is no longer relevant and we need to cancel ACKs scheduled
to probe for a lost reply.

Signed-off-by: David Howells

David Howells
2016-09-25 06:49:46 +0800
b69d94d79 rxrpc: Include the last reply DATA serial number in the final ACK ... Browse Code »

In a client call, include the serial number of the last DATA packet of the
reply in the final ACK.

Signed-off-by: David Howells

David Howells
2016-09-25 06:49:46 +0800
a7056c5ba rxrpc: Send an immediate ACK if we fill in a hole ... Browse Code »

Send an immediate ACK if we fill in a hole in the buffer left by an
out-of-sequence packet. This may allow the congestion management in the peer
to avoid a retransmission if packets got reordered on the wire.

Signed-off-by: David Howells

David Howells
2016-09-25 06:49:46 +0800
805b21b92 rxrpc: Send an ACK after every few DATA packets we receive ... Browse Code »

Send an ACK if we haven't sent one for the last two packets we've received.
This keeps the other end apprised of where we've got to - which is
important if they're doing slow-start.

We do this in recvmsg so that we can dispatch a packet directly without the
need to wake up the background thread.

This should possibly be made configurable in future.

Signed-off-by: David Howells

David Howells
2016-09-25 01:05:26 +0800

23 Sep, 2016

6 commits

c6672e3fe rxrpc: Add a tracepoint to log which packets will be retransmitted ... Browse Code »

Add a tracepoint to log in rxrpc_resend() which packets will be
retransmitted. Note that if a positive ACK comes in whilst we have dropped
the lock to retransmit another packet, the actual retransmission may not
happen, though some of the effects will (such as altering the congestion
management).

Signed-off-by: David Howells

David Howells
2016-09-23 22:49:19 +0800
9c7ad4344 rxrpc: Add tracepoint for ACK proposal ... Browse Code »

Add a tracepoint to log proposed ACKs, including whether the proposal is
used to update a pending ACK or is discarded in favour of an easlier,
higher priority ACK.

Whilst we're at it, get rid of the rxrpc_acks() function and access the
name array directly. We do, however, need to validate the ACK reason
number given to trace_rxrpc_rx_ack() to make sure we don't overrun the
array.

Signed-off-by: David Howells

David Howells
2016-09-23 22:49:19 +0800
89b475abd rxrpc: Add a tracepoint to log injected Rx packet loss ... Browse Code »

Add a tracepoint to log received packets that get discarded due to Rx
packet loss.

Signed-off-by: David Howells

David Howells
2016-09-23 22:49:19 +0800
be832aecc rxrpc: Add data Tx tracepoint and adjust Tx ACK tracepoint ... Browse Code »

Add a tracepoint to log transmission of DATA packets (including loss
injection).

Adjust the ACK transmission tracepoint to include the packet serial number
and to line this up with the DATA transmission display.

Signed-off-by: David Howells

David Howells
2016-09-23 22:49:19 +0800
fc7ab6d29 rxrpc: Add a tracepoint for the call timer ... Browse Code »

Add a tracepoint to log call timer initiation, setting and expiry.

Signed-off-by: David Howells

David Howells
2016-09-23 22:49:19 +0800
b86e218e0 rxrpc: Don't call the tx_ack tracepoint if don't generate an ACK ... Browse Code »

rxrpc_send_call_packet() is invoking the tx_ack tracepoint before it checks
whether there's an ACK to transmit (another thread may jump in and transmit
it).

Fix this by only invoking the tracepoint if we get a valid ACK to transmit.

Further, only allocate a serial number if we're going to actually transmit
something.

Signed-off-by: David Howells

David Howells
2016-09-23 22:49:19 +0800