Doug / smarc-fsl-linux-kernel | Embedian Git Server

08 Oct, 2008

1 commit

c57943a1c net: wrap sk->sk_backlog_rcv() ... Browse Code »

Wrap calling sk->sk_backlog_rcv() in a function. This will allow extending the
generic sk_backlog_rcv behaviour.

Signed-off-by: Peter Zijlstra
Signed-off-by: David S. Miller

Peter Zijlstra
2008-10-08 05:18:42 +0800

01 Oct, 2008

1 commit

a3116ac5c tcp: Port redirection support for TCP ... Browse Code »

Current TCP code relies on the local port of the listening socket
being the same as the destination address of the incoming
connection. Port redirection used by many transparent proxying
techniques obviously breaks this, so we have to store the original
destination port address.

This patch extends struct inet_request_sock and stores the incoming
destination port value there. It also modifies the handshake code to
use that value as the source port when sending reply packets.

Signed-off-by: KOVACS Krisztian
Signed-off-by: David S. Miller

KOVACS Krisztian
2008-10-01 22:46:49 +0800

23 Sep, 2008

2 commits

cd07a8ea0 tcp: Use SKB queue handling interfaces instead of by-hand versions. ... Browse Code »

Signed-off-by: David S. Miller

David S. Miller
2008-09-23 15:50:13 +0800
d258b4914 tcp: Use skb_queue_is_last() instead of by-hand version. ... Browse Code »

Signed-off-by: David S. Miller

David S. Miller
2008-09-23 15:34:37 +0800

22 Sep, 2008

1 commit

43f59c893 net: Remove __skb_insert() calls outside of skbuff internals. ... Browse Code »

This minor cleanup simplifies later changes which will convert
struct sk_buff and friends over to using struct list_head.

Signed-off-by: David S. Miller

David S. Miller
2008-09-22 12:28:51 +0800

21 Sep, 2008

4 commits

ef9da47c7 tcp: don't clear retransmit_skb_hint when not necessary ... Browse Code »

Most importantly avoid doing it with cumulative ACK. Not clearing
means that we no longer need n^2 processing in resolution of each
fast recovery.

Signed-off-by: Ilpo Järvinen
Signed-off-by: David S. Miller

Ilpo Järvinen
2008-09-21 12:25:15 +0800
0e1c54c2a tcp: reorganize retransmit code loops ... Browse Code »

Both loops are quite similar, so they can be combined
with little effort. As a result, forward_skb_hint becomes
obsolete as well.

Signed-off-by: Ilpo Järvinen
Signed-off-by: David S. Miller

Ilpo Järvinen
2008-09-21 12:24:21 +0800
006f582c7 tcp: convert retransmit_cnt_hint to seqno ... Browse Code »

Main benefit in this is that we can then freely point
the retransmit_skb_hint to anywhere we want to because
there's no longer need to know what would be the count
changes involve, and since this is really used only as a
terminator, unnecessary work is one time walk at most,
and if some retransmissions are necessary after that
point later on, the walk is not full waste of time
anyway.

Since retransmit_high must be kept valid, all lost
markers must ensure that.

Now I also have learned how those "holes" in the
rexmittable skbs can appear, mtu probe does them. So
I removed the misleading comment as well.

Signed-off-by: Ilpo Järvinen
Signed-off-by: David S. Miller

Ilpo Järvinen
2008-09-21 12:20:20 +0800
64edc2736 tcp: Partial hint clearing has again become meaningless ... Browse Code »

Ie., the difference between partial and all clearing doesn't
exists anymore since the SACK optimizations got dropped by
an sacktag rewrite.

Signed-off-by: Ilpo Järvinen
Signed-off-by: David S. Miller

Ilpo Järvinen
2008-09-21 12:18:32 +0800

09 Sep, 2008

1 commit

410e27a49 This reverts "Merge branch 'dccp' of git://eden-feed.erg.abdn.ac.uk/dccp_exp" ... Browse Code »

as it accentally contained the wrong set of patches. These will be
submitted separately.
Signed-off-by: Gerrit Renker

Gerrit Renker
2008-09-09 19:27:22 +0800

04 Sep, 2008

1 commit

6224877b2 tcp/dccp: Consolidate common code for RFC 3390 conversion ... Browse Code »

This patch consolidates the code common to TCP and CCID-2:
* TCP uses RFC 3390 in a packet-oriented manner (tcp_input.c) and
* CCID-2 uses RFC 3390 in packet-oriented manner (RFC 4341).

Signed-off-by: Gerrit Renker

Gerrit Renker
2008-09-04 13:45:39 +0800

19 Jul, 2008

2 commits

33ad798c9 tcp: options clean up ... Browse Code »

This should fix the following bugs:
* Connections with MD5 signatures produce invalid packets whenever SACK
options are included
* MD5 signatures are counted twice in the MSS calculations

Behaviour changes:
* A SYN with MD5 + SACK + TS elicits a SYNACK with MD5 + SACK

This is because we can't fit any SACK blocks in a packet with MD5 + TS
options. There was discussion about disabling SACK rather than TS in
order to fit in better with old, buggy kernels, but that was deemed to
be unnecessary.

* SYNs with MD5 don't include a TS option

See above.

Additionally, it removes a bunch of duplicated logic for calculating options,
which should help avoid these sort of issues in the future.

Signed-off-by: Adam Langley
Signed-off-by: David S. Miller

Adam Langley
2008-07-19 15:04:31 +0800
49a72dfb8 tcp: Fix MD5 signatures for non-linear skbs ... Browse Code »

Currently, the MD5 code assumes that the SKBs are linear and, in the case
that they aren't, happily goes off and hashes off the end of the SKB and
into random memory.

Reported by Stephen Hemminger in [1]. Advice thanks to Stephen and Evgeniy
Polyakov. Also includes a couple of missed route_caps from Stephen's patch
in [2].

[1] http://marc.info/?l=linux-netdev&m=121445989106145&w=2
[2] http://marc.info/?l=linux-netdev&m=121459157816964&w=2

Signed-off-by: Adam Langley
Acked-by: Stephen Hemminger
Signed-off-by: David S. Miller

Adam Langley
2008-07-19 15:01:42 +0800

18 Jul, 2008

1 commit

57ef42d59 mib: put tcp statistics on struct net ... Browse Code »

Proc temporary uses stats from init_net.

BTW, TCP_XXX_STATS are beautiful (w/o do { } while (0) facing) again :)

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-18 19:02:08 +0800

17 Jul, 2008

8 commits

de0744af1 mib: add net to NET_INC_STATS_BH ... Browse Code »

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-17 11:31:16 +0800
5c52ba170 sock: add net to prot->enter_memory_pressure callback ... Browse Code »

The tcp_enter_memory_pressure calls NET_INC_STATS, but doesn't
have where to get the net from.

I decided to add a sk argument, not the net itself, only to factor
all the required sock_net(sk) calls inside the enter_memory_pressure
callback itself.

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-17 11:28:10 +0800
cf1100a7a mib: add net to TCP_ADD_STATS_USER ... Browse Code »

Now we're done with the TCP_XXX_STATS macros.

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-17 11:27:38 +0800
74688e487 mib: add net to TCP_DEC_STATS ... Browse Code »

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-17 11:22:46 +0800
63231bddf mib: add net to TCP_INC_STATS_BH ... Browse Code »

Same as before - the sock is always there to get the net from,
but there are also some places with the net already saved on
the stack.

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-17 11:22:25 +0800
81cc8a75d mib: add net to TCP_INC_STATS ... Browse Code »

Fortunately (almost) all the TCP code has a sock to get the net from :)

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-17 11:22:04 +0800
a9c19329e tcp: add net to tcp_mib_init ... Browse Code »

This one sets TCP MIBs after zeroing them, and thus requires
the net.

The existing single caller can use init_net (temporarily).

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-17 11:21:42 +0800
f10f84314 mib: drop unused TCP_XXX_STATS macros ... Browse Code »

TCP_INC_STATS_USER and TCP_ADD_STATS_BH are currently unused.

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-07-17 11:21:20 +0800

15 Jun, 2008

1 commit

7d06b2e05 net: change proto destroy method to return void ... Browse Code »

Change struct proto destroy function pointer to return void. Noticed
by Al Viro.

Signed-off-by: Brian Haley
Signed-off-by: David S. Miller

Brian Haley
2008-06-15 08:04:49 +0800

14 Jun, 2008

1 commit

4ae127d1b Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 ... Browse Code »

Conflicts:

drivers/net/smc911x.c

David S. Miller
2008-06-14 11:52:39 +0800

13 Jun, 2008

1 commit

ec0a19662 tcp: Revert 'process defer accept as established' changes. ... Browse Code »

This reverts two changesets, ec3c0982a2dd1e671bad8e9d26c28dcba0039d87
("[TCP]: TCP_DEFER_ACCEPT updates - process as established") and
the follow-on bug fix 9ae27e0adbf471c7a6b80102e38e1d5a346b3b38
("tcp: Fix slab corruption with ipv6 and tcp6fuzz").

This change causes several problems, first reported by Ingo Molnar
as a distcc-over-loopback regression where connections were getting
stuck.

Ilpo Järvinen first spotted the locking problems. The new function
added by this code, tcp_defer_accept_check(), only has the
child socket locked, yet it is modifying state of the parent
listening socket.

Fixing that is non-trivial at best, because we can't simply just grab
the parent listening socket lock at this point, because it would
create an ABBA deadlock. The normal ordering is parent listening
socket --> child socket, but this code path would require the
reverse lock ordering.

Next is a problem noticed by Vitaliy Gusev, he noted:

----------------------------------------
>--- a/net/ipv4/tcp_timer.c
>+++ b/net/ipv4/tcp_timer.c
>@@ -481,6 +481,11 @@ static void tcp_keepalive_timer (unsigned long data)
> goto death;
> }
>
>+ if (tp->defer_tcp_accept.request && sk->sk_state == TCP_ESTABLISHED) {
>+ tcp_send_active_reset(sk, GFP_ATOMIC);
>+ goto death;

Here socket sk is not attached to listening socket's request queue. tcp_done()
will not call inet_csk_destroy_sock() (and tcp_v4_destroy_sock() which should
release this sk) as socket is not DEAD. Therefore socket sk will be lost for
freeing.
----------------------------------------

Finally, Alexey Kuznetsov argues that there might not even be any
real value or advantage to these new semantics even if we fix all
of the bugs:

----------------------------------------
Hiding from accept() sockets with only out-of-order data only
is the only thing which is impossible with old approach. Is this really
so valuable? My opinion: no, this is nothing but a new loophole
to consume memory without control.
----------------------------------------

So revert this thing for now.

Signed-off-by: David S. Miller

David S. Miller
2008-06-13 07:34:35 +0800

12 Jun, 2008

4 commits

9501f9722 tcp md5sig: Let the caller pass appropriate key for tcp_v{4,6}_do_calc_md5_hash(). ... Browse Code »

As we do for other socket/timewait-socket specific parameters,
let the callers pass appropriate arguments to
tcp_v{4,6}_do_calc_md5_hash().

Signed-off-by: YOSHIFUJI Hideaki

YOSHIFUJI Hideaki
2008-06-12 02:46:30 +0800
8d26d76dd tcp md5sig: Share most of hash calcucaltion bits between IPv4 and IPv6. ... Browse Code »

We can share most part of the hash calculation code because
the only difference between IPv4 and IPv6 is their pseudo headers.

Signed-off-by: YOSHIFUJI Hideaki

YOSHIFUJI Hideaki
2008-06-12 01:38:20 +0800
076fb7223 tcp md5sig: Remove redundant protocol argument. ... Browse Code »

Protocol is always TCP, so remove useless protocol argument.

Signed-off-by: YOSHIFUJI Hideaki

YOSHIFUJI Hideaki
2008-06-12 01:38:19 +0800
7d5d5525b tcp md5sig: Share MD5 Signature option parser between IPv4 and IPv6. ... Browse Code »

Signed-off-by: YOSHIFUJI Hideaki

YOSHIFUJI Hideaki
2008-06-12 01:38:18 +0800

11 Jun, 2008

1 commit

45d465bc2 ipv4: Remove unused declaration from include/net/tcp.h. ... Browse Code »

- The tcp_unhash() method in /include/net/tcp.h is no more needed, as the
unhash method in tcp_prot structure is now inet_unhash (instead of
tcp_unhash in the
past); see tcp_prot structure in net/ipv4/tcp_ipv4.c.

- So, this patch removes tcp_unhash() declaration from include/net/tcp.h

Signed-off-by: Rami Rosen
Signed-off-by: David S. Miller

Rami Rosen
2008-06-11 03:37:42 +0800

16 Apr, 2008

1 commit

dd9e0dda6 [TCP]: Increase the max_burst threshold from 3 to tp->reordering. ... Browse Code »

This change is necessary to allow cwnd to grow during persistent
reordering. Cwnd moderation is applied when in the disorder state
and an ack that fills the hole comes in. If the hole was greater
than 3 packets, but less than tp->reordering, cwnd will shrink when
it should not have.

Signed-off-by: John Heffner
Signed-off-by: David S. Miller

John Heffner
2008-04-16 17:29:56 +0800

14 Apr, 2008

6 commits

df39e8ba5 Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 ... Browse Code »

Conflicts:

drivers/net/ehea/ehea_main.c
drivers/net/wireless/iwlwifi/Kconfig
drivers/net/wireless/rt2x00/rt61pci.c
net/ipv4/inet_timewait_sock.c
net/ipv6/raw.c
net/mac80211/ieee80211_sta.c

David S. Miller
2008-04-14 17:30:23 +0800
7de6c0333 [SKB]: __skb_append = __skb_queue_after ... Browse Code »

This expresses __skb_append in terms of __skb_queue_after, exploiting that

__skb_append(old, new, list) = __skb_queue_after(list, old, new).

Signed-off-by: Gerrit Renker
Signed-off-by: David S. Miller

Gerrit Renker
2008-04-14 15:05:09 +0800
5f4472c5a [TCP]: Remove owner from tcp_seq_afinfo. ... Browse Code »

Move it to tcp_seq_afinfo->seq_fops as should be.

Signed-off-by: Denis V. Lunev
Signed-off-by: David S. Miller

Denis V. Lunev
2008-04-14 13:13:53 +0800
68fcadd16 [TCP]: Place file operations directly into tcp_seq_afinfo. ... Browse Code »

No need to have separate never-used variable.

Signed-off-by: Denis V. Lunev
Signed-off-by: David S. Miller

Denis V. Lunev
2008-04-14 13:13:30 +0800
9427c4b36 [TCP]: Move seq_ops from tcp_iter_state to tcp_seq_afinfo. ... Browse Code »

No need to create seq_operations for each instance of 'netstat'.

Signed-off-by: Denis V. Lunev
Signed-off-by: David S. Miller

Denis V. Lunev
2008-04-14 13:12:13 +0800
a4146b1b2 [TCP]: Replace struct net on tcp_iter_state with seq_net_private. ... Browse Code »

Signed-off-by: Denis V. Lunev
Signed-off-by: David S. Miller

Denis V. Lunev
2008-04-14 13:11:14 +0800

10 Apr, 2008

1 commit

4dfc28170 [Syncookies]: Add support for TCP options via timestamps. ... Browse Code »

Allow the use of SACK and window scaling when syncookies are used
and the client supports tcp timestamps. Options are encoded into
the timestamp sent in the syn-ack and restored from the timestamp
echo when the ack is received.

Based on earlier work by Glenn Griffin.
This patch avoids increasing the size of structs by encoding TCP
options into the least significant bits of the timestamp and
by not using any 'timestamp offset'.

The downside is that the timestamp sent in the packet after the synack
will increase by several seconds.

changes since v1:
don't duplicate timestamp echo decoding function, put it into ipv4/syncookie.c
and have ipv6/syncookies.c use it.
Feedback from Glenn Griffin: fix line indented with spaces, kill redundant if ()

Reviewed-by: Hagen Paul Pfeifer
Signed-off-by: Florian Westphal
Signed-off-by: David S. Miller

Florian Westphal
2008-04-10 18:12:40 +0800

08 Apr, 2008

1 commit

882bebaac [TCP]: tcp_simple_retransmit can cause S+L ... Browse Code »

This fixes Bugzilla #10384

tcp_simple_retransmit does L increment without any checking
whatsoever for overflowing S+L when Reno is in use.

The simplest scenario I can currently think of is rather
complex in practice (there might be some more straightforward
cases though). Ie., if mss is reduced during mtu probing, it
may end up marking everything lost and if some duplicate ACKs
arrived prior to that sacked_out will be non-zero as well,
leading to S+L > packets_out, tcp_clean_rtx_queue on the next
cumulative ACK or tcp_fastretrans_alert on the next duplicate
ACK will fix the S counter.

More straightforward (but questionable) solution would be to
just call tcp_reset_reno_sack() in tcp_simple_retransmit but
it would negatively impact the probe's retransmission, ie.,
the retransmissions would not occur if some duplicate ACKs
had arrived.

So I had to add reno sacked_out reseting to CA_Loss state
when the first cumulative ACK arrives (this stale sacked_out
might actually be the explanation for the reports of left_out
overflows in kernel prior to 2.6.23 and S+L overflow reports
of 2.6.24). However, this alone won't be enough to fix kernel
before 2.6.24 because it is building on top of the commit
1b6d427bb7e ([TCP]: Reduce sacked_out with reno when purging
write_queue) to keep the sacked_out from overflowing.

Signed-off-by: Ilpo Järvinen
Reported-by: Alessandro Suardi
Signed-off-by: David S. Miller

Ilpo Järvinen
2008-04-08 13:33:07 +0800

24 Mar, 2008

1 commit

2051f11fb [TCP]: Shrink syncookie_secret by 8 byte. ... Browse Code »

the first u32 copied from syncookie_secret is overwritten by the
minute-counter four lines below. After adjusting the destination
address, the size of syncookie_secret can be reduced accordingly.

AFAICS, the only other user of syncookie_secret[] is the ipv6
syncookie support. Because ipv6 syncookies only grab 44 bytes from
syncookie_secret[], this shouldn't affect them in any way.

With fixes from Glenn Griffin.

Signed-off-by: Florian Westphal
Acked-by: Glenn Griffin
Signed-off-by: David S. Miller

Florian Westphal
2008-03-24 13:21:28 +0800