Doug / smarc-fsl-linux-kernel | Embedian Git Server

20 Sep, 2012

2 commits

d4915c087 ipv6: make ip6_frag_nqueues() and ip6_frag_mem() static inline ... Browse Code »

Cc: Herbert Xu
Cc: Michal Kubeček
Cc: David Miller
Signed-off-by: Cong Wang
Signed-off-by: David S. Miller

Amerigo Wang
2012-09-20 05:23:28 +0800
b836c99fd ipv6: unify conntrack reassembly expire code with standard one ... Browse Code »

Two years ago, Shan Wei tried to fix this:
http://patchwork.ozlabs.org/patch/43905/

The problem is that RFC2460 requires an ICMP Time
Exceeded -- Fragment Reassembly Time Exceeded message should be
sent to the source of that fragment, if the defragmentation
times out.

"
If insufficient fragments are received to complete reassembly of a
packet within 60 seconds of the reception of the first-arriving
fragment of that packet, reassembly of that packet must be
abandoned and all the fragments that have been received for that
packet must be discarded. If the first fragment (i.e., the one
with a Fragment Offset of zero) has been received, an ICMP Time
Exceeded -- Fragment Reassembly Time Exceeded message should be
sent to the source of that fragment.
"

As Herbert suggested, we could actually use the standard IPv6
reassembly code which follows RFC2460.

With this patch applied, I can see ICMP Time Exceeded sent
from the receiver when the sender sent out 3/4 fragmented
IPv6 UDP packet.

Cc: Herbert Xu
Cc: Michal Kubeček
Cc: David Miller
Cc: Hideaki YOSHIFUJI
Cc: Patrick McHardy
Cc: Pablo Neira Ayuso
Cc: netfilter-devel@vger.kernel.org
Signed-off-by: Cong Wang
Signed-off-by: David S. Miller

Amerigo Wang
2012-09-20 05:23:28 +0800

25 Aug, 2012

1 commit

e6acb3848 Merge branch 'for-next' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace ... Browse Code »

This is an initial merge in of Eric Biederman's work to start adding
user namespace support to the networking.

Signed-off-by: David S. Miller

David S. Miller
2012-08-25 06:54:37 +0800

15 Aug, 2012

2 commits

4f82f4573 net ip6 flowlabel: Make owner a union of struct pid * and kuid_t ... Browse Code »

Correct a long standing omission and use struct pid in the owner
field of struct ip6_flowlabel when the share type is IPV6_FL_S_PROCESS.
This guarantees we don't have issues when pid wraparound occurs.

Use a kuid_t in the owner field of struct ip6_flowlabel when the
share type is IPV6_FL_S_USER to add user namespace support.

In /proc/net/ip6_flowlabel capture the current pid namespace when
opening the file and release the pid namespace when the file is
closed ensuring we print the pid owner value that is meaning to
the reader of the file. Similarly use from_kuid_munged to print
uid values that are meaningful to the reader of the file.

This requires exporting pid_nr_ns so that ipv6 can continue to built
as a module. Yoiks what silliness

Acked-by: David S. Miller
Acked-by: Serge Hallyn
Signed-off-by: Eric W. Biederman

Eric W. Biederman
2012-08-15 12:49:25 +0800
c12b395a4 gre: Support GRE over IPv6 ... Browse Code »

GRE over IPv6 implementation.

Signed-off-by: Dmitry Kozlov
Signed-off-by: David S. Miller

xeb@mail.ru
2012-08-15 05:28:32 +0800

19 Jul, 2012

1 commit

ddbe50320 ipv6: add ipv6_addr_hash() helper ... Browse Code »

Introduce ipv6_addr_hash() helper doing a XOR on all bits
of an IPv6 address, with an optimized x86_64 version.

Use it in flow dissector, as suggested by Andrew McGregor,
to reduce hash collision probabilities in fq_codel (and other
users of flow dissector)

Use it in ip6_tunnel.c and use more bit shuffling, as suggested
by David Laight, as existing hash was ignoring most of them.

Use it in sunrpc and use more bit shuffling, using hash_32().

Use it in net/ipv6/addrconf.c, using hash_32() as well.

As a cleanup, use it in net/ipv4/tcp_metrics.c

Signed-off-by: Eric Dumazet
Reported-by: Andrew McGregor
Cc: Dave Taht
Cc: Tom Herbert
Cc: David Laight
Cc: Joe Perches
Signed-off-by: David S. Miller

Eric Dumazet
2012-07-19 02:28:46 +0800

12 Jul, 2012

1 commit

b94f1c090 ipv6: Use icmpv6_notify() to propagate redirect, instead of rt6_redirect(). ... Browse Code »

And delete rt6_redirect(), since it is no longer used.

Signed-off-by: David S. Miller

David S. Miller
2012-07-12 15:33:37 +0800

11 Jul, 2012

1 commit

1a203cb33 ipv6: optimize ipv6 addresses compares ... Browse Code »

On 64 bit arches having efficient unaligned accesses (eg x86_64) we can
use long words to reduce number of instructions for free.

Joe Perches suggested to change ipv6_masked_addr_cmp() to return a bool
instead of 'int', to make sure ipv6_masked_addr_cmp() cannot be used
in a sorting function.

Signed-off-by: Eric Dumazet
Cc: Joe Perches
Signed-off-by: David S. Miller

Eric Dumazet
2012-07-11 14:13:46 +0800

19 May, 2012

1 commit

a50feda54 ipv6: bool/const conversions phase2 ... Browse Code »

Mostly bool conversions, some inline removals and const additions.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2012-05-19 13:08:16 +0800

18 May, 2012

2 commits

92113bfde ipv6: bool conversions phase1 ... Browse Code »

ipv6_opt_accepted() returns a bool, and can use const pointers

ipv6_addr_equal(), ipv6_addr_any(), ipv6_addr_loopback(),
ipv6_addr_orchid() return a bool.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2012-05-18 14:24:13 +0800
cbc264cac ip_frag: struct inet_frags match() method returns a bool ... Browse Code »

- match() method returns a boolean
- return (A && B && C && D) -> return A && B && C && D
- fix indentation

Signed-off-by: Eric Dumazet

Eric Dumazet
2012-05-18 13:40:27 +0800

21 Apr, 2012

2 commits

a5347fe36 net: Delete all remaining instances of ctl_path ... Browse Code »

We don't use struct ctl_path anymore so delete the exported constants.

Signed-off-by: Eric W. Biederman
Acked-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Eric W. Biederman
2012-04-21 09:22:30 +0800
4e5ca7854 net ipv4: Remove the unneeded registration of an empty net/ipv4/neigh ... Browse Code »

sysctl no longer requires explicit creation of directories. The neigh
directory is always populated with at least a default entry so this
won't cause any user visible changes.

Delete the ipv4_path and the ipv4_skeleton these are no longer needed.

Directly register the ipv4_route_table.

And since I am an idiot remove the header definitions that I should
have removed in the previous patch.

Signed-off-by: Eric W. Biederman
Acked-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Eric W. Biederman
2012-04-21 09:21:18 +0800

16 Apr, 2012

1 commit

95c961747 net: cleanup unsigned to unsigned int ... Browse Code »

Use of "unsigned int" is preferred to bare "unsigned" in net tree.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2012-04-16 00:44:40 +0800

04 Dec, 2011

1 commit

75f2811c6 ipv6: Add fragment reporting to ipv6_skip_exthdr(). ... Browse Code »

While parsing through IPv6 extension headers, fragment headers are
skipped making them invisible to the caller. This reports the
fragment offset of the last header in order to make it possible to
determine whether the packet is fragmented and, if so whether it is
a first or last fragment.

Signed-off-by: Jesse Gross

Jesse Gross
2011-12-04 01:35:10 +0800

23 Nov, 2011

1 commit

4e3fd7a06 net: remove ipv6_addr_copy() ... Browse Code »

C assignment can handle struct in6_addr copying.

Signed-off-by: Alexey Dobriyan
Signed-off-by: David S. Miller

Alexey Dobriyan
2011-11-23 05:43:32 +0800

14 Nov, 2011

1 commit

2a24444f8 ipv6: reduce percpu needs for icmpv6msg mibs ... Browse Code »

Reading /proc/net/snmp6 on a machine with a lot of cpus is very
expensive (can be ~88000 us).

This is because ICMPV6MSG MIB uses 4096 bytes per cpu, and folding
values for all possible cpus can read 16 Mbytes of memory (32MBytes on
non x86 arches)

ICMP messages are not considered as fast path on a typical server, and
eventually few cpus handle them anyway. We can afford an atomic
operation instead of using percpu data.

This saves 4096 bytes per cpu and per network namespace.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-11-14 13:12:26 +0800

27 Oct, 2011

1 commit

b903d324b ipv6: tcp: fix TCLASS value in ACK messages sent from TIME_WAIT ... Browse Code »

commit 66b13d99d96a (ipv4: tcp: fix TOS value in ACK messages sent from
TIME_WAIT) fixed IPv4 only.

This part is for the IPv6 side, adding a tclass param to ip6_xmit()

We alias tw_tclass and tw_tos, if socket family is INET6.

[ if sockets is ipv4-mapped, only IP_TOS socket option is used to fill
TOS field, TCLASS is not taken into account ]

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-10-27 12:44:35 +0800

22 Jul, 2011

1 commit

87c48fa3b ipv6: make fragment identifications less predictable ... Browse Code »

IPv6 fragment identification generation is way beyond what we use for
IPv4 : It uses a single generator. Its not scalable and allows DOS
attacks.

Now inetpeer is IPv6 aware, we can use it to provide a more secure and
scalable frag ident generator (per destination, instead of system wide)

This patch :
1) defines a new secure_ipv6_id() helper
2) extends inet_getid() to provide 32bit results
3) extends ipv6_select_ident() with a new dest parameter

Reported-by: Fernando Gont
Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-07-22 12:25:58 +0800

20 May, 2011

1 commit

be281e554 ipv6: reduce per device ICMP mib sizes ... Browse Code »

ipv6 has per device ICMP SNMP counters, taking too much space because
they use percpu storage.

needed size per device is :
(512+4)*sizeof(long)*number_of_possible_cpus*2

On a 32bit kernel, 16 possible cpus, this wastes more than 64kbytes of
memory per ipv6 enabled network device, taken in vmalloc pool.

Since ICMP messages are rare, just use shared counters (atomic_long_t)

Per network space ICMP counters are still using percpu memory, we might
also convert them to shared counters in a future patch.

Signed-off-by: Eric Dumazet
CC: Denys Fedoryshchenko
Signed-off-by: David S. Miller

Eric Dumazet
2011-05-20 04:21:22 +0800

25 Apr, 2011

1 commit

2a9e95070 net: Remove __KERNEL__ cpp checks from include/net ... Browse Code »

These header files are never installed to user consumption, so any
__KERNEL__ cpp checks are superfluous.

Projects should also not copy these files into their userland utility
sources and try to use them there. If they insist on doing so, the
onus is on them to sanitize the headers as needed.

Signed-off-by: David S. Miller

David S. Miller
2011-04-25 01:54:56 +0800

23 Apr, 2011

1 commit

b71d1d426 inet: constify ip headers and in6_addr ... Browse Code »

Add const qualifiers to structs iphdr, ipv6hdr and in6_addr pointers
where possible, to make code intention more obvious.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-04-23 02:04:14 +0800

13 Mar, 2011

1 commit

4c9483b2f ipv6: Convert to use flowi6 where applicable. ... Browse Code »

Signed-off-by: David S. Miller

David S. Miller
2011-03-13 07:08:54 +0800

04 Mar, 2011

1 commit

0a0e9ae1b Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 ... Browse Code »

Conflicts:
drivers/net/bnx2x/bnx2x.h

David S. Miller
2011-03-04 13:27:42 +0800

02 Mar, 2011

4 commits

2774c131b xfrm: Handle blackhole route creation via afinfo. ... Browse Code »

That way we don't have to potentially do this in every xfrm_lookup()
caller.

Signed-off-by: David S. Miller

David S. Miller
2011-03-02 06:59:04 +0800
69ead7afd ipv6: Normalize arguments to ip6_dst_blackhole(). ... Browse Code »

Return a dst pointer which is potentitally error encoded.

Don't pass original dst pointer by reference, pass a struct net
instead of a socket, and elide the flow argument since it is
unnecessary.

Signed-off-by: David S. Miller

David S. Miller
2011-03-02 06:45:33 +0800
a1414715f ipv6: Change final dst lookup arg name to "can_sleep" ... Browse Code »

Since it indicates whether we are invoked from a sleepable
context or not.

Signed-off-by: David S. Miller

David S. Miller
2011-03-02 06:32:04 +0800
68d0c6d34 ipv6: Consolidate route lookup sequences. ... Browse Code »

Route lookups follow a general pattern in the ipv6 code wherein
we first find the non-IPSEC route, potentially override the
flow destination address due to ipv6 options settings, and then
finally make an IPSEC search using either xfrm_lookup() or
__xfrm_lookup().

__xfrm_lookup() is used when we want to generate a blackhole route
if the key manager needs to resolve the IPSEC rules (in this case
-EREMOTE is returned and the original 'dst' is left unchanged).

Otherwise plain xfrm_lookup() is used and when asynchronous IPSEC
resolution is necessary, we simply fail the lookup completely.

All of these cases are encapsulated into two routines,
ip6_dst_lookup_flow and ip6_sk_dst_lookup_flow. The latter of which
handles unconnected UDP datagram sockets.

Signed-off-by: David S. Miller

David S. Miller
2011-03-02 05:19:07 +0800

23 Feb, 2011

1 commit

5ced13396 ipv6: Add IPv6 multicast address flag defines ... Browse Code »

This commit adds the missing IPv6 multicast address flag defines to
complement the already existing multicast address scope defines and to
be able to check these flags nicely in the future.

Signed-off-by: Linus Lüssing
Signed-off-by: David S. Miller

Linus Lüssing
2011-02-23 02:07:27 +0800

24 Sep, 2010

1 commit

a02cec215 net: return operator cleanup ... Browse Code »

Change "return (EXPR);" to "return EXPR;"

return is not a function, parentheses are not required.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2010-09-24 05:33:39 +0800

01 Jul, 2010

1 commit

4ce3c183f snmp: 64bit ipstats_mib for all arches ... Browse Code »

/proc/net/snmp and /proc/net/netstat expose SNMP counters.

Width of these counters is either 32 or 64 bits, depending on the size
of "unsigned long" in kernel.

This means user program parsing these files must already be prepared to
deal with 64bit values, regardless of user program being 32 or 64 bit.

This patch introduces 64bit snmp values for IPSTAT mib, where some
counters can wrap pretty fast if they are 32bit wide.

# netstat -s|egrep "InOctets|OutOctets"
InOctets: 244068329096
OutOctets: 244069348848

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2010-07-01 04:31:19 +0800

02 Jun, 2010

1 commit

20c59de2e ipv6: Refactor update of IPv6 flowi destination address for srcrt (RH) option ... Browse Code »

There are more than a dozen occurrences of following code in the
IPv6 stack:

if (opt && opt->srcrt) {
struct rt0_hdr *rt0 = (struct rt0_hdr *) opt->srcrt;
ipv6_addr_copy(&final, &fl.fl6_dst);
ipv6_addr_copy(&fl.fl6_dst, rt0->addr);
final_p = &final;
}

Replace those with a helper. Note that the helper overrides final_p
in all cases. This is ok as final_p was previously initialized to
NULL when declared.

Signed-off-by: Arnaud Ebalard
Signed-off-by: David S. Miller

Arnaud Ebalard
2010-06-02 22:08:31 +0800

25 May, 2010

1 commit

4be929be3 kernel-wide: replace USHORT_MAX, SHORT_MAX and SHORT_MIN with USHRT_MAX, SHRT_MAX and SHRT_MIN ... Browse Code »

- C99 knows about USHRT_MAX/SHRT_MAX/SHRT_MIN, not
USHORT_MAX/SHORT_MAX/SHORT_MIN.

- Make SHRT_MIN of type s16, not int, for consistency.

[akpm@linux-foundation.org: fix drivers/dma/timb_dma.c]
[akpm@linux-foundation.org: fix security/keys/keyring.c]
Signed-off-by: Alexey Dobriyan
Acked-by: WANG Cong
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Alexey Dobriyan
2010-05-25 23:07:02 +0800

24 Apr, 2010

2 commits

4b340ae20 IPv6: Complete IPV6_DONTFRAG support ... Browse Code »

Finally add support to detect a local IPV6_DONTFRAG event
and return the relevant data to the user if they've enabled
IPV6_RECVPATHMTU on the socket. The next recvmsg() will
return no data, but have an IPV6_PATHMTU as ancillary data.

Signed-off-by: Brian Haley
Signed-off-by: David S. Miller

Brian Haley
2010-04-24 14:35:29 +0800
13b52cd44 IPv6: Add dontfrag argument to relevant functions ... Browse Code »

Add dontfrag argument to relevant functions for
IPV6_DONTFRAG support, as well as allowing the value
to be passed-in via ancillary cmsg data.

Signed-off-by: Brian Haley
Signed-off-by: David S. Miller

Brian Haley
2010-04-24 14:35:28 +0800

16 Apr, 2010

1 commit

4e15ed4d9 net: replace ipfragok with skb->local_df ... Browse Code »

As Herbert Xu said: we should be able to simply replace ipfragok
with skb->local_df. commit f88037(sctp: Drop ipfargok in sctp_xmit function)
has droped ipfragok and set local_df value properly.

The patch kills the ipfragok parameter of .queue_xmit().

Signed-off-by: Shan Wei
Signed-off-by: David S. Miller

Shan Wei
2010-04-16 14:36:37 +0800

31 Mar, 2010

1 commit

d57b8fb8a ipv6: Use __fls() instead of fls() in __ipv6_addr_diff(). ... Browse Code »

Because we have ensured that the argument is non-zero,
it is better to use __fls() and generate better code.

Signed-off-by: YOSHIFUJI Hideaki
Signed-off-by: David S. Miller

YOSHIFUJI Hideaki / 吉藤英明
2010-03-31 14:28:46 +0800

26 Feb, 2010

1 commit

45bb00609 ipv6: Remove IPV6_ADDR_RESERVED ... Browse Code »

RFC 4291 section 2.4 states that all uncategorized addresses
should be considered as Global Unicast.

This will remove IPV6_ADDR_RESERVED completely
and return IPV6_ADDR_UNICAST in ipv6_addr_type() instead.

Signed-off-by: Ulrich Weber
Signed-off-by: David S. Miller

Ulrich Weber
2010-02-26 19:59:07 +0800

17 Feb, 2010

1 commit

9874c41cd ipv6.h: reassembly: replace calculated magic number with multiplication ... Browse Code »

On Tue, 2010-02-16 at 16:47 +0100, Patrick McHardy wrote:
> Joe Perches wrote:
> >> @@ -246,6 +246,8 @@ extern int ipv6_opt_accepted(struct sock *sk, struct sk_buff *skb);
> >> int ip6_frag_nqueues(struct net *net);
> >> int ip6_frag_mem(struct net *net);
> >>
> >> +#define IPV6_FRAG_HIGH_THRESH 262144 /* == 256*1024 */
> >> +#define IPV6_FRAG_LOW_THRESH 196608 /* == 192*1024 */
> >> #define IPV6_FRAG_TIMEOUT (60*HZ) /* 60 seconds */
> >
> > 196608 isn't a number I want to remember.
> > Is this better as:
> >
> > #define IPV6_FRAG_HIGH_THRESH (256 * 1024) /* 262144 */
> > #define IPV6_FRAG_LOW_THRESH (192 * 1024) /* 196608 */
>
> Please send a patch, I'll apply it once these patches are in Dave's
> tree.

Signed-off-by: Joe Perches
Signed-off-by: David S. Miller

Joe Perches
2010-02-17 16:03:28 +0800

16 Feb, 2010

1 commit

5d0aa2ccd netfilter: nf_conntrack: add support for "conntrack zones" ... Browse Code »

Normally, each connection needs a unique identity. Conntrack zones allow
to specify a numerical zone using the CT target, connections in different
zones can use the same identity.

Example:

iptables -t raw -A PREROUTING -i veth0 -j CT --zone 1
iptables -t raw -A OUTPUT -o veth1 -j CT --zone 1

Signed-off-by: Patrick McHardy

Patrick McHardy
2010-02-16 01:13:33 +0800