Eric Lee / smarc-ti-linux-kernel | Embedian Git Server

10 Dec, 2014

1 commit

f69e6d131 ip_generic_getfrag, udplite_getfrag: switch to passing msghdr ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2014-12-10 05:28:22 +0800

09 Dec, 2014

1 commit

60c04aecd udp: Neaten and reduce size of compute_score functions ... Browse Code »

The compute_score functions are a bit difficult to read.

Neaten them a bit to reduce object sizes and make them a
bit more intelligible.

Return early to avoid indentation and avoid unnecessary
initializations.

(allyesconfig, but w/ -O2 and no profiling)

$ size net/ipv[46]/udp.o.*
text data bss dec hex filename
28680 1184 25 29889 74c1 net/ipv4/udp.o.new
28756 1184 25 29965 750d net/ipv4/udp.o.old
17600 1010 2 18612 48b4 net/ipv6/udp.o.new
17632 1010 2 18644 48d4 net/ipv6/udp.o.old

Signed-off-by: Joe Perches
Acked-by: Eric Dumazet
Signed-off-by: David S. Miller

Joe Perches
2014-12-09 09:28:47 +0800

24 Nov, 2014

1 commit

227158db1 new helper: skb_copy_and_csum_datagram_msg() ... Browse Code »

Signed-off-by: Al Viro

Al Viro
2014-11-24 17:28:44 +0800

13 Nov, 2014

1 commit

4243cdc2c udp: Neaten function pointer calls and add braces ... Browse Code »

Standardize function pointer uses.

Convert calling style from:
(*foo)(args...);
to:
foo(args...);

Other miscellanea:

o Add braces around loops with single ifs on multiple lines
o Realign arguments around these functions
o Invert logic in if to return immediately.

Signed-off-by: Joe Perches
Signed-off-by: David S. Miller

Joe Perches
2014-11-13 03:51:59 +0800

12 Nov, 2014

2 commits

ba7a46f16 net: Convert LIMIT_NETDEBUG to net_dbg_ratelimited ... Browse Code »

Use the more common dynamic_debug capable net_dbg_ratelimited
and remove the LIMIT_NETDEBUG macro.

All messages are still ratelimited.

Some KERN_ uses are changed to KERN_DEBUG.

This may have some negative impact on messages that were
emitted at KERN_INFO that are not not enabled at all unless
DEBUG is defined or dynamic_debug is enabled. Even so,
these messages are now _not_ emitted by default.

This also eliminates the use of the net_msg_warn sysctl
"/proc/sys/net/core/warnings". For backward compatibility,
the sysctl is not removed, but it has no function. The extern
declaration of net_msg_warn is removed from sock.h and made
static in net/core/sysctl_net_core.c

Miscellanea:

o Update the sysctl documentation
o Remove the embedded uses of pr_fmt
o Coalesce format fragments
o Realign arguments

Signed-off-by: Joe Perches
Signed-off-by: David S. Miller

Joe Perches
2014-11-12 03:10:31 +0800
2c8c56e15 net: introduce SO_INCOMING_CPU ... Browse Code »
5

Alternative to RPS/RFS is to use hardware support for multiple
queues.

Then split a set of million of sockets into worker threads, each
one using epoll() to manage events on its own socket pool.

Ideally, we want one thread per RX/TX queue/cpu, but we have no way to
know after accept() or connect() on which queue/cpu a socket is managed.

We normally use one cpu per RX queue (IRQ smp_affinity being properly
set), so remembering on socket structure which cpu delivered last packet
is enough to solve the problem.

After accept(), connect(), or even file descriptor passing around
processes, applications can use :

int cpu;
socklen_t len = sizeof(cpu);

getsockopt(fd, SOL_SOCKET, SO_INCOMING_CPU, &cpu, &len);

And use this information to put the socket into the right silo
for optimal performance, as all networking stack should run
on the appropriate cpu, without need to send IPI (RPS/RFS).

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2014-11-12 02:00:06 +0800

08 Nov, 2014

1 commit

36cbb2452 udp: Increment UDP_MIB_IGNOREDMULTI for arriving unmatched multicasts ... Browse Code »
5

As NIC multicast filtering isn't perfect, and some platforms are
quite content to spew broadcasts, we should not trigger an event
for skb:kfree_skb when we do not have a match for such an incoming
datagram. We do though want to avoid sweeping the matter under the
rug entirely, so increment a suitable statistic.

This incorporates feedback from David L. Stevens, Karl Neiss and Eric
Dumazet.

V3 - use bool per David Miller

Signed-off-by: Rick Jones
Signed-off-by: David S. Miller

Rick Jones
2014-11-08 04:45:50 +0800

06 Nov, 2014

1 commit

51f3d02b9 net: Add and use skb_copy_datagram_msg() helper. ... Browse Code »

This encapsulates all of the skb_copy_datagram_iovec() callers
with call argument signature "skb, offset, msghdr->msg_iov, length".

When we move to iov_iters in the networking, the iov_iter object will
sit in the msghdr.

Having a helper like this means there will be less places to touch
during that transformation.

Based upon descriptions and patch from Al Viro.

Signed-off-by: David S. Miller

David S. Miller
2014-11-06 05:46:40 +0800

05 Nov, 2014

2 commits

6cf1093e5 udp: remove blank line between set and test ... Browse Code »

Suggested-by: Joe Perches
Signed-off-by: Fabian Frederick
Signed-off-by: David S. Miller

Fabian Frederick
2014-11-05 06:12:10 +0800
c18450a52 udp: remove else after return ... Browse Code »

else is unnecessary after return 0 in __udp4_lib_rcv()

Signed-off-by: Fabian Frederick
Signed-off-by: David S. Miller

Fabian Frederick
2014-11-05 04:13:18 +0800

06 Sep, 2014

1 commit

82eabd9eb net: merge cases where sock_efree and sock_edemux are the same function ... Browse Code »

Since sock_efree and sock_demux are essentially the same code for non-TCP
sockets and the case where CONFIG_INET is not defined we can combine the
code or replace the call to sock_edemux in several spots. As a result we
can avoid a bit of unnecessary code or code duplication.

Signed-off-by: Alexander Duyck
Signed-off-by: David S. Miller

Alexander Duyck
2014-09-06 08:43:45 +0800

02 Sep, 2014

1 commit

2abb7cdc0 udp: Add support for doing checksum unnecessary conversion ... Browse Code »
26

Add support for doing CHECKSUM_UNNECESSARY to CHECKSUM_COMPLETE
conversion in UDP tunneling path.

In the normal UDP path, we call skb_checksum_try_convert after locating
the UDP socket. The check is that checksum conversion is enabled for
the socket (new flag in UDP socket) and that checksum field is
non-zero.

In the UDP GRO path, we call skb_gro_checksum_try_convert after
checksum is validated and checksum field is non-zero. Since this is
already in GRO we assume that checksum conversion is always wanted.

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-09-02 12:36:28 +0800

25 Aug, 2014

1 commit

57c67ff4b udp: additional GRO support ... Browse Code »
26

Implement GRO for UDPv6. Add UDP checksum verification in gro_receive
for both UDP4 and UDP6 calling skb_gro_checksum_validate_zero_check.

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-08-25 09:09:24 +0800

24 Aug, 2014

1 commit

8fc54f689 net: use reciprocal_scale() helper ... Browse Code »
13

Replace open codings of (((u64) * ) >> 32) with reciprocal_scale().

Signed-off-by: Daniel Borkmann
Cc: Hannes Frederic Sowa
Signed-off-by: David S. Miller

Daniel Borkmann
2014-08-24 03:21:21 +0800

24 Jul, 2014

1 commit

274f482d3 sock: remove skb argument from sk_rcvqueues_full ... Browse Code »

It hasn't been used since commit 0fd7bac(net: relax rcvbuf limits).

Signed-off-by: Sorin Dumitru
Signed-off-by: David S. Miller

Sorin Dumitru
2014-07-24 04:23:06 +0800

17 Jul, 2014

3 commits

2dc41cff7 udp: Use hash2 for long hash1 chains in __udp*_lib_mcast_deliver. ... Browse Code »
2

Many multicast sources can have the same port which can result in a very
large list when hashing by port only. Hash by address and port instead
if this is the case. This makes multicast more similar to unicast.

On a 24-core machine receiving from 500 multicast sockets on the same
port, before this patch 80% of system CPU was used up by spin locking
and only ~25% of packets were successfully delivered.

With this patch, all packets are delivered and kernel overhead is ~8%
system CPU on spinlocks.

Signed-off-by: David Held
Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

David Held
2014-07-17 14:29:52 +0800
5cf3d4619 udp: Simplify __udp*_lib_mcast_deliver. ... Browse Code »
26

Switch to using sk_nulls_for_each which shortens the code and makes it
easier to update.

Signed-off-by: David Held
Acked-by: Eric Dumazet
Signed-off-by: David S. Miller

David Held
2014-07-17 14:29:52 +0800
1a98c69af Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net ... Browse Code »

Signed-off-by: David S. Miller

David S. Miller
2014-07-17 05:09:34 +0800

15 Jul, 2014

1 commit

155e010ed udp: Move udp_tunnel_segment into udp_offload.c ... Browse Code »

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-07-15 07:12:15 +0800

12 Jul, 2014

1 commit

a2f983f83 ipv4: remove the unnecessary variable in udp_mcast_next ... Browse Code »

Signed-off-by: Li RongQing
Signed-off-by: David S. Miller

Li RongQing
2014-07-12 05:08:17 +0800

27 Jun, 2014

1 commit

3e215c8d1 udp: Add MIB counters for rcvbuferrors ... Browse Code »

Add MIB counters for rcvbuferrors in UDP to help diagnose problems.

Signed-off-by: James M Leddy
Acked-by: Eric Dumazet
Signed-off-by: David S. Miller

James M Leddy
2014-06-27 15:20:55 +0800

14 Jun, 2014

1 commit

63c6f81cd udp: ipv4: do not waste time in __udp4_lib_mcast_demux_lookup ... Browse Code »
18

Its too easy to add thousand of UDP sockets on a particular bucket,
and slow down an innocent multicast receiver.

Early demux is supposed to be an optimization, we should avoid spending
too much time in it.

It is interesting to note __udp4_lib_demux_lookup() only tries to
match first socket in the chain.

10 is the threshold we already have in __udp4_lib_lookup() to switch
to secondary hash.

Fixes: 421b3885bf6d5 ("udp: ipv4: Add udp early demux")
Signed-off-by: Eric Dumazet
Reported-by: David Held
Cc: Shawn Bohrer
Signed-off-by: David S. Miller

Eric Dumazet
2014-06-14 06:39:24 +0800

05 Jun, 2014

3 commits

ebbe495f1 ipv4: use skb frags api in udp4_hwcsum() ... Browse Code »

Cc: "David S. Miller"
Signed-off-by: Cong Wang
Signed-off-by: David S. Miller

WANG Cong
2014-06-05 15:51:47 +0800
0f4f4ffa7 net: Add GSO support for UDP tunnels with checksum ... Browse Code »

Added a new netif feature for GSO_UDP_TUNNEL_CSUM. This indicates
that a device is capable of computing the UDP checksum in the
encapsulating header of a UDP tunnel.

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-06-05 13:46:38 +0800
af5fcba7f udp: Generic functions to set checksum ... Browse Code »

Added udp_set_csum and udp6_set_csum functions to set UDP checksums
in packets. These are for simple UDP packets such as those that might
be created in UDP tunnels.

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-06-05 13:46:38 +0800

24 May, 2014

2 commits

1c19448c9 net: Make enabling of zero UDP6 csums more restrictive ... Browse Code »

RFC 6935 permits zero checksums to be used in IPv6 however this is
recommended only for certain tunnel protocols, it does not make
checksums completely optional like they are in IPv4.

This patch restricts the use of IPv6 zero checksums that was previously
intoduced. no_check6_tx and no_check6_rx have been added to control
the use of checksums in UDP6 RX and TX path. The normal
sk_no_check_{rx,tx} settings are not used (this avoids ambiguity when
dealing with a dual stack socket).

A helper function has been added (udp_set_no_check6) which can be
called by tunnel impelmentations to all zero checksums (send on the
socket, and accept them as valid).

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-05-24 04:28:53 +0800
28448b804 net: Split sk_no_check into sk_no_check_{rx,tx} ... Browse Code »

Define separate fields in the sock structure for configuring disabling
checksums in both TX and RX-- sk_no_check_tx and sk_no_check_rx.
The SO_NO_CHECK socket option only affects sk_no_check_tx. Also,
removed UDP_CSUM_* defines since they are no longer necessary.

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-05-24 04:28:53 +0800

15 May, 2014

2 commits

c72283174 net: Use a more standard macro for INET_ADDR_COOKIE ... Browse Code »

Missing a colon on definition use is a bit odd so
change the macro for the 32 bit case to declare an
__attribute__((unused)) and __deprecated variable.

The __deprecated attribute will cause gcc to emit
an error if the variable is actually used.

Signed-off-by: Joe Perches
Signed-off-by: David S. Miller

Joe Perches
2014-05-15 04:07:23 +0800
122ff243f ipv4: make ip_local_reserved_ports per netns ... Browse Code »

ip_local_port_range is already per netns, so should ip_local_reserved_ports
be. And since it is none by default we don't actually need it when we don't
enable CONFIG_SYSCTL.

By the way, rename inet_is_reserved_local_port() to inet_is_local_reserved_port()

Cc: "David S. Miller"
Signed-off-by: Cong Wang
Signed-off-by: David S. Miller

WANG Cong
2014-05-15 03:31:45 +0800

09 May, 2014

1 commit

0a80966b1 net: Verify UDP checksum before handoff to encap ... Browse Code »

Moving validation of UDP checksum to be done in UDP not encap layer.

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-05-09 11:47:50 +0800

06 May, 2014

1 commit

ed70fcfce net: Call skb_checksum_init in IPv4 ... Browse Code »

Call skb_checksum_init instead of private functions.

Signed-off-by: Tom Herbert
Signed-off-by: David S. Miller

Tom Herbert
2014-05-06 03:26:30 +0800

20 Feb, 2014

1 commit

c8e6ad082 ipv6: honor IPV6_PKTINFO with v4 mapped addresses on sendmsg ... Browse Code »
13

In case we decide in udp6_sendmsg to send the packet down the ipv4
udp_sendmsg path because the destination is either of family AF_INET or
the destination is an ipv4 mapped ipv6 address, we don't honor the
maybe specified ipv4 mapped ipv6 address in IPV6_PKTINFO.

We simply can check for this option in ip_cmsg_send because no calls to
ipv6 module functions are needed to do so.

Reported-by: Gert Doering
Cc: Tore Anderson
Signed-off-by: Hannes Frederic Sowa
Signed-off-by: David S. Miller

Hannes Frederic Sowa
2014-02-20 05:28:42 +0800

19 Jan, 2014

1 commit

342dfc306 net: add build-time checks for msg->msg_name size ... Browse Code »

This is a follow-up patch to f3d3342602f8bc ("net: rework recvmsg
handler msg_name and msg_namelen logic").

DECLARE_SOCKADDR validates that the structure we use for writing the
name information to is not larger than the buffer which is reserved
for msg->msg_name (which is 128 bytes). Also use DECLARE_SOCKADDR
consistently in sendmsg code paths.

Signed-off-by: Steffen Hurrle
Suggested-by: Hannes Frederic Sowa
Acked-by: Hannes Frederic Sowa
Signed-off-by: David S. Miller

Steffen Hurrle
2014-01-19 15:04:16 +0800

15 Jan, 2014

1 commit

63862b5be net: replace macros net_random and net_srandom with direct calls to prandom ... Browse Code »

This patch removes the net_random and net_srandom macros and replaces
them with direct calls to the prandom ones. As new commits only seem to
use prandom_u32 there is no use to keep them around.
This change makes it easier to grep for users of prandom_u32.

Signed-off-by: Aruna-Hewapathirane
Suggested-by: Hannes Frederic Sowa
Acked-by: Hannes Frederic Sowa
Signed-off-by: David S. Miller

Aruna-Hewapathirane
2014-01-15 07:15:25 +0800

07 Jan, 2014

1 commit

56a4342df Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/davem/net ... Browse Code »

Conflicts:
drivers/net/ethernet/qlogic/qlcnic/qlcnic_sriov_pf.c
net/ipv6/ip6_tunnel.c
net/ipv6/ip6_vti.c

ipv6 tunnel statistic bug fixes conflicting with consolidation into
generic sw per-cpu net stats.

qlogic conflict between queue counting bug fix and the addition
of multiple MAC address support.

Signed-off-by: David S. Miller

David S. Miller
2014-01-07 06:37:45 +0800

03 Jan, 2014

1 commit

7a7ffbabf ipv4: fix tunneled VM traffic over hw VXLAN/GRE GSO NIC ... Browse Code »
13

VM to VM GSO traffic is broken if it goes through VXLAN or GRE
tunnel and the physical NIC on the host supports hardware VXLAN/GRE
GSO offload (e.g. bnx2x and next-gen mlx4).

Two issues -
(VXLAN) VM traffic has SKB_GSO_DODGY and SKB_GSO_UDP_TUNNEL with
SKB_GSO_TCP/UDP set depending on the inner protocol. GSO header
integrity check fails in udp4_ufo_fragment if inner protocol is
TCP. Also gso_segs is calculated incorrectly using skb->len that
includes tunnel header. Fix: robust check should only be applied
to the inner packet.

(VXLAN & GRE) Once GSO header integrity check passes, NULL segs
is returned and the original skb is sent to hardware. However the
tunnel header is already pulled. Fix: tunnel header needs to be
restored so that hardware can perform GSO properly on the original
packet.

Signed-off-by: Wei-Chun Chao
Signed-off-by: David S. Miller

Wei-Chun Chao
2014-01-03 08:06:47 +0800

20 Dec, 2013

1 commit

1669cb985 Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/klassert/ipsec-next ... Browse Code »

Steffen Klassert says:

====================
pull request (net-next): ipsec-next 2013-12-19

1) Use the user supplied policy index instead of a generated one
if present. From Fan Du.

2) Make xfrm migration namespace aware. From Fan Du.

3) Make the xfrm state and policy locks namespace aware. From Fan Du.

4) Remove ancient sleeping when the SA is in acquire state,
we now queue packets to the policy instead. This replaces the
sleeping code.

5) Remove FLOWI_FLAG_CAN_SLEEP. This was used to notify xfrm about the
posibility to sleep. The sleeping code is gone, so remove it.

6) Check user specified spi for IPComp. Thr spi for IPcomp is only
16 bit wide, so check for a valid value. From Fan Du.

7) Export verify_userspi_info to check for valid user supplied spi ranges
with pfkey and netlink. From Fan Du.

8) RFC3173 states that if the total size of a compressed payload and the IPComp
header is not smaller than the size of the original payload, the IP datagram
must be sent in the original non-compressed form. These packets are dropped
by the inbound policy check because they are not transformed. Document the need
to set 'level use' for IPcomp to receive such packets anyway. From Fan Du.

Please pull or let me know if there are problems.
====================

Signed-off-by: David S. Miller

David S. Miller
2013-12-20 07:37:49 +0800

18 Dec, 2013

1 commit

e47eb5dfb udp: ipv4: do not use sk_dst_lock from softirq context ... Browse Code »
19

Using sk_dst_lock from softirq context is not supported right now.

Instead of adding BH protection everywhere,
udp_sk_rx_dst_set() can instead use xchg(), as suggested
by David.

Reported-by: Fengguang Wu
Fixes: 975022310233 ("udp: ipv4: must add synchronization in udp_sk_rx_dst_set()")
Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2013-12-18 03:50:58 +0800

12 Dec, 2013

2 commits

975022310 udp: ipv4: must add synchronization in udp_sk_rx_dst_set() ... Browse Code »

Unlike TCP, UDP input path does not hold the socket lock.

Before messing with sk->sk_rx_dst, we must use a spinlock, otherwise
multiple cpus could leak a refcount.

This patch also takes care of renewing a stale dst entry.
(When the sk->sk_rx_dst would not be used by IP early demux)

Fixes: 421b3885bf6d ("udp: ipv4: Add udp early demux")
Signed-off-by: Eric Dumazet
Cc: Shawn Bohrer
Signed-off-by: David S. Miller

Eric Dumazet
2013-12-12 09:21:10 +0800
610438b74 udp: ipv4: fix potential use after free in udp_v4_early_demux() ... Browse Code »

pskb_may_pull() can reallocate skb->head, we need to move the
initialization of iph and uh pointers after its call.

Fixes: 421b3885bf6d ("udp: ipv4: Add udp early demux")
Signed-off-by: Eric Dumazet
Cc: Shawn Bohrer
Signed-off-by: David S. Miller

Eric Dumazet
2013-12-12 05:10:14 +0800