Eric Lee / smarc-ti-linux-kernel | Embedian Git Server

12 Apr, 2014

1 commit

676d23690 net: Fix use after free by removing length arg from sk_data_ready callbacks. ... Browse Code »
13

Several spots in the kernel perform a sequence like:

skb_queue_tail(&sk->s_receive_queue, skb);
sk->sk_data_ready(sk, skb->len);

But at the moment we place the SKB onto the socket receive queue it
can be consumed and freed up. So this skb->len access is potentially
to freed up memory.

Furthermore, the skb->len can be modified by the consumer so it is
possible that the value isn't accurate.

And finally, no actual implementation of this callback actually uses
the length argument. And since nobody actually cared about it's
value, lots of call sites pass arbitrary values in such as '0' and
even '1'.

So just remove the length argument from the callback, that way there
is no confusion whatsoever and all of these use-after-free cases get
fixed as a side effect.

Based upon a patch by Eric Dumazet and his suggestion to audit this
issue tree-wide.

Signed-off-by: David S. Miller

David S. Miller
2014-04-12 04:15:36 +0800

11 Mar, 2014

1 commit

9063e21fb netlink: autosize skb lengthes ... Browse Code »
2

One known problem with netlink is the fact that NLMSG_GOODSIZE is
really small on PAGE_SIZE==4096 architectures, and it is difficult
to know in advance what buffer size is used by the application.

This patch adds an automatic learning of the size.

First netlink message will still be limited to ~4K, but if user used
bigger buffers, then following messages will be able to use up to 16KB.

This speedups dump() operations by a large factor and should be safe
for legacy applications.

Signed-off-by: Eric Dumazet
Cc: Thomas Graf
Acked-by: Thomas Graf
Signed-off-by: David S. Miller

Eric Dumazet
2014-03-11 01:56:26 +0800

06 Mar, 2014

1 commit

67ddc87f1 Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net ... Browse Code »

Conflicts:
drivers/net/wireless/ath/ath9k/recv.c
drivers/net/wireless/mwifiex/pcie.c
net/ipv6/sit.c

The SIT driver conflict consists of a bug fix being done by hand
in 'net' (missing u64_stats_init()) whilst in 'net-next' a helper
was created (netdev_alloc_pcpu_stats()) which takes care of this.

The two wireless conflicts were overlapping changes.

Signed-off-by: David S. Miller

David S. Miller
2014-03-06 09:32:02 +0800

26 Feb, 2014

1 commit

46833a86f net: Fix permission check in netlink_connect() ... Browse Code »

netlink_sendmsg() was changed to prevent non-root processes from sending
messages with dst_pid != 0.
netlink_connect() however still only checks if nladdr->nl_groups is set.
This patch modifies netlink_connect() to check for the same condition.

Signed-off-by: Mike Pecovnik
Signed-off-by: David S. Miller

Mike Pecovnik
2014-02-26 07:35:14 +0800

18 Feb, 2014

1 commit

23b456729 netlink: fix checkpatch errors space and "foo *bar" ... Browse Code »

ERROR: spaces required and "(foo*)" should be "(foo *)"

Signed-off-by: Wang Yufen
Signed-off-by: David S. Miller

Wang Yufen
2014-02-18 05:57:28 +0800

19 Jan, 2014

1 commit

342dfc306 net: add build-time checks for msg->msg_name size ... Browse Code »

This is a follow-up patch to f3d3342602f8bc ("net: rework recvmsg
handler msg_name and msg_namelen logic").

DECLARE_SOCKADDR validates that the structure we use for writing the
name information to is not larger than the buffer which is reserved
for msg->msg_name (which is 128 bytes). Also use DECLARE_SOCKADDR
consistently in sendmsg code paths.

Signed-off-by: Steffen Hurrle
Suggested-by: Hannes Frederic Sowa
Acked-by: Hannes Frederic Sowa
Signed-off-by: David S. Miller

Steffen Hurrle
2014-01-19 15:04:16 +0800

07 Jan, 2014

3 commits

39b6b2992 Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/jesse/openvswitch ... Browse Code »

Jesse Gross says:

====================
[GIT net-next] Open vSwitch

Open vSwitch changes for net-next/3.14. Highlights are:
* Performance improvements in the mechanism to get packets to userspace
using memory mapped netlink and skb zero copy where appropriate.
* Per-cpu flow stats in situations where flows are likely to be shared
across CPUs. Standard flow stats are used in other situations to save
memory and allocation time.
* A handful of code cleanups and rationalization.
====================

Signed-off-by: David S. Miller

David S. Miller
2014-01-07 08:48:38 +0800
aae9f0e22 netlink: Avoid netlink mmap alloc if msg size exceeds frame size ... Browse Code »

An insufficent ring frame size configuration can lead to an
unnecessary skb allocation for every Netlink message. Check frame
size before taking the queue lock and allocating the skb and
re-check with lock to be safe.

Signed-off-by: Thomas Graf
Reviewed-by: Daniel Borkmann
Signed-off-by: Jesse Gross

Thomas Graf
2014-01-07 07:52:06 +0800
bb9b18fb5 genl: Add genlmsg_new_unicast() for unicast message allocation ... Browse Code »
2

Allocates a new sk_buff large enough to cover the specified payload
plus required Netlink headers. Will check receiving socket for
memory mapped i/o capability and use it if enabled. Will fall back
to non-mapped skb if message size exceeds the frame size of the ring.

Signed-of-by: Thomas Graf
Reviewed-by: Daniel Borkmann
Signed-off-by: Jesse Gross

Thomas Graf
2014-01-07 07:51:53 +0800

02 Jan, 2014

1 commit

2173f8d95 netlink: cleanup tap related functions ... Browse Code »

Cleanups in netlink_tap code
* remove unused function netlink_clear_multicast_users
* make local function static

Signed-off-by: Stephen Hemminger
Reviewed-by: Johannes Berg
Signed-off-by: David S. Miller

stephen hemminger
2014-01-02 12:43:36 +0800

01 Jan, 2014

2 commits

604d13c97 netlink: specify netlink packet direction for nlmon ... Browse Code »

In order to facilitate development for netlink protocol dissector,
fill the unused field skb->pkt_type of the cloned skb with a hint
of the address space of the new owner (receiver) socket in the
notion of "to kernel" resp. "to user".

At the time we invoke __netlink_deliver_tap_skb(), we already have
set the new skb owner via netlink_skb_set_owner_r(), so we can use
that for netlink_is_kernel() probing.

In normal PF_PACKET network traffic, this field denotes if the
packet is destined for us (PACKET_HOST), if it's broadcast
(PACKET_BROADCAST), etc.

As we only have 3 bit reserved, we can use the value (= 6) of
PACKET_FASTROUTE as it's _not used_ anywhere in the whole kernel
and not supported anywhere, and packets of such type were never
exposed to user space, so there are no overlapping users of such
kind. Thus, as wished, that seems the only way to make both
PACKET_* values non-overlapping and therefore device agnostic.

By using those two flags for netlink skbs on nlmon devices, they
can be made available and picked up via sll_pkttype (previously
unused in netlink context) in struct sockaddr_ll. We now have
these two directions:

- PACKET_USER (= 6) -> to user space
- PACKET_KERNEL (= 7) -> to kernel space

Partial `ip a` example strace for sa_family=AF_NETLINK with
detected nl msg direction:

syscall: direction:
sendto(3, ...) = 40 /* to kernel */
recvmsg(3, ...) = 3404 /* to user */
recvmsg(3, ...) = 1120 /* to user */
recvmsg(3, ...) = 20 /* to user */
sendto(3, ...) = 40 /* to kernel */
recvmsg(3, ...) = 168 /* to user */
recvmsg(3, ...) = 144 /* to user */
recvmsg(3, ...) = 20 /* to user */

Signed-off-by: Daniel Borkmann
Signed-off-by: Jakub Zawadzki
Signed-off-by: David S. Miller

Daniel Borkmann
2014-01-01 03:31:43 +0800
73bfd370c netlink: only do not deliver to tap when both sides are kernel sks ... Browse Code »

We should also deliver packets to nlmon devices when we are in
netlink_unicast_kernel(), and only one of the {src,dst} sockets
is user sk and the other one kernel sk. That's e.g. the case in
netlink diag, netlink route, etc. Still, forbid to deliver messages
from kernel to kernel sks.

Signed-off-by: Daniel Borkmann
Signed-off-by: Jakub Zawadzki
Signed-off-by: David S. Miller

Daniel Borkmann
2014-01-01 03:31:43 +0800

29 Nov, 2013

2 commits

5e53e689b genetlink/pmcraid: use proper genetlink multicast API ... Browse Code »

The pmcraid driver is abusing the genetlink API and is using its
family ID as the multicast group ID, which is invalid and may
belong to somebody else (and likely will.)

Make it use the correct API, but since this may already be used
as-is by userspace, reserve a family ID for this code and also
reserve that group ID to not break userspace assumptions.

My previous patch broke event delivery in the driver as I missed
that it wasn't using the right API and forgot to update it later
in my series.

While changing this, I noticed that the genetlink code could use
the static group ID instead of a strcmp(), so also do that for
the VFS_DQUOT family.

Cc: Anil Ravindranath
Cc: "James E.J. Bottomley"
Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-29 07:26:30 +0800
0f0e2159c genetlink: Fix uninitialized variable in genl_validate_assign_mc_groups() ... Browse Code »

net/netlink/genetlink.c: In function ‘genl_validate_assign_mc_groups’:
net/netlink/genetlink.c:217: warning: ‘err’ may be used uninitialized in this
function

Commit 2a94fe48f32ccf7321450a2cc07f2b724a444e5b ("genetlink: make multicast
groups const, prevent abuse") split genl_register_mc_group() in multiple
functions, but dropped the initialization of err.

Initialize err to zero to fix this.

Signed-off-by: Geert Uytterhoeven
Signed-off-by: David S. Miller

Geert Uytterhoeven
2013-11-29 07:24:07 +0800

22 Nov, 2013

1 commit

220815a96 genetlink: fix genlmsg_multicast() bug ... Browse Code »

Unfortunately, I introduced a tremendously stupid bug into
genlmsg_multicast() when doing all those multicast group
changes: it adjusts the group number, but then passes it
to genlmsg_multicast_netns() which does that again.

Somehow, my tests failed to catch this, so add a warning
into genlmsg_multicast_netns() and remove the offending
group ID adjustment.

Also add a warning to the similar code in other functions
so people who misuse them are more loudly warned.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-22 02:09:43 +0800

21 Nov, 2013

1 commit

f3d334260 net: rework recvmsg handler msg_name and msg_namelen logic ... Browse Code »
22

This patch now always passes msg->msg_namelen as 0. recvmsg handlers must
set msg_namelen to the proper size
Suggested-by: Eric Dumazet
Signed-off-by: Hannes Frederic Sowa
Signed-off-by: David S. Miller

Hannes Frederic Sowa
2013-11-21 10:52:30 +0800

20 Nov, 2013

8 commits

2a94fe48f genetlink: make multicast groups const, prevent abuse ... Browse Code »

Register generic netlink multicast groups as an array with
the family and give them contiguous group IDs. Then instead
of passing the global group ID to the various functions that
send messages, pass the ID relative to the family - for most
families that's just 0 because the only have one group.

This avoids the list_head and ID in each group, adding a new
field for the mcast group ID offset to the family.

At the same time, this allows us to prevent abusing groups
again like the quota and dropmon code did, since we can now
check that a family only uses a group it owns.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-20 05:39:06 +0800
68eb55031 genetlink: pass family to functions using groups ... Browse Code »

This doesn't really change anything, but prepares for the
next patch that will change the APIs to pass the group ID
within the family, rather than the global group ID.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-20 05:39:06 +0800
c2ebb9084 genetlink: remove family pointer from genl_multicast_group ... Browse Code »

There's no reason to have the family pointer there since it
can just be passed internally where needed, so remove it.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-20 05:39:06 +0800
06fb555a2 genetlink: remove genl_unregister_mc_group() ... Browse Code »

There are no users of this API remaining, and we'll soon
change group registration to be static (like ops are now)

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-20 05:39:06 +0800
2ecf7536b quota/genetlink: use proper genetlink multicast APIs ... Browse Code »

The quota code is abusing the genetlink API and is using
its family ID as the multicast group ID, which is invalid
and may belong to somebody else (and likely will.)

Make the quota code use the correct API, but since this
is already used as-is by userspace, reserve a family ID
for this code and also reserve that group ID to not break
userspace assumptions.

Acked-by: Jan Kara
Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-20 05:39:05 +0800
e5dcecba0 drop_monitor/genetlink: use proper genetlink multicast APIs ... Browse Code »

The drop monitor code is abusing the genetlink API and is
statically using the generic netlink multicast group 1, even
if that group belongs to somebody else (which it invariably
will, since it's not reserved.)

Make the drop monitor code use the proper APIs to reserve a
group ID, but also reserve the group id 1 in generic netlink
code to preserve the userspace API. Since drop monitor can
be a module, don't clear the bit for it on unregistration.

Acked-by: Neil Horman
Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-20 05:39:05 +0800
c53ed7423 genetlink: only pass array to genl_register_family_with_ops() ... Browse Code »

As suggested by David Miller, make genl_register_family_with_ops()
a macro and pass only the array, evaluating ARRAY_SIZE() in the
macro, this is a little safer.

The openvswitch has some indirection, assing ops/n_ops directly in
that code. This might ultimately just assign the pointers in the
family initializations, saving the struct genl_family_and_ops and
code (once mcast groups are handled differently.)

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-20 05:39:05 +0800
840e93f2e netlink: fix documentation typo in netlink_set_err() ... Browse Code »

The parameter is just 'group', not 'groups', fix the documentation typo.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-20 04:07:01 +0800

19 Nov, 2013

1 commit

029b234fb genetlink: rename shadowed variable ... Browse Code »

Sparse pointed out that the new flags variable I had added
shadowed an existing one, rename the new one to avoid that,
making the code clearer.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-19 04:34:00 +0800

16 Nov, 2013

1 commit

568508aa0 genetlink: unify registration functions ... Browse Code »

Now that the ops assignment is just two variables rather than a
long list iteration etc., there's no reason to separately export
__genl_register_family() and __genl_register_family_with_ops().

Unify the two functions into __genl_register_family() and make
genl_register_family_with_ops() call it after assigning the ops.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-16 09:50:23 +0800

15 Nov, 2013

3 commits

f84f771d9 genetlink: allow making ops const ... Browse Code »

Allow making the ops array const by not modifying the ops
flags on registration but rather only when ops are sent
out in the family information.

No users are updated yet except for the pre_doit/post_doit
calls in wireless (the only ones that exist now.)

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-15 06:10:41 +0800
d91824c08 genetlink: register family ops as array ... Browse Code »

Instead of using a linked list, use an array. This reduces
the data size needed by the users of genetlink, for example
in wireless (net/wireless/nl80211.c) on 64-bit it frees up
over 1K of data space.

Remove the attempted sending of CTRL_CMD_NEWOPS ctrl event
since genl_ctrl_event(CTRL_CMD_NEWOPS, ...) only returns
-EINVAL anyway, therefore no such event could ever be sent.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-15 06:10:41 +0800
3686ec5e8 genetlink: remove genl_register_ops/genl_unregister_ops ... Browse Code »

genl_register_ops() is still needed for internal registration,
but is no longer available to users of the API.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-11-15 06:10:40 +0800

07 Sep, 2013

1 commit

5ffd5cddd net: netlink: filter particular protocols from analyzers ... Browse Code »

Fix finer-grained control and let only a whitelist of allowed netlink
protocols pass, in our case related to networking. If later on, other
subsystems decide they want to add their protocol as well to the list
of allowed protocols they shall simply add it. While at it, we also
need to tell what protocol is in use otherwise BPF_S_ANC_PROTOCOL can
not pick it up (as it's not filled out).

Signed-off-by: Daniel Borkmann
Signed-off-by: David S. Miller

Daniel Borkmann
2013-09-07 02:43:48 +0800

06 Sep, 2013

1 commit

06c54055b Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net ... Browse Code »

Conflicts:
drivers/net/ethernet/stmicro/stmmac/stmmac_platform.c
net/bridge/br_multicast.c
net/ipv6/sit.c

The conflicts were minor:

1) sit.c changes overlap with change to ip_tunnel_xmit() signature.

2) br_multicast.c had an overlap between computing max_delay using
msecs_to_jiffies and turning MLDV2_MRC() into an inline function
with a name using lowercase instead of uppercase letters.

3) stmmac had two overlapping changes, one which conditionally allocated
and hooked up a dma_cfg based upon the presence of the pbl OF property,
and another one handling store-and-forward DMA made. The latter of
which should not go into the new of_find_property() basic block.

Signed-off-by: David S. Miller

David S. Miller
2013-09-06 02:58:52 +0800

29 Aug, 2013

2 commits

33c6b1f6b genl: Hold reference on correct module while netlink-dump. ... Browse Code »

netlink dump operations take module as parameter to hold
reference for entire netlink dump duration.
Currently it holds ref only on genl module which is not correct
when we use ops registered to genl from another module.
Following patch adds module pointer to genl_ops so that netlink
can hold ref count on it.

CC: Jesse Gross
CC: Johannes Berg
Signed-off-by: Pravin B Shelar
Signed-off-by: David S. Miller

Pravin B Shelar
2013-08-29 05:19:17 +0800
9b96309c5 genl: Fix genl dumpit() locking. ... Browse Code »

In case of genl-family with parallel ops off, dumpif() callback
is expected to run under genl_lock, But commit def3117493eafd9df
(genl: Allow concurrent genl callbacks.) changed this behaviour
where only first dumpit() op was called under genl-lock.
For subsequent dump, only nlk->cb_lock was taken.
Following patch fixes it by defining locked dumpit() and done()
callback which takes care of genl-locking.

CC: Jesse Gross
CC: Johannes Berg
Signed-off-by: Pravin B Shelar
Signed-off-by: David S. Miller

Pravin B Shelar
2013-08-29 05:19:17 +0800

27 Aug, 2013

1 commit

b05930f5d Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net ... Browse Code »

Conflicts:
drivers/net/wireless/iwlwifi/pcie/trans.c
include/linux/inetdevice.h

The inetdevice.h conflict involves moving the IPV4_DEVCONF values
into a UAPI header, overlapping additions of some new entries.

The iwlwifi conflict is a context overlap.

Signed-off-by: David S. Miller

David S. Miller
2013-08-27 04:37:08 +0800

23 Aug, 2013

1 commit

9d47b3805 Revert "genetlink: fix family dump race" ... Browse Code »

This reverts commit 58ad436fcf49810aa006016107f494c9ac9013db.

It turns out that the change introduced a potential deadlock
by causing a locking dependency with netlink's cb_mutex. I
can't seem to find a way to resolve this without doing major
changes to the locking, so revert this.

Signed-off-by: Johannes Berg
Acked-by: Pravin B Shelar
Signed-off-by: David S. Miller

Johannes Berg
2013-08-23 04:24:02 +0800

17 Aug, 2013

1 commit

2ff1cf12c Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net Browse Code »

David S. Miller
2013-08-17 06:37:26 +0800

16 Aug, 2013

1 commit

16b304f34 netlink: Eliminate kmalloc in netlink dump operation. ... Browse Code »
2

Following patch stores struct netlink_callback in netlink_sock
to avoid allocating and freeing it on every netlink dump msg.
Only one dump operation is allowed for a given socket at a time
therefore we can safely convert cb pointer to cb struct inside
netlink_sock.

Signed-off-by: Pravin B Shelar
Signed-off-by: David S. Miller

Pravin B Shelar
2013-08-16 06:51:20 +0800

13 Aug, 2013

1 commit

58ad436fc genetlink: fix family dump race ... Browse Code »

When dumping generic netlink families, only the first dump call
is locked with genl_lock(), which protects the list of families,
and thus subsequent calls can access the data without locking,
racing against family addition/removal. This can cause a crash.
Fix it - the locking needs to be conditional because the first
time around it's already locked.

A similar bug was reported to me on an old kernel (3.4.47) but
the exact scenario that happened there is no longer possible,
on those kernels the first round wasn't locked either. Looking
at the current code I found the race described above, which had
also existed on the old kernel.

Cc: stable@vger.kernel.org
Reported-by: Andrei Otcheretianski
Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2013-08-13 15:57:06 +0800

04 Aug, 2013

1 commit

0e76a3a58 Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net ... Browse Code »

Merge net into net-next to setup some infrastructure Eric
Dumazet needs for usbnet changes.

Signed-off-by: David S. Miller

David S. Miller
2013-08-04 12:36:46 +0800

03 Aug, 2013

1 commit

8a849bb7f net: netlink: minor: remove unused pointer in alloc_pg_vec ... Browse Code »

Variable ptr is being assigned, but never used, so just remove it.

Signed-off-by: Daniel Borkmann
Signed-off-by: David S. Miller

Daniel Borkmann
2013-08-03 06:26:12 +0800