Eric Lee / smarc-fsl-linux-kernel

11 Oct, 2007

35 commits

cd40b7d39 [NET]: make netlink user -> kernel interface synchronious ... Browse Code »

This patch make processing netlink user -> kernel messages synchronious.
This change was inspired by the talk with Alexey Kuznetsov about current
netlink messages processing. He says that he was badly wrong when introduced
asynchronious user -> kernel communication.

The call netlink_unicast is the only path to send message to the kernel
netlink socket. But, unfortunately, it is also used to send data to the
user.

Before this change the user message has been attached to the socket queue
and sk->sk_data_ready was called. The process has been blocked until all
pending messages were processed. The bad thing is that this processing
may occur in the arbitrary process context.

This patch changes nlk->data_ready callback to get 1 skb and force packet
processing right in the netlink_unicast.

Kernel -> user path in netlink_unicast remains untouched.

EINTR processing for in netlink_run_queue was changed. It forces rtnl_lock
drop, but the process remains in the cycle until the message will be fully
processed. So, there is no need to use this kludges now.

Signed-off-by: Denis V. Lunev
Acked-by: Alexey Kuznetsov
Signed-off-by: David S. Miller

Denis V. Lunev
2007-10-11 12:15:29 +0800
b7c6538cd [IPSEC]: Move state lock into x->type->output ... Browse Code »

This patch releases the lock on the state before calling x->type->output.
It also adds the lock to the spots where they're currently needed.

Most of those places (all except mip6) are expected to disappear with
async crypto.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:55:03 +0800
050f009e1 [IPSEC]: Lock state when copying non-atomic fields to user-space ... Browse Code »

This patch adds locking so that when we're copying non-atomic fields such as
life-time or coaddr to user-space we don't get a partial result.

For af_key I've changed every instance of pfkey_xfrm_state2msg apart from
expiration notification to include the keys and life-times. This is in-line
with XFRM behaviour.

The actual cases affected are:

* pfkey_getspi: No change as we don't have any keys to copy.
* key_notify_sa:
+ ADD/UPD: This wouldn't work otherwise.
+ DEL: It can't hurt.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:55:02 +0800
68325d3b1 [XFRM] user: Move attribute copying code into copy_to_user_state_extra ... Browse Code »

Here's a good example of code duplication leading to code rot. The
notification patch did its own netlink message creation for xfrm states.
It duplicated code that was already in dump_one_state. Guess what, the
next time (and the time after) when someone updated dump_one_state the
notification path got zilch.

This patch moves that code from dump_one_state to copy_to_user_state_extra
and uses it in xfrm_notify_sa too. Unfortunately whoever updates this
still needs to update xfrm_sa_len since the notification path wants to
know the exact size for allocation.

At least I've added a comment saying so and if someone still forgest, we'll
have a WARN_ON telling us so.

I also changed the security size calculation to use xfrm_user_sec_ctx since
that's what we actually put into the skb. However it makes no practical
difference since it has the same size as xfrm_sec_ctx.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:55:02 +0800
658b219e9 [IPSEC]: Move common code into xfrm_alloc_spi ... Browse Code »

This patch moves some common code that conceptually belongs to the xfrm core
from af_key/xfrm_user into xfrm_alloc_spi.

In particular, the spin lock on the state is now taken inside xfrm_alloc_spi.
Previously it also protected the construction of the response PF_KEY/XFRM
messages to user-space. This is inconsistent as other identical constructions
are not protected by the state lock. This is bad because they in fact should
be protected but only in certain spots (so as not to hold the lock for too
long which may cause packet drops).

The SPI byte order conversion has also been moved.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:55:01 +0800
75ba28c63 [IPSEC]: Remove gratuitous km wake-up events on ACQUIRE ... Browse Code »

There is no point in waking people up when creating/updating larval states
because they'll just go back to sleep again as larval states by definition
cannot be found by xfrm_state_find.

We should only wake them up when the larvals mature or die.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:55:01 +0800
007f0211a [IPSEC]: Store IPv6 nh pointer in mac_header on output ... Browse Code »

Current the x->mode->output functions store the IPv6 nh pointer in the
skb network header. This is inconvenient because the network header then
has to be fixed up before the packet can leave the IPsec stack. The mac
header field is unused on output so we can use that to store this instead.

This patch does that and removes the network header fix-up in xfrm_output.

It also uses ipv6_hdr where appropriate in the x->type->output functions.

There is also a minor clean-up in esp4 to make it use the same code as
esp6 to help any subsequent effort to merge the two.

Lastly it kills two redundant skb_set_* statements in BEET that were
simply copied over from transport mode.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:55:00 +0800
1ecafede8 [IPSEC]: Remove bogus ref count in xfrm_secpath_reject ... Browse Code »

Constructs of the form

xfrm_state_hold(x);
foo(x);
xfrm_state_put(x);

tend to be broken because foo is either synchronous where this is totally
unnecessary or if foo is asynchronous then the reference count is in the
wrong spot.

In the case of xfrm_secpath_reject, the function is synchronous and therefore
we should just kill the reference count.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:54:59 +0800
45b17f48e [IPSEC]: Move RO-specific output code into xfrm6_mode_ro.c ... Browse Code »

The lastused update check in xfrm_output can be done just as well in
the mode output function which is specific to RO.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:54:56 +0800
cdf7e668d [IPSEC]: Unexport xfrm_replay_notify ... Browse Code »

Now that the only callers of xfrm_replay_notify are in xfrm, we can remove
the export.

This patch also removes xfrm_aevent_doreplay since it's now called in just
one spot.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:54:55 +0800
436a0a402 [IPSEC]: Move output replay code into xfrm_output ... Browse Code »

The replay counter is one of only two remaining things in the output code
that requires a lock on the xfrm state (the other being the crypto). This
patch moves it into the generic xfrm_output so we can remove the lock from
the transforms themselves.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:54:54 +0800
83815dea4 [IPSEC]: Move xfrm_state_check into xfrm_output.c ... Browse Code »

The functions xfrm_state_check and xfrm_state_check_space are only used by
the output code in xfrm_output.c so we can move them over.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:54:54 +0800
406ef77c8 [IPSEC]: Move common output code to xfrm_output ... Browse Code »

Most of the code in xfrm4_output_one and xfrm6_output_one are identical so
this patch moves them into a common xfrm_output function which will live
in net/xfrm.

In fact this would seem to fix a bug as on IPv4 we never reset the network
header after a transform which may upset netfilter later on.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:54:53 +0800
2774c7aba [NET]: Make the loopback device per network namespace. ... Browse Code »

This patch makes loopback_dev per network namespace. Adding
code to create a different loopback device for each network
namespace and adding the code to free a loopback device
when a network namespace exits.

This patch modifies all users the loopback_dev so they
access it as init_net.loopback_dev, keeping all of the
code compiling and working. A later pass will be needed to
update the users to use something other than the initial network
namespace.

Signed-off-by: Eric W. Biederman
Signed-off-by: David S. Miller

Eric W. Biederman
2007-10-11 07:52:49 +0800
de3cb747f [NET]: Dynamically allocate the loopback device, part 1. ... Browse Code »

This patch replaces all occurences to the static variable
loopback_dev to a pointer loopback_dev. That provides the
mindless, trivial, uninteressting change part for the dynamic
allocation for the loopback.

Signed-off-by: Eric W. Biederman
Signed-off-by: Daniel Lezcano
Acked-By: Kirill Korotaev
Acked-by: Benjamin Thery
Signed-off-by: David S. Miller

Daniel Lezcano
2007-10-11 07:52:14 +0800
0cfad0755 [NETLINK]: Avoid pointer in netlink_run_queue ... Browse Code »

I was looking at Patrick's fix to inet_diag and it occured
to me that we're using a pointer argument to return values
unnecessarily in netlink_run_queue. Changing it to return
the value will allow the compiler to generate better code
since the value won't have to be memory-backed.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:51:24 +0800
b4b510290 [NET]: Support multiple network namespaces with netlink ... Browse Code »

Each netlink socket will live in exactly one network namespace,
this includes the controlling kernel sockets.

This patch updates all of the existing netlink protocols
to only support the initial network namespace. Request
by clients in other namespaces will get -ECONREFUSED.
As they would if the kernel did not have the support for
that netlink protocol compiled in.

As each netlink protocol is updated to be multiple network
namespace safe it can register multiple kernel sockets
to acquire a presence in the rest of the network namespaces.

The implementation in af_netlink is a simple filter implementation
at hash table insertion and hash table look up time.

Signed-off-by: Eric W. Biederman
Signed-off-by: David S. Miller

Eric W. Biederman
2007-10-11 07:49:09 +0800
e9dc86534 [NET]: Make device event notification network namespace safe ... Browse Code »

Every user of the network device notifiers is either a protocol
stack or a pseudo device. If a protocol stack that does not have
support for multiple network namespaces receives an event for a
device that is not in the initial network namespace it quite possibly
can get confused and do the wrong thing.

To avoid problems until all of the protocol stacks are converted
this patch modifies all netdev event handlers to ignore events on
devices that are not in the initial network namespace.

As the rest of the code is made network namespace aware these
checks can be removed.

Signed-off-by: Eric W. Biederman
Signed-off-by: David S. Miller

Eric W. Biederman
2007-10-11 07:49:09 +0800
ab5f5e8b1 [XFRM]: xfrm audit calls ... Browse Code »

This patch modifies the current ipsec audit layer
by breaking it up into purpose driven audit calls.

So far, the only audit calls made are when add/delete
an SA/policy. It had been discussed to give each
key manager it's own calls to do this, but I found
there to be much redundnacy since they did the exact
same things, except for how they got auid and sid, so I
combined them. The below audit calls can be made by any
key manager. Hopefully, this is ok.

Signed-off-by: Joy Latten
Signed-off-by: David S. Miller

Joy Latten
2007-10-11 07:49:02 +0800
f7944fb19 [XFRM] policy: Replace magic number with XFRM_POLICY_OUT ... Browse Code »

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:34 +0800
fd21150a0 [XFRM] netlink: Inline attach_encap_tmpl(), attach_sec_ctx(), and attach_one_addr() ... Browse Code »

These functions are only used once and are a lot easier to understand if
inlined directly into the function.

Fixes by Masahide NAKAMURA.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:26 +0800
15901a274 [XFRM] netlink: Remove dependency on rtnetlink ... Browse Code »

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:25 +0800
5424f32e4 [XFRM] netlink: Use nlattr instead of rtattr ... Browse Code »

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:25 +0800
35a7aa08b [XFRM] netlink: Rename attribute array from xfrma[] to attrs[] ... Browse Code »

Increases readability a lot.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:24 +0800
fab448991 [XFRM] netlink: Enhance indexing of the attribute array ... Browse Code »

nlmsg_parse() puts attributes at array[type] so the indexing
method can be simpilfied by removing the obscuring "- 1".

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:23 +0800
cf5cb79f6 [XFRM] netlink: Establish an attribute policy ... Browse Code »

Adds a policy defining the minimal payload lengths for all the attributes
allowing for most attribute validation checks to be removed from in
the middle of the code path. Makes updates more consistent as many format
errors are recognised earlier, before any changes have been attempted.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:23 +0800
a7bd9a45c [XFRM] netlink: Use nlmsg_parse() to parse attributes ... Browse Code »

Uses nlmsg_parse() to parse the attributes. This actually changes
behaviour as unknown attributes (type > MAXTYPE) no longer cause
an error. Instead unknown attributes will be ignored henceforth
to keep older kernels compatible with more recent userspace tools.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:22 +0800
7deb22649 [XFRM] netlink: Use nlmsg_new() and type-safe size calculation helpers ... Browse Code »

Moves all complex message size calculation into own inlined helper
functions and makes use of the type-safe netlink interface.

Using nlmsg_new() simplifies the calculation itself as it takes care
of the netlink header length by itself.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:22 +0800
cfbfd45a8 [XFRM] netlink: Clear up some of the CONFIG_XFRM_SUB_POLICY ifdef mess ... Browse Code »

Moves all of the SUB_POLICY ifdefs related to the attribute size
calculation into a function.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:21 +0800
c26445acb [XFRM] netlink: Move algorithm length calculation to its own function ... Browse Code »

Adds alg_len() to calculate the properly padded length of an
algorithm attribute to simplify the code.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:21 +0800
c0144beae [XFRM] netlink: Use nla_put()/NLA_PUT() variantes ... Browse Code »

Also makes use of copy_sec_ctx() in another place and removes
duplicated code.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:20 +0800
082a1ad57 [XFRM] netlink: Use nlmsg_broadcast() and nlmsg_unicast() ... Browse Code »

This simplifies successful return codes from >0 to 0.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:20 +0800
7b67c8575 [XFRM] netlink: Use nlmsg_data() instead of NLMSG_DATA() ... Browse Code »

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:19 +0800
9825069d0 [XFRM] netlink: Use nlmsg_end() and nlmsg_cancel() ... Browse Code »

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:18 +0800
79b8b7f4a [XFRM] netlink: Use nlmsg_put() instead of NLMSG_PUT() ... Browse Code »

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:48:18 +0800

14 Aug, 2007

1 commit

b5890d8ba [XFRM]: Clean up duplicate includes in net/xfrm/ ... Browse Code »

This patch cleans up duplicate includes in
net/xfrm/

Signed-off-by: Jesper Juhl
Signed-off-by: Andrew Morton
Signed-off-by: David S. Miller

Jesper Juhl
2007-08-14 13:52:08 +0800

02 Aug, 2007

1 commit

e6e0871cc Net/Security: fix memory leaks from security_secid_to_secctx() ... Browse Code »

The security_secid_to_secctx() function returns memory that must be freed
by a call to security_release_secctx() which was not always happening. This
patch fixes two of these problems (all that I could find in the kernel source
at present).

Signed-off-by: Paul Moore
Acked-by: Stephen Smalley
Signed-off-by: James Morris

Paul Moore
2007-08-02 23:52:26 +0800

31 Jul, 2007

2 commits

48b8d7831 [XFRM]: State selection update to use inner addresses. ... Browse Code »

This patch modifies the xfrm state selection logic to use the inner
addresses where the outer have been (incorrectly) used. This is
required for beet mode in general and interfamily setups in both
tunnel and beet mode.

Signed-off-by: Joakim Koskela
Signed-off-by: Herbert Xu
Signed-off-by: Diego Beltrami
Signed-off-by: Miika Komu
Acked-by: Patrick McHardy
Signed-off-by: David S. Miller

Joakim Koskela
2007-07-31 17:28:33 +0800
196b00362 [IPSEC]: Ensure that state inner family is set ... Browse Code »

Similar to the issue we had with template families which
specified the inner families of policies, we need to set
the inner families of states as the main xfrm user Openswan
leaves it as zero.

af_key is unaffected because the inner family is set by it
and not the KM.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-07-31 17:28:32 +0800

20 Jul, 2007

1 commit

20c2df83d mm: Remove slab destructors from kmem_cache_create(). ... Browse Code »

Slab destructors were no longer supported after Christoph's
c59def9f222d44bb7e2f0a559f2906191a0862d7 change. They've been
BUGs for both slab and slub, and slob never supported them
either.

This rips out support for the dtor pointer from kmem_cache_create()
completely and fixes up every single callsite in the kernel (there were
about 224, not including the slab allocator definitions themselves,
or the documentation references).

Signed-off-by: Paul Mundt

Paul Mundt
2007-07-20 09:11:58 +0800