Eric Lee / smarc-fsl-linux-kernel

19 Jun, 2007

1 commit

281216177 [XFRM]: Fix MTU calculation for non-ESP SAs ... Browse Code »

My IPsec MTU optimization patch introduced a regression in MTU calculation
for non-ESP SAs, the SA's header_len needs to be subtracted from the MTU if
the transform doesn't provide a ->get_mtu() function.

Reported-and-tested-by: Marco Berizzi

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-06-19 13:30:15 +0800

08 Jun, 2007

1 commit

4aa2e62c4 xfrm: Add security check before flushing SAD/SPD ... Browse Code »

Currently we check for permission before deleting entries from SAD and
SPD, (see security_xfrm_policy_delete() security_xfrm_state_delete())
However we are not checking for authorization when flushing the SPD and
the SAD completely. It was perhaps missed in the original security hooks
patch.

This patch adds a security check when flushing entries from the SAD and
SPD. It runs the entire database and checks each entry for a denial.
If the process attempting the flush is unable to remove all of the
entries a denial is logged the the flush function returns an error
without removing anything.

This is particularly useful when a process may need to create or delete
its own xfrm entries used for things like labeled networking but that
same process should not be able to delete other entries or flush the
entire database.

Signed-off-by: Joy Latten
Signed-off-by: Eric Paris
Signed-off-by: James Morris

Joy Latten
2007-06-08 04:42:46 +0800

31 May, 2007

2 commits

aad0e0b9b [XFRM]: xfrm_larval_drop sysctl should be __read_mostly. ... Browse Code »

Signed-off-by: David S. Miller

David S. Miller
2007-05-31 16:23:24 +0800
01e67d08f [XFRM]: Allow XFRM_ACQ_EXPIRES to be tunable via sysctl. ... Browse Code »

Signed-off-by: David S. Miller

David S. Miller
2007-05-31 16:23:23 +0800

25 May, 2007

1 commit

14e50e57a [XFRM]: Allow packet drops during larval state resolution. ... Browse Code »

The current IPSEC rule resolution behavior we have does not work for a
lot of people, even though technically it's an improvement from the
-EAGAIN buisness we had before.

Right now we'll block until the key manager resolves the route. That
works for simple cases, but many folks would rather packets get
silently dropped until the key manager resolves the IPSEC rules.

We can't tell these folks to "set the socket non-blocking" because
they don't have control over the non-block setting of things like the
sockets used to resolve DNS deep inside of the resolver libraries in
libc.

With that in mind I coded up the patch below with some help from
Herbert Xu which provides packet-drop behavior during larval state
resolution, controllable via sysctl and off by default.

This lays the framework to either:

1) Make this default at some point or...

2) Move this logic into xfrm{4,6}_policy.c and implement the
ARP-like resolution queue we've all been dreaming of.
The idea would be to queue packets to the policy, then
once the larval state is resolved by the key manager we
re-resolve the route and push the packets out. The
packets would timeout if the rule didn't get resolved
in a certain amount of time.

Signed-off-by: David S. Miller

David S. Miller
2007-05-25 09:17:54 +0800

23 May, 2007

1 commit

26b8e51e9 [IPSEC]: Fix warnings with casting int to pointer ... Browse Code »

This patch adds some casts to shut up the warnings introduced by my
last patch that added a common interator function for xfrm algorightms.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-05-23 07:12:26 +0800

20 May, 2007

1 commit

c92b3a2f1 [IPSEC] pfkey: Load specific algorithm in pfkey_add rather than all ... Browse Code »

This is a natural extension of the changeset

[XFRM]: Probe selected algorithm only.

which only removed the probe call for xfrm_user. This patch does exactly
the same thing for af_key. In other words, we load the algorithm requested
by the user rather than everything when adding xfrm states in af_key.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-05-20 05:21:18 +0800

14 May, 2007

2 commits

6253db055 [IPSEC]: Don't warn if high-order hash resize fails ... Browse Code »

Multi-page allocations are always likely to fail. Since such failures
are expected and non-critical in xfrm_hash_alloc, we shouldn't warn about
them.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-05-14 17:19:11 +0800
b5505c6e1 [IPSEC]: Check validity of direction in xfrm_policy_byid ... Browse Code »

The function xfrm_policy_byid takes a dir argument but finds the policy
using the index instead. We only use the dir argument to update the
policy count for that direction. Since the user can supply any value
for dir, this can corrupt our policy count.

I know this is the problem because a few days ago I was deleting
policies by hand using indicies and accidentally typed in the wrong
direction. It still deleted the policy and at the time I thought
that was cool. In retrospect it isn't such a good idea :)

I decided against letting it delete the policy anyway just in case
we ever remove the connection between indicies and direction.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-05-14 17:15:47 +0800

05 May, 2007

2 commits

5a6d34162 [XFRM] SPD info TLV aggregation ... Browse Code »

Aggregate the SPD info TLVs.

Signed-off-by: Jamal Hadi Salim
Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Jamal Hadi Salim
2007-05-05 03:55:39 +0800
af11e3160 [XFRM] SAD info TLV aggregationx ... Browse Code »

Aggregate the SAD info TLVs.

Signed-off-by: Jamal Hadi Salim
Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Jamal Hadi Salim
2007-05-05 03:55:13 +0800

30 Apr, 2007

1 commit

157bfc250 [XFRM]: Restrict upper layer information by bundle. ... Browse Code »

On MIPv6 usage, XFRM sub policy is enabled.
When main (IPsec) and sub (MIPv6) policy selectors have the same
address set but different upper layer information (i.e. protocol
number and its ports or type/code), multiple bundle should be created.
However, currently we have issue to use the same bundle created for
the first time with all flows covered by the case.

It is useful for the bundle to have the upper layer information
to be restructured correctly if it does not match with the flow.

1. Bundle was created by two policies
Selector from another policy is added to xfrm_dst.
If the flow does not match the selector, it goes to slow path to
restructure new bundle by single policy.

2. Bundle was created by one policy
Flow cache is added to xfrm_dst as originated one. If the flow does
not match the cache, it goes to slow path to try searching another
policy.

Signed-off-by: Masahide NAKAMURA
Signed-off-by: David S. Miller

Masahide NAKAMURA
2007-04-30 15:58:09 +0800

29 Apr, 2007

1 commit

ecfd6b183 [XFRM]: Export SPD info ... Browse Code »

With this patch you can use iproute2 in user space to efficiently see
how many policies exist in different directions.

Signed-off-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Jamal Hadi Salim
2007-04-29 12:20:32 +0800

28 Apr, 2007

1 commit

1a028e507 [NET]: Revert sk_buff walker cleanups. ... Browse Code »

This reverts eefa3906283a2b60a6d02a2cda593a7d7d7946c5

The simplification made in that change works with the assumption that
the 'offset' parameter to these functions is always positive or zero,
which is not true. It can be and often is negative in order to access
SKB header values in front of skb->data.

Signed-off-by: David S. Miller

David S. Miller
2007-04-28 06:21:23 +0800

27 Apr, 2007

1 commit

566ec0344 [XFRM]: Missing bits to SAD info. ... Browse Code »

This brings the SAD info in sync with net-2.6.22/net-2.6

Signed-off-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Jamal Hadi Salim
2007-04-27 05:12:15 +0800

26 Apr, 2007

14 commits

eefa39062 [NET]: Clean up sk_buff walkers. ... Browse Code »

I noticed recently that, in skb_checksum(), "offset" and "start" are
essentially the same thing and have the same value throughout the
function, despite being computed differently. Using a single variable
allows some cleanups and makes the skb_checksum() function smaller,
more readable, and presumably marginally faster.

We appear to have many other "sk_buff walker" functions built on the
exact same model, so the cleanup applies to them, too. Here is a list
of the functions I found to be affected:

net/appletalk/ddp.c:atalk_sum_skb()
net/core/datagram.c:skb_copy_datagram_iovec()
net/core/datagram.c:skb_copy_and_csum_datagram()
net/core/skbuff.c:skb_copy_bits()
net/core/skbuff.c:skb_store_bits()
net/core/skbuff.c:skb_checksum()
net/core/skbuff.c:skb_copy_and_csum_bit()
net/core/user_dma.c:dma_skb_copy_datagram_iovec()
net/xfrm/xfrm_algo.c:skb_icv_walk()
net/xfrm/xfrm_algo.c:skb_to_sgvec()

OTOH, I admit I'm a bit surprised, the cleanup is rather obvious so I'm
really wondering if I am missing something. Can anyone please comment
on this?

Signed-off-by: Jean Delvare
Signed-off-by: David S. Miller

Jean Delvare
2007-04-26 15:44:22 +0800
28d8909bc [XFRM]: Export SAD info. ... Browse Code »

On a system with a lot of SAs, counting SAD entries chews useful
CPU time since you need to dump the whole SAD to user space;
i.e something like ip xfrm state ls | grep -i src | wc -l
I have seen taking literally minutes on a 40K SAs when the system
is swapping.
With this patch, some of the SAD info (that was already being tracked)
is exposed to user space. i.e you do:
ip xfrm state count
And you get the count; you can also pass -s to the command line and
get the hash info.

Signed-off-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Jamal Hadi Salim
2007-04-26 15:10:29 +0800
3ff50b799 [NET]: cleanup extra semicolons ... Browse Code »

Spring cleaning time...

There seems to be a lot of places in the network code that have
extra bogus semicolons after conditionals. Most commonly is a
bogus semicolon after: switch() { }

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

Stephen Hemminger
2007-04-26 13:29:24 +0800
af65bdfce [NETLINK]: Switch cb_lock spinlock to mutex and allow to override it ... Browse Code »

Switch cb_lock to mutex and allow netlink kernel users to override it
with a subsystem specific mutex for consistent locking in dump callbacks.
All netlink_dump_start users have been audited not to rely on any
side-effects of the previously used spinlock.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:29:03 +0800
c5c252389 [XFRM]: Optimize MTU calculation ... Browse Code »

Replace the probing based MTU estimation, which usually takes 2-3 iterations
to find a fitting value and may underestimate the MTU, by an exact calculation.

Also fix underestimation of the XFRM trailer_len, which causes unnecessary
reallocations.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:28:38 +0800
716ea3a7a [NET]: Move generic skbuff stuff from XFRM code to generic code ... Browse Code »

Move generic skbuff stuff from XFRM code to generic code so that
AF_RXRPC can use it too.

The kdoc comments I've attached to the functions needs to be checked
by whoever wrote them as I had to make some guesses about the workings
of these functions.

Signed-off-By: David Howells
Signed-off-by: David S. Miller

David Howells
2007-04-26 13:28:33 +0800
c702e8047 [NETLINK]: Directly return -EINTR from netlink_dump_start() ... Browse Code »

Now that all users of netlink_dump_start() use netlink_run_queue()
to process the receive queue, it is possible to return -EINTR from
netlink_dump_start() directly, therefore simplying the callers.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:27:33 +0800
1d00a4eb4 [NETLINK]: Remove error pointer from netlink message handler ... Browse Code »

The error pointer argument in netlink message handlers is used
to signal the special case where processing has to be interrupted
because a dump was started but no error happened. Instead it is
simpler and more clear to return -EINTR and have netlink_run_queue()
deal with getting the queue right.

nfnetlink passed on this error pointer to its subsystem handlers
but only uses it to signal the start of a netlink dump. Therefore
it can be removed there as well.

This patch also cleans up the error handling in the affected
message handlers to be consistent since it had to be touched anyway.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:27:30 +0800
45e7ae7f7 [NETLINK]: Ignore control messages directly in netlink_run_queue() ... Browse Code »

Changes netlink_rcv_skb() to skip netlink controll messages and don't
pass them on to the message handler.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:27:29 +0800
d35b68564 [NETLINK]: Ignore !NLM_F_REQUEST messages directly in netlink_run_queue() ... Browse Code »

netlink_rcv_skb() is changed to skip messages which don't have the
NLM_F_REQUEST bit to avoid every netlink family having to perform this
check on their own.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:27:29 +0800
dc5fc579b [NETLINK]: Use nlmsg_trim() where appropriate ... Browse Code »

Signed-off-by: Arnaldo Carvalho de Melo
Signed-off-by: David S. Miller

Arnaldo Carvalho de Melo
2007-04-26 13:26:37 +0800
27a884dc3 [SK_BUFF]: Convert skb->tail to sk_buff_data_t ... Browse Code »

So that it is also an offset from skb->head, reduces its size from 8 to 4 bytes
on 64bit architectures, allowing us to combine the 4 bytes hole left by the
layer headers conversion, reducing struct sk_buff size to 256 bytes, i.e. 4
64byte cachelines, and since the sk_buff slab cache is SLAB_HWCACHE_ALIGN...
:-)

Many calculations that previously required that skb->{transport,network,
mac}_header be first converted to a pointer now can be done directly, being
meaningful as offsets or pointers.

Signed-off-by: Arnaldo Carvalho de Melo
Signed-off-by: David S. Miller

Arnaldo Carvalho de Melo
2007-04-26 13:26:28 +0800
9c70220b7 [SK_BUFF]: Introduce skb_transport_header(skb) ... Browse Code »

For the places where we need a pointer to the transport header, it is
still legal to touch skb->h.raw directly if just adding to,
subtracting from or setting it to another layer header.

Signed-off-by: Arnaldo Carvalho de Melo
Signed-off-by: David S. Miller

Arnaldo Carvalho de Melo
2007-04-26 13:25:31 +0800
9d729f72d [NET]: Convert xtime.tv_sec to get_seconds() ... Browse Code »

Where appropriate, convert references to xtime.tv_sec to the
get_seconds() helper function.

Signed-off-by: James Morris
Signed-off-by: David S. Miller

James Morris
2007-04-26 13:23:32 +0800

14 Apr, 2007

1 commit

661697f72 [IPSEC] XFRM_USER: kernel panic when large security contexts in ACQUIRE ... Browse Code »

When sending a security context of 50+ characters in an ACQUIRE
message, following kernel panic occurred.

kernel BUG in xfrm_send_acquire at net/xfrm/xfrm_user.c:1781!
cpu 0x3: Vector: 700 (Program Check) at [c0000000421bb2e0]
pc: c00000000033b074: .xfrm_send_acquire+0x240/0x2c8
lr: c00000000033b014: .xfrm_send_acquire+0x1e0/0x2c8
sp: c0000000421bb560
msr: 8000000000029032
current = 0xc00000000fce8f00
paca = 0xc000000000464b00
pid = 2303, comm = ping
kernel BUG in xfrm_send_acquire at net/xfrm/xfrm_user.c:1781!
enter ? for help
3:mon> t
[c0000000421bb650] c00000000033538c .km_query+0x6c/0xec
[c0000000421bb6f0] c000000000337374 .xfrm_state_find+0x7f4/0xb88
[c0000000421bb7f0] c000000000332350 .xfrm_tmpl_resolve+0xc4/0x21c
[c0000000421bb8d0] c0000000003326e8 .xfrm_lookup+0x1a0/0x5b0
[c0000000421bba00] c0000000002e6ea0 .ip_route_output_flow+0x88/0xb4
[c0000000421bbaa0] c0000000003106d8 .ip4_datagram_connect+0x218/0x374
[c0000000421bbbd0] c00000000031bc00 .inet_dgram_connect+0xac/0xd4
[c0000000421bbc60] c0000000002b11ac .sys_connect+0xd8/0x120
[c0000000421bbd90] c0000000002d38d0 .compat_sys_socketcall+0xdc/0x214
[c0000000421bbe30] c00000000000869c syscall_exit+0x0/0x40
--- Exception: c00 (System Call) at 0000000007f0ca9c
SP (fc0ef8f0) is in userspace

We are using size of security context from xfrm_policy to determine
how much space to alloc skb and then putting security context from
xfrm_state into skb. Should have been using size of security context
from xfrm_state to alloc skb. Following fix does that

Signed-off-by: Joy Latten
Acked-by: James Morris
Signed-off-by: David S. Miller

Joy Latten
2007-04-14 07:14:35 +0800

05 Apr, 2007

1 commit

4c4d51a73 [IPSEC]: Reject packets within replay window but outside the bit mask ... Browse Code »

Up until this point we've accepted replay window settings greater than
32 but our bit mask can only accomodate 32 packets. Thus any packet
with a sequence number within the window but outside the bit mask would
be accepted.

This patch causes those packets to be rejected instead.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-04-05 15:07:39 +0800

23 Mar, 2007

1 commit

b6f99a211 [NET]: fix up misplaced inlines. ... Browse Code »

Turning up the warnings on gcc makes it emit warnings
about the placement of 'inline' in function declarations.
Here's everything that was under net/

Signed-off-by: Dave Jones
Signed-off-by: David S. Miller

Dave Jones
2007-03-23 03:27:49 +0800

20 Mar, 2007

1 commit

961995582 [XFRM]: ipsecv6 needs a space when printing audit record. ... Browse Code »

This patch adds a space between printing of the src and dst ipv6 addresses.
Otherwise, audit or other test tools may fail to process the audit
record properly because they cannot find the dst address.

Signed-off-by: Joy Latten
Signed-off-by: David S. Miller

Joy Latten
2007-03-20 15:09:47 +0800

13 Mar, 2007

1 commit

75e252d98 [XFRM]: Fix missing protocol comparison of larval SAs. ... Browse Code »

I noticed that in xfrm_state_add we look for the larval SA in a few
places without checking for protocol match. So when using both
AH and ESP, whichever one gets added first, deletes the larval SA.
It seems AH always gets added first and ESP is always the larval
SA's protocol since the xfrm->tmpl has it first. Thus causing the
additional km_query()

Adding the check eliminates accidental double SA creation.

Signed-off-by: Joy Latten
Signed-off-by: David S. Miller

Joy Latten
2007-03-13 08:14:07 +0800

08 Mar, 2007

2 commits

16bec31db [IPSEC]: xfrm audit hook misplaced in pfkey_delete and xfrm_del_sa ... Browse Code »

Inside pfkey_delete and xfrm_del_sa the audit hooks were not called if
there was any permission/security failures in attempting to do the del
operation (such as permission denied from security_xfrm_state_delete).
This patch moves the audit hook to the exit path such that all failures
(and successes) will actually get audited.

Signed-off-by: Eric Paris
Acked-by: Venkat Yekkirala
Acked-by: James Morris
Signed-off-by: David S. Miller

Eric Paris
2007-03-08 08:08:11 +0800
ef41aaa0b [IPSEC]: xfrm_policy delete security check misplaced ... Browse Code »

The security hooks to check permissions to remove an xfrm_policy were
actually done after the policy was removed. Since the unlinking and
deletion are done in xfrm_policy_by* functions this moves the hooks
inside those 2 functions. There we have all the information needed to
do the security check and it can be done before the deletion. Since
auditing requires the result of that security check err has to be passed
back and forth from the xfrm_policy_by* functions.

This patch also fixes a bug where a deletion that failed the security
check could cause improper accounting on the xfrm_policy
(xfrm_get_policy didn't have a put on the exit path for the hold taken
by xfrm_policy_by*)

It also fixes the return code when no policy is found in
xfrm_add_pol_expire. In old code (at least back in the 2.6.18 days) err
wasn't used before the return when no policy is found and so the
initialization would cause err to be ENOENT. But since err has since
been used above when we don't get a policy back from the xfrm_policy_by*
function we would always return 0 instead of the intended ENOENT. Also
fixed some white space damage in the same area.

Signed-off-by: Eric Paris
Acked-by: Venkat Yekkirala
Acked-by: James Morris
Signed-off-by: David S. Miller

Eric Paris
2007-03-08 08:08:09 +0800

01 Mar, 2007

2 commits

b08d5840d [NET]: Fix kfree(skb) ... Browse Code »

Signed-off-by: Patrick McHardy
Acked-by: Paul Moore
Signed-off-by: David S. Miller

Patrick McHardy
2007-03-01 01:42:14 +0800
3a765aa52 [XFRM] xfrm_user: Fix return values of xfrm_add_sa_expire. ... Browse Code »

As noted by Kent Yoder, this function will always return an
error. Make sure it returns zero on success.

Signed-off-by: David S. Miller

David S. Miller
2007-03-01 01:41:57 +0800

14 Feb, 2007

1 commit

928ba4169 [IPSEC]: Fix the address family to refer encap_family ... Browse Code »

Fix the address family to refer encap_family
when comparing with a kernel generated xfrm_state

Signed-off-by: Kazunori MIYAZAWA
Signed-off-by: David S. Miller

Kazunori MIYAZAWA
2007-02-14 04:57:16 +0800

13 Feb, 2007

1 commit

13fcfbb06 [XFRM]: Fix OOPSes in xfrm_audit_log(). ... Browse Code »

Make sure that this function is called correctly, and
add BUG() checking to ensure the arguments are sane.

Based upon a patch by Joy Latten.

Signed-off-by: David S. Miller

David S. Miller
2007-02-13 05:53:54 +0800