Eric Lee / smarc-fsl-linux-kernel

04 Jun, 2008

1 commit

bc3ed28ca netlink: Improve returned error codes ... Browse Code »

Make nlmsg_trim(), nlmsg_cancel(), genlmsg_cancel(), and
nla_nest_cancel() void functions.

Return -EMSGSIZE instead of -1 if the provided message buffer is not
big enough.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2008-06-04 07:36:54 +0800

28 Apr, 2008

1 commit

2532386f4 Audit: collect sessionid in netlink messages ... Browse Code »

Previously I added sessionid output to all audit messages where it was
available but we still didn't know the sessionid of the sender of
netlink messages. This patch adds that information to netlink messages
so we can audit who sent netlink messages.

Signed-off-by: Eric Paris
Signed-off-by: Al Viro

Eric Paris
2008-04-28 18:18:03 +0800

19 Apr, 2008

2 commits

3925e6fc1 Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorri… ... Browse Code »

…s/security-testing-2.6

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorris/security-testing-2.6:
security: fix up documentation for security_module_enable
Security: Introduce security= boot parameter
Audit: Final renamings and cleanup
SELinux: use new audit hooks, remove redundant exports
Audit: internally use the new LSM audit hooks
LSM/Audit: Introduce generic Audit LSM hooks
SELinux: remove redundant exports
Netlink: Use generic LSM hook
Audit: use new LSM hooks instead of SELinux exports
SELinux: setup new inode/ipc getsecid hooks
LSM: Introduce inode_getsecid and ipc_getsecid hooks

Linus Torvalds
2008-04-19 09:18:30 +0800
0ce784ca7 Netlink: Use generic LSM hook ... Browse Code »

Don't use SELinux exported selinux_get_task_sid symbol.
Use the generic LSM equivalent instead.

Signed-off-by: Casey Schaufler
Signed-off-by: Ahmed S. Darwish
Acked-by: James Morris
Acked-by: David S. Miller
Reviewed-by: Paul Moore

Ahmed S. Darwish
2008-04-19 07:52:35 +0800

26 Mar, 2008

3 commits

878628fbf [NET] NETNS: Omit namespace comparision without CONFIG_NET_NS. ... Browse Code »

Introduce an inline net_eq() to compare two namespaces.
Without CONFIG_NET_NS, since no namespace other than &init_net
exists, it is always 1.

We do not need to convert 1) inline vs inline and
2) inline vs &init_net comparisons.

Signed-off-by: YOSHIFUJI Hideaki

YOSHIFUJI Hideaki
2008-03-26 03:40:00 +0800
1218854af [NET] NETNS: Omit seq_net_private->net without CONFIG_NET_NS. ... Browse Code »

Without CONFIG_NET_NS, no namespace other than &init_net exists,
no need to store net in seq_net_private.

Signed-off-by: YOSHIFUJI Hideaki

YOSHIFUJI Hideaki
2008-03-26 03:39:56 +0800
3b1e0a655 [NET] NETNS: Omit sock->sk_net without CONFIG_NET_NS. ... Browse Code »

Introduce per-sock inlines: sock_net(), sock_net_set()
and per-inet_timewait_sock inlines: twsk_net(), twsk_net_set().
Without CONFIG_NET_NS, no namespace other than &init_net exists.
Let's explicitly define them to help compiler optimizations.

Signed-off-by: YOSHIFUJI Hideaki

YOSHIFUJI Hideaki
2008-03-26 03:39:55 +0800

22 Mar, 2008

1 commit

b1153f29e netlink: make socket filters work on netlink ... Browse Code »

Make socket filters work for netlink unicast and notifications.
This is useful for applications like Zebra that get overrun with
messages that are then ignored.

Note: netlink messages are in host byte order, but packet filter
state machine operations are done as network byte order.

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

Stephen Hemminger
2008-03-22 06:46:12 +0800

01 Mar, 2008

2 commits

edf020870 [NET]: Make netlink_kernel_release publically available as sk_release_kernel. ... Browse Code »

This staff will be needed for non-netlink kernel sockets, which should
also not pin a namespace like tcp_socket and icmp_socket.

Signed-off-by: Denis V. Lunev
Acked-by: Daniel Lezcano
Signed-off-by: David S. Miller

Denis V. Lunev
2008-03-01 03:18:32 +0800
9dfbec1fb [NETLINK]: No need for a separate __netlink_release call. ... Browse Code »

Merge it to netlink_kernel_release.

Signed-off-by: Denis V. Lunev
Acked-by: Daniel Lezcano
Signed-off-by: David S. Miller

Denis V. Lunev
2008-03-01 03:17:56 +0800

13 Feb, 2008

1 commit

910d6c320 [GENETLINK]: Relax dances with genl_lock. ... Browse Code »

The genl_unregister_family() calls the genl_unregister_mc_groups(),
which takes and releases the genl_lock and then locks and releases
this lock itself.

Relax this behavior, all the more so the genl_unregister_mc_groups()
is called from genl_unregister_family() only.

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-02-13 14:16:33 +0800

02 Feb, 2008

1 commit

0c11b9428 [PATCH] switch audit_get_loginuid() to task_struct * ... Browse Code »

all callers pass something->audit_context

Signed-off-by: Al Viro

Al Viro
2008-02-02 03:04:59 +0800

01 Feb, 2008

1 commit

23fe18669 [NETNS]: Fix race between put_net() and netlink_kernel_create(). ... Browse Code »

The comment about "race free view of the set of network
namespaces" was a bit hasty. Look (there even can be only
one CPU, as discovered by Alexey Dobriyan and Denis Lunev):

put_net()
if (atomic_dec_and_test(&net->refcnt))
/* true */
__put_net(net);
queue_work(...);

/*
* note: the net now has refcnt 0, but still in
* the global list of net namespaces
*/

== re-schedule ==

register_pernet_subsys(&some_ops);
register_pernet_operations(&some_ops);
(*some_ops)->init(net);
/*
* we call netlink_kernel_create() here
* in some places
*/
netlink_kernel_create();
sk_alloc();
get_net(net); /* refcnt = 1 */
/*
* now we drop the net refcount not to
* block the net namespace exit in the
* future (or this can be done on the
* error path)
*/
put_net(sk->sk_net);
if (atomic_dec_and_test(&...))
/*
* true. BOOOM! The net is
* scheduled for release twice
*/

When thinking on this problem, I decided, that getting and
putting the net in init callback is wrong. If some init
callback needs to have a refcount-less reference on the struct
net, _it_ has to be careful himself, rather than relying on
the infrastructure to handle this correctly.

In case of netlink_kernel_create(), the problem is that the
sk_alloc() gets the given namespace, but passing the info
that we don't want to get it inside this call is too heavy.

Instead, I propose to crate the socket inside an init_net
namespace and then re-attach it to the desired one right
after the socket is created.

After doing this, we also have to be careful on error paths
not to drop the reference on the namespace, we didn't get
the one on.

Signed-off-by: Pavel Emelyanov
Acked-by: Denis Lunev
Signed-off-by: David S. Miller

Pavel Emelyanov
2008-02-01 11:27:22 +0800

29 Jan, 2008

9 commits

01480e1cf [NETLINK]: Add nla_append() ... Browse Code »

Used to append data to a message without a header or padding.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2008-01-29 07:11:09 +0800
775516bfa [NETNS]: Namespace stop vs 'ip r l' race. ... Browse Code »

During network namespace stop process kernel side netlink sockets
belonging to a namespace should be closed. They should not prevent
namespace to stop, so they do not increment namespace usage
counter. Though this counter will be put during last sock_put.

The raplacement of the correct netns for init_ns solves the problem
only partial as socket to be stoped until proper stop is a valid
netlink kernel socket and can be looked up by the user processes. This
is not a problem until it resides in initial namespace (no processes
inside this net), but this is not true for init_net.

So, hold the referrence for a socket, remove it from lookup tables and
only after that change namespace and perform a last put.

Signed-off-by: Denis V. Lunev
Tested-by: Alexey Dobriyan
Signed-off-by: David S. Miller

Denis V. Lunev
2008-01-29 07:08:08 +0800
b7c6ba6eb [NETNS]: Consolidate kernel netlink socket destruction. ... Browse Code »

Create a specific helper for netlink kernel socket disposal. This just
let the code look better and provides a ground for proper disposal
inside a namespace.

Signed-off-by: Denis V. Lunev
Tested-by: Alexey Dobriyan
Signed-off-by: David S. Miller

Denis V. Lunev
2008-01-29 07:08:07 +0800
869e58f87 [NETNS]: Double free in netlink_release. ... Browse Code »

Netlink protocol table is global for all namespaces. Some netlink
protocols have been virtualized, i.e. they have per/namespace netlink
socket. This difference can easily lead to double free if more than 1
namespace is started. Count the number of kernel netlink sockets to
track that this table is not used any more.

Signed-off-by: Denis V. Lunev
Tested-by: Alexey Dobriyan
Signed-off-by: David S. Miller

Denis V. Lunev
2008-01-29 07:08:05 +0800
3f2525267 [NETLINK] af_netlink: kill some bloat ... Browse Code »

net/netlink/af_netlink.c:
netlink_realloc_groups | -46
netlink_insert | -49
netlink_autobind | -94
netlink_clear_multicast_users | -48
netlink_bind | -55
netlink_setsockopt | -54
netlink_release | -86
netlink_kernel_create | -47
netlink_change_ngroups | -56
9 functions changed, 535 bytes removed, diff: -535

net/netlink/af_netlink.c:
netlink_table_ungrab | +53
1 function changed, 53 bytes added, diff: +53

net/netlink/af_netlink.o:
10 functions changed, 53 bytes added, 535 bytes removed, diff: -482

Signed-off-by: Ilpo Järvinen
Signed-off-by: David S. Miller

Ilpo Järvinen
2008-01-29 07:01:50 +0800
9a429c498 [NET]: Add some acquires/releases sparse annotations. ... Browse Code »

Add __acquires() and __releases() annotations to suppress some sparse
warnings.

example of warnings :

net/ipv4/udp.c:1555:14: warning: context imbalance in 'udp_seq_start' - wrong
count at exit
net/ipv4/udp.c:1571:13: warning: context imbalance in 'udp_seq_stop' -
unexpected unlock

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2008-01-29 07:00:31 +0800
ea72912c8 [NETLINK]: kzalloc() conversion ... Browse Code »

nl_pid_hash_alloc() is renamed to nl_pid_hash_zalloc().
It is now returning zeroed memory to its callers.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2008-01-29 06:57:06 +0800
6ac552fdc [NETLINK]: af_netlink.c checkpatch cleanups ... Browse Code »

Fix large number of checkpatch errors.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2008-01-29 06:55:50 +0800
e372c4140 [NET]: Consolidate net namespace related proc files creation. ... Browse Code »

Signed-off-by: Denis V. Lunev
Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Denis V. Lunev
2008-01-29 06:54:28 +0800

13 Nov, 2007

1 commit

022cbae61 [NET]: Move unneeded data to initdata section. ... Browse Code »

This patch reverts Eric's commit 2b008b0a8e96b726c603c5e1a5a7a509b5f61e35

It diets .text & .data section of the kernel if CONFIG_NET_NS is not set.
This is safe after list operations cleanup.

Signed-of-by: Denis V. Lunev
Signed-off-by: David S. Miller

Denis V. Lunev
2007-11-13 19:23:50 +0800

07 Nov, 2007

1 commit

c3d8d1e30 [NETLINK]: Fix unicast timeouts ... Browse Code »

Commit ed6dcf4a in the history.git tree broke netlink_unicast timeouts
by moving the schedule_timeout() call to a new function that doesn't
propagate the remaining timeout back to the caller. This means on each
retry we start with the full timeout again.

ipc/mqueue.c seems to actually want to wait indefinitely so this
behaviour is retained.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-11-07 20:15:12 +0800

01 Nov, 2007

1 commit

6257ff217 [NET]: Forget the zero_it argument of sk_alloc() ... Browse Code »

Finally, the zero_it argument can be completely removed from
the callers and from the function prototype.

Besides, fix the checkpatch.pl warnings about using the
assignments inside if-s.

This patch is rather big, and it is a part of the previous one.
I splitted it wishing to make the patches more readable. Hope
this particular split helped.

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2007-11-01 15:39:31 +0800

27 Oct, 2007

1 commit

2b008b0a8 [NET]: Marking struct pernet_operations __net_initdata was inappropriate ... Browse Code »

It is not safe to to place struct pernet_operations in a special section.
We need struct pernet_operations to last until we call unregister_pernet_subsys.
Which doesn't happen until module unload.

So marking struct pernet_operations is a disaster for modules in two ways.
- We discard it before we call the exit method it points to.
- Because I keep struct pernet_operations on a linked list discarding
it for compiled in code removes elements in the middle of a linked
list and does horrible things for linked insert.

So this looks safe assuming __exit_refok is not discarded
for modules.

Signed-off-by: Eric W. Biederman
Signed-off-by: David S. Miller

Eric W. Biederman
2007-10-27 13:54:53 +0800

24 Oct, 2007

1 commit

5c58298c2 [NETLINK]: Fix ACK processing after netlink_dump_start ... Browse Code »

Revert to original netlink behavior. Do not reply with ACK if the
netlink dump has bees successfully started.

libnl has been broken by the cd40b7d3983c708aabe3d3008ec64ffce56d33b0
The following command reproduce the problem:
/nl-route-get 192.168.1.1

Signed-off-by: Denis V. Lunev
Acked-by: Thomas Graf
Signed-off-by: David S. Miller

Denis V. Lunev
2007-10-24 12:27:51 +0800

16 Oct, 2007

1 commit

f937f1f46 [NETLINK]: Don't leak 'listeners' in netlink_kernel_create() ... Browse Code »

The Coverity checker spotted that we'll leak the storage allocated
to 'listeners' in netlink_kernel_create() when the
if (!nl_table[unit].registered)
check is false.

This patch avoids the leak.

Signed-off-by: Jesper Juhl
Acked-by: "Eric W. Biederman"
Signed-off-by: David S. Miller

Jesper Juhl
2007-10-16 03:26:32 +0800

11 Oct, 2007

12 commits

cd40b7d39 [NET]: make netlink user -> kernel interface synchronious ... Browse Code »

This patch make processing netlink user -> kernel messages synchronious.
This change was inspired by the talk with Alexey Kuznetsov about current
netlink messages processing. He says that he was badly wrong when introduced
asynchronious user -> kernel communication.

The call netlink_unicast is the only path to send message to the kernel
netlink socket. But, unfortunately, it is also used to send data to the
user.

Before this change the user message has been attached to the socket queue
and sk->sk_data_ready was called. The process has been blocked until all
pending messages were processed. The bad thing is that this processing
may occur in the arbitrary process context.

This patch changes nlk->data_ready callback to get 1 skb and force packet
processing right in the netlink_unicast.

Kernel -> user path in netlink_unicast remains untouched.

EINTR processing for in netlink_run_queue was changed. It forces rtnl_lock
drop, but the process remains in the cycle until the message will be fully
processed. So, there is no need to use this kludges now.

Signed-off-by: Denis V. Lunev
Acked-by: Alexey Kuznetsov
Signed-off-by: David S. Miller

Denis V. Lunev
2007-10-11 12:15:29 +0800
aed815601 [NET]: unify netlink kernel socket recognition ... Browse Code »

There are currently two ways to determine whether the netlink socket is a
kernel one or a user one. This patch creates a single inline call for
this purpose and unifies all the calls in the af_netlink.c

No similar calls are found outside af_netlink.c.

Signed-off-by: Denis V. Lunev
Acked-by: Alexey Kuznetsov
Signed-off-by: David S. Miller

Denis V. Lunev
2007-10-11 12:14:32 +0800
7ee015e0f [NET]: cleanup 3rd argument in netlink_sendskb ... Browse Code »

netlink_sendskb does not use third argument. Clean it and save a couple of
bytes.

Signed-off-by: Denis V. Lunev
Acked-by: Alexey Kuznetsov
Signed-off-by: David S. Miller

Denis V. Lunev
2007-10-11 12:14:03 +0800
3b71535f3 [NET]: Make netlink processing routines semi-synchronious (inspired by rtnl) v2 ... Browse Code »

The code in netfilter/nfnetlink.c and in ./net/netlink/genetlink.c looks
like outdated copy/paste from rtnetlink.c. Push them into sync with the
original.

Changes from v1:
- deleted comment in nfnetlink_rcv_msg by request of Patrick McHardy

Signed-off-by: Denis V. Lunev
Acked-by: Patrick McHardy
Signed-off-by: David S. Miller

Denis V. Lunev
2007-10-11 12:13:32 +0800
cf7732e4c [NET]: Make core networking code use seq_open_private ... Browse Code »

This concerns the ipv4 and ipv6 code mostly, but also the netlink
and unix sockets.

The netlink code is an example of how to use the __seq_open_private()
call - it saves the net namespace on this private.

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2007-10-11 07:55:33 +0800
4665079cb [NETNS]: Move some code into __init section when CONFIG_NET_NS=n ... Browse Code »

With the net namespaces many code leaved the __init section,
thus making the kernel occupy more memory than it did before.
Since we have a config option that prohibits the namespace
creation, the functions that initialize/finalize some netns
stuff are simply not needed and can be freed after the boot.

Currently, this is almost not noticeable, since few calls
are no longer in __init, but when the namespaces will be
merged it will be possible to free more code. I propose to
use the __net_init, __net_exit and __net_initdata "attributes"
for functions/variables that are not used if the CONFIG_NET_NS
is not set to save more space in memory.

The exiting functions cannot just reside in the __exit section,
as noticed by David, since the init section will have
references on it and the compilation will fail due to modpost
checks. These references can exist, since the init namespace
never dies and the exit callbacks are never called. So I
introduce the __exit_refok attribute just like it is already
done with the __init_refok.

Signed-off-by: Pavel Emelyanov
Signed-off-by: David S. Miller

Pavel Emelyanov
2007-10-11 07:54:58 +0800
26ff5ddc5 [NETLINK]: the temp variable name max is ambiguous ... Browse Code »

with the macro max provided by , so changed its name
to a more proper one: limit

Signed-off-by: Denis Cheng
Signed-off-by: David S. Miller

Denis Cheng
2007-10-11 07:51:25 +0800
99406c885 [NETLINK]: use the macro min(x,y) provided by <linux/kernel.h> instead ... Browse Code »

Signed-off-by: Denis Cheng
Signed-off-by: David S. Miller

Denis Cheng
2007-10-11 07:51:25 +0800
0cfad0755 [NETLINK]: Avoid pointer in netlink_run_queue ... Browse Code »

I was looking at Patrick's fix to inet_diag and it occured
to me that we're using a pointer argument to return values
unnecessarily in netlink_run_queue. Changing it to return
the value will allow the compiler to generate better code
since the value won't have to be memory-backed.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-10-11 07:51:24 +0800
077130c0c [NET]: Fix race when opening a proc file while a network namespace is exiting. ... Browse Code »

The problem: proc_net files remember which network namespace the are
against but do not remember hold a reference count (as that would pin
the network namespace). So we currently have a small window where
the reference count on a network namespace may be incremented when opening
a /proc file when it has already gone to zero.

To fix this introduce maybe_get_net and get_proc_net.

maybe_get_net increments the network namespace reference count only if it is
greater then zero, ensuring we don't increment a reference count after it
has gone to zero.

get_proc_net handles all of the magic to go from a proc inode to the network
namespace instance and call maybe_get_net on it.

PROC_NET the old accessor is removed so that we don't get confused and use
the wrong helper function.

Then I fix up the callers to use get_proc_net and handle the case case
where get_proc_net returns NULL. In that case I return -ENXIO because
effectively the network namespace has already gone away so the files
we are trying to access don't exist anymore.

Signed-off-by: Eric W. Biederman
Acked-by: Paul E. McKenney
Signed-off-by: David S. Miller

Eric W. Biederman
2007-10-11 07:49:22 +0800
8f4c1f9b0 [NETLINK]: Introduce nested and byteorder flag to netlink attribute ... Browse Code »

This change allows the generic attribute interface to be used within
the netfilter subsystem where this flag was initially introduced.

The byte-order flag is yet unused, it's intended use is to
allow automatic byte order convertions for all atomic types.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-10-11 07:49:16 +0800
b4b510290 [NET]: Support multiple network namespaces with netlink ... Browse Code »

Each netlink socket will live in exactly one network namespace,
this includes the controlling kernel sockets.

This patch updates all of the existing netlink protocols
to only support the initial network namespace. Request
by clients in other namespaces will get -ECONREFUSED.
As they would if the kernel did not have the support for
that netlink protocol compiled in.

As each netlink protocol is updated to be multiple network
namespace safe it can register multiple kernel sockets
to acquire a presence in the rest of the network namespaces.

The implementation in af_netlink is a simple filter implementation
at hash table insertion and hash table look up time.

Signed-off-by: Eric W. Biederman
Signed-off-by: David S. Miller

Eric W. Biederman
2007-10-11 07:49:09 +0800