Eric Lee / smarc-fsl-linux-kernel

16 Jun, 2009

3 commits

c23cad923 [S390] PM: af_iucv power management callbacks. ... Browse Code »

Patch establishes a dummy afiucv-device to make sure af_iucv is
notified as iucv-bus device about suspend/resume.

The PM freeze callback severs all iucv pathes of connected af_iucv sockets.
The PM thaw/restore callback switches the state of all previously connected
sockets to IUCV_DISCONN.

Signed-off-by: Ursula Braun
Signed-off-by: Martin Schwidefsky

Ursula Braun
2009-06-16 16:31:19 +0800
672e405b6 [S390] pm: iucv power management callbacks. ... Browse Code »

Patch calls the PM callback functions of iucv-bus devices, which are
responsible for removal of their established iucv pathes.

The PM freeze callback for the first iucv-bus device disables all iucv
interrupts except the connection severed interrupt.
The PM freeze callback for the last iucv-bus device shuts down iucv.

The PM thaw callback for the first iucv-bus device re-enables iucv
if it has been shut down during freeze. If freezing has been interrupted,
it re-enables iucv interrupts according to the needs of iucv-exploiters.

The PM restore callback for the first iucv-bus device re-enables iucv.

Signed-off-by: Ursula Braun
Signed-off-by: Martin Schwidefsky

Ursula Braun
2009-06-16 16:31:17 +0800
6c005961c [S390] iucv: establish reboot notifier ... Browse Code »

To guarantee a proper cleanup, patch adds a reboot notifier to
the iucv base code, which disables iucv interrupts, shuts down
established iucv pathes, and removes iucv declarations for z/VM.

Checks have to be added to the iucv-API functions, whether
iucv-buffers removed at reboot time are still declared.

Signed-off-by: Ursula Braun
Signed-off-by: Martin Schwidefsky

Ursula Braun
2009-06-16 16:31:17 +0800

15 Jun, 2009

4 commits

9cbc1cb8c Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/torvalds/linux-2.6 ... Browse Code »

Conflicts:
Documentation/feature-removal-schedule.txt
drivers/scsi/fcoe/fcoe.c
net/core/drop_monitor.c
net/core/net-traces.c

David S. Miller
2009-06-15 18:02:23 +0800
ca44d6e60 pkt_sched: Rename PSCHED_US2NS and PSCHED_NS2US ... Browse Code »

Let's use TICKS instead of US, so PSCHED_TICKS2NS and PSCHED_NS2TICKS
(like in PSCHED_TICKS_PER_SEC already) to avoid misleading.

Signed-off-by: Jarek Poplawski
Signed-off-by: David S. Miller

Jarek Poplawski
2009-06-15 17:31:47 +0800
e0f7cb8c8 ipv4: Fix fib_trie rebalancing ... Browse Code »

While doing trie_rebalance(): resize(), inflate(), halve() RCU free
tnodes before updating their parents. It depends on RCU delaying the
real destruction, but if RCU readers start after call_rcu() and before
parent update they could access freed memory.

It is currently prevented with preempt_disable() on the update side,
but it's not safe, except maybe classic RCU, plus it conflicts with
memory allocations with GFP_KERNEL flag used from these functions.

This patch explicitly delays freeing of tnodes by adding them to the
list, which is flushed after the update is finished.

Reported-by: Yan Zheng
Signed-off-by: Jarek Poplawski
Signed-off-by: David S. Miller

Jarek Poplawski
2009-06-15 17:31:29 +0800
489f7ab6c Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jikos/trivial ... Browse Code »

* 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jikos/trivial: (31 commits)
trivial: remove the trivial patch monkey's name from SubmittingPatches
trivial: Fix a typo in comment of addrconf_dad_start()
trivial: usb: fix missing space typo in doc
trivial: pci hotplug: adding __init/__exit macros to sgi_hotplug
trivial: Remove the hyphen from git commands
trivial: fix ETIMEOUT -> ETIMEDOUT typos
trivial: Kconfig: .ko is normally not included in module names
trivial: SubmittingPatches: fix typo
trivial: Documentation/dell_rbu.txt: fix typos
trivial: Fix Pavel's address in MAINTAINERS
trivial: ftrace:fix description of trace directory
trivial: unnecessary (void*) cast removal in sound/oss/msnd.c
trivial: input/misc: Fix typo in Kconfig
trivial: fix grammo in bus_for_each_dev() kerneldoc
trivial: rbtree.txt: fix rb_entry() parameters in sample code
trivial: spelling fix in ppc code comments
trivial: fix typo in bio_alloc kernel doc
trivial: Documentation/rbtree.txt: cleanup kerneldoc of rbtree.txt
trivial: Miscellaneous documentation typo fixes
trivial: fix typo milisecond/millisecond for documentation and source comments.
...

Linus Torvalds
2009-06-15 04:46:25 +0800

14 Jun, 2009

5 commits

1a097181e Bluetooth: Fix Kconfig issue with RFKILL integration ... Browse Code »

Since the re-write of the RFKILL subsystem it is no longer good to just
select RFKILL, but it is important to add a proper depends on rule.

Based on a report by Alexander Beregalov

Signed-off-by: Marcel Holtmann

Marcel Holtmann
2009-06-14 21:30:51 +0800
403dbb97f PIM-SM: namespace changes ... Browse Code »

IPv4:
- make PIM register vifs netns local
- set the netns when a PIM register vif is created
- make PIM available in all network namespaces (if CONFIG_IP_PIMSM_V2)
by adding the protocol handler when multicast routing is initialized

IPv6:
- make PIM register vifs netns local
- make PIM available in all network namespaces (if CONFIG_IPV6_PIMSM_V2)
by adding the protocol handler when multicast routing is initialized

Signed-off-by: Tom Goff
Signed-off-by: David S. Miller

Tom Goff
2009-06-14 18:16:13 +0800
e61a4b634 ipv4: update ARPD help text ... Browse Code »

Removed the statements about ARP cache size as this config option does
not affect it. The cache size is controlled by neigh_table gc thresholds.

Remove also expiremental and obsolete markings as the API originally
intended for arp caching is useful for implementing ARP-like protocols
(e.g. NHRP) in user space and has been there for a long enough time.

Signed-off-by: Timo Teras
Signed-off-by: David S. Miller

Timo Teräs
2009-06-14 14:36:32 +0800
125bb8f56 net: use a deferred timer in rt_check_expire ... Browse Code »

For the sake of power saver lovers, use a deferrable timer to fire
rt_check_expire()

As some big routers cache equilibrium depends on garbage collection
done in time, we take into account elapsed time between two
rt_check_expire() invocations to adjust the amount of slots we have to
check.

Based on an initial idea and patch from Tero Kristo

Signed-off-by: Eric Dumazet
Signed-off-by: Tero Kristo
Signed-off-by: David S. Miller

Eric Dumazet
2009-06-14 14:36:31 +0800
eaae44d24 Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/kaber/nf-next-2.6 Browse Code »

David S. Miller
2009-06-14 07:43:28 +0800

13 Jun, 2009

11 commits

3dd5d7e3b x_tables: Convert printk to pr_err ... Browse Code »

Signed-off-by: Joe Perches
Signed-off-by: Patrick McHardy

Joe Perches
2009-06-13 18:32:39 +0800
dd7669a92 netfilter: conntrack: optional reliable conntrack event delivery ... Browse Code »

This patch improves ctnetlink event reliability if one broadcast
listener has set the NETLINK_BROADCAST_ERROR socket option.

The logic is the following: if an event delivery fails, we keep
the undelivered events in the missed event cache. Once the next
packet arrives, we add the new events (if any) to the missed
events in the cache and we try a new delivery, and so on. Thus,
if ctnetlink fails to deliver an event, we try to deliver them
once we see a new packet. Therefore, we may lose state
transitions but the userspace process gets in sync at some point.

At worst case, if no events were delivered to userspace, we make
sure that destroy events are successfully delivered. Basically,
if ctnetlink fails to deliver the destroy event, we remove the
conntrack entry from the hashes and we insert them in the dying
list, which contains inactive entries. Then, the conntrack timer
is added with an extra grace timeout of random32() % 15 seconds
to trigger the event again (this grace timeout is tunable via
/proc). The use of a limited random timeout value allows
distributing the "destroy" resends, thus, avoiding accumulating
lots "destroy" events at the same time. Event delivery may
re-order but we can identify them by means of the tuple plus
the conntrack ID.

The maximum number of conntrack entries (active or inactive) is
still handled by nf_conntrack_max. Thus, we may start dropping
packets at some point if we accumulate a lot of inactive conntrack
entries that did not successfully report the destroy event to
userspace.

During my stress tests consisting of setting a very small buffer
of 2048 bytes for conntrackd and the NETLINK_BROADCAST_ERROR socket
flag, and generating lots of very small connections, I noticed
very few destroy entries on the fly waiting to be resend.

A simple way to test this patch consist of creating a lot of
entries, set a very small Netlink buffer in conntrackd (+ a patch
which is not in the git tree to set the BROADCAST_ERROR flag)
and invoke `conntrack -F'.

For expectations, no changes are introduced in this patch.
Currently, event delivery is only done for new expectations (no
events from expectation expiration, removal and confirmation).
In that case, they need a per-expectation event cache to implement
the same idea that is exposed in this patch.

This patch can be useful to provide reliable flow-accouting. We
still have to add a new conntrack extension to store the creation
and destroy time.

Signed-off-by: Pablo Neira Ayuso
Signed-off-by: Patrick McHardy

Pablo Neira Ayuso
2009-06-13 18:30:52 +0800
9858a3ae1 netfilter: conntrack: move helper destruction to nf_ct_helper_destroy() ... Browse Code »

This patch moves the helper destruction to a function that lives
in nf_conntrack_helper.c. This new function is used in the patch
to add ctnetlink reliable event delivery.

Signed-off-by: Pablo Neira Ayuso
Signed-off-by: Patrick McHardy

Pablo Neira Ayuso
2009-06-13 18:28:22 +0800
a0891aa6a netfilter: conntrack: move event caching to conntrack extension infrastructure ... Browse Code »

This patch reworks the per-cpu event caching to use the conntrack
extension infrastructure.

The main drawback is that we consume more memory per conntrack
if event delivery is enabled. This patch is required by the
reliable event delivery that follows to this patch.

BTW, this patch allows you to enable/disable event delivery via
/proc/sys/net/netfilter/nf_conntrack_events in runtime, although
you can still disable event caching as compilation option.

Signed-off-by: Pablo Neira Ayuso
Signed-off-by: Patrick McHardy

Pablo Neira Ayuso
2009-06-13 18:26:29 +0800
65cb9fda3 netfilter: nf_conntrack: use mod_timer_pending() for conntrack refresh ... Browse Code »

Use mod_timer_pending() instead of atomic sequence of del_timer()/
add_timer(). mod_timer_pending() does not rearm an inactive timer,
so we don't need the conntrack lock anymore to make sure we don't
accidentally rearm a timer of a conntrack which is in the process
of being destroyed.

With this change, we don't need to take the global lock anymore at all,
counter updates can be performed under the per-conntrack lock.

Signed-off-by: Patrick McHardy

Patrick McHardy
2009-06-13 18:21:49 +0800
266d07cb1 netfilter: nf_log: fix sleeping function called from invalid context ... Browse Code »

Fix regression introduced by 17625274 "netfilter: sysctl support of
logger choice":

BUG: sleeping function called from invalid context at /mnt/s390test/linux-2.6-tip/arch/s390/include/asm/uaccess.h:234
in_atomic(): 1, irqs_disabled(): 0, pid: 3245, name: sysctl
CPU: 1 Not tainted 2.6.30-rc8-tipjun10-02053-g39ae214 #1
Process sysctl (pid: 3245, task: 000000007f675da0, ksp: 000000007eb17cf0)
0000000000000000 000000007eb17be8 0000000000000002 0000000000000000
000000007eb17c88 000000007eb17c00 000000007eb17c00 0000000000048156
00000000003e2de8 000000007f676118 000000007eb17f10 0000000000000000
0000000000000000 000000007eb17be8 000000000000000d 000000007eb17c58
00000000003e2050 000000000001635c 000000007eb17be8 000000007eb17c30
Call Trace:
(Ý¨ show_trace+0x13a/0x148)
Ý¨ __might_sleep+0x13a/0x164
Ý¨ proc_dostring+0x134/0x22c
Ý¨ nf_log_proc_dostring+0xfc/0x188
Ý¨ proc_sys_call_handler+0xf6/0x118
Ý¨ proc_sys_read+0x26/0x34
Ý¨ vfs_read+0xac/0x158
Ý¨ SyS_read+0x56/0x88
Ý¨ sysc_noemu+0x10/0x16

Use the nf_log_mutex instead of RCU to fix this.

Reported-and-tested-by: Maran Pakkirisamy
Signed-off-by: Patrick McHardy

Patrick McHardy
2009-06-13 18:21:10 +0800
5b5481402 net: use symbolic values for ndo_start_xmit() return codes ... Browse Code »

Convert magic values 1 and -1 to NETDEV_TX_BUSY and NETDEV_TX_LOCKED respectively.

0 (NETDEV_TX_OK) is not changed to keep the noise down, except in very few cases
where its in direct proximity to one of the other values.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2009-06-13 16:18:50 +0800
81fbbf604 net: fix network drivers ndo_start_xmit() return values (part 7) ... Browse Code »

Fix up ATM drivers that return an errno value to qdisc_restart(), causing
qdisc_restart() to print a warning an requeue/retransmit the skb.

- lec: condition can only be remedied by userspace, until that retransmissions

Compile tested only.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2009-06-13 16:18:43 +0800
590a9887a trivial: Fix a typo in comment of addrconf_dad_start() ... Browse Code »

Signed-off-by: Masatake YAMATO
Signed-off-by: Jiri Kosina

Masatake YAMATO
2009-06-13 00:01:51 +0800
4737f0978 trivial: Kconfig: .ko is normally not included in module names ... Browse Code »

.ko is normally not included in Kconfig help, make it consistent.

Signed-off-by: Pavel Machek
Signed-off-by: Jiri Kosina

Pavel Machek
2009-06-13 00:01:50 +0800
6d60f9dfc trivial: Fix paramater/parameter typo in dmesg and source comments ... Browse Code »

Signed-off-by: Martin Olsson
Signed-off-by: Jiri Kosina

Martin Olsson
2009-06-13 00:01:46 +0800

12 Jun, 2009

8 commits

d2a7ddda9 virtio: find_vqs/del_vqs virtio operations ... Browse Code »

This replaces find_vq/del_vq with find_vqs/del_vqs virtio operations,
and updates all drivers. This is needed for MSI support, because MSI
needs to know the total number of vectors upfront.

Signed-off-by: Michael S. Tsirkin
Signed-off-by: Rusty Russell (+ lguest/9p compile fixes)

Michael S. Tsirkin
2009-06-12 20:46:36 +0800
9499f5e7e virtio: add names to virtqueue struct, mapping from devices to queues. ... Browse Code »

Add a linked list of all virtqueues for a virtio device: this helps for
debugging and is also needed for upcoming interface change.

Also, add a "name" field for clearer debug messages.

Signed-off-by: Rusty Russell

Rusty Russell
2009-06-12 20:46:36 +0800
da6782927 bridge: Simplify interface for ATM LANE ... Browse Code »

This patch changes FDB entry check for ATM LANE bridge integration.
There's no point in holding a FDB entry around SKB building.

br_fdb_get()/br_fdb_put() pair are changed into single br_fdb_test_addr()
hook that checks if the addr has FDB entry pointing to other port
to the one the request arrived on.

FDB entry refcounting is removed as it's not used anywhere else.

Signed-off-by: Michał Mirosław
Acked-by: Stephen Hemminger
Signed-off-by: David S. Miller

Michał Mirosław
2009-06-12 12:03:21 +0800
746e6ad23 [PATCH] net core: Some interface flags not returned by SIOCGIFFLAGS ... Browse Code »

Commit b00055aacdb172c05067612278ba27265fcd05ce " [NET] core: add
RFC2863 operstate" defined new interface flag values. Its
documentation specified that these flags could be accessed from user
space via SIOCGIFFLAGS. However, this does not work because the new
flags do not fit in that ioctl's argument width.

Change the documentation to match the code's behavior. Also change
the source to explicitly show the truncation. This _should_ have no
effect on executable code, and did not with gcc 4.2.4 generating x86
code.

A new ioctl could be defined to return all interface flags to user
space. However, since this has been broken for three years with no
one complaining, there doesn't seem much need. They are still
accessible via netlink.

Reported-by: "Fredrik Arnerup"
Signed-off-by: John Dykstra
Signed-off-by: David S. Miller

John Dykstra
2009-06-12 11:57:21 +0800
adf76cfe2 Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/kaber/nf-next-2.6 Browse Code »

David S. Miller
2009-06-12 11:00:44 +0800
3ee40c376 Merge branch 'linux-2.6.31.y' of git://git.kernel.org/pub/scm/linux/kernel/git/inaky/wimax Browse Code »

David S. Miller
2009-06-12 08:11:33 +0800
24992eacd netfilter: ip_tables: fix build error ... Browse Code »

Fix build error introduced by commit bb70dfa5 (netfilter: xtables:
consolidate comefrom debug cast access):

net/ipv4/netfilter/ip_tables.c: In function 'ipt_do_table':
net/ipv4/netfilter/ip_tables.c:421: error: 'comefrom' undeclared (first use in this function)
net/ipv4/netfilter/ip_tables.c:421: error: (Each undeclared identifier is reported only once
net/ipv4/netfilter/ip_tables.c:421: error: for each function it appears in.)

Signed-off-by: Patrick McHardy

Patrick McHardy
2009-06-12 07:53:09 +0800
d2f4c1054 wimax: fix warning caused by not checking retval of rfkill_set_hw_state() ... Browse Code »

Caused by an API update. The return value can be safely ignored, as
there is notthing we can do with it.

Signed-off-by: Inaky Perez-Gonzalez

Inaky Perez-Gonzalez
2009-06-12 02:12:48 +0800

11 Jun, 2009

9 commits

334a47f63 netfilter: nf_ct_tcp: fix up build after merge ... Browse Code »

Replace the last occurence of tcp_lock by the per-conntrack lock.

Signed-off-by: Patrick McHardy

Patrick McHardy
2009-06-11 22:16:09 +0800
36432dae7 Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-next-2.6 Browse Code »

Patrick McHardy
2009-06-11 22:00:49 +0800
bb400801c Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/holtmann/bluetooth-next-2.6 Browse Code »

David S. Miller
2009-06-11 20:47:43 +0800
5ef12d98a neigh: fix state transition INCOMPLETE->FAILED via Netlink request ... Browse Code »

The current code errors out the INCOMPLETE neigh entry skb queue only from
the timer if maximum probes have been attempted and there has been no reply.
This also causes the transtion to FAILED state.

However, the neigh entry can be also updated via Netlink to inform that the
address is unavailable. Currently, neigh_update() just stops the timers and
leaves the pending skb's unreleased. This results that the clean up code in
the timer callback is never called, preventing also proper garbage collection.

This fixes neigh_update() to process the pending skb queue immediately if
INCOMPLETE -> FAILED state transtion occurs due to a Netlink request.

Signed-off-by: Timo Teras
Signed-off-by: David S. Miller

Timo Teras
2009-06-11 19:16:28 +0800
2b85a34e9 net: No more expensive sock_hold()/sock_put() on each tx ... Browse Code »

One of the problem with sock memory accounting is it uses
a pair of sock_hold()/sock_put() for each transmitted packet.

This slows down bidirectional flows because the receive path
also needs to take a refcount on socket and might use a different
cpu than transmit path or transmit completion path. So these
two atomic operations also trigger cache line bounces.

We can see this in tx or tx/rx workloads (media gateways for example),
where sock_wfree() can be in top five functions in profiles.

We use this sock_hold()/sock_put() so that sock freeing
is delayed until all tx packets are completed.

As we also update sk_wmem_alloc, we could offset sk_wmem_alloc
by one unit at init time, until sk_free() is called.
Once sk_free() is called, we atomic_dec_and_test(sk_wmem_alloc)
to decrement initial offset and atomicaly check if any packets
are in flight.

skb_set_owner_w() doesnt call sock_hold() anymore

sock_wfree() doesnt call sock_put() anymore, but check if sk_wmem_alloc
reached 0 to perform the final freeing.

Drawback is that a skb->truesize error could lead to unfreeable sockets, or
even worse, prematurely calling __sk_free() on a live socket.

Nice speedups on SMP. tbench for example, going from 2691 MB/s to 2711 MB/s
on my 8 cpu dev machine, even if tbench was not really hitting sk_refcnt
contention point. 5 % speedup on a UDP transmit workload (depends
on number of flows), lowering TX completion cpu usage.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2009-06-11 17:55:43 +0800
e5241c448 ieee802154: Use '%Zu' printf format for size_t. ... Browse Code »

Signed-off-by: David S. Miller

David S. Miller
2009-06-11 17:10:19 +0800
84503ddd6 Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/linville/wireless-next-2.6 Browse Code »

David S. Miller
2009-06-11 14:41:43 +0800
862366118 Merge branch 'tracing-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip ... Browse Code »

* 'tracing-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip: (244 commits)
Revert "x86, bts: reenable ptrace branch trace support"
tracing: do not translate event helper macros in print format
ftrace/documentation: fix typo in function grapher name
tracing/events: convert block trace points to TRACE_EVENT(), fix !CONFIG_BLOCK
tracing: add protection around module events unload
tracing: add trace_seq_vprint interface
tracing: fix the block trace points print size
tracing/events: convert block trace points to TRACE_EVENT()
ring-buffer: fix ret in rb_add_time_stamp
ring-buffer: pass in lockdep class key for reader_lock
tracing: add annotation to what type of stack trace is recorded
tracing: fix multiple use of __print_flags and __print_symbolic
tracing/events: fix output format of user stack
tracing/events: fix output format of kernel stack
tracing/trace_stack: fix the number of entries in the header
ring-buffer: discard timestamps that are at the start of the buffer
ring-buffer: try to discard unneeded timestamps
ring-buffer: fix bug in ring_buffer_discard_commit
ftrace: do not profile functions when disabled
tracing: make trace pipe recognize latency format flag
...

Linus Torvalds
2009-06-11 10:53:40 +0800
2f0accc13 cfg80211: fix rfkill locking problem ... Browse Code »

rfkill currently requires a global lock within the
rfkill_register() function, and holds that lock over
calls to the set_block() methods. This means that we
cannot hold a lock around rfkill_register() that we
also require in set_block(), directly or indirectly.
Fix cfg80211 to register rfkill outside the block
locked by its global lock. Much of what cfg80211 does
in the locked block doesn't need to be locked anyway.

Reported-by: Vasanthakumar Thiagarajan
Signed-off-by: Johannes Berg
Signed-off-by: John W. Linville

Johannes Berg
2009-06-11 01:28:41 +0800