Eric Lee / smarc-fsl-linux-kernel

26 Apr, 2007

40 commits

c45d286e7 [NET]: Inline net_device_stats ... Browse Code »

Network drivers which keep stats allocate their own stats structure
then write a get_stats() function to return them. It would be nice if
this were done by default.

1) Add a new "stats" field to "struct net_device".
2) Add a new feature field to say "this driver uses the internal one"
3) Have a default "get_stats" which returns NULL if that feature not set.
4) Change callers to check result of get_stats call for NULL, not if
->get_stats is set.

This should not break backwards compatibility with older drivers, yet
allow modern drivers to shed some boilerplate code.

Lightly tested: works for a modified lguest network driver.

Signed-off-by: Rusty Russell
Signed-off-by: David S. Miller

Rusty Russell
2007-04-26 13:28:26 +0800
f85958151 [NET]: random functions can use nsec resolution instead of usec ... Browse Code »

In order to get more randomness for secure_tcpv6_sequence_number(),
secure_tcp_sequence_number(), secure_dccp_sequence_number() functions,
we can use the high resolution time services, providing nanosec
resolution.

I've also done two kmalloc()/kzalloc() conversions.

Signed-off-by: Eric Dumazet
Acked-by: James Morris
Signed-off-by: David S. Miller

Eric Dumazet
2007-04-26 13:28:25 +0800
4b19ca44c [NET] fib_rules: delay route cache flush by ip_rt_min_delay ... Browse Code »

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:28:24 +0800
d626f62b1 [SK_BUFF]: Introduce skb_copy_from_linear_data{_offset} ... Browse Code »

To clearly state the intent of copying from linear sk_buffs, _offset being a
overly long variant but interesting for the sake of saving some bytes.

Signed-off-by: Arnaldo Carvalho de Melo

Arnaldo Carvalho de Melo
2007-04-26 13:28:23 +0800
2a123b86e [BLUETOOTH]: Introduce skb->data accessor methods for hci_{acl,event,sco}_hdr ... Browse Code »

For consistency with other skb data accessors, reducing the number of direct
accesses to skb->data.

Signed-off-by: Arnaldo Carvalho de Melo

Arnaldo Carvalho de Melo
2007-04-26 13:28:21 +0800
03d4f879b [IPV4]: align inet_protos[] on SMP ... Browse Code »

As IPPROTO_TCP is 6, it makes sense to make sure inet_protos[] array
is properly cache line aligned to avoid false sharing on SMP.

c0680540 b peer_total
c0680544 b inet_peer_unused_head
c0680560 B inet_protos

On i386 this example, we can see that inet_protos[IPPROTO_TCP] shares
a potentially hot (and modified) cache line.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2007-04-26 13:28:20 +0800
4103f8cd5 [TCP]: tcp_memory_pressure and tcp_socket are__read_mostly candidates ... Browse Code »

tcp_memory_pressure and tcp_socket currently share a cache line with tcp_memory_allocated, tcp_sockets_allocated.
(Very hot cache line)
It makes sense to declare these variables as __read_mostly, to avoid false sharing on SMP.

ffffffff8081d9c0 B tcp_orphan_count
ffffffff8081d9c4 B tcp_memory_allocated
ffffffff8081d9c8 B tcp_sockets_allocated
ffffffff8081d9cc B tcp_memory_pressure
ffffffff8081d9d0 b tcp_md5sig_users
ffffffff8081d9d8 b tcp_md5sig_pool
ffffffff8081d9e0 b warntime.31570
ffffffff8081d9e8 b tcp_socket

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2007-04-26 13:28:19 +0800
73417f617 [NET] fib_rules: Flush route cache after rule modifications ... Browse Code »

The results of FIB rules lookups are cached in the routing cache
except for IPv6 as no such cache exists. So far, it was the
responsibility of the user to flush the cache after modifying any
rules. This lead to many false bug reports due to misunderstanding
of this concept.

This patch automatically flushes the route cache after inserting
or deleting a rule.

Thanks to Muli Ben-Yehuda for catching a bug
in the previous patch.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:28:18 +0800
be776281a [NET]: inet_ehash_secret should be __read_mostly and set only once ... Browse Code »

There is a very tiny probability that build_ehash_secret() is called
at the same time by different CPUS.

Also, using __read_mostly is a must for inet_ehash_secret

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2007-04-26 13:28:17 +0800
35fc92a9d [NET]: Allow forwarding of ip_summed except CHECKSUM_COMPLETE ... Browse Code »

Right now Xen has a horrible hack that lets it forward packets with
partial checksums. One of the reasons that CHECKSUM_PARTIAL and
CHECKSUM_COMPLETE were added is so that we can get rid of this hack
(where it creates two extra bits in the skbuff to essentially mirror
ip_summed without being destroyed by the forwarding code).

I had forgotten that I've already gone through all the deivce drivers
last time around to make sure that they're looking at ip_summed ==
CHECKSUM_PARTIAL rather than ip_summed != 0 on transmit. In any case,
I've now done that again so it should definitely be safe.

Unfortunately nobody has yet added any code to update CHECKSUM_COMPLETE
values on forward so we I'm setting that to CHECKSUM_NONE. This should
be safe to remove for bridging but I'd like to check that code path
first.

So here is the patch that lets us get rid of the hack by preserving
ip_summed (mostly) on forwarded packets.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-04-26 13:28:16 +0800
2d771cd86 [IPV4] LVS: Allow to send ICMP unreachable responses when real-servers are removed ... Browse Code »

this is a small patch by Janusz Krzysztofik to ip_route_output_slow()
that allows VIP-less LVS linux director to generate packets
originating >From VIP if sysctl_ip_nonlocal_bind is set.

In a nutshell, the intention is for an LVS linux director to be able
to send ICMP unreachable responses to end-users when real-servers are
removed.

http://archive.linuxvirtualserver.org/html/lvs-users/2007-01/msg00106.html

Signed-off-by: Simon Horman
Signed-off-by: David S. Miller

Janusz Krzysztofik
2007-04-26 13:28:15 +0800
fa0b2d1d2 [NET] fib_rules: Add no-operation action ... Browse Code »

The use of nop rules simplifies the usage of goto rules
and adds more flexibility as they allow targets to remain
while the actual content of the branches can change easly.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:28:14 +0800
2b4436830 [NET] fib_rules: Mark rules detached from the device ... Browse Code »

Rules which match against device names in their selector can
remain while the device itself disappears, in fact the device
doesn't have to present when the rule is added in the first
place. The device name is resolved by trying when the rule is
added and later by listening to NETDEV_REGISTER/UNREGISTER
notifications.

This patch adds the flag FIB_RULE_DEV_DETACHED which is set
towards userspace when a rule contains a device match which
is unresolved at the moment. This eases spotting the reason
why certain rules seem not to function properly.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:28:13 +0800
0947c9fe5 [NET] fib_rules: goto rule action ... Browse Code »

This patch adds a new rule action FR_ACT_GOTO which allows
to skip a set of rules by jumping to another rule. The rule
to jump to is specified via the FRA_GOTO attribute which
carries a rule preference.

Referring to a rule which doesn't exists is explicitely allowed.
Such goto rules are marked with the flag FIB_RULE_UNRESOLVED
and will act like a rule with a non-matching selector. The rule
will become functional as soon as its target is present.

The goto action enables performance optimizations by reducing
the average number of rules that have to be passed per lookup.

Example:
0: from all lookup local
40: not from all to 192.168.23.128 goto 32766
41: from all fwmark 0xa blackhole
42: from all fwmark 0xff blackhole
32766: from all lookup main

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:28:12 +0800
2f7826c02 [WAN] cosa.c: Build fix. ... Browse Code »

Caused by skb_reset_mac_header() changes, missing semicolon.

Signed-off-by: David S. Miller

David S. Miller
2007-04-26 13:28:11 +0800
85795d64e [TCP] tcp_probe: improvements for net-2.6.22 ... Browse Code »

Change tcp_probe to use ktime (needed to add one export).
Add option to only get events when cwnd changes - from Doug Leith

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

Stephen Hemminger
2007-04-26 13:28:10 +0800
e1c3e7ab6 [TCP]: cubic update for net-2.6.22 ... Browse Code »

The following update received from Injong updates TCP cubic to the latest
version. I am running more complete tests and will have results after 4/1.

According to Injong: the new version improves on its scalability,
fairness and stability. So in all properties, we confirmed it shows better
performance.

NCSU results (for 2.6.18 and 2.6.20) available:
http://netsrv.csc.ncsu.edu/wiki/index.php/TCP_Testing

This version is described in a new Internet draft for CUBIC.
http://www.ietf.org/internet-drafts/draft-rhee-tcp-cubic-00.txt

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

Stephen Hemminger
2007-04-26 13:28:09 +0800
9af3912ec [NET] Move DF check to ip_forward ... Browse Code »

Do fragmentation check in ip_forward, similar to ipv6 forwarding.

Signed-off-by: John Heffner
Signed-off-by: David S. Miller

John Heffner
2007-04-26 13:28:07 +0800
b3da2cf37 [INET]: Use jhash + random secret for ehash. ... Browse Code »

The days are gone when this was not an issue, there are folks out
there with huge bot networks that can be used to attack the
established hash tables on remote systems.

So just like the routing cache and connection tracking
hash, use Jenkins hash with random secret input.

Signed-off-by: David S. Miller

David S. Miller
2007-04-26 13:28:06 +0800
d30045a0b [NETLINK]: introduce NLA_BINARY type ... Browse Code »

This patch introduces a new NLA_BINARY attribute policy type with the
verification of simply checking the maximum length of the payload.

It also fixes a small typo in the example.

Signed-off-by: Johannes Berg
Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Johannes Berg
2007-04-26 13:28:05 +0800
703315712 [SCTP]: Implement SCTP_MAX_BURST socket option. ... Browse Code »

Signed-off-by: Vlad Yasevich
Signed-off-by: David S. Miller

Vlad Yasevich
2007-04-26 13:28:04 +0800
a5a35e767 [SCTP]: Implement sac_info field in SCTP_ASSOC_CHANGE notification. ... Browse Code »

As stated in the sctp socket api draft:

sac_info: variable

If the sac_state is SCTP_COMM_LOST and an ABORT chunk was received
for this association, sac_info[] contains the complete ABORT chunk as
defined in the SCTP specification RFC2960 [RFC2960] section 3.3.7.

We now save received ABORT chunks into the sac_info field and pass that
to the user.

Signed-off-by: Vlad Yasevich
Signed-off-by: David S. Miller

Vlad Yasevich
2007-04-26 13:28:03 +0800
bdf3092af [SCTP]: Honor flags when setting peer address parameters ... Browse Code »

Parameters only take effect when a corresponding flag bit is set
and a value is specified. This means we need to check the flags
in addition to checking for non-zero value.

Signed-off-by: Vlad Yasevich
Signed-off-by: David S. Miller

Vlad Yasevich
2007-04-26 13:28:02 +0800
1ae4114dc [SCTP]: Implement SCTP_ADDR_CONFIRMED state for ADDR_CHNAGE event ... Browse Code »

Signed-off-by: Vlad Yasevich
Signed-off-by: David S. Miller

Vlad Yasevich
2007-04-26 13:28:01 +0800
d49d91d79 [SCTP]: Implement SCTP_PARTIAL_DELIVERY_POINT option. ... Browse Code »

This option induces partial delivery to run as soon
as the specified amount of data has been accumulated on
the association. However, we give preference to fully
reassembled messages over PD messages. In any case,
window and buffer is freed up.

Signed-off-by: Vlad Yasevich
Signed-off-by: David S. Miller

Vlad Yasevich
2007-04-26 13:28:00 +0800
b6e1331f3 [SCTP]: Implement SCTP_FRAGMENT_INTERLEAVE socket option ... Browse Code »

This option was introduced in draft-ietf-tsvwg-sctpsocket-13. It
prevents head-of-line blocking in the case of one-to-many endpoint.
Applications enabling this option really must enable SCTP_SNDRCV event
so that they would know where the data belongs. Based on an
earlier patch by Ivan Skytte Jørgensen.

Additionally, this functionality now permits multiple associations
on the same endpoint to enter Partial Delivery. Applications should
be extra careful, when using this functionality, to track EOR indicators.

Signed-off-by: Vlad Yasevich
Signed-off-by: David S. Miller

Vlad Yasevich
2007-04-26 13:27:59 +0800
c95e93950 [NET_SCHED]: qdisc: remove unnecessary memory barriers ... Browse Code »

We're holding dev->queue_lock in qdisc_watchdog_schedule and
qdisc_watchdog_cancel, no need for the barriers.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:58 +0800
a48b5a614 [NET_SCHED]: Unline tcf_destroy ... Browse Code »

Uninline tcf_destroy and add a helper function to destroy an entire filter
chain.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:56 +0800
3bebcda28 [NET_SCHED]: turn PSCHED_GET_TIME into inline function ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:55 +0800
03cc45c0a [NET_SCHED]: turn PSCHED_TDIFF_SAFE into inline function ... Browse Code »

Also rename to psched_tdiff_bounded.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:54 +0800
8edc0c31d [NET_SCHED]: kill PSCHED_TDIFF ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:53 +0800
a084980dc [NET_SCHED]: kill PSCHED_SET_PASTPERFECT/PSCHED_IS_PASTPERFECT ... Browse Code »

Use direct assignment and comparison instead.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:51 +0800
104e08789 [NET_SCHED]: kill PSCHED_TLESS ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:50 +0800
7c59e25f3 [NET_SCHED]: kill PSCHED_TADD/PSCHED_TADD2 ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:49 +0800
26e252df1 [NET_SCHED]: kill PSCHED_AUDIT_TDIFF ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:48 +0800
76d643cd3 [NET_SCHED]: sch_netem: fix off-by-one in send time comparison ... Browse Code »

netem checks PSCHED_TLESS(cb->time_to_send, now) to find out whether it is
allowed to send a packet, which is equivalent to cb->time_to_send < now.
Use !PSCHED_TLESS(now, cb->time_to_send) instead to properly handle
cb->time_to_send == now.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:47 +0800
c7bf5f9dc [NETFILTER] nfnetlink: netlink_run_queue() already checks for NLM_F_REQUEST ... Browse Code »

Patrick has made use of netlink_run_queue() in nfnetlink while my patches
have been waiting for net-2.6.22 to open. So this check for NLM_F_REQUEST
can go as well.

Signed-off-by: Thomas Graf
Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Thomas Graf
2007-04-26 13:27:46 +0800
de6e05c49 [NETFILTER]: nf_conntrack: kill destroy() in struct nf_conntrack for diet ... Browse Code »

The destructor per conntrack is unnecessary, then this replaces it with
system wide destructor.

Signed-off-by: Yasuyuki Kozakai
Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Yasuyuki Kozakai
2007-04-26 13:27:45 +0800
5f79e0f91 [NETFILTER]: nf_conntrack: don't use nfct in skb if conntrack is disabled ... Browse Code »

Signed-off-by: Yasuyuki Kozakai
Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Yasuyuki Kozakai
2007-04-26 13:27:44 +0800
e6f689db5 [NETFILTER]: Use setup_timer ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:43 +0800