Eric Lee / smarc-fsl-linux-kernel

04 Jan, 2012

1 commit

6bafcac32 sch_qfq: fix overflow in qfq_update_start() ... Browse Code »

grp->slot_shift is between 22 and 41, so using 32bit wide variables is
probably a typo.

This could explain QFQ hangs Dave reported to me, after 2^23 packets ?

(23 = 64 - 41)

Reported-by: Dave Taht
Signed-off-by: Eric Dumazet
CC: Stephen Hemminger
CC: Dave Taht
Signed-off-by: David S. Miller

Eric Dumazet
2012-01-04 01:58:23 +0800

25 Dec, 2011

1 commit

bb52c7acf netem: dont call vfree() under spinlock and BH disabled ... Browse Code »

commit 6373a9a286 (netem: use vmalloc for distribution table) added a
regression, since vfree() is called while holding a spinlock and BH
being disabled.

Fix this by doing the pointers swap in critical section, and freeing
after spinlock release.

Also add __GFP_NOWARN to the kmalloc() try, since we fallback to
vmalloc().

Signed-off-by: Eric Dumazet
Acked-by: Stephen Hemminger
Signed-off-by: David S. Miller

Eric Dumazet
2011-12-25 05:08:50 +0800

23 Dec, 2011

1 commit

7838f2ce3 mqprio: Avoid panic if no options are provided ... Browse Code »
1

Userspace may not provide TCA_OPTIONS, in fact tc currently does
so not do so if no arguments are specified on the command line.
Return EINVAL instead of panicing.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2011-12-23 11:34:56 +0800

13 Dec, 2011

1 commit

3f1e6d3fd sch_gred: should not use GFP_KERNEL while holding a spinlock ... Browse Code »
1

gred_change_vq() is called under sch_tree_lock(sch).

This means a spinlock is held, and we are not allowed to sleep in this
context.

We might pre-allocate memory using GFP_KERNEL before taking spinlock,
but this is not suitable for stable material.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-12-13 08:08:54 +0800

02 Dec, 2011

1 commit

1ee5fa1e9 sch_red: fix red_change ... Browse Code »

Le mercredi 30 novembre 2011 à 14:36 -0800, Stephen Hemminger a écrit :

> (Almost) nobody uses RED because they can't figure it out.
> According to Wikipedia, VJ says that:
> "there are not one, but two bugs in classic RED."

RED is useful for high throughput routers, I doubt many linux machines
act as such devices.

I was considering adding Adaptative RED (Sally Floyd, Ramakrishna
Gummadi, Scott Shender), August 2001

In this version, maxp is dynamic (from 1% to 50%), and user only have to
setup min_th (target average queue size)
(max_th and wq (burst in linux RED) are automatically setup)

By the way it seems we have a small bug in red_change()

if (skb_queue_empty(&sch->q))
red_end_of_idle_period(&q->parms);

First, if queue is empty, we should call
red_start_of_idle_period(&q->parms);

Second, since we dont use anymore sch->q, but q->qdisc, the test is
meaningless.

Oh well...

[PATCH] sch_red: fix red_change()

Now RED is classful, we must check q->qdisc->q.qlen, and if queue is empty,
we start an idle period, not end it.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-12-02 08:24:38 +0800

01 Dec, 2011

1 commit

f7e57044e sch_teql: fix lockdep splat ... Browse Code »
1

We need rcu_read_lock() protection before using dst_get_neighbour(), and
we must cache its value (pass it to __teql_resolve())

teql_master_xmit() is called under rcu_read_lock_bh() protection, its
not enough.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-12-01 06:10:49 +0800

01 Nov, 2011

2 commits

bc3b2d7fb net: Add export.h for EXPORT_SYMBOL/THIS_MODULE to non-modules ... Browse Code »

These files are non modular, but need to export symbols using
the macros now living in export.h -- call out the include so
that things won't break when we remove the implicit presence
of module.h from everywhere.

Signed-off-by: Paul Gortmaker

Paul Gortmaker
2011-11-01 07:30:30 +0800
3a9a231d9 net: Fix files explicitly needing to include module.h ... Browse Code »

With calls to modular infrastructure, these files really
needs the full module.h header. Call it out so some of the
cleanups of implicit and unrequired includes elsewhere can be
cleaned up.

Signed-off-by: Paul Gortmaker

Paul Gortmaker
2011-11-01 07:30:28 +0800

25 Oct, 2011

1 commit

859c20123 net_sched: cls_flow: use skb_header_pointer() ... Browse Code »

Dan Siemon would like to add tunnelling support to cls_flow

This preliminary patch introduces use of skb_header_pointer() to help
this task, while avoiding skb head reallocation because of deep packet
inspection.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-10-25 06:40:14 +0800

22 Sep, 2011

1 commit

8decf8687 Merge branch 'master' of github.com:davem330/net ... Browse Code »

Conflicts:
MAINTAINERS
drivers/net/Kconfig
drivers/net/ethernet/broadcom/bnx2x/bnx2x_link.c
drivers/net/ethernet/broadcom/tg3.c
drivers/net/wireless/iwlwifi/iwl-pci.c
drivers/net/wireless/iwlwifi/iwl-trans-tx-pcie.c
drivers/net/wireless/rt2x00/rt2800usb.c
drivers/net/wireless/wl12xx/main.c

David S. Miller
2011-09-22 15:23:13 +0800

16 Sep, 2011

1 commit

27e95a8c6 pkt_sched: cls_rsvp.h was outdated ... Browse Code »

File cls_rsvp.h in /net/sched was outdated. I'm sending you patch for this
file.

[ tb[] array should be indexed by X not X-1 -DaveM ]

Signed-off-by: Igor Maravić
Signed-off-by: David S. Miller

Igor Maravić
2011-09-16 02:49:43 +0800

27 Aug, 2011

1 commit

363437f40 net_sched: sfb: optimize enqueue on full queue ... Browse Code »

In case SFB queue is full (hard limit reached), there is no point
spending time to compute hash and maximum qlen/p_mark.

We instead just early drop packet.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-08-27 00:55:18 +0800

18 Aug, 2011

1 commit

8919bc13e net_sched: fix port mirror/redirect stats reporting ... Browse Code »

When a redirected or mirrored packet is dropped by the target
device we need to record statistics.

Signed-off-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Jamal Hadi Salim
2011-08-18 11:10:20 +0800

10 Aug, 2011

1 commit

3557619f0 net_sched: prio: use qdisc_dequeue_peeked ... Browse Code »
1

commit 07bd8df5df4369487812bf85a237322ff3569b77
(sch_sfq: fix peek() implementation) changed sfq to use generic
peek helper.

This makes HFSC complain about a non-work-conserving child qdisc, if
prio with sfq child is used within hfsc:

hfsc peeks into prio qdisc, which will then peek into sfq.
returned skb is stashed in sch->gso_skb.

Next, hfsc tries to dequeue from prio, but prio will call sfq dequeue
directly, which may return NULL instead of previously peeked-at skb.

Have prio call qdisc_dequeue_peeked, so sfq->dequeue() is
not called in this case.

Cc: Eric Dumazet
Signed-off-by: Florian Westphal
Signed-off-by: David S. Miller

Florian Westphal
2011-08-10 12:52:11 +0800

01 Aug, 2011

1 commit

e1738bd9c sch_sfq: fix sfq_enqueue() ... Browse Code »
1

commit 8efa88540635 (sch_sfq: avoid giving spurious NET_XMIT_CN signals)
forgot to call qdisc_tree_decrease_qlen() to signal upper levels that a
packet (from another flow) was dropped, leading to various problems.

With help from Michal Soltys and Michal Pokrywka, who did a bisection.

Bugzilla ref: https://bugzilla.kernel.org/show_bug.cgi?id=39372
Debian ref: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=631945

Reported-by: Lucas Bocchi
Reported-and-bisected-by: Michal Pokrywka
Signed-off-by: Eric Dumazet
CC: Michal Soltys
Acked-by: Patrick McHardy
Signed-off-by: David S. Miller

Eric Dumazet
2011-08-01 17:27:21 +0800

18 Jul, 2011

1 commit

69cce1d14 net: Abstract dst->neighbour accesses behind helpers. ... Browse Code »
1

dst_{get,set}_neighbour()

Signed-off-by: David S. Miller

David S. Miller
2011-07-18 14:11:35 +0800

15 Jul, 2011

2 commits

f0c50c7c9 Remove redundant variable/code in __qdisc_run ... Browse Code »

Remove redundant variable "work".

Signed-off-by: Krishna Kumar
Signed-off-by: David S. Miller

Krishna Kumar
2011-07-15 23:08:26 +0800
e20e69407 net: remove SK_ROUTE_CAPS from meta ematch ... Browse Code »

Remove it, as it indirectly exposes netdev features. It's not used in
iproute2 (2.6.38) - is anything else using its interface?

Signed-off-by: Michał Mirosław
Signed-off-by: David S. Miller

Michał Mirosław
2011-07-15 05:45:59 +0800

06 Jul, 2011

1 commit

dc7f9f6e8 net: sched: constify tcf_proto and tc_action ... Browse Code »

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-07-06 17:52:16 +0800

27 Jun, 2011

1 commit

d5b8aa1d2 net_sched: fix dequeuer fairness ... Browse Code »

Results on dummy device can be seen in my netconf 2011
slides. These results are for a 10Gige IXGBE intel
nic - on another i5 machine, very similar specs to
the one used in the netconf2011 results.
It turns out - this is a hell lot worse than dummy
and so this patch is even more beneficial for 10G.

Test setup:
----------

System under test sending packets out.
Additional box connected directly dropping packets.
Installed prio qdisc on the eth device and default
netdev default length of 1000 used as is.
The 3 prio bands each were set to 100 (didnt factor in
the results).

5 packet runs were made and the middle 3 picked.

results
-------

The "cpu" column indicates the which cpu the sample
was taken on,
The "Pkt runx" carries the number of packets a cpu
dequeued when forced to be in the "dequeuer" role.
The "avg" for each run is the number of times each
cpu should be a "dequeuer" if the system was fair.

3.0-rc4 (plain)
cpu Pkt run1 Pkt run2 Pkt run3
================================================
cpu0 21853354 21598183 22199900
cpu1 431058 473476 393159
cpu2 481975 477529 458466
cpu3 23261406 23412299 22894315
avg 11506948 11490372 11486460

3.0-rc4 with patch and default weight 64
cpu Pkt run1 Pkt run2 Pkt run3
================================================
cpu0 13205312 13109359 13132333
cpu1 10189914 10159127 10122270
cpu2 10213871 10124367 10168722
cpu3 13165760 13164767 13096705
avg 11693714 11639405 11630008

As you can see the system is still not perfect but
is a lot better than what it was before...

At the moment we use the old backlog weight, weight_p
which is 64 packets. It seems to be reasonably fine
with that value.
The system could be made more fair if we reduce the
weight_p (as per my presentation), but we are going
to affect the shared backlog weight. Unless deemed
necessary, I think the default value is fine. If not
we could add yet another knob.

Signed-off-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

jamal
2011-06-27 15:14:10 +0800

22 Jun, 2011

2 commits

56f8a75c1 ip: introduce ip_is_fragment helper inline function ... Browse Code »

There are enough instances of this:

iph->frag_off & htons(IP_MF | IP_OFFSET)

that a helper function is probably warranted.

Signed-off-by: Paul Gortmaker
Signed-off-by: David S. Miller

Paul Gortmaker
2011-06-22 11:33:34 +0800
b7f080cfe net: remove mm.h inclusion from netdevice.h ... Browse Code »

Remove linux/mm.h inclusion from netdevice.h -- it's unused (I've checked manually).

To prevent mm.h inclusion via other channels also extract "enum dma_data_direction"
definition into separate header. This tiny piece is what gluing netdevice.h with mm.h
via "netdevice.h => dmaengine.h => dma-mapping.h => scatterlist.h => mm.h".
Removal of mm.h from scatterlist.h was tried and was found not feasible
on most archs, so the link was cutoff earlier.

Hope people are OK with tiny include file.

Note, that mm_types.h is still dragged in, but it is a separate story.

Signed-off-by: Alexey Dobriyan
Signed-off-by: David S. Miller

Alexey Dobriyan
2011-06-22 10:17:20 +0800

21 Jun, 2011

1 commit

9f6ec8d69 Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 ... Browse Code »

Conflicts:
drivers/net/wireless/iwlwifi/iwl-agn-rxon.c
drivers/net/wireless/rtlwifi/pci.c
net/netfilter/ipvs/ip_vs_core.c

David S. Miller
2011-06-21 13:29:08 +0800

10 Jun, 2011

1 commit

c7ac8679b rtnetlink: Compute and store minimum ifinfo dump size ... Browse Code »

The message size allocated for rtnl ifinfo dumps was limited to
a single page. This is not enough for additional interface info
available with devices that support SR-IOV and caused a bug in
which VF info would not be displayed if more than approximately
40 VFs were created per interface.

Implement a new function pointer for the rtnl_register service that will
calculate the amount of data required for the ifinfo dump and allocate
enough data to satisfy the request.

Signed-off-by: Greg Rose
Signed-off-by: Jeff Kirsher

Greg Rose
2011-06-10 11:38:07 +0800

07 Jun, 2011

2 commits

a6b7a4078 net: remove interrupt.h inclusion from netdevice.h ... Browse Code »

* remove interrupt.g inclusion from netdevice.h -- not needed
* fixup fallout, add interrupt.h and hardirq.h back where needed.

Signed-off-by: Alexey Dobriyan
Signed-off-by: David S. Miller

Alexey Dobriyan
2011-06-07 13:55:11 +0800
3019de124 net: Rework netdev_drivername() to avoid warning. ... Browse Code »

This interface uses a temporary buffer, but for no real reason.
And now can generate warnings like:

net/sched/sch_generic.c: In function dev_watchdog
net/sched/sch_generic.c:254:10: warning: unused variable drivername

Just return driver->name directly or "".

Reported-by: Connor Hansen
Signed-off-by: David S. Miller

David S. Miller
2011-06-07 07:41:33 +0800

26 May, 2011

1 commit

07bd8df5d sch_sfq: fix peek() implementation ... Browse Code »
1

Since commit eeaeb068f139 (sch_sfq: allow big packets and be fair),
sfq_peek() can return a different skb that would be normally dequeued by
sfq_dequeue() [ if current slot->allot is negative ]

Use generic qdisc_peek_dequeued() instead of custom implementation, to
get consistent result.

Signed-off-by: Eric Dumazet
CC: Jarek Poplawski
CC: Patrick McHardy
CC: Jesper Dangaard Brouer
Signed-off-by: David S. Miller

Eric Dumazet
2011-05-26 05:55:32 +0800

24 May, 2011

1 commit

8efa88540 sch_sfq: avoid giving spurious NET_XMIT_CN signals ... Browse Code »
1

While chasing a possible net_sched bug, I found that IP fragments have
litle chance to pass a congestioned SFQ qdisc :

- Say SFQ qdisc is full because one flow is non responsive.
- ip_fragment() wants to send two fragments belonging to an idle flow.
- sfq_enqueue() queues first packet, but see queue limit reached :
- sfq_enqueue() drops one packet from 'big consumer', and returns
NET_XMIT_CN.
- ip_fragment() cancel remaining fragments.

This patch restores fairness, making sure we return NET_XMIT_CN only if
we dropped a packet from the same flow.

Signed-off-by: Eric Dumazet
CC: Patrick McHardy
CC: Jarek Poplawski
CC: Jamal Hadi Salim
CC: Stephen Hemminger
Signed-off-by: David S. Miller

Eric Dumazet
2011-05-24 05:36:00 +0800

23 May, 2011

1 commit

3137663df net: avoid synchronize_rcu() in dev_deactivate_many ... Browse Code »

dev_deactivate_many() issues one synchronize_rcu() call after qdiscs set
to noop_qdisc.

This call is here to make sure they are no outstanding qdisc-less
dev_queue_xmit calls before returning to caller.

But in dismantle phase, we dont have to wait, because we wont activate
again the device, and we are going to wait one rcu grace period later in
rollback_registered_many().

After this patch, device dismantle uses one synchronize_net() and one
rcu_barrier() call only, so we have a ~30% speedup and a smaller RTNL
latency.

Signed-off-by: Eric Dumazet
CC: Patrick McHardy ,
CC: Ben Greear
Signed-off-by: David S. Miller

Eric Dumazet
2011-05-23 09:01:20 +0800

21 May, 2011

1 commit

06f4e926d Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-next-2.6 ... Browse Code »

* git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-next-2.6: (1446 commits)
macvlan: fix panic if lowerdev in a bond
tg3: Add braces around 5906 workaround.
tg3: Fix NETIF_F_LOOPBACK error
macvlan: remove one synchronize_rcu() call
networking: NET_CLS_ROUTE4 depends on INET
irda: Fix error propagation in ircomm_lmp_connect_response()
irda: Kill set but unused variable 'bytes' in irlan_check_command_param()
irda: Kill set but unused variable 'clen' in ircomm_connect_indication()
rxrpc: Fix set but unused variable 'usage' in rxrpc_get_transport()
be2net: Kill set but unused variable 'req' in lancer_fw_download()
irda: Kill set but unused vars 'saddr' and 'daddr' in irlan_provider_connect_indication()
atl1c: atl1c_resume() is only used when CONFIG_PM_SLEEP is defined.
rxrpc: Fix set but unused variable 'usage' in rxrpc_get_peer().
rxrpc: Kill set but unused variable 'local' in rxrpc_UDP_error_handler()
rxrpc: Kill set but unused variable 'sp' in rxrpc_process_connection()
rxrpc: Kill set but unused variable 'sp' in rxrpc_rotate_tx_window()
pkt_sched: Kill set but unused variable 'protocol' in tc_classify()
isdn: capi: Use pr_debug() instead of ifdefs.
tg3: Update version to 3.119
tg3: Apply rx_discards fix to 5719/5720
...

Fix up trivial conflicts in arch/x86/Kconfig and net/mac80211/agg-tx.c
as per Davem.

Linus Torvalds
2011-05-21 04:43:21 +0800

20 May, 2011

2 commits

034cfe48d networking: NET_CLS_ROUTE4 depends on INET ... Browse Code »

IP_ROUTE_CLASSID depends on INET and NET_CLS_ROUTE4 selects
IP_ROUTE_CLASSID, but when INET is not enabled, this kconfig warning
is produced, so fix it by making NET_CLS_ROUTE4 depend on INET.

warning: (NET_CLS_ROUTE4) selects IP_ROUTE_CLASSID which has unmet direct dependencies (NET && INET)

Signed-off-by: Randy Dunlap
Signed-off-by: David S. Miller

Randy Dunlap
2011-05-20 07:23:28 +0800
f06cd54f5 pkt_sched: Kill set but unused variable 'protocol' in tc_classify() ... Browse Code »

I checked the history and this has been like this since the
beginning of time.

Signed-off-by: David S. Miller

David S. Miller
2011-05-20 06:32:55 +0800

08 May, 2011

3 commits

75ef0368d net,act_police,rcu: remove rcu_barrier() ... Browse Code »

There is no callback of this module maybe queued
since we use kfree_rcu(), we can safely remove the rcu_barrier().

Signed-off-by: Lai Jiangshan
Acked-by: David S. Miller
Signed-off-by: Paul E. McKenney
Reviewed-by: Josh Triplett

Lai Jiangshan
2011-05-08 13:50:54 +0800
5957b1ac5 net,rcu: convert call_rcu(tcf_police_free_rcu) to kfree_rcu() ... Browse Code »

[PATCH 05/17] net,rcu: convert call_rcu(tcf_police_free_rcu) to kfree_rcu()

The rcu callback tcf_police_free_rcu() just calls a kfree(),
so we use kfree_rcu() instead of the call_rcu(tcf_police_free_rcu).

Signed-off-by: Lai Jiangshan
Acked-by: David S. Miller
Signed-off-by: Paul E. McKenney
Reviewed-by: Josh Triplett

Lai Jiangshan
2011-05-08 13:50:48 +0800
f5c8593c1 net,rcu: convert call_rcu(tcf_common_free_rcu) to kfree_rcu() ... Browse Code »

The rcu callback tcf_common_free_rcu() just calls a kfree(),
so we use kfree_rcu() instead of the call_rcu(tcf_common_free_rcu).

Signed-off-by: Lai Jiangshan
Acked-by: David S. Miller
Signed-off-by: Paul E. McKenney
Reviewed-by: Josh Triplett

Lai Jiangshan
2011-05-08 13:50:48 +0800

23 Apr, 2011

1 commit

b71d1d426 inet: constify ip headers and in6_addr ... Browse Code »

Add const qualifiers to structs iphdr, ipv6hdr and in6_addr pointers
where possible, to make code intention more obvious.

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2011-04-23 02:04:14 +0800

12 Apr, 2011

1 commit

1c01a80cf Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 ... Browse Code »

Conflicts:
drivers/net/smsc911x.c

David S. Miller
2011-04-12 04:44:25 +0800

05 Apr, 2011

1 commit

0545a3037 pkt_sched: QFQ - quick fair queue scheduler ... Browse Code »

This is an implementation of the Quick Fair Queue scheduler developed
by Fabio Checconi. The same algorithm is already implemented in ipfw
in FreeBSD. Fabio had an earlier version developed on Linux, I just
cleaned it up. Thanks to Eric Dumazet for testing this under load.

Signed-off-by: Stephen Hemminger
Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

stephen hemminger
2011-04-05 02:10:24 +0800

31 Mar, 2011

1 commit

25985edce Fix common misspellings ... Browse Code »

Fixes generated by 'codespell' and manually reviewed.

Signed-off-by: Lucas De Marchi

Lucas De Marchi
2011-03-31 22:26:23 +0800

05 Mar, 2011

1 commit

5e2b61f78 ipv4: Remove flowi from struct rtable. ... Browse Code »
1

The only necessary parts are the src/dst addresses, the
interface indexes, the TOS, and the mark.

The rest is unnecessary bloat, which amounts to nearly
50 bytes on 64-bit.

Signed-off-by: David S. Miller

David S. Miller
2011-03-05 13:55:31 +0800