Eric Lee / smarc-fsl-linux-kernel

15 Jul, 2007

6 commits

c3bc7cff8 [NET_SCHED]: Kill CONFIG_NET_CLS_POLICE ... Browse Code »

The NET_CLS_ACT option is now a full replacement for NET_CLS_POLICE,
remove the old code. The config option will be kept around to select
the equivalent NET_CLS_ACT options for a short time to allow easier
upgrades.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-15 15:03:05 +0800
73ca4918f [NET_SCHED]: act_api: qdisc internal reclassify support ... Browse Code »

The behaviour of NET_CLS_POLICE for TC_POLICE_RECLASSIFY was to return
it to the qdisc, which could handle it internally or ignore it. With
NET_CLS_ACT however, tc_classify starts over at the first classifier
and never returns it to the qdisc. This makes it impossible to support
qdisc-internal reclassification, which in turn makes it impossible to
remove the old NET_CLS_POLICE code without breaking compatibility since
we have two qdiscs (CBQ and ATM) that support this.

This patch adds a tc_classify_compat function that handles
reclassification the old way and changes CBQ and ATM to use it.

This again is of course not fully backwards compatible with the previous
NET_CLS_ACT behaviour. Unfortunately there is no way to fully maintain
compatibility *and* support qdisc internal reclassification with
NET_CLS_ACT, but this seems like the better choice over keeping the two
incompatible options around forever.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-15 15:02:31 +0800
f6853e2df [NET_SCHED]: sch_dsmark: act_api support ... Browse Code »

Handle act_api classification results.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-15 15:02:10 +0800
921008044 [NET_SCHED]: sch_atm: act_api support ... Browse Code »

Handle act_api classification results.

The ATM scheduler behaves slightly different than other schedulers
in that it only handles policer results for successful classifications,
this behaviour is retained for the act_api case.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-15 15:01:49 +0800
b0188d4db [NET_SCHED]: sch_atm: Lindent ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-15 15:01:25 +0800
0621ed2e4 [NET_SCHED]: Revert "avoid transmit softirq on watchdog wakeup" optimization ... Browse Code »

As noticed by Ranko Zivojnovic , calling qdisc_run
from the timer handler can result in deadlock:

> CPU#0
>
> qdisc_watchdog() fires and gets dev->queue_lock
> qdisc_run()...qdisc_restart()...
> -> releases dev->queue_lock and enters dev_hard_start_xmit()
>
> CPU#1
>
> tc del qdisc dev ...
> qdisc_graft()...dev_graft_qdisc()...dev_deactivate()...
> -> grabs dev->queue_lock ...
>
> qdisc_reset()...{cbq,hfsc,htb,netem,tbf}_reset()...qdisc_watchdog_cancel()...
> -> hrtimer_cancel() - waiting for the qdisc_watchdog() to exit, while still
> holding dev->queue_lock
>
> CPU#0
>
> dev_hard_start_xmit() returns ...
> -> wants to get dev->queue_lock(!)
>
> DEADLOCK!

The entire optimization is a bit questionable IMO, it moves potentially
large parts of NET_TX_SOFTIRQ work to TIMER_SOFTIRQ/HRTIMER_SOFTIRQ,
which kind of defeats the separation of them.

Signed-off-by: Patrick McHardy
Acked-by: Ranko Zivojnovic
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-15 11:49:26 +0800

12 Jul, 2007

1 commit

db3d99c09 [NET_SCHED]: ematch: module autoloading ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-12 10:46:26 +0800

11 Jul, 2007

10 commits

c9726d689 [NET_SCHED]: Make HTB scheduler work with TSO. ... Browse Code »

Currently the HTB scheduler does not correctly account for TSO packets
which causes large inaccuracies in the bandwidth control when using TSO.
This patch allows the HTB scheduler to work with TSO enabled devices.

Signed-off-by: Ranjit Manomohan
Signed-off-by: David S. Miller

Ranjit Manomohan
2007-07-11 13:43:16 +0800
0ba480538 [NET_SCHED]: Remove unnecessary includes ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-11 13:16:41 +0800
ee39e10c2 [NET_SCHED]: sch_htb: use generic estimator ... Browse Code »

Use the generic estimator instead of reimplementing (parts of) it.
For compatibility always create a default estimator for new classes.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-11 13:16:39 +0800
4bdf39911 [NET_SCHED]: Remove unnecessary stats_lock pointers ... Browse Code »

Remove stats_lock pointers from qdisc-internal structures, in all cases
it points to dev->queue_lock. The only case where it is necessary is for
top-level qdiscs, where it might also point to dev->ingress_lock in case
of the ingress qdisc. Also remove it from actions completely, it always
points to the actions internal lock.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-11 13:16:38 +0800
876d48aab [NET_SCHED]: Remove CONFIG_NET_ESTIMATOR option ... Browse Code »

The generic estimator is always built in anways and all the config options
does is prevent including a minimal amount of code for setting it up.
Additionally the option is already automatically selected for most cases.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-07-11 13:16:37 +0800
d62733c8e [SCHED]: Qdisc changes and sch_rr added for multiqueue ... Browse Code »

Add the new sch_rr qdisc for multiqueue network device support. Allow
sch_prio and sch_rr to be compiled with or without multiqueue hardware
support.

sch_rr is part of sch_prio, and is referenced from MODULE_ALIAS. This
was done since sch_prio and sch_rr only differ in their dequeue
routine.

Signed-off-by: Peter P Waskiewicz Jr
Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Peter P Waskiewicz Jr
2007-07-11 13:16:22 +0800
f25f4e448 [CORE] Stack changes to add multiqueue hardware support API ... Browse Code »

Add the multiqueue hardware device support API to the core network
stack. Allow drivers to allocate multiple queues and manage them at
the netdev level if they choose to do so.

Added a new field to sk_buff, namely queue_mapping, for drivers to
know which tx_ring to select based on OS classification of the flow.

Signed-off-by: Peter P Waskiewicz Jr
Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Peter P Waskiewicz Jr
2007-07-11 13:16:21 +0800
e50c41b53 [NET]: qdisc_restart - couple of optimizations. ... Browse Code »

Changes :

- netif_queue_stopped need not be called inside qdisc_restart as
it has been called already in qdisc_run() before the first skb
is sent, and in __qdisc_run() after each intermediate skb is
sent (note : we are the only sender, so the queue cannot get
stopped while the tx lock was got in the ~LLTX case).

- BUG_ON((int) q->q.qlen < 0) was a relic from old times when -1
meant more packets are available, and __qdisc_run used to loop
when qdisc_restart() returned -1. During those days, it was
necessary to make sure that qlen is never less than zero, since
__qdisc_run would get into an infinite loop if no packets are on
the queue and this bug in qdisc was there (and worse - no more
skbs could ever get queue'd as we hold the queue lock too). With
Herbert's recent change to return values, this check is not
required. Hopefully Herbert can validate this change. If at all
this is required, it should be added to skb_dequeue (in failure
case), and not to qdisc_qlen.

Signed-off-by: Krishna Kumar
Signed-off-by: David S. Miller

Krishna Kumar
2007-07-11 13:15:36 +0800
6c1361a6f [NET]: qdisc_restart - readability changes plus one bug fix. ... Browse Code »

New changes :

- Incorporated Peter Waskiewicz's comments.
- Re-added back one warning message (on driver returning wrong value).

Previous changes :

- Converted to use switch/case code which looks neater.

- "if (ret == NETDEV_TX_LOCKED && lockless)" is buggy, and the lockless
check should be removed, since driver will return NETDEV_TX_LOCKED only
if lockless is true and driver has to do the locking. In the original
code as well as the latest code, this code can result in a bug where
if LLTX is not set for a driver (lockless == 0) but the driver is written
wrongly to do a trylock (despite LLTX being set), the driver returns
LOCKED. But since lockless is zero, the packet is requeue'd instead of
calling collision code which will issue warning and free up the skb.
Instead this skb will be retried with this driver next time, and the same
result will ensue. Removing this check will catch these driver bugs instead
of hiding the problem. I am keeping this change to readability section
since :
a. it is confusing to check two things as it is; and
b. it is difficult to keep this check in the changed 'switch' code.

- Changed some names, like try_get_tx_pkt to dev_dequeue_skb (as that is
the work being done and easier to understand) and do_dev_requeue to
dev_requeue_skb, merged handle_dev_cpu_collision and tx_islocked to
dev_handle_collision (handle_dev_cpu_collision is a small routine with only
one caller, so there is no need to have two separate routines which also
results in getting rid of two macros, etc.

- Removed an XXX comment as it should never fail (I suspect this was related
to batch skb WIP, Jamal ?). Converted some functions to original coding
style of having the return values and the function name on same line, eg
prio2list.

Signed-off-by: Krishna Kumar
Signed-off-by: David S. Miller

Krishna Kumar
2007-07-11 13:15:35 +0800
c716a81ab [NET_SCHED]: Cleanup readability of qdisc restart ... Browse Code »

Over the years this code has gotten hairier. Resulting in many long
discussions over long summer days and patches that get it wrong.
This patch helps tame that code so normal people will understand it.

Thanks to Thomas Graf, Peter J. waskiewicz Jr, and Patrick McHardy
for their valuable reviews.

Signed-off-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Jamal Hadi Salim
2007-07-11 13:06:16 +0800

08 Jun, 2007

1 commit

b00b4bf94 [NET_SCHED]: Fix filter double free ... Browse Code »

cbq and atm destroy their filters twice when destroying inner classes
during qdisc destruction.

Reported-and-tested-by: Strobl Anton

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-06-08 04:41:05 +0800

04 Jun, 2007

2 commits

75202e768 [NET]: Fix comparisons of unsigned < 0. ... Browse Code »

Recent gcc versions emit warnings when unsigned variables are
compared < 0 or >= 0.

Signed-off-by: Bill Nottingham
Signed-off-by: David S. Miller

Bill Nottingham
2007-06-04 09:08:47 +0800
60468d5b5 [NET]: Make net watchdog timers 1 sec jiffy aligned. ... Browse Code »

round_jiffies for net dev watchdog timer.

Signed-off-by: Venkatesh Pallipadi
Signed-off-by: David S. Miller

Venkatesh Pallipadi
2007-06-04 09:08:46 +0800

25 May, 2007

2 commits

2e4b3b0e8 [NET_SCHED]: sch_htb: fix event cache time calculation ... Browse Code »

The event cache time must be an absolute value, when no event exists
it is incorrectly set to 1s instead of 1s in the future.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-05-25 07:36:56 +0800
36247f542 [NET_SCHED]: Fix qdisc_restart return value when dequeue is empty ... Browse Code »

My previous patch that changed the return value of qdisc_restart
incorrectly made the case where dequeue returns empty continue
processing packets.

This patch is based on diagnosis and fix by Patrick McHardy.

Reported-and-debugged-by: Anant Nitya

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-05-25 07:36:43 +0800

14 May, 2007

1 commit

3e5c2d3bd [NET_SCHED]: prio qdisc boundary condition ... Browse Code »

This fixes an out-of-boundary condition when the classified
band equals q->bands. Caught by Alexey

Signed-off-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Jamal Hadi Salim
2007-05-14 17:57:19 +0800

11 May, 2007

5 commits

41a23b078 [NET_SCHED]: Avoid requeue warning on dev_deactivate ... Browse Code »

When we relinquish queue_lock in qdisc_restart and then retake it for
requeueing, we might race against dev_deactivate and end up requeueing
onto noop_qdisc. This causes a warning to be printed.

This patch fixes this by checking this before we requeue. As an added
bonus, we can remove the same check in __qdisc_run which was added to
prevent dev->gso_skb from being requeued when we're shutting down.

Even though we've had to add a new conditional in its place, it's better
because it only happens on requeues rather than every single time that
qdisc_run is called.

For this to work we also need to move the clearing of gso_skb up in
dev_deactivate as now qdisc_restart can occur even after we wait for
__LINK_STATE_QDISC_RUNNING to clear (but it won't do anything as long
as the queue and gso_skb is already clear).

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-05-11 14:47:42 +0800
cce1fa36a [NET_SCHED]: Reread dev->qdisc for NETDEV_TX_OK ... Browse Code »

Now that we return the queue length after NETDEV_TX_OK we better
make sure that we have the right queue. Otherwise we can cause a
stall after a really quick dev_deactive/dev_activate.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-05-11 14:47:41 +0800
d90df3ad0 [NET_SCHED]: Rationalise return value of qdisc_restart ... Browse Code »

The current return value scheme and associated comment was invented
back in the 20th century when we still had that tbusy flag. Things
have changed quite a bit since then (even Tony Blair is moving on
now, not to mention the new French president).

All we need to indicate now is whether the caller should continue
processing the queue. Therefore it's sufficient if we return 0 if
we want to stop and non-zero otherwise.

This is based on a patch by Krishna Kumar.

Signed-off-by: Herbert Xu
Signed-off-by: David S. Miller

Herbert Xu
2007-05-11 14:47:40 +0800
5830725f8 [NET]: Fix dev->qdisc race for NETDEV_TX_LOCKED case ... Browse Code »

When transmit fails with NETDEV_TX_LOCKED the skb is requeued
to dev->qdisc again. The dev->qdisc pointer is protected by
the queue lock which needs to be dropped when attempting to
transmit and acquired again before requeing. The problem is
that qdisc_restart() fetches the dev->qdisc pointer once and
stores it in the `q' variable which is invalidated when
dropping the queue_lock, therefore the variable needs to be
refreshed before requeueing.

Signed-off-by: Thomas Graf
Signed-off-by: David S. Miller

Thomas Graf
2007-05-11 14:47:39 +0800
4cd8c9e87 [NET_SCHED]: teql_enqueue can check limits before skb enqueue ... Browse Code »

Optimize teql_enqueue so that it first checks limits before enqueing.

Signed-off-by: Krishna Kumar
Signed-off-by: David S. Miller

Krishna Kumar
2007-05-11 14:45:10 +0800

04 May, 2007

1 commit

7562f876c [NET]: Rework dev_base via list_head (v3) ... Browse Code »

Cleanup of dev_base list use, with the aim to simplify making device
list per-namespace. In almost every occasion, use of dev_base variable
and dev->next pointer could be easily replaced by for_each_netdev
loop. A few most complicated places were converted to using
first_netdev()/next_netdev().

Signed-off-by: Pavel Emelianov
Acked-by: Kirill Korotaev
Signed-off-by: David S. Miller

Pavel Emelianov
2007-05-04 06:13:45 +0800

26 Apr, 2007

11 commits

3ff50b799 [NET]: cleanup extra semicolons ... Browse Code »

Spring cleaning time...

There seems to be a lot of places in the network code that have
extra bogus semicolons after conditionals. Most commonly is a
bogus semicolon after: switch() { }

Signed-off-by: Stephen Hemminger
Signed-off-by: David S. Miller

Stephen Hemminger
2007-04-26 13:29:24 +0800
fd44de7cc [NET_SCHED]: ingress: switch back to using ingress_lock ... Browse Code »

Switch ingress queueing back to use ingress_lock. qdisc_lock_tree now locks
both the ingress and egress qdiscs on the device. All changes to data that
might be used on both ingress and egress needs to be protected by using
qdisc_lock_tree instead of manually taking dev->queue_lock. Additionally
the qdisc stats_lock needs to be initialized to ingress_lock for ingress
qdiscs.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:29:08 +0800
0463d4ae2 [NET_SCHED]: Eliminate qdisc_tree_lock ... Browse Code »

Since we're now holding the rtnl during the entire dump operation, we
can remove qdisc_tree_lock, whose only purpose is to protect dump
callbacks from concurrent changes to the qdisc tree.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:29:07 +0800
c95e93950 [NET_SCHED]: qdisc: remove unnecessary memory barriers ... Browse Code »

We're holding dev->queue_lock in qdisc_watchdog_schedule and
qdisc_watchdog_cancel, no need for the barriers.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:58 +0800
a48b5a614 [NET_SCHED]: Unline tcf_destroy ... Browse Code »

Uninline tcf_destroy and add a helper function to destroy an entire filter
chain.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:56 +0800
3bebcda28 [NET_SCHED]: turn PSCHED_GET_TIME into inline function ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:55 +0800
03cc45c0a [NET_SCHED]: turn PSCHED_TDIFF_SAFE into inline function ... Browse Code »

Also rename to psched_tdiff_bounded.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:54 +0800
8edc0c31d [NET_SCHED]: kill PSCHED_TDIFF ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:53 +0800
a084980dc [NET_SCHED]: kill PSCHED_SET_PASTPERFECT/PSCHED_IS_PASTPERFECT ... Browse Code »

Use direct assignment and comparison instead.

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:51 +0800
104e08789 [NET_SCHED]: kill PSCHED_TLESS ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:50 +0800
7c59e25f3 [NET_SCHED]: kill PSCHED_TADD/PSCHED_TADD2 ... Browse Code »

Signed-off-by: Patrick McHardy
Signed-off-by: David S. Miller

Patrick McHardy
2007-04-26 13:27:49 +0800