Eric Lee / smarc-fsl-linux-kernel

10 Mar, 2020

1 commit

3f95f55eb net: sched: pie: change tc_pie_xstats->prob ... Browse Code »

Commit 105e808c1da2 ("pie: remove pie_vars->accu_prob_overflows")
changes the scale of probability values in PIE from (2^64 - 1) to
(2^56 - 1). This affects the precision of tc_pie_xstats->prob in
user space.

This patch ensures user space is unaffected.

Suggested-by: Eric Dumazet
Signed-off-by: Leslie Monis
Signed-off-by: David S. Miller

Leslie Monis
2020-03-10 09:05:55 +0800

05 Mar, 2020

3 commits

105e808c1 pie: remove pie_vars->accu_prob_overflows ... Browse Code »

The variable pie_vars->accu_prob is used as an accumulator for
probability values. Since probabilty values are scaled using the
MAX_PROB macro denoting (2^64 - 1), pie_vars->accu_prob is
likely to overflow as it is of type u64.

The variable pie_vars->accu_prob_overflows counts the number of
times the variable pie_vars->accu_prob overflows.

The MAX_PROB macro needs to be equal to at least (2^39 - 1) in
order to do precise calculations without any underflow. Thus
MAX_PROB can be reduced to (2^56 - 1) without affecting the
precision in calculations drastically. Doing so will eliminate
the need for the variable pie_vars->accu_prob_overflows as the
variable pie_vars->accu_prob will never overflow.

Removing the variable pie_vars->accu_prob_overflows also reduces
the size of the structure pie_vars to exactly 64 bytes.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Gautam Ramakrishnan
Signed-off-by: Leslie Monis
Signed-off-by: David S. Miller

Leslie Monis
2020-03-05 05:25:55 +0800
220d4ac74 pie: remove unnecessary type casting ... Browse Code »

In function pie_calculate_probability(), the variables alpha and
beta are of type u64. The variables qdelay, qdelay_old and
params->target are of type psched_time_t (which is also u64).
The explicit type casting done when calculating the value for
the variable delta is redundant and not required.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Gautam Ramakrishnan
Signed-off-by: Leslie Monis
Signed-off-by: David S. Miller

Leslie Monis
2020-03-05 05:25:55 +0800
90baeb9dd pie: use term backlog instead of qlen ... Browse Code »

Remove ambiguity by using the term backlog instead of qlen when
representing the queue length in bytes.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Gautam Ramakrishnan
Signed-off-by: Leslie Monis
Signed-off-by: David S. Miller

Leslie Monis
2020-03-05 05:25:55 +0800

23 Jan, 2020

5 commits

5205ea00c net: sched: pie: export symbols to be reused by FQ-PIE ... Browse Code »

This patch makes the drop_early(), calculate_probability() and
pie_process_dequeue() functions generic enough to be used by
both PIE and FQ-PIE (to be added in a future commit). The major
change here is in the way the functions take in arguments. This
patch exports these functions and makes FQ-PIE dependent on
sch_pie.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Leslie Monis
Signed-off-by: Gautam Ramakrishnan
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2020-01-23 18:38:31 +0800
00ea2fb72 net: sched: pie: fix alignment in struct instances ... Browse Code »

Make the alignment in the initialization of the struct instances
consistent in the file.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Leslie Monis
Signed-off-by: Gautam Ramakrishnan
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2020-01-23 18:38:31 +0800
55f780c4a net: sched: pie: fix commenting ... Browse Code »

Fix punctuation and logical mistakes in the comments. The
logical mistake was that "dequeue_rate" is no longer the default
way to calculate queuing delay and is not needed. The default
way to calculate queue delay was changed in commit cec2975f2b70
("net: sched: pie: enable timestamp based delay calculation").

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Leslie Monis
Signed-off-by: Gautam Ramakrishnan
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2020-01-23 18:38:31 +0800
2dfb1952a pie: rearrange structure members and their initializations ... Browse Code »

Rearrange the members of the structure such that closely
referenced members appear together and/or fit in the same
cacheline. Also, change the order of their initializations to
match the order in which they appear in the structure.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Leslie Monis
Signed-off-by: Gautam Ramakrishnan
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2020-01-23 18:38:31 +0800
84bf557fb net: sched: pie: move common code to pie.h ... Browse Code »

This patch moves macros, structures and small functions common
to PIE and FQ-PIE (to be added in a future commit) from the file
net/sched/sch_pie.c to the header file include/net/pie.h.
All the moved functions are made inline.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Leslie Monis
Signed-off-by: Gautam Ramakrishnan
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2020-01-23 18:38:30 +0800

21 Nov, 2019

1 commit

cec2975f2 net: sched: pie: enable timestamp based delay calculation ... Browse Code »

RFC 8033 suggests an alternative approach to calculate the queue
delay in PIE by using a timestamp on every enqueued packet. This
patch adds an implementation of that approach and sets it as the
default method to calculate queue delay. The previous method (based
on Little's law) to calculate queue delay is set as optional.

Signed-off-by: Gautam Ramakrishnan
Signed-off-by: Leslie Monis
Signed-off-by: Mohit P. Tahiliani
Acked-by: Dave Taht
Signed-off-by: David S. Miller

Gautam Ramakrishnan
2019-11-21 04:31:45 +0800

19 Jun, 2019

1 commit

2504ba9f5 treewide: Replace GPLv2 boilerplate/reference with SPDX - rule 235 ... Browse Code »

Based on 1 normalized pattern(s):

this program is free software you can redistribute it and or modify
it under the terms of the gnu general public license as published by
the free software foundation either version 2 of the license this
program is distributed in the hope that it will be useful but
without any warranty without even the implied warranty of
merchantability or fitness for a particular purpose see the gnu
general public license for more details

extracted by the scancode license scanner the SPDX license identifier

GPL-2.0-only

has been chosen to replace the boilerplate/reference in 53 file(s).

Signed-off-by: Thomas Gleixner
Reviewed-by: Allison Randal
Reviewed-by: Alexios Zavras
Cc: linux-spdx@vger.kernel.org
Link: https://lkml.kernel.org/r/20190602204653.904365654@linutronix.de
Signed-off-by: Greg Kroah-Hartman

Thomas Gleixner
2019-06-19 23:09:07 +0800

28 Apr, 2019

2 commits

8cb081746 netlink: make validation more configurable for future strictness ... Browse Code »

We currently have two levels of strict validation:

1) liberal (default)
- undefined (type >= max) & NLA_UNSPEC attributes accepted
- attribute length >= expected accepted
- garbage at end of message accepted
2) strict (opt-in)
- NLA_UNSPEC attributes accepted
- attribute length >= expected accepted

Split out parsing strictness into four different options:
* TRAILING - check that there's no trailing data after parsing
attributes (in message or nested)
* MAXTYPE - reject attrs > max known type
* UNSPEC - reject attributes with NLA_UNSPEC policy entries
* STRICT_ATTRS - strictly validate attribute size

The default for future things should be *everything*.
The current *_strict() is a combination of TRAILING and MAXTYPE,
and is renamed to _deprecated_strict().
The current regular parsing has none of this, and is renamed to
*_parse_deprecated().

Additionally it allows us to selectively set one of the new flags
even on old policies. Notably, the UNSPEC flag could be useful in
this case, since it can be arranged (by filling in the policy) to
not be an incompatible userspace ABI change, but would then going
forward prevent forgetting attribute entries. Similar can apply
to the POLICY flag.

We end up with the following renames:
* nla_parse -> nla_parse_deprecated
* nla_parse_strict -> nla_parse_deprecated_strict
* nlmsg_parse -> nlmsg_parse_deprecated
* nlmsg_parse_strict -> nlmsg_parse_deprecated_strict
* nla_parse_nested -> nla_parse_nested_deprecated
* nla_validate_nested -> nla_validate_nested_deprecated

Using spatch, of course:
@@
expression TB, MAX, HEAD, LEN, POL, EXT;
@@
-nla_parse(TB, MAX, HEAD, LEN, POL, EXT)
+nla_parse_deprecated(TB, MAX, HEAD, LEN, POL, EXT)

@@
expression NLH, HDRLEN, TB, MAX, POL, EXT;
@@
-nlmsg_parse(NLH, HDRLEN, TB, MAX, POL, EXT)
+nlmsg_parse_deprecated(NLH, HDRLEN, TB, MAX, POL, EXT)

@@
expression NLH, HDRLEN, TB, MAX, POL, EXT;
@@
-nlmsg_parse_strict(NLH, HDRLEN, TB, MAX, POL, EXT)
+nlmsg_parse_deprecated_strict(NLH, HDRLEN, TB, MAX, POL, EXT)

@@
expression TB, MAX, NLA, POL, EXT;
@@
-nla_parse_nested(TB, MAX, NLA, POL, EXT)
+nla_parse_nested_deprecated(TB, MAX, NLA, POL, EXT)

@@
expression START, MAX, POL, EXT;
@@
-nla_validate_nested(START, MAX, POL, EXT)
+nla_validate_nested_deprecated(START, MAX, POL, EXT)

@@
expression NLH, HDRLEN, MAX, POL, EXT;
@@
-nlmsg_validate(NLH, HDRLEN, MAX, POL, EXT)
+nlmsg_validate_deprecated(NLH, HDRLEN, MAX, POL, EXT)

For this patch, don't actually add the strict, non-renamed versions
yet so that it breaks compile if I get it wrong.

Also, while at it, make nla_validate and nla_parse go down to a
common __nla_validate_parse() function to avoid code duplication.

Ultimately, this allows us to have very strict validation for every
new caller of nla_parse()/nlmsg_parse() etc as re-introduced in the
next patch, while existing things will continue to work as is.

In effect then, this adds fully strict validation for any new command.

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2019-04-28 05:07:21 +0800
ae0be8de9 netlink: make nla_nest_start() add NLA_F_NESTED flag ... Browse Code »

Even if the NLA_F_NESTED flag was introduced more than 11 years ago, most
netlink based interfaces (including recently added ones) are still not
setting it in kernel generated messages. Without the flag, message parsers
not aware of attribute semantics (e.g. wireshark dissector or libmnl's
mnl_nlmsg_fprintf()) cannot recognize nested attributes and won't display
the structure of their contents.

Unfortunately we cannot just add the flag everywhere as there may be
userspace applications which check nlattr::nla_type directly rather than
through a helper masking out the flags. Therefore the patch renames
nla_nest_start() to nla_nest_start_noflag() and introduces nla_nest_start()
as a wrapper adding NLA_F_NESTED. The calls which add NLA_F_NESTED manually
are rewritten to use nla_nest_start().

Except for changes in include/net/netlink.h, the patch was generated using
this semantic patch:

@@ expression E1, E2; @@
-nla_nest_start(E1, E2)
+nla_nest_start_noflag(E1, E2)

@@ expression E1, E2; @@
-nla_nest_start_noflag(E1, E2 | NLA_F_NESTED)
+nla_nest_start(E1, E2)

Signed-off-by: Michal Kubecek
Acked-by: Jiri Pirko
Acked-by: David Ahern
Signed-off-by: David S. Miller

Michal Kubecek
2019-04-28 05:03:44 +0800

01 Mar, 2019

1 commit

6c97da141 net: sched: pie: avoid slow division in drop probability decay ... Browse Code »

As per RFC 8033, it is sufficient for the drop probability
decay factor to have a value of (1 - 1/64) instead of 98%.
This avoids the need to do slow division.

Suggested-by: David Laight
Signed-off-by: Leslie Monis
Signed-off-by: David S. Miller

Leslie Monis
2019-03-01 02:35:41 +0800

27 Feb, 2019

2 commits

ff8285f81 net: sched: pie: fix 64-bit division ... Browse Code »

Use div_u64() to resolve build failures on 32-bit platforms.

Fixes: 3f7ae5f3dc52 ("net: sched: pie: add more cases to auto-tune alpha and beta")
Signed-off-by: Leslie Monis
Reported-by: Randy Dunlap
Tested-by: Randy Dunlap
Signed-off-by: David S. Miller

Leslie Monis
2019-02-27 10:55:38 +0800
24ed49002 net: sched: pie: fix mistake in reference link ... Browse Code »

Fix the incorrect reference link to RFC 8033

Signed-off-by: Leslie Monis
Signed-off-by: David S. Miller

Leslie Monis
2019-02-27 01:11:33 +0800

26 Feb, 2019

7 commits

c9d2ac5e6 net: sched: pie: update references ... Browse Code »

RFC 8033 replaces the IETF draft for PIE

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Dhaval Khandla
Signed-off-by: Hrishikesh Hiraskar
Signed-off-by: Manish Kumar B
Signed-off-by: Sachin D. Patil
Signed-off-by: Leslie Monis
Acked-by: Dave Taht
Acked-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2019-02-26 06:21:03 +0800
95400b975 net: sched: pie: add derandomization mechanism ... Browse Code »

Random dropping of packets to achieve latency control may
introduce outlier situations where packets are dropped too
close to each other or too far from each other. This can
cause the real drop percentage to temporarily deviate from
the intended drop probability. In certain scenarios, such
as a small number of simultaneous TCP flows, these
deviations can cause significant deviations in link
utilization and queuing latency.

RFC 8033 suggests using a derandomization mechanism to avoid
these deviations.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Dhaval Khandla
Signed-off-by: Hrishikesh Hiraskar
Signed-off-by: Manish Kumar B
Signed-off-by: Sachin D. Patil
Signed-off-by: Leslie Monis
Acked-by: Dave Taht
Acked-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2019-02-26 06:21:03 +0800
3f7ae5f3d net: sched: pie: add more cases to auto-tune alpha and beta ... Browse Code »

The current implementation scales the local alpha and beta
variables in the calculate_probability function by the same
amount for all values of drop probability below 1%.

RFC 8033 suggests using additional cases for auto-tuning
alpha and beta when the drop probability is less than 1%.

In order to add more auto-tuning cases, MAX_PROB must be
scaled by u64 instead of u32 to prevent underflow when
scaling the local alpha and beta variables in the
calculate_probability function.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Dhaval Khandla
Signed-off-by: Hrishikesh Hiraskar
Signed-off-by: Manish Kumar B
Signed-off-by: Sachin D. Patil
Signed-off-by: Leslie Monis
Acked-by: Dave Taht
Acked-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2019-02-26 06:21:03 +0800
30a92ad70 net: sched: pie: change initial value of pie_vars->burst_time ... Browse Code »

RFC 8033 suggests an initial value of 150 milliseconds for
the maximum time allowed for a burst of packets.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Dhaval Khandla
Signed-off-by: Hrishikesh Hiraskar
Signed-off-by: Manish Kumar B
Signed-off-by: Sachin D. Patil
Signed-off-by: Leslie Monis
Acked-by: Dave Taht
Acked-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2019-02-26 06:21:03 +0800
29daa8553 net: sched: pie: change default value of pie_params->tupdate ... Browse Code »

RFC 8033 suggests a default value of 15 milliseconds for the
update interval.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Dhaval Khandla
Signed-off-by: Hrishikesh Hiraskar
Signed-off-by: Manish Kumar B
Signed-off-by: Sachin D. Patil
Signed-off-by: Leslie Monis
Acked-by: Dave Taht
Acked-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2019-02-26 06:21:03 +0800
abde7920d net: sched: pie: change default value of pie_params->target ... Browse Code »

RFC 8033 suggests a default value of 15 milliseconds for the
target queue delay.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Dhaval Khandla
Signed-off-by: Hrishikesh Hiraskar
Signed-off-by: Manish Kumar B
Signed-off-by: Sachin D. Patil
Signed-off-by: Leslie Monis
Acked-by: Dave Taht
Acked-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2019-02-26 06:21:03 +0800
575090036 net: sched: pie: change value of QUEUE_THRESHOLD ... Browse Code »

RFC 8033 recommends a value of 16384 bytes for the queue
threshold.

Signed-off-by: Mohit P. Tahiliani
Signed-off-by: Dhaval Khandla
Signed-off-by: Hrishikesh Hiraskar
Signed-off-by: Manish Kumar B
Signed-off-by: Sachin D. Patil
Signed-off-by: Leslie Monis
Acked-by: Dave Taht
Acked-by: Jamal Hadi Salim
Signed-off-by: David S. Miller

Mohit P. Tahiliani
2019-02-26 06:21:03 +0800

08 Oct, 2018

1 commit

ac4a02c5a net: sched: pie: fix coding style issues ... Browse Code »

Fix 5 warnings and 14 checks issued by checkpatch.pl:

CHECK: Logical continuations should be on the previous line
+ if ((q->vars.qdelay < q->params.target / 2)
+ && (q->vars.prob < MAX_PROB / 5))

WARNING: line over 80 characters
+ q->params.tupdate = usecs_to_jiffies(nla_get_u32(tb[TCA_PIE_TUPDATE]));

CHECK: Blank lines aren't necessary after an open brace '{'
+{
+

CHECK: braces {} should be used on all arms of this statement
+ if (qlen < QUEUE_THRESHOLD)
[...]
+ else {
[...]

CHECK: Unbalanced braces around else statement
+ else {

CHECK: No space is necessary after a cast
+ if (delta > (s32) (MAX_PROB / (100 / 2)) &&

CHECK: Unnecessary parentheses around 'qdelay == 0'
+ if ((qdelay == 0) && (qdelay_old == 0) && update_prob)

CHECK: Unnecessary parentheses around 'qdelay_old == 0'
+ if ((qdelay == 0) && (qdelay_old == 0) && update_prob)

CHECK: Unnecessary parentheses around 'q->vars.prob == 0'
+ if ((q->vars.qdelay < q->params.target / 2) &&
+ (q->vars.qdelay_old < q->params.target / 2) &&
+ (q->vars.prob == 0) &&
+ (q->vars.avg_dq_rate > 0))

CHECK: Unnecessary parentheses around 'q->vars.avg_dq_rate > 0'
+ if ((q->vars.qdelay < q->params.target / 2) &&
+ (q->vars.qdelay_old < q->params.target / 2) &&
+ (q->vars.prob == 0) &&
+ (q->vars.avg_dq_rate > 0))

CHECK: Blank lines aren't necessary before a close brace '}'
+
+}

CHECK: Comparison to NULL could be written "!opts"
+ if (opts == NULL)

CHECK: No space is necessary after a cast
+ ((u32) PSCHED_TICKS2NS(q->params.target)) /

WARNING: line over 80 characters
+ nla_put_u32(skb, TCA_PIE_TUPDATE, jiffies_to_usecs(q->params.tupdate)) ||

CHECK: Blank lines aren't necessary before a close brace '}'
+
+}

CHECK: No space is necessary after a cast
+ .delay = ((u32) PSCHED_TICKS2NS(q->vars.qdelay)) /

WARNING: Missing a blank line after declarations
+ struct sk_buff *skb;
+ skb = qdisc_dequeue_head(sch);

WARNING: Missing a blank line after declarations
+ struct pie_sched_data *q = qdisc_priv(sch);
+ qdisc_reset_queue(sch);

WARNING: Missing a blank line after declarations
+ struct pie_sched_data *q = qdisc_priv(sch);
+ q->params.tupdate = 0;

Signed-off-by: Leslie Monis
Signed-off-by: David S. Miller

Leslie Monis
2018-10-08 11:39:01 +0800

22 Dec, 2017

2 commits

2030721cc net: sched: sch: add extack for change qdisc ops ... Browse Code »

This patch adds extack support for change callback for qdisc ops
structtur to prepare per-qdisc specific changes for extack.

Cc: David Ahern
Acked-by: Jamal Hadi Salim
Signed-off-by: Alexander Aring
Signed-off-by: David S. Miller

Alexander Aring
2017-12-22 01:32:50 +0800
e63d7dfd2 net: sched: sch: add extack for init callback ... Browse Code »

This patch adds extack support for init callback to prepare per-qdisc
specific changes for extack.

Cc: David Ahern
Acked-by: Jamal Hadi Salim
Signed-off-by: Alexander Aring
Signed-off-by: David S. Miller

Alexander Aring
2017-12-22 01:32:50 +0800

18 Oct, 2017

1 commit

cdeabbb88 net: sched: Convert timers to use timer_setup() ... Browse Code »

In preparation for unconditionally passing the struct timer_list pointer to
all timer callbacks, switch to using the new timer_setup() and from_timer()
to pass the timer pointer explicitly. Add pointer back to Qdisc.

Cc: Jamal Hadi Salim
Cc: Cong Wang
Cc: Jiri Pirko
Cc: "David S. Miller"
Cc: netdev@vger.kernel.org
Signed-off-by: Kees Cook
Signed-off-by: David S. Miller

Kees Cook
2017-10-18 19:39:54 +0800

14 Apr, 2017

1 commit

fceb6435e netlink: pass extended ACK struct to parsing functions ... Browse Code »

Pass the new extended ACK reporting struct to all of the generic
netlink parsing functions. For now, pass NULL in almost all callers
(except for some in the core.)

Signed-off-by: Johannes Berg
Signed-off-by: David S. Miller

Johannes Berg
2017-04-14 01:58:22 +0800

19 Sep, 2016

2 commits

ed760cb8a sched: replace __skb_dequeue with __qdisc_dequeue_head ... Browse Code »

After previous patch these functions are identical.
Replace __skb_dequeue in qdiscs with __qdisc_dequeue_head.

Next patch will then make __qdisc_dequeue_head handle
single-linked list instead of strcut sk_buff_head argument.

Doesn't change generated code.

Signed-off-by: Florian Westphal
Signed-off-by: David S. Miller

Florian Westphal
2016-09-19 13:47:18 +0800
1486587b2 pie: use qdisc_dequeue_head wrapper ... Browse Code »

Doesn't change generated code.

Signed-off-by: Florian Westphal
Signed-off-by: David S. Miller

Florian Westphal
2016-09-19 13:47:18 +0800

26 Jun, 2016

1 commit

520ac30f4 net_sched: drop packets after root qdisc lock is released ... Browse Code »

Qdisc performance suffers when packets are dropped at enqueue()
time because drops (kfree_skb()) are done while qdisc lock is held,
delaying a dequeue() draining the queue.

Nominal throughput can be reduced by 50 % when this happens,
at a time we would like the dequeue() to proceed as fast as possible.

Even FQ is vulnerable to this problem, while one of FQ goals was
to provide some flow isolation.

This patch adds a 'struct sk_buff **to_free' parameter to all
qdisc->enqueue(), and in qdisc_drop() helper.

I measured a performance increase of up to 12 %, but this patch
is a prereq so that future batches in enqueue() can fly.

Signed-off-by: Eric Dumazet
Acked-by: Jesper Dangaard Brouer
Signed-off-by: David S. Miller

Eric Dumazet
2016-06-26 00:19:35 +0800

16 Jun, 2016

1 commit

db4879d93 net_sched: sch_pie: defer skb freeing ... Browse Code »

pie_change() can use rtnl_qdisc_drop() to benefit from
deferred freeing.

pie_reset() is already using qdisc_reset_queue()

Signed-off-by: Eric Dumazet
Signed-off-by: David S. Miller

Eric Dumazet
2016-06-16 05:08:36 +0800

01 Mar, 2016

1 commit

2ccccf5fb net_sched: update hierarchical backlog too ... Browse Code »

When the bottom qdisc decides to, for example, drop some packet,
it calls qdisc_tree_decrease_qlen() to update the queue length
for all its ancestors, we need to update the backlog too to
keep the stats on root qdisc accurate.

Cc: Jamal Hadi Salim
Acked-by: Jamal Hadi Salim
Signed-off-by: Cong Wang
Signed-off-by: David S. Miller

WANG Cong
2016-03-01 06:02:33 +0800

30 Oct, 2014

1 commit

d56109020 sch_pie: schedule the timer after all init succeed ... Browse Code »

Cc: Vijay Subramanian
Cc: David S. Miller
Signed-off-by: Cong Wang
Acked-by: Eric Dumazet

WANG Cong
2014-10-30 02:28:01 +0800

30 Sep, 2014

1 commit

25331d6ce net: sched: implement qstat helper routines ... Browse Code »

This adds helpers to manipulate qstats logic and replaces locations
that touch the counters directly. This simplifies future patches
to push qstats onto per cpu counters.

Signed-off-by: John Fastabend
Signed-off-by: David S. Miller

John Fastabend
2014-09-30 13:02:26 +0800

14 Feb, 2014

1 commit

219e288e8 net: sched: Cleanup PIE comments ... Browse Code »

Fix incorrect comment reported by Norbert Kiesel. Edit another comment to add
more details. Also add references to algorithm (IETF draft and paper) to top of
file.

Signed-off-by: Vijay Subramanian
CC: Mythili Prabhu
CC: Norbert Kiesel
Signed-off-by: David S. Miller

Vijay Subramanian
2014-02-14 07:29:58 +0800

15 Jan, 2014

1 commit

63862b5be net: replace macros net_random and net_srandom with direct calls to prandom ... Browse Code »

This patch removes the net_random and net_srandom macros and replaces
them with direct calls to the prandom ones. As new commits only seem to
use prandom_u32 there is no use to keep them around.
This change makes it easier to grep for users of prandom_u32.

Signed-off-by: Aruna-Hewapathirane
Suggested-by: Hannes Frederic Sowa
Acked-by: Hannes Frederic Sowa
Signed-off-by: David S. Miller

Aruna-Hewapathirane
2014-01-15 07:15:25 +0800

07 Jan, 2014

1 commit

d4b36210c net: pkt_sched: PIE AQM scheme ... Browse Code »

Proportional Integral controller Enhanced (PIE) is a scheduler to address the
bufferbloat problem.

>From the IETF draft below:
" Bufferbloat is a phenomenon where excess buffers in the network cause high
latency and jitter. As more and more interactive applications (e.g. voice over
IP, real time video streaming and financial transactions) run in the Internet,
high latency and jitter degrade application performance. There is a pressing
need to design intelligent queue management schemes that can control latency and
jitter; and hence provide desirable quality of service to users.

We present here a lightweight design, PIE(Proportional Integral controller
Enhanced) that can effectively control the average queueing latency to a target
value. Simulation results, theoretical analysis and Linux testbed results have
shown that PIE can ensure low latency and achieve high link utilization under
various congestion situations. The design does not require per-packet
timestamp, so it incurs very small overhead and is simple enough to implement
in both hardware and software. "

Many thanks to Dave Taht for extensive feedback, reviews, testing and
suggestions. Thanks also to Stephen Hemminger and Eric Dumazet for reviews and
suggestions. Naeem Khademi and Dave Taht independently contributed to ECN
support.

For more information, please see technical paper about PIE in the IEEE
Conference on High Performance Switching and Routing 2013. A copy of the paper
can be found at ftp://ftpeng.cisco.com/pie/.

Please also refer to the IETF draft submission at
http://tools.ietf.org/html/draft-pan-tsvwg-pie-00

All relevant code, documents and test scripts and results can be found at
ftp://ftpeng.cisco.com/pie/.

For problems with the iproute2/tc or Linux kernel code, please contact Vijay
Subramanian (vijaynsu@cisco.com or subramanian.vijay@gmail.com) Mythili Prabhu
(mysuryan@cisco.com)

Signed-off-by: Vijay Subramanian
Signed-off-by: Mythili Prabhu
CC: Dave Taht
Signed-off-by: David S. Miller

Vijay Subramanian
2014-01-07 04:13:01 +0800