Eric Lee / smarc-fsl-linux-kernel

20 Sep, 2007

2 commits

9c95e7319 sched: fix invalid sched_class use ... Browse Code »

When using rt_mutex, a NULL pointer dereference is occurred at
enqueue_task_rt. Here is a scenario;
1) there are two threads, the thread A is fair_sched_class and
thread B is rt_sched_class.
2) Thread A is boosted up to rt_sched_class, because the thread A
has a rt_mutex lock and the thread B is waiting the lock.
3) At this time, when thread A create a new thread C, the thread
C has a rt_sched_class.
4) When doing wake_up_new_task() for the thread C, the priority
of the thread C is out of the RT priority range, because the
normal priority of thread A is not the RT priority. It makes
data corruption by overflowing the rt_prio_array.
The new thread C should be fair_sched_class.

The new thread should be valid scheduler class before queuing.
This patch fixes to set the suitable scheduler class.

Signed-off-by: Hiroshi Shimamoto
Signed-off-by: Ingo Molnar
Signed-off-by: Peter Zijlstra

Hiroshi Shimamoto
2007-09-20 05:34:46 +0800
1799e35d5 sched: add /proc/sys/kernel/sched_compat_yield ... Browse Code »

add /proc/sys/kernel/sched_compat_yield to make sys_sched_yield()
more agressive, by moving the yielding task to the last position
in the rbtree.

with sched_compat_yield=0:

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
2539 mingo 20 0 1576 252 204 R 50 0.0 0:02.03 loop_yield
2541 mingo 20 0 1576 244 196 R 50 0.0 0:02.05 loop

with sched_compat_yield=1:

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
2584 mingo 20 0 1576 248 196 R 99 0.0 0:52.45 loop
2582 mingo 20 0 1576 256 204 R 0 0.0 0:00.00 loop_yield

Signed-off-by: Ingo Molnar
Signed-off-by: Peter Zijlstra

Ingo Molnar
2007-09-20 05:34:46 +0800

05 Sep, 2007

3 commits

cf2ab4696 sched: fix xtensa build warning ... Browse Code »

rename RSR to SRR - 'RSR' is already defined on xtensa.

found by Adrian Bunk.

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-09-05 20:32:49 +0800
a206c0721 sched: debug: fix cfs_rq->wait_runtime accounting ... Browse Code »

the cfs_rq->wait_runtime debug/statistics counter was not maintained
properly - fix this.

this also removes some code:

text data bss dec hex filename
13420 228 1204 14852 3a04 sched.o.before
13404 228 1204 14836 39f4 sched.o.after

Signed-off-by: Ingo Molnar
Signed-off-by: Peter Zijlstra

Ingo Molnar
2007-09-05 20:32:49 +0800
7fd0d2dde sched: fix MC/HT scheduler optimization, without breaking the FUZZ logic. ... Browse Code »

First fix the check
if (*imbalance + SCHED_LOAD_SCALE_FUZZ < busiest_load_per_task)
with this
if (*imbalance < busiest_load_per_task)

As the current check is always false for nice 0 tasks (as
SCHED_LOAD_SCALE_FUZZ is same as busiest_load_per_task for nice 0
tasks).

With the above change, imbalance was getting reset to 0 in the corner
case condition, making the FUZZ logic fail. Fix it by not corrupting the
imbalance and change the imbalance, only when it finds that the HT/MC
optimization is needed.

Signed-off-by: Suresh Siddha
Signed-off-by: Ingo Molnar

Suresh Siddha
2007-09-05 20:32:48 +0800

28 Aug, 2007

1 commit

f6cf891c4 sched: make the scheduler converge to the ideal latency ... Browse Code »

de-HZ-ification of the granularity defaults unearthed a pre-existing
property of CFS: while it correctly converges to the granularity goal,
it does not prevent run-time fluctuations in the range of
[-gran ... 0 ... +gran].

With the increase of the granularity due to the removal of HZ
dependencies, this becomes visible in chew-max output (with 5 tasks
running):

out: 28 . 27. 32 | flu: 0 . 0 | ran: 9 . 13 | per: 37 . 40
out: 27 . 27. 32 | flu: 0 . 0 | ran: 17 . 13 | per: 44 . 40
out: 27 . 27. 32 | flu: 0 . 0 | ran: 9 . 13 | per: 36 . 40
out: 29 . 27. 32 | flu: 2 . 0 | ran: 17 . 13 | per: 46 . 40
out: 28 . 27. 32 | flu: 0 . 0 | ran: 9 . 13 | per: 37 . 40
out: 29 . 27. 32 | flu: 0 . 0 | ran: 18 . 13 | per: 47 . 40
out: 28 . 27. 32 | flu: 0 . 0 | ran: 9 . 13 | per: 37 . 40

average slice is the ideal 13 msecs and the period is picture-perfect 40
msecs. But the 'ran' field fluctuates around 13.33 msecs and there's no
mechanism in CFS to keep that from happening: it's a perfectly valid
solution that CFS finds.

to fix this we add a granularity/preemption rule that knows about
the "target latency", which makes tasks that run longer than the ideal
latency run a bit less. The simplest approach is to simply decrease the
preemption granularity when a task overruns its ideal latency. For this
we have to track how much the task executed since its last preemption.

( this adds a new field to task_struct, but we can eliminate that
overhead in 2.6.24 by putting all the scheduler timestamps into an
anonymous union. )

with this change in place, chew-max output is fluctuation-less all
around:

out: 28 . 27. 39 | flu: 0 . 2 | ran: 13 . 13 | per: 41 . 40
out: 28 . 27. 39 | flu: 0 . 2 | ran: 13 . 13 | per: 41 . 40
out: 28 . 27. 39 | flu: 0 . 2 | ran: 13 . 13 | per: 41 . 40
out: 28 . 27. 39 | flu: 0 . 2 | ran: 13 . 13 | per: 41 . 40
out: 28 . 27. 39 | flu: 0 . 1 | ran: 13 . 13 | per: 41 . 40
out: 28 . 27. 39 | flu: 0 . 1 | ran: 13 . 13 | per: 41 . 40

this patch has no impact on any fastpath or on any globally observable
scheduling property. (unless you have sharp enough eyes to see
millisecond-level ruckles in glxgears smoothness :-)

Signed-off-by: Ingo Molnar
Signed-off-by: Peter Zijlstra
Signed-off-by: Mike Galbraith

Ingo Molnar
2007-08-28 18:53:24 +0800

26 Aug, 2007

3 commits

50c46637a sched: s/sched_latency/sched_min_granularity ... Browse Code »

runtime limit and wakeup granularity used to be a function of
granularity and that was incorrect changed to sched_latency.

Fix this to make wakeup granularity a function of min-granularity,
and the runtime limit equal to latency.

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-26 04:17:19 +0800
172ac3dbb sched: cleanup, sched_granularity -> sched_min_granularity ... Browse Code »

due to adaptive granularity scheduling the role of sched_granularity
has changed to "minimum granularity", so rename the variable (and the
tunable) accordingly.

Signed-off-by: Ingo Molnar
Signed-off-by: Peter Zijlstra

Ingo Molnar
2007-08-26 00:41:53 +0800
218050855 sched: adaptive scheduler granularity ... Browse Code »

Instead of specifying the preemption granularity, specify the wanted
latency. By fixing the granlarity to a constany the wakeup latency
it a function of the number of running tasks on the rq.

Invert this relation.

sysctl_sched_granularity becomes a minimum for the dynamic granularity
computed from the new sysctl_sched_latency.

Then use this latency to do more intelligent granularity decisions: if
there are fewer tasks running then we can schedule coarser. This helps
performance while still always keeping the latency target.

Signed-off-by: Peter Zijlstra
Signed-off-by: Ingo Molnar

Peter Zijlstra
2007-08-26 00:41:53 +0800

25 Aug, 2007

2 commits

deac4ee65 sched: simplify can_migrate_task() ... Browse Code »

Remove trivial conditional branch in Linux scheduler's
can_migrate_task() function.

text data bss dec hex filename
34770 2998 24 37792 93a0 sched.o.before
34757 2998 24 37779 9393 sched.o.after

Signed-off-by: Sven-Thorsten Dietrich
Signed-off-by: Ingo Molnar

Sven-Thorsten Dietrich
2007-08-25 02:39:10 +0800
71fd37146 sched: remove HZ dependency from the granularity default ... Browse Code »

remove HZ dependency from the granularity default. Use 10 msec for
the base granularity, 1 msec for wakeup granularity and 25 msec for
batch wakeup granularity. (These defaults are close to the values
that the default HZ=250 setting got previously, and thus it's the
most common setting.)

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-25 02:39:10 +0800

23 Aug, 2007

5 commits

505c0efd5 sched: tweak the sched_runtime_limit tunable ... Browse Code »

Michael Gerdau reported reniced task CPU usage weirdnesses.
Such symptoms can be caused by limit underruns so double the
sched_runtime_limit.

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-23 21:18:02 +0800
f549da848 sched: skip updating rq's next_balance under null SD ... Browse Code »

Was playing with sched_smt_power_savings/sched_mc_power_savings and
found out that while the scheduler domains are reconstructed when sysfs
settings change, rebalance_domains() can get triggered with null domain
on other cpus, which is setting next_balance to jiffies + 60*HZ.
Resulting in no idle/busy balancing for 60 seconds.

Fix this.

Signed-off-by: Suresh Siddha
Signed-off-by: Ingo Molnar

Suresh Siddha
2007-08-23 21:18:02 +0800
f8700df7c sched: fix broken SMT/MC optimizations ... Browse Code »

On a four package system with HT - HT load balancing optimizations were
broken. For example, if two tasks end up running on two logical threads
of one of the packages, scheduler is not able to pull one of the tasks
to a completely idle package.

In this scenario, for nice-0 tasks, imbalance calculated by scheduler
will be 512 and find_busiest_queue() will return 0 (as each cpu's load
is 1024 > imbalance and has only one task running).

Similarly MC scheduler optimizations also get fixed with this patch.

[ mingo@elte.hu: restored fair balancing by increasing the fuzz and
adding it back to the power decision, without the /2
factor. ]

Signed-off-by: Suresh Siddha
Signed-off-by: Ingo Molnar

Suresh Siddha
2007-08-23 21:18:02 +0800
c57baf1e1 sched: fix sysctl directory permissions ... Browse Code »

There are two remaining gotchas:

- The directories have impossible permissions (writeable).

- The ctl_name for the kernel directory is inconsistent with
everything else. It should be CTL_KERN.

Signed-off-by: Eric W. Biederman
Signed-off-by: Ingo Molnar

Eric W. Biederman
2007-08-23 21:18:02 +0800
2aa44d056 sched: sched_clock_idle_[sleep|wakeup]_event() ... Browse Code »

construct a more or less wall-clock time out of sched_clock(), by
using ACPI-idle's existing knowledge about how much time we spent
idling. This allows the rq clock to work around TSC-stops-in-C2,
TSC-gets-corrupted-in-C3 type of problems.

( Besides the scheduler's statistics this also benefits blktrace and
printk-timestamps as well. )

Furthermore, the precise before-C2/C3-sleep and after-C2/C3-wakeup
callbacks allow the scheduler to get out the most of the period where
the CPU has a reliable TSC. This results in slightly more precise
task statistics.

the ACPI bits were acked by Len.

Signed-off-by: Ingo Molnar
Acked-by: Len Brown

Ingo Molnar
2007-08-23 21:18:02 +0800

13 Aug, 2007

2 commits

de0cf899b sched: run_rebalance_domains: s/SCHED_IDLE/CPU_IDLE/ ... Browse Code »

rebalance_domains(SCHED_IDLE) looks strange (typo), change it to CPU_IDLE.

the effect of this bug was slightly more agressive idle-balancing on
SMP than intended.

Signed-off-by: Oleg Nesterov
Signed-off-by: Ingo Molnar

Oleg Nesterov
2007-08-13 00:08:19 +0800
6707de00f sched: make global code static ... Browse Code »

This patch makes the following needlessly global code static:

- arch_reinit_sched_domains()
- struct attr_sched_mc_power_savings
- struct attr_sched_smt_power_savings

Signed-off-by: Adrian Bunk
Signed-off-by: Andrew Morton
Signed-off-by: Ingo Molnar

Adrian Bunk
2007-08-13 00:08:19 +0800

11 Aug, 2007

1 commit

529c77261 sched: improve rq-clock overflow logic ... Browse Code »

improve the rq-clock overflow logic: limit the absolute rq->clock
delta since the last scheduler tick, instead of limiting the delta
itself.

tested by Arjan van de Ven - whole laptop was misbehaving due to
an incorrectly calibrated cpu_khz confusing sched_clock().

Signed-off-by: Ingo Molnar
Signed-off-by: Arjan van de Ven

Ingo Molnar
2007-08-11 05:05:11 +0800

09 Aug, 2007

21 commits

194081ebf sched: round a bit better ... Browse Code »
44

round a tiny bit better in high-frequency rescheduling scenarios,
by rounding around zero instead of rounding down.

(this is pretty theoretical though)

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:51 +0800
254753dc3 sched: make the multiplication table more accurate ... Browse Code »

do small deltas in the weight and multiplication constant table so
that the worst-case numeric error is better than 1:100000000. (8 digits)

the current error table is:

nice mult * inv_mult error
------------------------------------------
-20: 88761 * 48388 -0.0000000065
-19: 71755 * 59856 -0.0000000037
-18: 56483 * 76040 0.0000000056
-17: 46273 * 92818 0.0000000042
-16: 36291 * 118348 -0.0000000065
-15: 29154 * 147320 -0.0000000037
-14: 23254 * 184698 -0.0000000009
-13: 18705 * 229616 -0.0000000037
-12: 14949 * 287308 -0.0000000009
-11: 11916 * 360437 -0.0000000009
-10: 9548 * 449829 -0.0000000009
-9: 7620 * 563644 -0.0000000037
-8: 6100 * 704093 0.0000000009
-7: 4904 * 875809 0.0000000093
-6: 3906 * 1099582 -0.0000000009
-5: 3121 * 1376151 -0.0000000058
-4: 2501 * 1717300 0.0000000009
-3: 1991 * 2157191 -0.0000000035
-2: 1586 * 2708050 0.0000000009
-1: 1277 * 3363326 0.0000000014
0: 1024 * 4194304 0.0000000000
1: 820 * 5237765 0.0000000009
2: 655 * 6557202 0.0000000033
3: 526 * 8165337 -0.0000000079
4: 423 * 10153587 0.0000000012
5: 335 * 12820798 0.0000000079
6: 272 * 15790321 0.0000000037
7: 215 * 19976592 -0.0000000037
8: 172 * 24970740 -0.0000000037
9: 137 * 31350126 -0.0000000079
10: 110 * 39045157 -0.0000000061
11: 87 * 49367440 -0.0000000037
12: 70 * 61356676 0.0000000056
13: 56 * 76695844 -0.0000000075
14: 45 * 95443717 -0.0000000072
15: 36 * 119304647 -0.0000000009
16: 29 * 148102320 -0.0000000037
17: 23 * 186737708 -0.0000000028
18: 18 * 238609294 -0.0000000009
19: 15 * 286331153 -0.0000000002

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:51 +0800
6e82a3bef sched: optimize update_rq_clock() calls in the load-balancer ... Browse Code »

optimize update_rq_clock() calls in the load-balancer: update them
right after locking the runqueue(s) so that the pull functions do
not have to call it.

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:51 +0800
2daa35770 sched: optimize activate_task() ... Browse Code »

optimize activate_task() by removing update_rq_clock() from it.
(and add update_rq_clock() to all callsites of activate_task() that
did not have it before.)

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:51 +0800
546fe3c90 sched: move the __update_rq_clock() call to scheduler_tick() ... Browse Code »

move the __update_rq_clock() call from update_cpu_load() to
scheduler_tick().

( identity transformation that causes no change in functionality. )

this allows the direct use of rq->clock in ->task_tick() functions.

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:51 +0800
bdd4dfa89 sched: remove the 'u64 now' local variables ... Browse Code »

final step: remove all (now superfluous) 'u64 now' variables.

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:51 +0800
2e1cb74a5 sched: remove the 'u64 now' parameter from deactivate_task() ... Browse Code »

remove the 'u64 now' parameter from deactivate_task().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
69be72c13 sched: remove the 'u64 now' parameter from dequeue_task() ... Browse Code »

remove the 'u64 now' parameter from dequeue_task().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
8159f87e2 sched: remove the 'u64 now' parameter from enqueue_task() ... Browse Code »

remove the 'u64 now' parameter from enqueue_task().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
db53181e4 sched: remove the 'u64 now' parameter from dec_nr_running() ... Browse Code »

remove the 'u64 now' parameter from dec_nr_running().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
e5fa2237b sched: remove the 'u64 now' parameter from inc_nr_running() ... Browse Code »

remove the 'u64 now' parameter from inc_nr_running().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
79b5dddf8 sched: remove the 'u64 now' parameter from dec_load() ... Browse Code »

remove the 'u64 now' parameter from dec_load().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
29b4b623f sched: remove the 'u64 now' parameter from inc_load() ... Browse Code »

remove the 'u64 now' parameter from inc_load().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
84a1d7a2f sched: remove the 'u64 now' parameter from update_curr_load() ... Browse Code »

remove the 'u64 now' parameter from update_curr_load().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
ee0827d8b sched: remove the 'u64 now' parameter from ->task_new() ... Browse Code »

remove the 'u64 now' parameter from ->task_new().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
31ee529cc sched: remove the 'u64 now' parameter from ->put_prev_task() ... Browse Code »

remove the 'u64 now' parameter from ->put_prev_task().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
ff95f3df5 sched: remove the 'u64 now' parameter from pick_next_task() ... Browse Code »

remove the 'u64 now' parameter from pick_next_task().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:49 +0800
fb8d47240 sched: remove the 'u64 now' parameter from ->pick_next_task() ... Browse Code »

remove the 'u64 now' parameter from ->pick_next_task().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:48 +0800
f02231e51 sched: remove the 'u64 now' parameter from ->dequeue_task() ... Browse Code »

remove the 'u64 now' parameter from ->dequeue_task().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:48 +0800
fd390f6a0 sched: remove the 'u64 now' parameter from ->enqueue_task() ... Browse Code »

remove the 'u64 now' parameter from ->enqueue_task().

( identity transformation that causes no change in functionality. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:48 +0800
d281918d7 sched: remove 'now' use from assignments ... Browse Code »

change all 'now' timestamp uses in assignments to rq->clock.

( this is an identity transformation that causes no functionality change:
all such new rq->clock is necessarily preceded by an update_rq_clock()
call. )

Signed-off-by: Ingo Molnar

Ingo Molnar
2007-08-09 17:16:47 +0800