Doug / smarc-fsl-linux-kernel | Embedian Git Server

02 Aug, 2009

2 commits

c3a2ae3d9 sched: Add new prio to cpupri before removing old prio ... Browse Code »

We need to add the new prio to the cpupri accounting before
removing the old prio. This is because removing the old prio
first will open a race window where the cpu will be removed
from pri_active. In this case the cpu will not be visible for
RT push and pulls. This could cause a RT task to not migrate
appropriately, and create a very large latency.

This bug was found with the use of ftrace sched events and
trace_printk.

Signed-off-by: Steven Rostedt
Signed-off-by: Peter Zijlstra
LKML-Reference:
Signed-off-by: Ingo Molnar

Steven Rostedt
2009-08-02 20:26:09 +0800
07903af15 sched: Fix race in cpupri introduced by cpumask_var changes ... Browse Code »

Background:

Several race conditions in the scheduler have cropped up
recently, which Steven and I have tracked down using ftrace.
The most recent one turns out to be a race in how the scheduler
determines a suitable migration target for RT tasks, introduced
recently with commit:

commit 68e74568fbe5854952355e942acca51f138096d9
Date: Tue Nov 25 02:35:13 2008 +1030

sched: convert struct cpupri_vec cpumask_var_t.

The original design of cpupri allowed lockless readers to
quickly determine a best-estimate target. Races between the
pri_active bitmap and the vec->mask were handled in the
original code because we would detect and return "0" when this
occured. The design was predicated on the *effective*
atomicity (*) of caching the result of cpus_and() between the
cpus_allowed and the vec->mask.

Commit 68e74568 changed the behavior such that vec->mask is
accessed multiple times. This introduces a subtle race, the
result of which means we can have a result that returns "1",
but with an empty bitmap.

*) yes, we know cpus_and() is not a locked operator across the
entire composite array, but it is implicitly atomic on a
per-word basis which is all the design required to work.

Implementation:

Rather than forgoing the lockless design, or reverting to a
stack-based cpumask_t, we simply check for when the race has
been encountered and continue processing in the event that the
race is hit. This renders the removal race as if the priority
bit had been atomically cleared as well, and allows the
algorithm to execute correctly.

Signed-off-by: Gregory Haskins
CC: Rusty Russell
CC: Steven Rostedt
Signed-off-by: Peter Zijlstra
LKML-Reference:
Signed-off-by: Ingo Molnar

Gregory Haskins
2009-08-02 20:23:29 +0800

17 Jun, 2009

1 commit

fd5e1b5db sched: Remove unneeded __ref tag ... Browse Code »

Those two functions no longer call alloc_bootmmem_cpumask_var(),
so no need to tag them with __init_refok.

Signed-off-by: Li Zefan
Acked-by: Pekka Enberg
LKML-Reference:
Signed-off-by: Ingo Molnar

Li Zefan
2009-06-17 22:08:04 +0800

12 Jun, 2009

1 commit

0fb530291 sched: use slab in cpupri_init() ... Browse Code »

Lets not use the bootmem allocator in cpupri_init() as slab is already up when
it is run.

Cc: Ingo Molnar
Cc: Linus Torvalds
Cc: Yinghai Lu
Signed-off-by: Pekka Enberg

Pekka Enberg
2009-06-12 00:27:12 +0800

09 Jun, 2009

1 commit

eaa958402 cpumask: alloc zeroed cpumask for static cpumask_var_ts ... Browse Code »

These are defined as static cpumask_var_t so if MAXSMP is not used,
they are cleared already. Avoid surprises when MAXSMP is enabled.

Signed-off-by: Yinghai Lu
Signed-off-by: Rusty Russell

Yinghai Lu
2009-06-09 21:00:27 +0800

01 Apr, 2009

1 commit

13b8bd0a5 sched_rt: don't allocate cpumask in fastpath ... Browse Code »

Impact: cleanup

As pointed out by Steven Rostedt. Since the arg in question is
unused, we simply change cpupri_find() to accept NULL.

Reported-by: Steven Rostedt
Signed-off-by: Rusty Russell
LKML-Reference:
Signed-off-by: Ingo Molnar

Rusty Russell
2009-04-01 19:24:51 +0800

06 Jan, 2009

1 commit

db2f59c8c sched: fix section mismatch ... Browse Code »

init_rootdomain() calls alloc_bootmem_cpumask_var() at system boot,
so does cpupri_init().

Signed-off-by: Li Zefan
Signed-off-by: Ingo Molnar

Li Zefan
2009-01-06 18:07:15 +0800

25 Nov, 2008

1 commit

68e74568f sched: convert struct cpupri_vec cpumask_var_t. ... Browse Code »

Impact: stack usage reduction, (future) size reduction for large NR_CPUS.

Dynamically allocating cpumasks (when CONFIG_CPUMASK_OFFSTACK) saves
space for small nr_cpu_ids but big CONFIG_NR_CPUS.

The fact cpupro_init is called both before and after the slab is
available makes for an ugly parameter unfortunately.

We also use cpumask_any_and to get rid of a temporary in cpupri_find.

Signed-off-by: Rusty Russell
Signed-off-by: Ingo Molnar

Rusty Russell
2008-11-25 00:52:22 +0800

06 Jun, 2008

1 commit

6e0534f27 sched: use a 2-d bitmap for searching lowest-pri CPU ... Browse Code »

The current code use a linear algorithm which causes scaling issues
on larger SMP machines. This patch replaces that algorithm with a
2-dimensional bitmap to reduce latencies in the wake-up path.

Signed-off-by: Gregory Haskins
Acked-by: Steven Rostedt
Signed-off-by: Ingo Molnar
Signed-off-by: Thomas Gleixner

Gregory Haskins
2008-06-06 21:19:28 +0800