Eric Lee / smarc-fsl-linux-kernel

24 Jun, 2009

1 commit

e74e39620 percpu: use dynamic percpu allocator as the default percpu allocator ... Browse Code »

This patch makes most !CONFIG_HAVE_SETUP_PER_CPU_AREA archs use
dynamic percpu allocator. The first chunk is allocated using
embedding helper and 8k is reserved for modules. This ensures that
the new allocator behaves almost identically to the original allocator
as long as static percpu variables are concerned, so it shouldn't
introduce much breakage.

s390 and alpha use custom SHIFT_PERCPU_PTR() to work around addressing
range limit the addressing model imposes. Unfortunately, this breaks
if the address is specified using a variable, so for now, the two
archs aren't converted.

The following architectures are affected by this change.

* sh
* arm
* cris
* mips
* sparc(32)
* blackfin
* avr32
* parisc (broken, under investigation)
* m32r
* powerpc(32)

As this change makes the dynamic allocator the default one,
CONFIG_HAVE_DYNAMIC_PER_CPU_AREA is replaced with its invert -
CONFIG_HAVE_LEGACY_PER_CPU_AREA, which is added to yet-to-be converted
archs. These archs implement their own setup_per_cpu_areas() and the
conversion is not trivial.

* powerpc(64)
* sparc(64)
* ia64
* alpha
* s390

Boot and batch alloc/free tests on x86_32 with debug code (x86_32
doesn't use default first chunk initialization). Compile tested on
sparc(32), powerpc(32), arm and alpha.

Kyle McMartin reported that this change breaks parisc. The problem is
still under investigation and he is okay with pushing this patch
forward and fixing parisc later.

[ Impact: use dynamic allocator for most archs w/o custom percpu setup ]

Signed-off-by: Tejun Heo
Acked-by: Rusty Russell
Acked-by: David S. Miller
Acked-by: Benjamin Herrenschmidt
Acked-by: Martin Schwidefsky
Reviewed-by: Christoph Lameter
Cc: Paul Mundt
Cc: Russell King
Cc: Mikael Starvik
Cc: Ralf Baechle
Cc: Bryan Wu
Cc: Kyle McMartin
Cc: Matthew Wilcox
Cc: Grant Grundler
Cc: Hirokazu Takata
Cc: Richard Henderson
Cc: Ivan Kokshaysky
Cc: Heiko Carstens
Cc: Ingo Molnar

Tejun Heo
2009-06-24 14:13:35 +0800

07 Apr, 2009

1 commit

5d6700ea7 percpu: __percpu_depopulate_mask can take a const mask ... Browse Code »

This eliminates a compiler warning:

mm/allocpercpu.c: In function 'free_percpu':
mm/allocpercpu.c:146: warning: passing argument 2 of '__percpu_depopulate_mask' discards qualifiers from pointer target type

Signed-off-by: Stephen Rothwell
Signed-off-by: Linus Torvalds

Stephen Rothwell
2009-04-07 04:44:15 +0800

30 Mar, 2009

1 commit

aa85ea5b8 cpumask: use new cpumask_ functions in core code. ... Browse Code »

Impact: cleanup

Time to clean up remaining laggards using the old cpu_ functions.

Signed-off-by: Rusty Russell
Cc: Greg Kroah-Hartman
Cc: Ingo Molnar
Cc: Trond.Myklebust@netapp.com

Rusty Russell
2009-03-30 19:35:16 +0800

11 Mar, 2009

1 commit

60db56422 percpu: fix spurious alignment WARN in legacy SMP percpu allocator ... Browse Code »

Impact: remove spurious WARN on legacy SMP percpu allocator

Commit f2a8205c4ef1af917d175c36a4097ae5587791c8 incorrectly added too
tight WARN_ON_ONCE() on alignments for UP and legacy SMP percpu
allocator. Commit e317603694bfd17b28a40de9d65e1a4ec12f816e fixed it
for UP but legacy SMP allocator was forgotten. Fix it.

Signed-off-by: Tejun Heo
Reported-by: Sachin P. Sant

Tejun Heo
2009-03-11 13:36:54 +0800

20 Feb, 2009

1 commit

f2a8205c4 percpu: kill percpu_alloc() and friends ... Browse Code »

Impact: kill unused functions

percpu_alloc() and its friends never saw much action. It was supposed
to replace the cpu-mask unaware __alloc_percpu() but it never happened
and in fact __percpu_alloc_mask() itself never really grew proper
up/down handling interface either (no exported interface for
populate/depopulate).

percpu allocation is about to go through major reimplementation and
there's no reason to carry this unused interface around. Replace it
with __alloc_percpu() and free_percpu().

Signed-off-by: Tejun Heo

Tejun Heo
2009-02-20 15:29:08 +0800

27 Jul, 2008

1 commit

9d8fddfb1 mm/allocpercpu.c: make 4 functions static ... Browse Code »

This patch makes the following needlessly global functions static:
- percpu_depopulate()
- __percpu_depopulate_mask()
- percpu_populate()
- __percpu_populate_mask()

Signed-off-by: Adrian Bunk
Acked-by: Christoph Lameter
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Adrian Bunk
2008-07-27 03:00:12 +0800

06 Jul, 2008

1 commit

68083e05d Merge commit 'v2.6.26-rc9' into cpus4096 Browse Code »

Ingo Molnar
2008-07-06 20:23:39 +0800

05 Jul, 2008

1 commit

cde535359 Christoph has moved ... Browse Code »

Remove all clameter@sgi.com addresses from the kernel tree since they will
become invalid on June 27th. Change my maintainer email address for the
slab allocators to cl@linux-foundation.org (which will be the new email
address for the future).

Signed-off-by: Christoph Lameter
Signed-off-by: Christoph Lameter
Cc: Pekka Enberg
Cc: Stephen Rothwell
Cc: Matt Mackall
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Christoph Lameter
2008-07-05 01:40:04 +0800

24 May, 2008

1 commit

6d6a43608 mm: use performance variant for_each_cpu_mask_nr ... Browse Code »

Change references from for_each_cpu_mask to for_each_cpu_mask_nr
where appropriate

Reviewed-by: Paul Jackson
Reviewed-by: Christoph Lameter
Signed-off-by: Mike Travis
Signed-off-by: Ingo Molnar
Signed-off-by: Thomas Gleixner

Mike Travis
2008-05-24 00:35:12 +0800

20 Apr, 2008

1 commit

d366f8cbc cpumask: Cleanup more uses of CPU_MASK and NODE_MASK ... Browse Code »

* Replace usages of CPU_MASK_NONE, CPU_MASK_ALL, NODE_MASK_NONE,
NODE_MASK_ALL to reduce stack requirements for large NR_CPUS
and MAXNODES counts.

* In some cases, the cpumask variable was initialized but then overwritten
with another value. This is the case for changes like this:

- cpumask_t oldmask = CPU_MASK_ALL;
+ cpumask_t oldmask;

Signed-off-by: Mike Travis
Signed-off-by: Ingo Molnar

Mike Travis
2008-04-20 01:44:58 +0800

05 Mar, 2008

1 commit

be852795e alloc_percpu() fails to allocate percpu data ... Browse Code »

Some oprofile results obtained while using tbench on a 2x2 cpu machine were
very surprising.

For example, loopback_xmit() function was using high number of cpu cycles
to perform the statistic updates, supposed to be real cheap since they use
percpu data

pcpu_lstats = netdev_priv(dev);
lb_stats = per_cpu_ptr(pcpu_lstats, smp_processor_id());
lb_stats->packets++; /* HERE : serious contention */
lb_stats->bytes += skb->len;

struct pcpu_lstats is a small structure containing two longs. It appears
that on my 32bits platform, alloc_percpu(8) allocates a single cache line,
instead of giving to each cpu a separate cache line.

Using the following patch gave me impressive boost in various benchmarks
( 6 % in tbench)
(all percpu_counters hit this bug too)

Long term fix (ie >= 2.6.26) would be to let each CPU allocate their own
block of memory, so that we dont need to roudup sizes to L1_CACHE_BYTES, or
merging the SGI stuff of course...

Note : SLUB vs SLAB is important here to *show* the improvement, since they
dont have the same minimum allocation sizes (8 bytes vs 32 bytes). This
could very well explain regressions some guys reported when they switched
to SLUB.

Signed-off-by: Eric Dumazet
Acked-by: Peter Zijlstra
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Eric Dumazet
2008-03-05 08:35:11 +0800

07 Feb, 2008

1 commit

b32421519 PERCPU : __percpu_alloc_mask() can dynamically size percpu_data storage ... Browse Code »

Instead of allocating a fix sized array of NR_CPUS pointers for percpu_data,
we can use nr_cpu_ids, which is generally < NR_CPUS.

Signed-off-by: Eric Dumazet
Cc: Christoph Lameter
Cc: "David S. Miller"
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Eric Dumazet
2008-02-07 02:41:04 +0800

18 Jul, 2007

1 commit

94f6030ca Slab allocators: Replace explicit zeroing with __GFP_ZERO ... Browse Code »

kmalloc_node() and kmem_cache_alloc_node() were not available in a zeroing
variant in the past. But with __GFP_ZERO it is possible now to do zeroing
while allocating.

Use __GFP_ZERO to remove the explicit clearing of memory via memset whereever
we can.

Signed-off-by: Christoph Lameter
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Christoph Lameter
2007-07-18 01:23:02 +0800

08 Dec, 2006

1 commit

a12058687 [PATCH] Allow NULL pointers in percpu_free ... Browse Code »

The patch (as824b) makes percpu_free() ignore NULL arguments, as one would
expect for a deallocation routine. (Note that free_percpu is #defined as
percpu_free in include/linux/percpu.h.) A few callers are updated to remove
now-unneeded tests for NULL. A few other callers already seem to assume
that passing a NULL pointer to percpu_free() is okay!

The patch also removes an unnecessary NULL check in percpu_depopulate().

Signed-off-by: Alan Stern
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Alan Stern
2006-12-08 00:39:22 +0800

26 Sep, 2006

1 commit

d00bcc98d [PATCH] Extract the allocpercpu functions from the slab allocator ... Browse Code »

The allocpercpu functions __alloc_percpu and __free_percpu() are heavily
using the slab allocator. However, they are conceptually slab. This also
simplifies SLOB (at this point slob may be broken in mm. This should fix
it).

Signed-off-by: Christoph Lameter
Cc: Matt Mackall
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Christoph Lameter
2006-09-26 23:48:51 +0800