Eric Lee / smarc-fsl-linux-kernel

19 Apr, 2009

2 commits

a489f0b55 lguest: fix guest crash on non-linear addresses in gdt pvops ... Browse Code »

Fixes guest crash 'lguest: bad read address 0x4800000 len 256'

The new per-cpu allocator ends up handing a non-linear address to
write_gdt_entry. We do __pa() on it, and hand it to the host, which
kills us.

I've long wanted to make the hypercall "LOAD_GDT_ENTRY" to match the IDT
code, but had no pressing reason until now.

Signed-off-by: Rusty Russell
Cc: lguest@ozlabs.org

Rusty Russell
2009-04-19 21:44:01 +0800
88df781af lguest: fix crash on vmlinux images ... Browse Code »

Typical message: 'lguest: unhandled trap 6 at 0x418726 (0x0)'

vmlinux guests were broken by 4cd8b5e2a159f18a1507f1187b44a1acbfa6341b
'lguest: use KVM hypercalls', which rewrites guest text from kvm hypercalls
to trap 31.

The Launcher mmaps the kernel image. The Guest executes and
immediately faults in the first text page (read-only). Then it hits a
hypercall, and we rewrite that hypercall, causing a copy-on-write.
But the Guest pagetables still refer to the old page: we fault again,
but as Host we see the hypercall already rewritten, and pass the fault
back to the Guest. The Guest hasn't set up an IDT yet, so we kill it.

This doesn't happen with bzImages: they unpack themselves and so the
text pages are already read-write.

Signed-off-by: Rusty Russell
Tested-by: Patrick McHardy

Matias Zabaljauregui
2009-04-19 21:44:00 +0800

30 Mar, 2009

3 commits

df1693abc lguest: use bool instead of int ... Browse Code »

Impact: clean up

Rusty told me, some time ago, that he had become a fan of "bool".
So, here are some replacements.

Signed-off-by: Matias Zabaljauregui
Signed-off-by: Rusty Russell

Matias Zabaljauregui
2009-03-30 19:25:25 +0800
4cd8b5e2a lguest: use KVM hypercalls ... Browse Code »

Impact: cleanup

This patch allow us to use KVM hypercalls

Signed-off-by: Matias Zabaljauregui
Signed-off-by: Rusty Russell

Matias Zabaljauregui
2009-03-30 19:25:24 +0800
6afbdd059 lguest: fix spurious BUG_ON() on invalid guest stack. ... Browse Code »

Impact: fix crash on misbehaving guest

gpte_addr() contains a BUG_ON(), insisting that the present flag is
set. We need to return before we call it if that isn't the case.

Signed-off-by: Rusty Russell
Cc: stable@kernel.org

Rusty Russell
2009-03-30 19:25:23 +0800

28 Mar, 2009

1 commit

6e15cf048 Merge branch 'core/percpu' into percpu-cpumask-x86-for-linus-2 ... Browse Code »

Conflicts:
arch/parisc/kernel/irq.c
arch/x86/include/asm/fixmap_64.h
arch/x86/include/asm/setup.h
kernel/irq/handle.c

Semantic merge:
arch/x86/include/asm/fixmap.h

Signed-off-by: Ingo Molnar

Ingo Molnar
2009-03-28 00:28:43 +0800

09 Mar, 2009

1 commit

6db6a5f3a lguest: fix for CONFIG_SPARSE_IRQ=y ... Browse Code »

Impact: remove lots of lguest boot WARN_ON() when CONFIG_SPARSE_IRQ=y

We now need to call irq_to_desc_alloc_cpu() before
set_irq_chip_and_handler_name(), but we can't do that from init_IRQ (no
kmalloc available).

So do it as we use interrupts instead. Also means we only alloc for
irqs we use, which was the intent of CONFIG_SPARSE_IRQ anyway.

Signed-off-by: Rusty Russell
Cc: Ingo Molnar

Rusty Russell
2009-03-09 07:36:29 +0800

23 Feb, 2009

1 commit

965c7ecaf x86: remove the Voyager 32-bit subarch ... Browse Code »

Impact: remove unused/broken code

The Voyager subarch last built successfully on the v2.6.26 kernel
and has been stale since then and does not build on the v2.6.27,
v2.6.28 and v2.6.29-rc5 kernels.

No actual users beyond the maintainer reported this breakage.
Patches were sent and most of the fixes were accepted but the
discussion around how to do a few remaining issues cleanly
fizzled out with no resolution and the code remained broken.

In the v2.6.30 x86 tree development cycle 32-bit subarch support
has been reworked and removed - and the Voyager code, beyond the
build problems already known, needs serious and significant
changes and probably a rewrite to support it.

CONFIG_X86_VOYAGER has been marked BROKEN then. The maintainer has
been notified but no patches have been sent so far to fix it.

While all other subarchs have been converted to the new scheme,
voyager is still broken. We'd prefer to receive patches which
clean up the current situation in a constructive way, but even in
case of removal there is no obstacle to add that support back
after the issues have been sorted out in a mutually acceptable
fashion.

So remove this inactive code for now.

Signed-off-by: Ingo Molnar

Ingo Molnar
2009-02-23 07:54:01 +0800

30 Jan, 2009

2 commits

05dfdbbd6 lguest: Fix a memory leak with the lg object during launcher close ... Browse Code »

Fix a memory leak identified by Rusty Russell during LCA09 by
kfree'ing the lg object instead of just clearing it when the
launcher closes.

Signed-off-by: Mark Wallis
Signed-off-by: Rusty Russell

Mark Wallis
2009-01-30 09:04:11 +0800
72410af92 lguest: typos fix ... Browse Code »

3 points

lguest_asm.S => i386_head.S
LHCALL_BREAK => LHREQ_BREAK
perferred => preferred

Signed-off-by: Atsushi SAKAI
Signed-off-by: Rusty Russell

Atsushi SAKAI
2009-01-30 09:04:10 +0800

07 Jan, 2009

1 commit

ff8561c4a lguest: do not statically allocate root device ... Browse Code »

We shouldn't be statically allocating the root device object,
so dynamically allocate it using root_device_register()
instead.

Signed-off-by: Mark McLoughlin
Acked-by: Rusty Russell
Signed-off-by: Greg Kroah-Hartman

Mark McLoughlin
2009-01-07 02:44:34 +0800

03 Jan, 2009

1 commit

b840d7963 Merge branch 'cpus4096-for-linus-2' of git://git.kernel.org/pub/scm/linux/kernel… ... Browse Code »

…/git/tip/linux-2.6-tip

* 'cpus4096-for-linus-2' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip: (66 commits)
x86: export vector_used_by_percpu_irq
x86: use logical apicid in x2apic_cluster's x2apic_cpu_mask_to_apicid_and()
sched: nominate preferred wakeup cpu, fix
x86: fix lguest used_vectors breakage, -v2
x86: fix warning in arch/x86/kernel/io_apic.c
sched: fix warning in kernel/sched.c
sched: move test_sd_parent() to an SMP section of sched.h
sched: add SD_BALANCE_NEWIDLE at MC and CPU level for sched_mc>0
sched: activate active load balancing in new idle cpus
sched: bias task wakeups to preferred semi-idle packages
sched: nominate preferred wakeup cpu
sched: favour lower logical cpu number for sched_mc balance
sched: framework for sched_mc/smt_power_savings=N
sched: convert BALANCE_FOR_xx_POWER to inline functions
x86: use possible_cpus=NUM to extend the possible cpus allowed
x86: fix cpu_mask_to_apicid_and to include cpu_online_mask
x86: update io_apic.c to the new cpumask code
x86: Introduce topology_core_cpumask()/topology_thread_cpumask()
x86: xen: use smp_call_function_many()
x86: use work_on_cpu in x86/kernel/cpu/mcheck/mce_amd_64.c
...

Fixed up trivial conflict in kernel/time/tick-sched.c manually

Linus Torvalds
2009-01-03 03:44:09 +0800

30 Dec, 2008

4 commits

bda53cd51 lguest: struct device - replace bus_id with dev_name() ... Browse Code »

bus_id is gradually being removed, so use dev_name() instead.

Signed-off-by: Mark McLoughlin
Cc: Kay Sievers
Cc: Greg Kroah-Hartman
Signed-off-by: Rusty Russell

Mark McLoughlin
2008-12-30 06:56:12 +0800
58a245664 lguest: move the initial guest page table creation code to the host ... Browse Code »

This patch moves the initial guest page table creation code to the host,
so the launcher keeps working with PAE enabled configs.

Signed-off-by: Matias Zabaljauregui
Signed-off-by: Rusty Russell

Matias Zabaljauregui
2008-12-30 06:56:11 +0800
87c7d57c1 virtio: hand virtio ring alignment as argument to vring_new_virtqueue ... Browse Code »

This allows each virtio user to hand in the alignment appropriate to
their virtio_ring structures.

Signed-off-by: Rusty Russell
Acked-by: Christian Borntraeger

Rusty Russell
2008-12-30 06:56:03 +0800
2966af73e virtio: use LGUEST_VRING_ALIGN instead of relying on pagesize ... Browse Code »

This doesn't really matter, since lguest is i386 only at the moment,
but we could actually choose a different value. (lguest doesn't have
a guarenteed ABI).

Signed-off-by: Rusty Russell

Rusty Russell
2008-12-30 06:56:02 +0800

24 Dec, 2008

1 commit

b77b881f2 x86: fix lguest used_vectors breakage, -v2 ... Browse Code »

Impact: fix lguest, clean up

32-bit lguest used used_vectors to record vectors, but that model of
allocating vectors changed and got broken, after we changed vector
allocation to a per_cpu array.

Try enable that for 64bit, and the array is used for all vectors that
are not managed by vector_irq per_cpu array.

Also kill system_vectors[], that is now a duplication of the
used_vectors bitmap.

[ merged in cpus4096 due to io_apic.c cpumask changes. ]
[ -v2, fix build failure ]

Signed-off-by: Yinghai Lu
Signed-off-by: Ingo Molnar
Signed-off-by: Ingo Molnar

Yinghai Lu
2008-12-24 05:37:28 +0800

25 Aug, 2008

1 commit

1dc3e3bcb lguest: update commentry ... Browse Code »

Signed-off-by: Rusty Russell

Rusty Russell
2008-08-25 22:19:28 +0800

12 Aug, 2008

1 commit

71a3f4edc lguest: use get_user_pages_fast() instead of get_user_pages() ... Browse Code »

Using a simple page table thrashing program I measure a slight
improvement. The program creates five processes. Each touches 1000
pages then schedules the next process. We repeat this 1000 times. As
lguest only caches 4 cr3 values, this rebuilds a lot of shadow page
tables requiring virt->phys mappings.

Before: 5.93 seconds
After: 5.40 seconds

(Counts of slow vs fastpath in this usage are 6092 and 2852462 respectively.)

And more importantly for lguest, the code is simpler.

Signed-off-by: Rusty Russell

Rusty Russell
2008-08-12 15:52:53 +0800

29 Jul, 2008

3 commits

cf485e566 lguest: use cpu capability accessors ... Browse Code »

To support my little make-x86-bitops-use-proper-typechecking projectlet.

Cc: Thomas Gleixner
Cc: Andrea Arcangeli
Signed-off-by: Andrew Morton
Acked-by: Ingo Molnar
Signed-off-by: Rusty Russell

Andrew Morton
2008-07-29 07:58:34 +0800
0a707210a lguest: fix switcher_page leak on unload ... Browse Code »

map_switcher allocates the array, unmap_switcher has to free it
accordingly.

Signed-off-by: Johannes Weiner
Signed-off-by: Rusty Russell

Johannes Weiner
2008-07-29 07:58:32 +0800
0c12091d8 lguest: Guest int3 fix ... Browse Code »

Ron Minnich noticed that guest userspace gets a GPF when it tries to int3:
we need to copy the privilege level from the guest-supplied IDT to the real
IDT. int3 is the only common case where guest userspace expects to invoke
an interrupt, so that's the symptom of failing to do this.

Signed-off-by: Rusty Russell

Rusty Russell
2008-07-29 07:58:31 +0800

25 Jul, 2008

2 commits

e34f87256 virtio: Add transport feature handling stub for virtio_ring. ... Browse Code »

To prepare for virtio_ring transport feature bits, hook in a call in
all the users to manipulate them. This currently just clears all the
bits, since it doesn't understand any features.

Signed-off-by: Rusty Russell

Rusty Russell
2008-07-25 10:06:14 +0800
c624896e4 virtio: Rename set_features to finalize_features ... Browse Code »

Rather than explicitly handing the features to the lower-level, we just
hand the virtio_device and have it set the features. This make it clear
that it has the chance to manipulate the features of the device at this
point (and that all feature negotiation is already done).

Signed-off-by: Rusty Russell

Rusty Russell
2008-07-25 10:06:12 +0800

16 Jul, 2008

1 commit

1a781a777 Merge branch 'generic-ipi' into generic-ipi-for-linus ... Browse Code »

Conflicts:

arch/powerpc/Kconfig
arch/s390/kernel/time.c
arch/x86/kernel/apic_32.c
arch/x86/kernel/cpu/perfctr-watchdog.c
arch/x86/kernel/i8259_64.c
arch/x86/kernel/ldt.c
arch/x86/kernel/nmi_64.c
arch/x86/kernel/smpboot.c
arch/x86/xen/smp.c
include/asm-x86/hw_irq_32.h
include/asm-x86/hw_irq_64.h
include/asm-x86/mach-default/irq_vectors.h
include/asm-x86/mach-voyager/irq_vectors.h
include/asm-x86/smp.h
kernel/Makefile

Signed-off-by: Ingo Molnar

Ingo Molnar
2008-07-16 03:55:59 +0800

11 Jul, 2008

1 commit

15e551d25 x86, VisWS: turn into generic arch, eliminate Kconfig specials ... Browse Code »

remove leftover traces of various VISWS related Kconfig specials.

Signed-off-by: Ingo Molnar

Ingo Molnar
2008-07-11 00:55:47 +0800

26 Jun, 2008

1 commit

15c8b6c1a on_each_cpu(): kill unused 'retry' parameter ... Browse Code »

It's not even passed on to smp_call_function() anymore, since that
was removed. So kill it.

Acked-by: Jeremy Fitzhardinge
Reviewed-by: Paul E. McKenney
Signed-off-by: Jens Axboe

Jens Axboe
2008-06-26 17:24:38 +0800

25 Jun, 2008

1 commit

d02859ecb Merge commit 'v2.6.26-rc8' into x86/xen ... Browse Code »

Conflicts:

arch/x86/xen/enlighten.c
arch/x86/xen/mmu.c

Signed-off-by: Ingo Molnar

Ingo Molnar
2008-06-25 18:16:51 +0800

20 Jun, 2008

1 commit

54481cf88 x86: fix NULL pointer deref in __switch_to ... Browse Code »

I am able to reproduce the oops reported by Simon in __switch_to() with
lguest.

My debug showed that there is at least one lguest specific
issue (which should be present in 2.6.25 and before aswell) and it got
exposed with a kernel oops with the recent fpu dynamic allocation patches.

In addition to the previous possible scenario (with fpu_counter), in the
presence of lguest, it is possible that the cpu's TS bit it still set and the
lguest launcher task's thread_info has TS_USEDFPU still set.

This is because of the way the lguest launcher handling the guest's TS bit.
(look at lguest_set_ts() in lguest_arch_run_guest()). This can result
in a DNA fault while doing unlazy_fpu() in __switch_to(). This will
end up causing a DNA fault in the context of new process thats
getting context switched in (as opossed to handling DNA fault in the context
of lguest launcher/helper process).

This is wrong in both pre and post 2.6.25 kernels. In the recent
2.6.26-rc series, this is showing up as NULL pointer dereferences or
sleeping function called from atomic context(__switch_to()), as
we free and dynamically allocate the FPU context for the newly
created threads. Older kernels might show some FPU corruption for processes
running inside of lguest.

With the appended patch, my test system is running for more than 50 mins
now. So atleast some of your oops (hopefully all!) should get fixed.
Please give it a try. I will spend more time with this fix tomorrow.

Reported-by: Simon Holm Thøgersen
Reported-by: Patrick McHardy
Signed-off-by: Suresh Siddha
Signed-off-by: Ingo Molnar

Suresh Siddha
2008-06-20 19:26:18 +0800

16 Jun, 2008

1 commit

688d22e23 Merge branch 'linus' into x86/xen Browse Code »

Ingo Molnar
2008-06-16 17:21:27 +0800

30 May, 2008

2 commits

b769f5790 virtio: set device index in common code. ... Browse Code »

Anthony Liguori points out that three different transports use the virtio code,
but each one keeps its own counter to set the virtio_device's index field. In
theory (though not in current practice) this means that names could be
duplicated, and that risk grows as more transports are created.

So we move the selection of the unique virtio_device.index into the common code
in virtio.c, which has the side-benefit of removing duplicate code.

The only complexity is that lguest and S/390 use the index to uniquely identify
the device in case of catastrophic failure before register_virtio_device() is
called: now we use the offset within the descriptor page as a unique identifier
for the printks.

Signed-off-by: Rusty Russell
Cc: Christian Borntraeger
Cc: Martin Schwidefsky
Cc: Carsten Otte
Cc: Heiko Carstens
Cc: Chris Lalancette
Cc: Anthony Liguori

Rusty Russell
2008-05-30 13:09:42 +0800
e27810f11 lguest: use ioremap_cache, not ioremap ... Browse Code »

Thanks to Jon Corbet & LWN. Only took me a day to join the dots.

Host->Guest netcat before (with unnecessily large receive buffers):
1073741824 bytes (1.1 GB) copied, 24.7528 seconds, 43.4 MB/s

After:
1073741824 bytes (1.1 GB) copied, 17.6369 seconds, 60.9 MB/s

Signed-off-by: Rusty Russell

Rusty Russell
2008-05-30 13:09:41 +0800

27 May, 2008

1 commit

a15af1c9e x86/paravirt: add pte_flags to just get pte flags ... Browse Code »

Add pte_flags() to extract the flags from a pte. This is a special
case of pte_val() which is only guaranteed to return the pte's flags
correctly; the page number may be corrupted or missing.

The intent is to allow paravirt implementations to return pte flags
without having to do any translation of the page number (most notably,
Xen).

Signed-off-by: Jeremy Fitzhardinge
Signed-off-by: Thomas Gleixner

Jeremy Fitzhardinge
2008-05-27 16:11:36 +0800

02 May, 2008

4 commits

a007a751d lguest: make Launcher see device status updates ... Browse Code »

This brings us closer to Real Life, where we'd examine the device
features once it's set the DRIVER_OK status bit.

Signed-off-by: Rusty Russell

Rusty Russell
2008-05-02 19:50:54 +0800
9f3f74674 lguest: remove bogus NULL cpu check ... Browse Code »

If lg isn't NULL, and cpu_id is sane, &lg->cpus[cpu_id] can't be NULL.

Signed-off-by: Rusty Russell

Rusty Russell
2008-05-02 19:50:52 +0800
24adf1272 lguest: avoid using NR_CPUS as a bounds check. ... Browse Code »

NR_CPUS (being a host number) is an arbitrary limit for the Guest.
Using the array size directly (which currently happes to be NR_CPUS)
is more futureproof.

Signed-off-by: Rusty Russell

Rusty Russell
2008-05-02 19:50:51 +0800
c45a6816c virtio: explicit advertisement of driver features ... Browse Code »

A recent proposed feature addition to the virtio block driver revealed
some flaws in the API: in particular, we assume that feature
negotiation is complete once a driver's probe function returns.

There is nothing in the API to require this, however, and even I
didn't notice when it was violated.

So instead, we require the driver to specify what features it supports
in a table, we can then move the feature negotiation into the virtio
core. The intersection of device and driver features are presented in
a new 'features' bitmap in the struct virtio_device.

Note that this highlights the difference between Linux unsigned-long
bitmaps where each unsigned long is in native endian, and a
straight-forward little-endian array of bytes.

Drivers can still remove feature bits in their probe routine if they
really have to.

API changes:
- dev->config->feature() no longer gets and acks a feature.
- drivers should advertise their features in the 'feature_table' field
- use virtio_has_feature() for extra sanity when checking feature bits

Signed-off-by: Rusty Russell

Rusty Russell
2008-05-02 19:50:50 +0800

19 Apr, 2008

1 commit

d3135846f drivers: Remove unnecessary inclusions of asm/semaphore.h ... Browse Code »

None of these files use any of the functionality promised by
asm/semaphore.h. It's possible that they rely on it dragging in some
unrelated header file, but I can't build all these files, so we'll have
fix any build failures as they come up.

Signed-off-by: Matthew Wilcox

Matthew Wilcox
2008-04-19 10:16:32 +0800

31 Mar, 2008

1 commit

74dbf719e misc __user misannotations (pointless casts to long) ... Browse Code »

Signed-off-by: Al Viro
Signed-off-by: Linus Torvalds

Al Viro
2008-03-31 05:20:23 +0800

28 Mar, 2008

1 commit

a6bd8e130 lguest: comment documentation update. ... Browse Code »

Took some cycles to re-read the Lguest Journey end-to-end, fix some
rot and tighten some phrases.

Only comments change. No new jokes, but a couple of recycled old jokes.

Signed-off-by: Rusty Russell

Rusty Russell
2008-03-28 08:05:54 +0800