Eric Lee / smarc-fsl-linux-kernel

22 Jul, 2011

1 commit

5dea1c88e lguest: use a special 1:1 linear pagetable mode until first switch. ... Browse Code »

The Host used to create some page tables for the Guest to use at the
top of Guest memory; it would then tell the Guest where this was. In
particular, it created linear mappings for 0 and 0xC0000000 addresses
because lguest used to switch to its real page tables quite late in
boot.

However, since d50d8fe19 Linux initialized boot page tables in
head_32.S even before the "are we lguest?" boot jump. So, now we can
simplify things: the Host pagetable code assumes 1:1 linear mapping
until it first calls the LHCALL_NEW_PGTABLE hypercall, which we now do
before we reach C code.

This also means that the Host doesn't need to know anything about the
Guest's PAGE_OFFSET. (Non-Linux guests might not even have such a
thing).

Signed-off-by: Rusty Russell

Rusty Russell
2011-07-22 13:09:48 +0800

30 Mar, 2010

1 commit

5a0e3ad6a include cleanup: Update gfp.h and slab.h includes to prepare for breaking implic… ... Browse Code »

…it slab.h inclusion from percpu.h

percpu.h is included by sched.h and module.h and thus ends up being
included when building most .c files. percpu.h includes slab.h which
in turn includes gfp.h making everything defined by the two files
universally available and complicating inclusion dependencies.

percpu.h -> slab.h dependency is about to be removed. Prepare for
this change by updating users of gfp and slab facilities include those
headers directly instead of assuming availability. As this conversion
needs to touch large number of source files, the following script is
used as the basis of conversion.

http://userweb.kernel.org/~tj/misc/slabh-sweep.py

The script does the followings.

* Scan files for gfp and slab usages and update includes such that
only the necessary includes are there. ie. if only gfp is used,
gfp.h, if slab is used, slab.h.

* When the script inserts a new include, it looks at the include
blocks and try to put the new include such that its order conforms
to its surrounding. It's put in the include block which contains
core kernel includes, in the same order that the rest are ordered -
alphabetical, Christmas tree, rev-Xmas-tree or at the end if there
doesn't seem to be any matching order.

* If the script can't find a place to put a new include (mostly
because the file doesn't have fitting include block), it prints out
an error message indicating which .h file needs to be added to the
file.

The conversion was done in the following steps.

1. The initial automatic conversion of all .c files updated slightly
over 4000 files, deleting around 700 includes and adding ~480 gfp.h
and ~3000 slab.h inclusions. The script emitted errors for ~400
files.

2. Each error was manually checked. Some didn't need the inclusion,
some needed manual addition while adding it to implementation .h or
embedding .c file was more appropriate for others. This step added
inclusions to around 150 files.

3. The script was run again and the output was compared to the edits
from #2 to make sure no file was left behind.

4. Several build tests were done and a couple of problems were fixed.
e.g. lib/decompress_*.c used malloc/free() wrappers around slab
APIs requiring slab.h to be added manually.

5. The script was run on all .h files but without automatically
editing them as sprinkling gfp.h and slab.h inclusions around .h
files could easily lead to inclusion dependency hell. Most gfp.h
inclusion directives were ignored as stuff from gfp.h was usually
wildly available and often used in preprocessor macros. Each
slab.h inclusion directive was examined and added manually as
necessary.

6. percpu.h was updated not to include slab.h.

7. Build test were done on the following configurations and failures
were fixed. CONFIG_GCOV_KERNEL was turned off for all tests (as my
distributed build env didn't work with gcov compiles) and a few
more options had to be turned off depending on archs to make things
build (like ipr on powerpc/64 which failed due to missing writeq).

* x86 and x86_64 UP and SMP allmodconfig and a custom test config.
* powerpc and powerpc64 SMP allmodconfig
* sparc and sparc64 SMP allmodconfig
* ia64 SMP allmodconfig
* s390 SMP allmodconfig
* alpha SMP allmodconfig
* um on x86_64 SMP allmodconfig

8. percpu.h modifications were reverted so that it could be applied as
a separate patch and serve as bisection point.

Given the fact that I had only a couple of failures from tests on step
6, I'm fairly confident about the coverage of this conversion patch.
If there is a breakage, it's likely to be something in one of the arch
headers which should be easily discoverable easily on most builds of
the specific arch.

Signed-off-by: Tejun Heo <tj@kernel.org>
Guess-its-ok-by: Christoph Lameter <cl@linux-foundation.org>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Lee Schermerhorn <Lee.Schermerhorn@hp.com>

Tejun Heo
2010-03-30 21:02:32 +0800

30 Jul, 2009

2 commits

1842f23c0 lguest and virtio: cleanup struct definitions to Linux style. ... Browse Code »

I've been doing this for years, and akpm picked me up on it about 12
months ago. lguest partly serves as example code, so let's do it Right.

Also, remove two unused fields in struct vblk_info in the example launcher.

Signed-off-by: Rusty Russell
Cc: Ingo Molnar

Rusty Russell
2009-07-30 14:33:46 +0800
2e04ef769 lguest: fix comment style ... Browse Code »

I don't really notice it (except to begrudge the extra vertical
space), but Ingo does. And he pointed out that one excuse of lguest
is as a teaching tool, it should set a good example.

Signed-off-by: Rusty Russell
Cc: Ingo Molnar

Rusty Russell
2009-07-30 14:33:45 +0800

17 Jul, 2009

1 commit

27de22d03 lguest: remove unnecessary forward struct declaration ... Browse Code »

While fixing lg.h to drop the fwd declaration, I noticed
there's another one ;)

Signed-off-by: Rusty Russell

Davide Libenzi
2009-07-17 20:17:44 +0800

01 Jul, 2009

1 commit

133890103 eventfd: revised interface and cleanups ... Browse Code »

Change the eventfd interface to de-couple the eventfd memory context, from
the file pointer instance.

Without such change, there is no clean way to racely free handle the
POLLHUP event sent when the last instance of the file* goes away. Also,
now the internal eventfd APIs are using the eventfd context instead of the
file*.

This patch is required by KVM's IRQfd code, which is still under
development.

Signed-off-by: Davide Libenzi
Cc: Gregory Haskins
Cc: Rusty Russell
Cc: Benjamin LaHaise
Cc: Avi Kivity
Signed-off-by: Andrew Morton
Signed-off-by: Linus Torvalds

Davide Libenzi
2009-07-01 09:55:58 +0800

12 Jun, 2009

8 commits

5dac051bc lguest: remove obsolete LHREQ_BREAK call ... Browse Code »

We no longer need an efficient mechanism to force the Guest back into
host userspace, as each device is serviced without bothering the main
Guest process (aka. the Launcher).

Signed-off-by: Rusty Russell

Rusty Russell
2009-06-12 20:57:11 +0800
df60aeef4 lguest: use eventfds for device notification ... Browse Code »

Currently, when a Guest wants to perform I/O it calls LHCALL_NOTIFY with
an address: the main Launcher process returns with this address, and figures
out what device to run.

A far nicer model is to let processes bind an eventfd to an address: if we
find one, we simply signal the eventfd.

Signed-off-by: Rusty Russell
Cc: Davide Libenzi

Rusty Russell
2009-06-12 20:57:10 +0800
9f155a9b3 lguest: allow any process to send interrupts ... Browse Code »

We currently only allow the Launcher process to send interrupts, but it
as we already send interrupts from the hrtimer, it's a simple matter of
extracting that code into a common set_interrupt routine.

As we switch to a thread per virtqueue, this avoids a bottleneck through the
main Launcher process.

Signed-off-by: Rusty Russell

Rusty Russell
2009-06-12 20:57:09 +0800
acdd0b629 lguest: PAE support ... Browse Code »

This version requires that host and guest have the same PAE status.
NX cap is not offered to the guest, yet.

Signed-off-by: Matias Zabaljauregui
Signed-off-by: Rusty Russell

Matias Zabaljauregui
2009-06-12 20:57:08 +0800
ebe0ba84f lguest: replace hypercall name LHCALL_SET_PMD with LHCALL_SET_PGD ... Browse Code »

replace LHCALL_SET_PMD with LHCALL_SET_PGD hypercall name
(That's really what it is, and the confusion gets worse with PAE support)

Signed-off-by: Matias Zabaljauregui
Signed-off-by: Rusty Russell
Reported-by: Jeremy Fitzhardinge

Matias Zabaljauregui
2009-06-12 20:57:07 +0800
f086122bb lguest: Segment selectors are 16-bit long. Fix lg_cpu.ss1 definition. ... Browse Code »

If GDT_ENTRIES were every > 256, this could become a problem.

Signed-off-by: Matias Zabaljauregui
Signed-off-by: Rusty Russell

Matias Zabaljauregui
2009-06-12 20:57:04 +0800
a32a8813d lguest: improve interrupt handling, speed up stream networking ... Browse Code »

lguest never checked for pending interrupts when enabling interrupts, and
things still worked. However, it makes a significant difference to TCP
performance, so it's time we fixed it by introducing a pending_irq flag
and checking it on irq_restore and irq_enable.

These two routines are now too big to patch into the 8/10 bytes
patch space, so we drop that code.

Note: The high latency on interrupt delivery had a very curious
effect: once everything else was optimized, networking without GSO was
faster than networking with GSO, since more interrupts were sent and
hence a greater chance of one getting through to the Guest!

Note2: (Almost) Closing the same loophole for iret doesn't have any
measurable effect, so I'm leaving that patch for the moment.

Before:
1GB tcpblast Guest->Host: 30.7 seconds
1GB tcpblast Guest->Host (no GSO): 76.0 seconds

After:
1GB tcpblast Guest->Host: 6.8 seconds
1GB tcpblast Guest->Host (no GSO): 27.8 seconds

Signed-off-by: Rusty Russell

Rusty Russell
2009-06-12 20:57:03 +0800
abd41f037 lguest: fix race in halt code ... Browse Code »

When the Guest does the LHCALL_HALT hypercall, we go to sleep, expecting
that a timer or the Waker will wake_up_process() us.

But we do it in a stupid way, leaving a classic missing wakeup race.

So split maybe_do_interrupt() into interrupt_pending() and
try_deliver_interrupt(), and check maybe_do_interrupt() and the
"break_out" flag before calling schedule.

Signed-off-by: Rusty Russell

Rusty Russell
2009-06-12 20:57:02 +0800

19 Apr, 2009

1 commit

a489f0b55 lguest: fix guest crash on non-linear addresses in gdt pvops ... Browse Code »

Fixes guest crash 'lguest: bad read address 0x4800000 len 256'

The new per-cpu allocator ends up handing a non-linear address to
write_gdt_entry. We do __pa() on it, and hand it to the host, which
kills us.

I've long wanted to make the hypercall "LOAD_GDT_ENTRY" to match the IDT
code, but had no pressing reason until now.

Signed-off-by: Rusty Russell
Cc: lguest@ozlabs.org

Rusty Russell
2009-04-19 21:44:01 +0800

30 Mar, 2009

1 commit

df1693abc lguest: use bool instead of int ... Browse Code »

Impact: clean up

Rusty told me, some time ago, that he had become a fan of "bool".
So, here are some replacements.

Signed-off-by: Matias Zabaljauregui
Signed-off-by: Rusty Russell

Matias Zabaljauregui
2009-03-30 19:25:25 +0800

30 Dec, 2008

1 commit

58a245664 lguest: move the initial guest page table creation code to the host ... Browse Code »

This patch moves the initial guest page table creation code to the host,
so the launcher keeps working with PAE enabled configs.

Signed-off-by: Matias Zabaljauregui
Signed-off-by: Rusty Russell

Matias Zabaljauregui
2008-12-30 06:56:11 +0800

27 May, 2008

1 commit

a15af1c9e x86/paravirt: add pte_flags to just get pte flags ... Browse Code »

Add pte_flags() to extract the flags from a pte. This is a special
case of pte_val() which is only guaranteed to return the pte's flags
correctly; the page number may be corrupted or missing.

The intent is to allow paravirt implementations to return pte flags
without having to do any translation of the page number (most notably,
Xen).

Signed-off-by: Jeremy Fitzhardinge
Signed-off-by: Thomas Gleixner

Jeremy Fitzhardinge
2008-05-27 16:11:36 +0800

19 Apr, 2008

1 commit

d3135846f drivers: Remove unnecessary inclusions of asm/semaphore.h ... Browse Code »

None of these files use any of the functionality promised by
asm/semaphore.h. It's possible that they rely on it dragging in some
unrelated header file, but I can't build all these files, so we'll have
fix any build failures as they come up.

Signed-off-by: Matthew Wilcox

Matthew Wilcox
2008-04-19 10:16:32 +0800

30 Jan, 2008

16 commits

ca94f2bdd lguest: Use explicit includes rateher than indirect ... Browse Code »

explicitly use ktime.h include
explicitly use hrtimer.h include
explicitly use sched.h include

This patch adds headers explicitly to lguest sources file,
to avoid depending on them being included somewhere else.

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:19 +0800
382ac6b3f lguest: get rid of lg variable assignments ... Browse Code »

We can save some lines of code by getting rid of
*lg = cpu... lines of code spread everywhere by now.

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:18 +0800
ae3749dcd lguest: move changed bitmap to lg_cpu ... Browse Code »

events represented in the 'changed' bitmap are per-cpu, not per-guest.
move it to the lg_cpu structure

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:17 +0800
f34f8c5fe lguest: move last_pages to lg_cpu ... Browse Code »

in our new model, pages are assigned to a virtual cpu, not to a guest.
We move it to the lg_cpu structure.

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:16 +0800
1713608f2 lguest: per-vcpu lguest pgdir management ... Browse Code »

this patch makes the pgdir management per-vcpu. The pgdirs pool
is still guest-wide (although it'll probably need to grow when we
are really executing more vcpus), but the pgdidx index is gone,
since it makes no sense anymore. Instead, we use a per-vcpu
index.

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:14 +0800
5e232f4f4 lguest: make pending notifications per-vcpu ... Browse Code »

this patch makes the pending_notify field, used to control
pending notifications, per-vcpu, instead of per-guest

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:13 +0800
4665ac8e2 lguest: makes special fields be per-vcpu ... Browse Code »

lguest struct have room for some fields, namely, cr2, ts, esp1
and ss1, that are not really guest-wide, but rather, vcpu-wide.

This patch puts it in the vcpu struct

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:13 +0800
66686c2ab lguest: per-vcpu lguest task management ... Browse Code »

lguest uses tasks to control its running behaviour (like sending
breaks, controlling halted state, etc). In a per-vcpu environment,
each vcpu will have its own underlying task. So this patch
makes the infrastructure for that possible

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:12 +0800
fc708b3e4 lguest: replace lguest_arch with lg_cpu_arch. ... Browse Code »

The fields found in lguest_arch are not really per-guest,
but per-cpu (gdt, idt, etc). So this patch turns lguest_arch
into lg_cpu_arch.

It makes sense to have a per-guest per-arch struct, but this
can be addressed later, when the need arrives.

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:11 +0800
a53a35a8b lguest: make registers per-vcpu ... Browse Code »

This is the most obvious per-vcpu field: registers.

So this patch moves it from struct lguest to struct vcpu,
and patch the places in which they are used, accordingly

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:11 +0800
0c78441cf lguest: map_switcher_in_guest() per-vcpu ... Browse Code »

The switcher needs to be mapped per-vcpu, because different vcpus
will potentially have different page tables (they don't have to,
because threads will share the same).

So our first step is the make the function receive a vcpu struct

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:09 +0800
177e449dc lguest: per-vcpu interrupt processing. ... Browse Code »

This patch adapts interrupt processing for using the vcpu struct.

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:09 +0800
ad8d8f3bc lguest: per-vcpu lguest timers ... Browse Code »

Here, I introduce per-vcpu timers. With this, we can have
local expiries, needed for accounting time in smp guests

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:08 +0800
73044f05a lguest: make hypercalls use the vcpu struct ... Browse Code »

this patch changes do_hcall() and do_async_hcall() interfaces (and obviously their
callers) to get a vcpu struct. Again, a vcpu services the hypercall, not the whole
guest

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:08 +0800
d0953d42c lguest: per-cpu run guest ... Browse Code »

This patch makes the run_guest() routine use the lg_cpu struct.
This is required since in a smp guest environment, there's no
more the notion of "running the guest", but rather, it is "running the vcpu"

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:06 +0800
badb1e040 lguest: introduce vcpu struct ... Browse Code »

this patch introduces a vcpu struct for lguest. In upcoming patches,
more and more fields will be moved from the lguest struct to the vcpu

Signed-off-by: Glauber de Oliveira Costa
Signed-off-by: Rusty Russell

Glauber de Oliveira Costa
2008-01-30 19:50:04 +0800

25 Oct, 2007

2 commits

e1e72965e lguest: documentation update ... Browse Code »

Went through the documentation doing typo and content fixes. This
patch contains only comment and whitespace changes.

Signed-off-by: Rusty Russell

Rusty Russell
2007-10-25 13:02:50 +0800
197bff630 lguest: remove unused "wake" element from struct lguest ... Browse Code »

Signed-off-by: Rusty Russell

Rusty Russell
2007-10-25 12:10:30 +0800

23 Oct, 2007

3 commits

2d37f94a2 generalize lgread_u32/lgwrite_u32. ... Browse Code »

Jes complains that page table code still uses lgread_u32 even though
it now uses general kernel pte types. The best thing to do is to
generalize lgread_u32 and lgwrite_u32.

This means we lose the efficiency of getuser(). We could potentially
regain it if we used __copy_from_user instead of copy_from_user, but
I'm not certain that our range check is equivalent to access_ok() on
all platforms.

Signed-off-by: Rusty Russell
Acked-by: Jes Sorensen

Rusty Russell
2007-10-23 13:49:56 +0800
15045275c Remove old lguest I/O infrrasructure. ... Browse Code »

This patch gets rid of the old lguest host I/O infrastructure and
replaces it with a single hypercall "LHCALL_NOTIFY" which takes an
address.

The main change is the removal of io.c: that mainly did inter-guest
I/O, which virtio doesn't yet support.

Signed-off-by: Rusty Russell

Rusty Russell
2007-10-23 13:49:55 +0800
47436aa4a Boot with virtual == physical to get closer to native Linux. ... Browse Code »

1) This allows us to get alot closer to booting bzImages.

2) It means we don't have to know page_offset.

3) The Guest needs to modify the boot pagetables to create the
PAGE_OFFSET mapping before jumping to C code.

4) guest_pa() walks the page tables rather than using page_offset.

5) We don't use page_offset to figure out whether to emulate: it was
always kinda quesationable, and won't work for instructions done
before remapping (bzImage unpacking in particular).

6) We still want the kernel address for tlb flushing: have the initial
hypercall give us that, too.

Signed-off-by: Rusty Russell

Rusty Russell
2007-10-23 13:49:54 +0800