Eric Lee / linux-smarc-t335x-v3.2

12 Jan, 2011

13 commits

5c663a153 KVM: Fix build error on s390 due to missing tlbs_dirty ... Browse Code »

Make it available for all archs.

Signed-off-by: Avi Kivity

Avi Kivity
2011-01-12 17:30:50 +0800
d4dbf4700 KVM: MMU: Make the way of accessing lpage_info more generic ... Browse Code »

Large page information has two elements but one of them, write_count, alone
is accessed by a helper function.

This patch replaces this helper function with more generic one which returns
newly named kvm_lpage_info structure and use it to access the other element
rmap_pde.

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Avi Kivity

Takuya Yoshikawa
2011-01-12 17:30:47 +0800
a4ee1ca4a KVM: MMU: delay flush all tlbs on sync_page path ... Browse Code »

Quote from Avi:
| I don't think we need to flush immediately; set a "tlb dirty" bit somewhere
| that is cleareded when we flush the tlb. kvm_mmu_notifier_invalidate_page()
| can consult the bit and force a flush if set.

Signed-off-by: Xiao Guangrong
Signed-off-by: Marcelo Tosatti

Xiao Guangrong
2011-01-12 17:29:51 +0800
27923eb19 KVM: PPC: Fix compile warning ... Browse Code »

KVM compilation fails with the following warning:

include/linux/kvm_host.h: In function 'kvm_irq_routing_update':
include/linux/kvm_host.h:679:2: error: 'struct kvm' has no member named 'irq_routing'

That function is only used and reasonable to have on systems that implement
an in-kernel interrupt chip. PPC doesn't.

Fix by #ifdef'ing it out when no irqchip is available.

Signed-off-by: Alexander Graf
Signed-off-by: Avi Kivity

Alexander Graf
2011-01-12 17:29:42 +0800
bd2b53b20 KVM: fast-path msi injection with irqfd ... Browse Code »

Store irq routing table pointer in the irqfd object,
and use that to inject MSI directly without bouncing out to
a kernel thread.

While we touch this structure, rearrange irqfd fields to make fastpath
better packed for better cache utilization.

This also adds some comments about locking rules and rcu usage in code.

Some notes on the design:
- Use pointer into the rt instead of copying an entry,
to make it possible to use rcu, thus side-stepping
locking complexities. We also save some memory this way.
- Old workqueue code is still used for level irqs.
I don't think we DTRT with level anyway, however,
it seems easier to keep the code around as
it has been thought through and debugged, and fix level later than
rip out and re-instate it later.

Signed-off-by: Michael S. Tsirkin
Acked-by: Marcelo Tosatti
Acked-by: Gregory Haskins
Signed-off-by: Avi Kivity

Michael S. Tsirkin
2011-01-12 17:29:38 +0800
1e001d49f KVM: Refactor IRQ names of assigned devices ... Browse Code »

Cosmetic change, but it helps to correlate IRQs with PCI devices.

Acked-by: Alex Williamson
Acked-by: Michael S. Tsirkin
Signed-off-by: Jan Kiszka
Signed-off-by: Marcelo Tosatti

Jan Kiszka
2011-01-12 17:29:21 +0800
0645211c4 KVM: Switch assigned device IRQ forwarding to threaded handler ... Browse Code »

This improves the IRQ forwarding for assigned devices: By using the
kernel's threaded IRQ scheme, we can get rid of the latency-prone work
queue and simplify the code in the same run.

Moreover, we no longer have to hold assigned_dev_lock while raising the
guest IRQ, which can be a lenghty operation as we may have to iterate
over all VCPUs. The lock is now only used for synchronizing masking vs.
unmasking of INTx-type IRQs, thus is renames to intx_lock.

Acked-by: Alex Williamson
Acked-by: Michael S. Tsirkin
Signed-off-by: Jan Kiszka
Signed-off-by: Marcelo Tosatti

Jan Kiszka
2011-01-12 17:29:20 +0800
d89f5eff7 KVM: Clean up vm creation and release ... Browse Code »

IA64 support forces us to abstract the allocation of the kvm structure.
But instead of mixing this up with arch-specific initialization and
doing the same on destruction, split both steps. This allows to move
generic destruction calls into generic code.

It also fixes error clean-up on failures of kvm_create_vm for IA64.

Signed-off-by: Jan Kiszka
Signed-off-by: Avi Kivity

Jan Kiszka
2011-01-12 17:29:09 +0800
515a01279 KVM: pre-allocate one more dirty bitmap to avoid vmalloc() ... Browse Code »

Currently x86's kvm_vm_ioctl_get_dirty_log() needs to allocate a bitmap by
vmalloc() which will be used in the next logging and this has been causing
bad effect to VGA and live-migration: vmalloc() consumes extra systime,
triggers tlb flush, etc.

This patch resolves this issue by pre-allocating one more bitmap and switching
between two bitmaps during dirty logging.

Performance improvement:
I measured performance for the case of VGA update by trace-cmd.
The result was 1.5 times faster than the original one.

In the case of live migration, the improvement ratio depends on the workload
and the guest memory size. In general, the larger the memory size is the more
benefits we get.

Note:
This does not change other architectures's logic but the allocation size
becomes twice. This will increase the actual memory consumption only when
the new size changes the number of pages allocated by vmalloc().

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Fernando Luis Vazquez Cao
Signed-off-by: Marcelo Tosatti

Takuya Yoshikawa
2011-01-12 17:28:46 +0800
612819c3c KVM: propagate fault r/w information to gup(), allow read-only memory ... Browse Code »

As suggested by Andrea, pass r/w error code to gup(), upgrading read fault
to writable if host pte allows it.

Signed-off-by: Marcelo Tosatti
Signed-off-by: Avi Kivity

Marcelo Tosatti
2011-01-12 17:28:40 +0800
344d9588a KVM: Add PV MSR to enable asynchronous page faults delivery. ... Browse Code »

Guest enables async PF vcpu functionality using this MSR.

Reviewed-by: Rik van Riel
Signed-off-by: Gleb Natapov
Signed-off-by: Marcelo Tosatti

Gleb Natapov
2011-01-12 17:23:12 +0800
49c7754ce KVM: Add memory slot versioning and use it to provide fast guest write interface ... Browse Code »

Keep track of memslots changes by keeping generation number in memslots
structure. Provide kvm_write_guest_cached() function that skips
gfn_to_hva() translation if memslots was not changed since previous
invocation.

Acked-by: Rik van Riel
Signed-off-by: Gleb Natapov
Signed-off-by: Marcelo Tosatti

Gleb Natapov
2011-01-12 17:23:08 +0800
af585b921 KVM: Halt vcpu if page it tries to access is swapped out ... Browse Code »

If a guest accesses swapped out memory do not swap it in from vcpu thread
context. Schedule work to do swapping and put vcpu into halted state
instead.

Interrupts will still be delivered to the guest and if interrupt will
cause reschedule guest will continue to run another task.

[avi: remove call to get_user_pages_noio(), nacked by Linus; this
makes everything synchrnous again]

Acked-by: Rik van Riel
Signed-off-by: Gleb Natapov
Signed-off-by: Marcelo Tosatti

Gleb Natapov
2011-01-12 17:21:39 +0800

25 Oct, 2010

1 commit

1765a1fe5 Merge branch 'kvm-updates/2.6.37' of git://git.kernel.org/pub/scm/virt/kvm/kvm ... Browse Code »

* 'kvm-updates/2.6.37' of git://git.kernel.org/pub/scm/virt/kvm/kvm: (321 commits)
KVM: Drop CONFIG_DMAR dependency around kvm_iommu_map_pages
KVM: Fix signature of kvm_iommu_map_pages stub
KVM: MCE: Send SRAR SIGBUS directly
KVM: MCE: Add MCG_SER_P into KVM_MCE_CAP_SUPPORTED
KVM: fix typo in copyright notice
KVM: Disable interrupts around get_kernel_ns()
KVM: MMU: Avoid sign extension in mmu_alloc_direct_roots() pae root address
KVM: MMU: move access code parsing to FNAME(walk_addr) function
KVM: MMU: audit: check whether have unsync sps after root sync
KVM: MMU: audit: introduce audit_printk to cleanup audit code
KVM: MMU: audit: unregister audit tracepoints before module unloaded
KVM: MMU: audit: fix vcpu's spte walking
KVM: MMU: set access bit for direct mapping
KVM: MMU: cleanup for error mask set while walk guest page table
KVM: MMU: update 'root_hpa' out of loop in PAE shadow path
KVM: x86 emulator: Eliminate compilation warning in x86_decode_insn()
KVM: x86: Fix constant type in kvm_get_time_scale
KVM: VMX: Add AX to list of registers clobbered by guest switch
KVM guest: Move a printk that's using the clock before it's ready
KVM: x86: TSC catchup mode
...

Linus Torvalds
2010-10-25 03:47:25 +0800

24 Oct, 2010

7 commits

d7a79b6c8 KVM: Fix signature of kvm_iommu_map_pages stub ... Browse Code »

Breaks otherwise if CONFIG_IOMMU_API is not set.

KVM-Stable-Tag.
Signed-off-by: Jan Kiszka
Signed-off-by: Marcelo Tosatti

Jan Kiszka
2010-10-24 16:53:15 +0800
34c238a1d KVM: x86: Rename timer function ... Browse Code »

This just changes some names to better reflect the usage they
will be given. Separated out to keep confusion to a minimum.

Signed-off-by: Zachary Amsden
Signed-off-by: Marcelo Tosatti

Zachary Amsden
2010-10-24 16:53:05 +0800
3842d135f KVM: Check for pending events before attempting injection ... Browse Code »

Instead of blindly attempting to inject an event before each guest entry,
check for a possible event first in vcpu->requests. Sites that can trigger
event injection are modified to set KVM_REQ_EVENT:

- interrupt, nmi window opening
- ppr updates
- i8259 output changes
- local apic irr changes
- rflags updates
- gif flag set
- event set on exit

This improves non-injecting entry performance, and sets the stage for
non-atomic injection.

Signed-off-by: Avi Kivity

Avi Kivity
2010-10-24 16:52:50 +0800
c30a358d3 KVM: MMU: Add infrastructure for two-level page walker ... Browse Code »

This patch introduces a mmu-callback to translate gpa
addresses in the walk_addr code. This is later used to
translate l2_gpa addresses into l1_gpa addresses.

Signed-off-by: Joerg Roedel
Signed-off-by: Avi Kivity

Joerg Roedel
2010-10-24 16:52:34 +0800
365fb3fdf KVM: MMU: rewrite audit_mappings_page() function ... Browse Code »

There is a bugs in this function, we call gfn_to_pfn() and kvm_mmu_gva_to_gpa_read() in
atomic context(kvm_mmu_audit() is called under the spinlock(mmu_lock)'s protection).

This patch fix it by:
- introduce gfn_to_pfn_atomic instead of gfn_to_pfn
- get the mapping gfn from kvm_mmu_page_get_gfn()

And it adds 'notrap' ptes check in unsync/direct sps

Signed-off-by: Xiao Guangrong
Signed-off-by: Avi Kivity

Xiao Guangrong
2010-10-24 16:51:48 +0800
48987781e KVM: MMU: introduce gfn_to_page_many_atomic() function ... Browse Code »

Introduce this function to get consecutive gfn's pages, it can reduce
gup's overload, used by later patch

Signed-off-by: Xiao Guangrong
Signed-off-by: Marcelo Tosatti

Xiao Guangrong
2010-10-24 16:51:26 +0800
887c08ac1 KVM: MMU: introduce hva_to_pfn_atomic function ... Browse Code »

Introduce hva_to_pfn_atomic(), it's the fast path and can used in atomic
context, the later patch will use it

Signed-off-by: Xiao Guangrong
Signed-off-by: Marcelo Tosatti

Xiao Guangrong
2010-10-24 16:51:26 +0800

20 Aug, 2010

1 commit

4b6a2872a kvm: add __rcu annotations ... Browse Code »

Signed-off-by: Arnd Bergmann
Signed-off-by: Paul E. McKenney
Cc: Avi Kivity
Cc: Marcelo Tosatti
Reviewed-by: Josh Triplett

Arnd Bergmann
2010-08-20 08:18:01 +0800

02 Aug, 2010

2 commits

4a994358b KVM: Convert mask notifiers to use irqchip/pin instead of gsi ... Browse Code »

Devices register mask notifier using gsi, but irqchip knows about
irqchip/pin, so conversion from irqchip/pin to gsi should be done before
looking for mask notifier to call.

Signed-off-by: Gleb Natapov
Signed-off-by: Marcelo Tosatti

Gleb Natapov
2010-08-02 11:40:39 +0800
edba23e51 KVM: Return EFAULT from kvm ioctl when guest accesses bad area ... Browse Code »

Currently if guest access address that belongs to memory slot but is not
backed up by page or page is read only KVM treats it like MMIO access.
Remove that capability. It was never part of the interface and should
not be relied upon.

Signed-off-by: Gleb Natapov
Signed-off-by: Avi Kivity

Gleb Natapov
2010-08-02 11:40:33 +0800

01 Aug, 2010

7 commits

e36d96f7c KVM: Keep slot ID in memory slot structure ... Browse Code »

May be used for distinguishing between internal and user slots, or for sorting
slots in size order.

Signed-off-by: Avi Kivity

Avi Kivity
2010-08-01 15:47:07 +0800
0719837c0 KVM: Reduce atomic operations on vcpu->requests ... Browse Code »

Usually the vcpu->requests bitmap is sparse, so a test_and_clear_bit() for
each request generates a large number of unneeded atomics if a bit is set.

Replace with a separate test/clear sequence. This is safe since there is
no clear_bit() outside the vcpu thread.

Signed-off-by: Avi Kivity

Avi Kivity
2010-08-01 15:47:06 +0800
a8eeb04a4 KVM: Add mini-API for vcpu->requests ... Browse Code »

Makes it a little more readable and hackable.

Signed-off-by: Avi Kivity

Avi Kivity
2010-08-01 15:47:05 +0800
a1f4d3950 KVM: Remove memory alias support ... Browse Code »

As advertised in feature-removal-schedule.txt. Equivalent support is provided
by overlapping memory regions.

Signed-off-by: Avi Kivity

Avi Kivity
2010-08-01 15:47:00 +0800
2acf923e3 KVM: VMX: Enable XSAVE/XRSTOR for guest ... Browse Code »

This patch enable guest to use XSAVE/XRSTOR instructions.

We assume that host_xcr0 would use all possible bits that OS supported.

And we loaded xcr0 in the same way we handled fpu - do it as late as we can.

Signed-off-by: Dexuan Cui
Signed-off-by: Sheng Yang
Reviewed-by: Marcelo Tosatti
Signed-off-by: Avi Kivity

Dexuan Cui
2010-08-01 15:46:31 +0800
d94e1dc9a KVM: Get rid of KVM_REQ_KICK ... Browse Code »

KVM_REQ_KICK poisons vcpu->requests by having a bit set during normal
operation. This causes the fast path check for a clear vcpu->requests
to fail all the time, triggering tons of atomic operations.

Fix by replacing KVM_REQ_KICK with a vcpu->guest_mode atomic.

Signed-off-by: Avi Kivity

Avi Kivity
2010-08-01 15:35:37 +0800
bf998156d KVM: Avoid killing userspace through guest SRAO MCE on unmapped pages ... Browse Code »

In common cases, guest SRAO MCE will cause corresponding poisoned page
be un-mapped and SIGBUS be sent to QEMU-KVM, then QEMU-KVM will relay
the MCE to guest OS.

But it is reported that if the poisoned page is accessed in guest
after unmapping and before MCE is relayed to guest OS, userspace will
be killed.

The reason is as follows. Because poisoned page has been un-mapped,
guest access will cause guest exit and kvm_mmu_page_fault will be
called. kvm_mmu_page_fault can not get the poisoned page for fault
address, so kernel and user space MMIO processing is tried in turn. In
user MMIO processing, poisoned page is accessed again, then userspace
is killed by force_sig_info.

To fix the bug, kvm_mmu_page_fault send HWPOISON signal to QEMU-KVM
and do not try kernel and user space MMIO processing for poisoned
page.

[xiao: fix warning introduced by avi]

Reported-by: Max Asbock
Signed-off-by: Huang Ying
Signed-off-by: Xiao Guangrong
Signed-off-by: Marcelo Tosatti
Signed-off-by: Avi Kivity

Huang Ying
2010-08-01 15:35:26 +0800

19 May, 2010

1 commit

0ee75bead KVM: Let vcpu structure alignment be determined at runtime ... Browse Code »

vmx and svm vcpus have different contents and therefore may have different
alignmment requirements. Let each specify its required alignment.

Signed-off-by: Avi Kivity

Avi Kivity
2010-05-19 16:36:29 +0800

17 May, 2010

3 commits

2a059bf44 KVM: Get rid of dead function gva_to_page() ... Browse Code »

Nobody use gva_to_page() anymore, get rid of it.

Signed-off-by: Gui Jianfeng
Signed-off-by: Avi Kivity

Gui Jianfeng
2010-05-17 17:18:10 +0800
90d83dc3d KVM: use the correct RCU API for PROVE_RCU=y ... Browse Code »

The RCU/SRCU API have already changed for proving RCU usage.

I got the following dmesg when PROVE_RCU=y because we used incorrect API.
This patch coverts rcu_deference() to srcu_dereference() or family API.

===================================================
[ INFO: suspicious rcu_dereference_check() usage. ]
---------------------------------------------------
arch/x86/kvm/mmu.c:3020 invoked rcu_dereference_check() without protection!

other info that might help us debug this:

rcu_scheduler_active = 1, debug_locks = 0
2 locks held by qemu-system-x86/8550:
#0: (&kvm->slots_lock){+.+.+.}, at: [] kvm_set_memory_region+0x29/0x50 [kvm]
#1: (&(&kvm->mmu_lock)->rlock){+.+...}, at: [] kvm_arch_commit_memory_region+0xa6/0xe2 [kvm]

stack backtrace:
Pid: 8550, comm: qemu-system-x86 Not tainted 2.6.34-rc4-tip-01028-g939eab1 #27
Call Trace:
[] lockdep_rcu_dereference+0xaa/0xb3
[] kvm_mmu_calculate_mmu_pages+0x44/0x7d [kvm]
[] kvm_arch_commit_memory_region+0xb7/0xe2 [kvm]
[] __kvm_set_memory_region+0x636/0x6e2 [kvm]
[] kvm_set_memory_region+0x37/0x50 [kvm]
[] vmx_set_tss_addr+0x46/0x5a [kvm_intel]
[] kvm_arch_vm_ioctl+0x17a/0xcf8 [kvm]
[] ? unlock_page+0x27/0x2c
[] ? __do_fault+0x3a9/0x3e1
[] kvm_vm_ioctl+0x364/0x38d [kvm]
[] ? up_read+0x23/0x3d
[] vfs_ioctl+0x32/0xa6
[] do_vfs_ioctl+0x495/0x4db
[] ? fget_light+0xc2/0x241
[] ? do_sys_open+0x104/0x116
[] ? retint_swapgs+0xe/0x13
[] sys_ioctl+0x47/0x6a
[] system_call_fastpath+0x16/0x1b

Signed-off-by: Lai Jiangshan
Signed-off-by: Avi Kivity

Lai Jiangshan
2010-05-17 17:18:01 +0800
660c22c42 KVM: limit the number of pages per memory slot ... Browse Code »

This patch limits the number of pages per memory slot to make
us free from extra care about type issues.

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Marcelo Tosatti

Takuya Yoshikawa
2010-05-17 17:17:41 +0800

20 Apr, 2010

2 commits

e80e2a60f KVM: Increase NR_IOBUS_DEVS limit to 200 ... Browse Code »

This patch increases the current hardcoded limit of NR_IOBUS_DEVS
from 6 to 200. We are hitting this limit when creating a guest with more
than 1 virtio-net device using vhost-net backend. Each virtio-net
device requires 2 such devices to service notifications from rx/tx queues.

Signed-off-by: Sridhar Samudrala
Signed-off-by: Avi Kivity

Sridhar Samudrala
2010-04-20 18:08:30 +0800
87bf6e7de KVM: fix the handling of dirty bitmaps to avoid overflows ... Browse Code »

Int is not long enough to store the size of a dirty bitmap.

This patch fixes this problem with the introduction of a wrapper
function to calculate the sizes of dirty bitmaps.

Note: in mark_page_dirty(), we have to consider the fact that
__set_bit() takes the offset as int, not long.

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Marcelo Tosatti

Takuya Yoshikawa
2010-04-20 18:06:55 +0800

01 Mar, 2010

3 commits

70e335e16 KVM: Convert kvm->requests_lock to raw_spinlock_t ... Browse Code »

The code relies on kvm->requests_lock inhibiting preemption.

Noted by Jan Kiszka.

Signed-off-by: Avi Kivity

Avi Kivity
2010-03-01 23:36:13 +0800
8f0b1ab6f KVM: Introduce kvm_host_page_size ... Browse Code »

This patch introduces a generic function to find out the
host page size for a given gfn. This function is needed by
the kvm iommu code. This patch also simplifies the x86
host_mapping_level function.

Signed-off-by: Joerg Roedel
Signed-off-by: Avi Kivity

Joerg Roedel
2010-03-01 23:36:08 +0800
ab9f4ecbb KVM: enable PCI multiple-segments for pass-through device ... Browse Code »

Enable optional parameter (default 0) - PCI segment (or domain) besides
BDF, when assigning PCI device to guest.

Signed-off-by: Zhai Edwin
Acked-by: Chris Wright
Signed-off-by: Marcelo Tosatti

Zhai, Edwin
2010-03-01 23:36:06 +0800