Eric Lee / smarc-fsl-linux-kernel

11 Jun, 2010

1 commit

07dc7263b KVM: read apic->irr with ioapic lock held ... Browse Code »

Read ioapic->irr inside ioapic->lock protected section.

KVM-Stable-Tag
Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-06-11 01:29:03 +0800

09 Jun, 2010

1 commit

05b782ab9 KVM: Fix order passed to iommu_unmap ... Browse Code »

This is obviously a left-over from the the old interface taking the
size. Apparently a mostly harmless issue with the current iommu_unmap
implementation.

Signed-off-by: Jan Kiszka
Acked-by: Joerg Roedel
Signed-off-by: Avi Kivity

Jan Kiszka
2010-06-09 23:48:38 +0800

22 May, 2010

1 commit

98edb6ca4 Merge branch 'kvm-updates/2.6.35' of git://git.kernel.org/pub/scm/virt/kvm/kvm ... Browse Code »

* 'kvm-updates/2.6.35' of git://git.kernel.org/pub/scm/virt/kvm/kvm: (269 commits)
KVM: x86: Add missing locking to arch specific vcpu ioctls
KVM: PPC: Add missing vcpu_load()/vcpu_put() in vcpu ioctls
KVM: MMU: Segregate shadow pages with different cr0.wp
KVM: x86: Check LMA bit before set_efer
KVM: Don't allow lmsw to clear cr0.pe
KVM: Add cpuid.txt file
KVM: x86: Tell the guest we'll warn it about tsc stability
x86, paravirt: don't compute pvclock adjustments if we trust the tsc
x86: KVM guest: Try using new kvm clock msrs
KVM: x86: export paravirtual cpuid flags in KVM_GET_SUPPORTED_CPUID
KVM: x86: add new KVMCLOCK cpuid feature
KVM: x86: change msr numbers for kvmclock
x86, paravirt: Add a global synchronization point for pvclock
x86, paravirt: Enable pvclock flags in vcpu_time_info structure
KVM: x86: Inject #GP with the right rip on efer writes
KVM: SVM: Don't allow nested guest to VMMCALL into host
KVM: x86: Fix exception reinjection forced to true
KVM: Fix wallclock version writing race
KVM: MMU: Don't read pdptrs with mmu spinlock held in mmu_alloc_roots
KVM: VMX: enable VMXON check with SMX enabled (Intel TXT)
...

Linus Torvalds
2010-05-22 08:16:21 +0800

19 May, 2010

1 commit

0ee75bead KVM: Let vcpu structure alignment be determined at runtime ... Browse Code »

vmx and svm vcpus have different contents and therefore may have different
alignmment requirements. Let each specify its required alignment.

Signed-off-by: Avi Kivity

Avi Kivity
2010-05-19 16:36:29 +0800

18 May, 2010

1 commit

8123d8f17 Merge branch 'core-iommu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel… ... Browse Code »

…/git/tip/linux-2.6-tip

* 'core-iommu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip:
x86/amd-iommu: Add amd_iommu=off command line option
iommu-api: Remove iommu_{un}map_range functions
x86/amd-iommu: Implement ->{un}map callbacks for iommu-api
x86/amd-iommu: Make amd_iommu_iova_to_phys aware of multiple page sizes
x86/amd-iommu: Make iommu_unmap_page and fetch_pte aware of page sizes
x86/amd-iommu: Make iommu_map_page and alloc_pte aware of page sizes
kvm: Change kvm_iommu_map_pages to map large pages
VT-d: Change {un}map_range functions to implement {un}map interface
iommu-api: Add ->{un}map callbacks to iommu_ops
iommu-api: Add iommu_map and iommu_unmap functions
iommu-api: Rename ->{un}map function pointers to ->{un}map_range

Linus Torvalds
2010-05-18 22:22:37 +0800

17 May, 2010

8 commits

d14769377 KVM: Remove test-before-set optimization for dirty bits ... Browse Code »

As Avi pointed out, testing bit part in mark_page_dirty() was important
in the days of shadow paging, but currently EPT and NPT has already become
common and the chance of faulting a page more that once per iteration is
small. So let's remove the test bit to avoid extra access.

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Avi Kivity

Takuya Yoshikawa
2010-05-17 17:19:13 +0800
66cbff59a KVM: do not call hardware_disable() on CPU_UP_CANCELED ... Browse Code »

When CPU_UP_CANCELED, hardware_enable() has not been called at the CPU
which is going up because raw_notifier_call_chain(CPU_ONLINE)
has not been called for this cpu.

Drop the handling for CPU_UP_CANCELED.

Signed-off-by: Lai Jiangshan
Signed-off-by: Avi Kivity

Lai Jiangshan
2010-05-17 17:18:04 +0800
90d83dc3d KVM: use the correct RCU API for PROVE_RCU=y ... Browse Code »

The RCU/SRCU API have already changed for proving RCU usage.

I got the following dmesg when PROVE_RCU=y because we used incorrect API.
This patch coverts rcu_deference() to srcu_dereference() or family API.

===================================================
[ INFO: suspicious rcu_dereference_check() usage. ]
---------------------------------------------------
arch/x86/kvm/mmu.c:3020 invoked rcu_dereference_check() without protection!

other info that might help us debug this:

rcu_scheduler_active = 1, debug_locks = 0
2 locks held by qemu-system-x86/8550:
#0: (&kvm->slots_lock){+.+.+.}, at: [] kvm_set_memory_region+0x29/0x50 [kvm]
#1: (&(&kvm->mmu_lock)->rlock){+.+...}, at: [] kvm_arch_commit_memory_region+0xa6/0xe2 [kvm]

stack backtrace:
Pid: 8550, comm: qemu-system-x86 Not tainted 2.6.34-rc4-tip-01028-g939eab1 #27
Call Trace:
[] lockdep_rcu_dereference+0xaa/0xb3
[] kvm_mmu_calculate_mmu_pages+0x44/0x7d [kvm]
[] kvm_arch_commit_memory_region+0xb7/0xe2 [kvm]
[] __kvm_set_memory_region+0x636/0x6e2 [kvm]
[] kvm_set_memory_region+0x37/0x50 [kvm]
[] vmx_set_tss_addr+0x46/0x5a [kvm_intel]
[] kvm_arch_vm_ioctl+0x17a/0xcf8 [kvm]
[] ? unlock_page+0x27/0x2c
[] ? __do_fault+0x3a9/0x3e1
[] kvm_vm_ioctl+0x364/0x38d [kvm]
[] ? up_read+0x23/0x3d
[] vfs_ioctl+0x32/0xa6
[] do_vfs_ioctl+0x495/0x4db
[] ? fget_light+0xc2/0x241
[] ? do_sys_open+0x104/0x116
[] ? retint_swapgs+0xe/0x13
[] sys_ioctl+0x47/0x6a
[] system_call_fastpath+0x16/0x1b

Signed-off-by: Lai Jiangshan
Signed-off-by: Avi Kivity

Lai Jiangshan
2010-05-17 17:18:01 +0800
660c22c42 KVM: limit the number of pages per memory slot ... Browse Code »

This patch limits the number of pages per memory slot to make
us free from extra care about type issues.

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Marcelo Tosatti

Takuya Yoshikawa
2010-05-17 17:17:41 +0800
6ce5a090a KVM: coalesced_mmio: fix kvm_coalesced_mmio_init()'s error handling ... Browse Code »

kvm_coalesced_mmio_init() keeps to hold the addresses of a coalesced
mmio ring page and dev even after it has freed them.

Also, if this function fails, though it might be rare, it seems to be
suggesting the system's serious state: so we'd better stop the works
following the kvm_creat_vm().

This patch clears these problems.

We move the coalesced mmio's initialization out of kvm_create_vm().
This seems to be natural because it includes a registration which
can be done only when vm is successfully created.

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Marcelo Tosatti

Takuya Yoshikawa
2010-05-17 17:15:53 +0800
d57e2c074 KVM: fix assigned_device_enable_host_msix error handling ... Browse Code »

Free IRQ's and disable MSIX upon failure.

Cc: Avi Kivity
Signed-off-by: Jing Zhang
Signed-off-by: Marcelo Tosatti

jing zhang
2010-05-17 17:15:36 +0800
a87fa3551 KVM: fix the errno of ioctl KVM_[UN]REGISTER_COALESCED_MMIO failure ... Browse Code »

This patch change the errno of ioctl KVM_[UN]REGISTER_COALESCED_MMIO
from -EINVAL to -ENXIO if no coalesced mmio dev exists.

Signed-off-by: Wei Yongjun
Signed-off-by: Marcelo Tosatti

Wei Yongjun
2010-05-17 17:15:34 +0800
2ed152afc KVM: cleanup kvm trace ... Browse Code »

This patch does:

- no need call tracepoint_synchronize_unregister() when kvm module
is unloaded since ftrace can handle it

- cleanup ftrace's macro

Signed-off-by: Xiao Guangrong
Signed-off-by: Avi Kivity

Xiao Guangrong
2010-05-17 17:15:22 +0800

13 May, 2010

1 commit

46a47b1ed KVM: convert ioapic lock to spinlock ... Browse Code »

kvm_set_irq is used from non sleepable contexes, so convert ioapic from
mutex to spinlock.

KVM-Stable-Tag.
Tested-by: Ralf Bonenkamp
Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-05-13 12:23:55 +0800

11 May, 2010

1 commit

795e74f7a Merge branch 'iommu/largepages' into amd-iommu/2.6.35 ... Browse Code »

Conflicts:
arch/x86/kernel/amd_iommu.c

Joerg Roedel
2010-05-11 23:40:57 +0800

25 Apr, 2010

1 commit

f5c980317 KVM: update gfn_to_hva() to use gfn_to_hva_memslot() ... Browse Code »

Marcelo introduced gfn_to_hva_memslot() when he implemented
gfn_to_pfn_memslot(). Let's use this for gfn_to_hva() too.

Note: also remove parentheses next to return as checkpatch said to do.

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Avi Kivity

Takuya Yoshikawa
2010-04-25 18:53:29 +0800

21 Apr, 2010

1 commit

eda2beda8 KVM: Add missing srcu_read_lock() for kvm_mmu_notifier_release() ... Browse Code »

I got this dmesg due to srcu_read_lock() is missing in
kvm_mmu_notifier_release().

===================================================
[ INFO: suspicious rcu_dereference_check() usage. ]
---------------------------------------------------
arch/x86/kvm/x86.h:72 invoked rcu_dereference_check() without protection!

other info that might help us debug this:

rcu_scheduler_active = 1, debug_locks = 0
2 locks held by qemu-system-x86/3100:
#0: (rcu_read_lock){.+.+..}, at: [] __mmu_notifier_release+0x38/0xdf
#1: (&(&kvm->mmu_lock)->rlock){+.+...}, at: [] kvm_mmu_zap_all+0x21/0x5e [kvm]

stack backtrace:
Pid: 3100, comm: qemu-system-x86 Not tainted 2.6.34-rc3-22949-gbc8a97a-dirty #2
Call Trace:
[] lockdep_rcu_dereference+0xaa/0xb3
[] unalias_gfn+0x56/0xab [kvm]
[] gfn_to_memslot+0x16/0x25 [kvm]
[] gfn_to_rmap+0x17/0x6e [kvm]
[] rmap_remove+0xa0/0x19d [kvm]
[] kvm_mmu_zap_page+0x109/0x34d [kvm]
[] kvm_mmu_zap_all+0x35/0x5e [kvm]
[] kvm_arch_flush_shadow+0x16/0x22 [kvm]
[] kvm_mmu_notifier_release+0x15/0x17 [kvm]
[] __mmu_notifier_release+0x88/0xdf
[] ? __mmu_notifier_release+0x38/0xdf
[] ? exit_mm+0xe0/0x115
[] exit_mmap+0x2c/0x17e
[] mmput+0x2d/0xd4
[] exit_mm+0x108/0x115
[...]

Signed-off-by: Lai Jiangshan
Signed-off-by: Avi Kivity

Lai Jiangshan
2010-04-21 16:17:43 +0800

20 Apr, 2010

1 commit

87bf6e7de KVM: fix the handling of dirty bitmaps to avoid overflows ... Browse Code »

Int is not long enough to store the size of a dirty bitmap.

This patch fixes this problem with the introduction of a wrapper
function to calculate the sizes of dirty bitmaps.

Note: in mark_page_dirty(), we have to consider the fact that
__set_bit() takes the offset as int, not long.

Signed-off-by: Takuya Yoshikawa
Signed-off-by: Marcelo Tosatti

Takuya Yoshikawa
2010-04-20 18:06:55 +0800

30 Mar, 2010

1 commit

5a0e3ad6a include cleanup: Update gfp.h and slab.h includes to prepare for breaking implic… ... Browse Code »

…it slab.h inclusion from percpu.h

percpu.h is included by sched.h and module.h and thus ends up being
included when building most .c files. percpu.h includes slab.h which
in turn includes gfp.h making everything defined by the two files
universally available and complicating inclusion dependencies.

percpu.h -> slab.h dependency is about to be removed. Prepare for
this change by updating users of gfp and slab facilities include those
headers directly instead of assuming availability. As this conversion
needs to touch large number of source files, the following script is
used as the basis of conversion.

http://userweb.kernel.org/~tj/misc/slabh-sweep.py

The script does the followings.

* Scan files for gfp and slab usages and update includes such that
only the necessary includes are there. ie. if only gfp is used,
gfp.h, if slab is used, slab.h.

* When the script inserts a new include, it looks at the include
blocks and try to put the new include such that its order conforms
to its surrounding. It's put in the include block which contains
core kernel includes, in the same order that the rest are ordered -
alphabetical, Christmas tree, rev-Xmas-tree or at the end if there
doesn't seem to be any matching order.

* If the script can't find a place to put a new include (mostly
because the file doesn't have fitting include block), it prints out
an error message indicating which .h file needs to be added to the
file.

The conversion was done in the following steps.

1. The initial automatic conversion of all .c files updated slightly
over 4000 files, deleting around 700 includes and adding ~480 gfp.h
and ~3000 slab.h inclusions. The script emitted errors for ~400
files.

2. Each error was manually checked. Some didn't need the inclusion,
some needed manual addition while adding it to implementation .h or
embedding .c file was more appropriate for others. This step added
inclusions to around 150 files.

3. The script was run again and the output was compared to the edits
from #2 to make sure no file was left behind.

4. Several build tests were done and a couple of problems were fixed.
e.g. lib/decompress_*.c used malloc/free() wrappers around slab
APIs requiring slab.h to be added manually.

5. The script was run on all .h files but without automatically
editing them as sprinkling gfp.h and slab.h inclusions around .h
files could easily lead to inclusion dependency hell. Most gfp.h
inclusion directives were ignored as stuff from gfp.h was usually
wildly available and often used in preprocessor macros. Each
slab.h inclusion directive was examined and added manually as
necessary.

6. percpu.h was updated not to include slab.h.

7. Build test were done on the following configurations and failures
were fixed. CONFIG_GCOV_KERNEL was turned off for all tests (as my
distributed build env didn't work with gcov compiles) and a few
more options had to be turned off depending on archs to make things
build (like ipr on powerpc/64 which failed due to missing writeq).

* x86 and x86_64 UP and SMP allmodconfig and a custom test config.
* powerpc and powerpc64 SMP allmodconfig
* sparc and sparc64 SMP allmodconfig
* ia64 SMP allmodconfig
* s390 SMP allmodconfig
* alpha SMP allmodconfig
* um on x86_64 SMP allmodconfig

8. percpu.h modifications were reverted so that it could be applied as
a separate patch and serve as bisection point.

Given the fact that I had only a couple of failures from tests on step
6, I'm fairly confident about the coverage of this conversion patch.
If there is a breakage, it's likely to be something in one of the arch
headers which should be easily discoverable easily on most builds of
the specific arch.

Signed-off-by: Tejun Heo <tj@kernel.org>
Guess-its-ok-by: Christoph Lameter <cl@linux-foundation.org>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Lee Schermerhorn <Lee.Schermerhorn@hp.com>

Tejun Heo
2010-03-30 21:02:32 +0800

08 Mar, 2010

1 commit

fcd95807f kvm: Change kvm_iommu_map_pages to map large pages ... Browse Code »

This patch changes the implementation of of
kvm_iommu_map_pages to map the pages with the host page size
into the io virtual address space.

Signed-off-by: Joerg Roedel
Acked-By: Avi Kivity

Joerg Roedel
2010-03-08 01:01:11 +0800

01 Mar, 2010

20 commits

70e335e16 KVM: Convert kvm->requests_lock to raw_spinlock_t ... Browse Code »

The code relies on kvm->requests_lock inhibiting preemption.

Noted by Jan Kiszka.

Signed-off-by: Avi Kivity

Avi Kivity
2010-03-01 23:36:13 +0800
8b97fb0fc KVM: do not store wqh in irqfd ... Browse Code »

wqh is unused, so we do not need to store it in irqfd anymore

Signed-off-by: Michael S. Tsirkin
Signed-off-by: Avi Kivity

Michael S. Tsirkin
2010-03-01 23:36:10 +0800
72bb2fcd2 KVM: cleanup the failure path of KVM_CREATE_IRQCHIP ioctrl ... Browse Code »

If we fail to init ioapic device or the fail to setup the default irq
routing, the device register by kvm_create_pic() and kvm_ioapic_init()
remain unregister. This patch fixed to do this.

Signed-off-by: Wei Yongjun
Signed-off-by: Avi Kivity

Wei Yongjun
2010-03-01 23:36:10 +0800
1ae77badc KVM: kvm->arch.vioapic should be NULL if kvm_ioapic_init() failure ... Browse Code »

kvm->arch.vioapic should be NULL in case of kvm_ioapic_init() failure
due to cannot register io dev.

Signed-off-by: Wei Yongjun
Signed-off-by: Avi Kivity

Wei Yongjun
2010-03-01 23:36:09 +0800
43db66973 KVM: Fix Codestyle in virt/kvm/coalesced_mmio.c ... Browse Code »

Fixed 2 codestyle issues in virt/kvm/coalesced_mmio.c

Signed-off-by: Jochen Maes
Signed-off-by: Avi Kivity

Jochen Maes
2010-03-01 23:36:09 +0800
8f0b1ab6f KVM: Introduce kvm_host_page_size ... Browse Code »

This patch introduces a generic function to find out the
host page size for a given gfn. This function is needed by
the kvm iommu code. This patch also simplifies the x86
host_mapping_level function.

Signed-off-by: Joerg Roedel
Signed-off-by: Avi Kivity

Joerg Roedel
2010-03-01 23:36:08 +0800
ab9f4ecbb KVM: enable PCI multiple-segments for pass-through device ... Browse Code »

Enable optional parameter (default 0) - PCI segment (or domain) besides
BDF, when assigning PCI device to guest.

Signed-off-by: Zhai Edwin
Acked-by: Chris Wright
Signed-off-by: Marcelo Tosatti

Zhai, Edwin
2010-03-01 23:36:06 +0800
f0f4b9309 KVM: Fix kvm_coalesced_mmio_ring duplicate allocation ... Browse Code »

The commit 0953ca73 "KVM: Simplify coalesced mmio initialization"
allocate kvm_coalesced_mmio_ring in the kvm_coalesced_mmio_init(), but
didn't discard the original allocation...

Signed-off-by: Sheng Yang
Signed-off-by: Marcelo Tosatti

Sheng Yang
2010-03-01 23:36:03 +0800
647492047 KVM: fix cleanup_srcu_struct on vm destruction ... Browse Code »

cleanup_srcu_struct on VM destruction remains broken:

BUG: unable to handle kernel paging request at ffffffffffffffff
IP: [] srcu_read_lock+0x16/0x21
RIP: 0010:[] [] srcu_read_lock+0x16/0x21
Call Trace:
[] kvm_arch_vcpu_uninit+0x1b/0x48 [kvm]
[] kvm_vcpu_uninit+0x9/0x15 [kvm]
[] vmx_free_vcpu+0x7f/0x8f [kvm_intel]
[] kvm_arch_destroy_vm+0x78/0x111 [kvm]
[] kvm_put_kvm+0xd4/0xfe [kvm]

Move it to kvm_arch_destroy_vm.

Signed-off-by: Marcelo Tosatti
Reported-by: Jan Kiszka

Marcelo Tosatti
2010-03-01 23:36:01 +0800
46a929bc1 KVM: avoid taking ioapic mutex for non-ioapic EOIs ... Browse Code »

When the guest acknowledges an interrupt, it sends an EOI message to the local
apic, which broadcasts it to the ioapic. To handle the EOI, we need to take
the ioapic mutex.

On large guests, this causes a lot of contention on this mutex. Since large
guests usually don't route interrupts via the ioapic (they use msi instead),
this is completely unnecessary.

Avoid taking the mutex by introducing a handled_vectors bitmap. Before taking
the mutex, check if the ioapic was actually responsible for the acked vector.
If not, we can return early.

Signed-off-by: Avi Kivity
Signed-off-by: Marcelo Tosatti

Avi Kivity
2010-03-01 23:35:46 +0800
79fac95ec KVM: convert slots_lock to a mutex ... Browse Code »

Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-03-01 23:35:45 +0800
e93f8a0f8 KVM: convert io_bus to SRCU ... Browse Code »

Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-03-01 23:35:45 +0800
a983fb238 KVM: x86: switch kvm_set_memory_alias to SRCU update ... Browse Code »

Using a similar two-step procedure as for memslots.

Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-03-01 23:35:45 +0800
bc6678a33 KVM: introduce kvm->srcu and convert kvm_set_memory_region to SRCU update ... Browse Code »

Use two steps for memslot deletion: mark the slot invalid (which stops
instantiation of new shadow pages for that slot, but allows destruction),
then instantiate the new empty slot.

Also simplifies kvm_handle_hva locking.

Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-03-01 23:35:44 +0800
3ad26d813 KVM: use gfn_to_pfn_memslot in kvm_iommu_map_pages ... Browse Code »

So its possible to iommu map a memslot before making it visible to
kvm.

Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-03-01 23:35:44 +0800
506f0d6f9 KVM: introduce gfn_to_pfn_memslot ... Browse Code »

Which takes a memslot pointer instead of using kvm->memslots.

To be used by SRCU convertion later.

Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-03-01 23:35:44 +0800
f7784b8ec KVM: split kvm_arch_set_memory_region into prepare and commit ... Browse Code »

Required for SRCU convertion later.

Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-03-01 23:35:44 +0800
46a26bf55 KVM: modify memslots layout in struct kvm ... Browse Code »

Have a pointer to an allocated region inside struct kvm.

[alex: fix ppc book 3s]

Signed-off-by: Alexander Graf
Signed-off-by: Marcelo Tosatti

Marcelo Tosatti
2010-03-01 23:35:43 +0800
980da6ce5 KVM: Simplify coalesced mmio initialization ... Browse Code »

- add destructor function
- move related allocation into constructor
- add stubs for !CONFIG_KVM_MMIO

Signed-off-by: Avi Kivity

Avi Kivity
2010-03-01 23:35:41 +0800
50eb2a3cd KVM: Add KVM_MMIO kconfig item ... Browse Code »

s390 doesn't have mmio, this will simplify ifdefing it out.

Signed-off-by: Avi Kivity

Avi Kivity
2010-03-01 23:35:41 +0800