linux

mirror of https://github.com/torvalds/linux.git synced 2024-11-10 14:11:52 +00:00

History

David Matlack ee3d1570b5 kvm: fix potentially corrupt mmio cache vcpu exits and memslot mutations can run concurrently as long as the vcpu does not aquire the slots mutex. Thus it is theoretically possible for memslots to change underneath a vcpu that is handling an exit. If we increment the memslot generation number again after synchronize_srcu_expedited(), vcpus can safely cache memslot generation without maintaining a single rcu_dereference through an entire vm exit. And much of the x86/kvm code does not maintain a single rcu_dereference of the current memslots during each exit. We can prevent the following case: vcpu (CPU 0) \| thread (CPU 1) --------------------------------------------+-------------------------- 1 vm exit \| 2 srcu_read_unlock(&kvm->srcu) \| 3 decide to cache something based on \| old memslots \| 4 \| change memslots \| (increments generation) 5 \| synchronize_srcu(&kvm->srcu); 6 retrieve generation # from new memslots \| 7 tag cache with new memslot generation \| 8 srcu_read_unlock(&kvm->srcu) \| ... \| <action based on cache occurs even \| though the caching decision was based \| on the old memslots> \| ... \| <action continues to occur until next \| memslot generation change, which may \| be never> \| \| By incrementing the generation after synchronizing with kvm->srcu readers, we ensure that the generation retrieved in (6) will become invalid soon after (8). Keeping the existing increment is not strictly necessary, but we do keep it and just move it for consistency from update_memslots to install_new_memslots. It invalidates old cached MMIOs immediately, instead of having to wait for the end of synchronize_srcu_expedited, which makes the code more clearly correct in case CPU 1 is preempted right after synchronize_srcu() returns. To avoid halving the generation space in SPTEs, always presume that the low bit of the generation is zero when reconstructing a generation number out of an SPTE. This effectively disables MMIO caching in SPTEs during the call to synchronize_srcu_expedited. Using the low bit this way is somewhat like a seqcount---where the protected thing is a cache, and instead of retrying we can simply punt if we observe the low bit to be 1. Cc: stable@vger.kernel.org Signed-off-by: David Matlack <dmatlack@google.com> Reviewed-by: Xiao Guangrong <xiaoguangrong@linux.vnet.ibm.com> Reviewed-by: David Matlack <dmatlack@google.com> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>		2014-09-03 10:03:41 +02:00
..
arm	KVM/ARM New features for 3.17 include:	2014-08-05 09:47:45 +02:00
assigned-dev.c	virt/kvm/assigned-dev.c: Set 'dev->irq_source_id' to '-1' after free it	2014-08-19 15:12:28 +02:00
async_pf.c	At over 200 commits, covering almost all supported architectures, this	2014-06-04 08:47:12 -07:00
async_pf.h	KVM: Halt vcpu if page it tries to access is swapped out	2011-01-12 11:21:39 +02:00
coalesced_mmio.c	KVM: return an error code in kvm_vm_ioctl_register_coalesced_mmio()	2014-01-30 11:56:09 +01:00
coalesced_mmio.h	KVM: Make coalesced mmio use a device per zone	2011-09-25 19:17:57 +03:00
eventfd.c	KVM: Move more code under CONFIG_HAVE_KVM_IRQFD	2014-08-06 14:24:47 +02:00
ioapic.c	KVM: x86: always exit on EOIs for interrupts listed in the IOAPIC redir table	2014-07-30 20:22:30 +02:00
ioapic.h	kvm: make local functions static	2014-01-08 19:02:58 -02:00
iodev.h	KVM: remove in_range from io devices	2009-09-10 08:33:05 +03:00
iommu.c	kvm: iommu: fix the third parameter of kvm_iommu_put_pages (CVE-2014-3601)	2014-08-19 15:04:45 +02:00
irq_comm.c	KVM: Move all accesses to kvm::irq_routing into irqchip.c	2014-08-05 14:26:20 +02:00
irqchip.c	KVM: Move irq notifier implementation into eventfd.c	2014-08-05 14:26:24 +02:00
Kconfig	KVM: Give IRQFD its own separate enabling Kconfig option	2014-08-05 14:26:28 +02:00
kvm_main.c	kvm: fix potentially corrupt mmio cache	2014-09-03 10:03:41 +02:00
vfio.c	kvm/vfio: Support for DMA coherent IOMMUs	2014-02-26 11:38:40 -07:00