linux

mirror of https://github.com/torvalds/linux.git synced 2024-11-01 17:51:43 +00:00

History

Jens Axboe f35546e072 Merge branch 'stable/for-jens-3.10' of git://git.kernel.org/pub/scm/linux/kernel/git/konrad/xen into for-3.11/drivers Konrad writes: It has the 'feature-max-indirect-segments' implemented in both backend and frontend. The current problem with the backend and frontend is that the segment size is limited to 11 pages. It means we can at most squeeze in 44kB per request. The ring can hold 32 (next power of two below 36) requests, meaning we can do 1.4M of outstanding requests. Nowadays that is not enough. The problem in the past was addressed in two ways - but neither one went upstream. The first solution to this proposed by Justin from Spectralogic was to negotiate the segment size. This means that the ‘struct blkif_sring_entry’ is now a variable size. It can expand from 112 bytes (cover 11 pages of data - 44kB) to 1580 bytes (256 pages of data - so 1MB). It is a simple extension by just making the array in the request expand from 11 to a variable size negotiated. But it had limits: this extension still limits the number of segments per request to 255 (as the total number must be specified in the request, which only has an 8-bit field for that purpose). The other solution (from Intel - Ronghui) was to create one extra ring that only has the ‘struct blkif_request_segment’ in them. The ‘struct blkif_request’ would be changed to have an index in said ‘segment ring’. There is only one segment ring. This means that the size of the initial ring is still the same. The requests would point to the segment and enumerate out how many of the indexes it wants to use. The limit is of course the size of the segment. If one assumes a one-page segment this means we can in one request cover ~4MB. Those patches were posted as RFC and the author never followed up on the ideas on changing it to be a bit more flexible. There is yet another mechanism that could be employed (which these patches implement) - and it borrows from VirtIO protocol. And that is the ‘indirect descriptors’. This very similar to what Intel suggests, but with a twist. The twist is to negotiate how many of these 'segment' pages (aka indirect descriptor pages) we want to support (in reality we negotiate how many entries in the segment we want to cover, and we module the number if it is bigger than the segment size). This means that with the existing 36 slots in the ring (single page) we can cover: 32 slots * each blkif_request_indirect covers: 512 * 4096 ~= 64M. Since we ample space in the blkif_request_indirect to span more than one indirect page, that number (64M) can be also multiplied by eight = 512MB. Roger Pau Monne took the idea and implemented them in these patches. They work great and the corner cases (migration between backends with and without this extension) work nicely. The backend has a limit right now off how many indirect entries it can handle: one indirect page, and at maximum 256 entries (out of 512 - so 50% of the page is used). That comes out to 32 slots * 256 entries in a indirect page * 1 indirect page per request * 4096 = 32MB. This is a conservative number that can change in the future. Right now it strikes a good balance between giving excellent performance, memory usage in the backend, and balancing the needs of many guests. In the patchset there is also the split of the blkback structure to be per-VBD. This means that the spinlock contention we had with many guests trying to do I/O and all the blkback threads hitting the same lock has been eliminated. Also there are bug-fixes to deal with oddly sized sectors, insane amounts on th ring, and also a security fix (posted earlier).		2013-06-28 16:01:14 +02:00
..
interface	Merge branch 'stable/for-jens-3.10' of git://git.kernel.org/pub/scm/linux/kernel/git/konrad/xen into for-3.11/drivers	2013-06-28 16:01:14 +02:00
acpi.h	xen/acpi: move xen_acpi_get_pxm under CONFIG_XEN_DOM0	2013-02-19 22:02:30 -05:00
balloon.h	xen-balloon: convert sysdev_class to a regular subsystem	2011-12-14 15:32:50 -08:00
events.h	xen: drop tracking of IRQ vector	2013-04-16 15:05:45 -04:00
features.h
gntalloc.h	xen/gntalloc,gntdev: Add unmap notify ioctl	2011-02-14 14:16:17 -05:00
gntdev.h	xen/gntalloc,gntdev: Add unmap notify ioctl	2011-02-14 14:16:17 -05:00
grant_table.h	Merge commit 'v3.7-rc1' into stable/for-linus-3.7	2012-10-19 15:19:19 -04:00
hvc-console.h	treewide: use __printf not __attribute__((format(printf,...)))	2011-10-31 17:30:54 -07:00
hvm.h	xen/hvm: If we fail to fetch an HVM parameter print out which flag it is.	2012-11-07 10:40:33 -05:00
page.h	xen: allow balloon driver to use more than one memory region	2011-09-29 11:12:10 -04:00
platform_pci.h	xen: Remove hanging references to CONFIG_XEN_PLATFORM_PCI	2011-11-16 12:13:42 -05:00
swiotlb-xen.h	xen/swiotlb: Remove functions not needed anymore.	2012-09-17 13:00:43 -04:00
tmem.h	xen: tmem: enable Xen tmem shim to be built/loaded as a module	2013-04-30 17:04:01 -07:00
xen-ops.h	Merge branch 'arm-privcmd-for-3.8' of git://xenbits.xen.org/people/ianc/linux into stable/for-linus-3.8	2012-11-30 17:07:59 -05:00
xen.h	xen/xen_initial_domain: check that xen_start_info is initialized	2012-10-03 13:03:32 -04:00
xenbus_dev.h	xenbus: Add support for xenbus backend in stub domain	2012-05-21 09:53:18 -04:00
xenbus.h	include/ and checkpatch: prefer __scanf to __attribute__((format(scanf,...)	2012-03-23 16:58:36 -07:00
xencomm.h	xen: import arch generic part of xencomm	2008-04-24 23:57:32 +02:00