linux

Author	SHA1	Message	Date
Ben Skeggs	1246f1dc22	drm/nouveau/gr/gf100-: virtualise init_gpc_mmu + apply fixes from traces Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:22 +10:00
Ben Skeggs	334cc26d4d	drm/nouveau/fifo/gp100-: force individual channels into a channel group RM does this for some reason, and is enforced in HW on Volta. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:22 +10:00
Ben Skeggs	eda12417d3	drm/nouveau/fifo/gm107-: write instance address in channel runlist entry RM does this for some reason. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:22 +10:00
Ben Skeggs	79bb4b617f	drm/nouveau/fifo/gk208-: write pbdma timeout regs during initialisation Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:22 +10:00
Ben Skeggs	8c4e9f9dff	drm/nouveau/fifo/gk110-: support writing channel group runlist entries Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:22 +10:00
Ben Skeggs	4f2fc25c0f	drm/nouveau/fifo/gk104-: poll for runlist update completion Newer HW doesn't appear to send this event, which will cause long delays in runlist updates if they don't complete immediately. RM doesn't use these events anywhere, and an NVGPU commit message notes that polling is the preferred method even on HW that supports the event. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	665870837a	drm/nouveau/fifo/gk104-: add interfaces to support different runlist layouts This will be required to support features on newer hardware. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	f9360c3aa6	drm/nouveau/fifo/gk104-: simplify definition of channel classes Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	a7cf01809b	drm/nouveau/fifo/gk104-: require explicit runlist selection for channel allocation We didn't used to be aware that runlist/engine IDs weren't the same thing, or that there was such variability in configuration between GPUs. By exposing this information to a client, and giving it explicit control of which runlist it's allocating a channel on, we're able to make better choices. The immediate effect of this is that on GPUs where CE0 is the "GRCE", we will now be allocating a copy engine running asynchronously to GR for BO migrations - as intended. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	cc36205085	drm/nouveau/fifo/gk104-: support querying engines available on each runlist Will be used to improve channel runlist selection. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	ddc669e256	drm/nouveau/fifo/gk104-: allow fault recovery code to be called by other subdevs This will be required to support Volta. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	55b8e85b0b	drm/nouveau/fifo/gk104-: accept engine contexts for CE3 and up These can exist on GP100 and newer. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	eb47db4f3b	drm/nouveau/fifo: support channel count query Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	6eb01aa898	drm/nouveau/device: support querying available engines of a specific type Will be used for fifo runlist selection. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	c5c9127b25	drm/nouveau/device: implement a generic method to query device-specific properties We have a need to fetch data from GPU-specific sub-devices that is not tied to any particular engine object. This commit provides the framework to support such queries. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	f5650478ab	drm/nouveau/disp/nv50-: pass nvkm_memory objects for channel push buffers Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	a9c44a88ca	drm/nouveau/disp/nv50-: add channel interfaces to control error interrupts This will be required to support Volta, but also allows us to remove code that's duplicated for each channel type already. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	4a8621a24a	drm/nouveau/disp/nv50-: add channel interfaces to determine the user area This will be required to support Volta. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	8531f57027	drm/nouveau/disp/nv50-: merge handling of pio and dma channels Unnecessarily complicated, and a barrier to cleanly supporting Volta. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	9b096283bf	drm/nouveau/disp/nv50-: simplify definiton of core channels Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	6d41a7536f	drm/nouveau/disp/nv50-: simplify definition of cursor channels Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	3ceeef9c03	drm/nouveau/disp/nv50-: simplify definition of base channels Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:21 +10:00
Ben Skeggs	c2c3a00310	drm/nouveau/disp/nv50-: simplify definition of overlay immediate channels Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	46f74a8ad7	drm/nouveau/disp/nv50-: simplify definition of overlay channels Introduces a new method of defining channels available from the display, common to all channel types, allowing for more flexibility in available channel types/counts, and reducing the amount of boiler-plate required. This will be required to support Volta. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	abc1d4379b	drm/nouveau/disp/nv50-: replace user object with engine pointer in channels More simplification. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	bb3b0a4220	drm/nouveau/disp/nv50-: initialise from the engine, rather than the user object Engines are initialised on an as-needed basis, so this results in the same behaviour, whilst allowing us to simplify things a bit. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	f5e088d6f0	drm/nouveau/disp/nv50-: fetch mask of available piors during oneinit Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	9fe4e17704	drm/nouveau/disp/nv50-: fetch mask of available sors during oneinit Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	bf5d1a6b6a	drm/nouveau/disp/nv50-: fetch mask of available dacs during oneinit Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	f7b2ece37f	drm/nouveau/disp/nv50-: fetch mask of available heads during oneinit Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	3b9ba66ab0	drm/nouveau/disp/nv50-: delay subunit construction until oneinit We should be reading registers to determine which subunits are really present on a given board, and this needs to be done after DEVINIT. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	85a3b9c839	drm/nouveau/fb/gm200-: fix overwriting of big page setting Likely a rebase bug. Should have no impact in default configuration due to using per-instance setting by default. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	d1ea77ab5f	drm/nouveau/fb/gf100-: bump size of mmu debug buffers to match big page size Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	d0e9351e42	drm/nouveau/fault/gp100: implement replayable fault buffer initialisation Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	36780d7eee	drm/nouveau/fault: add infrastructure to support fault buffers GPU-specific support will be added separately. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	2f68234fb3	drm/nouveau/mc/gp100-: route fault buffer interrupts to FAULT Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Ben Skeggs	1ce466894b	drm/nouveau/core: define FAULT subdev This will be responsible for the handling of MMU fault buffers on GPUs that support them. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Gustavo A. R. Silva	7bf5b70bef	drm/nouveau/secboot: remove VLA usage In preparation to enabling -Wvla, remove VLA. In this particular case directly use macro NVKM_MSGQUEUE_CMDLINE_SIZE instead of local variable cmdline_size. Also, remove cmdline_size as it is not actually useful anymore. The use of stack Variable Length Arrays needs to be avoided, as they can be a vector for stack exhaustion, which can be both a runtime bug or a security flaw. Also, in general, as code evolves it is easy to lose track of how big a VLA can get. Thus, we can end up having runtime failures that are hard to debug. Also, fixed as part of the directive to remove all VLAs from the kernel: https://lkml.org/lkml/2018/3/7/621 Signed-off-by: Gustavo A. R. Silva <gustavo@embeddedor.com> Reviewed-by: Thierry Reding <treding@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:20 +10:00
Arnd Bergmann	9dfbd73199	drm/nouveau: nouveau: use larger buffer in nvif_vmm_map gcc points out a buffer that is clearly too small to be used in a meaningful way, as the 'sizeof(*args) + argc > sizeof(stack)' will always fail: In function 'memcpy', inlined from 'nvif_vmm_map' at drivers/gpu/drm/nouveau/nvif/vmm.c:55:2: include/linux/string.h:353:9: error: '__builtin_memcpy' offset 40 is out of the bounds [0, 16] of object 'stack' with type 'u8[16]' {aka 'unsigned char[16]'} [-Werror=array-bounds] return __builtin_memcpy(p, q, size); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~ drivers/gpu/drm/nouveau/nvif/vmm.c: In function 'nvif_vmm_map': drivers/gpu/drm/nouveau/nvif/vmm.c:40:5: note: 'stack' declared here This makes the buffer large enough so it should serve the purpose that the author presumably had in mind. Alternatively we could just get rid of it completely and simplify the code at the cost of always doing the kmalloc (as we do in the current version). Fixes: `920d2b5ef2` ("drm/nouveau/mmu: define user interfaces to mmu vmm opertaions") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2018-05-18 15:01:19 +10:00
Colin Xu	9a512e23f1	drm/i915/gvt: Use sched_lock to protect gvt scheduler logic. The scheduler lock(gvt->sched_lock) is used to protect gvt scheduler logic, including the gvt scheduler structure(gvt->scheduler and per vgpu schedule data(vgpu->sched_data, vgpu->sched_ctl). v9: - Change commit author since the patches are improved a lot compared with original version. Original author: Pei Zhang <pei.zhang@intel.com> - Rebase to latest gvt-staging. v8: - Correct coding wqstyle. - Rebase to latest gvt-staging. v7: - Remove gtt_lock since already proteced by gvt_lock and vgpu_lock. v6: - Rebase to latest gvt-staging. v5: - Rebase to latest gvt-staging. v4: - Rebase to latest gvt-staging. v3: update to latest code base Signed-off-by: Pei Zhang <pei.zhang@intel.com> Signed-off-by: Colin Xu <colin.xu@intel.com> Signed-off-by: Zhenyu Wang <zhenyuw@linux.intel.com>	2018-05-18 12:39:26 +08:00
Colin Xu	f25a49ab8a	drm/i915/gvt: Use vgpu_lock to protect per vgpu access The patch set splits out 2 small locks from the original big gvt lock: - vgpu_lock protects per-vGPU data and logic, especially the vGPU trap emulation path. - sched_lock protects gvt scheudler structure, context schedule logic and vGPU's schedule data. Use vgpu_lock to replace the gvt big lock. By doing this, the mmio read/write trap path, vgpu virtual event emulation and other vgpu related process, would be protected under per vgpu_lock. v9: - Change commit author since the patches are improved a lot compared with original version. Original author: Pei Zhang <pei.zhang@intel.com> - Rebase to latest gvt-staging. v8: - Correct coding and comment style. - Rebase to latest gvt-staging. v7: - Remove gtt_lock since already proteced by gvt_lock and vgpu_lock. - Fix a typo in intel_gvt_deactivate_vgpu, unlock the wrong lock. v6: - Rebase to latest gvt-staging. v5: - Rebase to latest gvt-staging. - intel_vgpu_page_track_handler should use vgpu_lock. v4: - Rebase to latest gvt-staging. - Protect vgpu->active access with vgpu_lock. - Do not wait gpu idle in vgpu_lock. v3: update to latest code base v2: add gvt->lock in function gvt_check_vblank_emulation Performance comparison on Kabylake platform. - Configuration: Host: Ubuntu 16.04. Guest 1 & 2: Ubuntu 16.04. glmark2 score comparison: - Configuration: Host: glxgears. Guests: glmark2. +--------------------------------+-----------------+ \| Setup \| glmark2 score \| +--------------------------------+-----------------+ \| unified lock, iommu=on \| 58~62 (avg. 60) \| +--------------------------------+-----------------+ \| unified lock, iommu=igfx_off \| 57~61 (avg. 59) \| +--------------------------------+-----------------+ \| per-logic lock, iommu=on \| 60~68 (avg. 64) \| +--------------------------------+-----------------+ \| per-logic lock, iommu=igfx_off \| 61~67 (avg. 64) \| +--------------------------------+-----------------+ lock_stat comparison: - Configuration: Stop lock stat immediately after boot up. Boot 2 VM Guests. Run glmark2 in guests. Start perf lock_stat for 20 seconds and stop again. - Legend: c - contentions; w - waittime-avg +------------+-----------------+-----------+---------------+------------+ \| \| gvt_lock \|sched_lock \| vgpu_lock \| gtt_lock \| + lock type; +-----------------+-----------+---------------+------------+ \| iommu set \| c \| w \| c \| w \| c \| w \| c \| w \| +------------+-------+---------+----+------+------+--------+-----+------+ \| unified; \| 20697 \| 839 \|N/A \| N/A \| N/A \| N/A \| N/A \| N/A \| \| on \| \| \| \| \| \| \| \| \| +------------+-------+---------+----+------+------+--------+-----+------+ \| unified; \| 21838 \| 658.15 \|N/A \| N/A \| N/A \| N/A \| N/A \| N/A \| \| igfx_off \| \| \| \| \| \| \| \| \| +------------+-------+---------+----+------+------+--------+-----+------+ \| per-logic; \| 1553 \| 1599.96 \|9458\|429.97\| 5846 \| 274.33 \| 0 \| 0.00 \| \| on \| \| \| \| \| \| \| \| \| +------------+-------+---------+----+------+------+--------+-----+------+ \| per-logic; \| 1911 \| 1678.32 \|8335\|445.16\| 5451 \| 244.80 \| 0 \| 0.00 \| \| igfx_off \| \| \| \| \| \| \| \| \| +------------+-------+---------+----+------+------+--------+-----+------+ Signed-off-by: Pei Zhang <pei.zhang@intel.com> Signed-off-by: Colin Xu <colin.xu@intel.com> Signed-off-by: Zhenyu Wang <zhenyuw@linux.intel.com>	2018-05-18 12:39:02 +08:00
Dave Airlie	1fafef9dfe	Merge drm-fixes-for-v4.17-rc6-urgent into drm-next Need to backmerge some nouveau fixes to reduce the nouveau -next conflicts a lot. Signed-off-by: Dave Airlie <airlied@redhat.com>	2018-05-18 14:08:53 +10:00
Dave Airlie	1827cad96d	Merge tag 'drm-intel-fixes-2018-05-17' of git://anongit.freedesktop.org/drm/drm-intel into drm-fixes - Userptr IOCTL zero size check (Matt) - Two hardware quirk fixes (Michel & Chris) * tag 'drm-intel-fixes-2018-05-17' of git://anongit.freedesktop.org/drm/drm-intel: drm/i915/gen9: Add WaClearHIZ_WM_CHICKEN3 for bxt and glk drm/i915/execlists: Use rmb() to order CSB reads drm/i915/userptr: reject zero user_size	2018-05-18 12:01:49 +10:00
Paulo Zanoni	c8af5274c3	drm/i915: enable the pipe/transcoder/planes later on HSW+ For all platforms that run haswell_crtc_enable, our spec tells us to configure the transcoder clocks and do link training before it tells us to set pipeconf and the other pipe/transcoder/plane registers. Starting from Icelake, we get machine hangs if we try to touch the pipe/transcoder registers without having the clocks configured and not having some chicken bits set. So this patch changes haswell_crtc_enable() to issue the calls at the appropriate order mandated by the spec. While setting the appropriate chicken bits would also work here, it's better if we actually program the hardware the way it is intended to be programmed. And the chicken bit also has some theoretical downsides that may or may not affect us. Also, correctly programming the hardware does not prevent us from setting the chicken bits in a later patch in case we decide to. v2: Don't forget link training (Ville). Cc: Arthur J Runyan <arthur.j.runyan@intel.com> Cc: James Ausmus <james.ausmus@intel.com> Cc: Ville Syrjälä <ville.syrjala@linux.intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Reviewed-by: Manasi Navare <manasi.d.navare@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20180502215851.30736-1-paulo.r.zanoni@intel.com	2018-05-17 15:35:10 -07:00
Oscar Mateo	6b7a6a7b4b	drm/i915/icl: Read the correct Gen11 interrupt registers Stop reading some now deprecated interrupt registers in both debugfs and error state. Instead, read the new equivalents in the Gen11 interrupt repartitioning scheme. Note that the equivalent to the PM ISR & IIR cannot be read without affecting the current state of the system, so I've opted for leaving them out. See gen11_reset_one_iir() for more info. v2: else if !!! (Paulo) v3: another else if (Vinay) v4: - Rebased - Renamed patch - Improved the ordering of GENs - Improved the printing of per-GEN info v5: Avoid maybe-unitialized & add comment explaining the lack of PM ISR & IIR Suggested-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: Oscar Mateo <oscar.mateo@intel.com> Cc: Tvrtko Ursulin <tvrtko.ursulin@intel.com> Cc: Daniele Ceraolo Spurio <daniele.ceraolospurio@intel.com> Cc: Sagar Arun Kamble <sagar.a.kamble@intel.com> Cc: Vinay Belgaumkar <vinay.belgaumkar@intel.com> Reviewed-by: Vinay Belgaumkar <vinay.belgaumkar@intel.com> [Paulo: fix commit message and coding style.] Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/1525989595-18220-1-git-send-email-oscar.mateo@intel.com	2018-05-17 15:35:08 -07:00
Chris Wilson	560f6ad8ed	drm/i915: Remove unused enable_cmd_parser modparam The command parser is feature complete, stable and required by userspace. In commit `41736a8e33` ("drm/i915: Use the precomputed value for whether to enable command parsing") I accidentally removed control from the modparam, and as no one has complained, remove the left over modparam completely! References: `41736a8e33` ("drm/i915: Use the precomputed value for whether to enable command parsing") Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com> Cc: Jani Nikula <jani.nikula@linux.intel.com> Acked-by: Jani Nikula <jani.nikula@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20180517150727.10431-1-chris@chris-wilson.co.uk	2018-05-17 20:52:39 +01:00
Chris Wilson	96d4f03c20	drm/i915: Nul-terminate legacy debug string Make sure that when we don't have any scheduler attributes for the request, the string is terminated. Fixes: `247870ac8e` ("drm/i915: Build request info on stack before printk") Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com> Reviewed-by: Ville Syrjälä <ville.syrjala@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20180517152824.11619-1-chris@chris-wilson.co.uk	2018-05-17 20:50:28 +01:00
Chris Wilson	57877b7073	drm/i915/execlists: HWACK checking superseded checking port[0].count The HWACK bit more generically solves the problem of resubmitting ESLP while the hardware is still processing the current ELSP write. We no longer need to check port[0].count itself. References: `ba74cb10c7` ("drm/i915/execlists: Delay writing to ELSP until HW has processed the previous write") Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Cc: Michel Thierry <michel.thierry@intel.com> Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20180517115647.17205-1-chris@chris-wilson.co.uk	2018-05-17 18:02:02 +01:00
Ville Syrjälä	b45a258897	drm/i915: Clean up DVO pipe select bits Parametrize the DVO pipe select bits. For consistency with the new way of doing things, let's read out the pipe select bits even when the port is disable, even though we don't need that behaviour for asserts in this case. v2: Order the defines shift,mask,value (Jani) Reviewed-by: Jani Nikula <jani.nikula@intel.com> Signed-off-by: Ville Syrjälä <ville.syrjala@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20180514172423.9302-5-ville.syrjala@linux.intel.com	2018-05-17 19:39:01 +03:00
Ville Syrjälä	4add0f6bde	drm/i915: Clean up TV pipe select bits Parametrize the TV pipe select bits. For consistency with the new way of doing things, let's read out the pipe select bits even when the port is disable, even though we don't need that behaviour for asserts in this case. v2: Order the defines shift,mask,value (Jani) Clear the stale pipe select bit in load detection (Jani) Reviewed-by: Jani Nikula <jani.nikula@intel.com> Signed-off-by: Ville Syrjälä <ville.syrjala@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20180514172423.9302-4-ville.syrjala@linux.intel.com	2018-05-17 19:38:12 +03:00

... 40 41 42 43 44 ...

47944 Commits