linux

mirror of https://github.com/torvalds/linux.git synced 2024-12-04 01:51:34 +00:00

History

Lukas Wunner ea401499e9 PCI: pciehp: Ignore Link Down/Up caused by error-induced Hot Reset Stuart Hayes reports that an error handled by DPC at a Root Port results in pciehp gratuitously bringing down a subordinate hotplug port: RP -- UP -- DP -- UP -- DP (hotplug) -- EP pciehp brings the slot down because the Link to the Endpoint goes down. That is caused by a Hot Reset being propagated as a result of DPC. Per PCIe Base Spec 5.0, section 6.6.1 "Conventional Reset": For a Switch, the following must cause a hot reset to be sent on all Downstream Ports: [...] * The Data Link Layer of the Upstream Port reporting DL_Down status. In Switches that support Link speeds greater than 5.0 GT/s, the Upstream Port must direct the LTSSM of each Downstream Port to the Hot Reset state, but not hold the LTSSMs in that state. This permits each Downstream Port to begin Link training immediately after its hot reset completes. This behavior is recommended for all Switches. * Receiving a hot reset on the Upstream Port. Once DPC recovers, pcie_do_recovery() walks down the hierarchy and invokes pcie_portdrv_slot_reset() to restore each port's config space. At that point, a hotplug interrupt is signaled per PCIe Base Spec r5.0, section 6.7.3.4 "Software Notification of Hot-Plug Events": If the Port is enabled for edge-triggered interrupt signaling using MSI or MSI-X, an interrupt message must be sent every time the logical AND of the following conditions transitions from FALSE to TRUE: [...] * The Hot-Plug Interrupt Enable bit in the Slot Control register is set to 1b. * At least one hot-plug event status bit in the Slot Status register and its associated enable bit in the Slot Control register are both set to 1b. Prevent pciehp from gratuitously bringing down the slot by clearing the error-induced Data Link Layer State Changed event before restoring config space. Afterwards, check whether the link has unexpectedly failed to retrain and synthesize a DLLSC event if so. Allow each pcie_port_service_driver (one of them being pciehp) to define a slot_reset callback and re-use the existing pm_iter() function to iterate over the callbacks. Thereby, the Endpoint driver remains bound throughout error recovery and may restore the device to working state. Surprise removal during error recovery is detected through a Presence Detect Changed event. The hotplug port is expected to not signal that event as a result of a Hot Reset. The issue isn't DPC-specific, it also occurs when an error is handled by AER through aer_root_reset(). So while the issue was noticed only now, it's been around since 2006 when AER support was first introduced. [bhelgaas: drop PCI_ERROR_RECOVERY Kconfig, split pm_iter() rename to preparatory patch] Link: https://lore.kernel.org/linux-pci/08c046b0-c9f2-3489-eeef-7e7aca435bb9@gmail.com/ Fixes: `6c2b374d74` ("PCI-Express AER implemetation: AER core and aerdriver") Link: https://lore.kernel.org/r/251f4edcc04c14f873ff1c967bc686169cd07d2d.1627638184.git.lukas@wunner.de Reported-by: Stuart Hayes <stuart.w.hayes@gmail.com> Tested-by: Stuart Hayes <stuart.w.hayes@gmail.com> Signed-off-by: Lukas Wunner <lukas@wunner.de> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Cc: stable@vger.kernel.org # v2.6.19+: `ba952824e6`: PCI/portdrv: Report reset for frozen channel Cc: Keith Busch <kbusch@kernel.org>		2021-10-15 14:23:46 -05:00
..
acpi_pcihp.c	PCI: Fix kernel-doc errors	2021-03-11 17:37:20 -06:00
acpiphp_core.c
acpiphp_glue.c	ACPI / hotplug / PCI: Fix reference count leak in enable_slot()	2021-04-08 11:04:18 -05:00
acpiphp_ibm.c
acpiphp.h	PCI: acpiphp: Fix whitespace issue	2021-04-16 14:32:18 -05:00
cpci_hotplug_core.c
cpci_hotplug_pci.c	PCI: cpcihp: Declare cpci_debug in header file	2021-07-01 15:32:45 -05:00
cpci_hotplug.h	PCI: cpcihp: Declare cpci_debug in header file	2021-07-01 15:32:45 -05:00
cpcihp_generic.c
cpcihp_zt5550.c
cpcihp_zt5550.h
cpqphp_core.c	PCI: Fix kernel-doc formatting	2021-07-06 10:37:46 -05:00
cpqphp_ctrl.c	PCI: Fix kernel-doc formatting	2021-07-06 10:37:46 -05:00
cpqphp_nvram.c	PCI: cpqphp: Use DEFINE_SPINLOCK() for int15_lock	2021-04-14 15:24:10 -05:00
cpqphp_nvram.h
cpqphp_pci.c
cpqphp_sysfs.c
cpqphp.h
ibmphp_core.c
ibmphp_ebda.c	PCI: ibmphp: Fix double unmap of io_mem	2021-09-02 12:02:50 -05:00
ibmphp_hpc.c
ibmphp_pci.c	PCI: ibmphp: Remove unneeded break	2020-11-20 11:17:55 -06:00
ibmphp_res.c	treewide: Use fallthrough pseudo-keyword	2020-08-23 17:36:59 -05:00
ibmphp.h
Kconfig	treewide: replace '---help---' in Kconfig files with 'help'	2020-06-14 01:57:21 +09:00
Makefile
pci_hotplug_core.c	PCI/sysfs: Use sysfs_emit() and sysfs_emit_at() in "show" functions	2021-06-03 22:14:47 -05:00
pciehp_core.c	PCI: pciehp: Ignore Link Down/Up caused by error-induced Hot Reset	2021-10-15 14:23:46 -05:00
pciehp_ctrl.c	pci-v5.10-changes	2020-10-22 12:41:00 -07:00
pciehp_hpc.c	PCI: pciehp: Ignore Link Down/Up caused by error-induced Hot Reset	2021-10-15 14:23:46 -05:00
pciehp_pci.c
pciehp.h	PCI: pciehp: Ignore Link Down/Up caused by error-induced Hot Reset	2021-10-15 14:23:46 -05:00
pnv_php.c	PCI: Change the type of probe argument in reset functions	2021-08-18 17:32:42 -05:00
rpadlpar_core.c	PCI: rpadlpar: Use for_each_child_of_node() and for_each_node_by_name()	2020-09-17 16:22:36 -05:00
rpadlpar_sysfs.c	PCI/sysfs: Use sysfs_emit() and sysfs_emit_at() in "show" functions	2021-06-03 22:14:47 -05:00
rpadlpar.h
rpaphp_core.c	PCI: Use of_node_name_eq() for node name comparisons	2020-04-24 18:02:17 -05:00
rpaphp_pci.c	powerpc/eeh: Make early EEH init pseries specific	2020-03-25 12:09:39 +11:00
rpaphp_slot.c
rpaphp.h
s390_pci_hpc.c	s390/pci: rename zpci_configure_device()	2021-04-30 17:17:00 +02:00
shpchp_core.c
shpchp_ctrl.c	pci-v5.10-changes	2020-10-22 12:41:00 -07:00
shpchp_hpc.c	PCI: shpchp: Remove unused shpc_writeb()	2021-04-16 11:22:24 -05:00
shpchp_pci.c	PCI: shpchp: Make shpchp_unconfigure_device() void	2020-05-21 15:23:20 -05:00
shpchp_sysfs.c	PCI/sysfs: Use sysfs_emit() and sysfs_emit_at() in "show" functions	2021-06-03 22:14:47 -05:00
shpchp.h	PCI: shpchp: Make shpchp_unconfigure_device() void	2020-05-21 15:23:20 -05:00
TODO	PCI: ibmphp: Fix double unmap of io_mem	2021-09-02 12:02:50 -05:00