linux

History

Neal Cardwell 686dc2db2a tcp: fix early ETIMEDOUT after spurious non-SACK RTO Fix a bug reported and analyzed by Nagaraj Arankal, where the handling of a spurious non-SACK RTO could cause a connection to fail to clear retrans_stamp, causing a later RTO to very prematurely time out the connection with ETIMEDOUT. Here is the buggy scenario, expanding upon Nagaraj Arankal's excellent report: (1) Send one data packet on a non-SACK connection (2) Because no ACK packet is received, the packet is retransmitted and we enter CA_Loss; but this retransmission is spurious. (3) The ACK for the original data is received. The transmitted packet is acknowledged. The TCP timestamp is before the retrans_stamp, so tcp_may_undo() returns true, and tcp_try_undo_loss() returns true without changing state to Open (because tcp_is_sack() is false), and tcp_process_loss() returns without calling tcp_try_undo_recovery(). Normally after undoing a CA_Loss episode, tcp_fastretrans_alert() would see that the connection has returned to CA_Open and fall through and call tcp_try_to_open(), which would set retrans_stamp to 0. However, for non-SACK connections we hold the connection in CA_Loss, so do not fall through to call tcp_try_to_open() and do not set retrans_stamp to 0. So retrans_stamp is (erroneously) still non-zero. At this point the first "retransmission event" has passed and been recovered from. Any future retransmission is a completely new "event". However, retrans_stamp is erroneously still set. (And we are still in CA_Loss, which is correct.) (4) After 16 minutes (to correspond with tcp_retries2=15), a new data packet is sent. Note: No data is transmitted between (3) and (4) and we disabled keep alives. The socket's timeout SHOULD be calculated from this point in time, but instead it's calculated from the prior "event" 16 minutes ago (step (2)). (5) Because no ACK packet is received, the packet is retransmitted. (6) At the time of the 2nd retransmission, the socket returns ETIMEDOUT, prematurely, because retrans_stamp is (erroneously) too far in the past (set at the time of (2)). This commit fixes this bug by ensuring that we reuse in tcp_try_undo_loss() the same careful logic for non-SACK connections that we have in tcp_try_undo_recovery(). To avoid duplicating logic, we factor out that logic into a new tcp_is_non_sack_preventing_reopen() helper and call that helper from both undo functions. Fixes: `da34ac7626` ("tcp: only undo on partial ACKs in CA_Loss") Reported-by: Nagaraj Arankal <nagaraj.p.arankal@hpe.com> Link: https://lore.kernel.org/all/SJ0PR84MB1847BE6C24D274C46A1B9B0EB27A9@SJ0PR84MB1847.NAMPRD84.PROD.OUTLOOK.COM/ Signed-off-by: Neal Cardwell <ncardwell@google.com> Signed-off-by: Yuchung Cheng <ycheng@google.com> Reviewed-by: Eric Dumazet <edumazet@google.com> Link: https://lore.kernel.org/r/20220903121023.866900-1-ncardwell.kernel@gmail.com Signed-off-by: Paolo Abeni <pabeni@redhat.com>		2022-09-06 11:06:31 +02:00
..
6lowpan	net: 6lowpan: constify lowpan_nhc structures	2022-06-09 21:53:28 +02:00
9p	iov_iter stuff, part 2, rebased	2022-08-08 20:04:35 -07:00
802
8021q	Merge git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net	2022-07-14 15:27:35 -07:00
appletalk	net: remove noblock parameter from skb_recv_datagram()	2022-04-06 13:45:26 +01:00
atm	net: SO_RCVMARK socket option for SO_MARK with recvmsg()	2022-04-28 13:08:15 -07:00
ax25	net: avoid overflow when rose /proc displays timer information.	2022-08-05 19:00:02 -07:00
batman-adv	batman-adv: tracing: Use the new __vstring() helper	2022-07-30 13:52:47 -04:00
bluetooth	Bluetooth: hci_sync: Fix hci_read_buffer_size_sync	2022-09-02 14:01:28 -07:00
bpf	bpf: Allow calling bpf_prog_test kfuncs in tracing programs	2022-08-09 18:46:11 -07:00
bpfilter	uaccess: remove CONFIG_SET_FS	2022-02-25 09:36:06 +01:00
bridge	netfilter: br_netfilter: Drop dst references before setting.	2022-08-31 12:12:32 +02:00
caif	caif: Fix bitmap data type in "struct caifsock"	2022-07-22 12:51:45 +01:00
can	can: j1939: j1939_session_destroy(): fix memory leak of skbs	2022-08-09 09:05:06 +02:00
ceph	libceph: clean up ceph_osdc_start_request prototype	2022-08-03 14:05:39 +02:00
core	tcp: TX zerocopy should not sense pfmemalloc status	2022-09-02 12:29:02 +01:00
dcb	net: dcb: disable softirqs in dcbnl_flush_dev()	2022-03-03 08:01:55 -08:00
dccp	dccp: put dccp_qpolicy_full() and dccp_qpolicy_push() in the same lock	2022-08-01 12:11:56 -07:00
decnet	dn_route: replace "jiffies-now>0" with "jiffies!=now"	2022-07-29 20:12:49 -07:00
dns_resolver
dsa	net: dsa: hellcreek: Print warning only once	2022-08-31 19:54:04 -07:00
ethernet	net: ethernet: set default assignment identifier to NET_NAME_ENUM	2022-04-07 21:04:03 -07:00
ethtool	net: delete extra space and tab in blank line	2022-07-25 19:38:31 -07:00
hsr	treewide: Replace GPLv2 boilerplate/reference with SPDX - gpl-2.0_30.RULE (part 2)	2022-06-10 14:51:35 +02:00
ieee802154	net: SO_RCVMARK socket option for SO_MARK with recvmsg()	2022-04-28 13:08:15 -07:00
ife
ipv4	tcp: fix early ETIMEDOUT after spurious non-SACK RTO	2022-09-06 11:06:31 +02:00
ipv6	ipv6: sr: fix out-of-bounds read when setting HMAC data.	2022-09-05 10:33:34 +01:00
iucv	net: keep sk->sk_forward_alloc as small as possible	2022-06-10 16:21:27 -07:00
kcm	kcm: fix strp_init() order and cleanup	2022-08-31 12:16:44 -07:00
key	Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/klassert/ipsec	2022-08-24 12:51:50 +01:00
l2tp	l2tp: l2tp_debugfs: fix Clang -Wformat warnings	2022-07-08 12:14:36 +01:00
l3mdev	l3mdev: l3mdev_master_upper_ifindex_by_index_rcu should be using netdev_master_upper_dev_get_rcu	2022-04-15 14:27:24 -07:00
lapb
llc	net: rename reference+tracking helpers	2022-06-09 21:52:55 -07:00
mac80211	We have a handful of fixes:	2022-09-04 11:23:11 +01:00
mac802154	net: mac802154: Fix a condition in the receive path	2022-08-29 11:10:22 +02:00
mctp	Networking changes for 5.19.	2022-05-25 12:22:58 -07:00
mpls	net: Use u64_stats_fetch_begin_irq() for stats fetch.	2022-08-29 13:02:27 +01:00
mptcp	net: Fix data-races around sysctl_max_skb_frags.	2022-08-24 13:46:58 +01:00
ncsi	net/ncsi: use proper "mellanox" DT vendor prefix	2022-06-23 20:51:06 -07:00
netfilter	netfilter: nf_conntrack_irc: Fix forged IP logic	2022-09-01 02:01:56 +02:00
netlabel	netlabel: fix typo in comment	2022-08-10 09:24:41 +01:00
netlink	net: genl: fix error path memory leak in policy dumping	2022-08-18 10:20:48 -07:00
netrom	net: remove noblock parameter from skb_recv_datagram()	2022-04-06 13:45:26 +01:00
nfc	net: nfc: Directly use ida_alloc()/free()	2022-05-28 15:28:47 +01:00
nsh
openvswitch	openvswitch: fix memory leak at failed datapath creation	2022-08-26 19:26:30 -07:00
packet	net/af_packet: check len when min_header_len equals to 0	2022-07-29 12:09:27 +01:00
phonet	net: remove noblock parameter from recvmsg() entities	2022-04-12 15:00:25 +02:00
psample
qrtr	net: qrtr: start MHI channel after endpoit creation	2022-08-15 11:21:42 +01:00
rds	rds: add missing barrier to release_refill	2022-08-12 10:46:01 +01:00
rfkill	rfkill: make new event layout opt-in	2022-03-18 13:09:17 +02:00
rose	rose: check NULL rose_loopback_neigh->loopback	2022-08-22 14:24:54 +01:00
rxrpc	rxrpc: Remove rxrpc_get_reply_time() which is no longer used	2022-09-01 11:44:13 +01:00
sched	sch_sfb: Don't assume the skb is still around after enqueueing to child	2022-09-02 12:23:26 +01:00
sctp	Merge git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net	2022-07-28 18:21:16 -07:00
smc	net/smc: Remove redundant refcount increase	2022-09-01 10:04:45 +02:00
strparser	strparser: pad sk_skb_cb to avoid straddling cachelines	2022-07-08 18:38:44 -07:00
sunrpc	NFS client bugfixes for Linux 6.0	2022-08-22 11:40:01 -07:00
switchdev	net: rename reference+tracking helpers	2022-06-09 21:52:55 -07:00
tipc	tipc: fix shift wrapping bug in map_get()	2022-09-02 12:26:29 +01:00
tls	tls: rx: react to strparser initialization errors	2022-08-17 10:24:00 +01:00
unix	Merge https://git.kernel.org/pub/scm/linux/kernel/git/bpf/bpf-next	2022-07-09 12:24:16 -07:00
vmw_vsock	vsock: Set socket state back to SS_UNCONNECTED in vsock_connect_timeout()	2022-08-10 09:50:18 +01:00
wireless	wifi: use struct_group to copy addresses	2022-09-03 16:40:06 +02:00
x25	net/x25: fix call timeouts in blocking connects	2022-08-08 20:48:51 -07:00
xdp	xsk: Fix corrupted packets for XDP_SHARED_UMEM	2022-08-15 17:26:07 +02:00
xfrm	net: Fix data-races around netdev_max_backlog.	2022-08-24 13:46:57 +01:00
compat.c	Merge branch 'for-5.20/io_uring' into for-5.20/io_uring-zerocopy-send	2022-07-24 18:41:03 -06:00
devres.c
Kconfig	page_pool: Add allocation stats	2022-03-03 09:55:28 +00:00
Kconfig.debug	net: CONFIG_DEBUG_NET depends on CONFIG_NET	2022-06-02 10:15:05 -07:00
Makefile
socket.c	net: Fix a data-race around sysctl_somaxconn.	2022-08-24 13:46:58 +01:00
sysctl_net.c