linux/fs/ocfs2/cluster
Tao Ma 5cc3bf2786 ocfs2: Reconnect after idle time out.
Currently, o2net connects to a node on hb_up and disconnects on
hb_down and net timeout.

It disconnects on net timeout is ok, but it should attempt to
reconnect back. This is because sometimes nodes get overloaded
enough that the network connection breaks but the disk hb does not.
And if we get into that situation, we either fence (unnecessarily)
or wait for its disk hb to die (and sometimes hang in the process).

So in this updated scheme, when the network disconnects, we keep
attempting to reconnect till we succeed or we get a disk hb down
event.

If the other node is really dead, then we will eventually get a
node down event. If not, we should be able to connect again and
continue.

Signed-off-by: Tao Ma <tao.ma@oracle.com>
Signed-off-by: Mark Fasheh <mfasheh@suse.com>
2008-04-18 08:56:10 -07:00
..
heartbeat.c [PATCH] Fix possibly too long write in o2hb_setup_one_bio() 2007-11-06 15:35:35 -08:00
heartbeat.h ocfs2: Update default cluster timeouts 2008-01-25 15:05:45 -08:00
Makefile [PATCH] OCFS2: The Second Oracle Cluster Filesystem 2006-01-03 11:45:46 -08:00
masklog.c kset: convert ocfs2 to use kset_create 2008-01-24 20:40:23 -08:00
masklog.h ocfs2: Filter -ENOSPC in mlog_errno() 2007-11-27 16:47:01 -08:00
nodemanager.c byteorder: move le32_add_cpu & friends from OCFS2 to core 2008-02-08 09:22:32 -08:00
nodemanager.h ocfs2: live heartbeat depends on the local node configuration 2007-07-10 17:19:43 -07:00
ocfs2_heartbeat.h ocfs2: warn the user on a dead timeout mismatch 2006-06-29 15:45:35 -07:00
ocfs2_nodemanager.h [PATCH] OCFS2: The Second Oracle Cluster Filesystem 2006-01-03 11:45:46 -08:00
quorum.c ocfs2: Replace panic() with emergency_restart() when fencing 2007-04-26 13:39:02 -07:00
quorum.h [PATCH] OCFS2: The Second Oracle Cluster Filesystem 2006-01-03 11:45:46 -08:00
sys.c ocfs2: Fix userspace ABI breakage in sysfs 2008-01-28 19:10:23 -08:00
sys.h [PATCH] OCFS2: The Second Oracle Cluster Filesystem 2006-01-03 11:45:46 -08:00
tcp_internal.h ocfs2: Reconnect after idle time out. 2008-04-18 08:56:10 -07:00
tcp.c ocfs2: Reconnect after idle time out. 2008-04-18 08:56:10 -07:00
tcp.h ocfs2: Update default cluster timeouts 2008-01-25 15:05:45 -08:00
ver.c ocfs2: bump version number 2008-01-25 15:05:46 -08:00
ver.h [PATCH] OCFS2: The Second Oracle Cluster Filesystem 2006-01-03 11:45:46 -08:00