Skip to content

Commit 88dbe98

Browse files
biger410torvalds
authored andcommitted
ocfs2: o2hb: don't negotiate if last hb fail
Sometimes io error is returned when storage is down for a while. Like for iscsi device, stroage is made offline when session timeout, and this will make all io return -EIO. For this case, nodes shouldn't do negotiate timeout but should fence self. So let nodes fence self when o2hb_do_disk_heartbeat return an error, this is the same behavior with o2hb without negotiate timer. Signed-off-by: Junxiao Bi <[email protected]> Reviewed-by: Ryan Ding <[email protected]> Reviewed-by: Mark Fasheh <[email protected]> Cc: Gang He <[email protected]> Cc: rwxybh <[email protected]> Cc: Joel Becker <[email protected]> Cc: Joseph Qi <[email protected]> Signed-off-by: Andrew Morton <[email protected]> Signed-off-by: Linus Torvalds <[email protected]>
1 parent 1bd1290 commit 88dbe98

File tree

1 file changed

+10
-0
lines changed

1 file changed

+10
-0
lines changed

fs/ocfs2/cluster/heartbeat.c

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -284,6 +284,9 @@ struct o2hb_region {
284284
/* Message key for negotiate timeout message. */
285285
unsigned int hr_key;
286286
struct list_head hr_handler_list;
287+
288+
/* last hb status, 0 for success, other value for error. */
289+
int hr_last_hb_status;
287290
};
288291

289292
struct o2hb_bio_wait_ctxt {
@@ -395,6 +398,12 @@ static void o2hb_nego_timeout(struct work_struct *work)
395398
struct o2hb_region *reg;
396399

397400
reg = container_of(work, struct o2hb_region, hr_nego_timeout_work.work);
401+
/* don't negotiate timeout if last hb failed since it is very
402+
* possible io failed. Should let write timeout fence self.
403+
*/
404+
if (reg->hr_last_hb_status)
405+
return;
406+
398407
o2hb_fill_node_map(live_node_bitmap, sizeof(live_node_bitmap));
399408
/* lowest node as master node to make negotiate decision. */
400409
master_node = find_next_bit(live_node_bitmap, O2NM_MAX_NODES, 0);
@@ -1228,6 +1237,7 @@ static int o2hb_thread(void *data)
12281237
before_hb = ktime_get_real();
12291238

12301239
ret = o2hb_do_disk_heartbeat(reg);
1240+
reg->hr_last_hb_status = ret;
12311241

12321242
after_hb = ktime_get_real();
12331243

0 commit comments

Comments
 (0)