nvme_fc: fix error recovery on link down. Currently, the fc transport invokes nvme_fc_error_recovery() on every io in which the transport detects an error. Which means: a) it's really noisy on large io loads that all get hit by a link down. b) we repeatively call nvme_stop_queues() even though queues are stopped upon the first error or as first steps of reset_work. Correct by: Errors are only meaningful if the controller is in the LIVE state. Thus, enact the reset_work only if LIVE. If called repeatively, state will have already transitioned. There's no need to stop the queues here. Let the first steps of reset_work do the queue stopping. Signed-off-by: James Smart <james.smart@broadcom.com> Signed-off-by: Sagi Grimberg <sagi@grimberg.me> Signed-off-by: Jens Axboe <axboe@kernel.dk>

commit: 69fa964632fe18a11a80ead0d09ef3399b08144a [log] [tgz]
author: James Smart <jsmart2021@gmail.com> Wed Jun 21 17:43:21 2017 -0700
committer: Jens Axboe <axboe@kernel.dk> Wed Jun 28 08:14:13 2017 -0600
tree: 51a5bcf047795d11fe60fb7977a2037599fc5f94
parent: 188f7e8a3789cb2e37b68903e0a40f406091fa97 [diff] [blame]
diff --git a/drivers/nvme/host/fc.c b/drivers/nvme/host/fc.c
index 9444495..ed87214 100644
--- a/drivers/nvme/host/fc.c
+++ b/drivers/nvme/host/fc.c

@@ -1749,16 +1749,16 @@
 static void
 nvme_fc_error_recovery(struct nvme_fc_ctrl *ctrl, char *errmsg)
 {
+	/* only proceed if in LIVE state - e.g. on first error */
+	if (ctrl->ctrl.state != NVME_CTRL_LIVE)
+		return;
+
 	dev_warn(ctrl->ctrl.device,
 		"NVME-FC{%d}: transport association error detected: %s\n",
 		ctrl->cnum, errmsg);
 	dev_warn(ctrl->ctrl.device,
 		"NVME-FC{%d}: resetting controller\n", ctrl->cnum);
 
-	/* stop the queues on error, cleanup is in reset thread */
-	if (ctrl->queue_count > 1)
-		nvme_stop_queues(&ctrl->ctrl);
-
 	if (!nvme_change_ctrl_state(&ctrl->ctrl, NVME_CTRL_RECONNECTING)) {
 		dev_err(ctrl->ctrl.device,
 			"NVME-FC{%d}: error_recovery: Couldn't change state "
commit	69fa964632fe18a11a80ead0d09ef3399b08144a	[log] [tgz]
author	James Smart <jsmart2021@gmail.com>	Wed Jun 21 17:43:21 2017 -0700
committer	Jens Axboe <axboe@kernel.dk>	Wed Jun 28 08:14:13 2017 -0600
tree	51a5bcf047795d11fe60fb7977a2037599fc5f94
parent	188f7e8a3789cb2e37b68903e0a40f406091fa97 [diff] [blame]