dm snapshot: fix race during exception creation Fix a race condition that returns incorrect data when a write causes an exception to be allocated whilst a read is still in flight. The race condition happens as follows: * A read to non-reallocated sector in the snapshot is submitted so that the read is routed to the original device. * A write to the original device is submitted. The write causes an exception that reallocates the block. The write proceeds. * The original read is dequeued and reads the wrong data. This race can be triggered with CFQ scheduler and one thread writing and multiple threads reading simultaneously. (This patch relies upon the earlier dm-kcopyd-per-device.patch to avoid a deadlock.) Signed-off-by: Mikulas Patocka <mpatocka@redhat.com> Signed-off-by: Alasdair G Kergon <agk@redhat.com>

commit: a8d41b59f3f5a7ac19452ef442a7fc1b5fa17366 [log] [tgz]
author: Mikulas Patocka <mpatocka@redhat.com> Mon Jul 21 12:00:34 2008 +0100
committer: Alasdair G Kergon <agk@redhat.com> Mon Jul 21 12:00:34 2008 +0100
tree: f9435bed2d582e4cd3e91e4d6fb18a18f62aa019
parent: cd45daffd1f7b53aac0835b23e97f814ec3f10dc [diff] [blame]
diff --git a/drivers/md/dm-snap.c b/drivers/md/dm-snap.c
index de30270..f4fd0ce 100644
--- a/drivers/md/dm-snap.c
+++ b/drivers/md/dm-snap.c

@@ -134,6 +134,27 @@
 	mempool_free(c, s->tracked_chunk_pool);
 }
 
+static int __chunk_is_tracked(struct dm_snapshot *s, chunk_t chunk)
+{
+	struct dm_snap_tracked_chunk *c;
+	struct hlist_node *hn;
+	int found = 0;
+
+	spin_lock_irq(&s->tracked_chunk_lock);
+
+	hlist_for_each_entry(c, hn,
+	    &s->tracked_chunk_hash[DM_TRACKED_CHUNK_HASH(chunk)], node) {
+		if (c->chunk == chunk) {
+			found = 1;
+			break;
+		}
+	}
+
+	spin_unlock_irq(&s->tracked_chunk_lock);
+
+	return found;
+}
+
 /*
  * One of these per registered origin, held in the snapshot_origins hash
  */
@@ -840,6 +861,13 @@
 	}
 
 	/*
+	 * Check for conflicting reads. This is extremely improbable,
+	 * so yield() is sufficient and there is no need for a wait queue.
+	 */
+	while (__chunk_is_tracked(s, pe->e.old_chunk))
+		yield();
+
+	/*
 	 * Add a proper exception, and remove the
 	 * in-flight exception from the list.
 	 */
commit	a8d41b59f3f5a7ac19452ef442a7fc1b5fa17366	[log] [tgz]
author	Mikulas Patocka <mpatocka@redhat.com>	Mon Jul 21 12:00:34 2008 +0100
committer	Alasdair G Kergon <agk@redhat.com>	Mon Jul 21 12:00:34 2008 +0100
tree	f9435bed2d582e4cd3e91e4d6fb18a18f62aa019
parent	cd45daffd1f7b53aac0835b23e97f814ec3f10dc [diff] [blame]