Diff - b7219ccb33aa0df9949a60c68b5e9f712615e56f^! - kernel/msm-4.19

commit	b7219ccb33aa0df9949a60c68b5e9f712615e56f	[log] [tgz]
author	NeilBrown <neilb@suse.de>	Tue Jul 31 10:05:34 2012 +1000
committer	NeilBrown <neilb@suse.de>	Tue Jul 31 10:05:34 2012 +1000
tree	64f89f9a0270134c1895ddfe230bc6e862213ab0
parent	90cf195d9bcb4bf70e8b6df5073b05164b279ba0 [diff]

md/raid1: don't abort a resync on the first badblock.

If a resync of a RAID1 array with 2 devices finds a known bad block
one device it will neither read from, or write to, that device for
this block offset.
So there will be one read_target (The other device) and zero write
targets.
This condition causes md/raid1 to abort the resync assuming that it
has finished - without known bad blocks this would be true.

When there are no write targets because of the presence of bad blocks
we should only skip over the area covered by the bad block.
RAID10 already gets this right, raid1 doesn't.  Or didn't.

As this can cause a 'sync' to abort early and appear to have succeeded
it could lead to some data corruption, so it suitable for -stable.

Cc: stable@vger.kernel.org
Reported-by: Alexander Lyakas <alex.bolshoy@gmail.com>
Signed-off-by: NeilBrown <neilb@suse.de>

diff --git a/drivers/md/raid1.c b/drivers/md/raid1.c
index 7aa958e..d2361b1 100644
--- a/drivers/md/raid1.c
+++ b/drivers/md/raid1.c

@@ -2502,7 +2502,10 @@
 		/* There is nowhere to write, so all non-sync
 		 * drives must be failed - so we are finished
 		 */
-		sector_t rv = max_sector - sector_nr;
+		sector_t rv;
+		if (min_bad > 0)
+			max_sector = sector_nr + min_bad;
+		rv = max_sector - sector_nr;
 		*skipped = 1;
 		put_buf(r1_bio);
 		return rv;