[XFS] Fix memory corruption with small buffer reads When we have multiple buffers in a single page for a blocksize == pagesize filesystem we might overwrite the page contents if two callers hit it shortly after each other. To prevent that we need to keep the page locked until I/O is completed and the page marked uptodate. Thanks to Eric Sandeen for triaging this bug and finding a reproducible testcase and Dave Chinner for additional advice. This should fix kernel.org bz #10421. Tested-by: Eric Sandeen <sandeen@sandeen.net> SGI-PV: 981813 SGI-Modid: xfs-linux-melb:xfs-kern:31173a Signed-off-by: Christoph Hellwig <hch@infradead.org> Signed-off-by: David Chinner <dgc@sgi.com> Signed-off-by: Lachlan McIlroy <lachlan@sgi.com>

commit: 6ab455eeaff6893cd06da33843e840d888cdc04a [log] [tgz]
author: Christoph Hellwig <hch@infradead.org> Mon May 19 16:34:42 2008 +1000
committer: Lachlan McIlroy <lachlan@redback.melbourne.sgi.com> Fri May 23 18:12:49 2008 +1000
tree: e7744d1580647ca3b08e829bcf976f2f60c49986
parent: c8f5f12e46f079a954d4f7163ba59dadee08ca26 [diff] [blame]
diff --git a/fs/xfs/linux-2.6/xfs_buf.h b/fs/xfs/linux-2.6/xfs_buf.h
index 841d788..f948ec7 100644
--- a/fs/xfs/linux-2.6/xfs_buf.h
+++ b/fs/xfs/linux-2.6/xfs_buf.h

@@ -66,6 +66,25 @@
 	_XBF_PAGES = (1 << 18),	    /* backed by refcounted pages	   */
 	_XBF_RUN_QUEUES = (1 << 19),/* run block device task queue	   */
 	_XBF_DELWRI_Q = (1 << 21),   /* buffer on delwri queue		   */
+
+	/*
+	 * Special flag for supporting metadata blocks smaller than a FSB.
+	 *
+	 * In this case we can have multiple xfs_buf_t on a single page and
+	 * need to lock out concurrent xfs_buf_t readers as they only
+	 * serialise access to the buffer.
+	 *
+	 * If the FSB size >= PAGE_CACHE_SIZE case, we have no serialisation
+	 * between reads of the page. Hence we can have one thread read the
+	 * page and modify it, but then race with another thread that thinks
+	 * the page is not up-to-date and hence reads it again.
+	 *
+	 * The result is that the first modifcation to the page is lost.
+	 * This sort of AGF/AGI reading race can happen when unlinking inodes
+	 * that require truncation and results in the AGI unlinked list
+	 * modifications being lost.
+	 */
+	_XBF_PAGE_LOCKED = (1 << 22),
 } xfs_buf_flags_t;
 
 typedef enum {
commit	6ab455eeaff6893cd06da33843e840d888cdc04a	[log] [tgz]
author	Christoph Hellwig <hch@infradead.org>	Mon May 19 16:34:42 2008 +1000
committer	Lachlan McIlroy <lachlan@redback.melbourne.sgi.com>	Fri May 23 18:12:49 2008 +1000
tree	e7744d1580647ca3b08e829bcf976f2f60c49986
parent	c8f5f12e46f079a954d4f7163ba59dadee08ca26 [diff] [blame]