Diff - ea3d7209ca01da209cda6f0dea8be9cc4b7a933b^! - kernel/msm-4.9

commit	ea3d7209ca01da209cda6f0dea8be9cc4b7a933b	[log] [tgz]
author	Jan Kara <jack@suse.com>	Mon Dec 07 14:28:03 2015 -0500
committer	Theodore Ts'o <tytso@mit.edu>	Mon Dec 07 14:28:03 2015 -0500
tree	809b37322befdf8dda2d12b991d1c832241bc8bc
parent	f41683a204ea61568f0fd0804d47c19561f2ee39 [diff] [blame]

ext4: fix races between page faults and hole punching

Currently, page faults and hole punching are completely unsynchronized.
This can result in page fault faulting in a page into a range that we
are punching after truncate_pagecache_range() has been called and thus
we can end up with a page mapped to disk blocks that will be shortly
freed. Filesystem corruption will shortly follow. Note that the same
race is avoided for truncate by checking page fault offset against
i_size but there isn't similar mechanism available for punching holes.

Fix the problem by creating new rw semaphore i_mmap_sem in inode and
grab it for writing over truncate, hole punching, and other functions
removing blocks from extent tree and for read over page faults. We
cannot easily use i_data_sem for this since that ranks below transaction
start and we need something ranking above it so that it can be held over
the whole truncate / hole punching operation. Also remove various
workarounds we had in the code to reduce race window when page fault
could have created pages with stale mapping information.

Signed-off-by: Jan Kara <jack@suse.com>
Signed-off-by: Theodore Ts'o <tytso@mit.edu>

diff --git a/fs/ext4/super.c b/fs/ext4/super.c
index c9ab67d..493370e 100644
--- a/fs/ext4/super.c
+++ b/fs/ext4/super.c

@@ -958,6 +958,7 @@
 	INIT_LIST_HEAD(&ei->i_orphan);
 	init_rwsem(&ei->xattr_sem);
 	init_rwsem(&ei->i_data_sem);
+	init_rwsem(&ei->i_mmap_sem);
 	inode_init_once(&ei->vfs_inode);
 }