linux

History

Zhao Lei 20b2e3029e btrfs: Fix lockdep warning of wr_ctx->wr_lock in scrub_free_wr_ctx() lockdep report following warning in test: [25176.843958] ================================= [25176.844519] [ INFO: inconsistent lock state ] [25176.845047] 4.1.0-rc3 #22 Tainted: G W [25176.845591] --------------------------------- [25176.846153] inconsistent {SOFTIRQ-ON-W} -> {IN-SOFTIRQ-W} usage. [25176.846713] fsstress/26661 [HC0[0]:SC1[1]:HE1:SE0] takes: [25176.847246] (&wr_ctx->wr_lock){+.?...}, at: [<ffffffffa04cdc6d>] scrub_free_ctx+0x2d/0xf0 [btrfs] [25176.847838] {SOFTIRQ-ON-W} state was registered at: [25176.848396] [<ffffffff810bf460>] __lock_acquire+0x6a0/0xe10 [25176.848955] [<ffffffff810bfd1e>] lock_acquire+0xce/0x2c0 [25176.849491] [<ffffffff816489af>] mutex_lock_nested+0x7f/0x410 [25176.850029] [<ffffffffa04d04ff>] scrub_stripe+0x4df/0x1080 [btrfs] [25176.850575] [<ffffffffa04d11b1>] scrub_chunk.isra.19+0x111/0x130 [btrfs] [25176.851110] [<ffffffffa04d144c>] scrub_enumerate_chunks+0x27c/0x510 [btrfs] [25176.851660] [<ffffffffa04d3b87>] btrfs_scrub_dev+0x1c7/0x6c0 [btrfs] [25176.852189] [<ffffffffa04e918e>] btrfs_dev_replace_start+0x36e/0x450 [btrfs] [25176.852771] [<ffffffffa04a98e0>] btrfs_ioctl+0x1e10/0x2d20 [btrfs] [25176.853315] [<ffffffff8121c5b8>] do_vfs_ioctl+0x318/0x570 [25176.853868] [<ffffffff8121c851>] SyS_ioctl+0x41/0x80 [25176.854406] [<ffffffff8164da17>] system_call_fastpath+0x12/0x6f [25176.854935] irq event stamp: 51506 [25176.855511] hardirqs last enabled at (51506): [<ffffffff810d4ce5>] vprintk_emit+0x225/0x5e0 [25176.856059] hardirqs last disabled at (51505): [<ffffffff810d4b77>] vprintk_emit+0xb7/0x5e0 [25176.856642] softirqs last enabled at (50886): [<ffffffff81067a23>] __do_softirq+0x363/0x640 [25176.857184] softirqs last disabled at (50949): [<ffffffff8106804d>] irq_exit+0x10d/0x120 [25176.857746] other info that might help us debug this: [25176.858845] Possible unsafe locking scenario: [25176.859981] CPU0 [25176.860537] ---- [25176.861059] lock(&wr_ctx->wr_lock); [25176.861705] <Interrupt> [25176.862272] lock(&wr_ctx->wr_lock); [25176.862881] * DEADLOCK * Reason: Above warning is caused by: Interrupt -> bio_endio() -> ... -> scrub_put_ctx() -> scrub_free_ctx() 1 -> ... -> mutex_lock(&wr_ctx->wr_lock); scrub_put_ctx() is allowed to be called in end_bio interrupt, but in code design, it will never call scrub_free_ctx(sctx) in interrupe context(above 1), because btrfs_scrub_dev() get one additional reference of sctx->refs, which makes scrub_free_ctx() only called withine btrfs_scrub_dev(). Now the code runs out of our wish, because free sequence in scrub_pending_bio_dec() have a gap. Current code: -----------------------------------+----------------------------------- scrub_pending_bio_dec() \| btrfs_scrub_dev -----------------------------------+----------------------------------- atomic_dec(&sctx->bios_in_flight); \| wake_up(&sctx->list_wait); \| \| scrub_put_ctx() \| -> atomic_dec_and_test(&sctx->refs) scrub_put_ctx(sctx); \| -> atomic_dec_and_test(&sctx->refs)\| -> scrub_free_ctx() \| -----------------------------------+----------------------------------- We expected: -----------------------------------+----------------------------------- scrub_pending_bio_dec() \| btrfs_scrub_dev -----------------------------------+----------------------------------- atomic_dec(&sctx->bios_in_flight); \| wake_up(&sctx->list_wait); \| scrub_put_ctx(sctx); \| -> atomic_dec_and_test(&sctx->refs)\| \| scrub_put_ctx() \| -> atomic_dec_and_test(&sctx->refs) \| -> scrub_free_ctx() -----------------------------------+----------------------------------- Fix: Move scrub_pending_bio_dec() to a workqueue, to avoid this function run in interrupt context. Tested by check tracelog in debug. Changelog v1->v2: Use workqueue instead of adjust function call sequence in v1, because v1 will introduce a bug pointed out by: Filipe David Manana <fdmanana@gmail.com> Reported-by: Qu Wenruo <quwenruo@cn.fujitsu.com> Signed-off-by: Zhao Lei <zhaolei@cn.fujitsu.com> Reviewed-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: Chris Mason <clm@fb.com>		2015-06-10 07:04:52 -07:00
..
9p	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
adfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
affs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
afs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
autofs4	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
befs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
bfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
btrfs	btrfs: Fix lockdep warning of wr_ctx->wr_lock in scrub_free_wr_ctx()	2015-06-10 07:04:52 -07:00
cachefiles	VFS: fs/cachefiles: d_backing_inode() annotations	2015-04-15 15:06:59 -04:00
ceph	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
cifs	CIFS: Fix race condition on RFC1002_NEGATIVE_SESSION_RESPONSE	2015-05-20 13:25:55 -05:00
coda	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
configfs	configfs: init configfs module earlier at boot time	2015-05-05 17:10:11 -07:00
cramfs
debugfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
devpts	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
dlm	netlink: make nlmsg_end() and genlmsg_end() void	2015-01-18 01:03:45 -05:00
ecryptfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
efivarfs	Merge branch 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2015-05-06 10:57:37 -07:00
efs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
exofs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
exportfs	VFS: (Scripted) Convert S_ISLNK/DIR/REG(dentry->d_inode) to d_is_*(dentry)	2015-02-22 11:38:41 -05:00
ext2	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
ext3	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
ext4	ext4: fix an ext3 collapse range regression in xfstests	2015-05-15 00:24:10 -04:00
f2fs	f2fs: fix wrong error hanlder in f2fs_follow_link	2015-05-04 14:15:16 -07:00
fat	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
freevxfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
fscache
fuse	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
gfs2	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
hfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
hfsplus	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
hostfs	hostfs: Use correct mask for file mode	2015-05-04 14:50:29 +02:00
hpfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
hppfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
hugetlbfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
isofs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
jbd
jbd2	jbd2: fix r_count overflows leading to buffer overflow in journal recovery	2015-05-14 19:11:50 -04:00
jffs2	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
jfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
kernfs	kernfs: do not account ino_ida allocations to memcg	2015-05-14 17:55:51 -07:00
lockd	nfsd: eliminate NFSD_DEBUG	2015-04-21 16:16:02 -04:00
logfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
minix	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
ncpfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
nfs	nfs: take extra reference to fl->fl_file when running a setlk	2015-05-13 14:56:06 -04:00
nfs_common
nfsd	nfsd: skip CB_NULL probes for 4.1 or later	2015-05-04 12:02:42 -04:00
nilfs2	nilfs2: fix sanity check of btree level in nilfs_btree_root_broken()	2015-05-05 17:10:11 -07:00
nls
notify	fanotify: fix event filtering with FAN_ONDIR set	2015-03-12 18:46:08 -07:00
ntfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
ocfs2	ocfs2: dlm: fix race between purge and get lock resource	2015-05-05 17:10:11 -07:00
omfs	omfs: fix potential integer overflow in allocator	2015-05-28 18:25:19 -07:00
openpromfs
overlayfs	ovl: mount read-only if workdir can't be created	2015-05-19 14:30:12 +02:00
proc	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
pstore	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
qnx4
qnx6	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
quota	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
ramfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
reiserfs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
romfs	make new_sync_{read,write}() static	2015-04-11 22:29:40 -04:00
squashfs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
sysfs	sysfs: Only accept read/write permissions for file attributes	2015-03-25 13:27:57 +01:00
sysv	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
tracefs	tracing: Have mkdir and rmdir be part of tracefs	2015-02-03 12:48:43 -05:00
ubifs	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
udf	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
ufs	VFS: normal filesystems (and lustre): d_inode() annotations	2015-04-15 15:06:57 -04:00
xfs	xfs: fix broken i_nlink accounting for whiteout tmpfile inode	2015-05-29 08:14:55 +10:00
aio.c	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-16 23:27:56 -04:00
anon_inodes.c
attr.c
bad_inode.c	don't bother with most of the bad_file_ops methods	2015-02-20 04:03:58 -05:00
binfmt_aout.c
binfmt_elf_fdpic.c
binfmt_elf.c	fs/binfmt_elf.c:load_elf_binary(): return -EINVAL on zero-length mappings	2015-05-28 18:25:18 -07:00
binfmt_em86.c
binfmt_flat.c
binfmt_misc.c	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
binfmt_script.c
block_dev.c	direct-io: only inc/dec inode->i_dio_count for file systems	2015-04-24 15:45:28 -04:00
buffer.c	page_writeback: clean up mess around cancel_dirty_page()	2015-04-14 16:49:01 -07:00
char_dev.c	fs: introduce f_op->mmap_capabilities for nommu mmap support	2015-01-20 14:02:58 -07:00
compat_binfmt_elf.c
compat_ioctl.c	Bluetooth: bnep: Add support for get bnep features via ioctl	2015-04-03 23:21:34 +02:00
compat.c
coredump.c	coredump: accept any write method	2015-04-11 22:29:39 -04:00
dax.c	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs	2015-04-26 17:22:07 -07:00
dcache.c	d_walk() might skip too much	2015-05-28 23:45:30 -04:00
dcookies.c
direct-io.c	direct-io: only inc/dec inode->i_dio_count for file systems	2015-04-24 15:45:28 -04:00
drop_caches.c	vmscan: per memory cgroup slab shrinkers	2015-02-12 18:54:09 -08:00
eventfd.c	eventfd: don't take the spinlock in eventfd_poll	2015-02-17 14:34:52 -08:00
eventpoll.c	epoll: optimize setting task running after blocking	2015-02-13 21:21:40 -08:00
exec.c	parisc,metag: Fix crashes due to stack randomization on stack-grows-upwards architectures	2015-05-12 22:03:44 +02:00
fcntl.c
fhandle.c
file_table.c	->aio_read and ->aio_write removed	2015-04-11 22:29:43 -04:00
file.c	mm: rcu-protected get_mm_exe_file()	2015-04-17 09:04:07 -04:00
filesystems.c
fs_pin.c	fs_pin: Allow for the possibility that m_list or s_list go unused.	2015-04-09 11:39:55 -05:00
fs_struct.c
fs-writeback.c	fs: add dirtytime_expire_seconds sysctl	2015-03-17 12:23:32 -04:00
inode.c	direct-io: only inc/dec inode->i_dio_count for file systems	2015-04-24 15:45:28 -04:00
internal.h	trylock_super(): replacement for grab_super_passive()	2015-02-22 11:38:42 -05:00
ioctl.c	fsioctl.c: make generic_block_fiemap() signal-tolerant	2015-02-10 14:30:30 -08:00
Kconfig	f2fs: relocate Kconfig from misc filesystems	2015-04-10 15:08:35 -07:00
Kconfig.binfmt	mm: split ET_DYN ASLR from mmap ASLR	2015-04-14 16:49:05 -07:00
libfs.c	VFS: fs library helpers: d_inode() annotations	2015-04-15 15:06:58 -04:00
locks.c	proc: show locks in /proc/pid/fdinfo/X	2015-04-17 09:04:12 -04:00
Makefile	This adds the new tracefs file system. This has been in linux-next for	2015-04-14 10:22:29 -07:00
mbcache.c
mount.h	switch the IO-triggering parts of umount to fs_pin	2015-01-25 23:17:29 -05:00
mpage.c
namei.c	path_openat(): fix double fput()	2015-05-09 00:12:48 -04:00
namespace.c	mnt: Fix fs_fully_visible to verify the root directory is visible	2015-05-09 11:55:50 -05:00
no-block.c
nsfs.c	VFS: assorted weird filesystems: d_inode() annotations	2015-04-15 15:06:58 -04:00
open.c	xfs: update for 4.1-rc1	2015-04-24 07:08:41 -07:00
pipe.c	VFS: assorted weird filesystems: d_inode() annotations	2015-04-15 15:06:58 -04:00
pnode.c	mnt: Don't propagate unmounts to locked mounts	2015-04-02 20:34:20 -05:00
pnode.h	mnt: Honor MNT_LOCKED when detaching mounts	2015-04-09 11:39:55 -05:00
posix_acl.c	VFS: assorted d_backing_inode() annotations	2015-04-15 15:06:59 -04:00
proc_namespace.c	vfs: add support for a lazytime mount option	2015-02-05 02:45:00 -05:00
read_write.c	new_sync_write(): discard ->ki_pos unless the return value is positive	2015-04-11 22:29:46 -04:00
readdir.c
select.c	all arches, signal: move restart_block to struct task_struct	2015-02-12 18:54:12 -08:00
seq_file.c	Btrfs: show subvol= and subvolid= in /proc/mounts	2015-06-03 04:03:02 -07:00
signalfd.c
splice.c	splice: sendfile() at once fails for big files	2015-05-06 09:27:41 -06:00
stack.c
stat.c	VFS: assorted d_backing_inode() annotations	2015-04-15 15:06:59 -04:00
statfs.c
super.c	cleancache: remove limit on the number of cleancache enabled filesystems	2015-04-14 16:49:03 -07:00
sync.c	vfs: add support for a lazytime mount option	2015-02-05 02:45:00 -05:00
timerfd.c
utimes.c
xattr.c