linux

mirror of https://github.com/torvalds/linux.git synced 2024-11-23 12:42:02 +00:00

History

Filipe Manana d7cd4dd907 Btrfs: fix sysfs warning and missing raid sysfs directories In the 5.3 merge window, commit `7c7e301406` ("btrfs: sysfs: Replace default_attrs in ktypes with groups"), we started using the member "defaults_groups" for the kobject type "btrfs_raid_ktype". That leads to a series of warnings when running some test cases of fstests, such as btrfs/027, btrfs/124 and btrfs/176. The traces produced by those warnings are like the following: [116648.059212] kernfs: can not remove 'total_bytes', no directory [116648.060112] WARNING: CPU: 3 PID: 28500 at fs/kernfs/dir.c:1504 kernfs_remove_by_name_ns+0x75/0x80 (...) [116648.066482] CPU: 3 PID: 28500 Comm: umount Tainted: G W 5.3.0-rc3-btrfs-next-54 #1 (...) [116648.069376] RIP: 0010:kernfs_remove_by_name_ns+0x75/0x80 (...) [116648.072385] RSP: 0018:ffffabfd0090bd08 EFLAGS: 00010282 [116648.073437] RAX: 0000000000000000 RBX: ffffffffc0c11998 RCX: 0000000000000000 [116648.074201] RDX: ffff9fff603a7a00 RSI: ffff9fff603978a8 RDI: ffff9fff603978a8 [116648.074956] RBP: ffffffffc0b9ca2f R08: 0000000000000000 R09: 0000000000000001 [116648.075708] R10: ffff9ffe1f72e1c0 R11: 0000000000000000 R12: ffffffffc0b94120 [116648.076434] R13: ffffffffb3d9b4e0 R14: 0000000000000000 R15: dead000000000100 [116648.077143] FS: 00007f9cdc78a2c0(0000) GS:ffff9fff60380000(0000) knlGS:0000000000000000 [116648.077852] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [116648.078546] CR2: 00007f9fc4747ab4 CR3: 00000005c7832003 CR4: 00000000003606e0 [116648.079235] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [116648.079907] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [116648.080585] Call Trace: [116648.081262] remove_files+0x31/0x70 [116648.081929] sysfs_remove_group+0x38/0x80 [116648.082596] sysfs_remove_groups+0x34/0x70 [116648.083258] kobject_del+0x20/0x60 [116648.083933] btrfs_free_block_groups+0x405/0x430 [btrfs] [116648.084608] close_ctree+0x19a/0x380 [btrfs] [116648.085278] generic_shutdown_super+0x6c/0x110 [116648.085951] kill_anon_super+0xe/0x30 [116648.086621] btrfs_kill_super+0x12/0xa0 [btrfs] [116648.087289] deactivate_locked_super+0x3a/0x70 [116648.087956] cleanup_mnt+0xb4/0x160 [116648.088620] task_work_run+0x7e/0xc0 [116648.089285] exit_to_usermode_loop+0xfa/0x100 [116648.089933] do_syscall_64+0x1cb/0x220 [116648.090567] entry_SYSCALL_64_after_hwframe+0x49/0xbe [116648.091197] RIP: 0033:0x7f9cdc073b37 (...) [116648.100046] ---[ end trace 22e24db328ccadf8 ]--- [116648.100618] ------------[ cut here ]------------ [116648.101175] kernfs: can not remove 'used_bytes', no directory [116648.101731] WARNING: CPU: 3 PID: 28500 at fs/kernfs/dir.c:1504 kernfs_remove_by_name_ns+0x75/0x80 (...) [116648.105649] CPU: 3 PID: 28500 Comm: umount Tainted: G W 5.3.0-rc3-btrfs-next-54 #1 (...) [116648.107461] RIP: 0010:kernfs_remove_by_name_ns+0x75/0x80 (...) [116648.109336] RSP: 0018:ffffabfd0090bd08 EFLAGS: 00010282 [116648.109979] RAX: 0000000000000000 RBX: ffffffffc0c119a0 RCX: 0000000000000000 [116648.110625] RDX: ffff9fff603a7a00 RSI: ffff9fff603978a8 RDI: ffff9fff603978a8 [116648.111283] RBP: ffffffffc0b9ca41 R08: 0000000000000000 R09: 0000000000000001 [116648.111940] R10: ffff9ffe1f72e1c0 R11: 0000000000000000 R12: ffffffffc0b94120 [116648.112603] R13: ffffffffb3d9b4e0 R14: 0000000000000000 R15: dead000000000100 [116648.113268] FS: 00007f9cdc78a2c0(0000) GS:ffff9fff60380000(0000) knlGS:0000000000000000 [116648.113939] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [116648.114607] CR2: 00007f9fc4747ab4 CR3: 00000005c7832003 CR4: 00000000003606e0 [116648.115286] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [116648.115966] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [116648.116649] Call Trace: [116648.117326] remove_files+0x31/0x70 [116648.117997] sysfs_remove_group+0x38/0x80 [116648.118671] sysfs_remove_groups+0x34/0x70 [116648.119342] kobject_del+0x20/0x60 [116648.120022] btrfs_free_block_groups+0x405/0x430 [btrfs] [116648.120707] close_ctree+0x19a/0x380 [btrfs] [116648.121396] generic_shutdown_super+0x6c/0x110 [116648.122057] kill_anon_super+0xe/0x30 [116648.122702] btrfs_kill_super+0x12/0xa0 [btrfs] [116648.123335] deactivate_locked_super+0x3a/0x70 [116648.123961] cleanup_mnt+0xb4/0x160 [116648.124586] task_work_run+0x7e/0xc0 [116648.125210] exit_to_usermode_loop+0xfa/0x100 [116648.125830] do_syscall_64+0x1cb/0x220 [116648.126463] entry_SYSCALL_64_after_hwframe+0x49/0xbe [116648.127080] RIP: 0033:0x7f9cdc073b37 (...) [116648.135923] ---[ end trace 22e24db328ccadf9 ]--- These happen because, during the unmount path, we call kobject_del() for raid kobjects that are not fully initialized, meaning that we set their ktype (as btrfs_raid_ktype) through link_block_group() but we didn't set their parent kobject, which is done through btrfs_add_raid_kobjects(). We have this split raid kobject setup since commit `75cb379d26` ("btrfs: defer adding raid type kobject until after chunk relocation") in order to avoid triggering reclaim during contextes where we can not (either we are holding a transaction handle or some lock required by the transaction commit path), so that we do the calls to kobject_add(), which triggers GFP_KERNEL allocations, through btrfs_add_raid_kobjects() in contextes where it is safe to trigger reclaim. That change expected that a new raid kobject can only be created either when mounting the filesystem or after raid profile conversion through the relocation path. However, we can have new raid kobject created in other two cases at least: 1) During device replace (or scrub) after adding a device a to the filesystem. The replace procedure (and scrub) do calls to btrfs_inc_block_group_ro() which can allocate a new block group with a new raid profile (because we now have more devices). This can be triggered by test cases btrfs/027 and btrfs/176. 2) During a degraded mount trough any write path. This can be triggered by test case btrfs/124. Fixing this by adding extra calls to btrfs_add_raid_kobjects(), not only makes things more complex and fragile, can also introduce deadlocks with reclaim the following way: 1) Calling btrfs_add_raid_kobjects() at btrfs_inc_block_group_ro() or anywhere in the replace/scrub path will cause a deadlock with reclaim because if reclaim happens and a transaction commit is triggered, the transaction commit path will block at btrfs_scrub_pause(). 2) During degraded mounts it is essentially impossible to figure out where to add extra calls to btrfs_add_raid_kobjects(), because allocation of a block group with a new raid profile can happen anywhere, which means we can't safely figure out which contextes are safe for reclaim, as we can either hold a transaction handle or some lock needed by the transaction commit path. So it is too complex and error prone to have this split setup of raid kobjects. So fix the issue by consolidating the setup of the kobjects in a single place, at link_block_group(), and setup a nofs context there in order to prevent reclaim being triggered by the memory allocations done through the call chain of kobject_add(). Besides fixing the sysfs warnings during kobject_del(), this also ensures the sysfs directories for the new raid profiles end up created and visible to users (a bug that existed before the 5.3 commit `7c7e301406` ("btrfs: sysfs: Replace default_attrs in ktypes with groups")). Fixes: `75cb379d26` ("btrfs: defer adding raid type kobject until after chunk relocation") Fixes: `7c7e301406` ("btrfs: sysfs: Replace default_attrs in ktypes with groups") Signed-off-by: Filipe Manana <fdmanana@suse.com> Reviewed-by: David Sterba <dsterba@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>		2019-08-07 16:25:44 +02:00
..
tests	btrfs: Evaluate io_tree in find_lock_delalloc_range()	2019-07-04 17:26:17 +02:00
acl.c	btrfs: cleanup btrfs_setxattr_trans and drop transaction parameter	2019-04-29 19:02:44 +02:00
async-thread.c	btrfs: simplify workqueue name when allocating	2019-02-25 14:13:24 +01:00
async-thread.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
backref.c	Btrfs: fix deadlock between fiemap and transaction commits	2019-07-30 18:25:12 +02:00
backref.h	btrfs: fiemap: preallocate ulists for btrfs_check_shared	2019-07-01 13:34:53 +02:00
block-rsv.c	btrfs: migrate the global_block_rsv helpers to block-rsv.c	2019-07-02 12:30:55 +02:00
block-rsv.h	btrfs: migrate the global_block_rsv helpers to block-rsv.c	2019-07-02 12:30:55 +02:00
btrfs_inode.h	btrfs: remove assumption about csum type form btrfs_print_data_csum_error()	2019-07-01 13:35:02 +02:00
check-integrity.c	btrfs: directly call into crypto framework for checksumming	2019-07-01 13:35:02 +02:00
check-integrity.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
compression.c	btrfs: lift bio_set_dev from bio allocation helpers	2019-07-02 12:30:51 +02:00
compression.h	btrfs: correctly validate compression type	2019-07-02 12:30:48 +02:00
ctree.c	btrfs: ctree: Dump the leaf before BUG_ON in btrfs_set_item_key_safe	2019-04-29 19:02:52 +02:00
ctree.h	Btrfs: fix sysfs warning and missing raid sysfs directories	2019-08-07 16:25:44 +02:00
dedupe.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
delalloc-space.c	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
delalloc-space.h	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
delayed-inode.c	btrfs: get fs_info from eb in btrfs_leaf_free_space	2019-04-29 19:02:30 +02:00
delayed-inode.h	Btrfs: delayed-inode: use rb_first_cached for ins_root and del_root	2018-10-15 17:23:33 +02:00
delayed-ref.c	btrfs: migrate the delayed refs rsv code	2019-07-04 17:26:17 +02:00
delayed-ref.h	btrfs: migrate the delayed refs rsv code	2019-07-04 17:26:17 +02:00
dev-replace.c	btrfs: remove mapping tree structures indirection	2019-07-01 13:34:56 +02:00
dev-replace.h	btrfs: get fs_info from trans in btrfs_run_dev_replace	2019-04-29 19:02:43 +02:00
dir-item.c	btrfs: remove unused parameter fs_info from btrfs_extend_item	2019-04-29 19:02:50 +02:00
disk-io.c	Btrfs: fix sysfs warning and missing raid sysfs directories	2019-08-07 16:25:44 +02:00
disk-io.h	btrfs: directly call into crypto framework for checksumming	2019-07-01 13:35:02 +02:00
export.c	btrfs: Remove 'objectid' member from struct btrfs_root	2018-10-15 17:23:25 +02:00
export.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
extent_io.c	btrfs: fix memory leak of path on error return path	2019-07-05 18:47:57 +02:00
extent_io.h	btrfs: Evaluate io_tree in find_lock_delalloc_range()	2019-07-04 17:26:17 +02:00
extent_map.c	btrfs: Optimize unallocated chunks discard	2019-04-29 19:02:38 +02:00
extent_map.h	btrfs: Remove impossible condition from mergable_maps	2019-02-25 14:13:21 +01:00
extent-tree.c	Btrfs: fix sysfs warning and missing raid sysfs directories	2019-08-07 16:25:44 +02:00
file-item.c	btrfs: directly call into crypto framework for checksumming	2019-07-01 13:35:02 +02:00
file.c	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
free-space-cache.c	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
free-space-cache.h	btrfs: get fs_info from block group in btrfs_find_space_cluster	2019-04-29 19:02:46 +02:00
free-space-tree.c	btrfs: get fs_info from block group in search_free_space_info	2019-04-29 19:02:46 +02:00
free-space-tree.h	btrfs: get fs_info from block group in search_free_space_info	2019-04-29 19:02:46 +02:00
inode-item.c	btrfs: remove unused parameter fs_info from btrfs_extend_item	2019-04-29 19:02:50 +02:00
inode-map.c	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
inode-map.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
inode.c	btrfs: inode: Don't compress if NODATASUM or NODATACOW set	2019-07-17 17:03:28 +02:00
ioctl.c	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
Kconfig	btrfs: Fix build error while LIBCRC32C is module	2019-07-17 17:03:30 +02:00
locking.c	btrfs: Fix deadlock caused by missing memory barrier	2019-07-25 17:34:08 +02:00
locking.h	btrfs: merge btrfs_set_lock_blocking_rw with it's caller	2019-02-25 14:13:28 +01:00
lzo.c	btrfs: change set_level() to bound the level passed in	2019-02-25 14:13:32 +01:00
Makefile	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
math.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
ordered-data.c	btrfs: fix extent_state leak in btrfs_lock_and_flush_ordered_range	2019-07-26 12:21:22 +02:00
ordered-data.h	btrfs: don't assume ordered sums to be 4 bytes	2019-07-01 13:35:00 +02:00
orphan.c	btrfs: replace GPL boilerplate by SPDX -- sources	2018-04-12 16:29:51 +02:00
print-tree.c	btrfs: switch extent_buffer write_locks from atomic to int	2019-07-02 12:30:47 +02:00
print-tree.h	btrfs: print-tree: debugging output enhancement	2018-04-20 19:18:16 +02:00
props.c	btrfs: shut up bogus -Wmaybe-uninitialized warning	2019-07-02 12:30:49 +02:00
props.h	btrfs: delete unused function btrfs_set_prop_trans	2019-04-29 19:02:54 +02:00
qgroup.c	btrfs: qgroup: Don't hold qgroup_ioctl_lock in btrfs_qgroup_inherit()	2019-07-02 12:30:48 +02:00
qgroup.h	btrfs: qgroup: Move reserved data accounting from btrfs_delayed_ref_head to btrfs_qgroup_extent_record	2019-02-25 14:13:39 +01:00
raid56.c	block: remove the i argument to bio_for_each_segment_all	2019-04-30 09:26:13 -06:00
raid56.h	btrfs: constify map parameter for nr_parity_stripes and nr_data_stripes	2019-07-01 13:34:58 +02:00
rcu-string.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
reada.c	btrfs: start readahead also in seed devices	2019-06-14 17:33:46 +02:00
ref-verify.c	Wimplicit-fallthrough patches for 5.2-rc1	2019-05-07 12:48:10 -07:00
ref-verify.h	btrfs: ref-verify: Use btrfs_ref to refactor btrfs_ref_tree_mod()	2019-04-29 19:02:49 +02:00
relocation.c	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
root-tree.c	btrfs: move the subvolume reservation stuff out of extent-tree.c	2019-07-04 17:26:18 +02:00
scrub.c	btrfs: add mask for all RAID1 types	2019-07-02 12:30:48 +02:00
send.c	Btrfs: fix incremental send failure after deduplication	2019-07-30 18:25:11 +02:00
send.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
space-info.c	btrfs: Simplify update of space_info in __reserve_metadata_bytes()	2019-07-02 12:30:53 +02:00
space-info.h	btrfs: unexport can_overcommit	2019-07-02 12:30:53 +02:00
struct-funcs.c	btrfs: prune unused includes	2018-08-06 13:12:43 +02:00
super.c	btrfs: move space_info to space-info.h	2019-07-02 12:30:51 +02:00
sysfs.c	btrfs: move space_info to space-info.h	2019-07-02 12:30:51 +02:00
sysfs.h	btrfs: drop extra enum initialization where using defaults	2018-12-17 14:51:43 +01:00
transaction.c	Btrfs: fix deadlock between fiemap and transaction commits	2019-07-30 18:25:12 +02:00
transaction.h	Btrfs: fix deadlock between fiemap and transaction commits	2019-07-30 18:25:12 +02:00
tree-checker.c	btrfs: tree-checker: Check if the file extent end overflows	2019-07-01 13:34:55 +02:00
tree-checker.h	btrfs: get fs_info from eb in btrfs_check_chunk_valid	2019-04-29 19:02:39 +02:00
tree-defrag.c	btrfs: open code now trivial btrfs_set_lock_blocking	2019-02-25 14:13:27 +01:00
tree-log.c	Btrfs: fix fsync not persisting dentry deletions due to inode evictions	2019-07-02 12:30:50 +02:00
tree-log.h	btrfs: get fs_info from trans in btrfs_set_log_full_commit	2019-04-29 19:02:41 +02:00
ulist.c	btrfs: replace GPL boilerplate by SPDX -- sources	2018-04-12 16:29:51 +02:00
ulist.h	btrfs: replace GPL boilerplate by SPDX -- headers	2018-04-12 16:29:46 +02:00
uuid-tree.c	btrfs: remove unused parameter fs_info from btrfs_extend_item	2019-04-29 19:02:50 +02:00
volumes.c	Btrfs: fix sysfs warning and missing raid sysfs directories	2019-08-07 16:25:44 +02:00
volumes.h	btrfs: Use btrfs_get_io_geometry appropriately	2019-07-02 12:30:50 +02:00
xattr.c	Btrfs: fix failure to persist compression property xattr deletion on fsync	2019-06-17 16:37:17 +02:00
xattr.h	btrfs: cleanup btrfs_setxattr_trans and drop transaction parameter	2019-04-29 19:02:44 +02:00
zlib.c	btrfs: change set_level() to bound the level passed in	2019-02-25 14:13:32 +01:00
zstd.c	btrfs: correct zstd workspace manager lock to use spin_lock_bh()	2019-05-28 18:54:09 +02:00