linux

mirror of https://github.com/torvalds/linux.git synced 2024-12-03 17:41:22 +00:00

History

Gerald Schaefer bfe8cc1db0 mm/userfaultfd: do not access vma->vm_mm after calling handle_userfault() Alexander reported a syzkaller / KASAN finding on s390, see below for complete output. In do_huge_pmd_anonymous_page(), the pre-allocated pagetable will be freed in some cases. In the case of userfaultfd_missing(), this will happen after calling handle_userfault(), which might have released the mmap_lock. Therefore, the following pte_free(vma->vm_mm, pgtable) will access an unstable vma->vm_mm, which could have been freed or re-used already. For all architectures other than s390 this will go w/o any negative impact, because pte_free() simply frees the page and ignores the passed-in mm. The implementation for SPARC32 would also access mm->page_table_lock for pte_free(), but there is no THP support in SPARC32, so the buggy code path will not be used there. For s390, the mm->context.pgtable_list is being used to maintain the 2K pagetable fragments, and operating on an already freed or even re-used mm could result in various more or less subtle bugs due to list / pagetable corruption. Fix this by calling pte_free() before handle_userfault(), similar to how it is already done in __do_huge_pmd_anonymous_page() for the WRITE / non-huge_zero_page case. Commit `6b251fc96c` ("userfaultfd: call handle_userfault() for userfaultfd_missing() faults") actually introduced both, the do_huge_pmd_anonymous_page() and also __do_huge_pmd_anonymous_page() changes wrt to calling handle_userfault(), but only in the latter case it put the pte_free() before calling handle_userfault(). BUG: KASAN: use-after-free in do_huge_pmd_anonymous_page+0xcda/0xd90 mm/huge_memory.c:744 Read of size 8 at addr 00000000962d6988 by task syz-executor.0/9334 CPU: 1 PID: 9334 Comm: syz-executor.0 Not tainted 5.10.0-rc1-syzkaller-07083-g4c9720875573 #0 Hardware name: IBM 3906 M04 701 (KVM/Linux) Call Trace: do_huge_pmd_anonymous_page+0xcda/0xd90 mm/huge_memory.c:744 create_huge_pmd mm/memory.c:4256 [inline] __handle_mm_fault+0xe6e/0x1068 mm/memory.c:4480 handle_mm_fault+0x288/0x748 mm/memory.c:4607 do_exception+0x394/0xae0 arch/s390/mm/fault.c:479 do_dat_exception+0x34/0x80 arch/s390/mm/fault.c:567 pgm_check_handler+0x1da/0x22c arch/s390/kernel/entry.S:706 copy_from_user_mvcos arch/s390/lib/uaccess.c:111 [inline] raw_copy_from_user+0x3a/0x88 arch/s390/lib/uaccess.c:174 _copy_from_user+0x48/0xa8 lib/usercopy.c:16 copy_from_user include/linux/uaccess.h:192 [inline] __do_sys_sigaltstack kernel/signal.c:4064 [inline] __s390x_sys_sigaltstack+0xc8/0x240 kernel/signal.c:4060 system_call+0xe0/0x28c arch/s390/kernel/entry.S:415 Allocated by task 9334: slab_alloc_node mm/slub.c:2891 [inline] slab_alloc mm/slub.c:2899 [inline] kmem_cache_alloc+0x118/0x348 mm/slub.c:2904 vm_area_dup+0x9c/0x2b8 kernel/fork.c:356 __split_vma+0xba/0x560 mm/mmap.c:2742 split_vma+0xca/0x108 mm/mmap.c:2800 mlock_fixup+0x4ae/0x600 mm/mlock.c:550 apply_vma_lock_flags+0x2c6/0x398 mm/mlock.c:619 do_mlock+0x1aa/0x718 mm/mlock.c:711 __do_sys_mlock2 mm/mlock.c:738 [inline] __s390x_sys_mlock2+0x86/0xa8 mm/mlock.c:728 system_call+0xe0/0x28c arch/s390/kernel/entry.S:415 Freed by task 9333: slab_free mm/slub.c:3142 [inline] kmem_cache_free+0x7c/0x4b8 mm/slub.c:3158 __vma_adjust+0x7b2/0x2508 mm/mmap.c:960 vma_merge+0x87e/0xce0 mm/mmap.c:1209 userfaultfd_release+0x412/0x6b8 fs/userfaultfd.c:868 __fput+0x22c/0x7a8 fs/file_table.c:281 task_work_run+0x200/0x320 kernel/task_work.c:151 tracehook_notify_resume include/linux/tracehook.h:188 [inline] do_notify_resume+0x100/0x148 arch/s390/kernel/signal.c:538 system_call+0xe6/0x28c arch/s390/kernel/entry.S:416 The buggy address belongs to the object at 00000000962d6948 which belongs to the cache vm_area_struct of size 200 The buggy address is located 64 bytes inside of 200-byte region [00000000962d6948, 00000000962d6a10) The buggy address belongs to the page: page:00000000313a09fe refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x962d6 flags: 0x3ffff00000000200(slab) raw: 3ffff00000000200 000040000257e080 0000000c0000000c 000000008020ba00 raw: 0000000000000000 000f001e00000000 ffffffff00000001 0000000096959501 page dumped because: kasan: bad access detected page->mem_cgroup:0000000096959501 Memory state around the buggy address: 00000000962d6880: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00000000962d6900: 00 fc fc fc fc fc fc fc fc fa fb fb fb fb fb fb >00000000962d6980: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb ^ 00000000962d6a00: fb fb fc fc fc fc fc fc fc fc 00 00 00 00 00 00 00000000962d6a80: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ================================================================== Fixes: `6b251fc96c` ("userfaultfd: call handle_userfault() for userfaultfd_missing() faults") Reported-by: Alexander Egorenkov <egorenar@linux.ibm.com> Signed-off-by: Gerald Schaefer <gerald.schaefer@linux.ibm.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Cc: Andrea Arcangeli <aarcange@redhat.com> Cc: Heiko Carstens <hca@linux.ibm.com> Cc: <stable@vger.kernel.org> [4.3+] Link: https://lkml.kernel.org/r/20201110190329.11920-1-gerald.schaefer@linux.ibm.com Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>		2020-11-22 10:48:22 -08:00
..
kasan	mm: kasan: do not panic if both panic_on_warn and kasan_multishot set	2020-10-13 18:38:32 -07:00
backing-dev.c	bdi: replace BDI_CAP_NO_{WRITEBACK,ACCT_DIRTY} with a single flag	2020-09-24 13:43:39 -06:00
balloon_compaction.c	mm/balloon_compaction: suppress allocation warnings	2019-09-04 07:42:01 -04:00
cleancache.c	Driver Core and debugfs changes for 5.3-rc1	2019-07-12 12:24:03 -07:00
cma_debug.c	debugfs: make sure we can remove u32_array files cleanly	2020-07-10 13:54:00 -07:00
cma.c	cma: don't quit at first error when activating reserved areas	2020-08-12 10:57:57 -07:00
cma.h	mm: cma: use CMA_MAX_NAME to define the length of cma name array	2020-09-01 09:19:43 +02:00
compaction.c	mm/compaction: stop isolation if too many pages are isolated and we have pages to migrate	2020-11-14 11:26:03 -08:00
debug_page_ref.c
debug_vm_pgtable.c	mm/debug_vm_pgtable: avoid doing memory allocation with pgtable_t mapped.	2020-10-16 11:11:14 -07:00
debug.c	mm, dump_page: rename head_mapcount() --> head_compound_mapcount()	2020-10-13 18:38:29 -07:00
dmapool.c	mm/dmapool.c: replace hard coded function name with __func__	2020-10-13 18:38:32 -07:00
early_ioremap.c	mm/early_ioremap.c: use %pa to print resource_size_t variables	2020-01-31 10:30:38 -08:00
fadvise.c	mm, fadvise: improve the expensive remote LRU cache draining after FADV_DONTNEED	2020-10-13 18:38:29 -07:00
failslab.c
filemap.c	mm: never attempt async page lock if we've transferred data already	2020-11-16 13:39:34 -07:00
frame_vector.c	mmap locking API: convert mmap_sem comments	2020-06-09 09:39:14 -07:00
frontswap.c	mm/frontswap: mark various intentional data races	2020-08-14 19:56:56 -07:00
gup_benchmark.c	mm/gup_benchmark: take the mmap lock around GUP	2020-10-18 09:27:09 -07:00
gup.c	mm/gup: use unpin_user_pages() in __gup_longterm_locked()	2020-11-14 11:26:03 -08:00
highmem.c	mm/highmem.c: clean up endif comments	2020-10-16 11:11:18 -07:00
hmm.c	mm: do page fault accounting in handle_mm_fault	2020-08-12 10:58:02 -07:00
huge_memory.c	mm/userfaultfd: do not access vma->vm_mm after calling handle_userfault()	2020-11-22 10:48:22 -08:00
hugetlb_cgroup.c	hugetlb_cgroup: convert comma to semicolon	2020-08-21 09:52:52 -07:00
hugetlb.c	hugetlbfs: fix anon huge page migration race	2020-11-14 11:26:04 -08:00
hwpoison-inject.c	mm,hwpoison-inject: don't pin for hwpoison_filter	2020-10-16 11:11:16 -07:00
init-mm.c	mmap locking API: add MMAP_LOCK_INITIALIZER	2020-06-09 09:39:14 -07:00
internal.h	mm: rename page_order() to buddy_order()	2020-10-16 11:11:19 -07:00
interval_tree.c
ioremap.c	mm: move p?d_alloc_track to separate header file	2020-08-07 11:33:26 -07:00
Kconfig	mm: add a vmap_pfn function	2020-10-18 09:27:10 -07:00
Kconfig.debug	treewide: replace '---help---' in Kconfig files with 'help'	2020-06-14 01:57:21 +09:00
khugepaged.c	mm: remove the now-unnecessary mmget_still_valid() hack	2020-10-16 11:11:22 -07:00
kmemleak.c	mm/kmemleak: rely on rcu for task stack scanning	2020-10-13 18:38:27 -07:00
ksm.c	docs: get rid of :c:type explicit declarations for structs	2020-10-15 07:49:40 +02:00
list_lru.c	mm/list_lru: fix a data race in list_lru_count_one	2020-08-14 19:56:57 -07:00
maccess.c	uaccess: add force_uaccess_{begin,end} helpers	2020-08-12 10:57:59 -07:00
madvise.c	mm/madvise: fix memory leak from process_madvise	2020-11-22 10:48:22 -08:00
Makefile	mm,kmemleak-test.c: move kmemleak-test.c to samples dir	2020-10-13 18:38:27 -07:00
mapping_dirty_helpers.c	mm/mapping_dirty_helpers: update huge page-table entry callbacks	2020-04-02 09:35:29 -07:00
memblock.c	memblock: get rid of a :c:type leftover	2020-10-15 07:49:46 +02:00
memcontrol.c	mm: memcg/slab: fix root memcg vmstats	2020-11-22 10:48:22 -08:00
memfd.c	mm: page cache: store only head pages in i_pages	2019-09-24 15:54:08 -07:00
memory_hotplug.c	mm: fix phys_to_target_node() and memory_add_physaddr_to_nid() exports	2020-11-22 10:48:22 -08:00
memory-failure.c	hugetlbfs: fix anon huge page migration race	2020-11-14 11:26:04 -08:00
memory.c	mm: allow a NULL fn callback in apply_to_page_range	2020-10-18 09:27:10 -07:00
mempolicy.c	mm: mempolicy: fix potential pte_unmap_unlock pte error	2020-11-02 12:14:19 -08:00
mempool.c	mm/mempool: add 'else' to split mutually exclusive case	2020-10-13 18:38:34 -07:00
memremap.c	mm/mremap_pages: fix static key devmap_managed_key updates	2020-11-02 12:14:18 -08:00
memtest.c
migrate.c	hugetlbfs: fix anon huge page migration race	2020-11-14 11:26:04 -08:00
mincore.c	mm: factor find_get_incore_page out of mincore_page	2020-10-13 18:38:29 -07:00
mlock.c	mlock: fix unevictable_pgs event counts on THP	2020-09-19 13:13:38 -07:00
mm_init.c	mm: adjust vm_committed_as_batch according to vm overcommit policy	2020-08-07 11:33:26 -07:00
mmap.c	mm/mmap: add inline munmap_vma_range() for code readability	2020-10-18 09:27:09 -07:00
mmu_gather.c	mmap locking API: convert mmap_sem comments	2020-06-09 09:39:14 -07:00
mmu_notifier.c	mm/mmu_notifier: fix mmget() assert in __mmu_interval_notifier_insert	2020-10-16 11:11:17 -07:00
mmzone.c
mprotect.c	mm: Introduce arch_validate_flags()	2020-09-04 12:46:07 +01:00
mremap.c	mm/mremap: start addresses are properly aligned	2020-08-07 11:33:27 -07:00
msync.c	mmap locking API: use coccinelle to convert mmap_sem rwsem call sites	2020-06-09 09:39:14 -07:00
nommu.c	mm: remove alloc_vm_area	2020-10-18 09:27:10 -07:00
oom_kill.c	mm, oom_adj: don't loop through tasks in __set_oom_adj when not necessary	2020-10-13 18:38:35 -07:00
page_alloc.c	page_frag: Recover from memory pressure	2020-11-18 15:21:56 -08:00
page_counter.c	mm/page_counter: correct the obsolete func name in the comment of page_counter_try_charge()	2020-10-13 18:38:30 -07:00
page_ext.c	mm/page_ext.c: drop pfn_present() check when onlining	2020-04-07 10:43:40 -07:00
page_idle.c	mm/page_idle.c: skip offline pages	2020-06-08 11:05:55 -07:00
page_io.c	mm/page_io.c: remove useless out label in __swap_writepage()	2020-10-13 18:38:30 -07:00
page_isolation.c	mm: rename page_order() to buddy_order()	2020-10-16 11:11:19 -07:00
page_owner.c	mm: rename page_order() to buddy_order()	2020-10-16 11:11:19 -07:00
page_poison.c	mm/page_poison.c: replace bool variable with static key	2020-10-16 11:11:17 -07:00
page_reporting.c	mm: rename page_order() to buddy_order()	2020-10-16 11:11:19 -07:00
page_reporting.h	mm: introduce include/linux/pgtable.h	2020-06-09 09:39:13 -07:00
page_vma_mapped.c	mm: replace hpage_nr_pages with thp_nr_pages	2020-08-14 19:56:56 -07:00
page-writeback.c	mm/page-writeback: support tail pages in wait_for_stable_page	2020-10-16 11:11:15 -07:00
pagewalk.c	mmap locking API: convert mmap_sem comments	2020-06-09 09:39:14 -07:00
percpu-internal.h	mm: memcg/percpu: account percpu memory to memory cgroups	2020-08-12 10:57:55 -07:00
percpu-km.c	mm: memcg/percpu: account percpu memory to memory cgroups	2020-08-12 10:57:55 -07:00
percpu-stats.c	mm: memcg/percpu: account percpu memory to memory cgroups	2020-08-12 10:57:55 -07:00
percpu-vm.c	mm: memcg/percpu: account percpu memory to memory cgroups	2020-08-12 10:57:55 -07:00
percpu.c	percpu: convert flexible array initializers to use struct_size()	2020-10-30 23:02:28 +00:00
pgalloc-track.h	mm: move p?d_alloc_track to separate header file	2020-08-07 11:33:26 -07:00
pgtable-generic.c	mm: introduce include/linux/pgtable.h	2020-06-09 09:39:13 -07:00
process_vm_access.c	mm/process_vm_access: Add missing #include <linux/compat.h>	2020-10-27 12:41:29 -07:00
ptdump.c	mmap locking API: use coccinelle to convert mmap_sem rwsem call sites	2020-06-09 09:39:14 -07:00
readahead.c	mm: use limited read-ahead to satisfy read	2020-10-17 13:49:08 -06:00
rmap.c	hugetlbfs: fix anon huge page migration race	2020-11-14 11:26:04 -08:00
rodata_test.c	mm/rodata_test.c: fix missing function declaration	2020-08-21 09:52:53 -07:00
shmem.c	fs: add a filesystem flag for THPs	2020-10-16 11:11:15 -07:00
shuffle.c	mm: rename page_order() to buddy_order()	2020-10-16 11:11:19 -07:00
shuffle.h	mm/shuffle: remove dynamic reconfiguration	2020-08-07 11:33:29 -07:00
slab_common.c	mm/slab_common.c: delete duplicated word	2020-08-12 10:57:58 -07:00
slab.c	mm: fix some comments formatting	2020-10-16 11:11:19 -07:00
slab.h	mm: kmem: move memcg_kmem_bypass() calls to get_mem/obj_cgroup_from_current()	2020-10-18 09:27:09 -07:00
slob.c	mm: memcg: convert vmstat slab counters to bytes	2020-08-07 11:33:24 -07:00
slub.c	mm/slub: fix panic in slab_alloc_node()	2020-11-14 11:26:03 -08:00
sparse-vmemmap.c	mm/sparse: only sub-section aligned range would be populated	2020-08-07 11:33:27 -07:00
sparse.c	mm/memory_hotplug: guard more declarations by CONFIG_MEMORY_HOTPLUG	2020-10-16 11:11:18 -07:00
swap_cgroup.c	mm: memcontrol: make swap tracking an integral part of memory control	2020-06-03 20:09:48 -07:00
swap_slots.c	mm/swap_slots.c: remove always zero and unused return value of enable_swap_slots_cache()	2020-10-13 18:38:30 -07:00
swap_state.c	mm: fix some broken comments	2020-10-16 11:11:19 -07:00
swap.c	mm: move call to compound_head() in release_pages()	2020-10-13 18:38:33 -07:00
swapfile.c	mm/swapfile.c: fix potential memory leak in sys_swapon	2020-10-13 18:38:30 -07:00
truncate.c	mm/truncate.c: make __invalidate_mapping_pages() static	2020-11-02 12:14:19 -08:00
usercopy.c	mm/usercopy.c: delete duplicated word	2020-08-12 10:57:58 -07:00
userfaultfd.c	mm/vmscan: protect the workingset on anonymous LRU	2020-08-12 10:57:55 -07:00
util.c	mm/util.c: update the kerneldoc for kstrdup_const()	2020-10-16 11:11:17 -07:00
vmacache.c	kernel: better document the use_mm/unuse_mm API contract	2020-06-10 19:14:18 -07:00
vmalloc.c	mm: remove the filename in the top of file comment in vmalloc.c	2020-10-18 09:27:10 -07:00
vmpressure.c	mm: vmpressure: use mem_cgroup_is_root API	2020-04-02 09:35:31 -07:00
vmscan.c	mm/vmscan: fix NR_ISOLATED_FILE corruption on 64-bit	2020-11-14 11:26:03 -08:00
vmstat.c	mm/vmstat.c: use helper macro abs()	2020-10-16 11:11:17 -07:00
workingset.c	XArray updates for 5.9	2020-10-20 14:39:37 -07:00
z3fold.c	mm/z3fold.c: use xx_zalloc instead xx_alloc and memset	2020-10-13 18:38:34 -07:00
zbud.c	mm/zbud: remove redundant initialization	2020-10-13 18:38:34 -07:00
zpool.c	mm/zpool.c: delete duplicated word and fix grammar	2020-08-12 10:57:58 -07:00
zsmalloc.c	zsmalloc: switch from alloc_vm_area to get_vm_area	2020-10-18 09:27:10 -07:00
zswap.c	mm/zswap: allow setting default status, compressor and allocator in Kconfig	2020-04-07 10:43:41 -07:00