syzbot


possible deadlock in xfs_ilock (3)

Status: upstream: reported on 2024/11/18 06:10
Subsystems: xfs
[Documentation on labels]
Reported-by: syzbot+b143b25b374fbc5c3a04@syzkaller.appspotmail.com
First crash: 7d00h, last: 7d00h
Discussions (1)
Title Replies (including bot) Last reply
[syzbot] [xfs?] possible deadlock in xfs_ilock (3) 0 (1) 2024/11/18 06:10
Similar bugs (2)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream possible deadlock in xfs_ilock (2) xfs 1422 147d 220d 0/28 auto-obsoleted due to no activity on 2024/09/04 06:27
upstream possible deadlock in xfs_ilock xfs 30 225d 251d 25/28 fixed on 2024/04/10 03:59

Sample crash report:
XFS (loop8): Mounting V5 Filesystem bfdc47fc-10d8-4eed-a562-11a831b3f791
XFS (loop8): Ending clean mount
======================================================
WARNING: possible circular locking dependency detected
6.12.0-rc7-syzkaller-00042-gf1b785f4c787 #0 Not tainted
------------------------------------------------------
syz.8.662/10149 is trying to acquire lock:
ffffffff8e347080 (fs_reclaim){+.+.}-{0:0}, at: might_alloc include/linux/sched/mm.h:318 [inline]
ffffffff8e347080 (fs_reclaim){+.+.}-{0:0}, at: slab_pre_alloc_hook mm/slub.c:4036 [inline]
ffffffff8e347080 (fs_reclaim){+.+.}-{0:0}, at: slab_alloc_node mm/slub.c:4114 [inline]
ffffffff8e347080 (fs_reclaim){+.+.}-{0:0}, at: __do_kmalloc_node mm/slub.c:4263 [inline]
ffffffff8e347080 (fs_reclaim){+.+.}-{0:0}, at: __kmalloc_noprof+0xb1/0x400 mm/slub.c:4276

but task is already holding lock:
ffff88805a939858 (&xfs_dir_ilock_class){++++}-{3:3}, at: xfs_ilock+0xf5/0x210 fs/xfs/xfs_inode.c:166

which lock already depends on the new lock.


the existing dependency chain (in reverse order) is:

-> #1 (&xfs_dir_ilock_class){++++}-{3:3}:
       down_write_nested+0x97/0x210 kernel/locking/rwsem.c:1693
       xfs_ilock+0x198/0x210 fs/xfs/xfs_inode.c:164
       xfs_reclaim_inode fs/xfs/xfs_icache.c:981 [inline]
       xfs_icwalk_process_inode fs/xfs/xfs_icache.c:1675 [inline]
       xfs_icwalk_ag+0xca6/0x1780 fs/xfs/xfs_icache.c:1757
       xfs_icwalk fs/xfs/xfs_icache.c:1805 [inline]
       xfs_reclaim_inodes_nr+0x1bc/0x300 fs/xfs/xfs_icache.c:1047
       super_cache_scan+0x40c/0x550 fs/super.c:227
       do_shrink_slab+0x452/0x11c0 mm/shrinker.c:437
       shrink_slab+0x32b/0x12a0 mm/shrinker.c:664
       shrink_one+0x47e/0x7b0 mm/vmscan.c:4824
       shrink_many mm/vmscan.c:4885 [inline]
       lru_gen_shrink_node mm/vmscan.c:4963 [inline]
       shrink_node+0xb23/0x3a90 mm/vmscan.c:5943
       kswapd_shrink_node mm/vmscan.c:6771 [inline]
       balance_pgdat+0xc1f/0x18f0 mm/vmscan.c:6963
       kswapd+0x5ea/0xbf0 mm/vmscan.c:7232
       kthread+0x2c4/0x3a0 kernel/kthread.c:389
       ret_from_fork+0x48/0x80 arch/x86/kernel/process.c:147
       ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:244

-> #0 (fs_reclaim){+.+.}-{0:0}:
       check_prev_add kernel/locking/lockdep.c:3161 [inline]
       check_prevs_add kernel/locking/lockdep.c:3280 [inline]
       validate_chain kernel/locking/lockdep.c:3904 [inline]
       __lock_acquire+0x250b/0x3ce0 kernel/locking/lockdep.c:5202
       lock_acquire.part.0+0x11b/0x380 kernel/locking/lockdep.c:5825
       __fs_reclaim_acquire mm/page_alloc.c:3836 [inline]
       fs_reclaim_acquire+0x102/0x150 mm/page_alloc.c:3850
       might_alloc include/linux/sched/mm.h:318 [inline]
       slab_pre_alloc_hook mm/slub.c:4036 [inline]
       slab_alloc_node mm/slub.c:4114 [inline]
       __do_kmalloc_node mm/slub.c:4263 [inline]
       __kmalloc_noprof+0xb1/0x400 mm/slub.c:4276
       kmalloc_noprof include/linux/slab.h:882 [inline]
       xfs_attr_shortform_list fs/xfs/xfs_attr_list.c:117 [inline]
       xfs_attr_list_ilocked+0x9a0/0x1b00 fs/xfs/xfs_attr_list.c:569
       xfs_attr_list+0x1f9/0x2b0 fs/xfs/xfs_attr_list.c:595
       xfs_vn_listxattr+0x11f/0x1c0 fs/xfs/xfs_xattr.c:341
       vfs_listxattr+0xba/0x140 fs/xattr.c:493
       listxattr+0x69/0x190 fs/xattr.c:841
       path_listxattr+0xc0/0x160 fs/xattr.c:865
       do_syscall_x64 arch/x86/entry/common.c:52 [inline]
       do_syscall_64+0xcd/0x250 arch/x86/entry/common.c:83
       entry_SYSCALL_64_after_hwframe+0x77/0x7f

other info that might help us debug this:

 Possible unsafe locking scenario:

       CPU0                    CPU1
       ----                    ----
  rlock(&xfs_dir_ilock_class);
                               lock(fs_reclaim);
                               lock(&xfs_dir_ilock_class);
  lock(fs_reclaim);

 *** DEADLOCK ***

1 lock held by syz.8.662/10149:
 #0: ffff88805a939858 (&xfs_dir_ilock_class){++++}-{3:3}, at: xfs_ilock+0xf5/0x210 fs/xfs/xfs_inode.c:166

stack backtrace:
CPU: 1 UID: 0 PID: 10149 Comm: syz.8.662 Not tainted 6.12.0-rc7-syzkaller-00042-gf1b785f4c787 #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 10/30/2024
Call Trace:
 <TASK>
 __dump_stack lib/dump_stack.c:94 [inline]
 dump_stack_lvl+0x116/0x1f0 lib/dump_stack.c:120
 print_circular_bug+0x41c/0x610 kernel/locking/lockdep.c:2074
 check_noncircular+0x31a/0x400 kernel/locking/lockdep.c:2206
 check_prev_add kernel/locking/lockdep.c:3161 [inline]
 check_prevs_add kernel/locking/lockdep.c:3280 [inline]
 validate_chain kernel/locking/lockdep.c:3904 [inline]
 __lock_acquire+0x250b/0x3ce0 kernel/locking/lockdep.c:5202
 lock_acquire.part.0+0x11b/0x380 kernel/locking/lockdep.c:5825
 __fs_reclaim_acquire mm/page_alloc.c:3836 [inline]
 fs_reclaim_acquire+0x102/0x150 mm/page_alloc.c:3850
 might_alloc include/linux/sched/mm.h:318 [inline]
 slab_pre_alloc_hook mm/slub.c:4036 [inline]
 slab_alloc_node mm/slub.c:4114 [inline]
 __do_kmalloc_node mm/slub.c:4263 [inline]
 __kmalloc_noprof+0xb1/0x400 mm/slub.c:4276
 kmalloc_noprof include/linux/slab.h:882 [inline]
 xfs_attr_shortform_list fs/xfs/xfs_attr_list.c:117 [inline]
 xfs_attr_list_ilocked+0x9a0/0x1b00 fs/xfs/xfs_attr_list.c:569
 xfs_attr_list+0x1f9/0x2b0 fs/xfs/xfs_attr_list.c:595
 xfs_vn_listxattr+0x11f/0x1c0 fs/xfs/xfs_xattr.c:341
 vfs_listxattr+0xba/0x140 fs/xattr.c:493
 listxattr+0x69/0x190 fs/xattr.c:841
 path_listxattr+0xc0/0x160 fs/xattr.c:865
 do_syscall_x64 arch/x86/entry/common.c:52 [inline]
 do_syscall_64+0xcd/0x250 arch/x86/entry/common.c:83
 entry_SYSCALL_64_after_hwframe+0x77/0x7f
RIP: 0033:0x7f81fa37e719
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007f81fb1cf038 EFLAGS: 00000246 ORIG_RAX: 00000000000000c2
RAX: ffffffffffffffda RBX: 00007f81fa535f80 RCX: 00007f81fa37e719
RDX: 0000000000000014 RSI: 0000000020000480 RDI: 0000000020000440
RBP: 00007f81fa3f175e R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
R13: 0000000000000000 R14: 00007f81fa535f80 R15: 00007ffe5f9dc7d8
 </TASK>

Crashes (1):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/11/14 06:00 upstream f1b785f4c787 a8c99394 .config console log report info [disk image] [vmlinux] [kernel image] ci-upstream-kasan-badwrites-root possible deadlock in xfs_ilock
* Struck through repros no longer work on HEAD.