syzbot


possible deadlock in smc_pnet_find_ism_resource

Status: auto-obsoleted due to no activity on 2025/07/24 22:02
Subsystems: smc
[Documentation on labels]
Reported-by: syzbot+f160105b2817964a0886@syzkaller.appspotmail.com
First crash: 231d, last: 144d
Cause bisection: failed (error log, bisect log)
  
Fix bisection: failed (error log, bisect log)
  
Discussions (3)
Title Replies (including bot) Last reply
[syzbot] Monthly smc report (Apr 2025) 0 (1) 2025/04/15 10:40
[syzbot] Monthly s390 report (Feb 2025) 0 (1) 2025/02/05 12:43
[syzbot] [s390?] [net?] possible deadlock in smc_pnet_find_ism_resource 0 (1) 2025/01/19 00:44
Last patch testing requests (4)
Created Duration User Patch Repo Result
2025/07/24 21:35 25m retest repro net OK log
2025/04/25 15:49 6h42m retest repro net OK log
2025/02/08 18:36 18m retest repro net report log
2025/02/08 18:36 19m retest repro net report log
Fix bisection attempts (3)
Created Duration User Patch Repo Result
2025/05/15 13:56 17m bisect fix net error job log
2025/04/11 06:42 4h40m bisect fix net OK (0) job log log
2025/03/10 19:12 5h21m bisect fix net OK (0) job log log

Sample crash report:
======================================================
WARNING: possible circular locking dependency detected
6.13.0-syzkaller-07048-gae8b53aac327 #0 Not tainted
------------------------------------------------------
syz.1.205/6717 is trying to acquire lock:
ffffffff8fef5028
 (
rtnl_mutex){+.+.}-{4:4}
, at: pnet_find_base_ndev net/smc/smc_pnet.c:945 [inline]
, at: smc_pnet_find_ism_by_pnetid net/smc/smc_pnet.c:1101 [inline]
, at: smc_pnet_find_ism_resource+0xfa/0x3c0 net/smc/smc_pnet.c:1152

but task is already holding lock:
ffff888025380258
 (sk_lock-AF_INET
){+.+.}-{0:0}
, at: lock_sock include/net/sock.h:1624 [inline]
, at: smc_connect+0xd5/0x760 net/smc/af_smc.c:1644

which lock already depends on the new lock.


the existing dependency chain (in reverse order) is:

-> #1 (
sk_lock-AF_INET){+.+.}-{0:0}
:
       lock_sock_nested+0x3a/0xf0 net/core/sock.c:3645
       lock_sock include/net/sock.h:1624 [inline]
       sockopt_lock_sock net/core/sock.c:1133 [inline]
       sockopt_lock_sock+0x54/0x70 net/core/sock.c:1124
       do_ip_setsockopt+0x101/0x3680 net/ipv4/ip_sockglue.c:1078
       ip_setsockopt+0x59/0xf0 net/ipv4/ip_sockglue.c:1417
       udp_setsockopt+0x7d/0xd0 net/ipv4/udp.c:3053
       do_sock_setsockopt+0x222/0x480 net/socket.c:2298
       __sys_setsockopt+0x1a0/0x230 net/socket.c:2323
       __do_sys_setsockopt net/socket.c:2329 [inline]
       __se_sys_setsockopt net/socket.c:2326 [inline]
       __x64_sys_setsockopt+0xbd/0x160 net/socket.c:2326
       do_syscall_x64 arch/x86/entry/common.c:52 [inline]
       do_syscall_64+0xcd/0x250 arch/x86/entry/common.c:83
       entry_SYSCALL_64_after_hwframe+0x77/0x7f

-> #0 (rtnl_mutex){+.+.}-{4:4}:
       check_prev_add kernel/locking/lockdep.c:3163 [inline]
       check_prevs_add kernel/locking/lockdep.c:3282 [inline]
       validate_chain kernel/locking/lockdep.c:3906 [inline]
       __lock_acquire+0x249e/0x3c40 kernel/locking/lockdep.c:5228
       lock_acquire.part.0+0x11b/0x380 kernel/locking/lockdep.c:5851
       __mutex_lock_common kernel/locking/mutex.c:585 [inline]
       __mutex_lock+0x19b/0xb10 kernel/locking/mutex.c:730
       pnet_find_base_ndev net/smc/smc_pnet.c:945 [inline]
       smc_pnet_find_ism_by_pnetid net/smc/smc_pnet.c:1101 [inline]
       smc_pnet_find_ism_resource+0xfa/0x3c0 net/smc/smc_pnet.c:1152
       smc_find_ism_device net/smc/af_smc.c:1011 [inline]
       smc_find_proposal_devices net/smc/af_smc.c:1096 [inline]
       __smc_connect+0x50e/0x4890 net/smc/af_smc.c:1526
       smc_connect+0x2fc/0x760 net/smc/af_smc.c:1696
       __sys_connect_file+0x13e/0x1a0 net/socket.c:2040
       __sys_connect+0x14f/0x170 net/socket.c:2059
       __do_sys_connect net/socket.c:2065 [inline]
       __se_sys_connect net/socket.c:2062 [inline]
       __x64_sys_connect+0x72/0xb0 net/socket.c:2062
       do_syscall_x64 arch/x86/entry/common.c:52 [inline]
       do_syscall_64+0xcd/0x250 arch/x86/entry/common.c:83
       entry_SYSCALL_64_after_hwframe+0x77/0x7f

other info that might help us debug this:

 Possible unsafe locking scenario:

       CPU0                    CPU1
       ----                    ----
  lock(sk_lock-AF_INET);
                               lock(rtnl_mutex);
                               lock(sk_lock-AF_INET);
  lock(rtnl_mutex);

 *** DEADLOCK ***

1 lock held by syz.1.205/6717:
 #0: ffff888025380258 (sk_lock-AF_INET){+.+.}-{0:0}, at: lock_sock include/net/sock.h:1624 [inline]
 #0: ffff888025380258 (sk_lock-AF_INET){+.+.}-{0:0}, at: smc_connect+0xd5/0x760 net/smc/af_smc.c:1644

stack backtrace:
CPU: 0 UID: 0 PID: 6717 Comm: syz.1.205 Not tainted 6.13.0-syzkaller-07048-gae8b53aac327 #0
Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
Call Trace:
 <TASK>
 __dump_stack lib/dump_stack.c:94 [inline]
 dump_stack_lvl+0x116/0x1f0 lib/dump_stack.c:120
 print_circular_bug+0x490/0x760 kernel/locking/lockdep.c:2076
 check_noncircular+0x31a/0x400 kernel/locking/lockdep.c:2208
 check_prev_add kernel/locking/lockdep.c:3163 [inline]
 check_prevs_add kernel/locking/lockdep.c:3282 [inline]
 validate_chain kernel/locking/lockdep.c:3906 [inline]
 __lock_acquire+0x249e/0x3c40 kernel/locking/lockdep.c:5228
 lock_acquire.part.0+0x11b/0x380 kernel/locking/lockdep.c:5851
 __mutex_lock_common kernel/locking/mutex.c:585 [inline]
 __mutex_lock+0x19b/0xb10 kernel/locking/mutex.c:730
 pnet_find_base_ndev net/smc/smc_pnet.c:945 [inline]
 smc_pnet_find_ism_by_pnetid net/smc/smc_pnet.c:1101 [inline]
 smc_pnet_find_ism_resource+0xfa/0x3c0 net/smc/smc_pnet.c:1152
 smc_find_ism_device net/smc/af_smc.c:1011 [inline]
 smc_find_proposal_devices net/smc/af_smc.c:1096 [inline]
 __smc_connect+0x50e/0x4890 net/smc/af_smc.c:1526
 smc_connect+0x2fc/0x760 net/smc/af_smc.c:1696
 __sys_connect_file+0x13e/0x1a0 net/socket.c:2040
 __sys_connect+0x14f/0x170 net/socket.c:2059
 __do_sys_connect net/socket.c:2065 [inline]
 __se_sys_connect net/socket.c:2062 [inline]
 __x64_sys_connect+0x72/0xb0 net/socket.c:2062
 do_syscall_x64 arch/x86/entry/common.c:52 [inline]
 do_syscall_64+0xcd/0x250 arch/x86/entry/common.c:83
 entry_SYSCALL_64_after_hwframe+0x77/0x7f
RIP: 0033:0x7f933df8cd29
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007f933bdf6038 EFLAGS: 00000246 ORIG_RAX: 000000000000002a
RAX: ffffffffffffffda RBX: 00007f933e1a5fa0 RCX: 00007f933df8cd29
RDX: 0000000000000010 RSI: 0000000020000080 RDI: 0000000000000005
RBP: 00007f933e00e2a0 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
R13: 0000000000000000 R14: 00007f933e1a5fa0 R15: 00007ffc08a200e8
 </TASK>

Crashes (5):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2025/01/25 12:40 upstream ae8b53aac327 9fbd772e .config console log report info [disk image (non-bootable)] [vmlinux] [kernel image] ci-qemu-upstream possible deadlock in smc_pnet_find_ism_resource
2025/01/25 12:40 upstream ae8b53aac327 9fbd772e .config console log report info [disk image (non-bootable)] [vmlinux] [kernel image] ci-qemu-upstream possible deadlock in smc_pnet_find_ism_resource
2025/01/15 03:26 net 665bcfc982de 7315a7cf .config strace log report syz / log C [disk image] [vmlinux] [kernel image] ci-upstream-net-this-kasan-gce possible deadlock in smc_pnet_find_ism_resource
2025/01/15 01:48 net 665bcfc982de 7315a7cf .config strace log report syz / log C [disk image] [vmlinux] [kernel image] ci-upstream-net-this-kasan-gce possible deadlock in smc_pnet_find_ism_resource
2025/01/15 00:40 net 665bcfc982de 7315a7cf .config console log report info [disk image] [vmlinux] [kernel image] ci-upstream-net-this-kasan-gce possible deadlock in smc_pnet_find_ism_resource
* Struck through repros no longer work on HEAD.