syzbot


INFO: rcu detected stall in sys_sendmsg

Status: upstream: reported C repro on 2024/03/19 21:37
Bug presence: origin:upstream
[Documentation on labels]
Reported-by: syzbot+fd4fc65c579eec307cfd@syzkaller.appspotmail.com
First crash: 123d, last: 23d
Bug presence (1)
Date Name Commit Repro Result
2024/05/01 upstream (ToT) 18daea77cca6 C [report] INFO: rcu detected stall in corrupted
Similar bugs (10)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in sys_sendmsg (2) cgroups mm 5 1691d 1692d 0/27 closed as invalid on 2019/12/04 14:14
upstream INFO: rcu detected stall in sys_sendmsg (3) kernel 1 1656d 1656d 0/27 closed as invalid on 2020/01/08 05:33
linux-6.1 INFO: rcu detected stall in sys_sendmsg 2 2d13h 17d 0/3 upstream: reported on 2024/07/04 07:16
upstream INFO: rcu detected stall in sys_sendmsg net C done 2 1772d 1772d 13/27 fixed on 2019/10/09 10:54
android-6-1 BUG: soft lockup in sys_sendmsg origin:upstream C 3 76d 103d 0/2 upstream: reported C repro on 2024/04/09 06:46
linux-6.1 BUG: soft lockup in sys_sendmsg 2 405d 411d 0/3 auto-obsoleted due to no activity on 2023/09/20 17:26
android-5-10 BUG: soft lockup in sys_sendmsg C 32 7d14h 121d 0/2 upstream: reported C repro on 2024/03/22 10:41
upstream BUG: soft lockup in sys_sendmsg tipc batman C 3 119d 160d 26/27 fixed on 2024/05/22 23:36
android-5-15 BUG: soft lockup in sys_sendmsg origin:upstream C 11 5d02h 121d 0/2 upstream: reported C repro on 2024/03/22 10:44
linux-6.1 BUG: soft lockup in sys_sendmsg (2) origin:upstream C done 1 105d 105d 3/3 fixed on 2024/05/15 09:17
Fix bisection attempts (2)
Created Duration User Patch Repo Result
2024/06/27 20:16 1h14m bisect fix linux-5.15.y OK (0) job log log
2024/05/21 13:57 1h56m bisect fix linux-5.15.y OK (0) job log log

Sample crash report:
rcu: INFO: rcu_preempt self-detected stall on CPU
rcu: 	0-...!: (1 GPs behind) idle=64b/1/0x4000000000000000 softirq=5607/5611 fqs=0 
	(t=10500 jiffies g=5437 q=187)
rcu: rcu_preempt kthread timer wakeup didn't happen for 10499 jiffies! g5437 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
rcu: 	Possible timer handling issue on cpu=1 timer-softirq=4132
rcu: rcu_preempt kthread starved for 10500 jiffies! g5437 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=1
rcu: 	Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt     state:I stack:27000 pid:   15 ppid:     2 flags:0x00004000
Call Trace:
 <TASK>
 context_switch kernel/sched/core.c:5030 [inline]
 __schedule+0x12c4/0x45b0 kernel/sched/core.c:6376
 schedule+0x11b/0x1f0 kernel/sched/core.c:6459
 schedule_timeout+0x1b9/0x300 kernel/time/timer.c:1884
 rcu_gp_fqs_loop+0x2bf/0x1080 kernel/rcu/tree.c:1972
 rcu_gp_kthread+0xa4/0x360 kernel/rcu/tree.c:2145
 kthread+0x3f6/0x4f0 kernel/kthread.c:319
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:298
 </TASK>
rcu: Stack dump where RCU GP kthread last ran:
Sending NMI from CPU 0 to CPUs 1:
NMI backtrace for cpu 1
CPU: 1 PID: 5062 Comm: syz-executor299 Not tainted 5.15.152-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 02/29/2024
RIP: 0010:mark_lock+0x3/0x340 kernel/locking/lockdep.c:4552
Code: 44 89 f9 80 e1 07 80 c1 03 38 c1 0f 8c 73 ff ff ff 4c 89 ff e8 ae fd 66 00 e9 66 ff ff ff e8 64 d0 b8 08 0f 1f 40 00 55 41 57 <41> 56 41 55 41 54 53 48 83 ec 10 49 89 f7 48 89 3c 24 49 bd 00 00
RSP: 0018:ffffc90003606308 EFLAGS: 00000002
RAX: 0000000000048656 RBX: ffff88807587c698 RCX: ffffffff8162f928
RDX: 0000000000000002 RSI: ffff88807587c698 RDI: ffff88807587bb80
RBP: ffffc900036063e0 R08: dffffc0000000000 R09: fffffbfff1f79e32
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff88807587c6b8
R13: 0000000000000001 R14: ffff88807587c668 R15: 1ffff1100eb0f8cd
FS:  00007fb00dc0b6c0(0000) GS:ffff8880b9b00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fb00dc0bd58 CR3: 000000007bd08000 CR4: 00000000003506e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <TASK>
 mark_held_locks kernel/locking/lockdep.c:4193 [inline]
 __trace_hardirqs_on_caller kernel/locking/lockdep.c:4211 [inline]
 lockdep_hardirqs_on_prepare+0x27d/0x7a0 kernel/locking/lockdep.c:4278
 trace_hardirqs_on+0x67/0x80 kernel/trace/trace_preemptirq.c:49
 __local_bh_enable_ip+0x164/0x1f0 kernel/softirq.c:388
 __raw_spin_trylock_bh include/linux/spinlock_api_smp.h:186 [inline]
 _raw_spin_trylock_bh+0x5d/0x70 kernel/locking/spinlock.c:146
 spin_trylock_bh include/linux/spinlock.h:423 [inline]
 tipc_sk_rcv+0x454/0x1d40 net/tipc/socket.c:2496
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_node_xmit_skb net/tipc/node.c:1768 [inline]
 tipc_node_distr_xmit+0x309/0x440 net/tipc/node.c:1783
 tipc_sk_rcv+0x1629/0x1d40 net/tipc/socket.c:2501
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_sk_push_backlog+0x507/0x920 net/tipc/socket.c:1316
 tipc_sk_conn_proto_rcv net/tipc/socket.c:1370 [inline]
 tipc_sk_proto_rcv+0xa8e/0x1820 net/tipc/socket.c:2158
 tipc_sk_filter_rcv+0x315b/0x33d0 net/tipc/socket.c:2352
 tipc_sk_enqueue net/tipc/socket.c:2445 [inline]
 tipc_sk_rcv+0x8a7/0x1d40 net/tipc/socket.c:2497
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_node_xmit_skb net/tipc/node.c:1768 [inline]
 tipc_node_distr_xmit+0x309/0x440 net/tipc/node.c:1783
 tipc_sk_backlog_rcv+0x199/0x220 net/tipc/socket.c:2412
 sk_backlog_rcv include/net/sock.h:1059 [inline]
 __release_sock+0x198/0x4b0 net/core/sock.c:2713
 release_sock+0x5d/0x1c0 net/core/sock.c:3254
 sock_setsockopt+0x155d/0x2f10 net/core/sock.c:1378
 __sys_setsockopt+0x5dd/0x990 net/socket.c:2194
 __do_sys_setsockopt net/socket.c:2209 [inline]
 __se_sys_setsockopt net/socket.c:2206 [inline]
 __x64_sys_setsockopt+0xb1/0xc0 net/socket.c:2206
 do_syscall_x64 arch/x86/entry/common.c:50 [inline]
 do_syscall_64+0x3d/0xb0 arch/x86/entry/common.c:80
 entry_SYSCALL_64_after_hwframe+0x61/0xcb
RIP: 0033:0x7fb00dc6b3e9
Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 51 18 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fb00dc0b228 EFLAGS: 00000246 ORIG_RAX: 0000000000000036
RAX: ffffffffffffffda RBX: 00007fb00dc0b6c0 RCX: 00007fb00dc6b3e9
RDX: 0000000000000021 RSI: 0000000000000001 RDI: 0000000000000003
RBP: 00007fb00dcf5338 R08: 0000000000000004 R09: 0000000000000000
R10: 0000000020000540 R11: 0000000000000246 R12: 00007fb00dcf5330
R13: 00007fb00dcf533c R14: 00007ffd462419a0 R15: 00007ffd46241a88
 </TASK>
INFO: NMI handler (nmi_cpu_backtrace_handler) took too long to run: 2.105 msecs
NMI backtrace for cpu 0
CPU: 0 PID: 5061 Comm: syz-executor299 Not tainted 5.15.152-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 02/29/2024
Call Trace:
 <IRQ>
 __dump_stack lib/dump_stack.c:88 [inline]
 dump_stack_lvl+0x1e3/0x2cb lib/dump_stack.c:106
 nmi_cpu_backtrace+0x46a/0x4a0 lib/nmi_backtrace.c:111
 nmi_trigger_cpumask_backtrace+0x181/0x2a0 lib/nmi_backtrace.c:62
 trigger_single_cpu_backtrace include/linux/nmi.h:166 [inline]
 rcu_dump_cpu_stacks+0x223/0x390 kernel/rcu/tree_stall.h:349
 print_cpu_stall+0x320/0x600 kernel/rcu/tree_stall.h:633
 check_cpu_stall kernel/rcu/tree_stall.h:727 [inline]
 rcu_pending kernel/rcu/tree.c:3932 [inline]
 rcu_sched_clock_irq+0x8d9/0x1150 kernel/rcu/tree.c:2619
 update_process_times+0x196/0x200 kernel/time/timer.c:1788
 tick_sched_handle kernel/time/tick-sched.c:254 [inline]
 tick_sched_timer+0x386/0x550 kernel/time/tick-sched.c:1473
 __run_hrtimer kernel/time/hrtimer.c:1686 [inline]
 __hrtimer_run_queues+0x55b/0xcf0 kernel/time/hrtimer.c:1750
 hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1812
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1085 [inline]
 __sysvec_apic_timer_interrupt+0x139/0x470 arch/x86/kernel/apic/apic.c:1102
 sysvec_apic_timer_interrupt+0x8c/0xb0 arch/x86/kernel/apic/apic.c:1096
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:638
RIP: 0010:native_safe_halt arch/x86/include/asm/irqflags.h:51 [inline]
RIP: 0010:arch_safe_halt arch/x86/include/asm/irqflags.h:89 [inline]
RIP: 0010:kvm_wait+0x1b4/0x200 arch/x86/kernel/kvm.c:918
Code: e0 48 c1 e8 03 42 0f b6 04 28 84 c0 75 42 45 0f b6 34 24 e8 7e d4 4e 00 44 3a 74 24 1c 75 10 66 90 0f 00 2d fe 87 50 09 fb f4 <e9> c8 fe ff ff fb e9 c2 fe ff ff 44 89 e1 80 e1 07 38 c1 0f 8c 54
RSP: 0018:ffffc900035a7700 EFLAGS: 00000246
RAX: e088d78fa910eb00 RBX: 1ffff920006b4ee4 RCX: ffffffff8162f928
RDX: dffffc0000000000 RSI: ffffffff8a8b1500 RDI: ffffffff8ad88f00
RBP: ffffc900035a77d0 R08: dffffc0000000000 R09: fffffbfff1f79e32
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff88801994d488
R13: dffffc0000000000 R14: 0000000000000003 R15: ffffc900035a7740
 pv_wait arch/x86/include/asm/paravirt.h:597 [inline]
 pv_wait_head_or_lock kernel/locking/qspinlock_paravirt.h:470 [inline]
 __pv_queued_spin_lock_slowpath+0x6bc/0xc40 kernel/locking/qspinlock.c:508
 pv_queued_spin_lock_slowpath arch/x86/include/asm/paravirt.h:585 [inline]
 queued_spin_lock_slowpath+0x42/0x50 arch/x86/include/asm/qspinlock.h:51
 queued_spin_lock include/asm-generic/qspinlock.h:85 [inline]
 do_raw_spin_lock+0x269/0x370 kernel/locking/spinlock_debug.c:115
 spin_lock_bh include/linux/spinlock.h:368 [inline]
 lock_sock_nested+0x68/0x100 net/core/sock.c:3242
 lock_sock include/net/sock.h:1668 [inline]
 tipc_sendstream+0x43/0x70 net/tipc/socket.c:1549
 sock_sendmsg_nosec net/socket.c:704 [inline]
 __sock_sendmsg net/socket.c:716 [inline]
 ____sys_sendmsg+0x59e/0x8f0 net/socket.c:2431
 ___sys_sendmsg+0x252/0x2e0 net/socket.c:2485
 __sys_sendmsg net/socket.c:2514 [inline]
 __do_sys_sendmsg net/socket.c:2523 [inline]
 __se_sys_sendmsg+0x19a/0x260 net/socket.c:2521
 do_syscall_x64 arch/x86/entry/common.c:50 [inline]
 do_syscall_64+0x3d/0xb0 arch/x86/entry/common.c:80
 entry_SYSCALL_64_after_hwframe+0x61/0xcb
RIP: 0033:0x7fb00dc6b3e9
Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 51 18 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fb00dc2c228 EFLAGS: 00000246 ORIG_RAX: 000000000000002e
RAX: ffffffffffffffda RBX: 0000000000000018 RCX: 00007fb00dc6b3e9
RDX: 0000000000000000 RSI: 00000000200003c0 RDI: 0000000000000004
RBP: 00007fb00dcf5328 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 00007fb00dcf5320
R13: 00007fb00dcf532c R14: 00007ffd462419a0 R15: 00007ffd46241a88
 </TASK>
Sending NMI from CPU 0 to CPUs 1:
NMI backtrace for cpu 1
CPU: 1 PID: 5062 Comm: syz-executor299 Not tainted 5.15.152-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 02/29/2024
RIP: 0010:rhashtable_lookup include/linux/rhashtable.h:638 [inline]
RIP: 0010:tipc_sk_lookup+0x388/0x920 net/tipc/socket.c:2999
Code: 31 ff e8 4b 09 b2 f7 4c 89 f8 48 83 e0 01 0f 85 bb 00 00 00 48 8b 5c 24 18 48 89 d8 48 c1 e8 03 48 89 44 24 50 42 0f b6 04 20 <84> c0 0f 85 36 01 00 00 44 0f b7 2b 48 8b 74 24 20 48 89 f0 48 c1
RSP: 0018:ffffc900036063e0 EFLAGS: 00000a03
RAX: 0000000000000000 RBX: ffff88814ad2514e RCX: ffff88807587bb80
RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffffc900036064d0 R08: ffffffff89ce3465 R09: fffffbfff1f79e19
R10: 0000000000000000 R11: dffffc0000000001 R12: dffffc0000000000
R13: ffff88814b16c000 R14: 0000000000000025 R15: ffff88801994d9a8
FS:  00007fb00dc0b6c0(0000) GS:ffff8880b9b00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fb00dc0bd58 CR3: 000000007bd08000 CR4: 00000000003506e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <TASK>
 tipc_sk_rcv+0x428/0x1d40 net/tipc/socket.c:2492
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_node_xmit_skb net/tipc/node.c:1768 [inline]
 tipc_node_distr_xmit+0x309/0x440 net/tipc/node.c:1783
 tipc_sk_rcv+0x1629/0x1d40 net/tipc/socket.c:2501
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_sk_push_backlog+0x507/0x920 net/tipc/socket.c:1316
 tipc_sk_conn_proto_rcv net/tipc/socket.c:1370 [inline]
 tipc_sk_proto_rcv+0xa8e/0x1820 net/tipc/socket.c:2158
 tipc_sk_filter_rcv+0x315b/0x33d0 net/tipc/socket.c:2352
 tipc_sk_enqueue net/tipc/socket.c:2445 [inline]
 tipc_sk_rcv+0x8a7/0x1d40 net/tipc/socket.c:2497
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_node_xmit_skb net/tipc/node.c:1768 [inline]
 tipc_node_distr_xmit+0x309/0x440 net/tipc/node.c:1783
 tipc_sk_backlog_rcv+0x199/0x220 net/tipc/socket.c:2412
 sk_backlog_rcv include/net/sock.h:1059 [inline]
 __release_sock+0x198/0x4b0 net/core/sock.c:2713
 release_sock+0x5d/0x1c0 net/core/sock.c:3254
 sock_setsockopt+0x155d/0x2f10 net/core/sock.c:1378
 __sys_setsockopt+0x5dd/0x990 net/socket.c:2194
 __do_sys_setsockopt net/socket.c:2209 [inline]
 __se_sys_setsockopt net/socket.c:2206 [inline]
 __x64_sys_setsockopt+0xb1/0xc0 net/socket.c:2206
 do_syscall_x64 arch/x86/entry/common.c:50 [inline]
 do_syscall_64+0x3d/0xb0 arch/x86/entry/common.c:80
 entry_SYSCALL_64_after_hwframe+0x61/0xcb
RIP: 0033:0x7fb00dc6b3e9
Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 51 18 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fb00dc0b228 EFLAGS: 00000246 ORIG_RAX: 0000000000000036
RAX: ffffffffffffffda RBX: 00007fb00dc0b6c0 RCX: 00007fb00dc6b3e9
RDX: 0000000000000021 RSI: 0000000000000001 RDI: 0000000000000003
RBP: 00007fb00dcf5338 R08: 0000000000000004 R09: 0000000000000000
R10: 0000000020000540 R11: 0000000000000246 R12: 00007fb00dcf5330
R13: 00007fb00dcf533c R14: 00007ffd462419a0 R15: 00007ffd46241a88
 </TASK>
INFO: NMI handler (nmi_cpu_backtrace_handler) took too long to run: 2.105 msecs

Crashes (3):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/03/22 10:17 linux-5.15.y b95c01af2113 7a239ce7 .config console log report syz C [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in sys_sendmsg
2024/03/19 21:37 linux-5.15.y b95c01af2113 e104824c .config console log report syz C [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in sys_sendmsg
2024/04/17 13:01 linux-5.15.y c52b9710c83d 18f6e127 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan INFO: rcu detected stall in sys_sendmsg
* Struck through repros no longer work on HEAD.