syzbot


INFO: rcu detected stall in sys_sendmsg

Status: upstream: reported C repro on 2024/03/19 21:37
Bug presence: origin:upstream
[Documentation on labels]
Reported-by: syzbot+fd4fc65c579eec307cfd@syzkaller.appspotmail.com
First crash: 325d, last: 13d
Fix bisection: failed (error log, bisect log)
  
Bug presence (1)
Date Name Commit Repro Result
2024/05/01 upstream (ToT) 18daea77cca6 C [report] INFO: rcu detected stall in corrupted
Similar bugs (10)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in sys_sendmsg (2) cgroups mm 5 1893d 1894d 0/28 closed as invalid on 2019/12/04 14:14
upstream INFO: rcu detected stall in sys_sendmsg (3) kernel 1 1858d 1858d 0/28 closed as invalid on 2020/01/08 05:33
linux-6.1 INFO: rcu detected stall in sys_sendmsg 6 133d 219d 0/3 auto-obsoleted due to no activity on 2025/01/06 11:24
upstream INFO: rcu detected stall in sys_sendmsg net C done 2 1974d 1974d 13/28 fixed on 2019/10/09 10:54
android-6-1 BUG: soft lockup in sys_sendmsg origin:upstream C 3 277d 305d 0/2 upstream: reported C repro on 2024/04/09 06:46
linux-6.1 BUG: soft lockup in sys_sendmsg 2 606d 613d 0/3 auto-obsoleted due to no activity on 2023/09/20 17:26
android-5-10 BUG: soft lockup in sys_sendmsg C 39 3d02h 323d 0/2 upstream: reported C repro on 2024/03/22 10:41
upstream BUG: soft lockup in sys_sendmsg tipc batman C 3 320d 362d 25/28 fixed on 2024/05/22 23:36
android-5-15 BUG: soft lockup in sys_sendmsg origin:upstream C error 13 149d 323d 0/2 upstream: reported C repro on 2024/03/22 10:44
linux-6.1 BUG: soft lockup in sys_sendmsg (2) origin:upstream C done 1 307d 307d 3/3 fixed on 2024/05/15 09:17
Last patch testing requests (4)
Created Duration User Patch Repo Result
2025/01/04 11:17 12m retest repro linux-5.15.y report log
2024/12/20 17:38 11m retest repro linux-5.15.y report log
2024/12/15 19:39 17m retest repro linux-5.15.y report log
2024/10/10 08:10 55m retest repro linux-5.15.y report log
Fix bisection attempts (5)
Created Duration User Patch Repo Result
2024/10/15 00:29 0m bisect fix linux-5.15.y error job log
2024/09/09 00:56 2h21m bisect fix linux-5.15.y OK (0) job log log
2024/08/03 10:55 1h36m bisect fix linux-5.15.y OK (0) job log log
2024/06/27 20:16 1h14m bisect fix linux-5.15.y OK (0) job log log
2024/05/21 13:57 1h56m bisect fix linux-5.15.y OK (0) job log log

Sample crash report:
rcu: INFO: rcu_preempt self-detected stall on CPU
rcu: 	1-...!: (1 GPs behind) idle=409/1/0x4000000000000000 softirq=6179/6180 fqs=0 
	(t=10500 jiffies g=6725 q=74)
rcu: rcu_preempt kthread timer wakeup didn't happen for 10499 jiffies! g6725 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
rcu: 	Possible timer handling issue on cpu=0 timer-softirq=2086
rcu: rcu_preempt kthread starved for 10500 jiffies! g6725 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
rcu: 	Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt     state:I stack:26784 pid:   15 ppid:     2 flags:0x00004000
Call Trace:
 <TASK>
 context_switch kernel/sched/core.c:5027 [inline]
 __schedule+0x12c4/0x45b0 kernel/sched/core.c:6373
 schedule+0x11b/0x1f0 kernel/sched/core.c:6456
 schedule_timeout+0x1b9/0x300 kernel/time/timer.c:1914
 rcu_gp_fqs_loop+0x2bf/0x1080 kernel/rcu/tree.c:1972
 rcu_gp_kthread+0xa4/0x360 kernel/rcu/tree.c:2145
 kthread+0x3f6/0x4f0 kernel/kthread.c:334
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:287
 </TASK>
rcu: Stack dump where RCU GP kthread last ran:
Sending NMI from CPU 1 to CPUs 0:
NMI backtrace for cpu 0
CPU: 0 PID: 4176 Comm: syz-executor255 Not tainted 5.15.173-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 09/13/2024
RIP: 0010:lock_acquire+0xa/0x4f0 kernel/locking/lockdep.c:5591
Code: 8e e8 3a 8f 67 00 e9 a2 fd ff ff 0f 1f 44 00 00 65 8b 05 21 b3 9f 7e a9 00 ff ff 00 0f 95 c0 c3 55 48 89 e5 41 57 41 56 41 55 <41> 54 53 48 83 e4 e0 48 81 ec 20 01 00 00 4c 89 4c 24 28 4c 89 44
RSP: 0018:ffffc900025c63a8 EFLAGS: 00000246
RAX: ffffffff89f1666a RBX: 0000000090f48f50 RCX: 0000000000000002
RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffffff8cb1fc60
RBP: ffffc900025c63c0 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: dffffc0000000001 R12: dffffc0000000000
R13: 0000000090f48f50 R14: ffffffff96e4d240 R15: 1ffff920004b8c8c
FS:  00007fcd85b6e6c0(0000) GS:ffff8880b8e00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fcd85b6ed58 CR3: 000000007248f000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <TASK>
 rcu_lock_acquire+0x2a/0x30 include/linux/rcupdate.h:312
 rcu_read_lock include/linux/rcupdate.h:739 [inline]
 tipc_sk_lookup+0xc9/0x920 net/tipc/socket.c:2998
 tipc_sk_rcv+0x428/0x1d40 net/tipc/socket.c:2492
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_node_xmit_skb net/tipc/node.c:1768 [inline]
 tipc_node_distr_xmit+0x309/0x440 net/tipc/node.c:1783
 tipc_sk_rcv+0x1629/0x1d40 net/tipc/socket.c:2501
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_sk_push_backlog+0x507/0x920 net/tipc/socket.c:1316
 tipc_sk_conn_proto_rcv net/tipc/socket.c:1370 [inline]
 tipc_sk_proto_rcv+0xa8e/0x1820 net/tipc/socket.c:2158
 tipc_sk_filter_rcv+0x315b/0x33d0 net/tipc/socket.c:2352
 tipc_sk_enqueue net/tipc/socket.c:2445 [inline]
 tipc_sk_rcv+0x8a7/0x1d40 net/tipc/socket.c:2497
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_node_xmit_skb net/tipc/node.c:1768 [inline]
 tipc_node_distr_xmit+0x309/0x440 net/tipc/node.c:1783
 tipc_sk_backlog_rcv+0x199/0x220 net/tipc/socket.c:2412
 sk_backlog_rcv include/net/sock.h:1061 [inline]
 __release_sock+0x198/0x4b0 net/core/sock.c:2724
 release_sock+0x5d/0x1c0 net/core/sock.c:3265
 sock_setsockopt+0x155d/0x2f10 net/core/sock.c:1378
 __sys_setsockopt+0x5dd/0x990 net/socket.c:2199
 __do_sys_setsockopt net/socket.c:2214 [inline]
 __se_sys_setsockopt net/socket.c:2211 [inline]
 __x64_sys_setsockopt+0xb1/0xc0 net/socket.c:2211
 do_syscall_x64 arch/x86/entry/common.c:50 [inline]
 do_syscall_64+0x3b/0xb0 arch/x86/entry/common.c:80
 entry_SYSCALL_64_after_hwframe+0x66/0xd0
RIP: 0033:0x7fcd85bce429
Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 51 18 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fcd85b6e228 EFLAGS: 00000246 ORIG_RAX: 0000000000000036
RAX: ffffffffffffffda RBX: 00007fcd85c58338 RCX: 00007fcd85bce429
RDX: 0000000000000021 RSI: 0000000000000001 RDI: 0000000000000003
RBP: 0000000000000000 R08: 0000000000000004 R09: 0000000000000000
R10: 0000000020000540 R11: 0000000000000246 R12: 00007fcd85c58330
R13: 00007fcd85c25074 R14: 00007fffdcdfcc60 R15: 00007fffdcdfcd48
 </TASK>
INFO: NMI handler (nmi_cpu_backtrace_handler) took too long to run: 2.099 msecs
Sending NMI from CPU 1 to CPUs 0:
NMI backtrace for cpu 0
CPU: 0 PID: 4176 Comm: syz-executor255 Not tainted 5.15.173-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 09/13/2024
RIP: 0010:__lock_acquire+0xf1d/0x1ff0
Code: c1 c1 c1 04 31 e9 44 01 f8 41 29 cf 89 ca c1 c2 06 44 31 fa 01 c1 29 d0 89 d6 c1 c6 08 31 c6 01 ca 29 f1 89 f3 c1 c3 10 31 cb <01> d6 29 da 89 dd c1 c5 13 31 d5 01 f3 29 ee 01 eb c1 c5 04 31 f5
RSP: 0018:ffffc900025c6180 EFLAGS: 00000082
RAX: 00000000227a32e1 RBX: 00000000e885009b RCX: 000000008775631b
RDX: 00000000fc37cd68 RSI: 0000000063806ff0 RDI: dffffc0000000000
RBP: 0000000062a641d4 R08: dffffc0000000000 R09: fffffbfff2131021
R10: 0000000000000000 R11: dffffc0000000001 R12: 0000000000000002
R13: ffff888023cd8ae8 R14: 1ffff1100479b15c R15: 00000000df288930
FS:  00007fcd85b6e6c0(0000) GS:ffff8880b8e00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fcd85b6ed58 CR3: 000000007248f000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <TASK>
 lock_acquire+0x1db/0x4f0 kernel/locking/lockdep.c:5623
 rcu_lock_acquire+0x2a/0x30 include/linux/rcupdate.h:312
 rcu_read_lock include/linux/rcupdate.h:739 [inline]
 tipc_sk_lookup+0xc9/0x920 net/tipc/socket.c:2998
 tipc_sk_rcv+0x428/0x1d40 net/tipc/socket.c:2492
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_node_xmit_skb net/tipc/node.c:1768 [inline]
 tipc_node_distr_xmit+0x309/0x440 net/tipc/node.c:1783
 tipc_sk_rcv+0x1629/0x1d40 net/tipc/socket.c:2501
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_sk_push_backlog+0x507/0x920 net/tipc/socket.c:1316
 tipc_sk_conn_proto_rcv net/tipc/socket.c:1370 [inline]
 tipc_sk_proto_rcv+0xa8e/0x1820 net/tipc/socket.c:2158
 tipc_sk_filter_rcv+0x315b/0x33d0 net/tipc/socket.c:2352
 tipc_sk_enqueue net/tipc/socket.c:2445 [inline]
 tipc_sk_rcv+0x8a7/0x1d40 net/tipc/socket.c:2497
 tipc_node_xmit+0x1b7/0xf20 net/tipc/node.c:1703
 tipc_node_xmit_skb net/tipc/node.c:1768 [inline]
 tipc_node_distr_xmit+0x309/0x440 net/tipc/node.c:1783
 tipc_sk_backlog_rcv+0x199/0x220 net/tipc/socket.c:2412
 sk_backlog_rcv include/net/sock.h:1061 [inline]
 __release_sock+0x198/0x4b0 net/core/sock.c:2724
 release_sock+0x5d/0x1c0 net/core/sock.c:3265
 sock_setsockopt+0x155d/0x2f10 net/core/sock.c:1378
 __sys_setsockopt+0x5dd/0x990 net/socket.c:2199
 __do_sys_setsockopt net/socket.c:2214 [inline]
 __se_sys_setsockopt net/socket.c:2211 [inline]
 __x64_sys_setsockopt+0xb1/0xc0 net/socket.c:2211
 do_syscall_x64 arch/x86/entry/common.c:50 [inline]
 do_syscall_64+0x3b/0xb0 arch/x86/entry/common.c:80
 entry_SYSCALL_64_after_hwframe+0x66/0xd0
RIP: 0033:0x7fcd85bce429
Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 51 18 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fcd85b6e228 EFLAGS: 00000246 ORIG_RAX: 0000000000000036
RAX: ffffffffffffffda RBX: 00007fcd85c58338 RCX: 00007fcd85bce429
RDX: 0000000000000021 RSI: 0000000000000001 RDI: 0000000000000003
RBP: 0000000000000000 R08: 0000000000000004 R09: 0000000000000000
R10: 0000000020000540 R11: 0000000000000246 R12: 00007fcd85c58330
R13: 00007fcd85c25074 R14: 00007fffdcdfcc60 R15: 00007fffdcdfcd48
 </TASK>
NMI backtrace for cpu 1
CPU: 1 PID: 4175 Comm: syz-executor255 Not tainted 5.15.173-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 09/13/2024
Call Trace:
 <IRQ>
 __dump_stack lib/dump_stack.c:88 [inline]
 dump_stack_lvl+0x1e3/0x2d0 lib/dump_stack.c:106
 nmi_cpu_backtrace+0x46a/0x4a0 lib/nmi_backtrace.c:111
 nmi_trigger_cpumask_backtrace+0x181/0x2a0 lib/nmi_backtrace.c:62
 trigger_single_cpu_backtrace include/linux/nmi.h:166 [inline]
 rcu_dump_cpu_stacks+0x223/0x390 kernel/rcu/tree_stall.h:349
 print_cpu_stall+0x320/0x600 kernel/rcu/tree_stall.h:633
 check_cpu_stall kernel/rcu/tree_stall.h:727 [inline]
 rcu_pending kernel/rcu/tree.c:3932 [inline]
 rcu_sched_clock_irq+0x8d9/0x1150 kernel/rcu/tree.c:2619
 update_process_times+0x196/0x200 kernel/time/timer.c:1818
 tick_sched_handle kernel/time/tick-sched.c:254 [inline]
 tick_sched_timer+0x386/0x550 kernel/time/tick-sched.c:1473
 __run_hrtimer kernel/time/hrtimer.c:1688 [inline]
 __hrtimer_run_queues+0x55b/0xcf0 kernel/time/hrtimer.c:1752
 hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1814
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1097 [inline]
 __sysvec_apic_timer_interrupt+0x13b/0x4b0 arch/x86/kernel/apic/apic.c:1114
 instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1108 [inline]
 sysvec_apic_timer_interrupt+0x9b/0xc0 arch/x86/kernel/apic/apic.c:1108
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:676
RIP: 0010:native_safe_halt arch/x86/include/asm/irqflags.h:51 [inline]
RIP: 0010:arch_safe_halt arch/x86/include/asm/irqflags.h:89 [inline]
RIP: 0010:kvm_wait+0x1b4/0x200 arch/x86/kernel/kvm.c:918
Code: e0 48 c1 e8 03 42 0f b6 04 28 84 c0 75 42 45 0f b6 34 24 e8 ce de 4e 00 44 3a 74 24 1c 75 10 66 90 0f 00 2d 0e 6d 70 09 fb f4 <e9> c8 fe ff ff fb e9 c2 fe ff ff 44 89 e1 80 e1 07 38 c1 0f 8c 54
RSP: 0018:ffffc9000322f700 EFLAGS: 00000246
RAX: a8416fc43ef98100 RBX: 1ffff92000645ee4 RCX: ffffffff81632d68
RDX: dffffc0000000000 RSI: ffffffff8aab2a80 RDI: ffffffff8af9ed00
RBP: ffffc9000322f7d0 R08: dffffc0000000000 R09: fffffbfff213103b
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff88802c1a9c88
R13: dffffc0000000000 R14: 0000000000000003 R15: ffffc9000322f740
 pv_wait arch/x86/include/asm/paravirt.h:597 [inline]
 pv_wait_head_or_lock kernel/locking/qspinlock_paravirt.h:470 [inline]
 __pv_queued_spin_lock_slowpath+0x6bc/0xc40 kernel/locking/qspinlock.c:508
 pv_queued_spin_lock_slowpath arch/x86/include/asm/paravirt.h:585 [inline]
 queued_spin_lock_slowpath+0x42/0x50 arch/x86/include/asm/qspinlock.h:51
 queued_spin_lock include/asm-generic/qspinlock.h:85 [inline]
 do_raw_spin_lock+0x269/0x370 kernel/locking/spinlock_debug.c:115
 spin_lock_bh include/linux/spinlock.h:368 [inline]
 lock_sock_nested+0x68/0x100 net/core/sock.c:3253
 lock_sock include/net/sock.h:1678 [inline]
 tipc_sendstream+0x43/0x70 net/tipc/socket.c:1549
 sock_sendmsg_nosec net/socket.c:704 [inline]
 __sock_sendmsg net/socket.c:716 [inline]
 ____sys_sendmsg+0x59e/0x8f0 net/socket.c:2436
 ___sys_sendmsg+0x252/0x2e0 net/socket.c:2490
 __sys_sendmsg net/socket.c:2519 [inline]
 __do_sys_sendmsg net/socket.c:2528 [inline]
 __se_sys_sendmsg+0x19a/0x260 net/socket.c:2526
 do_syscall_x64 arch/x86/entry/common.c:50 [inline]
 do_syscall_64+0x3b/0xb0 arch/x86/entry/common.c:80
 entry_SYSCALL_64_after_hwframe+0x66/0xd0
RIP: 0033:0x7fcd85bce429
Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 51 18 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fcd85b8f228 EFLAGS: 00000246 ORIG_RAX: 000000000000002e
RAX: ffffffffffffffda RBX: 00007fcd85c58328 RCX: 00007fcd85bce429
RDX: 0000000000000000 RSI: 0000000020000500 RDI: 0000000000000004
RBP: 0000000000000010 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 00007fcd85c58320
R13: 00007fcd85c25074 R14: 00007fffdcdfcc60 R15: 00007fffdcdfcd48
 </TASK>

Crashes (8):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/11/29 14:57 linux-5.15.y 0a51d2d4527b 5df23865 .config console log report syz / log C [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in sys_sendmsg
2024/03/22 10:17 linux-5.15.y b95c01af2113 7a239ce7 .config console log report syz C [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in sys_sendmsg
2024/03/19 21:37 linux-5.15.y b95c01af2113 e104824c .config console log report syz C [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in sys_sendmsg
2025/01/26 08:46 linux-5.15.y 003148680b79 9fbd772e .config console log report syz / log [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-arm64 BUG: soft lockup in sys_sendmsg
2024/12/01 00:08 linux-5.15.y 0a51d2d4527b 68914665 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in sys_sendmsg
2024/08/06 04:18 linux-5.15.y 7e89efd3ae1c e1bdb00a .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan INFO: rcu detected stall in sys_sendmsg
2024/08/05 14:55 linux-5.15.y 7e89efd3ae1c e35c337f .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in sys_sendmsg
2024/04/17 13:01 linux-5.15.y c52b9710c83d 18f6e127 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan INFO: rcu detected stall in sys_sendmsg
* Struck through repros no longer work on HEAD.