syzbot


INFO: rcu detected stall in tcp_write_timer

Status: upstream: reported on 2024/08/22 21:30
Reported-by: syzbot+06b635a512eb322a9bb9@syzkaller.appspotmail.com
First crash: 54d, last: 1d21h
Similar bugs (16)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in tcp_write_timer (2) bpf 2 1351d 1407d 0/28 auto-closed as invalid on 2021/05/03 11:59
upstream INFO: rcu detected stall in tcp_write_timer (4) net 1 92d 92d 0/28 auto-obsoleted due to no activity on 2024/10/13 08:46
upstream INFO: rcu detected stall in tcp_write_timer net 3 1900d 1895d 0/28 auto-closed as invalid on 2019/10/25 14:11
upstream INFO: rcu detected stall in tcp_write_timer (3) net 1 1218d 1218d 0/28 auto-closed as invalid on 2021/09/13 13:17
linux-6.1 INFO: rcu detected stall in tcp_write_timer 11 7d18h 117d 0/3 upstream: reported on 2024/06/21 02:50
upstream BUG: soft lockup in tcp_write_timer (4) kasan mm 4 102d 126d 26/28 fixed on 2024/07/09 19:14
linux-4.14 INFO: rcu detected stall in tcp_write_timer 4 1502d 1745d 0/1 auto-closed as invalid on 2021/01/02 05:45
linux-4.19 INFO: rcu detected stall in tcp_write_timer 2 1631d 1685d 0/1 auto-closed as invalid on 2020/08/26 06:46
linux-4.19 BUG: soft lockup in tcp_write_timer (3) 2 655d 668d 0/1 upstream: reported on 2022/12/17 21:41
linux-4.19 BUG: soft lockup in tcp_write_timer (2) 2 1064d 1128d 0/1 auto-closed as invalid on 2022/03/16 10:56
linux-4.19 BUG: soft lockup in tcp_write_timer 1 1293d 1293d 0/1 auto-closed as invalid on 2021/07/30 14:52
upstream BUG: soft lockup in tcp_write_timer (2) kvm 1 904d 904d 0/28 auto-closed as invalid on 2022/06/24 22:31
upstream BUG: soft lockup in tcp_write_timer (3) net 6 281d 389d 0/28 closed as invalid on 2024/03/18 17:07
android-5-15 BUG: soft lockup in tcp_write_timer 11 65d 177d 0/2 premoderation: reported on 2024/04/21 15:01
linux-4.14 BUG: soft lockup in tcp_write_timer 2 1621d 1700d 0/1 auto-closed as invalid on 2020/09/05 12:42
upstream BUG: soft lockup in tcp_write_timer net 11 1900d 1908d 0/28 auto-closed as invalid on 2019/10/25 14:11

Sample crash report:
rcu: INFO: rcu_preempt self-detected stall on CPU
rcu: 	0-....: (1 GPs behind) idle=e5f/1/0x4000000000000000 softirq=13986/13988 fqs=5236 
	(t=10502 jiffies g=16249 q=2285)
NMI backtrace for cpu 0
CPU: 0 PID: 5849 Comm: syz.1.783 Not tainted 5.15.167-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 09/13/2024
Call Trace:
 <IRQ>
 __dump_stack lib/dump_stack.c:88 [inline]
 dump_stack_lvl+0x1e3/0x2d0 lib/dump_stack.c:106
 nmi_cpu_backtrace+0x46a/0x4a0 lib/nmi_backtrace.c:111
 nmi_trigger_cpumask_backtrace+0x181/0x2a0 lib/nmi_backtrace.c:62
 trigger_single_cpu_backtrace include/linux/nmi.h:166 [inline]
 rcu_dump_cpu_stacks+0x223/0x390 kernel/rcu/tree_stall.h:349
 print_cpu_stall+0x320/0x600 kernel/rcu/tree_stall.h:633
 check_cpu_stall kernel/rcu/tree_stall.h:727 [inline]
 rcu_pending kernel/rcu/tree.c:3932 [inline]
 rcu_sched_clock_irq+0x8d9/0x1150 kernel/rcu/tree.c:2619
 update_process_times+0x196/0x200 kernel/time/timer.c:1818
 tick_sched_handle kernel/time/tick-sched.c:254 [inline]
 tick_sched_timer+0x386/0x550 kernel/time/tick-sched.c:1473
 __run_hrtimer kernel/time/hrtimer.c:1688 [inline]
 __hrtimer_run_queues+0x55b/0xcf0 kernel/time/hrtimer.c:1752
 hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1814
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1085 [inline]
 __sysvec_apic_timer_interrupt+0x139/0x470 arch/x86/kernel/apic/apic.c:1102
 sysvec_apic_timer_interrupt+0x3e/0xb0 arch/x86/kernel/apic/apic.c:1096
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:638
RIP: 0010:sk_has_account include/net/sock.h:1557 [inline]
RIP: 0010:sk_mem_uncharge include/net/sock.h:1607 [inline]
RIP: 0010:sk_wmem_free_skb+0x88/0x5a0 include/net/sock.h:1626
Code: 00 41 29 9d 78 02 00 00 4d 8d 65 28 4c 89 e0 48 c1 e8 03 80 3c 28 00 74 08 4c 89 e7 e8 81 e0 f5 f8 bb f8 00 00 00 49 03 1c 24 <48> 89 d8 48 c1 e8 03 80 3c 28 00 74 08 48 89 df e8 63 e0 f5 f8 48
RSP: 0018:ffffc90000007950 EFLAGS: 00000286
RAX: 1ffff1100a73fd85 RBX: ffffffff8daeff18 RCX: ffff88801e52d940
RDX: 0000000000000100 RSI: ffff888074755400 RDI: ffff8880539fee78
RBP: dffffc0000000000 R08: dffffc0000000000 R09: ffff888074755458
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff8880539fec28
R13: ffff8880539fec00 R14: ffff8880747554d8 R15: 1ffff1100e8eaa9b
 tcp_write_queue_purge+0x132/0x3e0 net/ipv4/tcp.c:2963
 tcp_done_with_error+0x3d/0xc0 net/ipv4/tcp_input.c:4358
 tcp_write_err net/ipv4/tcp_timer.c:70 [inline]
 tcp_write_timeout net/ipv4/tcp_timer.c:273 [inline]
 tcp_retransmit_timer+0x112b/0x2680 net/ipv4/tcp_timer.c:543
 tcp_write_timer_handler+0x1f7/0x970 net/ipv4/tcp_timer.c:655
 tcp_write_timer+0x12e/0x280 net/ipv4/tcp_timer.c:675
 call_timer_fn+0x16d/0x560 kernel/time/timer.c:1451
 expire_timers kernel/time/timer.c:1496 [inline]
 __run_timers+0x67c/0x890 kernel/time/timer.c:1767
 run_timer_softirq+0x63/0xf0 kernel/time/timer.c:1780
 handle_softirqs+0x3a7/0x930 kernel/softirq.c:558
 __do_softirq kernel/softirq.c:592 [inline]
 invoke_softirq kernel/softirq.c:432 [inline]
 __irq_exit_rcu+0x157/0x240 kernel/softirq.c:641
 irq_exit_rcu+0x5/0x20 kernel/softirq.c:653
 sysvec_apic_timer_interrupt+0x91/0xb0 arch/x86/kernel/apic/apic.c:1096
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:638
RIP: 0010:trace_event_get_offsets_lock include/trace/events/lock.h:39 [inline]
RIP: 0010:perf_trace_lock+0xe7/0x440 include/trace/events/lock.h:39
Code: 38 84 c0 0f 85 02 03 00 00 c7 84 24 a0 00 00 00 00 00 00 00 49 8d 5c 24 18 48 89 d8 48 c1 e8 03 48 89 44 24 38 42 80 3c 38 00 <74> 08 48 89 df e8 cf cd 67 00 48 89 5c 24 40 48 8b 03 48 85 c0 48
RSP: 0018:ffffc900031f6ee0 EFLAGS: 00000246
RAX: 1ffffffff1923f8f RBX: ffffffff8c91fc78 RCX: 00000000031f6f03
RDX: ffffffff818aabc5 RSI: ffffffff8c91fc60 RDI: ffffc900031f6f60
RBP: ffffc900031f6ff0 R08: dffffc0000000000 R09: fffffbfff1bd2c16
R10: 0000000000000000 R11: dffffc0000000001 R12: ffffffff8c91fc60
R13: ffffffff818aabc5 R14: ffffffff8c7eec40 R15: dffffc0000000000
 trace_lock_release include/trace/events/lock.h:58 [inline]
 lock_release+0x93c/0x9a0 kernel/locking/lockdep.c:5634
 rcu_read_unlock include/linux/rcupdate.h:772 [inline]
 BPF_PROG_RUN_ARRAY include/linux/bpf.h:1346 [inline]
 trace_call_bpf+0x5a6/0x660 kernel/trace/bpf_trace.c:127
 perf_trace_run_bpf_submit+0x7b/0x1d0 kernel/events/core.c:9987
 perf_trace_lock+0x37f/0x440 include/trace/events/lock.h:39
 trace_lock_release include/trace/events/lock.h:58 [inline]
 lock_release+0x93c/0x9a0 kernel/locking/lockdep.c:5634
 rcu_read_unlock include/linux/rcupdate.h:772 [inline]
 is_bpf_text_address+0x24f/0x260 kernel/bpf/core.c:723
 kernel_text_address kernel/extable.c:151 [inline]
 __kernel_text_address+0x94/0x100 kernel/extable.c:105
 unwind_get_return_address+0x49/0x80 arch/x86/kernel/unwind_orc.c:323
 arch_stack_walk+0xf3/0x140 arch/x86/kernel/stacktrace.c:26
 stack_trace_save+0x113/0x1c0 kernel/stacktrace.c:122
 kasan_save_stack mm/kasan/common.c:38 [inline]
 kasan_set_track mm/kasan/common.c:46 [inline]
 set_alloc_info mm/kasan/common.c:434 [inline]
 ____kasan_kmalloc+0xba/0xf0 mm/kasan/common.c:513
 kasan_kmalloc include/linux/kasan.h:264 [inline]
 __kmalloc+0x168/0x300 mm/slub.c:4407
 kmalloc include/linux/slab.h:596 [inline]
 allocate_probes kernel/tracepoint.c:109 [inline]
 func_add kernel/tracepoint.c:205 [inline]
 tracepoint_add_func+0x2de/0x9d0 kernel/tracepoint.c:338
 tracepoint_probe_register_prio kernel/tracepoint.c:511 [inline]
 tracepoint_probe_register+0x101/0x160 kernel/tracepoint.c:531
 perf_trace_event_reg kernel/trace/trace_event_perf.c:129 [inline]
 perf_trace_event_init+0x494/0x940 kernel/trace/trace_event_perf.c:202
 perf_trace_init+0x23b/0x2d0 kernel/trace/trace_event_perf.c:226
 perf_tp_event_init+0x89/0x110 kernel/events/core.c:10073
 perf_try_init_event+0x135/0x3e0 kernel/events/core.c:11549
 perf_init_event kernel/events/core.c:11613 [inline]
 perf_event_alloc+0x1175/0x2180 kernel/events/core.c:11905
 __do_sys_perf_event_open kernel/events/core.c:12443 [inline]
 __se_sys_perf_event_open+0xb27/0x4510 kernel/events/core.c:12335
 do_syscall_x64 arch/x86/entry/common.c:50 [inline]
 do_syscall_64+0x3b/0xb0 arch/x86/entry/common.c:80
 entry_SYSCALL_64_after_hwframe+0x66/0xd0
RIP: 0033:0x7f86261e6ff9
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007f862465f038 EFLAGS: 00000246 ORIG_RAX: 000000000000012a
RAX: ffffffffffffffda RBX: 00007f862639ef80 RCX: 00007f86261e6ff9
RDX: ffffffffffffffff RSI: 0000000000000000 RDI: 0000000020000140
RBP: 00007f8626259296 R08: 0000000000000000 R09: 0000000000000000
R10: ffffffffffffffff R11: 0000000000000246 R12: 0000000000000000
R13: 0000000000000000 R14: 00007f862639ef80 R15: 00007ffef9837f48
 </TASK>

Crashes (4):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/10/14 08:12 linux-5.15.y 3a5928702e71 084d8178 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in tcp_write_timer
2024/10/11 10:03 linux-5.15.y 3a5928702e71 cd942402 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in tcp_write_timer
2024/08/22 21:29 linux-5.15.y fa93fa65db6e ca02180f .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf INFO: rcu detected stall in tcp_write_timer
2024/09/14 04:39 linux-5.15.y 3a5928702e71 b58f933c .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-5-15-kasan-perf BUG: soft lockup in tcp_write_timer
* Struck through repros no longer work on HEAD.