syzbot


INFO: rcu detected stall in neigh_timer_handler

Status: upstream: reported on 2024/06/06 22:02
Reported-by: syzbot+cc0a32d4a33855c0f7c7@syzkaller.appspotmail.com
First crash: 11d, last: 11d
Similar bugs (14)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in neigh_timer_handler (6) netfilter 1 1287d 1287d 0/27 auto-closed as invalid on 2021/03/09 03:20
upstream INFO: rcu detected stall in neigh_timer_handler (8) net C done 5 1d19h 5d14h 0/27 upstream: reported C repro on 2024/06/12 22:26
upstream INFO: rcu detected stall in neigh_timer_handler net 1 2033d 2033d 0/27 auto-closed as invalid on 2019/05/23 06:47
upstream INFO: rcu detected stall in neigh_timer_handler (2) net 2 1742d 1744d 13/27 fixed on 2019/10/09 10:54
upstream INFO: rcu detected stall in neigh_timer_handler (4) net 1 1658d 1658d 0/27 closed as invalid on 2019/12/04 14:04
upstream INFO: rcu detected stall in neigh_timer_handler (5) net 1 1390d 1390d 0/27 auto-closed as invalid on 2020/11/26 10:25
upstream INFO: rcu detected stall in neigh_timer_handler (3) net 2 1663d 1662d 0/27 closed as invalid on 2019/11/29 14:24
upstream INFO: rcu detected stall in neigh_timer_handler (7) net 1 1162d 1162d 0/27 auto-closed as invalid on 2021/07/11 19:14
linux-4.14 INFO: rcu detected stall in neigh_timer_handler 5 1507d 1627d 0/1 auto-closed as invalid on 2020/08/30 16:21
android-5-10 BUG: soft lockup in neigh_timer_handler 2 299d 307d 0/2 auto-obsoleted due to no activity on 2023/11/22 05:43
linux-4.19 BUG: soft lockup in neigh_timer_handler 4 584d 842d 0/1 auto-obsoleted due to no activity on 2023/03/12 06:57
linux-4.14 BUG: soft lockup in neigh_timer_handler 3 852d 895d 0/1 auto-closed as invalid on 2022/06/16 21:59
android-6-1 BUG: soft lockup in neigh_timer_handler 2 213d 235d 0/2 auto-obsoleted due to no activity on 2024/02/16 02:11
upstream BUG: soft lockup in neigh_timer_handler net 1 331d 306d 0/27 auto-obsoleted due to no activity on 2023/10/20 21:15

Sample crash report:
rcu: INFO: rcu_preempt detected expedited stalls on CPUs/tasks: { 1-.... } 2788 jiffies s: 7217 root: 0x2/.
rcu: blocking rcu_node structures (internal RCU debug):
Sending NMI from CPU 0 to CPUs 1:
NMI backtrace for cpu 1
CPU: 1 PID: 5087 Comm: syz-executor.3 Not tainted 6.1.92-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 04/02/2024
RIP: 0010:kasan_check_range+0x3/0x290 mm/kasan/generic.c:188
Code: 48 89 c7 e8 8f b9 a9 08 31 c0 c3 0f 0b b8 ea ff ff ff c3 0f 0b b8 ea ff ff ff c3 cc cc cc cc cc cc cc cc cc cc cc cc 55 41 57 <41> 56 53 b0 01 48 85 f6 0f 84 9a 01 00 00 48 89 fd 48 01 f5 0f 82
RSP: 0018:ffffc900001dfbe0 EFLAGS: 00000046
RAX: 0000000000000000 RBX: 0000000000000001 RCX: ffffffff8179eb32
RDX: 0000000000000000 RSI: 0000000000000008 RDI: ffffffff8e738968
RBP: 0000000000000001 R08: ffffffff8179eb0e R09: fffffbfff23e06dd
R10: 0000000000000000 R11: dffffc0000000001 R12: 0000003a0ebcd60b
R13: ffff8880b992a540 R14: ffff8880b992a4c0 R15: dffffc0000000000
FS:  00007fcc8d1dd6c0(0000) GS:ffff8880b9900000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000004 CR3: 0000000021bbe000 CR4: 00000000003506e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <IRQ>
 instrument_atomic_read include/linux/instrumented.h:72 [inline]
 _test_bit include/asm-generic/bitops/instrumented-non-atomic.h:141 [inline]
 cpumask_test_cpu include/linux/cpumask.h:444 [inline]
 cpu_online include/linux/cpumask.h:1030 [inline]
 trace_hrtimer_cancel include/trace/events/timer.h:297 [inline]
 debug_deactivate+0x82/0x280 kernel/time/hrtimer.c:482
 __run_hrtimer kernel/time/hrtimer.c:1654 [inline]
 __hrtimer_run_queues+0x334/0xe50 kernel/time/hrtimer.c:1750
 hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1812
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1095 [inline]
 __sysvec_apic_timer_interrupt+0x156/0x580 arch/x86/kernel/apic/apic.c:1112
 sysvec_apic_timer_interrupt+0x3e/0xb0 arch/x86/kernel/apic/apic.c:1106
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:653
RIP: 0010:seqcount_lockdep_reader_access+0x1dc/0x220 include/linux/seqlock.h:105
Code: 00 4d 85 ed 75 16 e8 43 88 10 00 eb 15 e8 3c 88 10 00 e8 07 03 11 09 4d 85 ed 74 ea e8 2d 88 10 00 fb 48 c7 04 24 0e 36 e0 45 <4b> c7 04 3c 00 00 00 00 66 43 c7 44 3c 09 00 00 43 c6 44 3c 0b 00
RSP: 0018:ffffc900001dff80 EFLAGS: 00000246
RAX: ffffffff817a0523 RBX: 0000000000000000 RCX: ffff88801f8cd940
RDX: 0000000000000502 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffffc900001e0038 R08: ffffffff817a04f9 R09: fffffbfff2093857
R10: 0000000000000000 R11: dffffc0000000001 R12: dffffc0000000000
R13: 0000000000000200 R14: 0000000000000046 R15: 1ffff9200003bff0
 ktime_get_with_offset+0x8c/0x320 kernel/time/timekeeping.c:889
 ktime_get_real include/linux/timekeeping.h:79 [inline]
 netif_rx_internal+0x42a/0x600 net/core/dev.c:4970
 __netif_rx+0x74/0xb0 net/core/dev.c:5013
 loopback_xmit+0x48a/0x710 drivers/net/loopback.c:89
 __netdev_start_xmit include/linux/netdevice.h:4853 [inline]
 netdev_start_xmit include/linux/netdevice.h:4867 [inline]
 xmit_one net/core/dev.c:3627 [inline]
 dev_hard_start_xmit+0x261/0x8c0 net/core/dev.c:3643
 __dev_queue_xmit+0x1bb1/0x3cf0 net/core/dev.c:4293
 dev_queue_xmit include/linux/netdevice.h:3021 [inline]
 neigh_hh_output include/net/neighbour.h:528 [inline]
 neigh_output include/net/neighbour.h:542 [inline]
 ip_finish_output2+0xcfc/0x1330 net/ipv4/ip_output.c:228
 dst_output include/net/dst.h:444 [inline]
 ip_local_out net/ipv4/ip_output.c:126 [inline]
 ip_send_skb+0x113/0x1a0 net/ipv4/ip_output.c:1596
 __icmp_send+0xffa/0x1530 net/ipv4/icmp.c:768
 ipv4_send_dest_unreach net/ipv4/route.c:1239 [inline]
 ipv4_link_failure+0x638/0xa10 net/ipv4/route.c:1246
 dst_link_failure include/net/dst.h:423 [inline]
 arp_error_report+0x112/0x160 net/ipv4/arp.c:296
 neigh_invalidate+0x240/0x460 net/core/neighbour.c:1059
 neigh_timer_handler+0x8a1/0x1010 net/core/neighbour.c:1146
 call_timer_fn+0x1ad/0x6b0 kernel/time/timer.c:1504
 expire_timers kernel/time/timer.c:1549 [inline]
 __run_timers+0x67c/0x890 kernel/time/timer.c:1820
 run_timer_softirq+0x63/0xf0 kernel/time/timer.c:1833
 __do_softirq+0x2e9/0xa4c kernel/softirq.c:571
 invoke_softirq kernel/softirq.c:445 [inline]
 __irq_exit_rcu+0x155/0x240 kernel/softirq.c:650
 irq_exit_rcu+0x5/0x20 kernel/softirq.c:662
 sysvec_apic_timer_interrupt+0x91/0xb0 arch/x86/kernel/apic/apic.c:1106
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:653
RIP: 0010:finish_task_switch+0x1d3/0x810 kernel/sched/core.c:5124
Code: 37 0b 00 48 83 c4 08 4c 89 f7 e8 98 30 00 00 0f 1f 44 00 00 4c 89 f7 e8 7b 20 36 09 e8 36 26 32 00 fb 49 8d bc 24 f8 15 00 00 <48> 89 f8 48 c1 e8 03 49 bd 00 00 00 00 00 fc ff df 42 0f b6 04 28
RSP: 0018:ffffc90004c078a8 EFLAGS: 00000286
RAX: fccf99339da29b00 RBX: ffff888015b08034 RCX: ffffffff816acf0a
RDX: dffffc0000000000 RSI: ffffffff8aec01c0 RDI: ffff88801f8cef38
RBP: ffffc90004c078f0 R08: dffffc0000000000 R09: fffffbfff209384b
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff88801f8cd940
R13: 1ffff110173276eb R14: ffff8880b993aa00 R15: ffff8880b993b758
 context_switch kernel/sched/core.c:5248 [inline]
 __schedule+0x1435/0x4550 kernel/sched/core.c:6558
 preempt_schedule_irq+0xf7/0x1c0 kernel/sched/core.c:6870
 irqentry_exit+0x53/0x80 kernel/entry/common.c:439
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:653
RIP: 0010:lockdep_enabled kernel/locking/lockdep.c:116 [inline]
RIP: 0010:lock_release+0xf6/0xa20 kernel/locking/lockdep.c:5675
Code: 01 09 0d 73 0d e8 0a bd 08 00 84 c0 0f 84 c5 05 00 00 48 c7 c0 24 be 73 8e 48 c1 e8 03 42 0f b6 04 38 84 c0 0f 85 cd 05 00 00 <83> 3d c7 35 09 0d 00 0f 84 2b 05 00 00 65 8b 05 56 f0 97 7e 85 c0
RSP: 0018:ffffc90004c07c40 EFLAGS: 00000246
RAX: 0000000000000000 RBX: 0000000000000001 RCX: ffffffff816a8827
RDX: 0000000000000000 RSI: ffffffff8b3d45c0 RDI: ffffffff8b3d4580
RBP: ffffc90004c07d70 R08: dffffc0000000000 R09: fffffbfff1ce712e
R10: 0000000000000000 R11: dffffc0000000001 R12: 1ffff92000980f94
R13: ffffffff81cdca31 R14: 1ffff92000980ffb R15: dffffc0000000000
 rseq_get_rseq_cs kernel/rseq.c:132 [inline]
 rseq_ip_fixup kernel/rseq.c:252 [inline]
 __rseq_handle_notify_resume+0x11e/0x1500 kernel/rseq.c:300
 rseq_handle_notify_resume include/linux/sched.h:2329 [inline]
 resume_user_mode_work include/linux/resume_user_mode.h:61 [inline]
 exit_to_user_mode_loop+0xb4/0x100 kernel/entry/common.c:177
 exit_to_user_mode_prepare+0xb1/0x140 kernel/entry/common.c:210
 __syscall_exit_to_user_mode_work kernel/entry/common.c:292 [inline]
 syscall_exit_to_user_mode+0x60/0x270 kernel/entry/common.c:303
 do_syscall_64+0x47/0xb0 arch/x86/entry/common.c:87
 entry_SYSCALL_64_after_hwframe+0x68/0xd2
RIP: 0033:0x7fcc8c47cf69
Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 e1 20 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fcc8d1dd178 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca
RAX: 0000000000000000 RBX: 00007fcc8c5b4128 RCX: 00007fcc8c47cf69
RDX: 0000000000000000 RSI: 0000000000000080 RDI: 00007fcc8c5b4128
RBP: 00007fcc8c5b4120 R08: 00007fcc8d1dd6c0 R09: 00007fcc8d1dd6c0
R10: 0000000000000000 R11: 0000000000000246 R12: 00007fcc8c5b412c
R13: 000000000000006e R14: 00007ffce77b2380 R15: 00007ffce77b2468
 </TASK>

Crashes (1):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/06/06 22:02 linux-6.1.y 88690811da69 121701b6 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan-perf INFO: rcu detected stall in neigh_timer_handler
* Struck through repros no longer work on HEAD.