syzbot


INFO: rcu detected stall in xfrm_timer_handler (2)

Status: auto-closed as invalid on 2022/06/25 09:12
Reported-by: syzbot+@syzkaller.appspotmail.com
First crash: 215d, last: 214d
similar bugs (1):
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in xfrm_timer_handler 3 643d 715d 0/24 auto-closed as invalid on 2021/05/22 19:16

Sample crash report:
rcu: INFO: rcu_preempt self-detected stall on CPU
rcu: 	0-...!: (10501 ticks this GP) idle=301/1/0x4000000000000000 softirq=49258/49258 fqs=0 
	(t=10502 jiffies g=62961 q=1695 ncpus=2)
rcu: rcu_preempt kthread timer wakeup didn't happen for 10502 jiffies! g62961 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
rcu: 	Possible timer handling issue on cpu=0 timer-softirq=28146
rcu: rcu_preempt kthread starved for 10505 jiffies! g62961 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
rcu: 	Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt     state:I stack:29096 pid:   16 ppid:     2 flags:0x00004000
Call Trace:
 <TASK>
 context_switch kernel/sched/core.c:5106 [inline]
 __schedule+0xa9a/0x4cc0 kernel/sched/core.c:6421
 schedule+0xd2/0x1f0 kernel/sched/core.c:6493
 schedule_timeout+0x14a/0x2a0 kernel/time/timer.c:1907
 rcu_gp_fqs_loop+0x1c0/0x840 kernel/rcu/tree.c:2078
 rcu_gp_kthread+0x1de/0x320 kernel/rcu/tree.c:2267
 kthread+0x2e9/0x3a0 kernel/kthread.c:376
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:298
 </TASK>
rcu: Stack dump where RCU GP kthread last ran:
NMI backtrace for cpu 0
CPU: 0 PID: 17543 Comm: btrfs-qgroup-re Not tainted 5.18.0-rc3-next-20220422-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
Call Trace:
 <IRQ>
 __dump_stack lib/dump_stack.c:88 [inline]
 dump_stack_lvl+0xcd/0x134 lib/dump_stack.c:106
 nmi_cpu_backtrace.cold+0x47/0x144 lib/nmi_backtrace.c:111
 nmi_trigger_cpumask_backtrace+0x1e6/0x230 lib/nmi_backtrace.c:62
 trigger_single_cpu_backtrace include/linux/nmi.h:164 [inline]
 rcu_check_gp_kthread_starvation.cold+0x1fb/0x200 kernel/rcu/tree_stall.h:516
 print_cpu_stall kernel/rcu/tree_stall.h:660 [inline]
 check_cpu_stall kernel/rcu/tree_stall.h:749 [inline]
 rcu_pending kernel/rcu/tree.c:4068 [inline]
 rcu_sched_clock_irq.cold+0xfa/0x8fc kernel/rcu/tree.c:2755
 update_process_times+0x16d/0x200 kernel/time/timer.c:1811
 tick_sched_handle+0x9b/0x180 kernel/time/tick-sched.c:243
 tick_sched_timer+0xee/0x120 kernel/time/tick-sched.c:1473
 __run_hrtimer kernel/time/hrtimer.c:1685 [inline]
 __hrtimer_run_queues+0x1c0/0xe50 kernel/time/hrtimer.c:1749
 hrtimer_interrupt+0x31c/0x790 kernel/time/hrtimer.c:1811
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1095 [inline]
 __sysvec_apic_timer_interrupt+0x146/0x530 arch/x86/kernel/apic/apic.c:1112
 sysvec_apic_timer_interrupt+0x40/0xc0 arch/x86/kernel/apic/apic.c:1106
 asm_sysvec_apic_timer_interrupt+0x12/0x20 arch/x86/include/asm/idtentry.h:649
RIP: 0010:kvm_wait+0x98/0x100 arch/x86/kernel/kvm.c:1058
Code: fa 83 e2 07 38 d0 7f 04 84 c0 75 63 0f b6 07 40 38 c6 74 35 48 83 c4 10 c3 c3 e8 23 91 4b 00 eb 07 0f 00 2d da b1 94 08 fb f4 <48> 83 c4 10 c3 89 74 24 0c 48 89 3c 24 e8 56 8f 4b 00 8b 74 24 0c
RSP: 0018:ffffc90000007588 EFLAGS: 00000246
RAX: 0000000000000007 RBX: 0000000000000000 RCX: 1ffffffff1b73199
RDX: 0000000000000000 RSI: ffffffff81807171 RDI: ffffffff8134dffd
RBP: ffff88807e7261c8 R08: 0000000000000000 R09: 0000000000000000
R10: ffffffff81807158 R11: 0000000000000001 R12: 0000000000000000
R13: ffffed100fce4c39 R14: 0000000000000001 R15: ffff8880b9c3ae40
 pv_wait arch/x86/include/asm/paravirt.h:603 [inline]
 pv_wait_head_or_lock kernel/locking/qspinlock_paravirt.h:470 [inline]
 __pv_queued_spin_lock_slowpath+0x8c7/0xb50 kernel/locking/qspinlock.c:511
 pv_queued_spin_lock_slowpath arch/x86/include/asm/paravirt.h:591 [inline]
 queued_spin_lock_slowpath arch/x86/include/asm/qspinlock.h:51 [inline]
 queued_spin_lock include/asm-generic/qspinlock.h:85 [inline]
 do_raw_spin_lock+0x200/0x2a0 kernel/locking/spinlock_debug.c:115
 spin_lock include/linux/spinlock.h:354 [inline]
 task_lock include/linux/sched/task.h:170 [inline]
 __get_task_comm+0x23/0x50 fs/exec.c:1219
 __set_page_owner_handle mm/page_owner.c:174 [inline]
 __set_page_owner+0x253/0x380 mm/page_owner.c:192
 prep_new_page mm/page_alloc.c:2394 [inline]
 get_page_from_freelist+0xba2/0x3e00 mm/page_alloc.c:4135
 __alloc_pages+0x1b2/0x500 mm/page_alloc.c:5356
 alloc_pages+0x1aa/0x310 mm/mempolicy.c:2273
 alloc_slab_page mm/slub.c:1797 [inline]
 allocate_slab+0x26c/0x3c0 mm/slub.c:1942
 new_slab mm/slub.c:2002 [inline]
 ___slab_alloc+0x985/0xd90 mm/slub.c:3002
 __slab_alloc.constprop.0+0x4d/0xa0 mm/slub.c:3089
 slab_alloc_node mm/slub.c:3180 [inline]
 __kmalloc_node_track_caller+0x2cb/0x360 mm/slub.c:4952
 kmalloc_reserve net/core/skbuff.c:354 [inline]
 __alloc_skb+0xde/0x340 net/core/skbuff.c:426
 alloc_skb include/linux/skbuff.h:1337 [inline]
 nlmsg_new include/net/netlink.h:953 [inline]
 xfrm_exp_state_notify net/xfrm/xfrm_user.c:2988 [inline]
 xfrm_send_state_notify+0xd89/0x1d00 net/xfrm/xfrm_user.c:3153
 km_state_notify+0xa6/0x1f0 net/xfrm/xfrm_state.c:2218
 km_state_expired net/xfrm/xfrm_state.c:2232 [inline]
 xfrm_timer_handler+0x856/0xc10 net/xfrm/xfrm_state.c:621
 __run_hrtimer kernel/time/hrtimer.c:1685 [inline]
 __hrtimer_run_queues+0x609/0xe50 kernel/time/hrtimer.c:1749
 hrtimer_run_softirq+0x17b/0x360 kernel/time/hrtimer.c:1766
 __do_softirq+0x29b/0x9c2 kernel/softirq.c:558
 invoke_softirq kernel/softirq.c:432 [inline]
 __irq_exit_rcu+0x123/0x180 kernel/softirq.c:637
 irq_exit_rcu+0x5/0x20 kernel/softirq.c:649
 sysvec_apic_timer_interrupt+0x93/0xc0 arch/x86/kernel/apic/apic.c:1106
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x12/0x20 arch/x86/include/asm/idtentry.h:649
RIP: 0010:__raw_spin_unlock_irqrestore include/linux/spinlock_api_smp.h:152 [inline]
RIP: 0010:_raw_spin_unlock_irqrestore+0x38/0x70 kernel/locking/spinlock.c:194
Code: 74 24 10 e8 0a 06 ed f7 48 89 ef e8 12 87 ed f7 81 e3 00 02 00 00 75 25 9c 58 f6 c4 02 75 2d 48 85 db 74 01 fb bf 01 00 00 00 <e8> e3 93 e0 f7 65 8b 05 2c 52 91 76 85 c0 74 0a 5b 5d c3 e8 80 53
RSP: 0018:ffffc9000e02fdd0 EFLAGS: 00000206
RAX: 0000000000000046 RBX: 0000000000000200 RCX: 1ffffffff1b73199
RDX: 0000000000000000 RSI: ffffffff81807171 RDI: 0000000000000001
RBP: ffff88804cc53480 R08: 0000000000000000 R09: 0000000000000000
R10: ffffffff81807158 R11: 0000000000000000 R12: ffff88804cc53478
R13: ffff88807e7261c8 R14: ffff88807e725880 R15: 0000000000000000
 complete_vfork_done kernel/fork.c:1410 [inline]
 mm_release+0x1c0/0x2c0 kernel/fork.c:1479
 exit_mm kernel/exit.c:482 [inline]
 do_exit+0x8c1/0x2a00 kernel/exit.c:782
 kthread_exit+0xbd/0xe0 kernel/kthread.c:306
 kthread+0x2a4/0x3a0 kernel/kthread.c:378
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:298
 </TASK>
NMI backtrace for cpu 0
CPU: 0 PID: 17543 Comm: btrfs-qgroup-re Not tainted 5.18.0-rc3-next-20220422-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
Call Trace:
 <IRQ>
 __dump_stack lib/dump_stack.c:88 [inline]
 dump_stack_lvl+0xcd/0x134 lib/dump_stack.c:106
 nmi_cpu_backtrace.cold+0x47/0x144 lib/nmi_backtrace.c:111
 nmi_trigger_cpumask_backtrace+0x1e6/0x230 lib/nmi_backtrace.c:62
 trigger_single_cpu_backtrace include/linux/nmi.h:164 [inline]
 rcu_dump_cpu_stacks+0x262/0x3f0 kernel/rcu/tree_stall.h:369
 print_cpu_stall kernel/rcu/tree_stall.h:665 [inline]
 check_cpu_stall kernel/rcu/tree_stall.h:749 [inline]
 rcu_pending kernel/rcu/tree.c:4068 [inline]
 rcu_sched_clock_irq.cold+0x144/0x8fc kernel/rcu/tree.c:2755
 update_process_times+0x16d/0x200 kernel/time/timer.c:1811
 tick_sched_handle+0x9b/0x180 kernel/time/tick-sched.c:243
 tick_sched_timer+0xee/0x120 kernel/time/tick-sched.c:1473
 __run_hrtimer kernel/time/hrtimer.c:1685 [inline]
 __hrtimer_run_queues+0x1c0/0xe50 kernel/time/hrtimer.c:1749
 hrtimer_interrupt+0x31c/0x790 kernel/time/hrtimer.c:1811
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1095 [inline]
 __sysvec_apic_timer_interrupt+0x146/0x530 arch/x86/kernel/apic/apic.c:1112
 sysvec_apic_timer_interrupt+0x40/0xc0 arch/x86/kernel/apic/apic.c:1106
 asm_sysvec_apic_timer_interrupt+0x12/0x20 arch/x86/include/asm/idtentry.h:649
RIP: 0010:kvm_wait+0x98/0x100 arch/x86/kernel/kvm.c:1058
Code: fa 83 e2 07 38 d0 7f 04 84 c0 75 63 0f b6 07 40 38 c6 74 35 48 83 c4 10 c3 c3 e8 23 91 4b 00 eb 07 0f 00 2d da b1 94 08 fb f4 <48> 83 c4 10 c3 89 74 24 0c 48 89 3c 24 e8 56 8f 4b 00 8b 74 24 0c
RSP: 0018:ffffc90000007588 EFLAGS: 00000246
RAX: 0000000000000007 RBX: 0000000000000000 RCX: 1ffffffff1b73199
RDX: 0000000000000000 RSI: ffffffff81807171 RDI: ffffffff8134dffd
RBP: ffff88807e7261c8 R08: 0000000000000000 R09: 0000000000000000
R10: ffffffff81807158 R11: 0000000000000001 R12: 0000000000000000
R13: ffffed100fce4c39 R14: 0000000000000001 R15: ffff8880b9c3ae40
 pv_wait arch/x86/include/asm/paravirt.h:603 [inline]
 pv_wait_head_or_lock kernel/locking/qspinlock_paravirt.h:470 [inline]
 __pv_queued_spin_lock_slowpath+0x8c7/0xb50 kernel/locking/qspinlock.c:511
 pv_queued_spin_lock_slowpath arch/x86/include/asm/paravirt.h:591 [inline]
 queued_spin_lock_slowpath arch/x86/include/asm/qspinlock.h:51 [inline]
 queued_spin_lock include/asm-generic/qspinlock.h:85 [inline]
 do_raw_spin_lock+0x200/0x2a0 kernel/locking/spinlock_debug.c:115
 spin_lock include/linux/spinlock.h:354 [inline]
 task_lock include/linux/sched/task.h:170 [inline]
 __get_task_comm+0x23/0x50 fs/exec.c:1219
 __set_page_owner_handle mm/page_owner.c:174 [inline]
 __set_page_owner+0x253/0x380 mm/page_owner.c:192
 prep_new_page mm/page_alloc.c:2394 [inline]
 get_page_from_freelist+0xba2/0x3e00 mm/page_alloc.c:4135
 __alloc_pages+0x1b2/0x500 mm/page_alloc.c:5356
 alloc_pages+0x1aa/0x310 mm/mempolicy.c:2273
 alloc_slab_page mm/slub.c:1797 [inline]
 allocate_slab+0x26c/0x3c0 mm/slub.c:1942
 new_slab mm/slub.c:2002 [inline]
 ___slab_alloc+0x985/0xd90 mm/slub.c:3002
 __slab_alloc.constprop.0+0x4d/0xa0 mm/slub.c:3089
 slab_alloc_node mm/slub.c:3180 [inline]
 __kmalloc_node_track_caller+0x2cb/0x360 mm/slub.c:4952
 kmalloc_reserve net/core/skbuff.c:354 [inline]
 __alloc_skb+0xde/0x340 net/core/skbuff.c:426
 alloc_skb include/linux/skbuff.h:1337 [inline]
 nlmsg_new include/net/netlink.h:953 [inline]
 xfrm_exp_state_notify net/xfrm/xfrm_user.c:2988 [inline]
 xfrm_send_state_notify+0xd89/0x1d00 net/xfrm/xfrm_user.c:3153
 km_state_notify+0xa6/0x1f0 net/xfrm/xfrm_state.c:2218
 km_state_expired net/xfrm/xfrm_state.c:2232 [inline]
 xfrm_timer_handler+0x856/0xc10 net/xfrm/xfrm_state.c:621
 __run_hrtimer kernel/time/hrtimer.c:1685 [inline]
 __hrtimer_run_queues+0x609/0xe50 kernel/time/hrtimer.c:1749
 hrtimer_run_softirq+0x17b/0x360 kernel/time/hrtimer.c:1766
 __do_softirq+0x29b/0x9c2 kernel/softirq.c:558
 invoke_softirq kernel/softirq.c:432 [inline]
 __irq_exit_rcu+0x123/0x180 kernel/softirq.c:637
 irq_exit_rcu+0x5/0x20 kernel/softirq.c:649
 sysvec_apic_timer_interrupt+0x93/0xc0 arch/x86/kernel/apic/apic.c:1106
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x12/0x20 arch/x86/include/asm/idtentry.h:649
RIP: 0010:__raw_spin_unlock_irqrestore include/linux/spinlock_api_smp.h:152 [inline]
RIP: 0010:_raw_spin_unlock_irqrestore+0x38/0x70 kernel/locking/spinlock.c:194
Code: 74 24 10 e8 0a 06 ed f7 48 89 ef e8 12 87 ed f7 81 e3 00 02 00 00 75 25 9c 58 f6 c4 02 75 2d 48 85 db 74 01 fb bf 01 00 00 00 <e8> e3 93 e0 f7 65 8b 05 2c 52 91 76 85 c0 74 0a 5b 5d c3 e8 80 53
RSP: 0018:ffffc9000e02fdd0 EFLAGS: 00000206
RAX: 0000000000000046 RBX: 0000000000000200 RCX: 1ffffffff1b73199
RDX: 0000000000000000 RSI: ffffffff81807171 RDI: 0000000000000001
RBP: ffff88804cc53480 R08: 0000000000000000 R09: 0000000000000000
R10: ffffffff81807158 R11: 0000000000000000 R12: ffff88804cc53478
R13: ffff88807e7261c8 R14: ffff88807e725880 R15: 0000000000000000
 complete_vfork_done kernel/fork.c:1410 [inline]
 mm_release+0x1c0/0x2c0 kernel/fork.c:1479
 exit_mm kernel/exit.c:482 [inline]
 do_exit+0x8c1/0x2a00 kernel/exit.c:782
 kthread_exit+0xbd/0xe0 kernel/kthread.c:306
 kthread+0x2a4/0x3a0 kernel/kthread.c:378
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:298
 </TASK>
----------------
Code disassembly (best guess):
   0:	fa                   	cli
   1:	83 e2 07             	and    $0x7,%edx
   4:	38 d0                	cmp    %dl,%al
   6:	7f 04                	jg     0xc
   8:	84 c0                	test   %al,%al
   a:	75 63                	jne    0x6f
   c:	0f b6 07             	movzbl (%rdi),%eax
   f:	40 38 c6             	cmp    %al,%sil
  12:	74 35                	je     0x49
  14:	48 83 c4 10          	add    $0x10,%rsp
  18:	c3                   	retq
  19:	c3                   	retq
  1a:	e8 23 91 4b 00       	callq  0x4b9142
  1f:	eb 07                	jmp    0x28
  21:	0f 00 2d da b1 94 08 	verw   0x894b1da(%rip)        # 0x894b202
  28:	fb                   	sti
  29:	f4                   	hlt
* 2a:	48 83 c4 10          	add    $0x10,%rsp <-- trapping instruction
  2e:	c3                   	retq
  2f:	89 74 24 0c          	mov    %esi,0xc(%rsp)
  33:	48 89 3c 24          	mov    %rdi,(%rsp)
  37:	e8 56 8f 4b 00       	callq  0x4b8f92
  3c:	8b 74 24 0c          	mov    0xc(%rsp),%esi

Crashes (2):
Manager Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Title
ci-upstream-linux-next-kasan-gce-root 2022/04/26 09:10 linux-next e7d6987e09a3 1fa34c1b .config log report info INFO: rcu detected stall in xfrm_timer_handler
ci-upstream-linux-next-kasan-gce-root 2022/04/25 19:57 linux-next e7d6987e09a3 152baedd .config log report info INFO: rcu detected stall in xfrm_timer_handler
* Struck through repros no longer work on HEAD.