syzbot


BUG: soft lockup in tcp_write_timer (2)

Status: auto-closed as invalid on 2022/06/24 22:31
Subsystems: kvm
[Documentation on labels]
First crash: 724d, last: 724d
Similar bugs (8)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
linux-4.19 BUG: soft lockup in tcp_write_timer (3) 2 475d 488d 0/1 upstream: reported on 2022/12/17 21:41
linux-4.14 BUG: soft lockup in tcp_write_timer 2 1442d 1521d 0/1 auto-closed as invalid on 2020/09/05 12:42
linux-4.19 BUG: soft lockup in tcp_write_timer (2) 2 885d 948d 0/1 auto-closed as invalid on 2022/03/16 10:56
linux-4.19 BUG: soft lockup in tcp_write_timer 1 1114d 1114d 0/1 auto-closed as invalid on 2021/07/30 14:52
upstream BUG: soft lockup in tcp_write_timer net 11 1721d 1729d 0/26 auto-closed as invalid on 2019/10/25 14:11
upstream BUG: soft lockup in tcp_write_timer (3) net 6 101d 209d 0/26 closed as invalid on 2024/03/18 17:07
upstream INFO: rcu detected stall in tcp_write_timer (2) bpf 2 1172d 1227d 0/26 auto-closed as invalid on 2021/05/03 11:59
upstream INFO: rcu detected stall in tcp_write_timer (3) net 1 1039d 1039d 0/26 auto-closed as invalid on 2021/09/13 13:17

Sample crash report:
watchdog: BUG: soft lockup - CPU#0 stuck for 143s! [swapper/0:0]
Modules linked in:
irq event stamp: 10968
hardirqs last  enabled at (10967): [<ffffffff816e53b1>] tick_nohz_idle_exit+0x161/0x4a0 kernel/time/tick-sched.c:1350
hardirqs last disabled at (10968): [<ffffffff896f9da4>] __schedule+0x1974/0x4cc0 kernel/sched/core.c:6317
softirqs last  enabled at (10938): [<ffffffff81483453>] invoke_softirq kernel/softirq.c:432 [inline]
softirqs last  enabled at (10938): [<ffffffff81483453>] __irq_exit_rcu+0x123/0x180 kernel/softirq.c:637
softirqs last disabled at (10895): [<ffffffff81483453>] invoke_softirq kernel/softirq.c:432 [inline]
softirqs last disabled at (10895): [<ffffffff81483453>] __irq_exit_rcu+0x123/0x180 kernel/softirq.c:637
CPU: 0 PID: 0 Comm: swapper/0 Not tainted 5.18.0-rc3-next-20220422-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
RIP: 0010:kvm_wait+0x98/0x100 arch/x86/kernel/kvm.c:1058
Code: fa 83 e2 07 38 d0 7f 04 84 c0 75 63 0f b6 07 40 38 c6 74 35 48 83 c4 10 c3 c3 e8 23 91 4b 00 eb 07 0f 00 2d da b1 94 08 fb f4 <48> 83 c4 10 c3 89 74 24 0c 48 89 3c 24 e8 56 8f 4b 00 8b 74 24 0c
RSP: 0018:ffffc90000007b80 EFLAGS: 00000246
RAX: 0000000000000007 RBX: 0000000000000000 RCX: 1ffffffff1b73199
RDX: 0000000000000000 RSI: ffffffff81807171 RDI: ffffffff8134dffd
RBP: ffff888076ba9898 R08: 0000000000000000 R09: 0000000000000000
R10: ffffffff81807158 R11: 0000000000000000 R12: 0000000000000000
R13: ffffed100ed75313 R14: 0000000000000001 R15: ffff8880b9c3ae40
FS:  0000000000000000(0000) GS:ffff8880b9c00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000000c002b68000 CR3: 000000007a8f5000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <IRQ>
 pv_wait arch/x86/include/asm/paravirt.h:603 [inline]
 pv_wait_head_or_lock kernel/locking/qspinlock_paravirt.h:470 [inline]
 __pv_queued_spin_lock_slowpath+0x8c7/0xb50 kernel/locking/qspinlock.c:511
 pv_queued_spin_lock_slowpath arch/x86/include/asm/paravirt.h:591 [inline]
 queued_spin_lock_slowpath arch/x86/include/asm/qspinlock.h:51 [inline]
 queued_spin_lock include/asm-generic/qspinlock.h:85 [inline]
 do_raw_spin_lock+0x200/0x2a0 kernel/locking/spinlock_debug.c:115
 spin_lock include/linux/spinlock.h:354 [inline]
 tcp_write_timer+0x26/0x2b0 net/ipv4/tcp_timer.c:640
 call_timer_fn+0x1a5/0x6b0 kernel/time/timer.c:1444
 expire_timers kernel/time/timer.c:1489 [inline]
 __run_timers.part.0+0x679/0xa80 kernel/time/timer.c:1760
 __run_timers kernel/time/timer.c:1738 [inline]
 run_timer_softirq+0xb3/0x1d0 kernel/time/timer.c:1773
 __do_softirq+0x29b/0x9c2 kernel/softirq.c:558
 invoke_softirq kernel/softirq.c:432 [inline]
 __irq_exit_rcu+0x123/0x180 kernel/softirq.c:637
 irq_exit_rcu+0x5/0x20 kernel/softirq.c:649
 sysvec_apic_timer_interrupt+0x93/0xc0 arch/x86/kernel/apic/apic.c:1106
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x12/0x20 arch/x86/include/asm/idtentry.h:649
RIP: 0010:native_save_fl arch/x86/include/asm/irqflags.h:29 [inline]
RIP: 0010:arch_local_save_flags arch/x86/include/asm/irqflags.h:70 [inline]
RIP: 0010:arch_irqs_disabled arch/x86/include/asm/irqflags.h:130 [inline]
RIP: 0010:acpi_safe_halt drivers/acpi/processor_idle.c:117 [inline]
RIP: 0010:acpi_idle_do_entry+0x1c6/0x250 drivers/acpi/processor_idle.c:559
Code: 89 de e8 5d 0c 09 f8 84 db 75 ac e8 74 08 09 f8 e8 6f 5a 0f f8 eb 0c e8 68 08 09 f8 0f 00 2d a1 0d c2 00 e8 5c 08 09 f8 fb f4 <9c> 5b 81 e3 00 02 00 00 fa 31 ff 48 89 de e8 d7 0a 09 f8 48 85 db
RSP: 0018:ffffffff8ba07d38 EFLAGS: 00000293
RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
RDX: ffffffff8babc880 RSI: ffffffff897116c4 RDI: ffffffff897116b1
RBP: ffff8880162f7864 R08: 0000000000000000 R09: 0000000000000000
R10: ffffffff81807158 R11: 0000000000000000 R12: 0000000000000001
R13: ffff8880162f7800 R14: ffff8880162f7864 R15: ffff8881471e1004
 acpi_idle_enter+0x369/0x510 drivers/acpi/processor_idle.c:696
 cpuidle_enter_state+0x1b1/0xc80 drivers/cpuidle/cpuidle.c:237
 cpuidle_enter+0x4a/0xa0 drivers/cpuidle/cpuidle.c:351
 call_cpuidle kernel/sched/idle.c:155 [inline]
 cpuidle_idle_call kernel/sched/idle.c:236 [inline]
 do_idle+0x3e8/0x590 kernel/sched/idle.c:303
 cpu_startup_entry+0x14/0x20 kernel/sched/idle.c:400
 rest_init+0x169/0x270 init/main.c:728
 arch_call_rest_init+0xf/0x14 init/main.c:885
 start_kernel+0x47f/0x4a0 init/main.c:1143
 secondary_startup_64_no_verify+0xce/0xdb
 </TASK>
Sending NMI from CPU 0 to CPUs 1:
NMI backtrace for cpu 1
CPU: 1 PID: 3541 Comm: syz-fuzzer Not tainted 5.18.0-rc3-next-20220422-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
RIP: 0010:kvm_wait+0x98/0x100 arch/x86/kernel/kvm.c:1058
Code: fa 83 e2 07 38 d0 7f 04 84 c0 75 63 0f b6 07 40 38 c6 74 35 48 83 c4 10 c3 c3 e8 23 91 4b 00 eb 07 0f 00 2d da b1 94 08 fb f4 <48> 83 c4 10 c3 89 74 24 0c 48 89 3c 24 e8 56 8f 4b 00 8b 74 24 0c
RSP: 0000:ffffc90000ddfd20 EFLAGS: 00000246
RAX: 0000000000000007 RBX: 0000000000000000 RCX: 1ffffffff1b73199
RDX: 0000000000000000 RSI: ffffffff81807171 RDI: ffffffff8134dffd
RBP: ffff88802431a6c8 R08: 0000000000000000 R09: 0000000000000000
R10: ffffffff81807158 R11: 0000000000000001 R12: 0000000000000000
R13: ffffed10048634d9 R14: 0000000000000001 R15: ffff8880b9d3ae40
FS:  000000c000032890(0000) GS:ffff8880b9d00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000000c002b80000 CR3: 000000007a8f5000 CR4: 00000000003506e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <IRQ>
 pv_wait arch/x86/include/asm/paravirt.h:603 [inline]
 pv_wait_head_or_lock kernel/locking/qspinlock_paravirt.h:470 [inline]
 __pv_queued_spin_lock_slowpath+0x8c7/0xb50 kernel/locking/qspinlock.c:511
 pv_queued_spin_lock_slowpath arch/x86/include/asm/paravirt.h:591 [inline]
 queued_spin_lock_slowpath arch/x86/include/asm/qspinlock.h:51 [inline]
 queued_spin_lock include/asm-generic/qspinlock.h:85 [inline]
 do_raw_spin_lock+0x200/0x2a0 kernel/locking/spinlock_debug.c:115
 spin_lock include/linux/spinlock.h:354 [inline]
 task_lock include/linux/sched/task.h:170 [inline]
 __get_task_comm+0x23/0x50 fs/exec.c:1219
 __set_page_owner_handle mm/page_owner.c:174 [inline]
 __set_page_owner+0x253/0x380 mm/page_owner.c:192
 prep_new_page mm/page_alloc.c:2394 [inline]
 get_page_from_freelist+0xba2/0x3e00 mm/page_alloc.c:4135
 __alloc_pages+0x1b2/0x500 mm/page_alloc.c:5356
 alloc_pages+0x1aa/0x310 mm/mempolicy.c:2273
 alloc_slab_page mm/slub.c:1797 [inline]
 allocate_slab+0x26c/0x3c0 mm/slub.c:1942
 new_slab mm/slub.c:2002 [inline]
 ___slab_alloc+0x985/0xd90 mm/slub.c:3002
 __slab_alloc.constprop.0+0x4d/0xa0 mm/slub.c:3089
 slab_alloc_node mm/slub.c:3180 [inline]
 kmem_cache_alloc_node+0x122/0x3f0 mm/slub.c:3264
 __alloc_skb+0x215/0x340 net/core/skbuff.c:414
 alloc_skb include/linux/skbuff.h:1337 [inline]
 __tcp_send_ack.part.0+0x67/0x760 net/ipv4/tcp_output.c:3944
 __tcp_send_ack net/ipv4/tcp_output.c:3976 [inline]
 tcp_send_ack+0x7d/0xa0 net/ipv4/tcp_output.c:3976
 __tcp_ack_snd_check+0x156/0x9c0 net/ipv4/tcp_input.c:5493
 tcp_rcv_established+0x18cc/0x2030 net/ipv4/tcp_input.c:5935
 tcp_v4_do_rcv+0x66c/0x980 net/ipv4/tcp_ipv4.c:1660
 tcp_v4_rcv+0x380f/0x3bc0 net/ipv4/tcp_ipv4.c:2074
 ip_protocol_deliver_rcu+0xa3/0xb30 net/ipv4/ip_input.c:205
 ip_local_deliver_finish+0x2ee/0x4c0 net/ipv4/ip_input.c:233
 NF_HOOK include/linux/netfilter.h:307 [inline]
 NF_HOOK include/linux/netfilter.h:301 [inline]
 ip_local_deliver+0x1b3/0x200 net/ipv4/ip_input.c:254
 dst_input include/net/dst.h:461 [inline]
 ip_sublist_rcv_finish+0x9a/0x2c0 net/ipv4/ip_input.c:568
 ip_list_rcv_finish net/ipv4/ip_input.c:618 [inline]
 ip_sublist_rcv+0x538/0x990 net/ipv4/ip_input.c:626
 ip_list_rcv+0x31a/0x470 net/ipv4/ip_input.c:661
 __netif_receive_skb_list_ptype net/core/dev.c:5478 [inline]
 __netif_receive_skb_list_core+0x548/0x8f0 net/core/dev.c:5526
 __netif_receive_skb_list net/core/dev.c:5578 [inline]
 netif_receive_skb_list_internal+0x75e/0xd80 net/core/dev.c:5669
 gro_normal_list include/net/gro.h:430 [inline]
 gro_normal_list include/net/gro.h:426 [inline]
 napi_complete_done+0x1f1/0x880 net/core/dev.c:6010
 virtqueue_napi_complete drivers/net/virtio_net.c:366 [inline]
 virtnet_poll+0xc9b/0x11a0 drivers/net/virtio_net.c:1613
 __napi_poll+0xb3/0x6e0 net/core/dev.c:6443
 napi_poll net/core/dev.c:6510 [inline]
 net_rx_action+0x8ec/0xc60 net/core/dev.c:6597
 __do_softirq+0x29b/0x9c2 kernel/softirq.c:558
 invoke_softirq kernel/softirq.c:432 [inline]
 __irq_exit_rcu+0x123/0x180 kernel/softirq.c:637
 irq_exit_rcu+0x5/0x20 kernel/softirq.c:649
 common_interrupt+0xa9/0xc0 arch/x86/kernel/irq.c:240
 </IRQ>
 <TASK>
 asm_common_interrupt+0x1e/0x40 arch/x86/include/asm/idtentry.h:640
RIP: 0010:arch_atomic_read arch/x86/include/asm/atomic.h:29 [inline]
RIP: 0010:atomic_read include/linux/atomic/atomic-instrumented.h:28 [inline]
RIP: 0010:queued_spin_is_locked include/asm-generic/qspinlock.h:28 [inline]
RIP: 0010:debug_spin_unlock kernel/locking/spinlock_debug.c:100 [inline]
RIP: 0010:do_raw_spin_unlock+0x61/0x230 kernel/locking/spinlock_debug.c:140
Code: 7d 04 ad 4e ad de 0f 85 73 01 00 00 be 04 00 00 00 48 89 ef e8 a0 08 68 00 48 89 ea 48 b8 00 00 00 00 00 fc ff df 48 c1 ea 03 <0f> b6 14 02 48 89 e8 83 e0 07 83 c0 03 38 d0 7c 08 84 d2 0f 85 6a
RSP: 0000:ffffc90002e1f940 EFLAGS: 00000a02
RAX: dffffc0000000000 RBX: ffff888024319d80 RCX: ffffffff815ea4d0
RDX: 1ffff110048634d9 RSI: 0000000000000004 RDI: ffff88802431a6c8
RBP: ffff88802431a6c8 R08: 0000000000000000 R09: ffff88802431a6cb
R10: ffffed10048634d9 R11: 0000000000000001 R12: ffff8880137362a8
R13: 0000000000000010 R14: ffff888013736288 R15: 0000000000000007
 __raw_spin_unlock include/linux/spinlock_api_smp.h:142 [inline]
 _raw_spin_unlock+0x1a/0x40 kernel/locking/spinlock.c:186
 spin_unlock include/linux/spinlock.h:394 [inline]
 task_unlock include/linux/sched/task.h:175 [inline]
 __get_task_comm+0x3d/0x50 fs/exec.c:1222
 __set_page_owner_handle mm/page_owner.c:174 [inline]
 __set_page_owner+0x253/0x380 mm/page_owner.c:192
 prep_new_page mm/page_alloc.c:2394 [inline]
 get_page_from_freelist+0xba2/0x3e00 mm/page_alloc.c:4135
 __alloc_pages+0x1b2/0x500 mm/page_alloc.c:5356
 alloc_pages_vma+0xf9/0x770 mm/mempolicy.c:2221
 do_anonymous_page mm/memory.c:4077 [inline]
 handle_pte_fault mm/memory.c:4902 [inline]
 __handle_mm_fault+0x1a24/0x33d0 mm/memory.c:5043
 handle_mm_fault+0x1c8/0x790 mm/memory.c:5141
 do_user_addr_fault+0x489/0x11c0 arch/x86/mm/fault.c:1397
 handle_page_fault arch/x86/mm/fault.c:1484 [inline]
 exc_page_fault+0x9e/0x180 arch/x86/mm/fault.c:1540
 asm_exc_page_fault+0x1e/0x30 arch/x86/include/asm/idtentry.h:570
RIP: 0033:0x465d5c
Code: 4c 01 de 48 29 c3 c5 fe 6f 06 c5 fe 6f 4e 20 c5 fe 6f 56 40 c5 fe 6f 5e 60 48 01 c6 c5 fd 7f 07 c5 fd 7f 4f 20 c5 fd 7f 57 40 <c5> fd 7f 5f 60 48 01 c7 48 29 c3 77 cf 48 01 c3 48 01 fb c4 c1 7e
RSP: 002b:000000c00059bc08 EFLAGS: 00010202
RAX: 0000000000000080 RBX: 000000000000007b RCX: 000000c000548000
RDX: 000000c000540000 RSI: 000000c000547f85 RDI: 000000c002b7ffa0
RBP: 000000c00059bc38 R08: 000000c000540000 R09: 0000000000007b89
R10: 000000c002b7809b R11: 0000000000000005 R12: 0000000000007ffd
R13: 0000000000007ff3 R14: 000000c000262ea0 R15: 0000000000000002
 </TASK>
----------------
Code disassembly (best guess):
   0:	fa                   	cli
   1:	83 e2 07             	and    $0x7,%edx
   4:	38 d0                	cmp    %dl,%al
   6:	7f 04                	jg     0xc
   8:	84 c0                	test   %al,%al
   a:	75 63                	jne    0x6f
   c:	0f b6 07             	movzbl (%rdi),%eax
   f:	40 38 c6             	cmp    %al,%sil
  12:	74 35                	je     0x49
  14:	48 83 c4 10          	add    $0x10,%rsp
  18:	c3                   	retq
  19:	c3                   	retq
  1a:	e8 23 91 4b 00       	callq  0x4b9142
  1f:	eb 07                	jmp    0x28
  21:	0f 00 2d da b1 94 08 	verw   0x894b1da(%rip)        # 0x894b202
  28:	fb                   	sti
  29:	f4                   	hlt
* 2a:	48 83 c4 10          	add    $0x10,%rsp <-- trapping instruction
  2e:	c3                   	retq
  2f:	89 74 24 0c          	mov    %esi,0xc(%rsp)
  33:	48 89 3c 24          	mov    %rdi,(%rsp)
  37:	e8 56 8f 4b 00       	callq  0x4b8f92
  3c:	8b 74 24 0c          	mov    0xc(%rsp),%esi

Crashes (1):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2022/04/25 22:26 linux-next e7d6987e09a3 152baedd .config console log report info ci-upstream-linux-next-kasan-gce-root BUG: soft lockup in tcp_write_timer
* Struck through repros no longer work on HEAD.