syzbot


INFO: rcu detected stall in update_balloon_stats_func

Status: auto-obsoleted due to no activity on 2023/12/19 05:20
Subsystems: virt
[Documentation on labels]
First crash: 676d, last: 676d
Similar bugs (1)
Kernel Title Rank 🛈 Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in update_balloon_stats_func (2) virt 1 1 393d 393d 0/29 auto-obsoleted due to no activity on 2024/10/27 14:54

Sample crash report:
rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
rcu: 	1-...!: (0 ticks this GP) idle=2a2c/1/0x4000000000000000 softirq=99422/99422 fqs=0
rcu: 	(detected by 0, t=10506 jiffies, g=189545, q=146 ncpus=2)
Sending NMI from CPU 0 to CPUs 1:
NMI backtrace for cpu 1
CPU: 1 PID: 15426 Comm: kworker/1:7 Not tainted 6.6.0-rc6-next-20231019-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 09/06/2023
Workqueue: events_freezable update_balloon_stats_func
RIP: 0010:lockdep_recursion_finish kernel/locking/lockdep.c:467 [inline]
RIP: 0010:lock_is_held_type+0xe8/0x140 kernel/locking/lockdep.c:5825
Code: f6 43 22 03 0f 95 c0 45 31 ed 44 39 f0 41 0f 94 c5 48 c7 c7 a0 c2 8c 8a e8 c5 0f 00 00 b8 ff ff ff ff 65 0f c1 05 b8 ff b9 75 <83> f8 01 75 29 9c 58 f6 c4 02 75 3d 48 f7 04 24 00 02 00 00 74 01
RSP: 0018:ffffc900001f0d40 EFLAGS: 00000057
RAX: 0000000000000001 RBX: ffff88801dc64690 RCX: 0000000000000001
RDX: 0000000000000000 RSI: ffffffff8a8cc2a0 RDI: ffffffff8ae98ee0
RBP: ffff888143ac3300 R08: 0000000000000005 R09: 0000000000000000
R10: 0000000000000001 R11: ffffc900001f0ff8 R12: ffff88801dc63b80
R13: 0000000000000001 R14: 00000000ffffffff R15: 0000000000000002
FS:  0000000000000000(0000) GS:ffff8880b9900000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000020c86030 CR3: 0000000051707000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <IRQ>
 lock_is_held include/linux/lockdep.h:288 [inline]
 advance_sched+0x700/0xc60 net/sched/sch_taprio.c:940
 __run_hrtimer kernel/time/hrtimer.c:1688 [inline]
 __hrtimer_run_queues+0x20c/0xc00 kernel/time/hrtimer.c:1752
 hrtimer_interrupt+0x31b/0x800 kernel/time/hrtimer.c:1814
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1065 [inline]
 __sysvec_apic_timer_interrupt+0x10c/0x400 arch/x86/kernel/apic/apic.c:1082
 sysvec_apic_timer_interrupt+0x8e/0xc0 arch/x86/kernel/apic/apic.c:1076
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:645
RIP: 0010:iowrite16+0x5b/0xb0 lib/iomap.c:214
Code: bb f4 58 fd e8 b6 f4 58 fd 48 89 de bf 00 00 01 00 e8 39 f0 58 fd 48 81 fb 00 00 01 00 76 12 e8 9b f4 58 fd 89 e8 89 da 66 ef <5b> 5d e9 8e f4 58 fd e8 89 f4 58 fd 8b 2d 13 fe 27 09 31 ff 89 ee
RSP: 0018:ffffc90003857bc0 EFLAGS: 00000293
RAX: 0000000000000002 RBX: 000000000001c090 RCX: ffffffff842fefa7
RDX: 000000000001c090 RSI: ffffffff842fefb5 RDI: 0000000000000007
RBP: 0000000000000002 R08: 0000000000000007 R09: 0000000000010000
R10: 000000000001c090 R11: 0000000000000001 R12: ffff888147a7e243
R13: ffff888017338000 R14: 000000000000000a R15: ffff8880b993bec0
 vp_notify+0x5a/0x80 drivers/virtio/virtio_pci_common.c:45
 virtqueue_notify drivers/virtio/virtio_ring.c:2370 [inline]
 virtqueue_kick+0xa7/0x110 drivers/virtio/virtio_ring.c:2393
 stats_handle_request drivers/virtio/virtio_balloon.c:386 [inline]
 update_balloon_stats_func+0x119/0x170 drivers/virtio/virtio_balloon.c:469
 process_one_work+0x8a2/0x15e0 kernel/workqueue.c:2630
 process_scheduled_works kernel/workqueue.c:2703 [inline]
 worker_thread+0x8b6/0x1280 kernel/workqueue.c:2784
 kthread+0x337/0x440 kernel/kthread.c:388
 ret_from_fork+0x45/0x80 arch/x86/kernel/process.c:147
 ret_from_fork_asm+0x11/0x20 arch/x86/entry/entry_64.S:242
 </TASK>
rcu: rcu_preempt kthread starved for 10506 jiffies! g189545 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x0 ->cpu=0
rcu: 	Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt     state:R  running task     stack:28128 pid:17    tgid:17    ppid:2      flags:0x00004000
Call Trace:
 <TASK>
 context_switch kernel/sched/core.c:5369 [inline]
 __schedule+0xee3/0x5a10 kernel/sched/core.c:6680
 __schedule_loop kernel/sched/core.c:6757 [inline]
 schedule+0xe5/0x270 kernel/sched/core.c:6772
 schedule_timeout+0x156/0x2b0 kernel/time/timer.c:2167
 rcu_gp_fqs_loop+0x1eb/0xb00 kernel/rcu/tree.c:1626
 rcu_gp_kthread+0x243/0x380 kernel/rcu/tree.c:1825
 kthread+0x337/0x440 kernel/kthread.c:388
 ret_from_fork+0x45/0x80 arch/x86/kernel/process.c:147
 ret_from_fork_asm+0x11/0x20 arch/x86/entry/entry_64.S:242
 </TASK>
rcu: Stack dump where RCU GP kthread last ran:
CPU: 0 PID: 28048 Comm: kworker/u4:9 Not tainted 6.6.0-rc6-next-20231019-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 09/06/2023
Workqueue: events_unbound toggle_allocation_gate
RIP: 0010:csd_lock_wait kernel/smp.c:311 [inline]
RIP: 0010:smp_call_function_many_cond+0x4c1/0x1560 kernel/smp.c:855
Code: 0b 00 85 ed 74 4d 48 b8 00 00 00 00 00 fc ff df 4d 89 f4 4c 89 f5 49 c1 ec 03 83 e5 07 49 01 c4 83 c5 03 e8 51 c4 0b 00 f3 90 <41> 0f b6 04 24 40 38 c5 7c 08 84 c0 0f 85 4b 0e 00 00 8b 43 08 31
RSP: 0018:ffffc90016ae7910 EFLAGS: 00000293
RAX: 0000000000000000 RBX: ffff8880b9941aa0 RCX: ffffffff817d2025
RDX: ffff888051b23b80 RSI: ffffffff817d1fff RDI: 0000000000000005
RBP: 0000000000000003 R08: 0000000000000005 R09: 0000000000000000
R10: 0000000000000001 R11: 0000000000000000 R12: ffffed1017328355
R13: 0000000000000001 R14: ffff8880b9941aa8 R15: ffff8880b983d900
FS:  0000000000000000(0000) GS:ffff8880b9800000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000020094030 CR3: 000000000c977000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <IRQ>
 </IRQ>
 <TASK>
 on_each_cpu_cond_mask+0x40/0x90 kernel/smp.c:1023
 on_each_cpu include/linux/smp.h:71 [inline]
 text_poke_sync arch/x86/kernel/alternative.c:2006 [inline]
 text_poke_bp_batch+0x2ce/0x960 arch/x86/kernel/alternative.c:2216
 text_poke_flush arch/x86/kernel/alternative.c:2407 [inline]
 text_poke_flush arch/x86/kernel/alternative.c:2404 [inline]
 text_poke_finish+0x30/0x40 arch/x86/kernel/alternative.c:2414
 arch_jump_label_transform_apply+0x1c/0x30 arch/x86/kernel/jump_label.c:146
 jump_label_update+0x32e/0x410 kernel/jump_label.c:829
 static_key_enable_cpuslocked+0x1b5/0x270 kernel/jump_label.c:205
 static_key_enable+0x1a/0x20 kernel/jump_label.c:218
 toggle_allocation_gate mm/kfence/core.c:830 [inline]
 toggle_allocation_gate+0xf4/0x250 mm/kfence/core.c:822
 process_one_work+0x8a2/0x15e0 kernel/workqueue.c:2630
 process_scheduled_works kernel/workqueue.c:2703 [inline]
 worker_thread+0x8b6/0x1280 kernel/workqueue.c:2784
 kthread+0x337/0x440 kernel/kthread.c:388
 ret_from_fork+0x45/0x80 arch/x86/kernel/process.c:147
 ret_from_fork_asm+0x11/0x20 arch/x86/entry/entry_64.S:242
 </TASK>

Crashes (1):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2023/10/20 05:11 linux-next 4230ea146b1e 42e1d524 .config console log report info [disk image] [vmlinux] [kernel image] ci-upstream-linux-next-kasan-gce-root INFO: rcu detected stall in update_balloon_stats_func
* Struck through repros no longer work on HEAD.