syzbot


INFO: rcu detected stall in netlink_release (5)

Status: upstream: reported syz repro on 2025/01/17 06:48
Subsystems: net mm
[Documentation on labels]
Reported-by: syzbot+9a69946171ff3136f79f@syzkaller.appspotmail.com
First crash: 33d, last: 25d
Cause bisection: failed (error log, bisect log)
  
Discussions (1)
Title Replies (including bot) Last reply
[syzbot] [mm?] INFO: rcu detected stall in netlink_release (5) 0 (1) 2025/01/17 06:48
Similar bugs (5)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in netlink_release wireless 1 1310d 1310d 0/28 auto-closed as invalid on 2021/10/14 21:29
upstream INFO: rcu detected stall in netlink_release (3) wireless 1 940d 940d 0/28 auto-closed as invalid on 2022/09/20 00:08
upstream INFO: rcu detected stall in netlink_release (4) netfilter 1 213d 213d 0/28 auto-obsoleted due to no activity on 2024/10/15 17:18
upstream INFO: rcu detected stall in netlink_release (2) wireless 1 1129d 1129d 0/28 auto-closed as invalid on 2022/04/13 11:22
upstream BUG: soft lockup in netlink_release kvm 1 1027d 1027d 0/28 auto-closed as invalid on 2022/06/25 01:57
Last patch testing requests (1)
Created Duration User Patch Repo Result
2025/02/05 02:40 28m retest repro net OK log

Sample crash report:
rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
rcu: 	1-...!: (1 GPs behind) idle=788c/1/0x4000000000000000 softirq=140480/140481 fqs=0
rcu: 	(detected by 0, t=10505 jiffies, g=146717, q=330 ncpus=2)
Sending NMI from CPU 0 to CPUs 1:
NMI backtrace for cpu 1
CPU: 1 UID: 0 PID: 1136 Comm: syz.2.10990 Not tainted 6.13.0-syzkaller-00603-g3d3a9c8b89d4 #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 12/27/2024
RIP: 0010:check_wait_context kernel/locking/lockdep.c:4864 [inline]
RIP: 0010:__lock_acquire+0x4e0/0x3c40 kernel/locking/lockdep.c:5176
Code: 45 0f b6 77 21 45 84 f6 0f 88 10 0c 00 00 48 8b 74 24 30 84 c0 48 ba 00 00 00 00 00 fc ff df 0f 44 c5 48 c1 ee 03 0f b6 14 16 <84> d2 74 09 80 fa 03 0f 8e 04 10 00 00 45 8b 85 d8 0a 00 00 44 89
RSP: 0018:ffffc90000a18b38 EFLAGS: 00000802
RAX: 0000000000000003 RBX: 0000000000080000 RCX: 0000000000000001
RDX: 0000000000000000 RSI: 1ffff1100f2a215b RDI: ffff888079510b29
RBP: 0000000000000003 R08: 0000000000000000 R09: fffffbfff2dce9b8
R10: ffffffff96e74dc7 R11: 0000000000000002 R12: 0000000000000002
R13: ffff888079510000 R14: 0000000000000048 R15: ffff888079510b08
FS:  000055558d467500(0000) GS:ffff8880b8700000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000001b2d31eff8 CR3: 000000007f436000 CR4: 00000000003526f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <IRQ>
 lock_acquire.part.0+0x11b/0x380 kernel/locking/lockdep.c:5849
 __raw_spin_lock include/linux/spinlock_api_smp.h:133 [inline]
 _raw_spin_lock+0x2e/0x40 kernel/locking/spinlock.c:154
 spin_lock include/linux/spinlock.h:351 [inline]
 advance_sched+0xd8/0xc60 net/sched/sch_taprio.c:924
 __run_hrtimer kernel/time/hrtimer.c:1739 [inline]
 __hrtimer_run_queues+0x20a/0xae0 kernel/time/hrtimer.c:1803
 hrtimer_interrupt+0x392/0x8e0 kernel/time/hrtimer.c:1865
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1038 [inline]
 __sysvec_apic_timer_interrupt+0x10f/0x400 arch/x86/kernel/apic/apic.c:1055
 instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1049 [inline]
 sysvec_apic_timer_interrupt+0x9f/0xc0 arch/x86/kernel/apic/apic.c:1049
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:702
RIP: 0010:netlink_release+0x0/0x2130 net/netlink/af_netlink.c:721
Code: ff e8 f4 0d b1 f8 e9 83 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 <f3> 0f 1e fa 41 57 49 89 ff 41 56 41 55 41 54 55 53 48 bb 00 00 00
RSP: 0018:ffffc900111ffda8 EFLAGS: 00000246
RAX: ffffffff894bd4e0 RBX: ffff88807811ec00 RCX: 0000000000000000
RDX: 1ffffffff19465ea RSI: 0000000000000008 RDI: ffff88807811ec00
RBP: ffff88807811ed98 R08: 0000000000000001 R09: ffffed100f023db4
R10: ffff88807811eda7 R11: 0000000000000000 R12: ffffffff8ca32f40
R13: ffff88807811ec20 R14: 0000000000000000 R15: ffff88807811ecc0
 __sock_release+0xb0/0x270 net/socket.c:640
 sock_close+0x1c/0x30 net/socket.c:1408
 __fput+0x3f8/0xb60 fs/file_table.c:450
 task_work_run+0x14e/0x250 kernel/task_work.c:239
 resume_user_mode_work include/linux/resume_user_mode.h:50 [inline]
 exit_to_user_mode_loop kernel/entry/common.c:114 [inline]
 exit_to_user_mode_prepare include/linux/entry-common.h:329 [inline]
 __syscall_exit_to_user_mode_work kernel/entry/common.c:207 [inline]
 syscall_exit_to_user_mode+0x27b/0x2a0 kernel/entry/common.c:218
 do_syscall_64+0xda/0x250 arch/x86/entry/common.c:89
 entry_SYSCALL_64_after_hwframe+0x77/0x7f
RIP: 0033:0x7fca5bf85d29
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007fffc03395f8 EFLAGS: 00000246 ORIG_RAX: 00000000000001b4
RAX: 0000000000000000 RBX: 00000000000c878f RCX: 00007fca5bf85d29
RDX: 0000000000000000 RSI: 000000000000001e RDI: 0000000000000003
RBP: 00007fca5c177ba0 R08: 0000000000000001 R09: 00007fffc03398ef
R10: 00007fca5be00000 R11: 0000000000000246 R12: 00000000000c87d2
R13: 00007fca5c175fa0 R14: 0000000000000032 R15: ffffffffffffffff
 </TASK>
rcu: rcu_preempt kthread starved for 10505 jiffies! g146717 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x0 ->cpu=0
rcu: 	Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt     state:R  running task     stack:28472 pid:17    tgid:17    ppid:2      flags:0x00004000
Call Trace:
 <TASK>
 context_switch kernel/sched/core.c:5369 [inline]
 __schedule+0xe58/0x5ad0 kernel/sched/core.c:6756
 __schedule_loop kernel/sched/core.c:6833 [inline]
 schedule+0xe7/0x350 kernel/sched/core.c:6848
 schedule_timeout+0x124/0x280 kernel/time/sleep_timeout.c:99
 rcu_gp_fqs_loop+0x1eb/0xb00 kernel/rcu/tree.c:2045
 rcu_gp_kthread+0x271/0x380 kernel/rcu/tree.c:2247
 kthread+0x2c1/0x3a0 kernel/kthread.c:389
 ret_from_fork+0x45/0x80 arch/x86/kernel/process.c:147
 ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:244
 </TASK>
rcu: Stack dump where RCU GP kthread last ran:
CPU: 0 UID: 0 PID: 2983 Comm: kworker/u8:7 Not tainted 6.13.0-syzkaller-00603-g3d3a9c8b89d4 #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 12/27/2024
Workqueue: events_unbound toggle_allocation_gate
RIP: 0010:csd_lock_wait kernel/smp.c:340 [inline]
RIP: 0010:smp_call_function_many_cond+0x46d/0x1300 kernel/smp.c:884
Code: f5 49 c1 ec 03 83 e5 07 49 01 c4 83 c5 03 e8 0a 11 0c 00 f3 90 41 0f b6 04 24 40 38 c5 7c 08 84 c0 0f 85 a7 0c 00 00 8b 43 08 <31> ff 83 e0 01 41 89 c5 89 c6 e8 e4 0b 0c 00 45 85 ed 75 d0 e8 da
RSP: 0018:ffffc9000c2c7998 EFLAGS: 00000246
RAX: 0000000000000011 RBX: ffff8880b8744a80 RCX: ffffffff818e25cc
RDX: ffff888030a30000 RSI: ffffffff818e25a6 RDI: 0000000000000005
RBP: 0000000000000003 R08: 0000000000000005 R09: 0000000000000000
R10: 0000000000000001 R11: 0000000000000006 R12: ffffed10170e8951
R13: 0000000000000001 R14: ffff8880b8744a88 R15: ffff8880b863fe80
FS:  0000000000000000(0000) GS:ffff8880b8600000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fb26d777bac CR3: 000000000df7e000 CR4: 00000000003526f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <IRQ>
 </IRQ>
 <TASK>
 on_each_cpu_cond_mask+0x40/0x90 kernel/smp.c:1051
 on_each_cpu include/linux/smp.h:71 [inline]
 text_poke_sync arch/x86/kernel/alternative.c:2114 [inline]
 text_poke_bp_batch+0x22b/0x760 arch/x86/kernel/alternative.c:2324
 text_poke_flush arch/x86/kernel/alternative.c:2515 [inline]
 text_poke_flush arch/x86/kernel/alternative.c:2512 [inline]
 text_poke_finish+0x30/0x40 arch/x86/kernel/alternative.c:2522
 arch_jump_label_transform_apply+0x1c/0x30 arch/x86/kernel/jump_label.c:146
 jump_label_update+0x1d7/0x400 kernel/jump_label.c:920
 static_key_enable_cpuslocked+0x1b7/0x270 kernel/jump_label.c:210
 static_key_enable+0x1a/0x20 kernel/jump_label.c:223
 toggle_allocation_gate mm/kfence/core.c:849 [inline]
 toggle_allocation_gate+0xfc/0x260 mm/kfence/core.c:841
 process_one_work+0x9c5/0x1ba0 kernel/workqueue.c:3236
 process_scheduled_works kernel/workqueue.c:3317 [inline]
 worker_thread+0x6c8/0xf00 kernel/workqueue.c:3398
 kthread+0x2c1/0x3a0 kernel/kthread.c:389
 ret_from_fork+0x45/0x80 arch/x86/kernel/process.c:147
 ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:244
 </TASK>

Crashes (2):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2025/01/21 17:47 upstream 3d3a9c8b89d4 6e87cfa2 .config console log report info [disk image] [vmlinux] [kernel image] ci-upstream-kasan-gce-selinux-root INFO: rcu detected stall in netlink_release
2025/01/13 06:39 net 47e55e4b410f 6dbc6a9b .config console log report syz / log [disk image] [vmlinux] [kernel image] ci-upstream-net-this-kasan-gce INFO: rcu detected stall in netlink_release
* Struck through repros no longer work on HEAD.