syzbot


INFO: rcu detected stall in sys_clone3 (3)

Status: upstream: reported on 2024/09/24 16:11
Reported-by: syzbot+f9057903a3564e358b29@syzkaller.appspotmail.com
First crash: 139d, last: 80d
Similar bugs (5)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in sys_clone3 kernfs 1 995d 995d 0/28 auto-closed as invalid on 2022/08/20 13:01
linux-5.15 INFO: rcu detected stall in sys_clone3 1 129d 129d 0/3 auto-obsoleted due to no activity on 2025/01/12 11:21
upstream INFO: rcu detected stall in sys_clone3 (2) cgroups mm 3 259d 363d 0/28 auto-obsoleted due to no activity on 2024/08/25 02:20
linux-6.1 INFO: rcu detected stall in sys_clone3 1 401d 398d 0/3 auto-obsoleted due to no activity on 2024/04/15 06:45
linux-6.1 INFO: rcu detected stall in sys_clone3 (2) 1 261d 261d 0/3 auto-obsoleted due to no activity on 2024/09/01 21:33

Sample crash report:
rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
rcu: 	1-...!: (1 GPs behind) idle=d474/1/0x4000000000000000 softirq=34815/34816 fqs=4
	(detected by 0, t=10502 jiffies, g=46317, q=452 ncpus=2)
Sending NMI from CPU 0 to CPUs 1:
NMI backtrace for cpu 1
CPU: 1 PID: 9472 Comm: syz.0.1760 Not tainted 6.1.118-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 10/30/2024
RIP: 0010:lock_release+0x5ad/0xa20
Code: 4c 89 fa 49 bf 00 00 00 00 00 fc ff df 4d 85 e4 0f 85 e4 fc ff ff 48 8b 7c 24 10 48 8b 74 24 18 48 8b 54 24 48 e8 33 87 00 00 <4c> 8b 64 24 08 4c 8b 6c 24 20 4c 8d b4 24 90 00 00 00 48 c7 c7 40
RSP: 0018:ffffc900001e0b80 EFLAGS: 00000046
RAX: 1ffff11004d77c82 RBX: ffff888026bbe410 RCX: ffffc900001e0c03
RDX: 0000000000000005 RSI: ffff888026bbe418 RDI: ffff888026bbe4e8
RBP: ffffc900001e0cb0 R08: dffffc0000000000 R09: fffffbfff1d3415e
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff888026bbe4e8
R13: 0000000000000005 R14: 0ded7def27459471 R15: dffffc0000000000
FS:  00007f9c8333d6c0(0000) GS:ffff8880b8f00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007f488f41a55a CR3: 0000000057d8d000 CR4: 00000000003506e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <IRQ>
 __raw_spin_unlock_irqrestore include/linux/spinlock_api_smp.h:149 [inline]
 _raw_spin_unlock_irqrestore+0x75/0x130 kernel/locking/spinlock.c:194
 __run_hrtimer kernel/time/hrtimer.c:1685 [inline]
 __hrtimer_run_queues+0x4b0/0xe50 kernel/time/hrtimer.c:1753
 hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1815
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1107 [inline]
 __sysvec_apic_timer_interrupt+0x158/0x5b0 arch/x86/kernel/apic/apic.c:1124
 instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1118 [inline]
 sysvec_apic_timer_interrupt+0x9b/0xc0 arch/x86/kernel/apic/apic.c:1118
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:691
RIP: 0010:__page_table_check_pte_set+0x2/0x110 mm/page_table_check.c:202
Code: ff 5b 41 5c 41 5e 41 5f e9 eb f7 ff ff e8 86 ad 9d ff 5b 41 5c 41 5e 41 5f c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 41 57 <41> 56 41 55 41 54 53 49 89 ce 49 89 d7 48 89 fb e8 59 ad 9d ff 48
RSP: 0018:ffffc9000f2a7410 EFLAGS: 00000283
RAX: ffffffff81cc0eb8 RBX: ffff888020f11a30 RCX: 800000005e33b007
RDX: ffff888049c48878 RSI: 00007f9c8170f000 RDI: ffff888030ff0000
RBP: ffffc9000f2a77d0 R08: ffffffff81cbfcc2 R09: fffff940002f19d9
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff88801fbd9240
R13: 0000000000000000 R14: ffff888030ff0000 R15: 800000005e33b007
 page_table_check_pte_set include/linux/page_table_check.h:83 [inline]
 set_pte_at arch/x86/include/asm/pgtable.h:1009 [inline]
 copy_present_pte mm/memory.c:1000 [inline]
 copy_pte_range mm/memory.c:1091 [inline]
 copy_pmd_range mm/memory.c:1177 [inline]
 copy_pud_range mm/memory.c:1214 [inline]
 copy_p4d_range mm/memory.c:1238 [inline]
 copy_page_range+0x2d50/0x4660 mm/memory.c:1336
 dup_mmap kernel/fork.c:697 [inline]
 dup_mm kernel/fork.c:1541 [inline]
 copy_mm+0xf42/0x1990 kernel/fork.c:1590
 copy_process+0x19d5/0x4060 kernel/fork.c:2266
 kernel_clone+0x222/0x920 kernel/fork.c:2681
 __do_sys_clone3 kernel/fork.c:2980 [inline]
 __se_sys_clone3+0x373/0x410 kernel/fork.c:2964
 do_syscall_x64 arch/x86/entry/common.c:51 [inline]
 do_syscall_64+0x3b/0xb0 arch/x86/entry/common.c:81
 entry_SYSCALL_64_after_hwframe+0x68/0xd2
RIP: 0033:0x7f9c8257e819
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007f9c8333cf08 EFLAGS: 00000246 ORIG_RAX: 00000000000001b3
RAX: ffffffffffffffda RBX: 0000000000000058 RCX: 00007f9c8257e819
RDX: 00007f9c8333cf20 RSI: 0000000000000058 RDI: 00007f9c8333cf20
RBP: 00007f9c825f175e R08: 0000000000000000 R09: 0000000000000058
R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
R13: 0000000000000000 R14: 00007f9c82735fa0 R15: 00007ffe6e2698d8
 </TASK>
rcu: rcu_preempt kthread starved for 10494 jiffies! g46317 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x0 ->cpu=0
rcu: 	Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt     state:R  running task     stack:26040 pid:16    ppid:2      flags:0x00004000
Call Trace:
 <TASK>
 context_switch kernel/sched/core.c:5241 [inline]
 __schedule+0x143f/0x4570 kernel/sched/core.c:6558
 schedule+0xbf/0x180 kernel/sched/core.c:6634
 schedule_timeout+0x1b9/0x300 kernel/time/timer.c:1965
 rcu_gp_fqs_loop+0x2d2/0x1150 kernel/rcu/tree.c:1706
 rcu_gp_kthread+0xa3/0x3b0 kernel/rcu/tree.c:1905
 kthread+0x28d/0x320 kernel/kthread.c:376
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:295
 </TASK>
rcu: Stack dump where RCU GP kthread last ran:
CPU: 0 PID: 9484 Comm: syz.1.1766 Not tainted 6.1.118-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 10/30/2024
RIP: 0010:csd_lock_wait kernel/smp.c:424 [inline]
RIP: 0010:smp_call_function_many_cond+0x1fae/0x3460 kernel/smp.c:998
Code: 2f 44 89 ee 83 e6 01 31 ff e8 ae 43 0b 00 41 83 e5 01 49 bd 00 00 00 00 00 fc ff df 75 0a e8 39 40 0b 00 e9 1b ff ff ff f3 90 <42> 0f b6 04 2b 84 c0 75 14 41 f7 07 01 00 00 00 0f 84 fe fe ff ff
RSP: 0018:ffffc90015157240 EFLAGS: 00000246
RAX: ffffffff817f4bd9 RBX: 1ffff110171e81b1 RCX: 0000000000080000
RDX: ffffc900041e1000 RSI: 000000000007ffff RDI: 0000000000080000
RBP: ffffc90015157620 R08: ffffffff817f4ba2 R09: fffffbfff224624d
R10: 0000000000000000 R11: dffffc0000000001 R12: 0000000800000000
R13: dffffc0000000000 R14: 0000000000000001 R15: ffff8880b8f40d88
FS:  00007f1dfa1d36c0(0000) GS:ffff8880b8e00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000000110c37fa40 CR3: 000000005c2c9000 CR4: 00000000003506f0
Call Trace:
 <IRQ>
 </IRQ>
 <TASK>
 on_each_cpu_cond_mask+0x3b/0x80 kernel/smp.c:1166
 on_each_cpu include/linux/smp.h:71 [inline]
 text_poke_sync arch/x86/kernel/alternative.c:1334 [inline]
 text_poke_bp_batch+0x2bb/0x940 arch/x86/kernel/alternative.c:1534
 text_poke_bp+0xc8/0x140 arch/x86/kernel/alternative.c:1771
 __static_call_transform+0x333/0x560 arch/x86/kernel/static_call.c:109
 arch_static_call_transform+0xcc/0x270 arch/x86/kernel/static_call.c:161
 __static_call_update+0xd4/0x5c0 kernel/static_call_inline.c:136
 tracepoint_update_call kernel/tracepoint.c:317 [inline]
 tracepoint_add_func+0x90c/0x9d0 kernel/tracepoint.c:358
 tracepoint_probe_register_prio_may_exist+0x11e/0x190 kernel/tracepoint.c:482
 bpf_raw_tp_link_attach+0x456/0x6b0 kernel/bpf/syscall.c:3387
 bpf_raw_tracepoint_open+0x196/0x210 kernel/bpf/syscall.c:3414
 __sys_bpf+0x4a7/0x6c0 kernel/bpf/syscall.c:5062
 __do_sys_bpf kernel/bpf/syscall.c:5124 [inline]
 __se_sys_bpf kernel/bpf/syscall.c:5122 [inline]
 __x64_sys_bpf+0x78/0x90 kernel/bpf/syscall.c:5122
 do_syscall_x64 arch/x86/entry/common.c:51 [inline]
 do_syscall_64+0x3b/0xb0 arch/x86/entry/common.c:81
 entry_SYSCALL_64_after_hwframe+0x68/0xd2
RIP: 0033:0x7f1df937e819
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007f1dfa1d3038 EFLAGS: 00000246 ORIG_RAX: 0000000000000141
RAX: ffffffffffffffda RBX: 00007f1df9535fa0 RCX: 00007f1df937e819
RDX: 0000000000000010 RSI: 0000000020000f40 RDI: 0000000000000011
RBP: 00007f1df93f175e R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
R13: 0000000000000000 R14: 00007f1df9535fa0 R15: 00007ffe38322a98
 </TASK>

Crashes (3):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/11/22 13:34 linux-6.1.y b67dc5c9ade9 4b25d554 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in sys_clone3
2024/09/27 14:11 linux-6.1.y e526b12bf916 9314348a .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan-perf INFO: rcu detected stall in sys_clone3
2024/09/24 16:10 linux-6.1.y e526b12bf916 5643e0e9 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan-arm64 INFO: rcu detected stall in sys_clone3
* Struck through repros no longer work on HEAD.