syzbot


INFO: rcu detected stall in validate_mm

Status: upstream: reported on 2024/08/03 11:55
Reported-by: syzbot+41a838498d4af3843c2d@syzkaller.appspotmail.com
First crash: 56d, last: 19d
Similar bugs (2)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: rcu detected stall in validate_mm (3) mm C error 27 2d00h 139d 0/28 upstream: reported C repro on 2024/05/12 09:19
upstream INFO: rcu detected stall in validate_mm (2) mm 2 326d 337d 0/28 auto-obsoleted due to no activity on 2024/02/04 15:00

Sample crash report:
rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
rcu: 	1-...!: (0 ticks this GP) idle=17b4/1/0x4000000000000000 softirq=22997/22997 fqs=0
	(detected by 0, t=10503 jiffies, g=29917, q=67 ncpus=2)
Sending NMI from CPU 0 to CPUs 1:
NMI backtrace for cpu 1
CPU: 1 PID: 6396 Comm: modprobe Not tainted 6.1.109-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 08/06/2024
RIP: 0010:_raw_spin_unlock_irqrestore+0x29/0x130 kernel/locking/spinlock.c:193
Code: 00 55 48 89 e5 41 57 41 56 41 55 41 54 53 48 83 e4 e0 48 83 ec 60 49 89 f7 48 89 fb 65 48 8b 04 25 28 00 00 00 48 89 44 24 40 <49> bc 00 00 00 00 00 fc ff df 4c 8d 74 24 20 48 c7 04 24 b3 8a b5
RSP: 0018:ffffc900001e0cc0 EFLAGS: 00000086
RAX: ea63a40bf4f65100 RBX: ffff8880b8f2a4c0 RCX: 0000000000000001
RDX: 0000000000010000 RSI: 0000000000000046 RDI: ffff8880b8f2a4c0
RBP: ffffc900001e0d50 R08: ffffffff8179dd30 R09: fffffbfff1d33b6e
R10: 0000000000000000 R11: dffffc0000000001 R12: ffffffff88cf4d90
R13: ffff8880b8f2a504 R14: ffff8880b8f2a4c0 R15: 0000000000000046
FS:  0000000000000000(0000) GS:ffff8880b8f00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000000002000f000 CR3: 000000005bc6e000 CR4: 00000000003506e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <IRQ>
 __run_hrtimer kernel/time/hrtimer.c:1685 [inline]
 __hrtimer_run_queues+0x4b0/0xe50 kernel/time/hrtimer.c:1753
 hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1815
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1095 [inline]
 __sysvec_apic_timer_interrupt+0x156/0x580 arch/x86/kernel/apic/apic.c:1112
 sysvec_apic_timer_interrupt+0x8c/0xb0 arch/x86/kernel/apic/apic.c:1106
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:653
RIP: 0010:validate_mm_mt+0x1b2/0x670 mm/mmap.c:297
Code: 24 10 48 8b 4c 24 08 48 c1 e9 03 48 89 4c 24 10 48 89 44 24 28 48 c1 e8 03 48 89 44 24 30 4d 89 ec 49 c1 ec 03 43 80 3c 3c 00 <74> 08 4c 89 ef e8 44 52 11 00 4d 8b 75 00 48 8b 44 24 10 42 80 3c
RSP: 0018:ffffc9000a937a40 EFLAGS: 00000246
RAX: ffffffff81d0dd3d RBX: ffffc9000a937ac0 RCX: ffff88802d009dc0
RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000000
RBP: ffffc9000a937b70 R08: ffffffff8a9cd3e3 R09: ffffffff8a9ccf12
R10: 0000000000000003 R11: ffff88802d009dc0 R12: 1ffff11005dc32d8
R13: ffff88802ee196c0 R14: 00007fd8d2256fff R15: dffffc0000000000
 validate_mm+0x16e/0x380 mm/mmap.c:332
 do_mmap+0xc7/0xf60 mm/mmap.c:1262
 vm_mmap_pgoff+0x1ca/0x2d0 mm/util.c:520
 ksys_mmap_pgoff+0x4f5/0x6d0 mm/mmap.c:1471
 do_syscall_x64 arch/x86/entry/common.c:51 [inline]
 do_syscall_64+0x3b/0xb0 arch/x86/entry/common.c:81
 entry_SYSCALL_64_after_hwframe+0x68/0xd2
RIP: 0033:0x7fd8d24f8b74
Code: 63 08 44 89 e8 5b 41 5c 41 5d c3 41 89 ca 41 f7 c1 ff 0f 00 00 74 0c c7 05 f5 46 01 00 16 00 00 00 eb 17 b8 09 00 00 00 0f 05 <48> 3d 00 f0 ff ff 76 0c f7 d8 89 05 dc 46 01 00 48 83 c8 ff c3 0f
RSP: 002b:00007ffd93d65038 EFLAGS: 00000246 ORIG_RAX: 0000000000000009
RAX: ffffffffffffffda RBX: 00007ffd93d650b0 RCX: 00007fd8d24f8b74
RDX: 0000000000000001 RSI: 0000000000007000 RDI: 00007fd8d224b000
RBP: 00007ffd93d65410 R08: 0000000000000000 R09: 000000000001b000
R10: 0000000000000812 R11: 0000000000000246 R12: 00007fd8d24dbfc0
R13: 00007ffd93d65498 R14: 000000000001a43e R15: 0000000000000000
 </TASK>
rcu: rcu_preempt kthread timer wakeup didn't happen for 10502 jiffies! g29917 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402
rcu: 	Possible timer handling issue on cpu=0 timer-softirq=13031
rcu: rcu_preempt kthread starved for 10503 jiffies! g29917 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
rcu: 	Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt     state:I stack:26144 pid:16    ppid:2      flags:0x00004000
Call Trace:
 <TASK>
 context_switch kernel/sched/core.c:5241 [inline]
 __schedule+0x143f/0x4570 kernel/sched/core.c:6558
 schedule+0xbf/0x180 kernel/sched/core.c:6634
 schedule_timeout+0x1b9/0x300 kernel/time/timer.c:1965
 rcu_gp_fqs_loop+0x2d2/0x1150 kernel/rcu/tree.c:1706
 rcu_gp_kthread+0xa3/0x3b0 kernel/rcu/tree.c:1905
 kthread+0x28d/0x320 kernel/kthread.c:376
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:295
 </TASK>
rcu: Stack dump where RCU GP kthread last ran:
CPU: 0 PID: 6394 Comm: syz.2.685 Not tainted 6.1.109-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 08/06/2024
RIP: 0010:csd_lock_wait kernel/smp.c:424 [inline]
RIP: 0010:smp_call_function_many_cond+0x1fae/0x3460 kernel/smp.c:998
Code: 2f 44 89 ee 83 e6 01 31 ff e8 be 43 0b 00 41 83 e5 01 49 bd 00 00 00 00 00 fc ff df 75 0a e8 49 40 0b 00 e9 1b ff ff ff f3 90 <42> 0f b6 04 2b 84 c0 75 14 41 f7 07 01 00 00 00 0f 84 fe fe ff ff
RSP: 0018:ffffc9000a667440 EFLAGS: 00000293
RAX: ffffffff817f6e49 RBX: 1ffff110171e81b1 RCX: ffff888029c9bb80
RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000000
RBP: ffffc9000a667820 R08: ffffffff817f6e12 R09: fffffbfff223b645
R10: 0000000000000000 R11: dffffc0000000001 R12: 0000000800000000
R13: dffffc0000000000 R14: 0000000000000001 R15: ffff8880b8f40d88
FS:  0000555569685500(0000) GS:ffff8880b8e00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000001b3220fff8 CR3: 0000000060ad1000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <IRQ>
 </IRQ>
 <TASK>
 on_each_cpu_cond_mask+0x3b/0x80 kernel/smp.c:1166
 on_each_cpu include/linux/smp.h:71 [inline]
 text_poke_sync arch/x86/kernel/alternative.c:1334 [inline]
 text_poke_bp_batch+0x2bb/0x940 arch/x86/kernel/alternative.c:1534
 text_poke_bp+0xc8/0x140 arch/x86/kernel/alternative.c:1771
 __static_call_transform+0x333/0x560 arch/x86/kernel/static_call.c:109
 arch_static_call_transform+0xcc/0x270 arch/x86/kernel/static_call.c:161
 __static_call_update+0xd4/0x5c0 kernel/static_call_inline.c:136
 tracepoint_update_call kernel/tracepoint.c:317 [inline]
 tracepoint_remove_func kernel/tracepoint.c:441 [inline]
 tracepoint_probe_unregister+0x8df/0x980 kernel/tracepoint.c:551
 bpf_raw_tp_link_release+0x5f/0x80 kernel/bpf/syscall.c:3190
 bpf_link_free kernel/bpf/syscall.c:2764 [inline]
 bpf_link_put+0x234/0x2c0 kernel/bpf/syscall.c:2790
 bpf_link_release+0x37/0x40 kernel/bpf/syscall.c:2799
 __fput+0x3f6/0x8d0 fs/file_table.c:320
 task_work_run+0x246/0x300 kernel/task_work.c:203
 resume_user_mode_work include/linux/resume_user_mode.h:49 [inline]
 exit_to_user_mode_loop+0xde/0x100 kernel/entry/common.c:177
 exit_to_user_mode_prepare+0xb1/0x140 kernel/entry/common.c:210
 __syscall_exit_to_user_mode_work kernel/entry/common.c:292 [inline]
 syscall_exit_to_user_mode+0x60/0x270 kernel/entry/common.c:303
 do_syscall_64+0x47/0xb0 arch/x86/entry/common.c:87
 entry_SYSCALL_64_after_hwframe+0x68/0xd2
RIP: 0033:0x7f274d17cef9
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 a8 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007ffe3a7aeee8 EFLAGS: 00000246 ORIG_RAX: 00000000000001b4
RAX: 0000000000000000 RBX: 000000000003e07d RCX: 00007f274d17cef9
RDX: 0000000000000000 RSI: 000000000000001e RDI: 0000000000000003
RBP: 00007f274d337a80 R08: 0000000000000001 R09: 00007ffe3a7af1df
R10: 00007f274ce00000 R11: 0000000000000246 R12: 000000000003e0f1
R13: 00007ffe3a7aeff0 R14: 0000000000000032 R15: ffffffffffffffff
 </TASK>

Crashes (5):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/09/08 21:34 linux-6.1.y 5ca5b389fddf 9750182a .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in validate_mm
2024/08/31 10:41 linux-6.1.y 311d8503ef9f 1eda0d14 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in validate_mm
2024/08/14 20:35 linux-6.1.y 117ac406ba90 e6b88e20 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in validate_mm
2024/08/03 22:31 linux-6.1.y 48d525b0e463 1786a2a8 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in validate_mm
2024/08/03 11:54 linux-6.1.y 48d525b0e463 1786a2a8 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in validate_mm
* Struck through repros no longer work on HEAD.