syzbot


INFO: rcu detected stall in batadv_tt_purge

Status: auto-obsoleted due to no activity on 2024/09/12 14:42
Reported-by: syzbot+c87c26f133881b2b1756@syzkaller.appspotmail.com
First crash: 296d, last: 224d
Similar bugs (5)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
linux-5.15 INFO: rcu detected stall in batadv_tt_purge 1 136d 136d 0/3 auto-obsoleted due to no activity on 2024/12/09 22:28
upstream INFO: rcu detected stall in batadv_tt_purge (2) batman C done inconclusive 3 700d 1269d 0/28 auto-obsoleted due to no activity on 2023/06/14 13:09
upstream INFO: rcu detected stall in batadv_tt_purge cgroups mm 1 1834d 1833d 0/28 closed as invalid on 2020/01/09 08:13
upstream INFO: rcu detected stall in batadv_tt_purge (3) batman 1 422d 406d 0/28 auto-obsoleted due to no activity on 2024/02/17 16:44
upstream BUG: soft lockup in batadv_tt_purge batman 1 206d 204d 0/28 auto-obsoleted due to no activity on 2024/09/20 21:14

Sample crash report:
rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
rcu: 	0-...!: (1 GPs behind) idle=b06c/1/0x4000000000000000 softirq=9375/9376 fqs=27
	(detected by 1, t=10502 jiffies, g=8585, q=944 ncpus=2)
Sending NMI from CPU 1 to CPUs 0:
NMI backtrace for cpu 0
CPU: 0 PID: 3632 Comm: kworker/u4:5 Not tainted 6.1.92-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 04/02/2024
Workqueue: bat_events batadv_tt_purge
RIP: 0010:__lock_acquire+0xcd1/0x1f80 kernel/locking/lockdep.c:5019
Code: 89 d8 48 c1 e8 06 48 8d 3c c5 20 c2 49 90 be 08 00 00 00 e8 c1 99 77 00 48 bf 00 00 00 00 00 fc ff df 48 0f a3 1d 6f 4d df 0e <0f> 83 40 08 00 00 49 8d 9d d0 0a 00 00 48 89 d8 48 c1 e8 03 48 89
RSP: 0018:ffffc90000007a80 EFLAGS: 00000057
RAX: 0000000000000001 RBX: 000000000000005e RCX: ffffffff816a749f
RDX: 0000000000000000 RSI: 0000000000000008 RDI: dffffc0000000000
RBP: ffff88802278a8f0 R08: dffffc0000000000 R09: fffffbfff2093846
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff88802278a898
R13: ffff888022789dc0 R14: ffff88802278a910 R15: 1ffff110044f1522
FS:  0000000000000000(0000) GS:ffff8880b9800000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000555b832f0968 CR3: 000000000ce8e000 CR4: 00000000003506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <NMI>
 </NMI>
 <IRQ>
 lock_acquire+0x1f8/0x5a0 kernel/locking/lockdep.c:5662
 __raw_spin_lock_irq include/linux/spinlock_api_smp.h:119 [inline]
 _raw_spin_lock_irq+0xcf/0x110 kernel/locking/spinlock.c:170
 __run_hrtimer kernel/time/hrtimer.c:1690 [inline]
 __hrtimer_run_queues+0x6d3/0xe50 kernel/time/hrtimer.c:1750
 hrtimer_interrupt+0x392/0x980 kernel/time/hrtimer.c:1812
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1095 [inline]
 __sysvec_apic_timer_interrupt+0x156/0x580 arch/x86/kernel/apic/apic.c:1112
 sysvec_apic_timer_interrupt+0x8c/0xb0 arch/x86/kernel/apic/apic.c:1106
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x16/0x20 arch/x86/include/asm/idtentry.h:653
RIP: 0010:should_resched arch/x86/include/asm/preempt.h:103 [inline]
RIP: 0010:__local_bh_enable_ip+0x16c/0x1f0 kernel/softirq.c:403
Code: 8a e8 68 06 37 09 65 66 8b 05 60 84 af 7e 66 85 c0 75 57 bf 01 00 00 00 e8 a1 38 0a 00 e8 6c 34 3d 00 fb 65 8b 05 0c 54 ae 7e <85> c0 75 05 e8 3b 79 ac ff 48 c7 44 24 20 0e 36 e0 45 49 c7 04 1c
RSP: 0018:ffffc90003e4fb20 EFLAGS: 00000282
RAX: 0000000080000000 RBX: 1ffff920007c9f68 RCX: ffffffff816acf0a
RDX: dffffc0000000000 RSI: ffffffff8aec01c0 RDI: ffffffff8b3d45e0
RBP: ffffc90003e4fbc8 R08: dffffc0000000000 R09: fffffbfff2093861
R10: 0000000000000000 R11: dffffc0000000001 R12: dffffc0000000000
R13: 1ffff920007c9f6c R14: ffffc90003e4fb60 R15: 0000000000000201
 spin_unlock_bh include/linux/spinlock.h:396 [inline]
 batadv_tt_global_purge net/batman-adv/translation-table.c:2299 [inline]
 batadv_tt_purge+0x4dc/0xa40 net/batman-adv/translation-table.c:3561
 process_one_work+0x8a9/0x11d0 kernel/workqueue.c:2292
 worker_thread+0xa47/0x1200 kernel/workqueue.c:2439
 kthread+0x28d/0x320 kernel/kthread.c:376
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:308
 </TASK>
rcu: rcu_preempt kthread starved for 10448 jiffies! g8585 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x0 ->cpu=1
rcu: 	Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior.
rcu: RCU grace-period kthread stack dump:
task:rcu_preempt     state:R  running task     stack:25016 pid:16    ppid:2      flags:0x00004000
Call Trace:
 <TASK>
 context_switch kernel/sched/core.c:5245 [inline]
 __schedule+0x142d/0x4550 kernel/sched/core.c:6558
 schedule+0xbf/0x180 kernel/sched/core.c:6634
 schedule_timeout+0x1b9/0x300 kernel/time/timer.c:1965
 rcu_gp_fqs_loop+0x2d2/0x1150 kernel/rcu/tree.c:1706
 rcu_gp_kthread+0xa3/0x3b0 kernel/rcu/tree.c:1905
 kthread+0x28d/0x320 kernel/kthread.c:376
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:308
 </TASK>
rcu: Stack dump where RCU GP kthread last ran:
CPU: 1 PID: 4072 Comm: kworker/1:18 Not tainted 6.1.92-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 04/02/2024
Workqueue: events drain_vmap_area_work
RIP: 0010:csd_lock_wait kernel/smp.c:424 [inline]
RIP: 0010:smp_call_function_many_cond+0x1fb5/0x3460 kernel/smp.c:998
Code: e6 01 31 ff e8 ec 42 0b 00 41 83 e5 01 49 bd 00 00 00 00 00 fc ff df 75 0a e8 77 3f 0b 00 e9 1b ff ff ff f3 90 42 0f b6 04 2b <84> c0 75 14 41 f7 07 01 00 00 00 0f 84 fe fe ff ff e8 55 3f 0b 00
RSP: 0018:ffffc900053176e0 EFLAGS: 00000293
RAX: 0000000000000000 RBX: 1ffff1101730859d RCX: ffff888073113b80
RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000000
RBP: ffffc90005317ac0 R08: ffffffff817f4dc4 R09: fffffbfff2093846
R10: 0000000000000000 R11: dffffc0000000001 R12: 0000000800000000
R13: dffffc0000000000 R14: 0000000000000000 R15: ffff8880b9842ce8
FS:  0000000000000000(0000) GS:ffff8880b9900000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007f7567ac9140 CR3: 000000007b8f4000 CR4: 00000000003506e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <IRQ>
 </IRQ>
 <TASK>
 on_each_cpu_cond_mask+0x3b/0x80 kernel/smp.c:1166
 __purge_vmap_area_lazy+0x29c/0x1720 mm/vmalloc.c:1753
 drain_vmap_area_work+0x3c/0xd0 mm/vmalloc.c:1803
 process_one_work+0x8a9/0x11d0 kernel/workqueue.c:2292
 worker_thread+0xa47/0x1200 kernel/workqueue.c:2439
 kthread+0x28d/0x320 kernel/kthread.c:376
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:308
 </TASK>

Crashes (6):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/06/04 14:41 linux-6.1.y 88690811da69 11f2afa5 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in batadv_tt_purge
2024/05/28 21:25 linux-6.1.y 88690811da69 34889ee3 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in batadv_tt_purge
2024/05/28 05:26 linux-6.1.y 88690811da69 f550015e .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in batadv_tt_purge
2024/05/26 09:21 linux-6.1.y 88690811da69 a10a183e .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in batadv_tt_purge
2024/05/26 09:15 linux-6.1.y 88690811da69 a10a183e .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in batadv_tt_purge
2024/03/24 12:46 linux-6.1.y d7543167affd 0ea90952 .config console log report info [disk image] [vmlinux] [kernel image] ci2-linux-6-1-kasan INFO: rcu detected stall in batadv_tt_purge
* Struck through repros no longer work on HEAD.