syzbot


INFO: rcu detected stall in wg_ratelimiter_gc_entries (2)

Status: auto-obsoleted due to no activity on 2023/11/23 21:08
Subsystems: wireguard
[Documentation on labels]
Reported-by: syzbot+c1cc0083f159b67cb192@syzkaller.appspotmail.com
First crash: 268d, last: 267d
Cause bisection: introduced by (bisect log) :
commit c2368b19807affd7621f7c4638cd2e17fec13021
Author: Jiri Pirko <jiri@nvidia.com>
Date: Fri Jul 29 07:10:35 2022 +0000

  net: devlink: introduce "unregistering" mark and use it during devlinks iteration

Crash: INFO: rcu detected stall in corrupted (log)
Repro: C syz .config
  
Fix bisection: fixed by (bisect log) :
commit da71714e359b64bd7aab3bd56ec53f307f058133
Author: Jamal Hadi Salim <jhs@mojatatu.com>
Date: Tue Aug 22 10:12:31 2023 +0000

  net/sched: fix a qdisc modification with ambiguous command request

  
Discussions (2)
Title Replies (including bot) Last reply
[syzbot] [wireguard?] INFO: rcu detected stall in wg_ratelimiter_gc_entries (2) 0 (2) 2023/09/29 19:45
[syzbot] Monthly wireguard report (Aug 2023) 0 (1) 2023/08/21 20:40
Similar bugs (2)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
linux-6.1 INFO: rcu detected stall in wg_ratelimiter_gc_entries 1 384d 384d 0/3 auto-obsoleted due to no activity on 2023/08/18 11:48
upstream INFO: rcu detected stall in wg_ratelimiter_gc_entries wireguard 1 361d 361d 0/26 auto-obsoleted due to no activity on 2023/08/11 08:26
Last patch testing requests (4)
Created Duration User Patch Repo Result
2023/08/30 01:53 24m retest repro net OK log
2023/08/30 01:53 25m retest repro net OK log
2023/08/18 11:26 44m hdanton@sina.com patch https://git.kernel.org/pub/scm/linux/kernel/git/netdev/net.git ace0ab3a4b54 OK log
2023/08/17 10:28 47m hdanton@sina.com patch https://git.kernel.org/pub/scm/linux/kernel/git/netdev/net.git ace0ab3a4b54 report log

Sample crash report:
rcu: INFO: rcu_preempt self-detected stall on CPU
rcu: 	1-....: (1 GPs behind) idle=a14c/1/0x4000000000000000 softirq=9048/9050 fqs=4461
rcu: 	         hardirqs   softirqs   csw/system
rcu: 	 number:        1          0            0
rcu: 	cputime:    26363      26111           22   ==> 52480(ms)
rcu: 	(t=10500 jiffies g=6745 q=571 ncpus=2)
CPU: 1 PID: 26 Comm: kworker/1:1 Not tainted 6.5.0-rc5-syzkaller-00202-g8a519a572598 #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 07/26/2023
Workqueue: events_power_efficient wg_ratelimiter_gc_entries
RIP: 0010:taprio_dequeue_tc_priority+0x266/0x4b0 net/sched/sch_taprio.c:798
Code: 10 89 ef 44 89 f6 e8 39 b6 2c f9 44 39 f5 0f 84 40 ff ff ff e8 3b bb 2c f9 49 83 ff 0f 0f 87 e1 01 00 00 48 8b 04 24 0f b6 00 <38> 44 24 36 7c 08 84 c0 0f 85 bf 01 00 00 8b 33 8b 4c 24 30 48 8b
RSP: 0018:ffffc900001e0d60 EFLAGS: 00000293
RAX: 0000000000000000 RBX: ffff88806f214394 RCX: 0000000000000100
RDX: ffff888018a61dc0 RSI: ffffffff88594d65 RDI: 0000000000000004
RBP: 000000000000000a R08: 0000000000000004 R09: 000000000000000a
R10: 0000000000000000 R11: 000000000000004e R12: 0000000000000010
R13: ffff88802cbfab60 R14: 0000000000000000 R15: 0000000000000001
FS:  0000000000000000(0000) GS:ffff8880b9900000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fff88e4d3d8 CR3: 0000000018ed8000 CR4: 00000000003506e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 <IRQ>
 taprio_dequeue+0x12e/0x5f0 net/sched/sch_taprio.c:868
 dequeue_skb net/sched/sch_generic.c:292 [inline]
 qdisc_restart net/sched/sch_generic.c:397 [inline]
 __qdisc_run+0x1c4/0x19d0 net/sched/sch_generic.c:415
 qdisc_run include/net/pkt_sched.h:125 [inline]
 qdisc_run include/net/pkt_sched.h:122 [inline]
 net_tx_action+0x71e/0xc80 net/core/dev.c:5049
 __do_softirq+0x218/0x965 kernel/softirq.c:553
 invoke_softirq kernel/softirq.c:427 [inline]
 __irq_exit_rcu kernel/softirq.c:632 [inline]
 irq_exit_rcu+0xb7/0x120 kernel/softirq.c:644
 sysvec_apic_timer_interrupt+0x93/0xc0 arch/x86/kernel/apic/apic.c:1109
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:645
RIP: 0010:lock_acquire+0x1ef/0x510 kernel/locking/lockdep.c:5729
Code: c1 05 d5 6e 9b 7e 83 f8 01 0f 85 b0 02 00 00 9c 58 f6 c4 02 0f 85 9b 02 00 00 48 85 ed 74 01 fb 48 b8 00 00 00 00 00 fc ff df <48> 01 c3 48 c7 03 00 00 00 00 48 c7 43 08 00 00 00 00 48 8b 84 24
RSP: 0018:ffffc90000a1fb98 EFLAGS: 00000206
RAX: dffffc0000000000 RBX: 1ffff92000143f75 RCX: 0000000000000001
RDX: 1ffff1100314c510 RSI: ffffffff8a6c83a0 RDI: ffffffff8ac811a0
RBP: 0000000000000200 R08: 0000000000000000 R09: fffffbfff2309dea
R10: ffffffff9184ef57 R11: 0000000000000000 R12: 0000000000000001
R13: 0000000000000000 R14: ffffffff8d89afb8 R15: 0000000000000000
 __raw_spin_lock include/linux/spinlock_api_smp.h:133 [inline]
 _raw_spin_lock+0x2e/0x40 kernel/locking/spinlock.c:154
 spin_lock include/linux/spinlock.h:351 [inline]
 wg_ratelimiter_gc_entries+0xc6/0x520 drivers/net/wireguard/ratelimiter.c:63
 process_one_work+0xaa2/0x16f0 kernel/workqueue.c:2600
 worker_thread+0x687/0x1110 kernel/workqueue.c:2751
 kthread+0x33a/0x430 kernel/kthread.c:389
 ret_from_fork+0x2c/0x70 arch/x86/kernel/process.c:145
 ret_from_fork_asm+0x11/0x20 arch/x86/entry/entry_64.S:304
 </TASK>

Crashes (3):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2023/08/15 21:08 net 8a519a572598 39990d51 .config console log report syz C [disk image] [vmlinux] [kernel image] ci-upstream-net-this-kasan-gce INFO: rcu detected stall in wg_ratelimiter_gc_entries
2023/08/14 18:27 net ace0ab3a4b54 39990d51 .config console log report syz C [disk image] [vmlinux] [kernel image] ci-upstream-net-this-kasan-gce INFO: rcu detected stall in wg_ratelimiter_gc_entries
2023/08/15 06:21 upstream 2ccdd1b13c59 39990d51 .config console log report info [disk image] [vmlinux] [kernel image] ci-upstream-kasan-gce-386 INFO: rcu detected stall in wg_ratelimiter_gc_entries
* Struck through repros no longer work on HEAD.