syzbot


INFO: rcu detected stall in do_idle (2)

Status: auto-closed as invalid on 2022/08/24 07:48
Reported-by: syzbot+60c6d3385e4c30e81e1b@syzkaller.appspotmail.com
First crash: 729d, last: 729d
Similar bugs (7)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
linux-5.15 BUG: soft lockup in do_idle origin:upstream C error 28 4d23h 331d 0/3 upstream: reported C repro on 2023/05/29 19:14
android-414 INFO: rcu detected stall in do_idle 1 2049d 2049d 0/1 auto-closed as invalid on 2019/03/13 10:31
linux-4.19 INFO: rcu detected stall in do_idle 1 1587d 1587d 0/1 auto-closed as invalid on 2020/04/18 00:25
linux-4.14 INFO: rcu detected stall in do_idle 4 1682d 1688d 0/1 auto-closed as invalid on 2020/01/14 08:59
upstream INFO: rcu detected stall in do_idle acpi C done error 1834 20h49m 2020d 0/26 upstream: reported C repro on 2018/10/13 07:31
linux-4.14 INFO: rcu detected stall in do_idle (2) C error 4 604d 733d 0/1 upstream: reported C repro on 2022/04/22 10:09
linux-6.1 BUG: soft lockup in do_idle origin:upstream C 15 18d 319d 0/3 upstream: reported C repro on 2023/06/10 08:51

Sample crash report:
IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
wlan1: Created IBSS using preconfigured BSSID 50:50:50:50:50:50
wlan1: Creating new IBSS network, BSSID 50:50:50:50:50:50
IPv6: ADDRCONF(NETDEV_CHANGE): wlan1: link becomes ready
[Firmware Bug]: TSC ADJUST differs: CPU0 0 --> -513626785106. Restoring
rcu: INFO: rcu_preempt self-detected stall on CPU
rcu: 	0-...!: (1 ticks this GP) idle=822/1/0x4000000000000002 softirq=22331/22331 fqs=0 
rcu: 	 (t=23349 jiffies g=22513 q=115)
rcu: rcu_preempt kthread starved for 23349 jiffies! g22513 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=0
rcu: RCU grace-period kthread stack dump:
systemd[1]: systemd-udevd.service: Watchdog timeout (limit 3min)!
rcu_preempt     I29208    10      2 0x80000000
systemd[1]: systemd-udevd.service: Killing process 4699 (systemd-udevd) with signal SIGABRT.
Call Trace:
systemd[1]: systemd-timesyncd.service: Watchdog timeout (limit 3min)!
 context_switch kernel/sched/core.c:2828 [inline]
 __schedule+0x887/0x2040 kernel/sched/core.c:3517
systemd[1]: systemd-timesyncd.service: Killing process 6163 (systemd-timesyn) with signal SIGABRT.
 schedule+0x8d/0x1b0 kernel/sched/core.c:3561
 schedule_timeout+0x4cf/0xfe0 kernel/time/timer.c:1818
 rcu_gp_kthread+0xdad/0x21c0 kernel/rcu/tree.c:2202
 kthread+0x33f/0x460 kernel/kthread.c:259
 ret_from_fork+0x24/0x30 arch/x86/entry/entry_64.S:415
NMI backtrace for cpu 0
CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.19.211-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
Call Trace:
 <IRQ>
 __dump_stack lib/dump_stack.c:77 [inline]
 dump_stack+0x1fc/0x2ef lib/dump_stack.c:118
 nmi_cpu_backtrace.cold+0x63/0xa2 lib/nmi_backtrace.c:101
 nmi_trigger_cpumask_backtrace+0x1a6/0x1f0 lib/nmi_backtrace.c:62
 trigger_single_cpu_backtrace include/linux/nmi.h:164 [inline]
 rcu_dump_cpu_stacks+0x15f/0x19c kernel/rcu/tree.c:1340
 print_cpu_stall kernel/rcu/tree.c:1478 [inline]
 check_cpu_stall kernel/rcu/tree.c:1550 [inline]
 __rcu_pending kernel/rcu/tree.c:3293 [inline]
 rcu_pending kernel/rcu/tree.c:3336 [inline]
 rcu_check_callbacks.cold+0x62d/0xe19 kernel/rcu/tree.c:2682
 update_process_times+0x2a/0x70 kernel/time/timer.c:1650
 tick_sched_handle+0x9b/0x180 kernel/time/tick-sched.c:168
 tick_sched_timer+0xfc/0x290 kernel/time/tick-sched.c:1278
 __run_hrtimer kernel/time/hrtimer.c:1465 [inline]
 __hrtimer_run_queues+0x3f6/0xe60 kernel/time/hrtimer.c:1527
 hrtimer_interrupt+0x326/0x9e0 kernel/time/hrtimer.c:1585
 local_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1071 [inline]
 smp_apic_timer_interrupt+0x10c/0x550 arch/x86/kernel/apic/apic.c:1096
 apic_timer_interrupt+0xf/0x20 arch/x86/entry/entry_64.S:894
 </IRQ>
RIP: 0010:cpuidle_idle_call kernel/sched/idle.c:140 [inline]
RIP: 0010:do_idle+0x372/0x4b0 kernel/sched/idle.c:263
Code: 48 c7 c0 98 82 f1 89 48 c1 e8 03 80 3c 18 00 0f 85 0c 01 00 00 48 83 3d 53 59 ae 08 00 0f 84 9a 00 00 00 fb 66 0f 1f 44 00 00 <e9> b5 fd ff ff 0f 0b 0f 0b e8 e0 7f 24 00 48 c7 c0 98 82 f1 89 48
RSP: 0018:ffffffff89e07d78 EFLAGS: 00000286 ORIG_RAX: ffffffffffffff13
RAX: 1ffffffff13e3053 RBX: dffffc0000000000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffffffff89e78904
RBP: 0000000000000000 R08: 0000000000000047 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffffffff89f18290
R13: 1ffffffff13c0fb2 R14: 0000000000000000 R15: 0000000000000000
 cpu_startup_entry+0xc5/0xe0 kernel/sched/idle.c:369
 start_kernel+0x8d6/0x911 init/main.c:736
 secondary_startup_64+0xa4/0xb0 arch/x86/kernel/head_64.S:243
clocksource: timekeeping watchdog on CPU0: Marking clocksource 'tsc' as unstable because the skew is too large:
clocksource:                       'kvm-clock' wd_now: 6cce2a4b14 wd_last: 366a43cc48 mask: ffffffffffffffff
clocksource:                       'tsc' cs_now: 30a43fee cs_last: 77a91814bc mask: ffffffffffffffff
tsc: Marking TSC unstable due to clocksource watchdog
ieee802154 phy0 wpan0: encryption failed: -22
ieee802154 phy1 wpan1: encryption failed: -22
Bluetooth: hci3: command 0x040f tx timeout
systemd[1]: systemd-journald.service: Main process exited, code=killed, status=6/ABRT
systemd[1]: systemd-journald.service: Unit entered failed state.
systemd[1]: systemd-journald.service: Failed with result 'watchdog'.
systemd[1]: systemd-journald.service: Service has no hold-off time, scheduling restart.
systemd[1]: Stopped Flush Journal to Persistent Storage.
systemd[1]: Stopping Flush Journal to Persistent Storage...
----------------
Code disassembly (best guess):
   0:	48 c7 c0 98 82 f1 89 	mov    $0xffffffff89f18298,%rax
   7:	48 c1 e8 03          	shr    $0x3,%rax
   b:	80 3c 18 00          	cmpb   $0x0,(%rax,%rbx,1)
   f:	0f 85 0c 01 00 00    	jne    0x121
  15:	48 83 3d 53 59 ae 08 	cmpq   $0x0,0x8ae5953(%rip)        # 0x8ae5970
  1c:	00
  1d:	0f 84 9a 00 00 00    	je     0xbd
  23:	fb                   	sti
  24:	66 0f 1f 44 00 00    	nopw   0x0(%rax,%rax,1)
* 2a:	e9 b5 fd ff ff       	jmpq   0xfffffde4 <-- trapping instruction
  2f:	0f 0b                	ud2
  31:	0f 0b                	ud2
  33:	e8 e0 7f 24 00       	callq  0x248018
  38:	48 c7 c0 98 82 f1 89 	mov    $0xffffffff89f18298,%rax
  3f:	48                   	rex.W

Crashes (2):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2022/04/26 07:48 linux-4.19.y 3f8a27f9e27b 1fa34c1b .config console log report info ci2-linux-4-19 INFO: rcu detected stall in do_idle
2022/04/26 06:46 linux-4.19.y 3f8a27f9e27b 1fa34c1b .config console log report info ci2-linux-4-19 INFO: rcu detected stall in do_idle
* Struck through repros no longer work on HEAD.