syzbot


BUG: soft lockup in br_multicast_group_expired

Status: premoderation: reported on 2024/08/19 00:45
Reported-by: syzbot+6eb826009b7bfd5769ba@syzkaller.appspotmail.com
First crash: 28d, last: 28d
Similar bugs (2)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream BUG: soft lockup in br_multicast_group_expired net 1 1752d 1752d 0/28 closed as invalid on 2019/11/30 16:54
android-5-10 BUG: soft lockup in br_multicast_group_expired 1 40d 40d 0/2 premoderation: reported on 2024/08/07 04:04

Sample crash report:
watchdog: BUG: soft lockup - CPU#1 stuck for 143s! [syz.4.454:2823]
Modules linked in:
CPU: 1 PID: 2823 Comm: syz.4.454 Not tainted 6.1.90-syzkaller-00020-gd6a513a78492 #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/27/2024
RIP: 0010:update_stack_state+0x293/0x460 arch/x86/kernel/unwind_frame.c:261
Code: 00 49 83 3e 00 48 8b 45 a8 4c 8b ad 78 ff ff ff 74 22 44 3b 7d a4 75 1c 48 3b 45 98 73 16 31 db 89 d8 48 81 c4 90 00 00 00 5b <41> 5c 41 5d 41 5e 41 5f 5d c3 4d 85 ed 4c 89 75 b0 74 77 49 bf 00
RSP: 0018:ffffc900001b0688 EFLAGS: 00000282
RAX: 00000000001b0801 RBX: ffffc900001b07b0 RCX: 1ffff92000036103
RDX: 1ffff920000360fa RSI: ffffc900001b0850 RDI: ffffc900001b0818
RBP: ffffc900001b06a8 R08: dffffc0000000000 R09: ffffc900001b07c0
R10: 0000000000000000 R11: dffffc0000000001 R12: 1ffff92000036100
R13: 0000000000000000 R14: ffffc900001b0800 R15: dffffc0000000000
FS:  0000000000000000(0000) GS:ffff8881f7100000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000555555a85808 CR3: 0000000006e0f000 CR4: 00000000003506a0
DR0: 0000000020000300 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000600
Call Trace:
 <IRQ>
 unwind_next_frame+0x3cb/0x700 arch/x86/kernel/unwind_frame.c:315
 __unwind_start+0x318/0x3a0 arch/x86/kernel/unwind_frame.c:417
 unwind_start arch/x86/include/asm/unwind.h:64 [inline]
 arch_stack_walk+0xdb/0x140 arch/x86/kernel/stacktrace.c:24
 stack_trace_save+0x113/0x1c0 kernel/stacktrace.c:122
 kasan_save_stack mm/kasan/common.c:45 [inline]
 kasan_set_track+0x4b/0x70 mm/kasan/common.c:52
 kasan_save_alloc_info+0x1f/0x30 mm/kasan/generic.c:505
 ____kasan_kmalloc mm/kasan/common.c:379 [inline]
 __kasan_kmalloc+0x9c/0xb0 mm/kasan/common.c:388
 kasan_kmalloc include/linux/kasan.h:212 [inline]
 __do_kmalloc_node mm/slab_common.c:957 [inline]
 __kmalloc_node_track_caller+0xb3/0x1e0 mm/slab_common.c:977
 kmalloc_reserve net/core/skbuff.c:446 [inline]
 __alloc_skb+0x125/0x2d0 net/core/skbuff.c:515
 alloc_skb include/linux/skbuff.h:1290 [inline]
 nlmsg_new include/net/netlink.h:991 [inline]
 br_mdb_notify+0x2d8/0x9b0 net/bridge/br_mdb.c:568
 br_multicast_host_leave net/bridge/br_multicast.c:1345 [inline]
 br_multicast_group_expired+0x30a/0x5e0 net/bridge/br_multicast.c:633
 call_timer_fn+0x3b/0x2d0 kernel/time/timer.c:1510
 expire_timers kernel/time/timer.c:1555 [inline]
 __run_timers+0x72a/0xa10 kernel/time/timer.c:1826
 run_timer_softirq+0x69/0xf0 kernel/time/timer.c:1839
 __do_softirq+0x1d8/0x661 kernel/softirq.c:617
 invoke_softirq kernel/softirq.c:472 [inline]
 __irq_exit_rcu+0x50/0xf0 kernel/softirq.c:700
 irq_exit_rcu+0x9/0x10 kernel/softirq.c:712
 sysvec_apic_timer_interrupt+0x9a/0xc0 arch/x86/kernel/apic/apic.c:1106
 </IRQ>
 <TASK>
 asm_sysvec_apic_timer_interrupt+0x1b/0x20 arch/x86/include/asm/idtentry.h:653
RIP: 0010:finish_task_switch+0x16f/0x7b0 kernel/sched/core.c:5295
Code: 74 08 4c 89 ff e8 31 63 6c 00 49 8b 1f 48 85 db 4c 8b 6d c0 0f 85 ce 00 00 00 4c 89 e7 e8 b9 0e cf 03 fb 49 8d 9d 48 0b 00 00 <48> 89 d8 48 c1 e8 03 49 be 00 00 00 00 00 fc ff df 42 0f b6 04 30
RSP: 0018:ffffc900010a6ee0 EFLAGS: 00000282
RAX: 0000000080000001 RBX: ffff88810d3e9f88 RCX: 0000000000000002
RDX: 0000000040000000 RSI: 0000000000000000 RDI: 0000000000000001
RBP: ffffc900010a6f30 R08: ffffffff819b54a0 R09: fffffbfff0ee5127
R10: 0000000000000000 R11: dffffc0000000001 R12: ffff8881f7137c80
R13: ffff88810d3e9440 R14: 1ffff1103ee270fc R15: ffff8881f71387e0
 context_switch kernel/sched/core.c:5421 [inline]
 __schedule+0xcaf/0x1550 kernel/sched/core.c:6744
 preempt_schedule_irq+0xc7/0x140 kernel/sched/core.c:7056
 raw_irqentry_exit_cond_resched+0x2a/0x30 kernel/entry/common.c:396
 irqentry_exit+0x30/0x40 kernel/entry/common.c:439
 sysvec_apic_timer_interrupt+0x55/0xc0 arch/x86/kernel/apic/apic.c:1106
 asm_sysvec_apic_timer_interrupt+0x1b/0x20 arch/x86/include/asm/idtentry.h:653
RIP: 0010:__kernel_text_address+0x0/0x40 kernel/extable.c:78
Code: 89 f0 5b 41 5e 5d c3 48 c7 c1 80 55 72 87 80 e1 07 80 c1 03 38 c1 7c c3 48 c7 c7 80 55 72 87 e8 56 07 70 00 eb b5 0f 1f 40 00 <55> 48 89 e5 53 48 89 fb e8 33 00 00 00 85 c0 0f 95 c0 48 c7 c1 00
RSP: 0018:ffffc900010a71f0 EFLAGS: 00000246
RAX: 0000000000000000 RBX: ffffc900010a7268 RCX: 00000000010a7201
RDX: 1ffff92000214e46 RSI: ffffc900010a7ef8 RDI: ffffffff85191146
RBP: ffffc900010a7210 R08: ffffc900010a7308 R09: 000000000000000e
R10: ffffc900010a7310 R11: dffffc0000000001 R12: ffff88810d3e9440
R13: ffffffff8165c3e0 R14: dffffc0000000000 R15: 1ffff92000214e4d
 arch_stack_walk+0xf3/0x140 arch/x86/kernel/stacktrace.c:26
 stack_trace_save+0x113/0x1c0 kernel/stacktrace.c:122
 save_stack+0xf6/0x1e0 mm/page_owner.c:147
 __reset_page_owner+0x54/0x190 mm/page_owner.c:168
 reset_page_owner include/linux/page_owner.h:26 [inline]
 free_pages_prepare mm/page_alloc.c:1498 [inline]
 free_pcp_prepare mm/page_alloc.c:1572 [inline]
 free_unref_page_prepare+0x83d/0x850 mm/page_alloc.c:3498
 free_unref_page+0xb2/0x5c0 mm/page_alloc.c:3594
 free_the_page mm/page_alloc.c:798 [inline]
 __free_pages+0x61/0xf0 mm/page_alloc.c:5803
 __vunmap+0x9f3/0xb60 mm/vmalloc.c:2728
 __vfree mm/vmalloc.c:2776 [inline]
 vfree+0x5c/0x80 mm/vmalloc.c:2807
 kcov_put kernel/kcov.c:428 [inline]
 kcov_close+0x2b/0x50 kernel/kcov.c:524
 __fput+0x3ab/0x870 fs/file_table.c:320
 ____fput+0x15/0x20 fs/file_table.c:348
 task_work_run+0x24d/0x2e0 kernel/task_work.c:179
 exit_task_work include/linux/task_work.h:38 [inline]
 do_exit+0xbd5/0x2b80 kernel/exit.c:875
 do_group_exit+0x21a/0x2d0 kernel/exit.c:1025
 get_signal+0x169d/0x1820 kernel/signal.c:2880
 arch_do_signal_or_restart+0xb0/0x16f0 arch/x86/kernel/signal.c:871
 exit_to_user_mode_loop+0x74/0xa0 kernel/entry/common.c:174
 exit_to_user_mode_prepare+0x5a/0xa0 kernel/entry/common.c:210
 __syscall_exit_to_user_mode_work kernel/entry/common.c:292 [inline]
 syscall_exit_to_user_mode+0x26/0x130 kernel/entry/common.c:303
 do_syscall_64+0x47/0xb0 arch/x86/entry/common.c:87
 entry_SYSCALL_64_after_hwframe+0x68/0xd2
RIP: 0033:0x7f71a1579e79
Code: Unable to access opcode bytes at 0x7f71a1579e4f.
RSP: 002b:00007f71a24410e8 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca
RAX: 0000000000000001 RBX: 00007f71a1715f88 RCX: 00007f71a1579e79
RDX: 00000000000f4240 RSI: 0000000000000081 RDI: 00007f71a1715f8c
RBP: 00007f71a1715f80 R08: 00007ffdc75f30b0 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 00007f71a1715f8c
R13: 0000000000000000 R14: 00007ffdc75c15e0 R15: 00007ffdc75c16c8
 </TASK>
Sending NMI from CPU 1 to CPUs 0:
NMI backtrace for cpu 0
CPU: 0 PID: 326 Comm: kworker/u4:3 Not tainted 6.1.90-syzkaller-00020-gd6a513a78492 #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/27/2024
Workqueue: events_unbound toggle_allocation_gate
RIP: 0010:__sanitizer_cov_trace_pc+0x5d/0x60 kernel/kcov.c:225
Code: 0b 00 00 83 fa 02 75 21 48 8b 91 50 0b 00 00 48 8b 32 48 8d 7e 01 8b 89 4c 0b 00 00 48 39 cf 73 08 48 89 3a 48 89 44 f2 08 5d <c3> 66 90 55 48 89 e5 4c 8b 45 08 65 48 8b 15 a0 ca 8c 7e 65 8b 05
RSP: 0018:ffffc9000b2e7838 EFLAGS: 00000293
RAX: ffffffff816c074c RBX: 1ffff1103ee27885 RCX: ffff88810f7cd100
RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000000
RBP: ffffc9000b2e7958 R08: ffffffff816c0715 R09: ffffed103ee071fb
R10: 0000000000000000 R11: dffffc0000000001 R12: 0000000000000001
R13: 0000000800000000 R14: ffff8881f713c428 R15: dffffc0000000000
FS:  0000000000000000(0000) GS:ffff8881f7000000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007f3c36f21fa0 CR3: 0000000006e0f000 CR4: 00000000003506b0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000600
Call Trace:
 <NMI>
 </NMI>
 <TASK>
 on_each_cpu_cond_mask+0x40/0x80 kernel/smp.c:1166
 on_each_cpu include/linux/smp.h:71 [inline]
 text_poke_sync arch/x86/kernel/alternative.c:1334 [inline]
 text_poke_bp_batch+0x1e9/0x730 arch/x86/kernel/alternative.c:1534
 text_poke_flush arch/x86/kernel/alternative.c:1725 [inline]
 text_poke_finish+0x1a/0x30 arch/x86/kernel/alternative.c:1732
 arch_jump_label_transform_apply+0x15/0x30 arch/x86/kernel/jump_label.c:146
 __jump_label_update+0x36a/0x380 kernel/jump_label.c:455
 jump_label_update+0x3af/0x450 kernel/jump_label.c:801
 static_key_enable_cpuslocked+0x12f/0x250 kernel/jump_label.c:177
 static_key_enable+0x1a/0x30 kernel/jump_label.c:190
 toggle_allocation_gate+0xbf/0x450 mm/kfence/core.c:804
 process_one_work+0x73d/0xcb0 kernel/workqueue.c:2299
 worker_thread+0xa60/0x1260 kernel/workqueue.c:2446
 kthread+0x26d/0x300 kernel/kthread.c:386
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:308
 </TASK>

Crashes (1):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2024/08/19 00:44 android14-6.1 d6a513a78492 dbc93b08 .config console log report info [disk image] [vmlinux] [kernel image] ci2-android-6-1-perf BUG: soft lockup in br_multicast_group_expired
* Struck through repros no longer work on HEAD.