syzbot


INFO: task hung in rdma_dev_exit_net (2)

Status: auto-closed as invalid on 2021/05/10 22:49
Subsystems: rdma
[Documentation on labels]
First crash: 1276d, last: 1142d
Similar bugs (3)
Kernel Title Repro Cause bisect Fix bisect Count Last Reported Patched Status
upstream INFO: task hung in rdma_dev_exit_net (3) rdma 5 890d 925d 0/26 auto-closed as invalid on 2022/01/17 19:05
upstream INFO: task hung in rdma_dev_exit_net rdma 1 1436d 1436d 0/26 auto-closed as invalid on 2020/07/21 12:21
upstream INFO: task hung in rdma_dev_exit_net (4) rdma 7 264d 427d 0/26 auto-obsoleted due to no activity on 2023/10/05 19:47

Sample crash report:
INFO: task kworker/u4:6:313 blocked for more than 143 seconds.
      Not tainted 5.11.0-rc6-syzkaller #0
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
task:kworker/u4:6    state:D stack:24352 pid:  313 ppid:     2 flags:0x00004000
Workqueue: netns cleanup_net
Call Trace:
 context_switch kernel/sched/core.c:4327 [inline]
 __schedule+0x90c/0x21a0 kernel/sched/core.c:5078
 schedule+0xcf/0x270 kernel/sched/core.c:5157
 rwsem_down_write_slowpath+0x7e5/0x1200 kernel/locking/rwsem.c:1106
 __down_write_common kernel/locking/rwsem.c:1261 [inline]
 __down_write_common kernel/locking/rwsem.c:1258 [inline]
 __down_write kernel/locking/rwsem.c:1270 [inline]
 down_write+0x132/0x150 kernel/locking/rwsem.c:1407
 rdma_dev_exit_net+0x1b7/0x540 drivers/infiniband/core/device.c:1108
 ops_exit_list+0xb0/0x160 net/core/net_namespace.c:187
 cleanup_net+0x4ea/0xb10 net/core/net_namespace.c:604
 process_one_work+0x98d/0x15f0 kernel/workqueue.c:2275
 worker_thread+0x64c/0x1120 kernel/workqueue.c:2421
 kthread+0x3b1/0x4a0 kernel/kthread.c:292
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:296
INFO: task kworker/1:19:16109 blocked for more than 143 seconds.
      Not tainted 5.11.0-rc6-syzkaller #0
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
task:kworker/1:19    state:D stack:25696 pid:16109 ppid:     2 flags:0x00004000
Workqueue: events switchdev_deferred_process_work
Call Trace:
 context_switch kernel/sched/core.c:4327 [inline]
 __schedule+0x90c/0x21a0 kernel/sched/core.c:5078
 schedule+0xcf/0x270 kernel/sched/core.c:5157
 schedule_preempt_disabled+0xf/0x20 kernel/sched/core.c:5216
 __mutex_lock_common kernel/locking/mutex.c:1033 [inline]
 __mutex_lock+0x81a/0x1110 kernel/locking/mutex.c:1103
 switchdev_deferred_process_work+0xa/0x20 net/switchdev/switchdev.c:74
 process_one_work+0x98d/0x15f0 kernel/workqueue.c:2275
 worker_thread+0x64c/0x1120 kernel/workqueue.c:2421
 kthread+0x3b1/0x4a0 kernel/kthread.c:292
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:296
INFO: task syz-executor.1:18405 blocked for more than 144 seconds.
      Not tainted 5.11.0-rc6-syzkaller #0
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
task:syz-executor.1  state:D stack:27416 pid:18405 ppid:  8454 flags:0x00004004
Call Trace:
 context_switch kernel/sched/core.c:4327 [inline]
 __schedule+0x90c/0x21a0 kernel/sched/core.c:5078
 schedule+0xcf/0x270 kernel/sched/core.c:5157
 schedule_preempt_disabled+0xf/0x20 kernel/sched/core.c:5216
 __mutex_lock_common kernel/locking/mutex.c:1033 [inline]
 __mutex_lock+0x81a/0x1110 kernel/locking/mutex.c:1103
 ib_get_eth_speed+0xeb/0x5c0 drivers/infiniband/core/verbs.c:1818
 rxe_query_port+0x13d/0x2d0 drivers/infiniband/sw/rxe/rxe_verbs.c:41
 __ib_query_port drivers/infiniband/core/device.c:2041 [inline]
 ib_query_port drivers/infiniband/core/device.c:2076 [inline]
 ib_query_port+0x4d9/0x9e0 drivers/infiniband/core/device.c:2066
 add_port drivers/infiniband/core/sysfs.c:1064 [inline]
 ib_setup_port_attrs+0x1ee/0x14f0 drivers/infiniband/core/sysfs.c:1395
 add_one_compat_dev+0x4ab/0x800 drivers/infiniband/core/device.c:954
 rdma_dev_init_net+0x2dc/0x480 drivers/infiniband/core/device.c:1170
 ops_init+0xaf/0x470 net/core/net_namespace.c:152
 setup_net+0x2de/0x850 net/core/net_namespace.c:342
 copy_net_ns+0x31e/0x760 net/core/net_namespace.c:483
 create_new_namespaces+0x3f6/0xb20 kernel/nsproxy.c:110
 unshare_nsproxy_namespaces+0xbd/0x1f0 kernel/nsproxy.c:226
 ksys_unshare+0x445/0x8e0 kernel/fork.c:2955
 __do_sys_unshare kernel/fork.c:3023 [inline]
 __se_sys_unshare kernel/fork.c:3021 [inline]
 __x64_sys_unshare+0x2d/0x40 kernel/fork.c:3021
 do_syscall_64+0x2d/0x70 arch/x86/entry/common.c:46
 entry_SYSCALL_64_after_hwframe+0x44/0xa9
RIP: 0033:0x465b09
RSP: 002b:00007f8af90cb188 EFLAGS: 00000246 ORIG_RAX: 0000000000000110
RAX: ffffffffffffffda RBX: 000000000056bf60 RCX: 0000000000465b09
RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000040060000
RBP: 00000000004b069f R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 000000000056bf60
R13: 00007ffd6a71f29f R14: 00007f8af90cb300 R15: 0000000000022000

Showing all locks held in the system:
3 locks held by kworker/u4:5/195:
4 locks held by kworker/u4:6/313:
 #0: ffff888011aaf138 ((wq_completion)netns){+.+.}-{0:0}, at: arch_atomic64_set arch/x86/include/asm/atomic64_64.h:34 [inline]
 #0: ffff888011aaf138 ((wq_completion)netns){+.+.}-{0:0}, at: atomic64_set include/asm-generic/atomic-instrumented.h:856 [inline]
 #0: ffff888011aaf138 ((wq_completion)netns){+.+.}-{0:0}, at: atomic_long_set include/asm-generic/atomic-long.h:41 [inline]
 #0: ffff888011aaf138 ((wq_completion)netns){+.+.}-{0:0}, at: set_work_data kernel/workqueue.c:616 [inline]
 #0: ffff888011aaf138 ((wq_completion)netns){+.+.}-{0:0}, at: set_work_pool_and_clear_pending kernel/workqueue.c:643 [inline]
 #0: ffff888011aaf138 ((wq_completion)netns){+.+.}-{0:0}, at: process_one_work+0x871/0x15f0 kernel/workqueue.c:2246
 #1: ffffc9000198fda8 (net_cleanup_work){+.+.}-{0:0}, at: process_one_work+0x8a5/0x15f0 kernel/workqueue.c:2250
 #2: ffffffff8d445c90 (pernet_ops_rwsem){++++}-{3:3}, at: cleanup_net+0x9b/0xb10 net/core/net_namespace.c:566
 #3: ffffffff8d174f50 (rdma_nets_rwsem){++++}-{3:3}, at: rdma_dev_exit_net+0x1b7/0x540 drivers/infiniband/core/device.c:1108
1 lock held by khungtaskd/1649:
 #0: ffffffff8bd73da0 (rcu_read_lock){....}-{1:2}, at: debug_show_all_locks+0x53/0x260 kernel/locking/lockdep.c:6259
1 lock held by in:imklog/8162:
 #0: ffff888012de00f0 (&f->f_pos_lock){+.+.}-{3:3}, at: __fdget_pos+0xe9/0x100 fs/file.c:947
3 locks held by kworker/0:7/12971:
 #0: ffff888147a34d38 ((wq_completion)ipv6_addrconf){+.+.}-{0:0}, at: arch_atomic64_set arch/x86/include/asm/atomic64_64.h:34 [inline]
 #0: ffff888147a34d38 ((wq_completion)ipv6_addrconf){+.+.}-{0:0}, at: atomic64_set include/asm-generic/atomic-instrumented.h:856 [inline]
 #0: ffff888147a34d38 ((wq_completion)ipv6_addrconf){+.+.}-{0:0}, at: atomic_long_set include/asm-generic/atomic-long.h:41 [inline]
 #0: ffff888147a34d38 ((wq_completion)ipv6_addrconf){+.+.}-{0:0}, at: set_work_data kernel/workqueue.c:616 [inline]
 #0: ffff888147a34d38 ((wq_completion)ipv6_addrconf){+.+.}-{0:0}, at: set_work_pool_and_clear_pending kernel/workqueue.c:643 [inline]
 #0: ffff888147a34d38 ((wq_completion)ipv6_addrconf){+.+.}-{0:0}, at: process_one_work+0x871/0x15f0 kernel/workqueue.c:2246
 #1: ffffc9000b937da8 ((addr_chk_work).work){+.+.}-{0:0}, at: process_one_work+0x8a5/0x15f0 kernel/workqueue.c:2250
 #2: ffffffff8d4590e8 (rtnl_mutex){+.+.}-{3:3}, at: addrconf_verify_work+0xa/0x20 net/ipv6/addrconf.c:4570
3 locks held by kworker/1:19/16109:
 #0: ffff888010c62d38 ((wq_completion)events){+.+.}-{0:0}, at: arch_atomic64_set arch/x86/include/asm/atomic64_64.h:34 [inline]
 #0: ffff888010c62d38 ((wq_completion)events){+.+.}-{0:0}, at: atomic64_set include/asm-generic/atomic-instrumented.h:856 [inline]
 #0: ffff888010c62d38 ((wq_completion)events){+.+.}-{0:0}, at: atomic_long_set include/asm-generic/atomic-long.h:41 [inline]
 #0: ffff888010c62d38 ((wq_completion)events){+.+.}-{0:0}, at: set_work_data kernel/workqueue.c:616 [inline]
 #0: ffff888010c62d38 ((wq_completion)events){+.+.}-{0:0}, at: set_work_pool_and_clear_pending kernel/workqueue.c:643 [inline]
 #0: ffff888010c62d38 ((wq_completion)events){+.+.}-{0:0}, at: process_one_work+0x871/0x15f0 kernel/workqueue.c:2246
 #1: ffffc90002dc7da8 (deferred_process_work){+.+.}-{0:0}, at: process_one_work+0x8a5/0x15f0 kernel/workqueue.c:2250
 #2: ffffffff8d4590e8 (rtnl_mutex){+.+.}-{3:3}, at: switchdev_deferred_process_work+0xa/0x20 net/switchdev/switchdev.c:74
1 lock held by syz-executor.2/18418:
6 locks held by syz-executor.1/18405:
 #0: ffffffff8d445c90 (pernet_ops_rwsem){++++}-{3:3}, at: copy_net_ns+0x2fa/0x760 net/core/net_namespace.c:479
 #1: ffffffff8d1751d0 (devices_rwsem){++++}-{3:3}, at: rdma_dev_init_net+0x25e/0x480 drivers/infiniband/core/device.c:1164
 #2: ffffffff8d174f50 (rdma_nets_rwsem){++++}-{3:3}, at: rdma_dev_init_net+0x2d1/0x480 drivers/infiniband/core/device.c:1169
 #3: ffff88805d9e8f70 (&device->compat_devs_mutex){+.+.}-{3:3}, at: add_one_compat_dev+0xea/0x800 drivers/infiniband/core/device.c:928
 #4: ffff88805d9e9198 (&rxe->usdev_lock){+.+.}-{3:3}, at: rxe_query_port+0x129/0x2d0 drivers/infiniband/sw/rxe/rxe_verbs.c:40
 #5: ffffffff8d4590e8 (rtnl_mutex){+.+.}-{3:3}, at: ib_get_eth_speed+0xeb/0x5c0 drivers/infiniband/core/verbs.c:1818

=============================================

NMI backtrace for cpu 1
CPU: 1 PID: 1649 Comm: khungtaskd Not tainted 5.11.0-rc6-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
Call Trace:
 __dump_stack lib/dump_stack.c:79 [inline]
 dump_stack+0x107/0x163 lib/dump_stack.c:120
 nmi_cpu_backtrace.cold+0x44/0xd7 lib/nmi_backtrace.c:105
 nmi_trigger_cpumask_backtrace+0x1b3/0x230 lib/nmi_backtrace.c:62
 trigger_all_cpu_backtrace include/linux/nmi.h:146 [inline]
 check_hung_uninterruptible_tasks kernel/hung_task.c:209 [inline]
 watchdog+0xd43/0xfa0 kernel/hung_task.c:294
 kthread+0x3b1/0x4a0 kernel/kthread.c:292
 ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:296
Sending NMI from CPU 1 to CPUs 0:
NMI backtrace for cpu 0
CPU: 0 PID: 18418 Comm: syz-executor.2 Not tainted 5.11.0-rc6-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
RIP: 0010:iterate_chain_key kernel/locking/lockdep.c:418 [inline]
RIP: 0010:__lock_acquire+0xc50/0x54f0 kernel/locking/lockdep.c:4820
Code: 24 38 8b 4c 24 50 8b 5c 24 48 c1 e0 0d 66 0b 44 24 08 98 2b 44 24 58 33 44 24 60 89 c2 29 c1 01 d8 c1 c2 06 31 ca 89 d1 29 d3 <01> c2 c1 c1 08 31 d9 89 ce 29 c8 01 d1 c1 c6 10 89 cf 31 f0 89 c6
RSP: 0018:ffffc9000205ea58 EFLAGS: 00000806
RAX: 0000000090a4bd1b RBX: 0000000067d07111 RCX: 000000005e3ba6a4
RDX: 000000005e3ba6a4 RSI: 0000000000000008 RDI: ffffffff8f8e0f40
RBP: ffff888024da9bc0 R08: 0000000000000000 R09: ffffffff8f8668e7
R10: fffffbfff1f0cd1c R11: 0000000000000001 R12: ffff888024daa570
R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
FS:  00007f35cb8bd700(0000) GS:ffff8880b9c00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007f3e87359000 CR3: 000000005fd8d000 CR4: 00000000001506f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
 lock_acquire kernel/locking/lockdep.c:5442 [inline]
 lock_acquire+0x1a8/0x720 kernel/locking/lockdep.c:5407
 __mutex_lock_common kernel/locking/mutex.c:956 [inline]
 __mutex_lock+0x134/0x1110 kernel/locking/mutex.c:1103
 tcf_idr_check_alloc+0x78/0x3b0 net/sched/act_api.c:549
 tcf_police_init+0x34f/0x1460 net/sched/act_police.c:81
 tcf_action_init_1+0x1a3/0x990 net/sched/act_api.c:1013
 tcf_exts_validate+0x138/0x420 net/sched/cls_api.c:3046
 route4_set_parms net/sched/cls_route.c:393 [inline]
 route4_change+0x6d8/0x2380 net/sched/cls_route.c:512
 tc_new_tfilter+0x1394/0x2120 net/sched/cls_api.c:2127
 rtnetlink_rcv_msg+0x80e/0xad0 net/core/rtnetlink.c:5544
 netlink_rcv_skb+0x153/0x420 net/netlink/af_netlink.c:2494
 netlink_unicast_kernel net/netlink/af_netlink.c:1304 [inline]
 netlink_unicast+0x533/0x7d0 net/netlink/af_netlink.c:1330
 netlink_sendmsg+0x856/0xd90 net/netlink/af_netlink.c:1919
 sock_sendmsg_nosec net/socket.c:652 [inline]
 sock_sendmsg+0xcf/0x120 net/socket.c:672
 ____sys_sendmsg+0x331/0x810 net/socket.c:2345
 ___sys_sendmsg+0xf3/0x170 net/socket.c:2399
 __sys_sendmmsg+0x195/0x470 net/socket.c:2489
 __do_sys_sendmmsg net/socket.c:2518 [inline]
 __se_sys_sendmmsg net/socket.c:2515 [inline]
 __x64_sys_sendmmsg+0x99/0x100 net/socket.c:2515
 do_syscall_64+0x2d/0x70 arch/x86/entry/common.c:46
 entry_SYSCALL_64_after_hwframe+0x44/0xa9
RIP: 0033:0x465b09
Code: ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 bc ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007f35cb8bd188 EFLAGS: 00000246 ORIG_RAX: 0000000000000133
RAX: ffffffffffffffda RBX: 000000000056c0b0 RCX: 0000000000465b09
RDX: 010efe10675dec16 RSI: 0000000020000200 RDI: 0000000000000003
RBP: 00000000004b069f R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000246 R12: 000000000056c0b0
R13: 00007ffef099ef5f R14: 00007f35cb8bd300 R15: 0000000000022000

Crashes (6):
Time Kernel Commit Syzkaller Config Log Report Syz repro C repro VM info Assets (help?) Manager Title
2021/02/09 22:48 net-old 6bbc088d6ebf 2bd9619f .config console log report info ci-upstream-net-this-kasan-gce INFO: task hung in rdma_dev_exit_net
2020/09/28 17:52 net-old 709a16be0593 6bfdbe89 .config console log report info ci-upstream-net-this-kasan-gce
2021/01/04 19:36 net-next-old 3db1a3fa9880 79264ae3 .config console log report info ci-upstream-net-kasan-gce
2020/12/27 02:01 net-next-old 3db1a3fa9880 821e0b09 .config console log report info ci-upstream-net-kasan-gce
2020/12/22 11:46 net-next-old 3db1a3fa9880 04201c06 .config console log report info ci-upstream-net-kasan-gce
2020/11/15 22:01 net-next-old 0064c5c1b3bf 1bf9a662 .config console log report info ci-upstream-net-kasan-gce
* Struck through repros no longer work on HEAD.