测试日志:
srp/002 (File I/O on top of multipath concurrently with logout and login (mq))
[ 15.265517] run blktests srp/002 at 2022-06-26 08:29:47
[ 15.700998] lo speed is unknown, defaulting to 1000
[ 15.702315] lo speed is unknown, defaulting to 1000
[ 15.703550] lo speed is unknown, defaulting to 1000
[ 15.711004] lo speed is unknown, defaulting to 1000
[ 15.760552] scsi_debug:scsi_debug_init: dif_storep 524288 bytes @ 00000000abb7527b
[ 15.765837] sd 3:0:0:0: Power-on or device reset occurred
[ 15.908308] lo speed is unknown, defaulting to 1000
[ 16.655328] sd 4:0:0:0: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatical
[ 16.658615] sd 4:0:0:1: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatical
[ 16.666159] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 16.667834] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 16.669108] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 16.718968] ib_srp: QP creation failed for dev rxe1: -22
[ 16.738059] ib_srp: QP creation failed for dev rxe1: -22
[ 22.071292] device-mapper: multipath: Failing path 8:32.
[ 27.135556] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 27.142852] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 27.151142] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 27.232340] ib_srp: QP creation failed for dev rxe1: -22
[ 27.256096] ib_srp: QP creation failed for dev rxe1: -22
[ 32.373096] srpt_recv_done: 502 callbacks suppressed
[ 34.349054] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 34.362998] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 34.367792] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 34.435274] ib_srp: QP creation failed for dev rxe1: -22
[ 34.453051] ib_srp: QP creation failed for dev rxe1: -22
[ 39.568037] srpt_recv_done: 502 callbacks suppressed
[ 41.540706] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 41.544740] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 41.549333] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 41.659859] ib_srp: QP creation failed for dev rxe1: -22
[ 41.671370] ib_srp: QP creation failed for dev rxe1: -22
[ 46.782981] srpt_recv_done: 502 callbacks suppressed
[ 47.748390] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 47.752500] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 47.756368] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.
[ 47.819097] ib_srp: QP creation failed for dev rxe1: -22
[ 47.830452] ib_srp: QP creation failed for dev rxe1: -22
[ 47.924356] ib_srp: QP creation failed for dev rxe1: -22
[ 47.941580] ib_srp: QP creation failed for dev rxe1: -22
[ 246.913304] INFO: task kworker/2:0:22 blocked for more than 120 seconds.
[ 246.915434] Not tainted 4.19.90-00008-g8fbfe2335d9 #36
[ 246.916312] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 246.917547] Call Trace:
[ 246.917972] ? __schedule+0x5c9/0xc60
[ 246.918550] schedule+0x5f/0x1c0
[ 246.919040] __rwsem_down_write_failed_common+0x2a2/0x840
[ 246.919878] ? generic_make_request+0xbb/0x690
[ 246.920565] ? rwsem_down_write_failed+0x17/0x20
[ 246.921279] rwsem_down_write_failed+0x17/0x20
[ 246.921938] call_rwsem_down_write_failed+0x13/0x20
[ 246.922677] down_write+0x39/0x50
[ 246.923190] __generic_file_fsync+0x57/0x130
[ 246.923854] ext4_sync_file+0x3cd/0x670
[ 246.924457] vfs_fsync_range+0x5c/0xb0
[ 246.925014] dio_complete+0x328/0x350
[ 246.925575] dio_aio_complete_work+0x1d/0x30
[ 246.926208] process_one_work+0x2cf/0x6f0
[ 246.926827] worker_thread+0x252/0x760
[ 246.927415] kthread+0x178/0x1e0
[ 246.927898] ? rescuer_thread+0x5c0/0x5c0
[ 246.928504] ? kthread_cancel_delayed_work_timer+0x70/0x70
[ 246.929354] ret_from_fork+0x35/0x40
Hi Luo_meng_meng, welcome to the openEuler Community.
I'm the Bot here serving you. You can find the instructions on how to interact with me at Here.
If you have any questions, please contact the SIG: Kernel, and any of the maintainers: @YangYingliang , @成坚 (CHENG Jian) , @jiaoff , @zhengzengkai , @刘勇强 , @wangxiongfeng , @朱科潜 , @WangShaoBo , @lujialin , @wuxu_buque , @Xu Kuohai , @冷嘲啊 , @Lingmingqiang , @yuzenghui , @岳海兵 , @juntian , @OSSIM , @陈结松 , @whoisxxx , @koulihong , @刘恺 , @hanjun-guo , @woqidaideshi , @Chiqijun , @Kefeng , @ThunderTown , @AlexGuo , @kylin-mayukun , @Zheng Zucheng , @柳歆 , @Jackie Liu , @zhujianwei001 , @郑振鹏 , @SuperSix173 , @colyli , @Zhang Yi , @htforge , @Xie XiuQi
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。
根据 https://syzkaller.appspot.com/bug?extid=831661966588c802aae9可知
kworker/0:7/17139 is trying to acquire lock:
ffff888077a89938 ((wq_completion)loop1){+.+.}-{0:0}, at: flush_workqueue+0xe1/0x13a0 kernel/workqueue.c:2824
but task is already holding lock:
ffffc9000fa07db8 ((work_completion)(&lo->rundown_work)){+.+.}-{0:0}, at: process_one_work+0x8c4/0x1650 kernel/workqueue.c:2282
根据sysbot描述,在flush_workqueue和work_completion上产生死锁。与栈信息相符。
fix commit 99eb8d694174c777558dc902d575d1997d5ca650("RDMA/ib_srp: Fix a deadlock") 已经合入
登录 后才可以发表评论