401 Star 1.4K Fork 1.3K

GVPopenEuler / kernel

 / 详情

【openEuler-1.0-LTS】dm_mpath 测试套用例srp/002失败

已完成
任务
创建于  
2022-08-11 22:01

测试日志:

srp/002 (File I/O on top of multipath concurrently with logout and login (mq))

[ 15.265517] run blktests srp/002 at 2022-06-26 08:29:47

[ 15.700998] lo speed is unknown, defaulting to 1000

[ 15.702315] lo speed is unknown, defaulting to 1000

[ 15.703550] lo speed is unknown, defaulting to 1000

[ 15.711004] lo speed is unknown, defaulting to 1000

[ 15.760552] scsi_debug:scsi_debug_init: dif_storep 524288 bytes @ 00000000abb7527b

[ 15.765837] sd 3:0:0:0: Power-on or device reset occurred

[ 15.908308] lo speed is unknown, defaulting to 1000

[ 16.655328] sd 4:0:0:0: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatical

[ 16.658615] sd 4:0:0:1: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatical

[ 16.666159] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 16.667834] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 16.669108] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 16.718968] ib_srp: QP creation failed for dev rxe1: -22

[ 16.738059] ib_srp: QP creation failed for dev rxe1: -22

[ 22.071292] device-mapper: multipath: Failing path 8:32.

[ 27.135556] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 27.142852] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 27.151142] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 27.232340] ib_srp: QP creation failed for dev rxe1: -22

[ 27.256096] ib_srp: QP creation failed for dev rxe1: -22

[ 32.373096] srpt_recv_done: 502 callbacks suppressed

[ 34.349054] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 34.362998] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 34.367792] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 34.435274] ib_srp: QP creation failed for dev rxe1: -22

[ 34.453051] ib_srp: QP creation failed for dev rxe1: -22

[ 39.568037] srpt_recv_done: 502 callbacks suppressed

[ 41.540706] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 41.544740] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 41.549333] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 41.659859] ib_srp: QP creation failed for dev rxe1: -22

[ 41.671370] ib_srp: QP creation failed for dev rxe1: -22

[ 46.782981] srpt_recv_done: 502 callbacks suppressed

[ 47.748390] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 47.752500] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 47.756368] srpt/192.168.250.40: Unsupported SCSI Opcode 0xa3, sending CHECK_CONDITION.

[ 47.819097] ib_srp: QP creation failed for dev rxe1: -22

[ 47.830452] ib_srp: QP creation failed for dev rxe1: -22

[ 47.924356] ib_srp: QP creation failed for dev rxe1: -22

[ 47.941580] ib_srp: QP creation failed for dev rxe1: -22

[ 246.913304] INFO: task kworker/2:0:22 blocked for more than 120 seconds.

[ 246.915434] Not tainted 4.19.90-00008-g8fbfe2335d9 #36

[ 246.916312] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.

[ 246.917547] Call Trace:

[ 246.917972] ? __schedule+0x5c9/0xc60

[ 246.918550] schedule+0x5f/0x1c0

[ 246.919040] __rwsem_down_write_failed_common+0x2a2/0x840

[ 246.919878] ? generic_make_request+0xbb/0x690

[ 246.920565] ? rwsem_down_write_failed+0x17/0x20

[ 246.921279] rwsem_down_write_failed+0x17/0x20

[ 246.921938] call_rwsem_down_write_failed+0x13/0x20

[ 246.922677] down_write+0x39/0x50

[ 246.923190] __generic_file_fsync+0x57/0x130

[ 246.923854] ext4_sync_file+0x3cd/0x670

[ 246.924457] vfs_fsync_range+0x5c/0xb0

[ 246.925014] dio_complete+0x328/0x350

[ 246.925575] dio_aio_complete_work+0x1d/0x30

[ 246.926208] process_one_work+0x2cf/0x6f0

[ 246.926827] worker_thread+0x252/0x760

[ 246.927415] kthread+0x178/0x1e0

[ 246.927898] ? rescuer_thread+0x5c0/0x5c0

[ 246.928504] ? kthread_cancel_delayed_work_timer+0x70/0x70

[ 246.929354] ret_from_fork+0x35/0x40

评论 (3)

LuoMeng 创建了任务

Hi Luo_meng_meng, welcome to the openEuler Community.
I'm the Bot here serving you. You can find the instructions on how to interact with me at Here.
If you have any questions, please contact the SIG: Kernel, and any of the maintainers: @YangYingliang , @成坚 (CHENG Jian) , @jiaoff , @zhengzengkai , @刘勇强 , @wangxiongfeng , @朱科潜 , @WangShaoBo , @lujialin , @wuxu_buque , @Xu Kuohai , @冷嘲啊 , @Lingmingqiang , @yuzenghui , @岳海兵 , @juntian , @OSSIM , @陈结松 , @whoisxxx , @koulihong , @刘恺 , @hanjun-guo , @woqidaideshi , @Chiqijun , @Kefeng , @ThunderTown , @AlexGuo , @kylin-mayukun , @Zheng Zucheng , @柳歆 , @Jackie Liu , @zhujianwei001 , @郑振鹏 , @SuperSix173 , @colyli , @Zhang Yi , @htforge , @Xie XiuQi

openeuler-ci-bot 添加了
 
sig/Kernel
标签

根据 https://syzkaller.appspot.com/bug?extid=831661966588c802aae9可知

kworker/0:7/17139 is trying to acquire lock:
ffff888077a89938 ((wq_completion)loop1){+.+.}-{0:0}, at: flush_workqueue+0xe1/0x13a0 kernel/workqueue.c:2824

but task is already holding lock:
ffffc9000fa07db8 ((work_completion)(&lo->rundown_work)){+.+.}-{0:0}, at: process_one_work+0x8c4/0x1650 kernel/workqueue.c:2282

根据sysbot描述,在flush_workqueue和work_completion上产生死锁。与栈信息相符。

fix commit 99eb8d694174c777558dc902d575d1997d5ca650("RDMA/ib_srp: Fix a deadlock") 已经合入

sanglipeng 负责人设置为sanglipeng
sanglipeng 任务状态待办的 修改为已完成
sanglipeng 添加了
 
issue_resolved
标签

登录 后才可以发表评论

状态
负责人
项目
里程碑
Pull Requests
关联的 Pull Requests 被合并后可能会关闭此 issue
分支
开始日期   -   截止日期
-
置顶选项
优先级
预计工期 (小时)
参与者(3)
5329419 openeuler ci bot 1632792936
C
1
https://gitee.com/openeuler/kernel.git
git@gitee.com:openeuler/kernel.git
openeuler
kernel
kernel

搜索帮助