428 Star 1.6K Fork 1.6K

GVPopenEuler/kernel

 / 详情

组bond,添加slaves设备时对其中slave设备执行pf复位,roce bond组建失败,环境异常

已完成
任务
创建于  
2023-10-12 16:10

In the concurrency process between setting bond and reset, when the
reset process is finished, the driver detects that bond resource has
already been allocated, thus entering the bond recover process, where
the bond state is set to HNS_ROCE_BOND_IS_BONDED. But at this point
the set bond process hasn't been executed yet(i.e. slaves haven't been
uninited). This wrong bond state leads to the abnormal reset result
that 2 slaves are both registered as bond device.

Thus delete the bond state setting in bond recover process. Besides,
to fix other potential concurrency errors between bond and reset,
some improvements are also added:

  1. For the situation that reset occurs before bond work, add a reset
    check at the beginning of bond work. If there is an ongoing reset
    process, re-queue the bond work until the reset is finished.

  2. For the situation that reset occurs during bond work, add reset
    checks to bond init/uninit process, treating this situation as an
    abnormal case.

评论 (2)

Hi stinft, welcome to the openEuler Community.
I'm the Bot here serving you. You can find the instructions on how to interact with me at Here.
If you have any questions, please contact the SIG: Kernel, and any of the maintainers.

zhouj 创建了任务 2年前
openeuler-ci-bot 添加了
 
sig/Kernel
标签
2年前
展开全部操作日志

登录 后才可以发表评论

状态
负责人
项目
Pull Requests
关联的 Pull Requests 被合并后可能会关闭此 issue
预计工期 (小时)
开始日期   -   截止日期
-
置顶选项
优先级
分支
参与者(2)
5329419 openeuler ci bot 1632792936 zhouj-stinft
C
1
https://gitee.com/openeuler/kernel.git
git@gitee.com:openeuler/kernel.git
openeuler
kernel
kernel

搜索帮助