7 Star 126 Fork 123

GVPAscend/DrivingSDK

 / 详情

PointPillar运行模型训练脚本报错HCCL function error: HcclCommInitRootInfo(numRanks, &rootInfo, rank, &(comm->hcclComm_)), error code is 7

DONE
Bug
Opened this issue  
2024-11-19 11:24

Comments (1)

Big_Fish112 created缺陷 6 months ago

问题已经解决。原因不是端口被占用。

而是代码中初始化分布式的代码写错了

原来的代码中
https://gitee.com/ascend/mxDriving/blob/branch_v6.0.0-RC3/model_examples/OpenPCDet/tools/train.py

输入图片说明

这一行很明显应该是hccl,修改后就可以解决此问题

刘哲续 changed issue state from TODO to WIP 5 months ago
刘哲续 changed issue state from WIP to DONE 5 months ago

Sign in to comment

Status
Assignees
Projects
Milestones
Pull Requests
Successfully merging a pull request will close this issue.
Branches
Priority
Duration (hours)
Planed to start   -   Planed to end
-
Top level
参与者(1)
Big_Fish112-bigfish2000
1
https://gitee.com/ascend/DrivingSDK.git
git@gitee.com:ascend/DrivingSDK.git
ascend
DrivingSDK
DrivingSDK

Search