diff --git "a/AscendPytorch\346\250\241\345\236\213\344\274\227\346\231\272FAQ.md" "b/AscendPytorch\346\250\241\345\236\213\344\274\227\346\231\272FAQ.md" index 10d3ec407998cd21b724628f14e78a5764524222..960052b00e6a851cb5be8651e4f8c8d1c9eeb6a8 100644 --- "a/AscendPytorch\346\250\241\345\236\213\344\274\227\346\231\272FAQ.md" +++ "b/AscendPytorch\346\250\241\345\236\213\344\274\227\346\231\272FAQ.md" @@ -366,8 +366,8 @@ StopIteration ### FAQ18、加载权重时发生load state_dict error. -* 现象描述 -![](https://gitee.com/wangjiangben_hw/ascend-pytorch-crowdintelligence-doc/raw/master/figures/model_faq18_01.png) +* 现象描述wangjiangben_hw +![](https://gitee.com//ascend-pytorch-crowdintelligence-doc/raw/master/figures/model_faq18_01.png) ![](https://gitee.com/wangjiangben_hw/ascend-pytorch-crowdintelligence-doc/raw/master/figures/model_faq18_02.png) * 原因分析 @@ -403,6 +403,22 @@ StopIteration 找不到数据集,检查数据集路径和数据集是否有效。 +### FAQ22、在模型运行时遇到npu上broadcast_to_d算子报错. + +* 现象描述 +![](https://gitee.com/wangjiangben_hw/ascend-pytorch-crowdintelligence-doc/raw/master/figures/model_faq22_01.png) + + +* 原因分析 + + 通过打印输入发现,调用broadcast_to_d算子前输入的数据类型是int64,通过查看环境中/usr/local/Ascend/ascend-toolkit/20.2.rc1/x86_64-linux/opp/op_impl/built-in/ai_core/tbe/config/ascend910目录(不同环境目录有差别),查看算子信息库aic-ascend910-ops-info.json,发现其不支持int64的输入导致npu上报错。 +![](https://gitee.com/wangjiangben_hw/ascend-pytorch-crowdintelligence-doc/raw/master/figures/model_faq22_02.png) + +* 处理方法 + + 调用broadcast_to_d算子前把输入的数据类型转换为该算子支持的int32,问题得到解决。 + + ## [2.2 NPU模型分布式运行常见问题FAQ](#22-NPU模型分布式运行常见问题FAQ) ### FAQ1、在模型分布式训练时,遇到报错 host not found. diff --git a/figures/model_faq22_01.png b/figures/model_faq22_01.png new file mode 100644 index 0000000000000000000000000000000000000000..9ebf23b2085e467f1195c2957f43242d57a814f4 Binary files /dev/null and b/figures/model_faq22_01.png differ diff --git a/figures/model_faq22_02.png b/figures/model_faq22_02.png new file mode 100644 index 0000000000000000000000000000000000000000..23d7817638315b1b7d603781dc148adb19f19e72 Binary files /dev/null and b/figures/model_faq22_02.png differ