74 Star 219 Fork 167

Ascend / modelzoo

 / 详情

【吉大】Bilm模型ModelArts版本升级后运行模型出现找不到NPU的问题

DONE
Bug-Report
创建于  
2021-09-13 23:42

一、问题现象(附报错日志上下文):

INFO:root:CUDA_VISIBLE_DEVICES is not used in NPU device.
Traceback (most recent call last):
  File "/home/ma-user/miniconda3/envs/TensorFlow-1.15-arm/lib/python3.7/site-packages/moxing/tensorflow/practice/nas/eval.py", line 20, in <module>
    from moxing.tensorflow.nas.eval import eval_nas
  File "/home/ma-user/anaconda/lib/python3.7/site-packages/moxing/tensorflow/__init__.py", line 48, in <module>
    from moxing.tensorflow.utils.tf_util import print_version_info as _print_version_info
  File "/home/ma-user/anaconda/lib/python3.7/site-packages/moxing/tensorflow/utils/__init__.py", line 22, in <module>
    from moxing.tensorflow.utils.cache_helper import cache
  File "/home/ma-user/anaconda/lib/python3.7/site-packages/moxing/tensorflow/utils/cache_helper.py", line 53, in <module>
    _MOX_TF_CACHE = cloud_utils.get_cache_dir()
AttributeError: module 'moxing.framework.cloud_utils' has no attribute 'get_cache_dir'
===>>>Training finished:

四、日志信息:
日志信息在:
https://sstfine-data-bucker.obs.cn-north-4.myhuaweicloud.com:443/logs/MA-new-AleatoricSent-master-npu-09-13-16-10/log/modelarts-job-e2784e0f-cfc9-4aa0-8474-c4e9c8bf7fc7-proc-rank-0-device-0.txt?AccessKeyId=VV2WJNJQJYMORFHNWZBY&Expires=1634139612&Signature=dIwG3eq60rSW8ORPgSBFPg7cyHQ%3D
代码在:
URL:
https://e-share.obs-website.cn-north-1.myhuaweicloud.com?token=hJmts6ST6ZbzkxbMbQ6RHQq+wXXD1IkDj5SzwBPhTXPuMk6MuKIU7Od64pcaZrLPbviPX1V6vTOi55K9bQnRtFCEN/TnAHwF9quWgNnrcvRRhxCq1bbJxG3L/aePvghWD6UARQAieNGKqpyA+zviWJLVia8qCR47E8CbXylxwQXWIf9Xj5XpgHK7o2KKK5faYTFKP7EiIlR0mk/MmXi6CdLly2h0WVIoFlIiMXPRIsCZWmHfKEkwn5MCxULR1Y3Bfx7IxLS825ybNGIqO1gX6AFS/uIr7i1zwwRLX60dGoWVQYQGtenG6DE3Tvy1WSSVSsTvc4WmB7nY4JwcbA2Tomg5pGtm6IurAV9C04DQyag1Mc8XgXgRUXKRXgCiwIhnPwv+CU+aUWcp8RfsUio6wpNFkhQktXPSUQ/yKR/W7sGTUhcUmyOQKrl2vw/JG1erV+B989KnaLnjFKquD/ApDrXdA55KpDz9Or7LE+iIaYeK+GoLv5uiAxgH/F2i5rup+W7JAV+7udhuG/gyXYN3GYeBEK5U9QaUf17MkyUwQdXBfAsaCFbGONZU1ymQwW7EF52Dn15AFcfNbM/5MivHze2cmopA/PYiuHGswLk4aNdgx/Ff+8ifN6HlHCyQFigLkmPFf9fwmrXwPxz5s8F9aoHfSyTWZYNdbuBUBw9MnaLCZAVgt5ZjQJ0/DNp3J5XDWdZubhTzctiAiyyzhHP8baasxfzVjnKqKwEfDrAS72ImTYfW/uKeeQv6hVmaQPewv6Eq2vKFPUKtVLd4rVeNZQ==

提取码:
123456

*有效期至: 2021/10/13 23:43:07 GMT+08:00

评论 (1)

YongZheng 创建了Bug-Report
YongZheng 关联仓库设置为Ascend/modelzoo
YongZheng 修改了描述
zhujianpeng 负责人设置为张晓龙
zhujianpeng 任务状态TODO 修改为Analysing
展开全部操作日志

这个eval文件是ModelArts的文件,并非你的训练入口文件。请用其他手段安排好ModelArts训练路径后执行试试。
/home/ma-user/miniconda3/envs/TensorFlow-1.15-arm/lib/python3.7/site-packages/moxing/tensorflow/practice/nas/eval.py

YongZheng 任务状态Analysing 修改为DONE
吴定远 关联仓库Ascend/modelzoo-his 修改为Ascend/modelzoo

登录 后才可以发表评论

状态
负责人
项目
里程碑
Pull Requests
关联的 Pull Requests 被合并后可能会关闭此 issue
分支
开始日期   -   截止日期
-
置顶选项
优先级
预计工期 (小时)
参与者(2)
1
https://gitee.com/ascend/modelzoo.git
git@gitee.com:ascend/modelzoo.git
ascend
modelzoo
modelzoo

搜索帮助