2.4K Star 8.2K Fork 4.4K

GVPMindSpore / mindspore

 / 详情

[ST][MS][ascend]Ascend环境下resnet50/deeplabv3、deeptext、inceptionv3等网络执行训练功能异常,日志提示报错runtime_instance为空

DONE
Bug-Report
创建于  
2023-04-10 10:28
name about labels
Bug Report Use this template for reporting a bug kind/bug

Describe the current behavior / 问题描述 (Mandatory / 必填)

Ascend环境下用例批量报错runtime_instance为空

Environment / 环境信息 (Mandatory / 必填)

  • Hardware Environment(Ascend/GPU/CPU) / 硬件环境:

Please delete the backend not involved / 请删除不涉及的后端:
/device ascend

  • Software Environment / 软件环境 (Mandatory / 必填):
    -- MindSpore version (e.g., 1.7.0.Bxxx) :commit_id = '[sha1]:36f1cbe5,[branch]:(HEAD,origin/r2.0,r2.0)'
    -- Python version (e.g., Python 3.7.5) :
    -- OS platform and distribution (e.g., Linux Ubuntu 16.04):
    -- GCC/Compiler version (if compiled from source):

  • Excute Mode / 执行模式 (Mandatory / 必填)(PyNative/Graph):

Please delete the mode not involved / 请删除不涉及的模式:
/mode pynative
/mode graph

Related testcase / 关联用例 (Mandatory / 必填)

test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py

Steps to reproduce the issue / 重现步骤 (Mandatory / 必填)

1、 安装mindspore

2、 设置相应的场景参数

3、 运行用例

cd ~ solution_test/

nosetests -s remaining/test_scripts/mindspore/features/exportmodel/mindir_load_export_and_infer/test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py

Describe the expected behavior / 预期结果 (Mandatory / 必填)

用例执行功能正常,无报错

Related log / screenshot / 日志 / 截图 (Mandatory / 必填)

输入图片说明

Special notes for this issue/备注 (Optional / 选填)

开发责任人:胡彬

评论 (9)

陶青 创建了Bug-Report
陶青 添加了
 
kind/bug
标签
陶青 添加了
 
attr/function
标签
陶青 添加了
 
stage/func-debug
标签
陶青 添加了
 
v2.0.0.rc1
标签
陶青 添加了
 
sig/ascend
标签
展开全部操作日志

Please assign maintainer to check this issue.
请为此issue分配处理人。
@陶青

Please add labels (comp or sig), also you can visit https://gitee.com/mindspore/community/blob/master/sigs/dx/docs/labels.md to find more.
为了让代码尽快被审核,请您为Pull Request打上 组件(comp)或兴趣组(sig) 标签,打上标签的PR可直接推送给责任人进行审核。
更多的标签可以查看https://gitee.com/mindspore/community/blob/master/sigs/dx/docs/labels.md
以组件相关代码提交为例,如果你提交的是data组件代码,你可以这样评论:
//comp/data
当然你也可以邀请data SIG组来审核代码,可以这样写:
//sig/data
另外你还可以给这个PR标记类型,例如是bugfix或者是特性需求:
//kind/bug or //kind/feature
恭喜你,你已经学会了使用命令来打标签,接下来就在下面的评论里打上标签吧!

陶青 修改了描述
陶青 优先级主要 修改为严重

网络测试中多个网络均报错该问题

陶青 修改了描述

deeplabv3、deeptext、inceptionv3、resnext50、vgg16、shufflenetv1、ctpn网络都有这个报错

crnn,unet3d,fasterrcnn,facerecognition,maskrcnn,textcnn,yolov4,gru,unet-plus,mobilenetv2等网络,共计几十个用例均有此问题

mudongrui 添加协作者mudongrui
mudongrui 负责人mudongrui 修改为胡彬

通过二分代码,2.0分支上f1792523a23无此错误,47cd61b5c80引入此错误

胡彬 里程碑B-SIG-ASCEND 修改为B-SIG-Data
胡彬 添加协作者胡彬
胡彬 负责人胡彬 修改为shenwei41
胡彬 负责人shenwei41 修改为caifubi
胡彬 添加协作者shenwei41
胡彬 里程碑B-SIG-Data 修改为B-SIG-PYNATIVE

#Appearance & Root Cause
个别场景下(例如 export mindir),不需要开启Ascend设备,runtime_instance没有创建,此时调用Bind context报错。
#Fix Solution
判断runtime_instance为空,不需要Bind context 和 SyncStream。

caifubi 添加了
 
rct/bugfix
标签
caifubi 添加了
 
rca/codelogic
标签
caifubi 添加了
 
ctl/componenttest
标签
caifubi 任务状态TODO 修改为VALIDATION
caifubi 添加协作者caifubi
caifubi 负责人caifubi 修改为陶青
caifubi 里程碑B-SIG-PYNATIVE 修改为B-SolutionTest

基于commit id 550ff02762d9de172bfec5f99ad6fdffe18a35e9 自验:
执行:nosetests -s remaining/test_scripts/mindspore/features/exportmodel/mindir_load_export_and_infer/test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py

[图片上传中…(image-TN2f8FSXTPgYgYVLgIXh)]

(ci) [jenkins0@10-155-112-38 solution_test]$ nosetests -s remaining/test_scripts/mindspore/features/exportmodel/mindir_load                                                                                                    _export_and_infer/test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py
/home/miniconda3/envs/ci/lib/python3.7/site-packages/nose/plugins/manager.py:395: RuntimeWarning: Unable to load plugin bea                                                                                                    m_test_plugin = test_config:BeamTestPlugin: (typing-extensions 4.3.0 (/data/miniconda3/envs/ci/lib/python3.7/site-packages)                                                                                                    , Requirement.parse('typing-extensions<3.8.0,>=3.7.0'))
  RuntimeWarning)
INFO 2023-04-11 09:30:59 - root - base.py:__init__:29 - This is Ascend910 environment.
INFO 2023-04-11 09:30:59 - root - base.py:print_hostname:221 - Host name is [10-155-112-38], Using device0
INFO 2023-04-11 09:30:59 - root - case_base.py:__init__:39 - env_dev is 0: 0代表A+X,1代表GPU(V100-PCIE),2代表CPU,3代表A+                                                                                                    K,4代表GPU(V100-SXM2)
INFO 2023-04-11 09:30:59 - root - case_base.py:__init__:93 - RuntimeMode is GRAPH_MODE
INFO 2023-04-11 09:30:59 - root - test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py:setup:38 - setup cases
INFO 2023-04-11 09:30:59 - root - test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py:testrun:45 - test run
INFO 2023-04-11 09:30:59 - root - base.py:get_status_cmd:256 - Exec [cp -r /home/jenkins/workspace/TDT_deployment/models/of                                                                                                    ficial/cv/Alexnet /data2/jenkins0/solution_test/remaining/test_scripts/mindspore/features/exportmodel/mindir_load_export_an                                                                                                    d_infer/test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine] success
INFO 2023-04-11 09:30:59 - root - base.py:get_status_cmd:256 - Exec [cp -r /home/jenkins/workspace/TDT_deployment/models/ut                                                                                                    ils/cpp_infer/example/common_inc/ /data2/jenkins0/solution_test/remaining/test_scripts/mindspore/features/exportmodel/mindi                                                                                                    r_load_export_and_infer/test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine/cpp_infer] success
INFO 2023-04-11 09:31:07 - root - base.py:exec_shell_cmd:239 - Exec [cd /data2/jenkins0/solution_test/remaining/test_script                                                                                                    s/mindspore/features/exportmodel/mindir_load_export_and_infer/test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine                                                                                                    ;python export.py --dataset_name=cifar10 --device_target=Ascend --ckpt_file=/home/workspace/mindspore_ckpt/alexnet/checkpoi                                                                                                    nt_alexnet-30_1562.ckpt --file_name=alexnet_cifar10 --file_format=MINDIR 2>&1|tee export.log] success
INFO 2023-04-11 09:31:07 - root - test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py:testrun:70 - Export MIND                                                                                                    IR[/data2/jenkins0/solution_test/remaining/test_scripts/mindspore/features/exportmodel/mindir_load_export_and_infer/test_ms                                                                                                    _alexnet_cifar10_load_ckpt_export_mindir_to_310machine/alexnet_cifar10.mindir] success.
INFO 2023-04-11 09:31:07 - root - test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py:teardown:80 - case teard                                                                                                    own
.
----------------------------------------------------------------------
Ran 1 test in 8.191s

OK
陶青 修改了标题
陶青 修改了标题
陶青 关联分支r2.0 修改为未关联

Regression Time :2023.4.12

Regression Version :

commit_id = '[sha1]:353066e3,[branch]:(HEAD,origin/r2.0,r2.0)'

1、 安装mindspore

2、 设置相应的场景参数

3、 运行用例

cd ~ solution_test/

nosetests -s remaining/test_scripts/mindspore/features/exportmodel/mindir_load_export_and_infer/test_ms_alexnet_cifar10_load_ckpt_export_mindir_to_310machine.py

输入图片说明

Regression Conclusion :PASS

陶青 任务状态VALIDATION 修改为DONE

登录 后才可以发表评论

状态
负责人
项目
里程碑
Pull Requests
关联的 Pull Requests 被合并后可能会关闭此 issue
分支
开始日期   -   截止日期
-
置顶选项
优先级
预计工期 (小时)
参与者(8)
6574868 jojohw 1584546516
Python
1
https://gitee.com/mindspore/mindspore.git
git@gitee.com:mindspore/mindspore.git
mindspore
mindspore
mindspore

搜索帮助

344bd9b3 5694891 D2dac590 5694891