2.3K Star 8.1K Fork 4.3K

GVPMindSpore / mindspore

[ST][MS][CI] test case failed in gate with < test_multifieldembeddinglookup_p...

kind/ci
v2.0.0.rc1
attr/function
sig/parallel
#I63NEL zengfanyu 1
负责人: huangxinjing

The result of Pangu3.0 in MindSpore 2.1 B060 forward process with model paral...

kind/bug
v2.2.0
ctl/componenttest
rca/algorithm
rct/newfeature
foruda
#I7LIWZ 刘崇鸣 4
负责人: 6579380 liuchongming74 1593503138刘崇鸣

[CT][MS][reinforcement]test_reinforcement_mcts_001在gpu graph下执行core

kind/bug
attr/function
sig/parallel
v2.3.0
foruda
rct/bugfix
rca/others
ctl/componenttest
#I923OM 杨凯璐 4
负责人: 杨凯璐

[ST][MS][mindrlhf][llama2_reward_model][910B3 8P]评估指标达不到转测指标,avg acc:0.64<0.9

kind/bug
attr/accuracy
stage/prec-tuning
sig/parallel
device/ascend
v2.3.0
#I945KA zhangjie18 3
负责人: jiahongqian

[ST][MS][master][Bert_large][ascend][多机]网络训练失败

attr/function
kind/bug
stage/func-debug
v2.1.0
sig/parallel
rca/others
ctl/solutiontest
rct/bugfix
foruda
#I7BFIP sunjiawei999 4
负责人: sunjiawei999

[ST][MS][mindrlhf][llama2/gpt2][910b3]网络训练失败,ModuleNotFoundError: No module n...

kind/bug
attr/function
stage/func-debug
sig/mba
github
device/ascend
v2.3.0
#I92N9T zhangjie18 4
负责人: YijieChen

[ST][MS] llama2-175B加载编译缓存后报错

kind/bug
v2.2.14
sig/parallel
attr/function
rct/cann
rca/others
ctl/solutiontest
foruda
#I9EYXX duanjiali 6
负责人: xiaoyao

[ST][MS][mindrlhf][gpt2/llama2]当前已转测的mindrlhf网络,暂不支持msrun

kind/bug
attr/function
stage/func-debug
sig/mba
device/ascend
v2.3.0
#I92EED zhangjie18 3
负责人: YijieChen

HCCL ERROR

kind/bug
mindspore-assistant
#I4E3C8 harasuki 5
负责人: lichen

[ST][MS][NET][GPU 8p][wide&deep ps]网络全量训练失败,报错:Cuda create Event failed

kind/bug
v2.2.0
sig/modelzoo
stage/func-debug
attr/function
#I8CUWV 魏鑫 3
负责人: 魏鑫

[ST][MS][NET][pangu-alpha][910A 32p]FPS[289] can not reach 400

sig/parallel
attr/function
kind/bug
v2.3.0
v2.3.0.alpha
stage/func-debug
master
#I9BIPS zhongjicheng 5
负责人: 6574048 hulktang 1584443870tanghuikang

【ST】【ms】【2.2】910A环境,开启ge流程,pynative模式,异常dump设置为1,Resnet50网络训练失败

kind/bug
sig/parallel
attr/function
stage/func-debug
dts-szv
v2.2.13
rct/cann
v2.2.14
#I96RGH wenli 3
负责人: 6574048 hulktang 1584443870tanghuikang

[ST][MS][NET][pangu c3][910B3 8p]Accuracy[50.9%] can not reach 53.56%

device/ascend
foruda
sig/parallel
stage/func-debug
kind/bug
attr/function
ctl/solutiontest
v2.3.0
v2.3.0.alpha
rct/cann
#I9BHQI zhongjicheng 5
负责人: 6574048 hulktang 1584443870tanghuikang

[ST][MS][大集群专项]mixtral网络在910B上单卡模拟编译报错

kind/bug
attr/function
stage/coding
master
sig/parallel
#I9Q26Q baimz 2
负责人: zhouyaqiang0

[ST][MS][分布式并行][pangu-alpha 2.6B][910B3 8p]dp=1,mp=2,pp=4,将中间某一个stage的参数全部冻结,...

sig/parallel
device/ascend
attr/function
stage/func-debug
kind/bug
v2.3.0
#I9DYN8 zhongjicheng 2
负责人: lichen

[ST][MS][2.2][910B][wide_deep&ps模型]网络训练失败,EmbeddingLookup算子报错

kind/bug
v2.2.0
attr/function
stage/func-debug
sig/modelzoo
rct/cann
v2.2.10
foruda
#I842I5 zhangjie18 9
负责人: zhongjicheng
Python
1
https://gitee.com/mindspore/mindspore.git
git@gitee.com:mindspore/mindspore.git
mindspore
mindspore
mindspore

搜索帮助