xue-yuqing

@xue-yuqing

xue-yuqing 暂无简介

组织

所有 个人的 我参与的
Forks 暂停/关闭的

    qianggee/FaultInjection

    fault injection for pytorch by torch_distpatch

    xiangsen2/KJ600

    TensorProbe (code name: kj600) is a LLM pretrain debugger with model's torch module , optimizer status, collective communication tensor collection and aggregation. It also supports rule-based alerts.

    xue-yuqing/KJ600 forked from xiangsen2/KJ600

    TensorProbe (code name: kj600) is a LLM pretrain debugger with model's torch module , optimizer status, collective communication tensor collection and aggregation. It also supports rule-based alerts.

    xue-yuqing/image1

搜索帮助