xue-yuqing

@xue-yuqing

xue-yuqing 暂无简介

组织

xue-yuqing的个人主页
/
关注的仓库(4)

    Watch qianggee/FaultInjection

    fault injection for pytorch by torch_distpatch

    最近更新: 1年多前

    Watch xiangsen2/KJ600

    TensorProbe (code name: kj600) is a LLM pretrain debugger with model's torch module , optimizer status, collective communication tensor collection and aggregation. It also supports rule-based alerts.

    最近更新: 接近2年前

    Watch xue-yuqing/KJ600 forked from xiangsen2/KJ600

    TensorProbe (code name: kj600) is a LLM pretrain debugger with model's torch module , optimizer status, collective communication tensor collection and aggregation. It also supports rule-based alerts.

    最近更新: 接近2年前

    Watch xue-yuqing/image1

    最近更新: 4年多前

搜索帮助