A vLLM out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).
最近更新: 37分钟前Omni_Infer is a suite of inference accelerators designed for the Ascend NPU platform, offering native support and an expanding feature set.
最近更新: 38分钟前A vLLM out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).
最近更新: 6天前A vLLM out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).
最近更新: 8天前A vLLM out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).
最近更新: 19天前A vLLM (0.12.0) out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).
最近更新: 1个月前Omni_Infer is a suite of inference accelerators designed for the Ascend NPU platform, offering native support and an expanding feature set.
最近更新: 1个月前A vLLM (0.12.0) out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).
最近更新: 2个月前A vLLM (0.12.0) out-of-tree platform plugin that enables running vLLM on NPU (Ascend/torch_npu).
最近更新: 2个月前Omni_Infer is a suite of inference accelerators designed for the Ascend NPU platform, offering native support and an expanding feature set.
最近更新: 4个月前