prima.cpp
C++
forked from Zonghang Li/prima.cpp
C++

prima.cpp: Fast 30-70B LLM Inference on Heterogeneous and Low-Resource Home Clusters

最近更新: 28天前

搜索帮助