1 Star 0 Fork 0

triton-inference-server/server

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
文件
克隆/下载
model_analyzer.md 2.37 KB
一键复制 编辑 原始数据 按行查看 历史

Model Analyzer

The Triton Model Analyzer is a tool that uses Performance Analyzer to send requests to your model while measuring GPU memory and compute utilization. The Model Analyzer is specifically useful for characterizing the GPU memory requirements for your model under different batching and model instance configurations. Once you have this GPU memory usage information you can more intelligently decide on how to combine multiple models on the same GPU while remaining within the memory capacity of the GPU.

For more information see the Model Analyzer repository and the detailed explanation provided in Maximizing Deep Learning Inference Performance with NVIDIA Model Analyzer.

Loading...
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/triton-inference-server/server.git
git@gitee.com:triton-inference-server/server.git
triton-inference-server
server
server
main

搜索帮助