FlashInfer: Kernel Library for LLM Serving
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
Virtual Python Environment builder
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
最近更新: 6天前The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
最近更新: 6天前Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
最近更新: 6天前