A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind.
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters in a single C++ process.
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
最近更新: 7天前Collective communications library with various primitives for multi-machine training.
最近更新: 7天前This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track d...
最近更新: 7天前A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate me...
最近更新: 7天前A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
最近更新: 7天前HUD for CI activity on `pytorch/pytorch`, provides a top level view for jobs to easily discern regressions
最近更新: 7天前