This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our continuation integration jobs HUD/dashboard.
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
Alerting system for the pytorch org
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track d...
Last updated: 3 days agoA Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
Last updated: 3 days agoCollective communications library with various primitives for multi-machine training.
Last updated: 3 days agoFault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
Last updated: 3 days agoA modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Last updated: 3 days agoA library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate me...
Last updated: 3 days agoA CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
Last updated: 3 days agoHUD for CI activity on `pytorch/pytorch`, provides a top level view for jobs to easily discern regressions
Last updated: 3 days agoTesting downstream libraries using pytorch release candidates
Last updated: 3 days ago