@shaofeng09999
shaofeng09999 no introduction.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
bazel build rules to use boost in bazel projects
Rules for building and handling Docker images with Bazel