AITemplate
Visit ToolAITemplate is a Python framework that renders neural networks into high-performance CUDA/HIP C++ code. It specializes in FP16 TensorCore for NVIDIA GPUs and MatrixCore for AMD GPUs, optimizing inference serving.
At a glance
Trending