FasterTransformer
Visit ToolFasterTransformer provides optimized scripts for running transformer-based encoders and decoders. It focuses on accelerating transformer models like BERT and GPT for improved performance.
At a glance
Trending
FasterTransformer provides optimized scripts for running transformer-based encoders and decoders. It focuses on accelerating transformer models like BERT and GPT for improved performance.
Trending
About
FasterTransformer is a repository designed to provide optimized scripts for running transformer-based encoders and decoders. Its primary focus is on transformer-related optimization, specifically targeting models such as BERT and GPT. While the NVIDIA/FasterTransformer repository remains available, development has officially transitioned to TensorRT-LLM, meaning no further updates or new features will be added to FasterTransformer itself. This tool is ideal for developers and researchers looking to enhance the performance of their transformer models.
Capabilities
Pricing & Plans
unknown
Free
FAQs
Trending