Long-Context-Attention
Visit Toollong-context-attention is an open-source research tool that provides a unified sequence parallel approach for long context LLM model training and inference. It combines DeepSpeed-Ulysses-Attention and Ring-Attention for improved performance and versatility.
At a glance
Trending