Attention_with_linear_biases
Visit Toolattention_with_linear_biases is an Open Source & Models tool that provides code for the ALiBi method in transformer language models. It enables input length extrapolation and is detailed in the ICLR 2022 paper 'Train Short, Test Long'.
At a glance
Trending