Awesome-LLM-Inference
Visit ToolAwesome-LLM-Inference is an open-source research and education tool that curates a list of papers and code for Large Language Model (LLM) inference. It covers techniques like Flash-Attention, Paged-Attention, and various optimization methods.
At a glance
Trending