Back to Flash Attention

Example of LLM inference using FlashAttention

examples/inference/README.md

2.8.3114 B
Original Source

Example of LLM inference using FlashAttention

Example script of using FlashAttention for inference coming soon.