Low-rank attention - a KeiraYC Collection

KeiraYC 's Collections

Low-rank attention

Low-rank attention

updated 3 days ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25 • 58
XAttention: Block Sparse Attention with Antidiagonal Scoring

Paper • 2503.16428 • Published Mar 20 • 15
On the Benefits of Rank in Attention Layers

Paper • 2407.16153 • Published Jul 23, 2024
Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition

Paper • 2504.20938 • Published Apr 29
Loki: Low-Rank Keys for Efficient Sparse Attention

Paper • 2406.02542 • Published Jun 4, 2024
Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

Paper • 2410.07687 • Published Oct 10, 2024 • 1
LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89