Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
KeiraYC 's Collections
TDL project
Low-rank attention

Low-rank attention

updated 3 days ago
Upvote
-

  • SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

    Paper • 2502.18137 • Published Feb 25 • 58

  • XAttention: Block Sparse Attention with Antidiagonal Scoring

    Paper • 2503.16428 • Published Mar 20 • 15

  • On the Benefits of Rank in Attention Layers

    Paper • 2407.16153 • Published Jul 23, 2024

  • Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition

    Paper • 2504.20938 • Published Apr 29

  • Loki: Low-Rank Keys for Efficient Sparse Attention

    Paper • 2406.02542 • Published Jun 4, 2024

  • Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

    Paper • 2410.07687 • Published Oct 10, 2024 • 1

  • LoRA Learns Less and Forgets Less

    Paper • 2405.09673 • Published May 15, 2024 • 89
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs