PAPERS
updated
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for
Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper
•
2412.13663
•
Published
•
158
A Survey of Small Language Models
Paper
•
2410.20011
•
Published
•
46
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper
•
2412.11768
•
Published
•
43
Chain of Draft: Thinking Faster by Writing Less
Paper
•
2502.18600
•
Published
•
50
How far can we go with ImageNet for Text-to-Image generation?
Paper
•
2502.21318
•
Published
•
26