Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.00729

TRAMS: Training-free Memory Selection for Long-range Language Modeling

Paper • 2310.15494 • Published Oct 24, 2023 • 2
A Long Way to Go: Investigating Length Correlations in RLHF

Paper • 2310.03716 • Published Oct 5, 2023 • 10
YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 77
Giraffe: Adventures in Expanding Context Lengths in LLMs

Paper • 2308.10882 • Published Aug 21, 2023 • 1

community-datasets/doqa

Updated Jan 18, 2024 • 150 • 2
metaeval/reclor

Viewer • Updated May 31, 2023 • 5.14k • 300 • 14
community-datasets/so_stacksample

Updated Jan 18, 2024 • 67 • 4
community-datasets/yahoo_answers_topics

Viewer • Updated Jun 24, 2024 • 1.46M • 4.91k • 58

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized

Latent Reasoning in LLMs as a Vocabulary-Space Superposition

Paper • 2510.15522 • Published Oct 17 • 1
Language Models are Injective and Hence Invertible

Paper • 2510.15511 • Published Oct 17 • 69
Eliciting Secret Knowledge from Language Models

Paper • 2510.01070 • Published Oct 1 • 4
Interpreting Language Models Through Concept Descriptions: A Survey

Paper • 2510.01048 • Published Oct 1 • 2

A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems

Paper • 2308.08434 • Published Aug 16, 2023 • 1
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

Paper • 2302.02662 • Published Feb 6, 2023 • 1
Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning

Paper • 2309.01352 • Published Sep 4, 2023 • 1
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Paper • 2308.03188 • Published Aug 6, 2023 • 2

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28, 2024 • 38
Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 65
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 44
Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 40

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Paper • 2403.09029 • Published Mar 14, 2024 • 55
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

Paper • 2403.12968 • Published Mar 19, 2024 • 25
RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 72
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 78

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 14
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 60
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 48

TRAMS: Training-free Memory Selection for Long-range Language Modeling

Paper • 2310.15494 • Published Oct 24, 2023 • 2
A Long Way to Go: Investigating Length Correlations in RLHF

Paper • 2310.03716 • Published Oct 5, 2023 • 10
YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 77
Giraffe: Adventures in Expanding Context Lengths in LLMs

Paper • 2308.10882 • Published Aug 21, 2023 • 1

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28, 2024 • 38
Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 65
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 44
Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 40

community-datasets/doqa

Updated Jan 18, 2024 • 150 • 2
metaeval/reclor

Viewer • Updated May 31, 2023 • 5.14k • 300 • 14
community-datasets/so_stacksample

Updated Jan 18, 2024 • 67 • 4
community-datasets/yahoo_answers_topics

Viewer • Updated Jun 24, 2024 • 1.46M • 4.91k • 58

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Paper • 2403.09029 • Published Mar 14, 2024 • 55
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

Paper • 2403.12968 • Published Mar 19, 2024 • 25
RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 72
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 78

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized

Latent Reasoning in LLMs as a Vocabulary-Space Superposition

Paper • 2510.15522 • Published Oct 17 • 1
Language Models are Injective and Hence Invertible

Paper • 2510.15511 • Published Oct 17 • 69
Eliciting Secret Knowledge from Language Models

Paper • 2510.01070 • Published Oct 1 • 4
Interpreting Language Models Through Concept Descriptions: A Survey

Paper • 2510.01048 • Published Oct 1 • 2

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 14
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 60
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 48

A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems

Paper • 2308.08434 • Published Aug 16, 2023 • 1
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

Paper • 2302.02662 • Published Feb 6, 2023 • 1
Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning

Paper • 2309.01352 • Published Sep 4, 2023 • 1
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Paper • 2308.03188 • Published Aug 6, 2023 • 2

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs