Agentic - a galois77 Collection

galois77 's Collections

Thousand brains theory

energy based models

Image generation

Training optimization

RL

Benchmarks and challenges

Agentic

updated Jan 24

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28, 2025 • 39
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper • 2504.16078 • Published Apr 22, 2025 • 21
Emergent Agentic Transformer from Chain of Hindsight Experience

Paper • 2305.16554 • Published May 26, 2023
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

Paper • 2504.02882 • Published Apr 2, 2025 • 7
ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published May 29, 2025 • 23
Self-Challenging Language Model Agents

Paper • 2506.01716 • Published Jun 2, 2025 • 10
Matrix-Game: Interactive World Foundation Model

Paper • 2506.18701 • Published Jun 23, 2025 • 72
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Paper • 2508.09736 • Published Aug 13, 2025 • 58
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 82
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 107
AgentFold: Long-Horizon Web Agents with Proactive Context Management

Paper • 2510.24699 • Published Oct 28, 2025 • 71
AlphaResearch: Accelerating New Algorithm Discovery with Language Models

Paper • 2511.08522 • Published Nov 11, 2025 • 18
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

Paper • 2511.07685 • Published Nov 10, 2025 • 10
LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 84