hal

community

https://hal.cs.princeton.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ronch99 submitted a paper about 11 hours ago

Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation

kanghengliu updated a dataset 3 days ago

agent-evals/hal_traces

xuetianci99 updated a dataset about 1 month ago

agent-evals/hal_traces

View all activity

ronch99

submitted a paper to Daily Papers about 11 hours ago

Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation

Paper • 2602.03806 • Published about 21 hours ago • 3

kanghengliu

updated a dataset 3 days ago

agent-evals/hal_traces

Updated 3 days ago • 1.05k • 4

xuetianci99

updated a dataset about 1 month ago

agent-evals/hal_traces

Updated 3 days ago • 1.05k • 4

boyiwei

updated a dataset 3 months ago

agent-evals/hal_traces

Updated 3 days ago • 1.05k • 4

fsndzomga

updated a dataset 3 months ago

agent-evals/hal_traces

Updated 3 days ago • 1.05k • 4

boyiwei

authored a paper 3 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 223

ronch99

updated a dataset 3 months ago

agent-evals/hal_traces

Updated 3 days ago • 1.05k • 4

Peterkirgis

updated a dataset 4 months ago

agent-evals/hal_traces

Updated 3 days ago • 1.05k • 4

boyiwei

authored a paper 4 months ago

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

Paper • 2509.26354 • Published Sep 30, 2025 • 18

sayashk

updated a dataset 6 months ago

agent-evals/hal_traces

Updated 3 days ago • 1.05k • 4

siegelz

updated a dataset 6 months ago

agent-evals/hal_traces

Updated 3 days ago • 1.05k • 4

boyiwei

authored a paper 8 months ago

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Paper • 2412.07097 • Published Dec 10, 2024 • 1

benediktstroebl

authored a paper 8 months ago

Dynamic Risk Assessments for Offensive Cybersecurity Agents

Paper • 2505.18384 • Published May 23, 2025 • 8

boyiwei

authored a paper 8 months ago

Dynamic Risk Assessments for Offensive Cybersecurity Agents

Paper • 2505.18384 • Published May 23, 2025 • 8

sayashk

authored a paper 9 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29, 2025 • 72

yifeizhou

authored a paper 10 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21, 2025 • 44

yifeizhou

authored a paper 11 months ago

SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks

Paper • 2503.15478 • Published Mar 19, 2025 • 13

yifeizhou

authored 3 papers about 1 year ago

AI & ML interests

Recent Activity

Team members 14

agent-evals's activity