Collection of the models for our paper "Intrinsic Credit Assignment for Long Horizon Interaction"
Joschka Strüber
Klingspor
AI & ML interests
None yet
Recent Activity
upvoted
an
article
4 days ago
DenseR: Dense Rewards For Free in LLM Reasoning
updated
a model
9 days ago
Klingspor/StarPO-4B
updated
a model
9 days ago
Klingspor/StarPO-1.7B