Low Horng Jiun
NickolasLow1
ยท
AI & ML interests
None yet
Recent Activity
reacted
to
sergiopaniego's
post
with ๐
13 days ago
Interested in RL training environments?
We just released a beginner-friendly walkthrough notebook!
Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM.
happy learning! ๐ฑ
Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb
OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv
updated
a model
17 days ago
NickolasLow1/Qwen2.5-7B-Instruct
updated
a Space
17 days ago
NickolasLow1/Qwen2.5-7B-Instruct
Organizations
None yet