Quentin Gallouédec's picture

Hiring 💼

Quentin Gallouédec PRO

qgallouedec

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

openbmb/MiniCPM4.1-8B

updated a dataset 1 day ago

hf-doc-build/doc-build-dev

updated a dataset 1 day ago

trl-lib/documentation-images

View all activity

Organizations

liked a model 1 day ago

openbmb/MiniCPM4.1-8B

Text Generation • 8B • Updated Oct 24 • 18.9k • 380

updated 2 datasets 1 day ago

hf-doc-build/doc-build-dev

Updated about 3 hours ago • 177k • 6

trl-lib/documentation-images

Viewer • Updated 1 day ago • 9 • 75.5k

New activity in trl-lib/documentation-images 1 day ago

Add logos as assets

#3 opened 2 days ago by

upvoted a paper 2 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 6 days ago • 77

published an article 3 days ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

3 days ago

•

41

liked a dataset 5 days ago

a2aj/canadian-laws

Viewer • Updated 7 days ago • 5.77k • 173 • 4

updated a dataset 9 days ago

qgallouedec/Dolci-Think-DPO-7B

Viewer • Updated 9 days ago • 150k • 29

published a dataset 9 days ago

qgallouedec/Dolci-Think-DPO-7B

Viewer • Updated 9 days ago • 150k • 29

liked a dataset 9 days ago

allenai/Dolci-Think-DPO-7B

Viewer • Updated 17 days ago • 150k • 560 • 7

updated a dataset 11 days ago

hf-doc-build/doc-build

Updated about 3 hours ago • 1.16M • 13

New activity in HuggingFaceTB/SmolLM3-3B 12 days ago

Tool calls aren't rendered by the chat template

#44 opened 12 days ago by

upvoted a paper 12 days ago

Go-Explore: a New Approach for Hard-Exploration Problems

Paper • 1901.10995 • Published Jan 30, 2019 • 1

upvoted a paper 14 days ago

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2, 2024 • 20

upvoted an article 16 days ago

Article

20x Faster TRL Fine-tuning with RapidFire AI

+1

16 days ago

•

20

updated a dataset 16 days ago

huggingface/documentation-images

Viewer • Updated 3 days ago • 55 • 1.99M • 93

liked a Space 16 days ago

RapidFire AI — LLM Fine-Tuning Engine

Hyperparallel LLM fine-tuning with live ops.

published an article 16 days ago

Article

20x Faster TRL Fine-tuning with RapidFire AI

+1

16 days ago

•

20

updated a dataset 19 days ago

qgallouedec/biogrid_qa

Viewer • Updated 19 days ago • 59.4k • 263

published a dataset 19 days ago

qgallouedec/biogrid_qa

Viewer • Updated 19 days ago • 59.4k • 263