Elie Bakouch's picture

Elie Bakouch PRO

eliebak

·

AI & ML interests

Training LLM's @ 🤗

Recent Activity

liked a model 2 days ago

EssentialAI/rnj-1-instruct

liked a model 2 days ago

EssentialAI/rnj-1

upvoted a paper 3 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted a paper 3 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 7 days ago • 78

upvoted a paper 6 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 14 days ago • 240

upvoted an article 6 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

7 days ago

•

224

upvoted a collection 9 days ago

INTELLECT-3

INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated 9 days ago • 11

upvoted an article 17 days ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

19 days ago

•

26

upvoted a collection 22 days ago

NeMo Gym

Collection of RL verifiable data for NeMo Gym • 8 items • Updated 4 days ago • 8

upvoted a paper 24 days ago

Motif 2 12.7B technical report

Paper • 2511.07464 • Published about 1 month ago • 38

upvoted 2 collections about 1 month ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 390

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29 • 58

upvoted a collection about 2 months ago

Reproducing-TRM

3 items • Updated Oct 22 • 4

upvoted an article about 2 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

Oct 23

•

134

upvoted a paper 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 493

upvoted a paper 3 months ago

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14 • 15

upvoted a collection 3 months ago

Tiny Language Model Datasets

Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model • 14 items • Updated Sep 21 • 29

upvoted 4 papers 3 months ago

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Paper • 2508.18672 • Published Aug 26 • 10

Fantastic Pretraining Optimizers and Where to Find Them

Paper • 2509.02046 • Published Sep 2 • 13

AWorld: Orchestrating the Training Recipe for Agentic AI

Paper • 2508.20404 • Published Aug 28 • 38

Motif 2.6B Technical Report

Paper • 2508.09148 • Published Aug 2 • 5

upvoted a paper 4 months ago

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 105

upvoted an article 4 months ago

Article

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

+1

Jul 25

•

83