t.d.a.g.'s picture

t.d.a.g. PRO

sequelbox

·

sequelbox.bsky.social

AI & ML interests

open source, infinite games. (they/them)

Recent Activity

new activity 3 days ago

mradermacher/model_requests:https://huggingface.co/ValiantLabs/Ministral-3-14B-Reasoning-2512-Esper3.1

new activity 3 days ago

mradermacher/model_requests:request: ValiantLabs/Ministral-3-8B-Reasoning-2512-Esper3.1

upvoted a collection 3 days ago

View all activity

Organizations

upvoted a collection 3 days ago

Esper 3.1

Esper 3.1 is a DevOps, architecture, code, and general reasoning finetune for Qwen, Ministral and gpt-oss! • 5 items • Updated 3 days ago • 1

upvoted a changelog 3 days ago

Changelog

Duplicate Datasets

4 days ago

• 55

upvoted a collection 6 days ago

Qwen3

84 items • Updated Aug 6 • 1.47k

upvoted a paper about 1 month ago

The Massive Legal Embedding Benchmark (MLEB)

Paper • 2510.19365 • Published Oct 22 • 17

upvoted a paper 3 months ago

Hermes 4 Technical Report

Paper • 2508.18255 • Published Aug 25 • 41

upvoted 2 articles 5 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

722

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8

•

734

upvoted 2 papers 5 months ago

CodeContests+: High-Quality Test Case Generation for Competitive Programming

Paper • 2506.05817 • Published Jun 6 • 9

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 47

upvoted a collection 5 months ago

🐙 OctoThinker

Mid-training Incentivizes Reinforcement Learning Scaling • 18 items • Updated Jun 26 • 2

upvoted a changelog 6 months ago

Changelog

Organization and User profiles now include repository listing pages

Jun 20

• 131

upvoted a collection 6 months ago

Esper 3

Esper 3 is a DevOps, architecture, code, and general reasoning finetune for Qwen 3! • 4 items • Updated 3 days ago • 3

upvoted a paper 11 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 52

upvoted a collection 12 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 3 days ago • 155

upvoted a paper about 1 year ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 24

upvoted an article about 1 year ago

Article

Introducing Community Tools on HuggingChat

Sep 16, 2024

•

37

upvoted an article over 1 year ago

Article

Synthetic dataset generation techniques: Self-Instruct

May 15, 2024

•

21

upvoted a collection over 1 year ago

Llama 3.x Models

Our models built with Llama 3, 3.1, and 3.2 • 10 items • Updated 3 days ago • 3

upvoted 2 collections about 2 years ago

Llamafied Yi

Yi base models converted to Llama architecture. • 4 items • Updated Nov 14, 2023 • 9

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 652