Edward Neuhaus's picture

Edward Neuhaus

Pretergeek

·

https://ko-fi.com/pretergeek

pretergeek

AI & ML interests

NLP, ML, LLMs, AI Ethics, AI Welfare, Privacy in AI, FOSS.

Recent Activity

upvoted a collection 1 day ago

liked a dataset 6 days ago

AlistairPullen/gsm8k-grpo-format

liked a dataset 6 days ago

thesven/gsm8k-reasoning

View all activity

Organizations

None yet

upvoted a collection 1 day ago

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 139

upvoted an article 6 days ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

6 days ago

•

9

upvoted 6 collections 2 months ago

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 8 items • Updated Feb 21, 2025 • 64

SmolLM3 evaluation datasets

Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry • 13 items • Updated Jul 8, 2025 • 7

SmolLM3 pretraining datasets

datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12, 2025 • 42

INTELLECT-2

INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. • 3 items • Updated Oct 7, 2025 • 26

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 7, 2025 • 25

INTELLECT-1

13 items • Updated Oct 7, 2025 • 12

upvoted an article 3 months ago

Article

Preserving Agency: Why AI Safety Needs Community, Not Corporate Control

Sep 29, 2025

•

10

upvoted a paper 4 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 71

upvoted a collection 5 months ago

Cosmos-Reason1

⚠️ The latest version of Cosmos Reason is now live! 👉 https://huggingface.co/collections/nvidia/cosmos-reason2 • 8 items • Updated about 15 hours ago • 38

upvoted an article 6 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

755

upvoted a paper 7 months ago

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4, 2025 • 43

upvoted an article 8 months ago

Article

Interactive Tools for machine learning, deep learning, and math

May 26, 2025

•

47

upvoted 3 papers 8 months ago

Thinkless: LLM Learns When to Think

Paper • 2505.13379 • Published May 19, 2025 • 50

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19, 2025 • 83

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Paper • 2505.11896 • Published May 17, 2025 • 58

upvoted a collection 8 months ago

Physical AI

Collection of open, commercial-grade datasets for physical AI developers • 23 items • Updated 16 days ago • 104

upvoted a paper 8 months ago

Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents

Paper • 2505.02156 • Published May 4, 2025 • 18

upvoted a collection 11 months ago

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 256