21 57 88

Asaf Yehudai

Asaf-Yehudai

AI & ML interests

None yet

Recent Activity

liked a dataset 17 days ago

ibm-research/Auto-BenchmarkCard

liked a model about 1 month ago

open-thoughts/OpenThinker-Agent-v1

liked a dataset about 1 month ago

open-thoughts/OpenThoughts-Agent-v1-SFT

View all activity

Organizations

New activity in lmarena-ai/arena-human-preference-140k 4 months ago

Missing models compared to the Arena-Hard-v2.0-Preview

#2 opened 4 months ago by

Asaf-Yehudai

New activity in gaia-benchmark/leaderboard 4 months ago

Access to the submission and evaluation data

#69 opened 4 months ago by

Asaf-Yehudai

commented a paper 5 months ago

CLEAR: Error Analysis via LLM-as-a-Judge Made Easy

Paper • 2507.18392 • Published Jul 24, 2025 • 19 •

commented 2 papers 9 months ago

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Paper • 2504.02605 • Published Apr 3, 2025 • 48 •

LiveVQA: Live Visual Knowledge Seeking

Paper • 2504.05288 • Published Apr 7, 2025 • 15 •

commented 3 papers 10 months ago

commented a paper 11 months ago

Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models

Paper • 2502.08130 • Published Feb 12, 2025 • 9 •

New activity in RLHFlow/ArmoRM-Llama3-8B-v0.1 over 1 year ago

Problem running the model

#1 opened over 1 year ago by

Asaf-Yehudai

New activity in mistralai/Mixtral-8x22B-Instruct-v0.1 over 1 year ago

MT-Bench Results

👍 5

#8 opened over 1 year ago by

0-hero

New activity in Nexusflow/Starling-RM-34B almost 2 years ago

Bug with example code:

#1 opened almost 2 years ago by

Asaf-Yehudai

New activity in microsoft/phi-2 about 2 years ago

How to Train model with AutoModelForSequenceClassification?

👍 2

#20 opened about 2 years ago by

jerife

New activity in dfurman/Falcon-40B-Chat-v0.1 over 2 years ago

qlora - need to be applied and few more places

#4 opened over 2 years ago by

Asaf-Yehudai

New activity in timdettmers/openassistant-guanaco over 2 years ago

Guanaco?

#1 opened over 2 years ago by

edensn

New activity in eachadea/vicuna-13b-1.1 over 2 years ago

running the model in Python

#3 opened over 2 years ago by

Asaf-Yehudai

Asaf Yehudai

AI & ML interests

Recent Activity

Organizations

Asaf-Yehudai's activity

Missing models compared to the Arena-Hard-v2.0-Preview

Access to the submission and evaluation data

Problem running the model

MT-Bench Results

Bug with example code:

How to Train model with AutoModelForSequenceClassification?

qlora - need to be applied and few more places

Guanaco?

running the model in Python