Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Mukayese
community
https://mukayese.tdd.ai/
Activity Feed
Request to join this org
Follow
18
AI & ML interests
Turkish NLP, Benchmarking
Recent Activity
emrecanacikgoz
authored
a paper
about 2 months ago
TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
emrecanacikgoz
authored
a paper
about 2 months ago
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents
emrecanacikgoz
authored
a paper
about 2 months ago
Self-Improving LLM Agents at Test-Time
View all activity
Team members
3
mukayese
's datasets
3
Sort: Recently updated
mukayese/gsm8k-tr
Viewer
•
Updated
Jul 4, 2024
•
1.32k
•
58
•
2
mukayese/arc-tr
Viewer
•
Updated
Mar 24, 2024
•
1.17k
•
77
•
4
mukayese/truthful_qa-tr
Viewer
•
Updated
Mar 17, 2024
•
817
•
29
•
3