Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sdiazlor 's Collections
Leaderboards
Instruction Models
Computer Vision Models
Audio Models
Data Related Tools
Utilities
Favorite Demos

Leaderboards

updated Jul 14

Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions

Upvote
-

  • Running
    17

    InferBench

    🥇
    17

    A cost/quality/speed Leaderboard for Inference Providers!


  • Running on CPU Upgrade
    6.78k

    MTEB Leaderboard

    🥇
    6.78k

    Embedding Leaderboard


  • Running on CPU Upgrade
    13.7k

    Open LLM Leaderboard

    🏆
    13.7k

    Track, rank and evaluate open LLMs and chatbots


  • Running
    4.68k

    LMArena Leaderboard

    🏆
    4.68k

    Display LMArena Leaderboard


  • Running on CPU Upgrade
    74

    La Leaderboard

    🌸
    74

    Evaluate open LLMs in the languages of LATAM and Spain.


  • Running
    109

    Judge Arena

    💻
    109

    Vote on AI responses to rank models


  • Running
    Featured
    576

    LLM-Perf Leaderboard

    🏆
    576

    Explore hardware performance for LLMs


  • Running
    188

    Vidore Leaderboard

    🥇
    188

    Explore visual document retrieval model rankings


  • Running on CPU Upgrade
    940

    Open VLM Leaderboard

    🌎
    940

    VLMEvalKit Evaluation Results Collection


  • Running
    Featured
    85

    SEED-Bench Leaderboard

    🏆
    85

    Submit model evaluation results to leaderboard


  • Running
    23

    MM-UPD Leaderboard

    🥇
    23

    Submit and evaluate model results on MM-UPD benchmarks


  • Paused
    24

    MMBench Leaderboard

    🚀
    24

    Explore MMBench Leaderboard data

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs