Skip to content
@lmarena

LMArena

An open platform to evaluate, benchmark, compare, and test frontier AI models

Popular repositories Loading

  1. arena-hard-auto arena-hard-auto Public

    Arena-Hard-Auto: An automatic LLM benchmark.

    Python 966 136

  2. copilot-arena copilot-arena Public

    TypeScript 340 26

  3. p2l p2l Public

    Prompt-to-Leaderboard

    Python 265 24

  4. PPE PPE Public

    Jupyter Notebook 59 12

  5. search-arena search-arena Public

    ⚔️ Official code of "Search Arena: Analyzing Search-Augmented LLMs".

    Jupyter Notebook 45 6

  6. lmarena.github.io lmarena.github.io Public

    HTML 19 14

Repositories

Showing 10 of 10 repositories

Most used topics

Loading…