demobook

3 Real Product Demos of Lmsys Chatbot Arena

Lmsys Chatbot Arena

Open-source platform for crowdsourced LLM benchmarking via blind pairwise comparisons.

Lmsys Chatbot Arena provides a live, community-driven evaluation environment where users rate Large Language Models through side-by-side blind testing. The platform utilizes Elo rating systems to generate a transparent, real-world leaderboard for both open-source and proprietary AI models.

by Lmsys

beginner
  • Blind pairwise model comparisons
  • Crowdsourced Elo rating leaderboard
  • Real-world user prompt evaluation
  • Support for open-weight and commercial APIs
  • Preview testing for unreleased models
  • Open-source FastChat infrastructure
  • Transparent evaluation and ranking pipelines
  • Publicly shared user preference datasets
3 clips
  1. 8:220:45Compare video models in LMSYS ArenaThe video demonstrates using 'Battle Mode' in Arena to generate two videos from random high-end models simultaneously for comparison and download.LLmsys Chatbot Arena · beginner
  2. 8:231:21Compare AI image models on LMSYS Chatbot ArenaThe user demonstrates generating images on the LMSYS platform to compare outputs from different models like Kwen Image and Flux Pro side-by-side.LLmsys Chatbot Arena · Flux · beginner
  3. 9:551:16Generate videos with Seedance and Happy HorseThe creator uses the LMSYS video generation interface to produce and compare 8-second video clips using the Seedance 1.5 and Happy Horse models.LLmsys Chatbot Arena · Seedance · beginner