Posts

Showing posts from May, 2023

Open Source LLM Gaining

Image
  The following is claimed to have been leaked from a Google internal memo. It seems interesting.    

Chatbot Arena and Leaderboard

Image
   The people at lmsys.org have Chatbot Arena   where you can give a question, see answers from two different bots and rate them.   Only after are you told which bots they were.    Using these human rated competitions between random bots, they can give ratings to the different bots.  As of May 3rd 2023 this is what their Leaderboard looked like: I do think that Vicuna is the best of these.   So I think the results are right.   And I like the method.   Hope they keep doing this.   It will make it easier to tell which is the best open source bot quickly.  Also interesting is their open source leaderboard .