The people at lmsys.org have Chatbot Arena where you can give a question, see answers from two different bots and rate them. Only after are you told which bots they were. Using these human rated competitions between random bots, they can give ratings to the different bots. As of May 3rd 2023 this is what their Leaderboard looked like: I do think that Vicuna is the best of these. So I think the results are right. And I like the method. Hope they keep doing this. It will make it easier to tell which is the best open source bot quickly. Also interesting is their open source leaderboard .