|
|
1. | | Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena (twitter.com/lmsysorg) | |
20 points by weichiang on June 16, 2023 | past
|
2. | | Google PaLM 2 ranked 6th on the LLM benchmark in the wild (twitter.com/lmsysorg) | |
1 point by weichiang on May 25, 2023 | past
|
3. | | Chatbot Arena: a crowd-sourced LLM leaderboard (twitter.com/lmsysorg) | |
1 point by weichiang on May 12, 2023 | past | 1 comment
|
4. | | State-of-the-Art Chatbot, Vicuna-7B, now runs on MacBook with GPU acceleration (twitter.com/lmsysorg) | |
126 points by weichiang on April 6, 2023 | past | 84 comments
|
5. | | State-of-the-art open-source chatbot, Vicuna-13B, just released model weights (twitter.com/lmsysorg) | |
271 points by weichiang on April 3, 2023 | past | 139 comments
|
6. | | Who's GPT-4's favorite? Battles between state-of-the-art chatbots. (lmsys.org) | |
6 points by weichiang on March 30, 2023 | past | 4 comments
|
|
Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
|