Hacker News new | past | comments | ask | show | jobs | submit | weichiang's submissions login
1. Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena (twitter.com/lmsysorg)
20 points by weichiang on June 16, 2023 | past
2. Google PaLM 2 ranked 6th on the LLM benchmark in the wild (twitter.com/lmsysorg)
1 point by weichiang on May 25, 2023 | past
3. Chatbot Arena: a crowd-sourced LLM leaderboard (twitter.com/lmsysorg)
1 point by weichiang on May 12, 2023 | past | 1 comment
4. State-of-the-Art Chatbot, Vicuna-7B, now runs on MacBook with GPU acceleration (twitter.com/lmsysorg)
126 points by weichiang on April 6, 2023 | past | 84 comments
5. State-of-the-art open-source chatbot, Vicuna-13B, just released model weights (twitter.com/lmsysorg)
271 points by weichiang on April 3, 2023 | past | 139 comments
6. Who's GPT-4's favorite? Battles between state-of-the-art chatbots. (lmsys.org)
6 points by weichiang on March 30, 2023 | past | 4 comments

Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: