There are already a bunch of websites that let you test how language models compare when it comes to handling various types of prompts. Have you ever wanted to know which ones perform the best when it comes to coding. With WebDev Arena, you now have the ability to compare and vote on these AIs. For example, I used this tool to clone Hacker News:
Once there, you can just vote on which models does the job better. In this test, Claude did better than Gemini. You can test this out for other coding tasks here.