I was about to state that there is nothing here that 4o or Sonnet couldn’t do with very limited prompting, then I noticed that the hamburger menu on mobile doesn’t even work and had to retract that statement. Both wouldn’t have made such a mistake.
Thanks, this only cements where Devin lies in comparison and explains the lack of benchmarks and independent testing…
I'm impressed with Devin's capabilities. Its good at building standard web applications and implementing common patterns. I is particularly effective for enterprises needing basic web pages or solutions that follow established development patterns.
While Devin handles routine development tasks well, it still requires oversight and guidance when dealing with complex integrations or custom business logic. It was helpful in reducing the time spent on boilerplate code and basic setup tasks.