If those teams got together, hacked up a program that ran 2 or more independant systems (entries) at the same time and averaged the results, they'd probably get to 10 percent.
They actually do one better than your suggestion, which is that they use machine learning to figure out how to weight one team's results vs. the other.