Hitting a non-blocking C stack is much better than hitting a blocking Python stack - especially when you have tons of clients hammering with polling requests. So the optimization was to reduce the load on our Python servers by handling most of the polling requests in a C based server.