(Setting aside the fact that Logstash is JRuby/Java app easily eating said gigab...

geggam · on Feb 23, 2020

Not sure if you are aware of this. We ran this at Y!

emmelaich · on Feb 23, 2020

Looks awesome.

ES is good but it takes a small army to keep on top of the performance and management. And then there's the upgrades which will fix a few bugs and introduce new ones too.

otisg · on Feb 24, 2020

How would you compare Vespa to ES when it comes to CPU/memory requirements and performance for something like log indexing and search use case?

DmitryOlshansky · on Feb 23, 2020

Seen that and starred probably half a year ago. Never had the time to dig around and see what it’s like in perf, operations and scalability.

geggam · on Feb 23, 2020

Saw 4 petabyes in it.

Cant remember the hosts number exactly but it was several hundred bare metal servers of various sizes

Behind Y! Groups

zmmmmm · on Feb 24, 2020

> I believe a finely crafted native code solution for this problem could achieve x3 less ram usage and x2-3 indexing/search performance

Many people believe that about everything written in Java. The story often ends up a lot more complicated though. In the domains where it shines Java is surprisingly hard to beat without very significant effort. And then you must also contemplate what they same significant effort would achieve if directed at the Java solution or cherry picking particular performance hotspots out of it.

DmitryOlshansky · on Feb 24, 2020

I’m basing it on my limited experience of rewriting things from optimized but messy C++/D to JVM. Yes, the end result is much simpler but is memory hungry and is ~2x slower (after optimizations and tuning). Sometimes you can fit Java in less then ~2x of original footprint but at the cost of burning CPU on frequent GC cycles.

Not every application is the same but the moment it involves manipulating a lot of data in Java you fight the platform to get back the control you require for those things. And by the end of day there is a limit at which reasonable people stop and just give up dodging memory allocations, creatively reusing objects, wrangling off-heap memory without the help of type system.