From the announcement “As of now, we have mined 1,580 PySpark tests from the Spa...

Kydlaw · 2024-09-10T12:00:15 1725969615

The next paragraph explains that: "When looking at the test coverage numbers alone, Sail’s capability may seem limited. But we have found that there is a long tail of failed tests due to formatting discrepancies, edge cases, and less-used SQL functions, which we will continue tackling in future releases."

I am with you that it is still very very early. I'll personally keep an eye on the project.

SpicyLemonZest · 2024-09-10T12:35:46 1725971746

I'll keep an eye on it too, but for a query engine formatting compliance and edge cases tend to be almost all of the work. It's easy to implement SELECT x FROM y WHERE z.

bburnett44 · 2024-09-10T12:40:27 1725972027

Yeah but the website literally says “zero code changes”. It’s the long tail that’s dangerous since most people don’t understand it as well as a the core functions