Test out early stop sort merge join to handle AS OF join? #360
Labels
future
This issue is in a backlog of ideas to possibly be done in the future
performance
Issues with the time to execute operations or other performance issues
I ran into some online benchmarks about AS OF join where in certain cases, "early stop sort merge join" can outperform UNION based AS OF join.
https://www.hopsworks.ai/post/a-spark-join-operator-for-point-in-time-correct-joins (fwiw, it mentioned tempo as the inspiration for the UNION based AS OF join)
open sourced implementations
https://github.com/Ackuq/spark-pit/blob/main/scala/src/main/scala/execution/Patterns.scala
Would be interested to see what the community / maintainers think.
The text was updated successfully, but these errors were encountered: