Can spark sql optimizer eliminate unnecessary joins?
It looks like spark sql unable to do this kind of optimization. Here’s my sample code:
How does Spark’s Catalyst Optimizer do the optimization without executing the code?
I’m trying to find specific documentation about how Spark’s Catalyst optimizer works in detail because I plan on customizing it in a way that’s tailored to reduce resource usage rather than computational time.