最佳答案
What's the difference between selecting with a where clause and filtering in Spark?
Are there any use cases in which one is more appropriate than the other one?
When do I use
DataFrame newdf = df.select(df.col("*")).where(df.col("somecol").leq(10))
and when is
DataFrame newdf = df.select(df.col("*")).filter("somecol <= 10")
more appropriate?