What is the difference between cache and persist in Spark?
Cache () | Persist () |
---|---|
While using this, the default storage level is MEMORY_ONLY for RDD and MEMORY_AND_DISK for Dataset. | While using this, the user can use various storage levels for both RDD and Dataset. |
BY Best Interview Question ON 10 Jun 2020