Run Spark with Ignite Shared RDD on Large Volume of Data
In recent I'm running Spark MLLIb KMeans with Apach Ignite 2.6.0 shared RDD
on ten AWS r4.2xlarge workers.
It works and runs to finish on 1 billion points (within memory), but failed
with 2 billion points (exceeding available memory)
My code for loading data to Ignite Shared RDD is here: