Closed
Description
A job that defines the spark()
method isn't going to be able to serialize self
because self.stdin
, self.stdout
etc. are un-serializable, which we won't be able to serialize methods (no rdd.flatMap(self.some_method)
).
We should sandbox the job prior to running its spark()
method, just like we do in the Spark harness.