Practice More Questions From: Spark Lesson 3
Q:
Check all true statements about the Directed Acyclic Graph Scheduler
Q:
Why is building a DAG necessary in Spark but not in MapReduce?
Q:
What are the differences between an action and a transformation? Mark all that apply
Q:
Generally, which are good stages to mark a RDD for caching in memory?
Q:
What are good cases for using a broadcast variable? Mark all that apply
Q:
We would like to count the number of invalid entries in this example dataset: invalid = sc.accumulator(0)d = sc.parallelize([“3”, “23”, “S”, “99”, “TT”]).foreach(count_invalid) What would be a good implementation of the count_invalid function?
Subscribe
0 Comments