Why is building a DAG necessary in Spark but not in MapReduce?

Practice More Questions From: Spark Lesson 3

Q:

Check all true statements about the Directed Acyclic Graph Scheduler

Q:

Why is building a DAG necessary in Spark but not in MapReduce?

Q:

What are the differences between an action and a transformation? Mark all that apply

Q:

Generally, which are good stages to mark a RDD for caching in memory?

Q:

What are good cases for using a broadcast variable? Mark all that apply

Q:

We would like to count the number of invalid entries in this example dataset: invalid = sc.accumulator(0)d = sc.parallelize([“3”, “23”, “S”, “99”, “TT”]).foreach(count_invalid) What would be a good implementation of the count_invalid function?

Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments