Question about final exam solution 2019 Exercise 4
Hi, so I wonder why for question 4.1 the traffice between network is 64 bytes? Also, for question 4.2, where does the (7/8) come from, and why we can assume the memory of a Person is (3/4)? For question 4.3, why do we only consider the cost of groupByKey(), but not considering the cost of .collect()? And why in this case we assume the memory of the person is (5/4) ? Lastly, for question 4.4, why is the answer 8 * 10 * 64?
Thank you, I just really don't understand where does these numbers come from in the solution.