Technical Perspective: Data Distribution For Fast Joins
What is the most drastic way to reduce the cost of communication for parallel data processing algorithms? This is the question studied in "Reasoning on Data Partitioning for Single-Round Multi-Join Evaluation in Massively Parallel Systems."