Parallel computing
Understanding Data Engineering
Hadrien Lacroix
Content Developer at DataCamp
Parallel computing
Basis of modern data processing tools
Necessary:
Mainly because of memory
Also for processing power
How it works:
Split tasks up into several smaller subtasks
Distribute these subtasks over several computers
1
Emojis by Mohamed Hassan
Benefits and risks of parallel computing
Employees = processing units
Advantages
Extra processing power
Reduced memory footprint
Disadvantages
Moving data incurs a cost
Communication time
Summary
Benefits and risks
How it's implemented at Spotflix
Let's practice!
Understanding Data Engineering
Preparing Video For Download...