Parallel programmeren met Dask in Python
James Fulton
Climate Informatics Researcher
# Maak 2 delayed-objecten delayed_num1 = delayed(my_square_function)(3) delayed_num2 = delayed(my_square_function)(4) # Tel ze op result = delayed_num1 + delayed_num2# Visualiseer de taakgrafiek result.visualize()

delayed_intermediate = delayed(my_square_function)(3)
# Deze twee resultaten gebruiken beide delayed_intermediate_result
delayed_result1 = delayed_intermediate - 5
delayed_result2 = delayed_intermediate + 4
delayed_result1.visualize()

delayed_result2.visualize()
# Visualiseer de taakgrafiek
dask.visualize(delayed_result1, delayed_result2)

# Som uitvoeren op twee grote arrays
sum1 = delayed(np.sum)(big_array1)
sum2 = delayed(np.sum)(big_array2)
# Berekenen met processen
dask.compute(sum1, sum2)

# Som uitvoeren op twee grote arrays
sum1 = delayed(np.sum)(big_array1)
sum2 = delayed(np.sum)(big_array2)
# Berekenen met threads
dask.compute(sum1, sum2)

Global Interpreter Lock: slechts één thread kan tegelijk de Python-code uitvoeren
def sum_to_n(n):
"""Sums numbers from 0 to n"""
total = 0
for i in range(n+1):
total += i
return total
sum1 = delayed(sum_to_n)(1000)
sum2 = delayed(sum_to_n)(1000)

pd.read_csv() laat de GIL losdf1 = delayed(pd.read_csv)('file1.csv')
df2 = delayed(pd.read_csv)('file2.csv')

Parallel programmeren met Dask in Python