Programmazione parallela con Dask in Python
James Fulton
Climate Informatics Researcher
# Crea 2 oggetti delayed delayed_num1 = delayed(my_square_function)(3) delayed_num2 = delayed(my_square_function)(4) # Somma result = delayed_num1 + delayed_num2# Visualizza il task graph result.visualize()

delayed_intermediate = delayed(my_square_function)(3)
# Entrambi i risultati usano delayed_intermediate_result
delayed_result1 = delayed_intermediate - 5
delayed_result2 = delayed_intermediate + 4
delayed_result1.visualize()

delayed_result2.visualize()
# Visualizza il task graph
dask.visualize(delayed_result1, delayed_result2)

# Somma su due array grandi
sum1 = delayed(np.sum)(big_array1)
sum2 = delayed(np.sum)(big_array2)
# Calcola con processi
dask.compute(sum1, sum2)

# Somma su due array grandi
sum1 = delayed(np.sum)(big_array1)
sum2 = delayed(np.sum)(big_array2)
# Calcola con thread
dask.compute(sum1, sum2)

Global Interpreter Lock: un solo thread può leggere lo script Python alla volta
def sum_to_n(n):
"""Somma i numeri da 0 a n"""
total = 0
for i in range(n+1):
total += i
return total
sum1 = delayed(sum_to_n)(1000)
sum2 = delayed(sum_to_n)(1000)

pd.read_csv() rilascia il GILdf1 = delayed(pd.read_csv)('file1.csv')
df2 = delayed(pd.read_csv)('file2.csv')

Programmazione parallela con Dask in Python