Window Functions in Snowflake
Jake Roach
Field Data Engineer
Qui classifichiamo tutti i record nel result set!
user_id | event_name | km_traveled | closest_attendees
------- | --------------- | ------------- | -----------------
user_81 | Lunar Drift | 0.5 | 1
user_02 | Lunar Drift | 1.1 | 2
user_33 | Crimson Arc | 8.6 | 3
user_33 | Neon Prophet | 8.6 | 3
user_15 | VibeStorm | 17 | 5
user_94 | The Dusk Owls | 41 | 6
user_47 | Lunar Drift | 61 | 7
user_56 | Crimson Arc | 116 | 8
Ora vogliamo classificare i dati per ogni finestra.

SELECT
user_id,
event_name,
distance_traveled,
RANK() OVER(
-- Crea finestra per event_name
PARTITION BY event_name
ORDER BY km_traveled
) AS closest_concert_goer
FROM CONCERTS.attendance;
PARTITION BY ci permette di creare finestre di record su cui applicare funzioni
$$
OVER(...), PARTITION BY va prima di ORDER BYGROUP BY, ma non "collassa" i recordSELECT
level,
price,
RANK() OVER(
PARTITION BY level
ORDER BY price DESC
) AS price_rank
FROM CONCERTS.attendance;
PARTITION BY crea finestre level | price | price_rank
-------- | --------- | -----------
100 | 765 | 1
100 | 617 | 2
100 | 490 | 3
100 | 490 | 3
...
200 | 212 | 1
200 | 207 | 2
...
FIRST_VALUE(<1>) OVER(
PARTITION BY <2>
ORDER BY <3>
) AS <alias>
FIRST_VALUE aiuta a trovare il primo valore in una finestra
<1>: quale colonna restituire
<2>: campo per partizionare i dati
<3>: campo per determinare il primo record
AVG(<1>) OVER(
PARTITION BY <2>
-- No need to ORDER BY!
) AS <alias>
AVG calcola la media di un campo per ogni finestra
<1>: colonna di cui fare la media
<2>: campo per partizionare i dati
$$
... non serve ORDER BY!
SELECT user_id, event_name, satisfaction_score,FIRST_VALUE(satisfaction_score) OVER( PARTITION BY event_name -- Punteggio soddisfazione del fan più vicino ORDER BY km_traveled ) AS first_score,-- Trova il punteggio medio di soddisfazione per una "finestra" di record AVG(satisfaction_score) OVER( PARTITION BY event_name ) AS average_scoreFROM CONCERTS.attendance;
user_id | event_name | satisfaction_score | first_score | average_score
--------- | -------------- | -------------------- | ------------- | ---------------
user_26 | Pulse Theory | 71 | 98 | 84.5
user_92 | Pulse Theory | 98 | 98 | 84.5
...
user_57 | Nova Sway | 4 | 22 | 29.3
user_39 | Nova Sway | 22 | 22 | 29.3
user_44 | Nova Sway | 62 | 22 | 29.3
...
Window Functions in Snowflake