Manipulasi Data di Snowflake
Jake Roach
Field Data Engineer
Subkueri memungkinkan hasil dari satu kueri dipakai oleh kueri lain.
$$
$$
FROM ( ... ) atau WHERE ... IN ( ... )

SELECT
...
-- Ambil dari kueri, bukan dari tabel
FROM (
-- Buat himpunan hasil yang akan
-- dipakai oleh kueri utama
SELECT
<fields>
FROM <table>
WHERE ...
);
Ambil data dari hasil kueri lain alih-alih langsung dari tabel.
$$
JOIN, WHERE, dll.SELECT
month_num,
-- windchill - temperature harus dipakai dua kali di sini. Bagaimana jika rumusnya berubah?
AVG(windchill - temperature) AS avg_differential
MIN(windchill - temperature) AS most_differential
FROM weather
WHERE
-- Penyaringan terjadi di kueri yang sama dengan agregasi/analisis
season = 'Winter' AND
temperature < 32
GROUP BY month_num;
-- Mulai dengan subkueri, lalu agregasiSELECT month_num, AVG(differential) AS avg_differential MIN(differential) AS most_differential FROM (SELECT month_num, windchill - temperature AS differential FROM weather WHERE season = 'Winter' AND temperature < 32) GROUP BY month_num;
| month_num | differential |
| --------- | ------------ |
| 12 | -12 |
| 1 | -3 |
| 1 | 0 |
| 2 | -7 |
| month_num | avg_differential | most_differential |
| --------- | ---------------- | ----------------- |
| 12 | -5.77 | -14 |
| 1 | -1.91 | -8 |
| 2 | -8.13 | -22 |
Analisis mudah dipahami dan diubah setelah datanya dibersihkan.
...
-- Filter berdasarkan rekaman dengan nilai
-- dalam hasil subkueri
WHERE <field> IN (
SELECT <other-field> FROM ...
);
Buat himpunan hasil kecil untuk transformasi, filter, atau manipulasi data.
$$
IN himpunan hasil non-konstanAVG, MAX, MIN, dll.SELECT
todays_date,
temperature,
status
FROM weather
WHERE todays_date IN ( -- Filter semua hari dengan laga kandang yang dimenangkan
SELECT
game_date
FROM game_schedule
WHERE stadium = 'Home' AND did_win = TRUE
);

Manipulasi Data di Snowflake