Meratakan Data Semi-Terstruktur

Tipe Data dan Fungsi di Snowflake

Jake Roach

Field Data Engineer

Data terstruktur

     school_id  |    school_name    |  street_number  |  street_name  |  suffix   |      city      |  zip_code
    ----------- | ----------------- | --------------- | ------------- | --------- | -------------- | ----------
      s_19219   |  West Aurora HS   |       879       |    Main       |    St.    |  West Aurora   |   25041
      s_77465   |  Springtown HS    |      1645       |    Cherry     |    Rd.    |  Springtown    |   14556
     school_id  |                address_info        
    ----------- | ------------------------------------------
      s_19219   |    {
                |        "school_name": "West Aurora HS",
                |        "address": {
                |            "street_number": 879,
                |            "street_name": "Main",
                |            "suffix":  "St."
                |            "city": "West Aurora",
                |            "zip_code": 25041
                |        }
                |    }
Tipe Data dan Fungsi di Snowflake

Data semi-terstruktur

Data disimpan dengan kurung kurawal dalam pasangan kunci-nilai bertipe VARIANT

{
    "school_name": "West Aurora HS",
    "address": {  -- Nested object
        "street_number": 879,
        "street_name": "Main",
        "suffix": "St.",
        "city": "West Aurora",
        "zip_code": 25041
    }
}

$$

$$

$$

  • Mirip dictionary Python atau objek JSON
  • Memungkinkan penyimpanan data mentah
  • Dapat menanam objek, seperti address
  • Ambil data dengan dua cara
Tipe Data dan Fungsi di Snowflake

Notasi titik

                  my_column  
 -------------------------------------------
        {
            "my_first_key": 2025,
            "my_second_key": {
                "a": "alpha",
                "b": "bravo"
            }
        }
SELECT

my_column:my_first_key -- Level atas
my_column:my_second_key.a -- Bertingkat my_column:my_second_key.b -- Bertingkat
...

Memudahkan mengambil nilai level atas dan bertindak dari data VARIANT

$$

  • Titik dua memisahkan <column-name>:<top-level-key>
  • Tambah . lalu field bertingkat, <column-name>:<top-level-key>.<nested-key>
  • Ambil nilai yang sangat bertingkat
Tipe Data dan Fungsi di Snowflake

Notasi titik

SELECT
    address_info:school_name,                              -- Level atas, notasi titik

    address_info:address.street_number AS street_number,   -- Bertingkat, notasi titik
    address_info:address.street_name AS street_name,
    address_info:address.suffix AS suffix

FROM SCHOOLS.school_info;
              school_name    |  street_number  |  street_name  |  suffix  
           ----------------- | --------------- | ------------- | ---------
            West Aurora HS   |       879       |    Main       |    St.   
            Springtown HS    |      1645       |    Cherry     |    Rd.
Tipe Data dan Fungsi di Snowflake

Notasi bracket

Memberikan teknik tambahan untuk mengambil nilai level atas dan bertingkat

$$

  • <column-name>['<top-level-key']['...']
  • Banyak lapisan bertingkat
  • Mirip mengambil data dari dictionary Python
  • Pastikan gunakan tanda petik tunggal (')!
                    my_column  
 ------------------------------------------------
          {
              "my_first_key": 2025,
               my_second_key": {
                  "a": "alpha",
                  "b": "bravo"
              }
          }
SELECT

my_column['my_first_key'], -- Level atas
my_column['my_second_key']['a'] -- Bertingkat my_column['my_second_key']['b'] -- Bertingkat
...
Tipe Data dan Fungsi di Snowflake

Notasi bracket

SELECT

address_info['school_name'], -- Level atas, notasi bracket
address_info['address']['city'] AS city, -- Bertingkat, notasi bracket address_info['address']['zip_code'] AS zip_code
FROM SCHOOLS.school_info;
                     school_name    |      city      |  zip_code
                  ----------------- | -------------- | ----------
                   West Aurora HS   |  West Aurora   |   25041
                   Springtown HS    |  Springtown    |   14556
Tipe Data dan Fungsi di Snowflake

Mengubah data semi-terstruktur

SELECT
    school_id,

    address_info:school_name AS school_name,               -- Level atas, notasi titik

    address_info:address.street_number AS street_number,   -- Bertingkat, notasi titik
    address_info:address.street_name AS street_name,
    address_info:address.suffix AS suffix,

    address_info['address']['city'] AS city,               -- Bertingkat, notasi bracket
    address_info['address']['zip_code'] AS zip_code

FROM SCHOOLS.school_info;
Tipe Data dan Fungsi di Snowflake

Mengubah data semi-terstruktur

     school_id  |                address_info        
    ----------- | ------------------------------------------
      s_19219   |    {
                |        "school_name": "West Aurora HS",
                |        "address": {
                |            "street_number": 879,
                |            "street_name": "Main",
                |            "suffix":  "St."
                |            "city": "West Aurora",
                |            "zip_code": 25041
                |        }
                |    }
     school_id  |    school_name    |  street_number  |  street_name  |  suffix   |      city      |  zip_code
    ----------- | ----------------- | --------------- | ------------- | --------- | -------------- | ----------
      s_19219   |  West Aurora HS   |       879       |    Main       |    St.    |  West Aurora   |   25041
      s_77465   |  Springtown HS    |      1645       |    Cherry     |    Rd.    |  Springtown    |   14556
Tipe Data dan Fungsi di Snowflake

Ayo berlatih!

Tipe Data dan Fungsi di Snowflake

Preparing Video For Download...