Analisis Data Eksploratif di SQL
Christina Maimone
Data Scientist
character(n) atau char(n)
ncharacter varying(n) atau varchar(n)
ntext atau varchar
Kategorikal
Tues, Tuesday, Mon, TH
shirts, shoes, hats, pants
satisfied, very satisfied, unsatisfied
0349-938, 1254-001, 5477-651
red, blue, green, yellow
Teks tak terstruktur
I really like this product. I use it every day. It's my favorite color.
We've redesigned your favorite t-shirt to make it even better. You'll love...
Four score and seven years ago our fathers brought forth on this continent, a new nation, conceived in Liberty, and dedicated to the proposition that all men are created equal...
SELECT category, -- variabel kategorikal
count(*) -- hitung baris per kategori
FROM product -- tabel
GROUP BY category; -- variabel kategorikal
category | count
----------+-------
Banana | 1
Apple | 4
apple | 2
apple | 1
banana | 3
(5 rows)
SELECT category, -- variabel kategorikal
count(*) -- hitung baris per kategori
FROM product -- tabel
GROUP BY category -- variabel kategorikal
ORDER BY count DESC; -- tampilkan nilai paling sering terlebih dahulu
category | count
----------+-------
Apple | 4
banana | 3
apple | 2
Banana | 1
apple | 1
(5 rows)
SELECT category, -- variabel kategorikal
count(*) -- hitung baris per kategori
FROM product -- tabel
GROUP BY category -- variabel kategorikal
ORDER BY category; -- urutkan menurut kategori
category | count
----------+-------
apple | 1
Apple | 4
Banana | 1
apple | 2
banana | 3
(5 rows)
-- Hasil
category | count
----------+-------
apple | 1
Apple | 4
Banana | 1
apple | 2
banana | 3
(5 rows)
-- Urutan alfabet:
' ' < 'A' < 'a'
-- Dari hasil
' ' < 'A' < 'B' < 'a' < 'b'
Huruf besar-kecil berpengaruh
'apple' != 'Apple'
Spasi dihitung
' apple' != 'apple'
'' != ' '
String kosong bukan null
'' != NULL
Perbedaan tanda baca
'to-do' != 'to–do'
Analisis Data Eksploratif di SQL