Manipulasi Data di Julia
Katerina Zahradova
Instructor
# Perusahaan di berbagai negara
choc_comp_france = filter(chocolates -> chocolates.company_location == "France", chocolates)
choc_comp_brazil = filter(chocolates -> chocolates.company_location == "Brazil", chocolates)
choc_comp_peru = filter(chocolates -> chocolates.company_location == "Peru", chocolates)
choc_comp_belgium = filter(chocolates -> chocolates.company_location == "Belgium", chocolates)
...


# Kelompokkan menurut company_location
groupby(chocolates, :company_location)
GroupedDataFrame dengan 60 grup berdasarkan kunci: company_location
Grup pertama (156 baris): company_location = "France"
Row company bean_origin ...
String String ...
_______________________________
1 A. Morin Agua Grande ...
...
Grup terakhir (4 baris): company_location = "Ireland"
Row company bean_origin ...
String String ...
_______________________________
1 Wilkie's Organic Amazonas ...
groupby(chocolates, [:company_location, :cocoa])
GroupedDataFrame dengan 373 grup berdasarkan kunci: company_location, cocoa
Grup pertama (5 baris): company_location = "France", cocoa = 63.0
Row company bean_origin ...
String String ...
_______________________________
1 A. Morin Agua Grande ...
...
Grup terakhir (1 baris): company_location = "Ireland", cocoa = 89.0
Row company bean_origin ...
String String ...
_______________________________
1 Wilkie's Organic Amazonas ...

# Kelompokkan menurut negara
chocolates_by_country = groupby(chocolates, :company_location)
# Jumlah baris per grup
combine(chocolates_by_country, nrow => :count)
60x2 DataFrame
Row company_location nrow
String31 Int64
______________________________
1 France 156
2 U.S.A. 764
3 Fiji 4
...
# Kelompokkan menurut negara
chocolates_by_country = groupby(chocolates, :company_location)
# Urutkan menurut jumlah baris
sort(combine(chocolates_by_country, nrow => :count), :count, rev = true)
60x2 DataFrame
Row company_location nrow
String31 Int64
______________________________
1 U.S.A 764
2 France 156
3 Canada 125
...
# Elemen unik
unique(chocolates.company_location)
60-element Vector{String31}:
"France"
"U.S.A."
"Fiji"
"Ecuador"
"Mexico"
"Switzerland"
"Netherlands"
...
# Buat DF baru hanya berisi baris unik
unique(chocolates)
1795×10 DataFrame
Row company bean_origin ...
String String ...
_________________________________
1 A. Morin Agua Grande ...
...
# Baris perusahaan unik
unique(chocolates, :company)
416×10 DataFrame
Row company bean_origin ...
String String ...
_________________________________
1 A. Morin Agua Grande ...
...
# Perusahaan DAN kadar kakao unik
unique(chocolates, [:company, :cocoa])
968×10 DataFrame
Row company bean_origin ...
String String ...
_________________________________
1 A. Morin Agua Grande ...
...
Manipulasi Data di Julia