Data Manipulation in Julia
Katerina Zahradova
Instructor
# Using strings
df[:, "col 1"]
df[:, "col2"]
# Using symbols df[:, Symbol("col 1")]
df[:, :col2]
# Using first()
println(first(penguins))
Row species island culmen_l_mm ...
String15 String15 String7? ...
____________________________________________
1 Adelie Torgersen 39.1
# Using describe
describe(penguins)
7x7 DataFrame
Row variable ... nmissing ...
Symbol ... Int64 ...
______________________________________
1 species ... 0 ...
2 island ... 0 ...
3 culmen_l_mm ... 10 ...
4 culmen_d_mm ... 10 ...
5 flipper_l_mm ... 10 ...
...
# Describe
describe(penguins)
Row variable mean min ...
Symbol Nothing Union ...
________________________________________________
1 species Adelie
2 island Biscoe
3 culmen_l_mm 32.1 34.7
4 culmen_d_mm 13.1 16
5 flipper_l_mm 205.4 165
...
# Describe using only some columns
describe(penguins, :nmissing, :eltype)
Row variable nmissing eltype
Symbol Int64 DataType
________________________________________________
1 species 0 String15
2 island 0 String15
3 culmen_l_mm 10 Float64
4 culmen_d_mm 10 Float64
5 flipper_l_mm 10 Float64
...
# Using sum
describe(penguins, sum => :total)
7×2 DataFrame
Row variable total
Symbol Union
________________________________________
1 species
2 island
3 culmen_l_mm 15136.6
4 culmen_d_mm 5163.4
...
Data Manipulation in Julia