Data manipulatie in Julia
Katerina Zahradova
Instructor
# Alle kolommen uit chocolates selecteren behalve review_date
select(chocolates, :company, :bean_origin, :REF, :cocoa,
:company_location, :ratings, :bean_type, :bean_location)
ArgumentError: column name "ratings" not found in the data frame;
existing most similar names are: "rating"
...
# Alle kolommen uit chocolates selecteren behalve review_date
select(chocolates, Not(:review_date))
1795×9 DataFrame
Row company bean_origin REF cocoa company_location rating bean_type bean_location
String String Int64 Float64 String31 Float64 String31? String31?
______________________________________________________________________________________
1 A. Morin Agua Grande 1876 63.0 France 3.75 Sao Tome
...
# Dezelfde kolom twee keer droppen
select!(chocolates, Not(:review_date))
# Enkele regels later ...
select(chocolates, Not(:review_date))
ArgumentError: column name :review_date not found in the data frame
# Cols gebruiken
select(chocolates, Not(Cols(==("review_date"))))
1795×9 DataFrame
Row company bean_origin REF cocoa company_location rating bean_type bean_location
String String Int64 Float64 String31 Float64 String31? String31?
...
# Regex gebruiken
select(chocolates, Not(r"review\_date"))
1795×9 DataFrame
Row company bean_origin REF cocoa company_location rating bean_type bean_location
String String Int64 Float64 String31 Float64 String31? String31?
...
# Vooraf geselecteerde chocolades
first(chocolates)
1795x5 DataFrame
Row company review_date cocoa rating bean_location
String Int64 Float64 Float64 String31?
__________________________________________________________
1 A. Morin 2016 63.0 3.75 Sao Tome
# cocoa naar links verplaatsen
select(chocolates, :cocoa, :)
1795x5 DataFrame
Row cocoa company review_date rating bean_location
Float64 String Int64 Float64 String31?
__________________________________________________________
1 63.0 A. Morin 2016 3.75 Sao Tome
...
# cocoa en rating naar links verplaatsen
select(chocolates, :cocoa, :rating, :)
1795x5 DataFrame
Row cocoa rating company review_date bean_location
Float64 Float64 String Int64 String31?
__________________________________________________________
1 63.0 3.75 A. Morin 2016 Sao Tome
...
# company naar rechts verplaatsen
select(chocolates, Not(:company), :company)
1795x5 DataFrame
Row review_date cocoa rating bean_location company
Int64 Float64 Float64 String31? String
__________________________________________________________
1 2016 63.0 3.75 Sao Tome A. Morin
...
# De twee verplaatsingen combineren
select(chocolates, :cocoa, :rating, Not(:company), :company)
1795x5 DataFrame
Row cocoa rating bean_location review_date company
Float64 Float64 String31 Int64 String
__________________________________________________________
1 63.0 3.75 Sao Tome 2016 A. Morin
...
# Herorden en review_date droppen
# De twee verplaatsingen combineren
select(chocolates, :cocoa, :rating, Not([:company, :review_date]), :company)
1795x4 DataFrame
Row cocoa rating bean_location company
Float64 Float64 String31 String
__________________________________________________________
1 63.0 3.75 Sao Tome A. Morin
...
Kolommen droppen:
# Drop col1 en col 2
select(df, Not([:col1, "col 2"]))
Kolommen veilig droppen:
# Drop col1 en col 2 die mogelijk niet bestaan
select(df, Not(r"col1"), Not(Cols(==("col 2"))))
Kolommen naar links verplaatsen
# Verplaats col1 en col 2 naar links
select(df, :col1, "col 2", :)
Kolommen naar rechts verplaatsen
# Verplaats col1 en col 2 naar rechts
select(df, Not([:col1, "col 2"]) ,:col1, "col 2")
Data manipulatie in Julia