Exploratory Data Analysis in R
Andrew Bray
Assistant Professor, Reed College
options(scipen = 999, digits = 3) # Simplify display format
tab_cnt <- table(comics$id, comics$align)
tab_cnt
Bad Good Neutral
No Dual 474 647 390
Public 2172 2930 965
Secret 4493 2475 959
Unknown 7 0 2
prop.table(tab_cnt)
Bad Good Neutral
No Dual 0.030553 0.041704 0.025139
Public 0.140003 0.188862 0.062202
Secret 0.289609 0.159533 0.061815
Unknown 0.000451 0.000000 0.000129
sum(prop.table(tab_cnt))
1
prop.table(tab_cnt, 1)
Bad Good Neutral
No Dual 0.314 0.428 0.258
Public 0.358 0.483 0.159
Secret 0.567 0.312 0.121
Unknown 0.778 0.000 0.222
prop.table(tab_cnt, 2)
Bad Good Neutral
No Dual 0.066331 0.106907 0.168394
Public 0.303946 0.484137 0.416667
Secret 0.628743 0.408956 0.414076
Unknown 0.000980 0.000000 0.000864
ggplot(comics, aes(x = id, fill = align)) +
geom_bar(position = "fill") +
ylab("proportion")
ggplot(comics, aes(x = id, fill = align)) +
geom_bar(position = "fill") +
ylab("proportion")
ggplot(comics, aes(x = id, fill = align)) +
geom_bar(position = "fill") +
ylab("proportion")
ggplot(comics, aes(x = align, fill = id)) +
geom_bar(position = "fill") +
ylab("proportion")
ggplot(comics, aes(x = align, fill = id)) +
geom_bar(position = "fill") +
ylab("proportion")
Exploratory Data Analysis in R