Visualization with scatterplots

Analyzing Survey Data in R

Kelly McConville

Assistant Professor of Statistics

babies <- filter(NHANESraw, AgeMonths <= 6) %>%
  select(AgeMonths, HeadCirc)
babies
# A tibble: 484 x 2
   AgeMonths HeadCirc
       <int>    <dbl>
 1         3     42.7
 2         4     42.8
 3         2     38.8
 4         0     36.0
 5         5     42.7
 6         2     41.9
 7         6     44.3
 8         3     42.0
 9         2     41.3
10         1     38.9
# ... with 474 more rows
Analyzing Survey Data in R

Scatterplots

ggplot(data = babies, mapping = aes(x = AgeMonths, y = HeadCirc)) + 
  geom_point()

Scatterplot of age versus head circumference

Analyzing Survey Data in R

Jittering

ggplot(data = babies, mapping = aes(x = AgeMonths, y = HeadCirc)) + 
  geom_jitter(width = 0.3, height = 0)

Jittered scatterplot of age versus head circumference

Analyzing Survey Data in R
babies <- filter(NHANESraw, AgeMonths <= 6) %>%
  select(AgeMonths, HeadCirc, WTMEC4YR)
babies
# A tibble: 484 x 3
   AgeMonths HeadCirc WTMEC4YR
       <int>    <dbl>    <dbl>
 1         3     42.7    12915
 2         4     42.8    12791
 3         2     38.8     2359
 4         0     36.0     4306
 5         5     42.7     2922
 6         2     41.9     5561
 7         6     44.3    10416
 8         3     42.0     9957
 9         2     41.3     4503
10         1     38.9     3718
# ... with 474 more rows
Analyzing Survey Data in R

Bubble plots

ggplot(data = babies, mapping = aes(x = AgeMonths, y = HeadCirc,
                                    size = WTMEC4YR)) + 
  geom_jitter(width = 0.3, height = 0) + 
  guides(size = "none")

Bubbleplot of age versus head circumference

Analyzing Survey Data in R

Bubble plots

ggplot(data = babies, mapping = aes(x = AgeMonths, y = HeadCirc,
                                    size = WTMEC4YR)) + 
  geom_jitter(width = 0.3, height = 0, alpha = 0.3) + 
  guides(size = "none")

Bubbleplot of age versus head circumference with semi-transparent points

Analyzing Survey Data in R

Survey-weighted scatterplots

ggplot(data = babies, mapping = aes(x = AgeMonths, y = HeadCirc,
                                    color = WTMEC4YR)) + 
  geom_jitter(width = 0.3, height = 0) + 
  guides(color = "none")

Scatterplot of age versus head circumference where color represents weights

Analyzing Survey Data in R

Survey-weighted scatterplots

ggplot(data = babies, mapping = aes(x = AgeMonths, y = HeadCirc,
                                    alpha = WTMEC4YR)) + 
  geom_jitter(width = 0.3, height = 0) + 
  guides(alpha = "none")

Scatterplot of age versus head circumference where transparency represents weights

Analyzing Survey Data in R

Let's practice!

Analyzing Survey Data in R

Preparing Video For Download...