Histogram nuances

Visualization Best Practices in R

Nick Strayer

Instructor

Histogram positives

  • Intuitive
  • Interpretable

Visualization Best Practices in R

Histogram negatives

  • Sensitive to bin placements
  • Iffy with small amounts of data

Visualization Best Practices in R

Adjusting number of bins

  • Exact same data
  • Varying bin-numbers (geom_histogram(bins = n))) from 10 to 55

Visualization Best Practices in R

Bin number best practices

  • If length(data$x) > 150bins = 100
  • Otherwise, play around to get a good sense of the data

Visualization Best Practices in R

Reality

  • Beware of digit preferences
  • Data from automated sources are less likely to be problematic

Visualization Best Practices in R

Let's improve some histograms!

Visualization Best Practices in R

Preparing Video For Download...