Adding more detail to summaries

Visualizing Big Data with Trelliscope in R

Ryan Hafen

Author, TrelliscopeJS

Distribution of total fare amount

Visualizing Big Data with Trelliscope in R

Adding more detail to summaries

Introduce more variables into the summary computations

  • Binning two or more continuous variables to visualize joint distribution
  • Grouping or faceting summary computations by additional variables
Visualizing Big Data with Trelliscope in R

Binning continuous variables using geom_hex()

ggplot(tx, aes(tip_amount, total_amount)) +

geom_hex(bins = 75) + scale_x_log10() + scale_y_log10() + geom_abline(slope = 1, intercept = 0)

Visualizing Big Data with Trelliscope in R

Faceting using facet_wrap()

ggplot(daily_count, aes(pickup_date, n_rides)) +
  geom_point() +
  facet_wrap(~ pickup_dow)

Visualizing Big Data with Trelliscope in R

Faceting

ggplot(daily_count, aes(pickup_date, n_rides)) +
  geom_point() +
  facet_grid(~ pickup_dow) +
  geom_smooth(method = "rlm", se = FALSE)

Visualizing Big Data with Trelliscope in R

Let's practice!

Visualizing Big Data with Trelliscope in R

Preparing Video For Download...