Joining data: a real-world necessity
Pandas Joins for Spreadsheet Users
John Miller
Principal Data Scientist
Pandas for spreadsheet users
Learn based on similarities to spreadsheets
Understand the power and flexibility of pandas
Use data from the National Football League (NFL)
Common situations
$$
Datasets split by time or other factor
Datasets with related factors
Split data
$$
Influenced by reporting cycle
Common splits
Time
Geography
Business unit
Split data example
Split data example
Split data example
Complementary data
$$
Results from collecting data for different purposes
Department-specific data
Storage in separate files or database tables
Complementary data example
$$
Complementary data example
$$
Complementary data example
$$
Let's practice!
Pandas Joins for Spreadsheet Users
Preparing Video For Download...