Pandas Joins for Spreadsheet Users
John Miller
Principal Data Scientist
$$
After viewing and understanding the data:
$$ $$ Unique values for single column key
df.duplicated('GameKey').sum()
$$ -- -- -- A value of 0 means no duplicates -- -- --
df.duplicated(['GameKey', 'PlayId').sum()
$$
After viewing and understanding the data:
$$
The statement is the same!
df1.merge(df2, how='inner', on='')
$$
Full syntax:
DataFrame.merge(right, how='inner', on=None, left_on=None, right_on=None, left_index=False, right_index=False, sort=False, suffixes=('_x', '_y'), copy=True, indicator=False, validate=None)
DataFrame.merge(right, how='inner', on=None, left_on=None, right_on=None, left_index=False, right_index=False, sort=False, suffixes=('_x', '_y'), copy=True, indicator=False, validate=None)
$$
Values for validate
:
Pandas Joins for Spreadsheet Users