Fraud Detection in Python
Charlotte Werger
Data Scientist
Types of useful text data:
Flagging suspicious words:
# Using a string operator to find words df['email_body'].str.contains('money laundering')
# Select data that matches df.loc[df['email_body'].str.contains('money laundering', na=False)]
# Create a list of words to search for list_of_words = ['police', 'money laundering'] df.loc[df['email_body'].str.contains('|'.join(list_of_words) , na=False)]
# Create a fraud flag df['flag'] = np.where((df['email_body'].str.contains('|'.join (list_of_words)) == True), 1, 0)
Fraud Detection in Python