Heatmap

Analisis Market Basket dengan Python

Isaiah Hull

Visiting Associate Professor of Finance, BI Norwegian Business School

Dataset MovieLens

import pandas as pd

# Load ratings data.
ratings = pd.read_csv('datasets/movie_ratings.csv')
print(ratings.head())
   userId  movieId  title
0    3149    54286  Bourne Ultimatum, The (2007) 
1    3149     1220  Blues Brothers, The (1980)
2    3149     4007  Wall Street (1987)
3    3149     7156  Fog of War: Eleven...
4    3149    97304  Argo (2012)
Analisis Market Basket dengan Python

Membuat "transaksi" dari rating

# Recover unique user IDs.
user_id = movies['userId'].unique()

# Create library of highly rated movies for each user.
libraries = [list(ratings[ratings['userId'] == u].title) for u in user_id]

# Print example library.
print(library[0])
['Battlestar Galactica (2003)',
 'Gorgon, The (1964)',
 'Under the Skin (2013)',
 'Upstream Color (2013)',
 'Destry Rides Again (1939)',
 'Dr. Phibes Rises Again (1972)']
Analisis Market Basket dengan Python

One-hot encoding transaksi

from mlxtend.preprocessing import TransactionEncoder

# Instantiate transaction encoder.
encoder = TransactionEncoder()

# One-hot encode libraries.
onehot = encoder.fit(libraries).transform(libraries)

# Use movie titles as column headers.
onehot = pd.DataFrame(onehot, columns = encoder.columns_)

# Print onehot header.
print(onehot.head())
Analisis Market Basket dengan Python

One-hot encoding transaksi

 

   (500) Days of Summer (2009)  .45 (2006)  10 Things I Hate About You (1999) 
0                        False       False                              False   
1                        False       False                              False   
2                        False       False                              False   
3                        False       False                              False   
4                        False       False                              False
Analisis Market Basket dengan Python

Apa itu heatmap?

Gambar menunjukkan contoh heatmap.

Analisis Market Basket dengan Python

Menyiapkan data

  1. Hasilkan aturan.
    • Gunakan algoritme Apriori dan association rules.
  2. Ubah antecedent dan consequent jadi string.
    • Secara default disimpan sebagai frozen set di mlxtend.
  3. Ubah aturan ke format matriks.
    • Cocok untuk heatmap.
Analisis Market Basket dengan Python

Menyiapkan data

from mlxtend.frequent_patterns import association_rules, apriori
import seaborn as sns
# Apply the apriori algorithm
frequent_itemsets = apriori(onehot, min_support=0.10,
                            use_colnames=True, max_len=2)

# Recover the association rules
rules = association_rules(frequent_itemsets)
Analisis Market Basket dengan Python

Membuat heatmap

# Convert antecedents and consequents into strings
rules['antecedents'] = rules['antecedents'].apply(lambda a: ','.join(list(a)))
rules['consequents'] = rules['consequents'].apply(lambda a: ','.join(list(a)))
# Print example.
print(rules[['antecedents','consequents']])
            antecedents                    consequents
0  Batman Begins (2005)  Dark Knight Rises, The (2012)
Analisis Market Basket dengan Python

Membuat heatmap

# Transform antecedent, consequent, and support columns into matrix
support_table = rules.pivot(index='consequents', columns='antecedents', 
                                values='support')
# Generate heatmap
sns.heatmap(support_table)
Analisis Market Basket dengan Python

Membuat heatmap

Gambar menunjukkan heatmap dari data MovieLens.

Analisis Market Basket dengan Python

Kustomisasi heatmap

sns.heatmap(pivot, annot=True, cbar=False, cmap='ocean')

Gambar ini menunjukkan versi heatmap yang dikustomisasi.

Analisis Market Basket dengan Python

Ayo berlatih!

Analisis Market Basket dengan Python

Preparing Video For Download...