Metrik dalam analisis keranjang belanja

Analisis Keranjang Belanja di R

Christopher Bruffaerts

Statistician

Metrik untuk ekstraksi aturan

TID Transaksi
1 {Bread, Butter, Cheese, Wine}
2 {Bread, Butter, Wine}
3 {Bread, Butter}
4 {Butter, Cheese, Wine}
5 {Butter, Cheese}
6 {Cheese, Wine}
7 {Butter, Wine}

Tujuan: Ekstrak aturan asosiasi

Contoh:

  • {Bread} $\rightarrow$ {Butter}
    • Bread = "Antecedent"
    • Butter = "Consequent"
  • {Butter, Cheese} $\rightarrow$ {Wine}

Metrik: Support, confidence, lift, ...

Analisis Keranjang Belanja di R

Metrik support

TID Transaksi
1 {Bread, Butter, Cheese, Wine}
2 {Bread, Butter, Wine}
3 {Bread, Butter}
4 {Butter, Cheese, Wine}
5 {Butter, Cheese}
6 {Cheese, Wine}
7 {Butter, Wine}

Support: "popularitas itemset"

  • supp(X) = Fraksi transaksi yang memuat itemset X.
  • supp(X $\cup$ Y) = Fraksi transaksi yang memuat X dan Y.

Contoh:

  • supp({Bread}) = 3/7 = 42%
  • supp({Bread} $\cup$ {Butter}) = 3/7 = 42%
Analisis Keranjang Belanja di R

Metrik confidence

TID Transaksi
1 {Bread, Butter, Cheese, Wine}
2 {Bread, Butter, Wine}
3 {Bread, Butter}
4 {Butter, Cheese, Wine}
5 {Butter, Cheese}
6 {Cheese, Wine}
7 {Butter, Wine}

Confidence: "seberapa sering aturan benar"

conf(X $\rightarrow$ Y) = supp(X $\cup$ Y) / supp(X)

Confidence menunjukkan persentase Y dibeli bersama X.

Contoh:

X = {Bread}

Y = {Butter}

conf(X $\rightarrow$ Y) = $\frac{3/7}{3/7}$ = 100%

Analisis Keranjang Belanja di R

Metrik lift

TID Transaksi
1 {Bread, Butter, Cheese, Wine}
2 {Bread, Butter, Wine}
3 {Bread, Butter}
4 {Butter, Cheese, Wine}
5 {Butter, Cheese}
6 {Cheese, Wine}
7 {Butter, Wine}

Lift: "seberapa kuat asosiasi"

lift(X $\rightarrow$ Y) = $\dfrac{supp(X \cup Y)}{supp(X) \times supp(Y)}$

  • Lift > 1: Y cenderung dibeli bersama X
  • Lift < 1: Y kecil kemungkinan dibeli jika X dibeli

Contoh:

X = {Bread}; Y = {Butter}

lift(X $\rightarrow$ Y) = $\frac{3/7}{(3/7)*(6/7)} = \frac{7}{6}$ ~ 1,16

Analisis Keranjang Belanja di R

Fungsi apriori untuk frequent itemsets

library(arules)
# Frequent itemsets
supp.cw = apriori(trans, # the transactional dataset
                  # Parameter list
                  parameter=list(
                    # Minimum Support
                    supp=0.2,
                    # Minimum Confidence
                    conf=0.4,
                    # Minimum length
                    minlen=2,
                    # Target
                    target="frequent itemsets"),
                  # Appearence argument
                  appearance = list(
                    items = c("Cheese","Wine"))
                 )
Analisis Keranjang Belanja di R

Fungsi apriori untuk rules

library(arules)
# Rules
rules.b.rhs = apriori(trans, # the transactional dataset
                  # Parameter list
                  parameter=list(
                    # Minimum Support
                    supp=0.2,
                    # Minimum Confidence
                    conf=0.4,
                    # Minimum length
                    minlen=2,
                    # Target
                    target="rules"),
                  # Appearence argument
                   appearance = list(
                     rhs = "Butter",
                    default = "lhs")
                 )
Analisis Keranjang Belanja di R

Frequent itemsets dengan apriori

TID Transaksi
1 {Bread, Butter, Cheese, Wine}
2 {Bread, Butter, Wine}
3 {Bread, Butter}
4 {Butter, Cheese, Wine}
5 {Butter, Cheese}
6 {Cheese, Wine}
7 {Butter, Wine}

Ambil frequent itemsets

supp.all = apriori(trans, 
           parameter=list(supp=3/7,
           target="frequent itemsets"))        
inspect(head(sort(supp.all,by="support"),3))
    items          support   count
[1] {Butter}       0.8571429 6    
[2] {Wine}         0.7142857 5    
[3] {Cheese}       0.5714286 4
Analisis Keranjang Belanja di R

Periksa metrik confidence dan lift

TID Transaksi
1 {Bread, Butter, Cheese, Wine}
2 {Bread, Butter, Wine}
3 {Bread, Butter}
4 {Butter, Cheese, Wine}
5 {Butter, Cheese}
6 {Cheese, Wine}
7 {Butter, Wine}

Ambil rules

# Rules dengan "Butter" di rhs
rules.b.rhs = apriori(trans, 
                 parameter=list(
                    minlen=2,
                    target="rules"), 
                 appearance = list(
                     rhs="Butter",
                    default = "lhs")
           )
inspect(head(sort(rules.b.rhs,by="lift")), 5)
Analisis Keranjang Belanja di R

Periksa metrik confidence dan lift

TID Transaksi
1 {Bread, Butter, Cheese, Wine}
2 {Bread, Butter, Wine}
3 {Bread, Butter}
4 {Butter, Cheese, Wine}
5 {Butter, Cheese}
6 {Cheese, Wine}
7 {Butter, Wine}

Ambil rules

    lhs                    rhs      support  confidence lift  count
[1] {Bread}             => {Butter} 0.42      1.0        1.16     3    
[2] {Bread,Cheese}      => {Butter} 0.14      1.0        1.16     1    
[3] {Bread,Wine}        => {Butter} 0.28      1.0        1.16     2    
[4] {Bread,Cheese,Wine} => {Butter} 0.14      1.0        1.16     1    
[5] {Wine}              => {Butter} 0.57      0.8        0.93     4
Analisis Keranjang Belanja di R

Ayo berlatih!

Analisis Keranjang Belanja di R

Preparing Video For Download...