Generalized Linear Models di Python
Ita Cirovic Donev
Data Science Consultant

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\normalsize{\color{#00A388}{\text{salary}} = \beta_0 + \beta_1\times\color{#FF6138}{\text{experience}} + \epsilon}$
$\normalsize{\color{#00A388}y = \beta_0 + \beta_1x_1 + \epsilon}$

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\color{#00A388}{\text{salary}} = \beta_0 + \beta_1\times{\text{experience}} + \epsilon$
$\color{#00A388}y = \beta_0 + \beta_1x_1 + \epsilon$
di mana:
$\color{#00A388}y$ - variabel respons (output)

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\normalsize{\color{#00A388}{\text{salary}} = \beta_0 + \beta_1\times\color{#FF6138}{\text{experience}} + \epsilon}$
$\normalsize{\color{#00A388}y = \beta_0 + \beta_1\color{#FF6138}{x_1} + \epsilon}$
di mana:
$y$ - variabel respons (output)
$\color{#FF6138}x$ - variabel penjelas (input)

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\normalsize{\color{#00A388}{\text{salary}} = \color{#007AFF}{\beta_0} + \color{#007AFF}{\beta_1}\times\color{#FF6138}{\text{experience}} + \epsilon}$
$\normalsize{\color{#00A388}y = \color{#007AFF}{\beta_0} + \color{#007AFF}{\beta_1}\color{#FF6138}{x_1} + \epsilon}$
di mana:
$y$ - variabel respons (output)
$x$ - variabel penjelas (input)
$\color{#007AFF}{\beta}$ - parameter model
$\color{#007AFF}{\beta_0}$ - intersep
$\color{#007AFF}{\beta_1}$ - kemiringan (slope)

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\normalsize{\color{#00A388}{\text{salary}} = \color{#007AFF}{\beta_0} + \color{#007AFF}{\beta_1}\times\color{#FF6138}{\text{experience}} + \color{#B12BFF}\epsilon}$
$\normalsize{\color{#00A388}y = \color{#007AFF}{\beta_0} + \color{#007AFF}{\beta_1}\color{#FF6138}{x_1} + \color{#B12BFF}\epsilon}$
di mana:
$y$ - variabel respons (output)
$x$ - variabel penjelas (input)
$\color{#007AFF}{\beta}$ - parameter model
$\color{#007AFF}{\beta_0}$ - intersep
$\color{#007AFF}{\beta_1}$ - kemiringan (slope)
$\color{#B12BFF}{\epsilon}$ - galat acak
MODEL LINEAR - ols()
from statsmodels.formula.api import ols
model = ols(formula = 'y ~ X',
data = my_data).fit()
MODEL LINEAR TERUMUMKAN - glm()
import statsmodels.api as sm
from statsmodels.formula.api import glm
model = glm(formula = 'y ~ X',
data = my_data,
family = sm.families.____).fit()

$$ \normalsize{{\text{salary} = \color{blue}{25790} + \color{blue}{9449}\times\text{experience}}} $$
Fungsi regresi
$\normalsize{E[y] = \mu = \beta_0 + \beta_1x_1}$
Asumsi

| Nama Variabel | Deskripsi |
|---|---|
sat |
Jumlah satelit di sarang |
y |
Ada ≥1 satelit di sarang; 0/1 |
weight |
Berat kepiting betina (kg) |
width |
Lebar kepiting betina (cm) |
color |
1 - medium terang, 2 - medium, 3 - medium gelap, 4 - gelap |
spine |
1 - keduanya baik, 2 - satu aus/retak, 3 - keduanya aus/retak |
$\text{satellite crab} \sim \text{female crab weight}$
y ~ weight
$P(\text{satelit ada})=P(y=1)$






Generalized Linear Models di Python