Modelli lineari generalizzati in Python
Ita Cirovic Donev
Data Science Consultant

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\normalsize{\color{#00A388}{\text{salary}} = \beta_0 + \beta_1\times\color{#FF6138}{\text{experience}} + \epsilon}$
$\normalsize{\color{#00A388}y = \beta_0 + \beta_1x_1 + \epsilon}$

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\color{#00A388}{\text{salary}} = \beta_0 + \beta_1\times{\text{experience}} + \epsilon$
$\color{#00A388}y = \beta_0 + \beta_1x_1 + \epsilon$
where:
$\color{#00A388}y$ - variabile risposta (output)

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\normalsize{\color{#00A388}{\text{salary}} = \beta_0 + \beta_1\times\color{#FF6138}{\text{experience}} + \epsilon}$
$\normalsize{\color{#00A388}y = \beta_0 + \beta_1\color{#FF6138}{x_1} + \epsilon}$
where:
$y$ - variabile risposta (output)
$\color{#FF6138}x$ - variabile esplicativa (input)

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\normalsize{\color{#00A388}{\text{salary}} = \color{#007AFF}{\beta_0} + \color{#007AFF}{\beta_1}\times\color{#FF6138}{\text{experience}} + \epsilon}$
$\normalsize{\color{#00A388}y = \color{#007AFF}{\beta_0} + \color{#007AFF}{\beta_1}\color{#FF6138}{x_1} + \epsilon}$
where:
$y$ - variabile risposta (output)
$x$ - variabile esplicativa (input)
$\color{#007AFF}{\beta}$ - parametri del modello
$\color{#007AFF}{\beta_0}$ - intercetta
$\color{#007AFF}{\beta_1}$ - pendenza

$\color{#00A388}{\text{salary}} \sim \color{#FF6138}{\text{experience}}$
$\normalsize{\color{#00A388}{\text{salary}} = \color{#007AFF}{\beta_0} + \color{#007AFF}{\beta_1}\times\color{#FF6138}{\text{experience}} + \color{#B12BFF}\epsilon}$
$\normalsize{\color{#00A388}y = \color{#007AFF}{\beta_0} + \color{#007AFF}{\beta_1}\color{#FF6138}{x_1} + \color{#B12BFF}\epsilon}$
where:
$y$ - variabile risposta (output)
$x$ - variabile esplicativa (input)
$\color{#007AFF}{\beta}$ - parametri del modello
$\color{#007AFF}{\beta_0}$ - intercetta
$\color{#007AFF}{\beta_1}$ - pendenza
$\color{#B12BFF}{\epsilon}$ - errore casuale
MODELLO LINEARE - ols()
from statsmodels.formula.api import ols
model = ols(formula = 'y ~ X',
data = my_data).fit()
MODELLO LINEARE GENERALIZZATO - glm()
import statsmodels.api as sm
from statsmodels.formula.api import glm
model = glm(formula = 'y ~ X',
data = my_data,
family = sm.families.____).fit()

$$ \normalsize{{\text{salary} = \color{blue}{25790} + \color{blue}{9449}\times\text{experience}}} $$
Funzione di regressione
$\normalsize{E[y] = \mu = \beta_0 + \beta_1x_1}$
Assunzioni

| Nome variabile | Descrizione |
|---|---|
sat |
Numero di satelliti nel nido |
y |
C’è almeno un satellite nel nido; 0/1 |
weight |
Peso della femmina (kg) |
width |
Larghezza della femmina (cm) |
color |
1 - medio chiaro, 2 - medio, 3 - medio scuro, 4 - scuro |
spine |
1 - entrambe buone, 2 - una usurata/rotta, 3 - entrambe usurate/rotte |
$\text{satellite crab} \sim \text{female crab weight}$
y ~ weight
$P(\text{satellite crab is present})=P(y=1)$






Modelli lineari generalizzati in Python