Introduction to Statistics in R
Maggie Matsui
Content Developer, DataCamp
What's the probability of an event?
$$ P(\text{event}) = \frac{\text{\# ways event can happen}}{\text{total \# of possible outcomes}} $$
Example: a coin flip
$$ P(\text{heads}) = \frac{\text{1 way to get heads}}{\text{2 possible outcomes}} = \frac{1}{2} = 50\%$$



$$P(\text{Brian}) = \frac{1}{4} = 25\%$$
sales_counts
   name  n_sales
 1 Amir      178
 2 Brian     126
 3 Claire     75
 4 Damian     69
sales_counts %>%
  sample_n(1)
   name  n_sales
 1 Brian     126
sales_counts %>%
  sample_n(1)
   name  n_sales
 1 Claire     75
set.seed(5)sales_counts %>% sample_n(1)
   name  n_sales
 1 Brian     126
set.seed(5)sales_counts %>% sample_n(1)
   name  n_sales
 1 Brian     126
Sampling without replacement


$$P(\text{Claire}) = \frac{1}{3} = 33\%$$
sales_counts %>%
  sample_n(2)
   name  n_sales
 1 Brian     126
 2 Claire     75


$$P(\text{Claire}) = \frac{1}{4} = 25\%$$
sales_counts %>%
  sample_n(2, replace = TRUE)
   name  n_sales
 1 Brian     126
 2 Claire     75
5 meetings:
sample(sales_team, 5, replace = TRUE)
   name  n_sales
 1 Brian     126
 2 Claire     75
 3 Brian     126
 4 Brian     126
 5 Amir      178
Two events are independent if the probability of the second event isn't affected by the outcome of the first event.

Two events are independent if the probability of the second event isn't affected by the outcome of the first event.
Sampling with replacement = each pick is independent

Two events are dependent if the probability of the second event is affected by the outcome of the first event.

Two events are dependent if the probability of the second event is affected by the outcome of the first event.

Two events are dependent if the probability of the second event is affected by the outcome of the first event.
Sampling without replacement = each pick is dependent

Introduction to Statistics in R