ROLLUP

Data-driven beslissingen nemen met SQL

Bart Baesens

Professor Data Science and Analytics

Tabel renting_extended

De eerste rijen van de tabel renting_extended:

| renting_id | country  | genre  | rating |
|------------|----------|--------|--------|
| 2          | Belgium  | Drama  | 10     |
| 32         | Belgium  | Drama  | 10     |
| 203        | Austria  | Drama  | 6      |
| 292        | Austria  | Comedy | 8      |
| 363        | Belgium  | Drama  | 7      |
| .......... | ........ | ...... | ...... |
Data-driven beslissingen nemen met SQL

Query met ROLLUP

SELECT country, 
       genre, 
       COUNT(*)
FROM renting_extended
GROUP BY ROLLUP (country, genre);
  • Aggregatieniveaus
    • Aggregatie per combinatie van land en genre
    • Aggregatie per land
    • Totale aggregatie
Data-driven beslissingen nemen met SQL

Query met ROLLUP

SELECT country, 
       genre, 
       COUNT(*)
FROM renting_extended
GROUP BY ROLLUP (country, genre);
| country | genre  | count |
|---------|--------|-------|
| null    | null   | 22    |
| Austria | Comedy | 2     |
| Belgium | Drama  | 15    |
| Austria | Drama  | 4     |
| Belgium | Comedy | 1     |
| Belgium | null   | 16    |
| Austria | null   | 6     |
Data-driven beslissingen nemen met SQL

Volgorde in ROLLUP

SELECT country, 
       genre, 
       COUNT(*)
FROM renting_extended
GROUP BY ROLLUP (genre, country);
| country | genre  | count |
|---------|--------|-------|
| null    | null   | 22    |
| Austria | Comedy | 2     |
| Belgium | Drama  | 15    |
| Austria | Drama  | 4     |
| Belgium | Comedy | 1     |
| null    | Comedy | 3     |
| null    | Drama  | 19    |
Data-driven beslissingen nemen met SQL

Samenvatting ROLLUP

  • Geeft aggregaties voor een hiërarchie, bijv. ROLLUP (country, genre)
    • Verhuur per land en per genre
    • Verhuur per land
    • Totaal aantal verhuur
  • Elke stap laat één detailniveau weg
  • Volgorde van kolomnamen is belangrijk voor ROLLUP
Data-driven beslissingen nemen met SQL

Aantal verhuur en ratings

SELECT country, 
       genre, 
       COUNT(*) AS n_rentals,
       COUNT(rating) AS n_ratings
FROM renting_extended
GROUP BY ROLLUP (genre, country);
| country  | genre  | n_rentals | n_ratings |
|----------|--------|-----------|-----------|
| null     | null   | 22        | 9         |
| Belgium  | Drama  | 15        | 6         |
| Austria  | Comedy | 2         | 1         |
| Belgium  | Comedy | 1         | 0         |
| Austria  | Drama  | 4         | 2         |
| null     | Comedy | 3         | 1         |
| null     | Drama  | 19        | 8         |
Data-driven beslissingen nemen met SQL

Laten we oefenen!

Data-driven beslissingen nemen met SQL

Preparing Video For Download...