Gegroepeerde gegevens filteren

Gevorderde SQL

Jasmin Ludolf

Data Science Content Developer, DataCamp

HAVING

SELECT 
       release_year,
       COUNT(title) AS title_count
FROM films
GROUP BY release_year
WHERE COUNT(title) > 10;
syntax error at or near "WHERE"
LINE 4: WHERE COUNT(title) > 10;
        ^
SELECT 
       release_year,
       COUNT(title) AS title_count
FROM films
GROUP BY release_year
HAVING COUNT(title) > 10;
|release_year|title_count|
|------------|-----------|
|1988        |31         |
|null        |42         |
|2008        |225        |
...
Gevorderde SQL

Volgorde van uitvoering

-- Written code:

SELECT certification, COUNT(title) AS title_count FROM films WHERE certification IN ('G', 'PG', 'PG-13') GROUP BY certification HAVING COUNT(title) > 500 ORDER BY title_count DESC LIMIT 3;
-- Order of execution:

SELECT certification, COUNT(title) AS title_count
FROM films
WHERE certification IN ('G', 'PG', 'PG-13')
GROUP BY certification
HAVING COUNT(title) > 500
ORDER BY title_count DESC
LIMIT 3;
Gevorderde SQL

HAVING versus WHERE

  • WHERE filtert individuele records, HAVING filtert gegroepeerde records
  • Welke films zijn er in het jaar 2000 uitgekomen?
SELECT title
FROM films
WHERE release_year = 2000;
|title         |
|--------------|
|102 Dalmatians|
|28 Days       |
...
  • In welke jaren duurde een film gemiddeld meer dan twee uur?
Gevorderde SQL

HAVING versus WHERE

  • In welke jaren duurde een film gemiddeld meer dan twee uur?
SELECT release_year
FROM films

GROUP BY release_year
HAVING AVG(duration) > 120;
|release_year|
|------------|
|1954        |
|1959        |
...
Gevorderde SQL

Laten we oefenen!

Gevorderde SQL

Preparing Video For Download...