Filtragem de dados agrupados

SQL Intermediário

Jasmin Ludolf

Data Science Content Developer, DataCamp

HAVING

SELECT 
       release_year,
       COUNT(title) AS title_count
FROM films
GROUP BY release_year
WHERE COUNT(title) > 10;
syntax error at or near "WHERE"
LINE 4: WHERE COUNT(title) > 10;
        ^
SELECT 
       release_year,
       COUNT(title) AS title_count
FROM films
GROUP BY release_year
HAVING COUNT(title) > 10;
|release_year|title_count|
|------------|-----------|
|1988        |31         |
|null        |42         |
|2008        |225        |
...
SQL Intermediário

Ordem de execução

-- Written code:

SELECT certification, COUNT(title) AS title_count FROM films WHERE certification IN ('G', 'PG', 'PG-13') GROUP BY certification HAVING COUNT(title) > 500 ORDER BY title_count DESC LIMIT 3;
-- Order of execution:

SELECT certification, COUNT(title) AS title_count
FROM films
WHERE certification IN ('G', 'PG', 'PG-13')
GROUP BY certification
HAVING COUNT(title) > 500
ORDER BY title_count DESC
LIMIT 3;
SQL Intermediário

HAVING vs. WHERE

  • WHERE filtra registros individuais, HAVING filtra registros agrupados
  • Quais filmes foram lançados em 2000?
SELECT title
FROM films
WHERE release_year = 2000;
|title         |
|--------------|
|102 Dalmatians|
|28 Days       |
...
  • Em quais anos a duração média dos filmes foi superior a duas horas?
SQL Intermediário

HAVING vs. WHERE

  • Em quais anos a duração média dos filmes foi superior a duas horas?
SELECT release_year
FROM films

GROUP BY release_year
HAVING AVG(duration) > 120;
|release_year|
|------------|
|1954        |
|1959        |
...
SQL Intermediário

Vamos praticar!

SQL Intermediário

Preparing Video For Download...