Grouping data

Intermediate SQL

Jasmin Ludolf

Data Science Content Developer, DataCamp

Grouping data

Illustration showing a full funnel with different shapes being funneled into three categories: squares, circles, and triangles

Intermediate SQL

GROUP BY single fields

SELECT certification, COUNT(title) AS title_count
FROM films
GROUP BY certification;
|certification|title_count|
|-------------|-----------|
|Unrated      |62         |
|M            |5          |
|G            |112        |
|NC-17        |7          |
...
Intermediate SQL

Error handling

SELECT certification, title
FROM films
GROUP BY certification;
column "films.title" must appear in the
GROUP BY clause or be used in an 
aggregate function
LINE 1: SELECT certification, title
                              ^
SELECT 
   certification, 
   COUNT(title) AS count_title
FROM films
GROUP BY certification;
|certification|count_title|
|-------------|-----------|
|Unrated      |62         |
|M            |5          |
|G            |112        |
...
Intermediate SQL

GROUP BY multiple fields

SELECT certification, language, COUNT(title) AS title_count
FROM films
GROUP BY certification, language;
|certification|language |title_count|
|-------------|---------|-----------|
|null         |null     |5          |
|Unrated      |Japanese |2          |
|R            |Norwegian|2          |
...
Intermediate SQL

GROUP BY with ORDER BY

SELECT 
       certification, 
       COUNT(title) AS title_count
FROM films
GROUP BY certification;
|certification|title_count|
|-------------|-----------|
|Unrated      |62         |
|M            |5          |
|G            |112        |
...
SELECT 
       certification, 
       COUNT(title) AS title_count
FROM films
GROUP BY certification
ORDER BY title_count DESC;
|certification|title_count|
|-------------|-----------|
|R            |2118       |
|PG-13        |1462       |
...
Intermediate SQL

Order of execution

-- Written code:
SELECT 
       certification, 
       COUNT(title) AS title_count
FROM films
GROUP BY certification
ORDER BY title_count DESC
LIMIT 3;
-- Order of execution:

SELECT certification, COUNT(title) AS title_count
FROM films
GROUP BY certification
ORDER BY title_count DESC
LIMIT 3;
Intermediate SQL

Let's practice!

Intermediate SQL

Preparing Video For Download...