Intermediate SQL with AI
Jasmin Ludolf
Senior Data Science Content Developer
Which languages have more than 20 films?
WHERE
$$
$$
$$
HAVING
Prompt: Which languages have more than 20 films?
SELECT language, COUNT(*) AS lang_count
FROM films
GROUP BY language
HAVING COUNT(*) > 20;
|language|
|--------|
|Mandarin|
|Spanish |
|French |
|Hindi |
|English |
$$
Prompt: Film titles and budgets with a budget above 5 million
SELECT title, budget
FROM films
WHERE budget > 5000000;
|title |budget |
|---------------|-------|
|Metropolis |6000000|
|Duel in the Sun|8000000|
|Quo Vadis |7623000|
|West Side Story|6000000|
...
Prompt: Languages and budget with rounded average budget above 5 million
SELECT language, ROUND(AVG(budget)) AS avg_budget
FROM films
GROUP BY language
HAVING ROUND(AVG(budget)) > 5000000;
|language|avg_budget|
|--------|----------|
|Danish |16700000 |
|None |8250000 |
|Tamil |150000000 |
...
Prompt: Show languages with over 5 films from 2000 onwards, where average duration exceeds 80
Prompt: Show languages with over 5 films from 2000 onwards, where average duration exceeds 80
SELECT language, COUNT(*) AS film_count, ROUND(AVG(duration)) AS average_duration FROM filmsWHERE release_year >= 2000GROUP BY language
Prompt: Show languages with over 5 films from 2000 onwards, where average duration exceeds 80
SELECT language, COUNT(*) AS film_count, ROUND(AVG(duration)) AS average_duration
FROM films
WHERE release_year >= 2000
GROUP BY language
Prompt: Show languages with over 5 films from 2000 onwards, where average duration exceeds 80
SELECT language, COUNT(*) AS film_count, ROUND(AVG(duration)) AS average_duration FROM films WHERE release_year >= 2000 GROUP BY languageHAVING COUNT(*) > 5 AND ROUND(AVG(duration)) > 80;
|language|film_count|average_duration|
|--------|----------|----------------|
|Mandarin|26 |112 |
|Japanese|10 |114 |
|Spanish |37 |107 |
...
$$
FROM ➝

$$
FROM ➝ WHERE ➝

$$
FROM ➝ WHERE ➝ GROUP BY (and aggregation) ➝

$$
FROM ➝ WHERE ➝ GROUP BY (and aggregation) ➝ HAVING ➝

$$
FROM ➝ WHERE ➝ GROUP BY (and aggregation) ➝ HAVING ➝ SELECT ➝

$$
FROM ➝ WHERE ➝ GROUP BY (and aggregation) ➝ HAVING ➝ SELECT ➝ ORDER BY ➝

$$
FROM ➝ WHERE ➝ GROUP BY (and aggregation) ➝ HAVING ➝ SELECT ➝ ORDER BY ➝ LIMIT

Both: Films from 2000 onwards where the average duration is above 80
WHERE: Individual records
HAVING: Mentions aggregate

$$
1) Start small
2) Check the results
3) Prompt again to add more detail
4) Repeat!

Intermediate SQL with AI