Subqueries in SELECT

Data Manipulation in SQL

Mona Khalil

Data Scientist, Greenhouse Software

SELECTing what?

  • Returns a single value
    • Include aggregate values to compare to individual values
  • Used in mathematical calculations
    • Deviation from the average
Data Manipulation in SQL

Subqueries in SELECT

  • Calculate the total matches across all seasons
SELECT COUNT(id) FROM match;
12837
Data Manipulation in SQL

Subqueries in SELECT

SELECT
  season,
  COUNT(id) AS matches,
  12837 as total_matches
FROM match
GROUP BY season;
| season    | matches | total_matches |
|-----------|---------|---------------|
| 2011/2012 | 3220    | 12837         |
| 2012/2013 | 3260    | 12837         |
| 2013/2014 | 3032    | 12837         |
| 2014/2015 | 3325    | 12837         |
Data Manipulation in SQL

Subqueries in SELECT

SELECT
  season,
  COUNT(id) AS matches,
  (SELECT COUNT(id) FROM match) as total_matches
FROM match
GROUP BY season;
| season    | matches | total_matches |
|-----------|---------|---------------|
| 2011/2012 | 3220    | 12837         |
| 2012/2013 | 3260    | 12837         |
| 2013/2014 | 3032    | 12837         |
| 2014/2015 | 3325    | 12837         |
Data Manipulation in SQL

SELECT subqueries for mathematical calculations

SELECT AVG(home_goal + away_goal) 
FROM match
WHERE season = '2011/2012';
2.72
SELECT
  date,
  (home_goal + away_goal) AS goals,
  (home_goal + away_goal) - 2.72 AS diff
FROM match
WHERE season = '2011/2012';
Data Manipulation in SQL

Subqueries in SELECT

SELECT
  date,
  (home_goal + away_goal) AS goals,
  (home_goal + away_goal) - 
     (SELECT AVG(home_goal + away_goal) 
      FROM match
      WHERE season = '2011/2012') AS diff
FROM match
WHERE season = '2011/2012';
| date       | goals | diff              |
|------------|-------|-------------------|
| 2011-07-29 | 3     | 0.28354037267081  |
| 2011-07-30 | 2     | -0.71645962732919 |
| 2011-07-30 | 4     | 1.28354037267081  |
| 2011-07-30 | 1     | -1.71645962732919 |
Data Manipulation in SQL

SELECT subqueries -- things to keep in mind

  • Need to return a SINGLE value

    • Will generate an error otherwise
  • Make sure you have all filters in the right places

    • Properly filter both the main and the subquery!
SELECT
    date,
    (home_goal + away_goal) AS goals,
    (home_goal + away_goal) - 
       (SELECT AVG(home_goal + away_goal) 
        FROM match
        WHERE season = '2011/2012') AS diff
FROM match
WHERE season = '2011/2012';
Data Manipulation in SQL

Let's practice!

Data Manipulation in SQL

Preparing Video For Download...