Les sous-requêtes sont partout ! Et quelles sont les bonnes pratiques ?

Manipulation de données en SQL

Mona Khalil

Data Scientist, Greenhouse Software

Autant de sous-requêtes que vous le souhaitez…

  • Peut inclure plusieurs sous-requêtes dans SELECT, FROM, WHERE
SELECT 
    s.stage,
    ROUND(s.avg_goals,2) AS avg_goal,
    (SELECT AVG(home_goal + away_goal)
     FROM match WHERE season = '2013/2014') AS overall_avg
FROM 
    (SELECT
         stage,
         AVG(home_goal + away_goal) AS avg_goals
     FROM match
     WHERE season = '2013/2014'
     GROUP BY stage) AS s
WHERE 
    s.avg_goals > (SELECT AVG(home_goal + away_goal) 
                   FROM match WHERE season = '2013/2014');
Manipulation de données en SQL

Formater les requêtes

  • Aligner SELECT, FROM, WHERE et GROUP BY
SELECT
    col1,
    col2,
    col3
FROM table1
WHERE col1 = 2;
Manipulation de données en SQL

Annoter vos requêtes

/* This query filters for col1 = 2
and only selects data from table1 */
SELECT
    col1,
    col2,
    col3
FROM table1
WHERE col1 = 2;
Manipulation de données en SQL

Annoter vos requêtes

SELECT
    col1,
    col2,
    col3
FROM table1 -- this table has 10,000 rows
WHERE col1 = 2; -- Filter WHERE value 2
Manipulation de données en SQL

Indenter vos requêtes

  • Indenter vos sous-requêtes
SELECT
    col1,
    col2,
    col3
FROM table1
WHERE col1 IN
        (SELECT id
         FROM table2
         WHERE year = 1991);
Manipulation de données en SQL

Indenter vos requêtes

SELECT 
  date, 
  hometeam_id, 
  awayteam_id,
  CASE WHEN hometeam_id = 8455 AND home_goal > away_goal 
            THEN 'Chelsea home win'
       WHEN awayteam_id = 8455 AND home_goal < away_goal
            THEN 'Chelsea away win'
       WHEN hometeam_id = 8455 AND home_goal < away_goal
            THEN 'Chelsea home loss'
       WHEN awayteam_id = 8455 AND home_goal > away_goal
            THEN 'Chelsea away loss'
       WHEN (hometeam_id = 8455 OR awayteam_id = 8455) 
            AND home_goal = away_goal THEN 'Chelsea Tie'
       END AS outcome
FROM match
WHERE hometeam_id = 8455 OR awayteam_id = 8455;

Guide de style SQL de Holywell

Manipulation de données en SQL

Cette sous-requête est-elle nécessaire ?

  • Les sous-requêtes nécessitent une puissance de calcul importante

    • Quelle est la taille de votre base de données ?
    • Quelle est la taille de la table sur laquelle porte votre requête ?
  • La sous-requête est-elle réellement nécessaire ?

Manipulation de données en SQL

Filtrer correctement chaque sous-requête

  • Attention aux filtres !
SELECT 
    s.stage,
    ROUND(s.avg_goals,2) AS avg_goal,
    (SELECT AVG(home_goal + away_goal)
     FROM match WHERE season = '2013/2014') AS overall_avg
FROM 
    (SELECT
         stage,
         AVG(home_goal + away_goal) AS avg_goals
     FROM match
     WHERE season = '2013/2014'
     GROUP BY stage) AS s
WHERE 
    s.avg_goals > (SELECT AVG(home_goal + away_goal) 
                   FROM match WHERE season = '2013/2014');
Manipulation de données en SQL

Passons à la pratique !

Manipulation de données en SQL

Preparing Video For Download...