What's in the database?

Analisi esplorativa dei dati in SQL

Christina Maimone

Data Scientist

PostgreSQL

PostgreSQL Logo

Analisi esplorativa dei dati in SQL

Database client

Database client connects to database server

Analisi esplorativa dei dati in SQL

entity relationship diagram with each database table and their columns

Analisi esplorativa dei dati in SQL

Evanston 311 database table

Analisi esplorativa dei dati in SQL

erdiagram_fortune500.png

Analisi esplorativa dei dati in SQL

Stackoverflow database table

Analisi esplorativa dei dati in SQL

Tables supporting stackoverflow database table

Analisi esplorativa dei dati in SQL

Select a few rows

SELECT * 
  FROM company 
LIMIT 5;
 id | exchange | ticker |         name          | parent_id 
<hr />-+----------+--------+-----------------------+-----------
  1 | nasdaq   | PYPL   | PayPal Holdings, Inc. |          
  2 | nasdaq   | AMZN   | Amazon.com, Inc.      |          
  3 | nasdaq   | MSFT   | Microsoft Corporation |          
  4 | nasdaq   | MDB    | MongoDB Inc.          |          
  5 | nasdaq   | DBX    | Dropbox, Inc.         |          
(5 rows)
Analisi esplorativa dei dati in SQL

A few reminders

Code Note
NULL missing
Analisi esplorativa dei dati in SQL

A few reminders

Code Note
NULL missing
IS NULL, IS NOT NULL don't use = NULL
Analisi esplorativa dei dati in SQL

A few reminders

Code Note
NULL missing
IS NULL, IS NOT NULL don't use = NULL
count(*) number of rows
Analisi esplorativa dei dati in SQL

A few reminders

Code Note
NULL missing
IS NULL, IS NOT NULL don't use = NULL
count(*) number of rows
count(column_name) number of non-NULL values
Analisi esplorativa dei dati in SQL

A few reminders

Code Note
NULL missing
IS NULL, IS NOT NULL don't use = NULL
count(*) number of rows
count(column_name) number of non-NULL values
count(DISTINCT column_name) number of different non-NULL values
Analisi esplorativa dei dati in SQL

A few reminders

Code Note
NULL missing
IS NULL, IS NOT NULL don't use = NULL
count(*) number of rows
count(column_name) number of non-NULL values
count(DISTINCT column_name) number of different non-NULL values
SELECT DISTINCT column_name ... distinct values, including NULL
Analisi esplorativa dei dati in SQL

Let's start exploring

Analisi esplorativa dei dati in SQL

Preparing Video For Download...