Metropolitane segregatie

US Census-gegevens analyseren in Python

Lee Hachadoorian

Asst. Professor of Instruction, Temple University

Een kaart van de VS met grenzen van metropolitane en micropolitane statistische gebieden.

Bron: United States Census Bureau

US Census-gegevens analyseren in Python

Census API-aanvraag: metro/micro-gegevens

import requests

# Basis-URL opbouwen
HOST = "https://api.census.gov/data"
year = "2012"
dataset = "acs/acs5"
base_url = "/".join([HOST, year, dataset])

# Gewenste variabelen
# B01001_001E = Totale bevolking (schatting)
# B03002_003E = Niet-Spaanse witte bevolking (schatting)
# B03002_004E = Niet-Spaanse zwarte bevolking (schatting)
get_vars = ["NAME", "B01001_001E", "B03002_003E", "B03002_004E"]
US Census-gegevens analyseren in Python

Census API-aanvraag: metro/micro-gegevens

# Gewenste variabelen
get_vars = ["NAME", "B01001_001E", "B03002_003E", "B03002_004E"]

# Dictionary met parameters
predicates = {}
predicates["get"] = ",".join(get_vars)

# Gevraagde geografie predicates["for"] = \ "metropolitan statistical area/micropolitan statistical area:*"
US Census-gegevens analyseren in Python

Census API-aanvraag: metro/micro-gegevens

r = requests.get(base_url, params=predicates)

print(r.json()[:5])
[['NAME', 'B01001_001E', 'B03002_003E', 'B03002_004E', 'metropolitan statistical area/micropolitan statistical area'],
 ['Adjuntas, PR Micro Area', '19458', '140', '0', '10260'],
 ['Aguadilla-Isabela-San Sebastián, PR Metro Area', '305538', '5602', '231', '10380'],
 ['Coamo, PR Micro Area', '71596', '228', '53', '17620'],
 ['Fajardo, PR Metro Area', '70633', '543', '195', '21940']]
US Census-gegevens analyseren in Python

Census API-aanvraag: metro/micro-gegevens

# Gebruiksvriendelijke kolomnamen
col_names = ["name", "pop", "white", "black", "msa"]

# JSON-respons in DataFrame laden
msa = pd.DataFrame(columns=col_names, data=r.json()[1:])

# Aantalkolommen casten naar int
msa[["pop", "white", "black"]] = msa["pop", "white", "black"]].astype(int)
US Census-gegevens analyseren in Python

Definitie metropolitaan gebied

  state county   tract  white  black
0    01    001  020100   1601    217
1    01    001  020200    844   1214
2    01    001  020300   2538    647
3    01    001  020400   4030    191
4    01    001  020500   8438   1418
     msa      msa_name          county_name    state_name state county
0  10100  Aberdeen, SD         Brown County  South Dakota    46    013
1  10100  Aberdeen, SD       Edmunds County  South Dakota    46    045
2  10140  Aberdeen, WA  Grays Harbor County    Washington    53    027
3  10180   Abilene, TX      Callahan County         Texas    48    059
4  10180   Abilene, TX         Jones County         Texas    48    253
US Census-gegevens analyseren in Python

Pandas merge-methode

import pandas as pd
# DataFrames samenvoegen op overeenkomende kolommen
tracts_with_msa_id = pd.merge(...)
US Census-gegevens analyseren in Python

Pandas merge-methode

import pandas as pd
# DataFrames samenvoegen op overeenkomende kolommen
tracts_with_msa_id = pd.merge(tracts, msa_def, ...)
US Census-gegevens analyseren in Python

Pandas merge-methode

import pandas as pd
# DataFrames samenvoegen op overeenkomende kolommen
tracts_with_msa_id = pd.merge(tracts, msa_def, 
    left_on = ["state", "county"], right_on = ["state", "county"])
# Alternatief wanneer kolomnamen gelijk zijn
tracts_with_msa_id = pd.merge(tracts, msa_def, on = ["state", "county"])
US Census-gegevens analyseren in Python

Pandas merge-methode

# DataFrame met statennamen
st.head()
       state_name
state            
01        Alabama
02         Alaska
04        Arizona
05       Arkansas
06     California
US Census-gegevens analyseren in Python

Pandas merge-methode

# tracts en st samenvoegen
tracts_st = pd.merge(tracts, st, left_on = "state", right_index = True)

tracts_st.head()
  state county   tract  white  black state_name
0    01    001  020100   1601    217    Alabama
1    01    001  020200    844   1214    Alabama
2    01    001  020300   2538    647    Alabama
3    01    001  020400   4030    191    Alabama
US Census-gegevens analyseren in Python

Laten we oefenen!

US Census-gegevens analyseren in Python

Preparing Video For Download...