Efficiënte code schrijven met pandas
Leonidas Souliotis
PhD Candidate
| S1 | R1 | S2 | R2 | S3 | R3 | S4 | R4 | S5 | R5 | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | ♦ | 10 | ♣ | Boer | ♣ | Koning | ♠ | 4 | ♥ | Aas |
| 2 | ♦ | Boer | ♦ | Koning | ♦ | 10 | ♦ | Vrouw | ♦ | Aas |
| 3 | ♣ | Vrouw | ♣ | Boer | ♣ | Koning | ♣ | 10 | ♣ | Aas |
| S1 | R1 | S2 | R2 | S3 | R3 | S4 | R4 | S5 | R5 | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 10 | 3 | 11 | 3 | 13 | 4 | 4 | 1 | 1 |
| 2 | 2 | 11 | 2 | 13 | 2 | 10 | 2 | 12 | 2 | 1 |
| 3 | 3 | 12 | 3 | 11 | 3 | 13 | 3 | 10 | 3 | 1 |
Sn: symbool van de n-de kaart
1 (Harten), 2 (Ruiten), 3 (Klaveren), 4 (Schoppen)
Rn: rang van de n-de kaart
1 (Aas), 2-10, 11 (Boer), 12 (Vrouw), 13 (Koning)
.loc[] — indexnaam-locator
# Specify the range of rows to select
rows = range(0, 500)
# Time selecting rows using .loc[]
loc_start_time = time.time()
data.loc[rows]
loc_end_time = time.time()
print("Time using .loc[] : {} sec".format(
loc_end_time - loc_start_time))
Time using .loc[]: 0.001951932 seconds
.iloc[] — indexnummer-locator
# Specify the range of rows to select
rows = range(0, 500)
# Time selecting rows using .iloc[]
iloc_start_time = time.time()
data.iloc[rows]
iloc_end_time = time.time()
print("Time using .iloc[]: {} sec".format(
iloc_end_time - iloc_start_time)
Time using .iloc[] : 0.0007140636 sec
Verschil in snelheid: 173.355592654%
.iloc[] — indexnummer-locator
iloc_start_time = time.time()
data.iloc[:,:3]
iloc_end_time = time.time()
print("Time using .iloc[]: {} sec".format(
iloc_end_time - iloc_start_time))
Time using .iloc[]: 0.00125193595886 sec
Kolommen op naam selecteren
names_start_time = time.time()
data[['S1', 'R1', 'S2']]
names_end_time = time.time()
print("Time using selection by name: {} sec".format(
names_end_time - names_start_time))
Time using selection by name: 0.000964879989624 sec
Verschil in snelheid: 29.7504324188%
Efficiënte code schrijven met pandas