Utilisation des dates et des heures en Python
Max Shron
Data Scientist and Author
# Charger Pandas import pandas as pd# Importer les trajets de W20529 au T4 2017 rides = pd.read_csv('capital-onebike.csv')
# Voir nos données
print(rides.head(3))
Start date End date Start station \
0 2017-10-01 15:23:25 2017-10-01 15:26:26 Glebe Rd & 11th St N
1 2017-10-01 15:42:57 2017-10-01 17:49:59 George Mason Dr & Wilson Blvd
2 2017-10-02 06:37:10 2017-10-02 06:42:53 George Mason Dr & Wilson Blvd
End station Bike number Member type
0 George Mason Dr & Wilson Blvd W20529 Member
1 George Mason Dr & Wilson Blvd W20529 Casual
2 Ballston Metro / N Stuart & 9th St N W20529 Member
rides['Start date']
0 2017-10-01 15:23:25
1 2017-10-01 15:42:57
...
Name: Start date, Length: 290, dtype: object
rides.iloc[2]
Start date 2017-10-02 06:37:10
End date 2017-10-02 06:42:53
...
Name: 1, dtype: object
# Importer les trajets de W20529 au T4 2017 rides = pd.read_csv('capital-onebike.csv', parse_dates = ['Start date', 'End date'])# Ou : rides['Start date'] = pd.to_datetime(rides['Start date'], format = "%Y-%m-%d %H:%M:%S")
# Sélectionner Start date pour la ligne 2
rides['Start date'].iloc[2]
Timestamp('2017-10-02 06:37:10')
# Créer une colonne de durée rides['Duration'] = rides['End date'] - rides['Start date']# Afficher les 5 premières lignes print(rides['Duration'].head(5))
0 0 days 00:03:01
1 0 days 02:07:02
2 0 days 00:05:43
3 0 days 00:21:18
4 0 days 00:21:17
Name: Duration, dtype: timedelta64[ns]
rides['Duration']\
.dt.total_seconds()\
.head(5)
0 181.0
1 7622.0
2 343.0
3 1278.0
4 1277.0
Name: Duration, dtype: float64
Utilisation des dates et des heures en Python