Fundamentals of reinforcement learning

Reinforcement Learning with Gymnasium in Python

Fouad Trad

Machine Learning Engineer

Reinforcement learning

Agent learns through trial and error

Image showing two icons, one for an agent, and the other for the environment.

Reinforcement learning

Agent learns through trial and error

Image showing that observations are given from the environment to the agent.

Reinforcement learning

Agent learns through trial and error

Image showing that the environment provides the agent with observations, and then the agent performs actions accordingly.

Reinforcement learning

Agent learns through trial and error
Agent receives:
- Rewards for good decisions
- Penalties for bad decisions
Goal: maximize positive feedback over time

Image showing that the environment provides the agent with observations, then the agent performs actions, and receives rewards or penalties based on these actions.

RL as training a pet

Image showing an old man (the environment) training a pet (the agent).

RL vs. other ML types

The image shows a table with the title "Supervised Learning," indicating that the data type used is labeled data, the main objective is to predict outcomes based on input data, and it is suitable for classification and regression tasks.

RL vs. other ML types

RL vs. other ML types

When to use RL?

Sequential decision-making
- Decisions influence future observations
Learning through rewards and penalties
- No direct supervision

Icon for a robot

Appropriate for RL: playing video games

Player makes sequential decisions
Receives points and loses lives depending on actions

Image showing a video game scene where the agent is taking a decision.

Inappropriate for RL: in-game object recognition

No sequential decision-making
No interaction with an environment

Image showing a video game frame where the goal is to recognize different kinds of pokemons.

RL applications

Robotics

Robot walking
Object manipulation

Image showing a robot hand.

RL applications

Robotics

Robot walking
Object manipulation

Image showing a robot hand.

Finance

Optimizing trading and investment
Maximize profit

Image depicting a large sum of money flying out of an open briefcase against a blue background, conveying the concept of financial success.

RL applications

Autonomous Vehicles

Enhancing safety and efficiency
Minimizing accident risks

Image showing several autonomous vehicles driving on the road.

RL applications

Autonomous Vehicles

Enhancing safety and efficiency
Minimizing accident risks

Image showing several autonomous vehicles driving on the road.

Chatbot development

Enhancing conversational skills
Improving user experiences

Image showing a conversational chatbot.

What's next?

In this course we will:

Understand RL foundations and principles
Identify, frame, and solve RL problems
Application with Gymnasium

Image for the Gymnasium logo.

Let's practice!

Reinforcement Learning with Gymnasium in Python

Preparing Video For Download...