Intermediate ChatGPT
Alex Banks
Founder & Educator








Large quantities of low-quality internet data

Low quantities of high-quality conversational data

RLHF = Reinforcement Learning from Human Feedback


Labeling is a human-machine collaboration

Intermediate ChatGPT