Intermediate ChatGPT
Alex Banks
Founder & Educator

 



 

 


 

Large quantities of low-quality internet data

Low quantities of high-quality conversational data

RLHF = Reinforcement Learning from Human Feedback


Labeling is a human-machine collaboration

Intermediate ChatGPT