Intermediate ChatGPT
Alex Banks
Founder & Educator
Large quantities of low-quality internet data
Low quantities of high-quality conversational data
RLHF = Reinforcement Learning from Human Feedback
Labeling is a human-machine collaboration
Intermediate ChatGPT