Key insight: The base model is an internet text simulator — it autocompletes based on patterns in training data.
The instruction-tuned model (SFT → RLHF) learns to follow instructions and be helpful.
Same underlying knowledge, fundamentally different behavior. InstructGPT (Jan 2022) demonstrated this; ChatGPT (Nov 2022) brought it to the public.
USER →
Click "Generate" to see output...
Click "Generate" to see output...