Learning techniques

Large Language Models (LLMs) Concepts

Vidhi Chugh

AI strategist and ethicist

Where are we?

Progress chart showing we have reached the fine-tuning stage

Large Language Models (LLMs) Concepts

Getting beyond data constraints

 

  • Fine-tuning: training a pre-trained model for a specific task

 

  • But, what if there is little to no labeled data?

 

  • N-shot learning: zero-shot, few-shot, and multi-shot
Large Language Models (LLMs) Concepts

Transfer learning

  • Learn from one task and transfer to related task
  • Transferring knowledge from piano to guitar
    • Reading musical notes
    • Understanding rhythm
    • Grasping musical concepts
  • N-shot learning
    • Zero-shot - no task-specific data
    • Few-shot - little task-specific data
    • Multi-shot - relatively more training data

Illustration of a piano and a guitar

Large Language Models (LLMs) Concepts

Zero-shot learning

  • No explicit training
  • Uses language understanding and context
  • Generalizes without any prior examples

An image showing how training over horse and stripes separately helps the model identify zebra.

1 Freepik
Large Language Models (LLMs) Concepts

Few-shot learning

  • Learn a new task with a few examples

An image showing a classroom

  • One-shot learning: fine-tuning from one example
  • Prior knowledge to answer new question

An image showing a child writing an exam

Large Language Models (LLMs) Concepts

Multi-shot learning

  • Requires more examples than few-shot

 

  • Previous tasks, plus new examples

 

  • For example, a model trained on Golden Retriever

Images of three Golden Retreivers

1 Freepik
Large Language Models (LLMs) Concepts

Multi-shot learning

  • Model output: Labrador Retriever

 

  • Saves time in collecting and labeling data

 

  • No compromise on accuracy

An image of a labrador

1 Freepik
Large Language Models (LLMs) Concepts

Building blocks so far

  • Data preparation workflow

 

  • Fine-tuning

 

  • N-shot learning techniques

 

  • Next up: pre-training
Large Language Models (LLMs) Concepts

Let's practice!

Large Language Models (LLMs) Concepts

Preparing Video For Download...