Question 1

How does it work?

Accepted Answer

Zero-shot learning works by training a model to map input features (e.g., images) into a shared semantic space, such as attribute vectors or word embeddings. During inference, the model compares the projected input against semantic descriptions of unseen classes and selects the closest match. This allows recognition of classes never seen during training.

Question 2

What is the difference between zero-shot and one-shot learning?

Accepted Answer

Zero-shot learning requires no labeled examples of the target class during training, relying solely on semantic descriptions. One-shot learning, in contrast, uses a single labeled example of each target class to learn a new concept. Zero-shot is more extreme in its lack of data but depends on high-quality semantic information, while one-shot learning often uses metric learning or meta-learning.

Question 3

When should zero-shot learning be used instead of traditional supervised learning?

Accepted Answer

Zero-shot learning is appropriate when labeled data for many classes is unavailable or expensive to obtain, but semantic descriptions (e.g., attributes or text) are easy to define. It is commonly used for rare species classification, emerging object recognition, or tasks with a large number of classes. However, if sufficient labeled data exists for all classes, traditional supervised learning typically yields higher accuracy.

Zero-Shot Learning

Zero-Shot Learning

Why it matters

FAQ

How does it work?

What is the difference between zero-shot and one-shot learning?

When should zero-shot learning be used instead of traditional supervised learning?