Interactive Quiz

What is the main advantage of transfer learning in deep learning?

What is the main difference between transfer learning and fine-tuning?

In fine-tuning, how do you choose the number of layers to retrain?

Which dataset is often used to pre-train image classification models in transfer learning?

What is the main objective of knowledge distillation?

Why does knowledge distillation often improve the performance of the student model?

In knowledge distillation applied to unsupervised anomaly detection, what is the main role of the student model?

What is the particularity of the BERT architecture compared to GPT?

Which training task does BERT use to learn linguistic representations?

In token-level classification with BERT (e.g., NER), why is a [CLS] token used at the beginning of the sequence?

Score: 0/10