Interactive Quiz

What is Natural Language Processing (NLP) in the context of machine learning?

In a bigram model for predicting the next character, what is the prediction based on?

Why is a special character '.' added at the beginning and end of first names in the bigram model?

What is the main problem with the counting method for N-grams when N increases?

What is the main advantage of using a neural network with a fully connected layer for predicting the next character compared to the counting method?

What does the embedding matrix \( C \) represent in the fully connected model inspired by Bengio et al.?

What is the main advantage of the hyperbolic tangent (tanh) activation function compared to the sigmoid in hidden layers?

What is the main motivation for using a Recurrent Neural Network (RNN) for character sequence prediction?

What is one of the main problems encountered by classical RNNs on long sequences?

What is the main innovation introduced by the LSTM layer compared to a classical RNN?

Score: 0/10