Within Exercise 5, it's indicated that # this should be a numpy array of length 941 of probabilities
However, if we're using train_label as means to measure our self-confidence so we can test our approach on out-of-sample data, then we're smaller than the indicated 941 length (unless we're expected to calculate the pred_probs for the full dataset). train_label has 705 observations, whereas the full dataframe has 941 observations.
Just wanted to get clarity as to what's exactly expected?