Testing Kaggle forum ideas
Unfortunately, not all the Forum posts and Kernels are necessarily useful for your model. So instead of blindly incorporating ideas into your pipeline, you should test them first.
You're given a function get_cv_score()
, which takes a train dataset as an argument and returns the overall validation root mean squared error over 3-fold cross-validation. The train
DataFrame is already available in your workspace.
You should try different suggestions from the Kaggle Forum and check whether they improve your validation score.
Diese Übung ist Teil des Kurses
Winning a Kaggle Competition in Python
Interaktive Übung
Versuche dich an dieser Übung, indem du diesen Beispielcode vervollständigst.
# Drop passenger_count column
new_train_1 = train.____('____', axis=1)
# Compare validation scores
initial_score = get_cv_score(train)
new_score = get_cv_score(new_train_1)
print('Initial score is {} and the new score is {}'.format(initial_score, new_score))