Fit a random forest

Data scientists often use random forest models. They perform well out of the box, and have lots of settings to optimize performance. Random forests can be used for classification or regression; we'll use it for regression to predict the future price change of LNG.

We'll create and fit the random forest model similarly to the decision trees using the .fit(features, targets) method. With sklearn's RandomForestRegressor, there's a built-in .score() method we can use to evaluate performance. This takes arguments (features, targets), and returns the R\(^2\) score (the coefficient of determination).

This exercise is part of the course

Machine Learning for Finance in Python

View Course

Exercise instructions

  • Create the random forest model with the imported RandomForestRegressor class.
  • Fit (train) the random forest using train_features and train_targets.
  • Print out the R\(^2\) score on the train and test sets.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

from sklearn.ensemble import RandomForestRegressor

# Create the random forest model and fit to the training data
rfr = ____(n_estimators=200)
rfr.fit(____, ____)

# Look at the R^2 scores on train and test
print(rfr.score(train_features, train_targets))
print(____)