Build and assess a model: movies reviews

In this problem, you will build a logistic regression model using the movies dataset. The score is stored in the label column and is 1 when the review is positive, and 0 when negative. The text review has been transformed, using BOW, to numeric columns.

You have already built a classifier but evaluated it using the same data employed in the training step. Make sure you now assess the model using an unseen test dataset. How does the performance of the model change when evaluated on the test set?

Import the function required for a train/test split.
Perform the train/test split, specifying that 20% of the data should be used as a test set.
Train a logistic regression model.
Print out the accuracy of the model on the training and on the testing data.

Sentiment Analysis Nuts and Bolts

Numeric Features from Reviews

More on Numeric Vectors: Transforming Tweets

Let's Predict the Sentiment

Exercise

Build and assess a model: movies reviews

Instructions