अभ्यास

Data preparation

During the 2016 US election, Russian tweet bots were used to constantly distribute political rhetoric to both democrats and republicans. You have been given a dataset of such tweets called russian_tweets. You have decided to classify these tweets as either left- (democrat) or right-leaning(republican). Before you can build a classification model, you need to clean and prepare the text for modeling.

निर्देश

100 XP

Finalize the tokenization process by stemming the tokens.
Use cast_dtm() to create a document-term matrix.
Weight the document-term matrix using tfidf weighting.
Print the matrix.

.css-6su6fj{-webkit-flex-shrink:0;-ms-flex-negative:0;flex-shrink:0;}अभ्यास

निर्देश

अभ्यास