We add a Feature, Number of Words for each line to the Train and Test data sets
train$len = str_count(train$text) test$len = str_count(test$text)