Thoughts - Ambarish

Little Book on Text Mining

This Little Book on Text Mining provides a gentle and hands on introduction to Text Mining. If you are tired of reading through pages of text and would like to get your hands dirty and experience on how to do a quick and detailed text mining, then you are in the right place. This book does a detailed Text Mining and Modelling on the following datasets

  • Spooky Author Identification dataset from Kaggle

  • Yelp Data Reviews dataset from Kaggle

  • Simpsons dataset from Kaggle

Hope you will have fun going thorough this book as much as I had writing it.

You can find the book here

The book focuses on Three main areas

  • Exploratory Data Analysis , TF IDF concept and the application of it ,Bigrams ,Trigrams ,Relationship among various words ( Word Clouds and Bar Plots )

  • Detailed Sentiment Analysis and insights from it using different Sentiment Analysis lexicons such as AFINN , NRC

  • Modelling using feature engineering and supervised learning techniques such as XGBoost and Multinomial Logistic Regression.Modelling using unsupervised learning techniques such as Topic Modelling

Please let me know your comments and suggestions.