This white paper examines different approaches for sentiment analysis and summarizes the key benefits and drawbacks of each:
1. The data mining approach represents documents as numeric vectors and applies machine learning techniques to discover patterns for predicting sentiment. While capable of discovering complex patterns, it does not maintain important contextual information and provides little insight into model predictions.
2. The natural language processing (NLP) approach uses linguistic rules defined by domain experts to determine sentiment polarity. It can better capture context but requires more time to develop rules and annotate training data.
3. A hybrid approach combines the two by using data mining to discover patterns for rule development in NLP models or by incorporating linguistic features into machine learning models. This takes advantage