Some information about the data set
The data is a CSV with emoticons removed. The data file format has 6 fields:
the polarity of the tweet (-1= negative, 0 = neutral, 1 = positive)
the id of the tweet (2087)
the date of the tweet (Sat May 16 23:58:44 UTC 2009)
the query (lyx). If there is no query, then this value is NO_QUERY.
the user that tweeted (robotickilldozr)
the text of the tweet (Lyx is cool)