Skip to content

Latest commit

 

History

History
36 lines (23 loc) · 1.28 KB

README.md

File metadata and controls

36 lines (23 loc) · 1.28 KB

Analysis of sentiments


About the Dataset

This dataset was taken from the social network Twitter and is divided into three classes (positive, negative and neutral). These dataset have a popular language with slang and word abbreviations. It is necessary to perform some manipulations on the data to obtain a better performance of the model used.

Sentiments labels were transformed as follow:

  • Negative label: 0
  • Positive label: 1
  • Neutral label: 2

link of the dataset: https://www.kaggle.com/augustop/portuguese-tweets-for-sentiment-analysis

Columns description

  • id: String identifier directly from Twitter;
  • tweet_text: Full text from the tweet
  • tweet_date: Tweet creation date
  • sentiment: Sentiment label (classifier)
  • query_used: Query used to collect the tweet

References