Skip to content

Latest commit

 

History

History
34 lines (27 loc) · 2.51 KB

README.md

File metadata and controls

34 lines (27 loc) · 2.51 KB

🎯 Sentiment Analysis

Perform binary sentiment analysis (positive / negative) with tweets.

We benchmark 3 approaches:

  • Turn-Key Solutions with Microsoft Cognitive Services (Sentiment Analysis);
  • Low-code Solutions with Microsoft Azure Machine Learning Studio (Designer);
  • Advanced and Custom Solutions with Keras / Tensorflow (RNN/LSTM and Word Embeddings)

🗂️ Dataset

Sentiment140 dataset with 1.6 million tweets

📜 Tasks

  • ✔️ Pre-process Tweets;
  • ✔️ Use Azure Text Analytics - Sentiment Analysis;
  • ✔️ Use AMLS Designer, Logistic Regression model, and 2 text vectorizations;
  • ✔️ Use Tensorflow / Keras with RNN / LTSM and word embeddings (from scratch and pre-trained);
  • ✔️ Compare all approaches and evaluate performance (AUC, Accuracy);
  • ✔️ Deploy the best model for real-time inferencing and publish endpoint;
  • ✔️ Write a blog article.

💻 Dependencies

NLTK, Spacy, WordCloud, Azure ML/AI SDK, scikit-learn, Azure Portal, Azure Machine Learning Studio, Tensorflow/Keras, Google Colab (with GPU), Tensorboard, pretrained word embeddings (Word2Vec, GloVE, USE)

📌 References

Further readings