Covid19-Sentiment-Analysis of Unlabelled Tweets

Introduction

Sentiment analysis is the process of determining whether a piece of writing is positive, negative, or neutral. Sentiment Analysis is a Natural Language Processing technique. A branch of linguistics, computer science, and artificial intelligence called "natural language processing" (NLP) studies how computers and human language interact, with a focus on how to train computers to process and analyse massive volumes of natural language data. The ultimate goal is to create a machine that is able to "understand" the contents of documents, including the subtle subtleties of language used in different contexts. Once the information and insights are accurately extracted from the documents, the technology can classify and arrange the documents themselves. Read More

Getting Started

Open a terminal in a specific folder. Then execute the following command one at a time.

  git clone https://github.com/rohit-khoiwal-30/Covid19-Sentiment-Analysis.git
  cd Covid19-Sentiment-Analysis
  virtualenv env
  env\scripts\activate
  pip install -r requirements.txt
  cd server
  flask run

Open app folder and double-click index.html and enjoy the app.

Problem-Statement:

The topic of Covid-19 is covered in a sizable corpus of tweets on Twitter. We wish to categorise how many people have good and negative views about the COVID-19 epidemic.

Solution :

We download Twitter's raw tweets into our system. Hydrate Tweets
We must clean and preprocess tweets before using it.
Than Tweet features can be extracted using the tf-idf vectorizer.
Since the data is unlabeled, we must somehow label it in order to use supervised learning. Go to the notebook.
After labelling, we extract features using countvectorizer from Sklearn.
For classification, we employ a naïve bayes classifier.

Conclusion:

Using an unsupervised learning technique, we labelled the data and classified based on that. Because we trained the model on 1 lakh tweets, we obtained quite decent accuracy but had a few small mistakes in identifying tweets.

Refernces:

We make use of a github repository's dataset. Here
Preprocessing and Feature Extraction. Here

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.ipynb_checkpoints		.ipynb_checkpoints
app		app
model		model
server		server
.gitignore		.gitignore
Hydrating Tweets.ipynb		Hydrating Tweets.ipynb
README.md		README.md
Sentiment_Analysis.ipynb		Sentiment_Analysis.ipynb
requirements.txt		requirements.txt
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Covid19-Sentiment-Analysis of Unlabelled Tweets

Introduction

Getting Started

Problem-Statement:

Solution :

Conclusion:

Refernces:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

rohit-khoiwal/Covid19-Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Covid19-Sentiment-Analysis of Unlabelled Tweets

Introduction

Getting Started

Problem-Statement:

Solution :

Conclusion:

Refernces:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages