During your second Individual Project (IP), you will utilize your Python environment to derive structure from unstructured data. You will utilize the data set "Airline Sentiment" from Kaggle open data...

1 answer below »

During your second Individual Project (IP), you will utilize your Python environment to derive structure from unstructured data. You will utilize the data set "Airline Sentiment" from Kaggle open data sets located athttps://www.kaggle.com/welkin10/airline-sentiment.


Using this data set, you will create a text analytics Python application that extracts themes from each comment using term frequency–inverse document frequency (TF–IDF) or simple word counts. For the deliverable, provide your Python file and a .csv with your results added as a column to the original data set.



Answered Same DaySep 06, 2022

Answer To: During your second Individual Project (IP), you will utilize your Python environment to derive...

Aditi answered on Sep 06 2022
77 Votes
SOLUTION
import pandas as pd
from sklearn.feature_extraction.text import TfidfVectorizer
pd.set_o
ption('display.max_columns', 20)
df_full = pd.read_csv("Tweets.csv") #read csv into pandas dataframe
df_text = df_full.text #Select the tweet text column for text analysis
#Running tfidf on the...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here