Advanced Excel - Class Project Conduct a typical big data analytics research project, evaluate the results, and write an academic research paper. Decide on two things: 1) A topic in which you would...

1 answer below »


Advanced Excel - Class Project


Conduct a typical big data analytics research project, evaluate the results, and write an academic research paper.


Decide on two things:


1) A topic in which you would like to conduct research and analyze the results


2) Where you can obtain this data


Once you have identified your problem area and gathered your data, apply at least 2 analytics models to the dataset. You may choose multiple ones if it helps for your analysis. These may be any from the course content. During the evaluation, identify key features of your dataset to help determine which parts have significance.


Paper should be in academic form in APA format. Include the following sections, have a minimum of 4pages although some conferences/workshops/journals may be longer. All analysis dataset and data analysis files should be in excel (in xlxs please)


I:Abstract(quick summary of problem and outcome)


II:Introduction/ Problem Statement(what are you analyzing and for what motivation)


III:Methodology(Your experiment set up: What is your data, what algorithms, how did you analyze the data, how did you run the tests)


IV:Results and Discussion(tables and discussion of evaluation metrics and the successes or


failures of the algorithms)


V:Conclusion




Project Proposalneeded earlier (Due 7/22 at 1:00pm)- Problem statement and where you plan on getting the data (1 page)





Ideas of analytics models? Feel free to add or use others


Linear Regression Scatter Plot, Data Analysis Regression, The Paired Two-Sample Test, Lead-Time, Sampling, Benford Law Case, Descriptive Statistics, Excel Logic Formulas, Scenario Manager Practice, Spin Button, Breakeven Decision Model, Advanced Excel Pivot Tables, Data Analysis ToolPak

Answered 1 days AfterJul 20, 2022

Answer To: Advanced Excel - Class Project Conduct a typical big data analytics research project, evaluate the...

Komalavalli answered on Jul 22 2022
86 Votes
Abstract:
Demand for analysing data among the superstore owner is increasing to get insights of product , segment wise sales in order to strength their existence in the market. This project is to gain insights about the super store data by employing analysis of pivot table, regression analysis.
It is found that both East and West region contributes more than 50% of the profit .In order to strength the super store existence in central home office product needs some changes. Regression model indicates that there is more space to add more feature variable should be included to get proper prediction on profit and sales
Problem statement:
With rising market demands and fierce competition, a Superstore Giant seeks expertise in determining what works best for them. They want to know which items, geographies, categories, and consumer groups to target or avoid.
Methodology:
About data set: Superstore data set was obtained from the website Kaggle(https://www.kaggle.com/datasets/vivek468/superstore-dataset-final). Total number of observation in this dataset is 994 obs.
Description of variables :
Row ID = Unique ID for each row.
Order ID = Unique Order ID for each Customer.
Order Date = Order Date of the product.
Ship Date = Shipping Date of the Product.
Ship Mode= Shipping Mode specified by the Customer.
Customer ID = Unique ID to identify each Customer.
Customer Name = Name of the Customer.
Segment = The segment where the Customer belongs.
Country = Country of residence of the Customer.
City = City of residence of of the Customer.
State = State of residence of the Customer.
Postal Code = Postal Code of every Customer.
Region = Region where the Customer belong.
Product ID = Unique ID of the Product.
Category = Category of the product ordered.
Sub-Category = Sub-Category of the product ordered.
Product Name = Name of the Product
Sales = Sales of the Product.
Quantity = Quantity of the Product.
Discount =Discount provided.
Profit = Profit/Loss incurred.
Process of data wrangling:
This process involves the cleaning of dataset before conducting any analysis.
Formatted the columns sales and profit from general to number format. Then deleted the columns Row ID, order ID, order date, ship date, customer ID, Customer Name, Country, city, state, postal code, region, sub category of the product .Next, I checked for blank or empty cells of each column using Count if blank function. The dataset has no blank cells. So I proceeded to Exploratory Data analysis
Pivot table creation:
Created Pivot table for Region wise total sales and profit, Bar chart was created for analysing which region has highest percentage contribution in profit
Doughnut chart was created for each region to analysis which product segment has highest contribution towards profit.
Correlation matrix
This matrix was used to find the strength of relationship between the variables sales, profit and discount.
Regression Models
Logistic Linear...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here