Categories: Assignment Help

BUS210 Business Analytics Project Data Set Creation and Processing You Have to Use StatTools Data Set Creation and Processing: 1. Find a data set that is

BUS210 Business Analytics Project Data Set Creation and Processing You Have to Use StatTools

Data Set Creation and Processing:

Don't use plagiarized sources. Get Your Custom Essay on
BUS210 Business Analytics Project Data Set Creation and Processing You Have to Use StatTools Data Set Creation and Processing: 1. Find a data set that is
Get an essay WRITTEN FOR YOU, Plagiarism free, and by an EXPERT!
Order Essay

1. Find a data set that is interesting to you.

2. Minimum 1,000 records, (observations, examples).

3. Minimum 15 variables, (attributes, features).

4. It is best to not use a data set that contains extensive missing data.

5. Save the data set as a .CSV file, name file as described above.

6. Save the data set as a MS Excel, name file as described above.

7. In the MS Excel workbook, name the worksheet containing the data, “Data”.

8. In the MS Excel workbook create in worksheet and name it “Data Dictionary”.

1. In the Data Dictionary worksheet: 2. Create three labeled columns: Variable Name, Data Type, Explanation of Variable. 3. List your variables and complete the Data Dictionary worksheet.

9. Create a correlation matrix of your data set.

10. Create a set of scatter plots of “interesting” variables of your choice.

11. Create a set of PivotTables of “interesting” variables of your choose.

12. Create a set of bar charts of “interesting” variables of your choice.

13. Create a set of histograms of “interesting” variables of our choice.

Paper Outline: You are to “pretend” and put yourself into the following context. You are an outside Business Analytics consultant hired by a company that has a goal of improving their current quantitative analysis processes. They provide you with a basic data set, that needs “cleaning up” by preprocessing, filtering examples with missing attributes, replacing attributes, randomly selecting 1,000 examples from a larger data set, when applicable.

Thus, you will have eight parts:

1. Title page

2. Introduction, company information, consulting firm information.Your story, your interest.

3. Data Understanding, what the data set represents and used for.

4. Data Preparation, how the data set was “cleaned up,” file preparation, missing data removal.

5. Discussion of data dictionary. Include URL of data location.

6. Discussion of correlation matrix, explain any “interesting” correlations.

7. Discussion of scatter plots, and why you selected the specific variables to use.

8. Discussion of PivotTables, and why you selected the specific variables to use.

9. Discussion of bar charts, and why you selected the specific variables to use.

10. Discussion of histograms, and why you selected the specific variables to use. BUS 210 Optional Business Analytics Project Requirements
This optional project has a possible point value of 50 points. The score will be based on
completion of project, how “interesting” the data set is, and how well the report is written.
Submission to a Blackboard Assignment is due the last day of class.
You will submit 3 files:
1. CSV file containing the data set in its original form
a. Name file LastName FirstName Original Data Set
2. MS Excel file containing your processed clean data set and data dictionary
a. Name file LastName FirstName Processed Data Set
3. MS Word file of your professional report
a. Name file LastName FirstName Consulting Report
Data Set Creation and Processing:
1. Find a data set that is interesting to you.
2. Minimum 1,000 records, (observations, examples).
3. Minimum 15 variables, (attributes, features).
4. It is best to not use a data set that contains extensive missing data.
5. Save the data set as a .CSV file, name file as described above.
6. Save the data set as a MS Excel, name file as described above.
7. In the MS Excel workbook, name the worksheet containing the data, “Data”.
8. In the MS Excel workbook create in worksheet and name it “Data Dictionary”.
1. In the Data Dictionary worksheet:
2. Create three labeled columns: Variable Name, Data Type, Explanation of Variable.
3. List your variables and complete the Data Dictionary worksheet.
9. Create a correlation matrix of your data set.
10. Create a set of scatter plots of “interesting” variables of your choice.
11. Create a set of PivotTables of “interesting” variables of your choose.
12. Create a set of bar charts of “interesting” variables of your choice.
13. Create a set of histograms of “interesting” variables of our choice.
Possible Data Set Repositories:
http://archive.ics.uci.edu/ml/index.php
https://www.kdnuggets.com/datasets/index.html
https://toolbox.google.com/datasetsearch
https://www.kaggle.com/datasets
https://datamarket.com/data/list/?q=provider:tsdl
Paper Outline:
You are to “pretend” and put yourself into the following context. You are an outside Business
Analytics consultant hired by a company that has a goal of improving their current quantitative
analysis processes. They provide you with a basic data set, that needs “cleaning up” by
preprocessing, filtering examples with missing attributes, replacing attributes, randomly selecting
1,000 examples from a larger data set, when applicable.
You are to write up a professional report discussing your findings.
Thus, you will have eight parts:
1. Title page
2. Introduction, company information, consulting firm information. Your story, your interest.
3. Data Understanding, what the data set represents and used for.
4. Data Preparation, how the data set was “cleaned up,” file preparation, missing data removal.
5. Discussion of data dictionary. Include URL of data location.
6. Discussion of correlation matrix, explain any “interesting” correlations.
7. Discussion of scatter plots, and why you selected the specific variables to use.
8. Discussion of PivotTables, and why you selected the specific variables to use.
9. Discussion of bar charts, and why you selected the specific variables to use.
10. Discussion of histograms, and why you selected the specific variables to use.
Use the tasks listed above as headings for each section of your report. The final report should be
at least 1000 word. YOU ARE TO USE 1.15 LINE SPACING FOR YOUR MS WORD
DOCUMENT.
Your final grade will depend on:
1. The quality of you writing.
2. The content quality of your report.
3. Data set quality and clarity of the data dictionary.
4. Where you obtained your data set for your analysis.

Purchase answer to see full
attachment

superadmin

Recent Posts

What is the easy difination of science | Quick Solution

Science is the pursuit and application of knowledge and understanding of the natural and social…

3 years ago

definition, values, meaning of such values and type of goods with such elasticity value …….. | Quick Solution

Clearly stating the definition, the values, the meaning of such values and the type of…

3 years ago

Acct 422 – Nora D | Quick Solution

All answered must be typed using Times New Roman (size 12, double-spaced) font. No pictures…

3 years ago

Acct 322 – Nora D | Quick Solution

All answered must be typed using Times New Roman (size 12, double-spaced) font. No pictures…

3 years ago

Macro Economics Question | Quick Solution

https://www.npr.org/sections/ed/2018/04/25/605092520/high-paying-trade-jobs-sit-empty-while-high-school-grads-line-up-for-university Click on the link above. Read the entire link and answer the questions below…

3 years ago

MGT 322 – Nora D | Quick Solution

All answered must be typed using Times New Roman (size 12, double-spaced) font. No pictures…

3 years ago