BUS210 Business Analytics Project Data Set Creation and Processing You Have to Use StatTools
Data Set Creation and Processing:
1. Find a data set that is interesting to you.
2. Minimum 1,000 records, (observations, examples).
3. Minimum 15 variables, (attributes, features).
4. It is best to not use a data set that contains extensive missing data.
5. Save the data set as a .CSV file, name file as described above.
6. Save the data set as a MS Excel, name file as described above.
7. In the MS Excel workbook, name the worksheet containing the data, Data.
8. In the MS Excel workbook create in worksheet and name it Data Dictionary.
1. In the Data Dictionary worksheet: 2. Create three labeled columns: Variable Name, Data Type, Explanation of Variable. 3. List your variables and complete the Data Dictionary worksheet.
9. Create a correlation matrix of your data set.
10. Create a set of scatter plots of interesting variables of your choice.
11. Create a set of PivotTables of interesting variables of your choose.
12. Create a set of bar charts of interesting variables of your choice.
13. Create a set of histograms of interesting variables of our choice.
Paper Outline: You are to pretend and put yourself into the following context. You are an outside Business Analytics consultant hired by a company that has a goal of improving their current quantitative analysis processes. They provide you with a basic data set, that needs cleaning up by preprocessing, filtering examples with missing attributes, replacing attributes, randomly selecting 1,000 examples from a larger data set, when applicable.
Thus, you will have eight parts:
1. Title page
2. Introduction, company information, consulting firm information.Your story, your interest.
3. Data Understanding, what the data set represents and used for.
4. Data Preparation, how the data set was cleaned up, file preparation, missing data removal.
5. Discussion of data dictionary. Include URL of data location.
6. Discussion of correlation matrix, explain any interesting correlations.
7. Discussion of scatter plots, and why you selected the specific variables to use.
8. Discussion of PivotTables, and why you selected the specific variables to use.
9. Discussion of bar charts, and why you selected the specific variables to use.
10. Discussion of histograms, and why you selected the specific variables to use. BUS 210 Optional Business Analytics Project Requirements
This optional project has a possible point value of 50 points. The score will be based on
completion of project, how interesting the data set is, and how well the report is written.
Submission to a Blackboard Assignment is due the last day of class.
You will submit 3 files:
1. CSV file containing the data set in its original form
a. Name file LastName FirstName Original Data Set
2. MS Excel file containing your processed clean data set and data dictionary
a. Name file LastName FirstName Processed Data Set
3. MS Word file of your professional report
a. Name file LastName FirstName Consulting Report
Data Set Creation and Processing:
1. Find a data set that is interesting to you.
2. Minimum 1,000 records, (observations, examples).
3. Minimum 15 variables, (attributes, features).
4. It is best to not use a data set that contains extensive missing data.
5. Save the data set as a .CSV file, name file as described above.
6. Save the data set as a MS Excel, name file as described above.
7. In the MS Excel workbook, name the worksheet containing the data, Data.
8. In the MS Excel workbook create in worksheet and name it Data Dictionary.
1. In the Data Dictionary worksheet:
2. Create three labeled columns: Variable Name, Data Type, Explanation of Variable.
3. List your variables and complete the Data Dictionary worksheet.
9. Create a correlation matrix of your data set.
10. Create a set of scatter plots of interesting variables of your choice.
11. Create a set of PivotTables of interesting variables of your choose.
12. Create a set of bar charts of interesting variables of your choice.
13. Create a set of histograms of interesting variables of our choice.
Possible Data Set Repositories:
http://archive.ics.uci.edu/ml/index.php
https://www.kdnuggets.com/datasets/index.html
https://toolbox.google.com/datasetsearch
https://www.kaggle.com/datasets
https://datamarket.com/data/list/?q=provider:tsdl
Paper Outline:
You are to pretend and put yourself into the following context. You are an outside Business
Analytics consultant hired by a company that has a goal of improving their current quantitative
analysis processes. They provide you with a basic data set, that needs cleaning up by
preprocessing, filtering examples with missing attributes, replacing attributes, randomly selecting
1,000 examples from a larger data set, when applicable.
You are to write up a professional report discussing your findings.
Thus, you will have eight parts:
1. Title page
2. Introduction, company information, consulting firm information. Your story, your interest.
3. Data Understanding, what the data set represents and used for.
4. Data Preparation, how the data set was cleaned up, file preparation, missing data removal.
5. Discussion of data dictionary. Include URL of data location.
6. Discussion of correlation matrix, explain any interesting correlations.
7. Discussion of scatter plots, and why you selected the specific variables to use.
8. Discussion of PivotTables, and why you selected the specific variables to use.
9. Discussion of bar charts, and why you selected the specific variables to use.
10. Discussion of histograms, and why you selected the specific variables to use.
Use the tasks listed above as headings for each section of your report. The final report should be
at least 1000 word. YOU ARE TO USE 1.15 LINE SPACING FOR YOUR MS WORD
DOCUMENT.
Your final grade will depend on:
1. The quality of you writing.
2. The content quality of your report.
3. Data set quality and clarity of the data dictionary.
4. Where you obtained your data set for your analysis.
Purchase answer to see full
attachment
Science is the pursuit and application of knowledge and understanding of the natural and social…
Clearly stating the definition, the values, the meaning of such values and the type of…
All answered must be typed using Times New Roman (size 12, double-spaced) font. No pictures…
All answered must be typed using Times New Roman (size 12, double-spaced) font. No pictures…
https://www.npr.org/sections/ed/2018/04/25/605092520/high-paying-trade-jobs-sit-empty-while-high-school-grads-line-up-for-university Click on the link above. Read the entire link and answer the questions below…
All answered must be typed using Times New Roman (size 12, double-spaced) font. No pictures…