Reference no: EM132327506
Decision Management Systems
Assignment – Exploratory Data Analysis (EDA) using Cognos Analytics
Activities
1. Select from the datasets provided (or ones designated by your instructor). Provide a brief description of the dataset to include the number of cases, description of the inputs, description of the variables that could be used to develop predictive models, etc.
2. Examine the dataset and eliminate mistakes, bad records, data entry errors, and outliers.
Using Cognos Analytics:
3. Explore the dataset, including:
a) Pose an initial set of questions to use for data exploration. Provide any insights gained from using Cognos Analytics with this dataset.
For example, are there outliers? Missing variables for some cases? Irrelevant variables? If so, what do you recommend as solutions? How does Cognos Analytics provide assistance? Develop new specific questions which provide additional insights into and answer specific questions from the dataset. Discuss how these insights could be useful. Discuss how you would improve the relevancy.
b) Develop and explain at least five different visualizations. Experiment with the available options and summarize the results. Provide insights to what the visualizations show.
c) Utilize features (e.g., filters, comparisons) with the visualizations to uncover and explain interesting aspects of the data set.
d) Create and explain at least one insightful calculation. Discuss why this would be useful.
Content (note that the document must have clearly marked sections for the items listed below)
1) Title page (1 page limit): course number and term, assignment number and project title, student name and contact information, instructor’s name. Format it so it looks pleasant and presentable. Follow formatting guidelines above.
2) Introduction. Provide a brief outline of the dataset you are using for this assignment. Briefly describe the content of the data. Include a screenshot of the data (not all, but partial as far as all relevant variables are visible).
3) Data exploration process. Explain and discuss what data exploration you performed (e.g., questions generated about the data set content). Include any specific ideas or suggestions as to how this could be used in your organization.
4) Visualizations created. Explain the visualizations created. Include the value-added aspects of the visualizations. Include creative aspects for increasing potential for higher assignment grade.
5) Calculation which adds insights or value to the data set. Include and explain the value of the calculation, i.e., insights provided by the calculation.
6) References (1 page limit): List all references in APA format used in preparing this report. It is strongly recommended to use outside knowledge in setting-up the analysis or discussing the results where possible.
7) Appendix (4 page limit):
a) Appendix A: Include any appropriate workbooks and/or screenshots (figures, tables, diagrams) used in this assignment. Make sure all tables, figures, or diagrams are properly numbered and titled. For example, “Table 1. Model Results”. Make sure all tables or figures or diagrams are easily readable and visually presentable.
a. Introduction
Is the dataset fully described and outlined? Is the dataset a robust (i.e., lots of cases and inputs) selection? Is the intent of the assignment discussed at an appropriate level of detail? Are any initial insights provided?
b. Dataset cleansing
Is the dataset fully described and cleansed of outliers (as appropriate), mistakes and erroneous entries? Is rationale provided for any cleansing to the data set?
c. Data exploration and calculations
a) Are questions about the data set discussed and their relevance analyzed? Are insights from the questions provided?
b) Was at least one calculation for the data set performed? How meaningful was the calculation? Did the calculation provide insights and/or clarity to the data set?
a. Visualizations
a) How well are at least five insightful visualizations developed? How effectively were filters used? How appropriate were the comparison visualizations? What insights were included?
a. a) How well are visualizations explained and interpreted? Are the visualizations appropriate and varied? Are insights provided that would help understand the important aspects of the data set? Are creative aspects included in the visualizations?
b) How creative were the visualizations? Did they provide interesting insights and conclusions about the data set?
b. Mechanics (spelling, grammar)
Is the paper free of grammatical errors and spelling and punctuation? Is the paper properly formatted?
c. Citations and References
Are all references and citations correctly written and presented?
Attachment:- Dataset- House Price.rar