CIS8008 Business Intelligence Assignment

Assignment Help Other Subject
Reference no: EM132483447

CIS8008 - Business Intelligence - University of Southern Queensland

Part 1. demonstrate applied knowledge of people, markets, finances, technology and management in a global context of business intelligence practice (data warehouse design, data mining process, data visualisation and performance management) and resulting organisational change and how these apply to implementation of business intelligence in organisation systems and business processes

Part 2. identify and solve complex organisational problems creatively and practically through the use of business intelligence and critically reflect on how evidence based decision making and sustainable business performance management can effectively address real world problems

Part 3. demonstrate the ability to communicate effectively in a clear and concise manner in written report style for senior management with correct and appropriate acknowledgment of main ideas presented and discussed.

Assignment Task 1 Data Warehouse Concepts (Worth 39 Marks)
Drawing on relevant and current literature on data warehouses, write a short essay on data warehousing that addresses three sub tasks:

Task1.1) Provide a concise definition of a data warehouse and identify and describe two ways in which a data warehouse differs from a transactional database (10 marks about 250 words) and

Task 1.2) Identify and describe the three main types of data warehouse

Task 1.3 Define concept of a data lake, discuss two advantages and two disadvantages of a data lake comparative to a data warehouse

Assignment Task 2 Exploratory Data Analysis and Linear Regression Analysis

Carefully study the Data Dictionary for Boston Housing Data Set (See Table 1) and accompanying description of each variable. It is important to understand this data set as it is used for Task 2 and Task 3 in Assignment 2. Each record in the housing.csv data set describes a Boston suburb or town. The data was drawn from the Boston Standard Metropolitan Statistical Area (SMSA) in 1970.

Assignment Task 2.1) Conduct and report on exploratory data analysis (EDA) of the housing.csv data set using RapidMiner Studio data mining tool. Note this will require use of a number of RapidMiner operators

Provide following for Task 2.1:

(i) a screen capture of your final EDA process, briefly describe your EDA process

(ii) summarise key results of your exploratory data analysis in Table 2.1 Results of Exploratory Data Analysis for housing.csv. Table 2.1 should include key characteristics of each variable in housing.csv set such as maximum, minimum values, average, standard deviation, most frequent values (mode), missing values and invalid values etc.

(iii) Discuss key results of exploratory data analysis presented in Table 2.1 and provide a rationale for selecting top 5 variables for predicting median house value (medv), in particular focusing on the relationships of independent variables with each other and with dependent variable median house value (medv) drawing on results of EDA analysis and relevant literature on determinates of house prices

Hint: Statistics Tab and Chart Tab in RapidMiner Studio provide a lot of descriptive statistical information and the ability to create useful charts like Barcharts, Scatterplots, Boxplot charts etc for EDA analysis. You might also like to look at running correlations and/or chi square tests as appropriate to determine which variables contribute most to predicting median house value (medv).

Assignment Task 2.2) Build and report on Linear Regression model for predicting medv using RapidMiner data mining process and appropriate set of data mining operators and a reduced set of variables from housing.csv data set as determined by your exploratory data analysis in Task 2.1.

Provide the following for Task 2.2:

(i) A screen capture of Final Linear Regression Model process and briefly describe your Final Linear Regression Model process

(ii) Table 2.2 named Results of Final Linear Regression Model for Task 2.2 for
housing.csv data set.

(iii) Discuss the results of Final Linear Regression Model for housing.csv data set drawing on key outputs (coefficients, standardised coefficients, t-statistics values, p-values and significance levels etc) for predicting median house value (medv) and relevant supporting literature on interpretation of a Linear Regression Model.

Include all appropriate outputs such as RapidMiner Processes, Graphs and Tables that support key aspects of exploratory data analysis and linear regression model analysis of the housing.csv data set in your Assignment 2 report.

Task 3 Tableau Desktop View of Weather Traffic Volume

After connecting to housing.csv data set in Tableau Desktop you consider binning variables such as age, crim (crime rate), ptratio (pupil to teacher ratio) to create categorical variables Task 3.1) Create a Tableau Text Table or Graph view that displays median house values by age of houses and other relevant data using the data set housing.csv. Comment on the (1) process of preparing a Text Table or Graph view using Tableau Desktop and (2) key trends and patterns that are apparent in Tableau view you have created (8 marks about 50 words).

Task 3.2) Create a Tableau Text Table or Graph view that displays median house values and potential impact of crime rate and other relevant data using data set housing.csv.

Comment on the (1) process of preparing a Text Table or Graph view using Tableau Desktop and (2) key trends and patterns that are apparent in Tableau view you have created.

Attachment:- Business Intelligence.rar

Reference no: EM132483447

Questions Cloud

Prepare the general journal entries for the august : Use this information to prepare the General Journal entries (without explanation) for the August 26, 2016 event. Alpha Company uses an allowance method
Determine for March the equivalent units of production : Bowie uses the weighted average method. Determine for March 2019 the equivalent units of production for conversion costs
Standard deviation of the sample proportion : With 3 decimal places, what is the standard deviation of the sample proportion if the sample size is 33?
Which diagnostic studies would you recommend for patient : How would you evaluate and manage a pediatric patient who has a painful swelling of the hands and feet, fatigue, or fussiness? Which diagnostic studies would.
CIS8008 Business Intelligence Assignment : CIS8008 Business Intelligence Assignment help and solution, University of Southern Queensland - assessment writing service
Null hypothesis and the alternative hypothesis : What are the null hypothesis and the alternative hypothesis? Test your Hypothesis using a Z-test. Show your calculations.
What is the journal entry for the sale : Styles Dress Company previously purchased 10,000 shares of treasury stock on the open market for $8 per share. What is the journal entry for the sale
Two operations performed at the same time : Suppose your spouse is having two operations performed at the same time. If the chances of success for operation A are 85%, and the chances of success
Identify components of proper literacy learning environment : Identify three major components of a proper literacy learning environment. Discuss three routines that create a proper literacy environment that enhances.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd