Analyse the data set using the methods

Assignment Help Applied Statistics
Reference no: EM133552316

Predictive Analytics

For this assignment, each group has been given a different data set. Your task is to analyse the data set using the methods you've learnt in this unit, you're more than welcome to use methods you learnt in other units in this assignment.

Data Preprocessing

This is worth 6, 10, or 14 marks depending on the difficulty of the data set. This task include, but not limited to:
• Simple statistical evaluation of the data set;
• Identify the target variable(s);
• Identify if this is a regression or a classification task;
• Are there any missing data? What can you do about them?
• Which variables are relevant or irrelevant to the task?

• What type of validation/cross-validation procedure will you use?
• Data visualisation.

Methods to use
For the analysis, you need to analyse the data using at least three different methods. You'll need to use
• Neural Networks, and
• Support Vector Machine
as two of the methods. Do not use linear and logistic regression other than as a compar- ison method. The other methods taught in this subject which you can use are:
• Ridge Lasso regression,
• Lasso regression, and
• Naive Bayes
You can substitute these methods with k Nearest Neighbours, Decision Trees, Random Forest, or other machine learning methods.
For each method, write a brief description of your steps to create the model and your prediction. What did you do? Your description should include, but not limit to, answers to the following questions:
• What is the accuracy of your model?
• Is the model a good model? Why or why not?
• Any particular choices you made or had to make in creating model or prediction? Why did you make them?

Attachment:- Predictive Analytics.rar

Reference no: EM133552316

Questions Cloud

What implications does todays session have for your practice : What implications does today's session have for your instructional practices? Your responses here should be specific rather than general.
What bat-and-ball games from other countries influenced : I would use this in my classroom to have students investigate what bat-and-ball games from other countries influenced the creation of baseball.
What values of c would a monopolist that sells its output : What values of c would a monopolist that sells its output and cannot commit to prices choose the nondurable product? what values of 6 would a monopolist
Define what is keynesian economics and what is neoliberalism : Define what is Keynesian Economics and what is Neoliberalism. What is their main difference regarding the role of government in the market place/the economy
Analyse the data set using the methods : COMP 7023 Predictive Analytics, Western Sydney University - analyse the data set using the methods you've learnt in this unit
Why does gnp remain unchanged in the long run : Why does GNP remain unchanged in the long run when the central bank undertakes expansionary monetary policy under floating exchange rates?
What investment in human capital will you personally need : What are you going to produce? Why did you choose this to produce? Give a little background on why you chose this item to produce. Does it have special meaning
Analyze the net present value and internal rate of return : analyze the net present value (NPV) and internal rate of return (IRR) of different medical specialties as well as barriers to entry and how these factors
How the labor is divided up and what made division possible : how the labor is divided up and what made the division possible: isit the need for specialization or is it technological innovation?

Reviews

Write a Review

Applied Statistics Questions & Answers

  What need to do Run frequencies and regressions

Look at the codes given to use by our professor. Make sure that they are correct so we can do what we need to do e.g. Run frequencies, regressions, etc

  Create a bar chart to show the frequencies

Use these 5 categories and create a relative and cumulative frequency chart for all the GPAs in the dataset. You can do this by hand or you can use Excel. Create a bar chart to show the frequencies of each letter grade

  It lsquos has been claimed that if you drop a standard

it lsquos has been claimed that if you drop a standard thumbtack from a height of 3 feet or more onto a flat hard

  Let x be the sample mean of a random sample of size n

Let X be the sample mean of a random sample of size n from a uniform distributionon the interval [0, b], and let bb = 2X be an estimator of the upper endpoint b.(a) Demonstrate that bb is an unbiased estimator of b, and find the variance of th..

  Describe the bank customer waiting times

What does the histogram in Figure 2.16 (page 53) say about whether the Empirical Rule should be used to describe the bank customer waiting times?

  Liquid products were first obtained from coal

1). Liquid products were first obtained from coal in England during the 1700s. Lamp oil was produced from coal in the United States as early as 1850, but the domestic coal chemicals industry did not develop until World War I. A modern coal - for - re..

  Evaluation of online shopping and its branches

EVALUATION OF ONLINE SHOPPING & ITS BRANCHES - Data collection and Report on Methods of Analysis of Data - Analyse the data and write a report

  Calculate descriptive statistics for the price of old cars

MAT10251 STATISTICAL ANALYSIS PROJECT Southern Cross University-Australia-Construct a frequency histogram or polygon for the price of two and three year old car

  Calculate mortgage rates and production lot size

Calculate mortgage rates, production lot size, forecast the demand for the products, track epidemics and their spread, help solve crimes, even help estimate tax revenues.

  What was the most important attribute the client should take

What are other analytics that could be done for this client that would add insight - What was the most important attribute the client should take away from this analysis and why

  Quantitative analysis for decision making assignment

Graphical Solutions in Linear Programming have limited number of decision variables. What is the maximum number of decision variables used in graphical solutions?

  Worst performing asset

Historical data for Emfs for the past five years - Worst performing asset and Best diversification benefits

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd