Real dataset that the group members find interesting

Assignment Help Other Subject
Reference no: EM133559546

Overview

For this assignment, students will work in groups of 2 or 3. Each group needs to choose a real dataset that the group members find interesting, in the sense that they believe it contains data which can provide useful information if explored. Students then need to implement, via the R programming language,different techniques that we have covered in this unit to try to find the best way to answer their questions about the dataset and extract the useful information.

There are numerous datasets available online, and a great repository with datasets is the UCI Machine Learning repository:

You are free, however, to choose any data set you prefer, the conditions being that

1. The dataset must be freely available online so that I can download it and perform the analysis myself.
2. Students must each choose unique projects - this generally means different datasets entirely.

If you decide to work in a group of 3, you need to work on 2 datasets.

If you have another preferred source of data then you may request to use that instead and I'll have a look. I can also propose other datasets, if students need additional choices. Having decided on a dataset you should then post up your plans on the discussion forum for other students to view and comment. This discussion is assessed.

Your results, after using on the dataset the techniques you have learned in this unit, should then be described and explained to the reader. The report does not require lengthy text sections and

much of the content may contain the results, the analysis of the results and/or graphs or plots as required.

In conjunction with the submission of the report, students will also present an overview of the findings, as explained below.

Deliverables:

1. Online Discussion forum: Post your proposed topic and chosen dataset as well as a short plan for the project. Explain if it falls into the supervised or unsupervised learning category and if it is a regression or classification problem. The above is required for approval of the topic. As discussed, students must select unique topics, therefore if any assignments overlap they will not be accepted. This should be done by the end of week 10. Also any queries about the assignment deliverables should be made in the discussion forum so that other students can also benefit from the responses.

2. Oral Presentation: You will be required to present a brief (10) minute executive summary of your project in class. This is a mandatory component of the assignment.

3. Data Mining technical report:The marks for the report section are split into three areas:
a. Data understanding and preparation
b. Algorithms/techniques chosen andimplemented in the R programming language for data analysis
c. Presentation,discussion and quality of the results - explanation of interesting patterns found

Attachment:- Foundations of Data Science.rar

Reference no: EM133559546

Questions Cloud

What trend in america changing population : What trend in America's changing population do you think has had the biggest influence on the nation's politics over time?
Define sustainability in the business community : Define Sustainability in the Business community. Analyse the argument that the extraction of crude oil from oil sands in Canada is of benefit to the community,
Examine the phases of the submission process for your state : Examine the phases of the submission process for your current or home state. Assess the revenue sources for your selected state agency or program.
Describe how you or your organization incorporated : Describe how you or your organization incorporated a "value-creating" strategy at work to overcome any odds and move the organization forward.
Real dataset that the group members find interesting : ICT515 Foundations of Data Science, Murdoch University - Find interesting, in the sense that they believe it contains data
What would you propose they do moving forward : what would you propose they do moving forward? Why do you feel that taking this path moving forward serves in the best interest of the sport of pickleball
Rates undergo lognormal random walk with volatility : Using the binomial model (which assumes that one-year rates undergo a lognormal random walk with volatility s), show that if s is assumed to be 15%,
Description of organizations purpose-function and goals : Your team's first step will be to decide on a for-profit or nonprofit organization you would like to work with. What are some of the industries, products.
What ways can cancon requirements be circumvented : What does CanCon refer to? In what ways can CanCon requirements be circumvented? What was the last work of CanCon you consumed on television or radio?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd