Discuss what is the data mining

Assignment Help Computer Engineering
Reference no: EM131550150

Question: MURDOCH UNIVERSITY
ICT515 Foundations of Data Science

Semester 1, 2017
ASSIGNMENT 2

Assignment Information: For this assignment, students should work in pairs.

You should submit your assignment from the ICT515 LMS site using the Assignment unit tool.

Late submissions will be penalised at the rate of 10 marks per day late or part thereof.

You must keep a copy of the final version of your assignment as submitted and be prepared to provide it on request.

The University treats plagiarism, collusion, theft of other students' work and other forms of dishonesty in assessment seriously. Any instances of dishonesty in this assessment will be forwarded immediately to the Faculty Dean. For guidelines on honesty in assessment including avoiding plagiarism, see: https://our.murdoch.edu.au/Educational-technologies/Academic-integrity/

Overview: For this assignment, students will work in pairs. Each group needs to choose a real dataset that the group members find interesting, in the sense that they believe it contains data which can provide useful information if explored. Students then need to implement, via the R programming language, different techniques that we have covered in this unit to try to find the best way to answer their questions about the dataset and extract the useful information.

There are numerous datasets available online, and a link to a good repository will been given in LMS during the semester. You are free, however, to choose any data set you prefer, the conditions being that

1. The dataset must be freely available online so that I can download it and perform the analysis myself.

2. Students must each choose unique projects - this generally means different datasets entirely.

If you have another preferred source of data then you may request to use that instead and I'll have a look. I can also propose other datasets, if students need additional choices. Having decided on a dataset you should then post up your plans on the discussion forum for other students to view and comment. This discussion is assessed.

Your results, after using on the dataset the techniques you have learned in this unit, should then be described and explained to the reader.

The report does not require lengthy text sections and much of the content may be results of analysis and/or graphs or plots as required.

In conjunction with the submission of the report, students will also present an overview of the findings, as explained below.

Deliverables: 1. Online Discussion forum: Post your proposed topic and chosen dataset as well as a short plan for the project. Explain if it falls into the supervised or unsupervised learning category and if it is a regression or classification problem. The above is required for approval of the topic. As discussed, students must select unique topics, therefore if any assignments overlap they will not be accepted. This should be done by the end of week 10. Also any queries about the assignment deliverables should be made in the discussion forum so that other students can also benefit from the responses.

2. Oral Presentation: You will be required to present a brief (10) minute executive summary of your project in class. This is a mandatory component of the assignment.

3. Data Mining technical report: The marks for the report section are split into three areas:

a. Data understanding and preparation

b. Algorithms/techniques chosen and implemented in the R programming language for data analysis

c. Presentation,discussion and quality of the results - explanation of interesting patterns found

Notes : All work must be submitted in ONE word document

No Email submissions allowed unless specific permission has been granted

Do not explain how to perform the techniques or provide instructions in your report, this is what the books are for. Instead spend your time explaining your findings.

Reference no: EM131550150

Questions Cloud

Recovering from a left hip replacement : Mrs. Jones, to your facility. Mrs. Jones, who was just transferred from the hospital, is recovering from a left hip replacement.
Methods for searching and accessing internal database : Name and describe two methods for searching and accessing internal database.
Analyze communication techniques : This exercise involves analyzing a communication technique-presentations. Daily, it is often expected that professionals be proficient with presentations.
What are the major components of a dbms : What is a DBMS? What are the major components of a DBMS?
Discuss what is the data mining : Online Discussion forum: Post your proposed topic and chosen dataset as well as a short plan for the project. Explain if it falls into the supervised.
Discus various considerations for an initial public offering : Conduct research in the University Online Library, and write a paper discussing the various considerations for initial public offering (IPO) or bond refunding.
How quality improvement was incorporated at each site : The volunteer coordinator expressed an interest in determining how quality improvement was incorporated at each site.
Quality model selected : Discuss the quality care approach used in an organization to improve patient safety and patient outcomes. Why was the quality model selected and how does it ali
What must spot rate be to eliminate arbitrage opportunities : What must the spot rate be to eliminate arbitrage opportunities? If an astute trader finds an arbitrage, what is the arbitrage profit in one year?

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd