Analytics report assignment

Assignment Help Other Subject

Reference no: EM133077521

ITECH1103 Big Data And Analytics - Federation University

Assignment: Analytics Report

Overview

The purpose of this task is to provide students with practical experience in writing a dataanalytical report to provide useful insights, pattern and trends in a chosen dataset in the light of a set of tasks required within this document. This dataset will be chosen from the UC Irvine Machine Learning Repository . This activity willgive students the opportunity to show innovation and creativity in applying the WEKA data mining software, and designing usefulvisualization and data mining solutions presented as an analytics report.

Project Details

You will use an analytical tool (i.e. WEKA) to explore, analyse and visualise a dataset of your choosing.An important part of this work is preparing a good quality report, which details your choices, content, and analysis, and that is of an appropriate style.
The dataset should be chosen from the following repository:

The aim is to use the data set allocated to provide interesting insights, trends and patterns amongst the data. Yourintended audience is the CEO and middle management of the Company for whom you are employed, and who have tasked you with this analysis.

Assignment Task 1- Data choice. Choose any dataset from the repository that has at least five attributes, and for which the default task is classification. Transform this dataset into the ARFF format required by WEKA.

Assignment Task 2 - Background information. Write a description of the dataset and project, and its importance for the organization. Provide an overview of what the dataset is about, including from where and how it has been gathered, and for what purpose.Discuss the main benefits of using data mining to explore datasets such as this. This discussion should be suitable for a general audience. Information must come from at least two appropriate sources be appropriately referenced.

Assignment Task 3 - Data description. Describe how many instances does the dataset contain, how many attributes there are in the dataset, their names, and include which is the class attribute.Include in your description details of any missing values, and any other relevant characteristics. For at least 5 attributes, describe what is the range of possible values of the attributes, and visualise these in a graphical format.

Assignment Task 4 -Data preprocessing. Preprocess the dataset attributes using WEKA's filters. Useful techniques will includeremove certain attributes, exploring different ways of discretizing continuous attributes and replacing missing values. Discretizing is the conversion of numeric attributes into "nominal" ones by binning numeric values into intervals . Missing values in ARFF files are represented with the character "?" .If you replaced missing values explain what strategy you used to select a replacement of the missing values. Use and describe at least three different preprocessing techniques.

Assignment Task 5 - Data mining. Compare and contrast at least three different data mining algorithms on your data, for instance:.k-nearest neighbour,Apriori association rules, decision tree induction. For each experiment you ran describe: the data you used for the experiments, that is, did you use the entire dataset of just a subset of it. You must include screenshots and results from the techniques you employ.

Assignment Task 6 - Discussion of findings. Explain your results and include the usefulness of the approaches for the purpose of the analysis. Include any assumptions that you may have made about the analysis. In thisdiscussion you should explain what each algorithm provides to the overall analysis task. Summarize your main findings.

Assignment Task 7 - Report writing.Present your work in the form of an analytics report.

Your references should use the APAreferencing style

Attachment:- Analytics Report.rar

Reference no: EM133077521

Questions Cloud

Critically review the principles of modelling : What are the factors controlling energy release rates in enclosure fires and Discuss briefly: energy release rates based on free burn measurements

Why is scarcity constant in a resource rich country : 1. Why is scarcity constant in a resource rich country like Canada?

Discuss what attributes you felt were good and bad : Describe a product or service that you feel is well designed and one that is poorly designed. Discuss what attributes you felt were good and bad.

Quebecor printing is commercial printing company : Quebecor Printing is a commercial printing company that is expanding, acquiring ailing printing companies, and moving into international markets

Analytics report assignment : Analytics Report Assignment Help - Describe how many instances does the dataset contain, how many attributes there are in the dataset

Differences between process costing and job-order costing : Describe the differences between process costing and job-order costing. Or provide an example of each.

What amount of personnel costs will be allocated to A : If the number of employees is considered the cost driver, what amount of personnel costs will be allocated to Department A

International division of labour contribute to globalisation : How does the international division of labour contribute to globalisation?

Develop an effective strategy : Given this dilemma, how would you develop an effective strategy to ensure you act in the best interests of your company, but without engaging in collusion, whic

User Account

All Pages