Perform a data analysis of a data set

Assignment Help Other Subject
Reference no: EM133180774

Big Data Analytics

Repeat Assignment Worth 60% of Module Grade

1 Description and Submission Format

In this assignment you are tasked to perform a data analysis of a data set with the use of R language. You should submit a PDF document that should be generated from your RMarkdown.

2 Data set

The data set that should be used for the analysis is the Student Performance Data Set

Tasks

Task 1

Your first task is to perform exploratory analysis of the data set. That should give you some basic understanding of the data. For that you should load you data from a file, then clean the data as much as possible that the further analysis is easier. Finally, perform exploratory analysis by visualising and summarising the data. You should also look at the relationships between variables and you should check the "strength" of those relationships. Your report should include some of the plots and summaries with explanations.

Task 2

Second task is quite open. You have done preliminary exploration of the data set. At this point you should understand the domain of your data set, and you should have seen how di?erent attributes of the data look. Your final goal is to report some findings (or lack of them). You should have proofs that these are statistically correct. The following points are just hints of what might be interesting to do/take a look at:
• Take a look at plots you have created in the first part - what conclusions can be drawn based on them? These could be your hypotheses.

• Data contains categorical variables - is there a di?erence between instances belong- ing to one category and the other? Even if you do not see clear di?erences, you could perform a statistical test checking if some properties change over categories.

Task 3

• Perform linear regression with multiple variables to predict the student grade. Normalize the data and repeat the process of performing Linear Regression with Multiple Variables on normalized data to predict the student grade. Highlight the di?erence in prediction accuracy with both data sets.
• Perform classification to classify an appropriate categorical variable. Normalize the data and repeat the process of performing classification on normalized data. Highlight the di?erence in prediction accuracy with both data sets.

Submission

Write your code in an R Markdown document to present your preliminary data analysis in the form of report. Do not put all of the plots in the report, decide what might be useful, what might be interesting to explore. Use multidimensional plots to present multiple variables.

• You can also get up to 10 points for clarity and quality of the report and the source code.
• Acceptable file format: Knit your Markdown document in pdf output. Use the submission link on Moodle to upload your final pdf report.

Reference no: EM133180774

Questions Cloud

Post the journal entries to the ledger accounts : Grete Rodewald formed a dog grooming and training business called Grete Kanines on September 1, 2021. Post the journal entries to the ledger accounts
Workout an acceptable compromise with the superior : John was just promoted as a shift officer. The promotion became effective when his immediate superior Mike was out of town for a few days.
What is the equivalent annual cost of the econo-cool model : Econo-Cool air conditioners cost R500 to purchase, result in electricity bills of R250 per year, What is the equivalent annual cost of the Econo-Cool model
Collective agreement regarding the topic : Three courier drivers and one dispatcher from the ABC Courier Company went out for a beverage after work. In the course of their conversation, they decided that
Perform a data analysis of a data set : Perform a data analysis of a data set with the use of R language. You should submit a PDF document that should be generated from your RMarkdown
What facts in the case support the discipline imposed : 1. What facts in the case support the discipline imposed by the employer?
What is the minimum lease payment : What is the minimum lease payment that would make purchasing a precision manufacturing machine and writing a 4-year lease contract on it
Method of departmentalization : Draw the organization chart of this company. What is the method of departmentalization used at each level on the chart?
Promoting the health and safety of employees : What are the most significant challenges and how can employers bring a culture promoting the health and safety of their employees?

Reviews

Write a Review

Other Subject Questions & Answers

  Write about topic - ozone

Write about given topic, Topic is ozone. Research design/Data collection/Data processing/Data manipulation/Data presentation/Data Analysis/Findings

  What steps will you take to perform a comprehensive risk

What steps will you take to perform a comprehensive risk assessment for your organization? Who will you need help from in performing this assessment?

  Collect arn evaluate different menus from local restaurant

collect arn evaluate 2 different menus from local restaurants. Each restaurant must offer different menu choices. Fast food or take-away menu's are not to be used.

  What types of testimony may an expert witness offer in court

What types of testimony may an expert witness offer in court? Should we limit the opportunity to present expert witnesses in court? Why?

  What is meant by sociological perspective

Write down a 700 to 1000 word paper which explains what is meant by the sociological perspective. Describe how it helps us to understand the origins of crime and to identify possible ways of reducing crime.

  Explain the history and origin of cba

Explain the history and origin of CBA.

  Explain the purpose of abstinence before marriage

Define intimacy and describe how it can be beneficial to a person's well-being. Explain the purpose of abstinence before marriage

  Fedex and the us postal service

FedEx and the U.S. Postal Service compete for many of the same customers. Describe two (2) ways that their strategies for attracting customers are different.

  How good are experienced job interviewers at spotting liars

How Good Are Experienced Job Interviewers at Spotting Liars? Read the article and answer the question thoroughly-How Good Are Experienced Job Interviewers.

  Describe a time in which you have experienced burnout

Describe a time in which you have experienced burnout. What led to the burnout? How did you feel and think while experiencing it? How did you overcome burnout?

  Tribal lands

The displacement of Native Americans from their tribal lands in the U.S. in order to make the land available to white settlers was:

  Effect of metabolites on myosin atpase

The effect of metabolites on myosin ATPase (muscle contraction) as a possible mechanism of peripheral muscle fatigue

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd