Energy prediction of domestic appliances dataset

Assignment Help Basic Computer Science
Reference no: EM132516849

Energy Prediction of Domestic Appliances Dataset

The given dataset, "Energy20.txt", can be used to create models of energy use of appliances in a energy-efficient house. The dataset provides the Energy use of appliances (denoted as Y) using 671 samples. It is a modified version of data used in the study [1]. The dataset includes 5 variables, denoted as X1, X2, X3, X4, X5, and Y, described as follows:

X1: Temperature in kitchen area, in Celsius
X2: Humidity in kitchen area, given as a percentage
X3: Temperature outside (from weather station), in Celsius
X4: Humidity outside (from weather station), given as a percentage
X5: Visibility (from weather station), in km
Y: Energy use of appliances, in Wh

Tasks

Task 1. Understand the data
(i) Download the txt file (Energy20.txt) from Future Learn and save it to your R working directory.
(ii) Assign the data to a matrix, e.g. using
the.data <- as.matrix(read.table("Energy20.txt "))
(iii) The variable of interest is Energy use of appliances (Y). To investigate Y, generate a subset of 350 data, e.g. using:
my.data <- the.data[sample(1:671,350),c(1:6)]

(iv) Using scatter plots and histograms, report on the general relationship between each of the variables X1, X2, X3, X4, X5 and the variable of interest Y. Include 5 scatter plots, 6 histograms, and 1 or 2 sentences for each of the variables, including the variable of interest Y.

Task 2. Transform the data

(i) Choose any four from the five variables (X1, X2,..,X5). Make appropriate transformations to the chosen four variables and the variable of interest Y so that the values can be aggregated in order to predict the variable of interest. Assign your transformed data along with your transformed variable of interest to an array (it should be 350 rows and 5 columns). Save it to a txt file titled "name- transformed.txt" using
write.table(your.data,"name-transformed.txt")
where "name" is replaced with your name - you can use your surname or first name.
(ii) Briefly explain the transformations applied for the selected four variables and the variable of interest. (1- 2 sentences each)

Task 3. Build models and investigate the importance of each variable
(i) Download the AggWaFit718.R file to your working directory and load into the R workspace using, source("AggWaFit718.R")
(ii) Use the fitting functions to learn the parameters for
• A weighted arithmetic mean (WAM)
• Weighted power means (WPM) with p = 0.5, and p = 5,
• An ordered weighted averaging function (OWA), and
• A Choquet integral.

(iii) Include two tables in your report - one with the error measures and correlation coefficients, and one summarising the weights/parameters and any other useful information learned for your data.

(iv) Compare and interpret the data in your tables. Comment on
a. How good the model is,
b. The importance of each of the variables (the four variables that you have selected),
c. Any interaction between any of those variables (are they complementary or redundant?) and
d. Better models favour higher or lower inputs. (1-3 paragraphs for part 3(iv))

Task 4. Use your model for prediction
(i) Choose your best fitting model.
Using your best fitting model, predict the Energy use of appliances for the following input X1=17; X2=39; X3=4; X4=77; X5=32.
(ii) Give your result and comment on whether you think it is reasonable. (1-2 sentences).
(iii) Comment on the best conditions (in terms of your chosen four variables) under which a high Energy use of appliances will occur. (1-2 sentences).

Task 5. Comparing with a linear regression model
Linear regression is used to predict the value of an outcome variable Y based on one or more input predictor variables X. The equation is Y = β0 + β1X1 + β2X2 + ? βnXn + ∈. The built- in function lm() is used to fit linear models in R.
(i) Build your linear model using the same dataset in Question 3 and describe the summary statistics for your model using the function summary().
(ii) Compare the performance of the linear model you got with your best fitting model in Question 4. Visualise the predicted Y values of both models on the 350 data and compare them with the true Y values.
(iii) Give your comment on the differences between the linear model and your best fitting model. (2-4 sentences).

All supporting information should be presented in the pdf report. It will be assessed for style and grammar, professional presentation of figures, tables and references. List and quote in the text the references used, including books, articles and web resources.

Use the Harvard style

Attachment:- data analysis.rar

Reference no: EM132516849

Questions Cloud

Consider kant critique of happiness as an ethical principle : Consider Kant's critique of happiness as an ethical principle. Why does he think that happiness (or well-being) fails to provide an adequate standard
Paraphrasing a scripture how do you cite it in text : In APA format, when you are paraphrasing a scripture how do you cite it in text?
What was the amount of direct materials charged to that job : What was the amount of direct materials charged to that job? Howard Corporation has a job order costing system. The debits (credits) appear in the firm.
Prepare monthly cash budgets for January and March : According to a credit agreement with its bank, Kayak requires a minimum cash balance of $40,000. Prepare monthly cash budgets for January and March
Energy prediction of domestic appliances dataset : Energy Prediction of Domestic Appliances Dataset - Build models and investigate the importance of each variable - Comparing with a linear regression model
How will that affect scope-costs and schedule : Your project sponsor and customer are impressed with your project schedule, How will that affect scope, costs, and schedule?
Calculate the cost of goods sold account : Assuming underapplied or overapplied overhead is transferred to cost of goods sold at the end of the period, calculate the cost of goods sold account.
Implementation of enterprise resource planning : You are in the planning stage for an implementation of an enterprise resource planning (ERP) solution for your firm. Y
Determine cost formula that expresses the behaviour of yang : Determine the cost formula that expresses the behaviour of Yang's total costs. Yang Manufacturing makes a product called Yin. The relevant range of operations

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Identifies the cost of computer

identifies the cost of computer components to configure a computer system (including all peripheral devices where needed) for use in one of the following four situations:

  Input devices

Compare how the gestures data is generated and represented for interpretation in each of the following input devices. In your comparison, consider the data formats (radio waves, electrical signal, sound, etc.), device drivers, operating systems suppo..

  Cores on computer systems

Assignment : Cores on Computer Systems:  Differentiate between multiprocessor systems and many-core systems in terms of power efficiency, cost benefit analysis, instructions processing efficiency, and packaging form factors.

  Prepare an annual budget in an excel spreadsheet

Prepare working solutions in Excel that will manage the annual budget

  Write a research paper in relation to a software design

Research paper in relation to a Software Design related topic

  Describe the forest, domain, ou, and trust configuration

Describe the forest, domain, OU, and trust configuration for Bluesky. Include a chart or diagram of the current configuration. Currently Bluesky has a single domain and default OU structure.

  Construct a truth table for the boolean expression

Construct a truth table for the Boolean expressions ABC + A'B'C' ABC + AB'C' + A'B'C' A(BC' + B'C)

  Evaluate the cost of materials

Evaluate the cost of materials

  The marie simulator

Depending on how comfortable you are with using the MARIE simulator after reading

  What is the main advantage of using master pages

What is the main advantage of using master pages. Explain the purpose and advantage of using styles.

  Describe the three fundamental models of distributed systems

Explain the two approaches to packet delivery by the network layer in Distributed Systems. Describe the three fundamental models of Distributed Systems

  Distinguish between caching and buffering

Distinguish between caching and buffering The failure model defines the ways in which failure may occur in order to provide an understanding of the effects of failure. Give one type of failure with a brief description of the failure

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd