Reference no: EM132651044
BIG DATA and DATA ANALYTICS
PROJECT - DATASET DESCRIPTION & PREPARATION
This LAB project is based on a meteorological dataset about forest fires in the northeast region of Portugal.
In preparation for Lab Project 3, load the forestfires.csv dataset. Use the dataset to practice the following topics:
- Create correlation tables and correlation matrices
- Reporting of results in APA style (including tests discussed in earlier weeks, e.g., ANOVA, t-test)
- Storing the results of ggplot and other plotting tools to automatically generate png and pdf files
- Creating rtf files and include text, tables, and figures
- Running R files from the command line with "RScript"
- Simple linear regressions: coefficient estimates, predicted values, residuals, standard errors, confidence intervals, t-values, residual standard error, R2
- Analysing and interpreting the results of simple linear regressions
- Find out more about the function "describe" to explore a dataset and find out whether there are missing values.
- Create density plots with ggplot()
- Install and load the library "moments". Search online to find out more about the functions "skewness()" and "kurtosis()" of the library "moments" and the interpretation of their results. What is skewness? What is kurtosis?
Attachment:- Dataset_Description_and_Preparation.rar
Attachment:- forestfires.rar