Research australian law on collecting public data

Assignment Help Other Subject
Reference no: EM133549490

Big Data Management

Learning Outcome 1: Demonstrate advanced and integrated understanding of data modelling, storage, and retrieval methods and apply knowledge and skills to retrieve information from data storage;

Learning Outcome 2: Apply knowledge and skills to design and complete a project to coordinate and manage large data sets;

Learning Outcome 3: Analyse critically and interpret the knowledge from large data sets;

Learning Outcome 4: Interpret and transmit information and knowledge in the application discipline to specialist and non-specialist audiences;

Learning Outcome 5: Analyse critically and reflect on the issues of privacy and ethics of Big Data.

Suppose you are working for the Australian Government as a "Data Scientist" to tackle COVID-19 or any other future pandemic. Google has released a dataset on people's mobility during the pandemic. As a "Data Scientist," you have found some critical information from that dataset, which helped Australia understand COVID-19. Now, you are famous :).

So, Australia's Prime Anthony Albanese has hired you in his special Foreign Affairs team. He wants you to compare Australia's pandemic situation with any other country. Luckily, you have the dataset from Google and another new dataset regarding the COVID-19 cases in the government-secured server. Suppose the size of each dataset is 100 petabytes.
Therefore, you have chosen to use Spark to complete the analysis.

In this assignment, you will add some old information from Assignments 1 and 2.

Tasks of the Assignment:

• Explore two datasets and identify a research question.
• Now create spark distributed data frames from these datasets.
• Explore, Filter, and Analyse datasets using spark.
• Based on the analysis, answer the research question.

1. Introduction:
• Provide a brief discussion of the mobility dataset details.
• Provide a brief discussion of the covid case (cc) dataset details.
• From where did you download the mobility dataset?

2. Data Exploration:
• Discuss the size of the mobility dataset.
• Discuss the size and format of the cc dataset.
• Discuss the format of the mobility dataset.
• Discuss the features (columns) of the mobility dataset.
• Discuss the features (columns) of the cc dataset.

3. Literature Review:

• Find at least two research works from "Google Scholar (Any preprint or published work)" where the researchers have used this mobility dataset. Please provide a brief discussion of their research. How did the researchers use this dataset to answer their research question?

• Find at least two research works from "google scholar (Any preprint or published work)" where the researchers have used this cc dataset. Please provide a brief discussion of their research. How did the researchers use this dataset to answer their research question?

4. Research Question/Selection of the Problem:

• Identify a research question that you can answer after analysing both datasets. The research question must focus on countries, such as Australia and the UK.
• Justify your research question. Why is your research question important for comparing the COVID-19 situation between Australia and other countries?

5. Method:• You are using Spark as you are dealing with big data. By the way, what is Spark?• Why did you choose spark over Hadoop MapReduce?

6. Connection Between Datasets:
• How can you connect these two datasets to answer your research question?
• List the steps you have taken to find out the useful subset of the datasets.

7. Data Analysis:
• Provide a detailed analysis with appropriate visualisations to answer the research question. (15 Marks {Visualisations on the Analysis} +
{Relevant Discussions according to the Visualisations})

8. Findings
• Provide the discussion to answer your research question based on the findings from the analysis.

9. Ethics and Privacy:
• Research Australian Law on collecting public data and show the validity of this mobility dataset according to Australian Law.
• Research Australian Law on collecting public data and show the validity of this cc dataset according to Australian Law.

10. Hosting on a server

• Please create a Spark cluster in AZURE and run your analysis code in that cluster. Now, record a video with any screen capturing software. The recording should show that you are using AZURE and you are running your whole code in the AZURE server using Spark. Upload this video to Google Drive and share the link at the end of the report or in a separate file named.

12. Presentation and Viva:

• Students need to present their work and findings. Questions will be asked at the end of the presentation
10. Writing Style and Report Format:

• The report is clearly written, and sections are connected.
• The report follows the given structure.
• Proper and correct in-text citation is presented in the report.
• The report cannot exceed fifteen pages (Page count includes everything from the table of contents to references and appendix). Any front of size 12pt is accepted.

Reference no: EM133549490

Questions Cloud

What is the simple doctrine of innate ideas : What is the simple doctrine of innate ideas? Give the basic strategy of one of Locke's arguments against the simple doctrine of innate ideas.
Ideas gained more attention in twentieth century : Kierkegaard died in 1855, but his ideas gained more attention in the twentieth century than during his lifetime.
Nonetheless valid from the aristotelian standpoint : Some categorical syllogisms that are invalid from the Boolean standpoint are nonetheless valid from the Aristotelian standpoint.
Mentalistic and behavioral approaches to therapy : Identify the difference between mentalistic and behavioral approaches to therapy. Discuss the use of data collection to determine treatment effectiveness
Research australian law on collecting public data : Research Australian Law on collecting public data and show the validity of this mobility dataset according to Australian Law.
Create a purchase order for the deluxe touring bike : Reviewing the stock, you realise that stock is required, and you are required to start the procurement process. The Deluxe Touring Bike
Design of a small food retail outlet : Design of a small food retail outlet, and the associated construction detailing of the street facing wall - discuss their independent ways of approaching
Kinaxis case study - forecasting the unforeseeable : Applying the naive algorithm, moving average and exponential smoothing - Forecast 2020 monthly average demand
Different view points on why society expects nurses : What are different view points on why society expects nurses and healthcare professionals to act ethically?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd