Shared memory architecture and resilient distributed data

Assignment Help Other Subject
Reference no: EM132667786 , Length: word count:1600

MITS6005 Big Data - Victorian Institute of Technology

Learning Outcome 1: Expertly apply techniques to perform big data query manipulation, evaluate various data storage option and type of aggregated data modelling. Through a critical study, choose an appropriate storage model based on the application requirements for processing large amounts of structured and unstructured data.

Learning Outcome 2: Independently perform data manipulation and querying (including updates, transactions, and indexes) big data applications dealing with high volume using NoSQL. Organize, store the collected data and manipulate by crafting queries. For example, using Hive, HBase and related data tools.

Learning Outcome 3: Carry out research on emerging Big Data technologies to evolve models/solutions such as configurable and executable compute jobs on top of using distributed and shared memory architecture and Resilient Distributed Data Sets (RDDs).

Learning Outcome 4: Implement typical solution use cases in big data context using technologies such as MapReduce and Spark Framework and using ecosystems such as Hadoop (or other similar platform).

Objective
You will work with your group and leverage the data provided by Pixystems (see the Data Overview above for a high-level explanation of data provided) and information obtained from the CFO to identify areas for financial and operational improvement, address issues and errors found in processing of data. To do this your group must create analytics that will assess the validity of data and provide the insight the CFO is looking for.

It is expecting that you have a visualization report representing the analytical procedures you performed and assumptions you made. The project data required for the given case study is provided in "Fictional Project Data.xlsx" with the relevant records as tabs in the spreadsheets. You will have 10 mins minutes to present this visualization, assumptions made, and recommendations. Each member of your group must have an active speaking role within the presentation.

Description

Read carefully "Pixystems_Toys_Information.pdf" file. You are going to do the analytics using Packages in Python or the choice of your package. You can use "Tableau Server Client" and write your code in Python, or you can use other libraries in Python or choice of your package libraries; it is your choice. If you are using Python then it is one of the most frequently used programming languages in many fields, particularly in data science. There are many libraries in Python for various tasks including big data and data visualization. You must do some research on python packages or choice of your packages, find proper ones for the below task and use them for analysis and writing your report. But at the end, we expect a quality work from you. For Python code, you can use Python Anaconda or Google Colab. Colab is a free notebook environment that requires no setup and runs entirely in the cloud. You need to login to google Colab to enable to use it.

Your report should have 1000-1500 words addressing the business questions, challenges, analytics and data visualization in "Pixystems_Toys_Information.pdf". It should cover what you are going to solve and how, plots and recommendations. The report should have at least 6-10 plots (screenshots) from your findings with explanations. The program code needs to be added at the end of the project. The template of the word file is provided as "MITS6005-Report format for assignment-3.doc".

The presentation should be a maximum of 10 minutes for the whole team. Each member should talk at least 2 minutes related to the project and findings. The whole presentation should cover the data, business questions, research findings and visualization and step by step discussion on how you've achieved this project.

You will also prepare a final report outlining the following:
• Results of the analytics you performed along with your rational for performing and assumptions made.
• Insight that the analytics provided management
• Explanation of any analytics you decided not to perform
• Recommendations your team has for improving Pixystems' processes
• Overview of any other issues that Pixystems should follow-up on
• Recommendations on system controls that could be put in place
• Any other data you would like to have obtained from Pixystems

General Instructions

1. Your writing should be clear and concise and be in your own words.
2. The report must be in the range of 1,500-2,500 words in length excluding references.

Attachment:- Big Data Assignment.rar

Reference no: EM132667786

Questions Cloud

Explain what is in the book of hammurabi : Explain what is in the book of Hammurabi from Babylon in 1700 BC about the basics of Economics?
Which differential income from accepting offer : Which Differential income from accepting offer. FDE Manufacturing Company has a normal plant capacity of 75,000 units per month.
What is the impact of using nonrenewable energy : Use an energy calculator to determine how much energy your household consumes. Discuss how you could lessen the amount of energy that is consumed.
How much may Stork deduct for this event : The game was preceded by a bona fide business discussion, and all expenses are adequately substantiated. How much may Stork deduct for this event
Shared memory architecture and resilient distributed data : Carry out research on emerging Big Data technologies to evolve models/solutions such as configurable and executable compute jobs on top of using distributed
Calculate the percentage change in the rental on capital : Suppose that there are drastic technological improvements in shoe production in Home such that shoe factories can operate almost completely
Compute the selling price of product b for glover inc : Glover Inc. manufactures Product B, incurring variable costs of $15.00 per unit and fixed costs. Compute the markup percentage using the total cost concept.
Write description of your current or intended industry : Brief description of your current or intended industry. Concluding discussion that addresses how the factors that influence value created in this industry.
Elaborating on the external and domestic causes : Outline the standard explanations of the "resource curse" elaborating on both the external and domestic causes.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd