Big data and data warehouse.

Assignment Help Basic Computer Science
Reference no: EM133052758

Assignment : Big Data and Data Warehouse.

In this assignment you will make recommendations on the collection and management of Big Data on behalf of your client, and you will explain how and where all of this data will be stored.

What database will you use? Will it store raw unstructured data or pre-formatted structured data?

Your choices here also depend in part, on your prior choices involving the cloud and will further influence your future choices for networks.

Your client's data might be found in multiple places and in multiple formats. For the purposes of this assignment assume you CAN get the data by either partnering with the data-owner, or maybe by recommending the rights to the data be purchased by the client, or perhaps by screen-scraping data from the client's own website(s), or by uploading client financial data.

If you are using video data as well as part of your proposal, then there are other considerations. How will you store and use that data? Perhaps you use software that interprets activity within videos? Perhaps you will plan for constituents to upload mobile phone video data to your client's website.

1. For your chosen business (the business of your client) and the industry he/she is in, determine if it is advisable to plan this new data analytics function and database in a manner where it will be established at a cloud service provider (CSP)? Explain why. Find similar cases elsewhere.

2. Where is this Big Data found?

3. What is the format and type of the database going to be?

4. How will the data get from wherever it is into this database? Supply a data flow diagram (DFD).

5. Will you store unformatted data? If so what application will format the data when you read it for analysis?

6. Will you store formatted data in a Data Warehouse? If so supply the schema diagram.

7. Is this data going to be historical in nature?

8. Is this data going to include a real-time component? If so this greatly complicates the scenario and you need to address the impact of outages on data loss and probably need to mention the need for a Helpdesk to support the real-time function. Any real-time component will significantly impact your future networking recommendations.

9. Will you be recommending some form of data warehouse? If so, will you use ETL formatting or something else?

10. Will you be recommending a Hadoop structure? If so where will this be hosted?

11. Create a workflow diagram (WFD) to show the activities from data generation, to data capture, to analysis of data, to report generation.

Reference no: EM133052758

Questions Cloud

Business process and software development process : What is the difference between a business process and a software development process? Provide an example of each.
Ramifications of changes for sport communicators : How have social media changed the way people relate to sport? What are the ramifications of these changes for sport communicators?
Technological change offers great opportunities : Technological change offers great opportunities. it also causes disruption. While technology presents many benefits to those who can afford access to it.
Frameworks for improve workflow : In terms of Digital Transformation, if a small sized company with bad infrastructure want to improve. What is suggested strategies/frameworks for improve workfl
Big data and data warehouse. : Where is this Big Data found? How will the data get from wherever it is into this database? Supply a data flow diagram (DFD).
Determine the liability that would be recorded by jenkins : The Jenkins Corporation has purchased an executive jet. The company has agreed to pay $201,700 per year. Determine liability that would be recorded by Jenkins
Price gouging or good business : What are the ethical issue or issues presented in the case. What principles or values that support your view.
Appropriate for employers to monitor employee behavior : Some people believe that the right of privacy should be extended to the workplace. Others feel that, on the contrary, that such an extension would constitute an
Outline two possible causes for each of the variances : All materials purchased were used to produce the 10,000 bottles of Allure. Outline two possible causes for EACH of the variances

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Identifies the cost of computer

identifies the cost of computer components to configure a computer system (including all peripheral devices where needed) for use in one of the following four situations:

  Input devices

Compare how the gestures data is generated and represented for interpretation in each of the following input devices. In your comparison, consider the data formats (radio waves, electrical signal, sound, etc.), device drivers, operating systems suppo..

  Cores on computer systems

Assignment : Cores on Computer Systems:  Differentiate between multiprocessor systems and many-core systems in terms of power efficiency, cost benefit analysis, instructions processing efficiency, and packaging form factors.

  Prepare an annual budget in an excel spreadsheet

Prepare working solutions in Excel that will manage the annual budget

  Write a research paper in relation to a software design

Research paper in relation to a Software Design related topic

  Describe the forest, domain, ou, and trust configuration

Describe the forest, domain, OU, and trust configuration for Bluesky. Include a chart or diagram of the current configuration. Currently Bluesky has a single domain and default OU structure.

  Construct a truth table for the boolean expression

Construct a truth table for the Boolean expressions ABC + A'B'C' ABC + AB'C' + A'B'C' A(BC' + B'C)

  Evaluate the cost of materials

Evaluate the cost of materials

  The marie simulator

Depending on how comfortable you are with using the MARIE simulator after reading

  What is the main advantage of using master pages

What is the main advantage of using master pages. Explain the purpose and advantage of using styles.

  Describe the three fundamental models of distributed systems

Explain the two approaches to packet delivery by the network layer in Distributed Systems. Describe the three fundamental models of Distributed Systems

  Distinguish between caching and buffering

Distinguish between caching and buffering The failure model defines the ways in which failure may occur in order to provide an understanding of the effects of failure. Give one type of failure with a brief description of the failure

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd