Identify application in your daily life that uses big data

Assignment Help Computer Network Security
Reference no: EM133276484

Big Data and Analytics

Big Data is defined as data that shares the following three characteristics (or three V's as they are called):

Volume: the amount of data is very vast.

Variety: the data comes from a varied set of sources. These include standard databases, non-structured data, text, audio, images, video, sensor data, and geographic information.

Velocity: data is produced at very high speed.

Veracity and Value, meaning that the data should be of quality and relevant to the users, but the instructor believes these are characteristics of the manipulation of the data, rather than an intrinsic characteristic.

Because of the variety of information than can be obtained from big data, it is not possible to have a model that will cover all possible information when can obtain from the data. This means that we cannot create an ERD powerful enough to capture the richness of the data. Therefore systems that handle big data must design a "schema on read". The structure of the data will be created the moment the data is available. There are two current approaches to obtain this schema: JavaScript Object Notation (JSON) and Extended Markup Language (XML). Both are languages that can be used to describe the organization of data that is being received from the sources of big data.

Once we can obtain schemas from big data, we need to be able to manipulate it to obtain information. For this we use "Not only SQL" or NoSQL. This is software that store big data and helps to manage it. This is the equivalent of our DBMS in structured databases, but for big data. The textbook describes 4 kinds of NoSQL databases management systems that are currently being used. Each one store data in different ways and with different capabilities.

Even though we can create programs to handle each piece of big data, we also need the ability to do it rapidly. It is of no use to know how to obtain some information if this information is only going to be available after a long period of time. The large amount of items in big data requires an approach to speed calculations up. This approach is Hadoop. Hadoop is an algorithm that breaks calculations to be performed in big data into a set of smaller tasks to be performed in sections of this data by a cluster of computers. Each computer in the cluster is independent of each other and receives a set of the calculations and a section of the big data to process. All computers in the cluster work in parallel to perform their calculations in their section of the data. When they finish, all their results are compiled and a final response is created for the whole system. An example that comes to mind is the following: How can we find what search is trendy right now in Google? We could collect all queries made in the last hour and find which is the topic more requested. Given the number of Google users all over the world, this could take quite a while, if done by one single machine. However, if we use a cluster of computers and we give each computer data from a world region, each computer may find the trendiest topic in each region. All compute this data in parallel, and at the end another set of computers match the results of different regions to obtain the final result.

Finally, this chapter presents some real computer architectures that implement some of the approaches explained above in real big data. It will be worthwhile for you to review the list of areas where these approaches made and will continue making an impact.

QUESTION:

Identify an application in your daily life that uses big data. You are actually surrounded with these applications that appear small to you, but they actually involve big data: payments at grocery stores, water, electricity, phone, cable Internet billing systems, social networks, Email systems, streaming audio and/or video, navigational systems, all dealings with bank and investment companies, etc. You may be involved in few transactions with these systems, but they generate a lot of data from all the interactions with a network of users.

Once you identify the application you want to work with, think of useful piece of information you may want to obtain from this big data. You must explain what this information is and how do you think we can use the big data application to obtain that information.

Reference no: EM133276484

Questions Cloud

How are the odd chapters different from the evens : How are the odd (bolded) chapters different from the evens? Besides their formatting, what features or qualities do they have in common
Identify the local host on your network : How can you use the log to identify the local host on your network that was used to perform the exfiltration?
How were the principles of project management applied : MGMT 412 American InterContinental University What skills and knowledge of the project manager were applied to make you successful
Difference between a cream pie filling-a custard pie filling : ??Explain the difference between a cream pie filling and a custard pie filling. Give two examples of each type of filling.
Identify application in your daily life that uses big data : Identify an application in your daily life that uses big data. You are actually surrounded with these applications that appear small to you,
What are the responsibilities an organization has : What are the responsibilities an organization has to the communities in which it operates? What does it mean to be a good "corporate citizen"
Select the project manager to successfully deliver : What are the project manager requirements that the Sally Williams Executive Team should use to select the project manager to successfully deliver the insource
Write a job description for the systems analyst position : Write a job description for the systems analyst position you must now fill. The role which the system analyst will fulfill within the organization
What good and bad traits do you see in colins leadership : What good and bad traits do you see in Colins leadership and How would you describe Colin's leadership style in terms of the various theories

Reviews

Write a Review

Computer Network Security Questions & Answers

  An overview of wireless lan security - term paper

Computer Science or Information Technology deals with Wireless LAN Security. Wireless LAN Security is gaining importance in the recent times. This report talks about how vulnerable are wireless LAN networks without any security measures and also talk..

  Computer networks and security against hackers

This case study about a company named Magna International, a Canada based global supplier of automotive components, modules and systems. Along with the company analysis have been made in this assignment.

  New attack models

The Internet evolution is and is very fast and the Internet exposes the connected computers to attacks and the subsequent losses are in rise.

  Islamic Calligraphy

Islamic calligraphy or Arabic calligraphy is a primary form of art for Islamic visual expression and creativity.

  A comprehensive study about web-based email implementation

Conduct a comprehensive study about web-based email implementation in gmail. Optionally, you may use sniffer like wireshark or your choice to analyze the communication traffic.

  Retention policy and litigation hold notices

The purpose of this project is to provide you with an opportunity to create a document retention policy. You will also learn how to serve a litigation hold notice for an educational institute.

  Tools to enhance password protection

A report on Tools to enhance Password Protection.

  Analyse security procedures

Analyse security procedures

  Write a report on denial of service

Write a report on DENIAL OF SERVICE (DoS).

  Phising email

Phising email It is multipart, what are the two parts? The HTML part, is it inviting the recepient to click somewhere? What is the email proporting to do when the link is clicked?

  Express the shannon-hartley capacity theorem

Express the Shannon-Hartley capacity theorem in terms of where is the Energy/bit and is the psd of white noise.

  Modern symmetric encryption schemes

Pseudo-random generators, pseudo-random functions and pseudo-random permutations

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd