Process of sorting through large amounts of data

Assignment Help Computer Engineering
Reference no: EM13810814

Assignment Overview

The goal of this project is to gain more practice with file I/O, lists and functions.

Background

Data mining is the process of sorting through large amounts of data and picking out relevant information. Everyone from financial analysts to scientists use it to extract information from enormous data sets. These large data sets and the trend of analyzing them has come to be know as "Big Data".

In this project, we want to do some preliminary data mining of the prices of Apple stock. Your program will calculate the monthly average prices of Apple stock from 1984 to 2013. You will report facts about the monthly highs and lows for this data.

Project Specifications

1. A file of Apple's daily stock's prices will be given to you, whose name is table.csv (we pulled it off the web). This file could be opened by notepad or similar text editor, and is delimited by commas. If you open it with Excel, it will show you the data as a spreadsheet.

2. You must implement the following functions:

a) get_input_descriptor()

In this function, you are required to repeatedly prompt for the name of an input file until the user enters filename and the file can be opened for input. Return a file descriptor attached to the opened file.

b) get_data_list(file_object, column_number)

In this function, you are required to read the file of Apple's data. The function is flexible as it can read the data for any column of the data (0 through 6). If you read column 6, you are gathering the data for the "Adjusted Daily Close". If you read column 5, you are gathering data for the "Volume" that day. The function returns a list that consists of tuples. Each tuple is of the form: (date, column_data), the first value is a string, the second is a float. For example: ('2013- 02-08', 474.98) if we were collecting data from column 6.

c) average_data(list_of_tuples)

In this function the parameter is a list, the list of tuples generated by get_data_list above.

You will average the data for each month, and generate a list of tuples. A tuple here will have the form: (data_avg, date), the first is a float, the second is a string. For example: (2972945.4545454546, '07:1985'). Note the date in the returned list does not contain a day any more.

Programming Project 04
Summer 2014

Because each month has multiple entries the biggest challenge is to collect the data for each month together. One way to is to have variables "current_month" and "current_year" and update them when the month changes. That is, read lines summing data for the "current_month" until you encounter a new month. Encountering a new month means that you are done summing data for the "current_month" so you can calculate an average for the "current_month". After calculating the average, you can now set "current_month" to a new month and start summing values for the new "current_month."

d) main()

In this function, you:

• call get_input to get a file descriptor
• prompt for the column to average
• call the get_data function
• call the average_data function
• print the highest 6 averages (for the column selected) and the lowest 6 averages. Print that data with the month-year information.

Attachment:- table.csv

Reference no: EM13810814

Questions Cloud

How do you determine whether an ordered pair : How do you determine whether an ordered pair is a solution to a linear equation?  Can you demonstrate with the below examples?Determine whether the ordered pair is a solution to the given equation.
What actions should jason take to obtain a contractor to do : Jason is the project manager for a manufacturing company. After several weeks of planning sessions, a team member informs him that a part of the production work needs to be outsourced to another company since they don't have the relevant expertise in..
Northcentral university : Promoting individual development and working to maintain a productive and positive working environment are vital. However, these are not the only functions of HR. Write a paper in which you identify other important HR functions and analyze how you vi..
Economic run size : A shop that makes candles offers a scented candle, which has a monthly demand of 360 boxes. Candles can be produced at a rate of 36 boxes per day. The shop operates 20 days a month. Assume that demand is uniform throughout the month. Setup cost is..
Process of sorting through large amounts of data : Data mining is the process of sorting through large amounts of data and picking out relevant information. Everyone from financial analysts to scientists use it to extract information from enormous data sets. These large data sets and the trend of ..
Give examples of local shared and core data : Consider an organization that you are familiar with. Give examples of local, shared, and core data. Why do you categorize the examples you chose into these categories? Does the organization manage data in these three categories as suggested in this c..
Describe the way the issue is framed : What is the main idea or issue of focus in your news articles? Remember to incorporate at least 5 related to themes of the course.2. Is any background history provided to establish or inform the political, economic, social and or historical cont..
How does goleman define self-awareness : How does Goleman define self-awareness
What horizontal distance from the shipwreck : What horizontal distance from the shipwreck should the raft be dropped in order to land near the survivors?

Reviews

Write a Review

Computer Engineering Questions & Answers

  Designing the calculator program

Develop a calculator program which will subtract, add, multiply, or divide two numbers input given by a user. If user selects division, do not permit the user to divide by 0

  Describe the impact of merging the domains into a single

question 1 how would you define an item on the internet with respect to a search statement and similarity

  Advantages and disadvantages of compilers and interpreters

choose a problematic situation that you think a Java program can solve or make easier.

  Write a program that demonstrates the coin class

Write a program that demonstrates the Coin class. The program should create an instance of the class and display the side that is initially facing up.

  Explain condition is met and a program to approximate

I just want to learn and see done programs with all these things. If their is a way to keep them mostly the same so I may compare to see differences this is good.

  Classify enterprise wide system in hospitals today

Classify enterprise wide system in hospitals today. Classify all external system that interface with the facility or organization.

  Design two concrete classes circle and square that inherits

design an abstract class called shape. this class must contain a single constructor that accepts a single numeric value

  Write down four logic functions that are true

Implement the four functions using a ROM.Assume that X consists of 3 bits, x2 x1 x0

  Writing c code to determine the balance

Write down a program in C++ which determines the balance because of each month on a non-interest loan. Ask user for the loan amount and how much s/he will pay each month.

  Rules of inference to show that the hypotheses

Use rules of inference to show that the hypotheses "If it does not rain or if it is not foggy, then the sailing race will be held and the lifesaving demonstration will go on,"

  Developing system for the classification problem

client asks your company in order to develop a system for classification problem (for example: medical insurance fraud detection). Because of user requirements, he requires the final developed system to be able to offer the explanations about syst..

  Build a java program that manipulates at least two objects

Create a Java program that manipulates at least two objects instantiated from the Dog class. Your class must include at least five private attributes, three setter and three getter methods which are the public interface and two constructors. All the ..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd