Write a map-reduce program to get the minimum salary

Assignment Help Data Structure & Algorithms
Reference no: EM133249898

Problem 1

Input Data File: dept_salary.txt (attached to this assignment)
The first column of the data file has the id of a dept and the second column a salary amount. Write a map-reduce program to get the MINIMUM salary per department. Please submit python mapper and reducer similar to ones given in lab along with detailed instructions on how to use.
Expected files to be submitted:
1. Mapper in python
2. Reducer in python
3. A file of instructions explaining how to use the mapper, reducer, and the data file
Rubric of the Grades:
- Correct Mapper code,
- Correct Reducer code,
- Executable MapReduce job,
- Producing correct output,
- Complete set of instructions showing all steps that needs to be run to produce a result (10 pts),
- Screen snapshot provided and matching to the execution of the submitted mapper and reducer codes

Problem 2

Input Data File: shakespeare_100.txt (download it from this link)
In the Word count example we output all words with their count. What if we only want the words with top 20 counts (This is very common problem in the industry). Think about using multiple map-reduce steps. Please submit python mapper and reducer similar to ones given in lab along with detailed instructions on how to use. If using multiple map-reduce steps, explanation of the steps should be included in your documentation
Expected files to be submitted:
1. Mapper(s) in python
2. Reducer(s) in python
3. A file of instructions explaining how to use the mapper(s), reducer(s), and the data file

Rubric of the Grades:
- Correct Mapper code(s). If multiple mappers the total grade will be split between them,
- Correct Reducer code(s). If multiple reducers the total grade will be split between them,
- Executable MapReduce job(s). If multiple execution steps the total grade will be split between them,
- Producing correct output(s). If multiple execution steps the total grade will be split between them,
- Complete set of instructions showing all steps that needs to be run to produce a result,
- Screen snapshot provided and matching to the execution(s) of the submitted mapper(s) and/or reducer(s) codes

Reference no: EM133249898

Questions Cloud

Make a recommendation Stevenson : Analyze the information and make a recommendation Stevenson, W.(2021), Operations Management
Describe evidence based human resource management : Describe evidence based Human Resource Management. List and describe at least three specific ways that evidence based management
Why do you think that cms picked these conditions : Why do you think that CMS picked these conditions? What is the benefit of developing an EMR solution for this issue
What would you do differently from a chatbot : Explain in detail the steps that you would take in the process of creating this chatbot. What would you do differently from a chatbot for English language
Write a map-reduce program to get the minimum salary : Write a map-reduce program to get the MINIMUM salary per department - If multiple mappers the total grade will be split
Outline the methodology and the results of the globe study : Outline the methodology and the results of the GLOBE study. To what extent do cultures undergo changes? Illustrate your statement with an example
What is the organizational strcuture and culture of both : What is the organizational strcuture and culture of both? What is the 6 recommendations that can help both organizations to get a better performance
What role does an apology play in restoring safety : Why is mutual purpose the entry condition of Dialogue? Why is mutual respect the continuance condition of Dialogue?
Describe at least three commercial insurance programs : Commercial insurance ranges from individual health coverage to automobile liability coverage. Describe at least three commercial insurance programs

Reviews

Write a Review

Data Structure & Algorithms Questions & Answers

  Algorithm to find maximum sum of contiguous sublist

Using dynamic programming, write an algorithm to find the maximum sum of contiguous sublist of a given list of n real values.

  Create a singly link list of books which can add new books

Create a Singly Link list of books which can Add new books to a list remove books from a list and compare two list and find whether a book is present in a list.

  What the turing machine actually does

What the Turing machine actually does when given one single input as a unary number - reducibility relation to a decision problem

  Highlighting features that boost performances

highlighting features that boost performances

  Show that g is an encoding function

State whether { A, B }* is finite or infinite and whether it is countable or uncountable and State the condition for g to be function and argue that g is indeed

  Create a binary search tree

Create a Binary Search Tree

  Create a shell script the count the number of files

Create a shell script that will calculate the number of files in your account hat were last modified five or more days ago and when you run the shell script,

  What will be the ouput of lines

Consider the following code snippet: 1. list = [ [ ] ] * 5 2. list # output? 3. list[0].append(110) 4. list # output? 5. list[1].append(200) 6. list # output? 7. list.append(230) 8. list # output? What will be the ouput of lines 2, 4, 6, and 8?

  Discuss in detail the time complexity of your algorithm

Discuss (in detail) the time complexity of your algorithm. What can you say about the efficiency of your algorithm? Explain. Can you think of a better way to represent and add two polynomials? Explain.

  Write algorithm to find median value using queries

As queries are expensive, you would like to calculate median using as few queries as possible. Provide the algorithm which finds median value using at most O(log n) queries.

  Difference between a problem and an opportunity

What was the problems and/or opportunities facing Delta in late 1997? What is the difference between a problem and an opportunity

  Explaining view of header and footer areas of worksheet

In which view can you see header and footer areas of worksheet?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd