Improve the performance of the kernel developed

Assignment Help Computer Engineering
Reference no: EM131904813

Parallel Computer Architecture Programming Assignment: Equation Solver

Consider the Gauss-Seidel equation solver discussed within the lecture notes on how to write parallel programs-see the file called parallelization process.pdf on BBLearn. You will find the source code for the reference implementation in the zip file on BBLearn.

Recall that the order in which the grid points are updated in the sequential algorithm is not funda- mental to the Gauss-Seidel solution method; it is simply one possible ordering that is convenient to program sequentially. Since the Gauss-Seidel method is not an exact solution method but rather iterates until convergence, we can update the grid points in a different order as long as we use updated values for grid points frequently enough, a technique called the Jacobi method where we don't use updated values from the current iteration for any grid points but always use the values as they were at the end of the previous iteration. Using the sequential program as a starting point, develop a parallel version of the Jacobi method using an element-based decomposition strategy where each GPU thread is responsible for processing a single grid element.

The program provided to you accepts no arguments. It creates a randomly initialized grid of N N elements and applies the update rule to each element within the grid until the specified convergence criteria is satisfied. The solution provided by the GPU is compared to that generated by the CPU by printing out the relevant statistics.

Answer the following questions.

1. Edit the compute on device() function in the file solver.cu and the solver kernel naive() function in solver kernel.cu file to complete the functi- onality of the equation solver on the GPU using only global memory.

2. Improve the performance of the kernel developed in the previous step by using shared memory on the GPU. Edit the kernel function solver kernel optimized() in the solver kernel.cu file to complete the functionality.

3. Upload all of the files needed to run your code on BBLearn as a single zip file. Submit a short report describing: (1) the design of your kernels using code or pseudocode to clarify the discussion; (2) the speedup obtained over the serial version for both the naive and optimized kernels, for grid sizes of 2048 2048, 4096 4096, and 8192 8192; and (3) sensitivity of your kernels to thread-block size in terms of the execution time.

Reference no: EM131904813

Questions Cloud

Reducing pollution well below the current industrys standard : Reducing pollution well below the current industrys standard and Eliminating pollution caused by coal mining pollution that ends up in the air
Develop a formal outline for your paper : Read assignment instructions for Paper. Develop a formal outline for your Paper writing in full sentences (12 point font, Times New Roman font, double spaced).
Explain description of the aesthetics and packaging : A description of the aesthetics, packaging, labeling, or look, of your product. For a service describe the service offering so that the reader understands.
Calculate an appropriate measure of association : A research team conducted a study of soft-drink preferences among residents in a test market prior to an advertising campaign for a new cola product.
Improve the performance of the kernel developed : Improve the performance of the kernel developed in the previous step by using shared memory on the GPU. Edit the kernel function solver kernel optimized().
Discuss the marketing communications process : MGT104 Intro to Applied Marketing - Discuss each of these objectives and their purpose in relation to the Wannaburger business
How does the globalization impact pricing : "Globalization is a very interesting topic and can get very heated," you explain. "There are some countries and companies who are fiercely loyal to their homes.
How to improve the long-term customer relationships : Every company gathers information on customer preferences and buying history in order to improve their long-term customer relationships.
Discuss what you feel is the most effective way for a leader : After watching the videos, discuss what you feel is the most effective way for a leader to implement organizational change.

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd