Improve the performance of the kernel developed

Assignment Help Computer Engineering
Reference no: EM131904813

Parallel Computer Architecture Programming Assignment: Equation Solver

Consider the Gauss-Seidel equation solver discussed within the lecture notes on how to write parallel programs-see the file called parallelization process.pdf on BBLearn. You will find the source code for the reference implementation in the zip file on BBLearn.

Recall that the order in which the grid points are updated in the sequential algorithm is not funda- mental to the Gauss-Seidel solution method; it is simply one possible ordering that is convenient to program sequentially. Since the Gauss-Seidel method is not an exact solution method but rather iterates until convergence, we can update the grid points in a different order as long as we use updated values for grid points frequently enough, a technique called the Jacobi method where we don't use updated values from the current iteration for any grid points but always use the values as they were at the end of the previous iteration. Using the sequential program as a starting point, develop a parallel version of the Jacobi method using an element-based decomposition strategy where each GPU thread is responsible for processing a single grid element.

The program provided to you accepts no arguments. It creates a randomly initialized grid of N N elements and applies the update rule to each element within the grid until the specified convergence criteria is satisfied. The solution provided by the GPU is compared to that generated by the CPU by printing out the relevant statistics.

Answer the following questions.

1. Edit the compute on device() function in the file solver.cu and the solver kernel naive() function in solver kernel.cu file to complete the functi- onality of the equation solver on the GPU using only global memory.

2. Improve the performance of the kernel developed in the previous step by using shared memory on the GPU. Edit the kernel function solver kernel optimized() in the solver kernel.cu file to complete the functionality.

3. Upload all of the files needed to run your code on BBLearn as a single zip file. Submit a short report describing: (1) the design of your kernels using code or pseudocode to clarify the discussion; (2) the speedup obtained over the serial version for both the naive and optimized kernels, for grid sizes of 2048 2048, 4096 4096, and 8192 8192; and (3) sensitivity of your kernels to thread-block size in terms of the execution time.

Reference no: EM131904813

Questions Cloud

Reducing pollution well below the current industrys standard : Reducing pollution well below the current industrys standard and Eliminating pollution caused by coal mining pollution that ends up in the air
Develop a formal outline for your paper : Read assignment instructions for Paper. Develop a formal outline for your Paper writing in full sentences (12 point font, Times New Roman font, double spaced).
Explain description of the aesthetics and packaging : A description of the aesthetics, packaging, labeling, or look, of your product. For a service describe the service offering so that the reader understands.
Calculate an appropriate measure of association : A research team conducted a study of soft-drink preferences among residents in a test market prior to an advertising campaign for a new cola product.
Improve the performance of the kernel developed : Improve the performance of the kernel developed in the previous step by using shared memory on the GPU. Edit the kernel function solver kernel optimized().
Discuss the marketing communications process : MGT104 Intro to Applied Marketing - Discuss each of these objectives and their purpose in relation to the Wannaburger business
How does the globalization impact pricing : "Globalization is a very interesting topic and can get very heated," you explain. "There are some countries and companies who are fiercely loyal to their homes.
How to improve the long-term customer relationships : Every company gathers information on customer preferences and buying history in order to improve their long-term customer relationships.
Discuss what you feel is the most effective way for a leader : After watching the videos, discuss what you feel is the most effective way for a leader to implement organizational change.

Reviews

Write a Review

Computer Engineering Questions & Answers

  Write your own code to perform the k-core

Write your own code to perform the k-core and onion decompositions of the network. Plot the number of nodes of in the layer l on the y-axis vs l on the x-axis.

  You will be developing a human elements in it strategy

you will select a real or fictitious organization and apply your research to the development of a strategy that would

  What are three common problems occurring on the windows

research common problems that occur with windows and create a well organized powerpoint presentation. for your

  Describe the common web server threats and vulnerabilities

Describe the common Web server threats and vulnerabilities, Describe the common threats and vulnerabilities for Web browsers.

  Identify each form of data and signal

Identify each form of data and signal. Are they analog? Digital? What data/signal conversions are taking place? Where are these conversions?

  Suppose a computer using a set-associative cache

assume that a computer using a set-associative cache has 2^16 words of main memory and a cache of 32 blocks, where each cache block contains 8 words.

  Describe the function and purpose of each component

describe how the components are related to each other. For example, does the component you are describing depend on another to operate? Is it connected to the network physically or wirelessly.

  Explain the significance of single- and double-quoting

Explain the significance of single- and double-quoting, including when one is preferred to the other. What are the two consequences of using double quotes?

  Explain two specific products from different vendors

At this point, you have been introduced to various security tools. You are to take one of the five identified categories of tools and identify two specific products from different vendors.

  Define in detail minimum five practices

Your Web design firm has just hired a talented graphic artist with years of expertise in print media and no experience with producing image production. You have to train this person in the best practices for Web graphics. Explain in detail minimum..

  Discuss the united states hold in this security environment

the most important advantages and deficits the united states hold in this security environment

  Create a static one for one nat to a pc at toledo

Create a static one for one NAT to a PC at Toledo. Ping outside address from a PC connected to classroom network. What device is actually replying to the ping?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd