We will compare the performance of a vector processor, Basic Computer Science

Assignment Help:
In this problem we will compare the performance of a vector processor with a system
that contains a scalar processor and a GPU-based coprocessor. In the hybrid system,
the host processor has superior scalar performance to the GPU, so in this case all scalar
code is executed on the host processor while all vector code is executed on the GPU.
We will refer to the rst system as the vector computer and the second system as the
hybrid computer.
Assume your target application contains a kernel with an arithmetic intensity of 0.5
FLOPs per DRAM byte accessed. However, the application also has a scalar component
which must be performed before and after the kernel in order to prepare the input
vectors and output vectors, respectively.
For a sample dataset, the scalar portion of the code requires 400 ms of execution time
on both the vector processor and the host processor in the hybrid system. The kernel
reads input vectors consisting of 200 MB and has output data consisting of 100 MB.
The vector processor has a peak memory bandwidth of 30 GB/s and the GPU has a
peak memory bandwidth of 150 GB/s. The hybrid system has an additional overhead
that requires all input vectors to be transferred between the host memory and GPU
local memory before and after the kernel is invoked. The hybrid system has a DMA
bandwidth of 10 GB/s and an average latency of 10 ms.Assume that both the vector processor and GPU are both performance bound by mem-
ory bandwidth. Compute the execution time for both computers for this application

Related Discussions:- We will compare the performance of a vector processor

Concept of operating system, Concept of operating system: An operating...

Concept of operating system: An operating system is an essential software component of a computer system. The basic  objectives of an operating system are to make the computer

Foundation of computer system, Foundation of Computer System Q.No-1: W...

Foundation of Computer System Q.No-1: What do you understand by the Efficiency in term of Computers? and How will you measure it? Also define Speed and its factors which ca

How to create basic assembler program?, The primary step is to start the De...

The primary step is to start the Debug; this step only consists of typing debug [Enter] on the operative system on time. the "a" (assemble) command is used To assemble a program on

Block matching algorithm, I am using block matching algorithm to get the mo...

I am using block matching algorithm to get the motion vectors, now, how can I get the depth map/depth value from the motion vectors?

Operating system, advantage and disadvantage of operating system

advantage and disadvantage of operating system

Data structure, #question.a tree has 0 off springs at each node. if it had ...

#question.a tree has 0 off springs at each node. if it had a label L, what will be the maximum number of nodes that the tree can have. only an expression involving the number of no

The terms ''row-major order'' and ''column-major order", What is meant by t...

What is meant by the terms 'row-major order' and 'column-major order'? Ans: (i) Storing the array column by column is called as column-major order and storing the array row by

Access database, 1.Add a Validation Rule for Date of Birth so no one under ...

1.Add a Validation Rule for Date of Birth so no one under the age of 18 can be added to the table. Hint: subtract the DOB from today''s date and divide by 365.25 (watch the parenth

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd