We will compare the performance of a vector processor, Basic Computer Science

Assignment Help:
In this problem we will compare the performance of a vector processor with a system
that contains a scalar processor and a GPU-based coprocessor. In the hybrid system,
the host processor has superior scalar performance to the GPU, so in this case all scalar
code is executed on the host processor while all vector code is executed on the GPU.
We will refer to the rst system as the vector computer and the second system as the
hybrid computer.
Assume your target application contains a kernel with an arithmetic intensity of 0.5
FLOPs per DRAM byte accessed. However, the application also has a scalar component
which must be performed before and after the kernel in order to prepare the input
vectors and output vectors, respectively.
For a sample dataset, the scalar portion of the code requires 400 ms of execution time
on both the vector processor and the host processor in the hybrid system. The kernel
reads input vectors consisting of 200 MB and has output data consisting of 100 MB.
The vector processor has a peak memory bandwidth of 30 GB/s and the GPU has a
peak memory bandwidth of 150 GB/s. The hybrid system has an additional overhead
that requires all input vectors to be transferred between the host memory and GPU
local memory before and after the kernel is invoked. The hybrid system has a DMA
bandwidth of 10 GB/s and an average latency of 10 ms.Assume that both the vector processor and GPU are both performance bound by mem-
ory bandwidth. Compute the execution time for both computers for this application

Related Discussions:- We will compare the performance of a vector processor

Memory of computer, Memory : Computers store information internally as wel...

Memory : Computers store information internally as well as externally. Linked to CPU is an internal memory unit where all the instructions and data are stored. It contains the

Bus structure, which computer architecture use single bus structure??????? ...

which computer architecture use single bus structure??????? tells the name and little bit working

EM202073PRA65DBMS, HEY, i wanna discuss about DBMS assignment Did you find...

HEY, i wanna discuss about DBMS assignment Did you find a link of mySQL?

Elements of cpu, ELEMENTS OF CPU: The CPU is a complex IC chip having ...

ELEMENTS OF CPU: The CPU is a complex IC chip having millions of transistors. As mentioned earlier, a CPU has three major identifiable parts: Control Unit (CU), Arithmetic & l

Bidirectional search-artificial intelligence, Bidirectional Search-Artifici...

Bidirectional Search-Artificial intelligence: We've concentrated so far on searches where the point of the search is to search a solution, not the path to the solution. In anot

String problem, c program to convert S to palindromes with minimum number o...

c program to convert S to palindromes with minimum number of character replacements

Operating system problems, 1. In discussing software algorithms for mutual ...

1. In discussing software algorithms for mutual exclusion, we noted that optimizing compilers and out-of-order execution by processors could invalidate most of these algorithms bec

Simple Algorithm, How to make an assignment entitled "Decision Making: Fore...

How to make an assignment entitled "Decision Making: Forecasting" and I am required to make a pseudocode and flowchart based on the task.

Probability, Suppose that of 100 computers checked for viruses 30 where inf...

Suppose that of 100 computers checked for viruses 30 where infected and 70 where not. If we select 10 computers at random what is the probability that at most 4 of them are infecte

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd