We will compare the performance of a vector processor, Basic Computer Science

Assignment Help:
In this problem we will compare the performance of a vector processor with a system
that contains a scalar processor and a GPU-based coprocessor. In the hybrid system,
the host processor has superior scalar performance to the GPU, so in this case all scalar
code is executed on the host processor while all vector code is executed on the GPU.
We will refer to the rst system as the vector computer and the second system as the
hybrid computer.
Assume your target application contains a kernel with an arithmetic intensity of 0.5
FLOPs per DRAM byte accessed. However, the application also has a scalar component
which must be performed before and after the kernel in order to prepare the input
vectors and output vectors, respectively.
For a sample dataset, the scalar portion of the code requires 400 ms of execution time
on both the vector processor and the host processor in the hybrid system. The kernel
reads input vectors consisting of 200 MB and has output data consisting of 100 MB.
The vector processor has a peak memory bandwidth of 30 GB/s and the GPU has a
peak memory bandwidth of 150 GB/s. The hybrid system has an additional overhead
that requires all input vectors to be transferred between the host memory and GPU
local memory before and after the kernel is invoked. The hybrid system has a DMA
bandwidth of 10 GB/s and an average latency of 10 ms.Assume that both the vector processor and GPU are both performance bound by mem-
ory bandwidth. Compute the execution time for both computers for this application

Related Discussions:- We will compare the performance of a vector processor

Memory, Memory The memory unit is used for the storage of binary coded ...

Memory The memory unit is used for the storage of binary coded information. Information consists of instructions and data where: • Instructions are the coded pieces of infor

How can we create a fcb file?, Creating a new file For the formation of fil...

Creating a new file For the formation of files the 21H interruption 16H function is used. DX must identify a control structure whose necessities are that as a minimum the logic uni

Explain public code archive, Question 1 Explain the various tools used for...

Question 1 Explain the various tools used for open source software development Question 2 Discuss the OSS licensing strategies Question 3 Explain the software developmen

Democracy and ICT, Ask questio Discuss the idea of democracy and its overl...

Ask questio Discuss the idea of democracy and its overlapping meanings for IS design. How would you democratically design IS? How does participation help the practice of democrac

Write a note on file permissions on linux, Question 1 Write a note on a...

Question 1 Write a note on advantages and disadvantages of Linux 2 Write a note on "File Permissions" on linux 3 Explain the usage of following Linux commands Cd kill

Internetworking, You have been approached by Company XYZ to design and depl...

You have been approached by Company XYZ to design and deploy a new network that will span three cities in Queensland: Brisbane (12 users), Gold Coast (8 users), and Cairns (6 users

Rom, what is rom

what is rom

What is Semaphore and explain with example?, Description A semaphore is a...

Description A semaphore is a confined variable whose value can be accessed and changed only by the operations P and V and initialization operation known as ''Semaphoiinitislize''.

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd