What is speedup in throughput for each of these improvements

Assignment Help Electrical Engineering
Reference no: EM13218085

Assume a GPU architecture that contains 10 SIMD processors. Each SIMD instruction has a width of 32 and each SIMD processor contains 8 lanes for single-precision arithmetic and load/store instructions, meaning that each non- diverged SIMD instruction can produce 32 results every 4 cycles.Assume a kernel that has divergent branches that causes on average 80% of threads to be active. Assume that 70% of all SIMD instructions executed are single-precision arithmetic and 20% are load/store. Since not all memory latencies are covered, assume an average SIMD instruction issue rate of 0.85. Assume that the GPU has a clock speed of 1.5 GHz.

Questions :

(1) Compute the throughput, in GFLOP/sec, for this kernel on this GPU.

(2) Assume that you have the following choices:

(1) Increasing the number of single precision lanes to 16

(2) Increasing the number of SIMD processors to 15 (assume this change doesn't affect any other performance metrics and that the code scales to the additional processors)

(3) Adding a cache that will effectively reduce memory latency by 40%, which will increase

instruction issue rate to 0.95

What is speedup in throughput for each of these improvements?

 

Reference no: EM13218085

Questions Cloud

Involvement in the insurance process : What are the implications of simultaneous federal and state involvement in the insurance process?
What current in amperes will flow through the bulbs : A 100 W bulb takes 0.833A and a 200 W bulb takes 1.67 A from a 120V source. If the two bulb were connected across a 240 V source, what current in amperes will flow through the bulbs?
Describe the muslim presence in africa in 18th-19th century : Describe the Muslim presence in Africa in the 18th and 19th century. How did Muslims make their money? How did they spread their influence? What areas of Africa were under Muslim control?
Calculate the amout of mmf generated by a coil : calculate the amout of mmf generated by a coil of wire having 1300 turns and carrying milliamperes of current?
What is speedup in throughput for each of these improvements : Increasing the number of SIMD processors to 15 (assume this change doesn't affect any other performance metrics and that the code scales to the additional processors)
The treaty of guadalupe hidalgo : The Treaty of Guadalupe Hidalgo... George Washington's success as a general is most accurately explained by.Benjamin Franklin.
What type of cable is necessary for each connection : What type of cable is necessary for each connection: straight or crossover? You can assume that S1 and S2 do not have the ability to resolve crossovers (called Auto-MDIX).
Calculate the minimum sampling interval : calculate the minimum sampling interval (time period for counting the pulse train) to achieve a speed resolution of 5 rpm, and the number of bits that must be there in the binary counter if the pulse train is to be sampled with this interval for a..
Calculate the minimum sampling interval : calculate the minimum sampling interval (time period for counting the pulse train) to achieve a speed resolution of 5 rpm, and the number of bits that must be there in the binary counter if the pulse train is to be sampled with this interval for a..

Reviews

Write a Review

Electrical Engineering Questions & Answers

  Explain can protons every flow through current

Can protons every flow through current? Even though in the overwhelming majority of cases the current flowing through a material is dominated by the movement of electrons"

  A batch process which involves filling a tank with liquid

A batch process which involves filling a tank with liquid, mixing the liquid and draining the tank, is automated with a PLC. The specific sequence of events is as follows

  Explain why the short-circuit method for determining

Explain why the short-circuit method for determining the Thevenin resistance of a circuit cannot be applied if the circuit does not contain any independent current or voltage sources.

  Draw the impedance diagram in per-unit system

Draw the per-phase impedance diagram of the power system and Draw the impedance diagram in per-unit system

  Surface integral for jet engine

Surface integral for jet engine.  The air-gas exhaust velocity from a jet engine varies linearly from a maximum of 300 m/s at the center of the circular exhaust opening to zero at the edges. If the exhaust diameter is 1.6 m, find  thq  exhaust flow.

  Explain a first-order circuit

A first-order circuit, having a gain of 10 at dc and a gain of 1 at infinite frequency, has its pole at 20 kHz. Find its transfer function.

  Calculate the unit-pulse response

Compute the unit-pulse response h[n] for the discrete-time system y[n+2]2y[n+1]+y[n]=x[n] (for n = 0, 1, 2, 3).

  Explain explanation how a parallel rlc filter circuit

Please explanation how a Parallel RLC Filter circuit behave at Low Frequencies, Resonant Frequency and High Frequencies

  Find out the voltage required on the input

Determine the voltage required on the input of the amplifier to give the maximum output of 85 dBuV. If the signal level from the aerial is 5 dBmV and the input noise level is 20 dBuV, calculate the signal-to-noise ratio on the output of the amplifi..

  Define dynamic ram that must be given a refresh cycle

Dynamic RAM that must be given a refresh cycle 64 times per ms. Each refresh operation requires 150 ns. memory cycle requires 250 ns.

  Determine the bit error rate performance

A signal is modulated by BPSK and transmitted over an AWGN channel. At the receiver, the bit error rate performance is 0.001

  Laplace transforms enable interpretation and manipulation

Laplace transforms enable interpretation and manipulation of different signals by viewing these signals as either time domain signals/pulse or else frequency domain representations

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd