Determine the cpi load latency, Electrical Engineering

Assignment Help:

Question:

(a) Describe the following terminologies:
i. Branch
ii. Branch Prediction
iii. Branch Predictor
iv. Branch Misprediction

(b) Consider that 15% of instructions are loads and that 20% of the instructions following a load depend on its results and are stalled for 1 cycle. All instructions and all loads hit in their respective first-level caches. Consider further that 20% of instructions are branches, with 60% of them being taken and 40% being not taken. The penalty is 2 cycles if the branch is not taken, and it is 3 cycles if the branch is taken. Then, 1 cycle is lost for 20% of the loads, 2 cycles are lost when a conditional branch is not taken, and 3 cycles are lost for taken branches.

(i) Determine the CPI load latency, CPI branches, CPI, and IPC.

(ii) A very simple optimization implementation for branches is to consider that they are not taken. There will be no penalty if indeed the branch is not taken, and there will still be a 3 cycle penalty if it is taken. Calculate the CPI branches, CPI, and IPC.

(iii) Assuming that a branch-not-taken strategy has been implemented, plot CPI vs. branch misprediction cost when the latter varies between 3 and 20 cycles.

(iv) Do your computations in (iii) argue for sophisticated branch predictors when the pipelines become "deeper"?

(c) In (b), we assumed that the cache miss penalty was 20 cycles. With modern processors running at a frequency of 1 to 3 GHz, the cache miss penalty can reach several hundred cycles.

(i) Keeping all other parameters the same as in (b), plot CPI vs. cache miss penalty cost when the latter varies between 20 and 500 cycles.

(ii) Do your computations argue for the threat of a "memory wall" whereby loading instructions and data could potentially dominate the execution time?


Related Discussions:- Determine the cpi load latency

Calculate the generator emf , Q.  The armature of 6 -pole dc generator has...

Q.  The armature of 6 -pole dc generator has a wave winding containing 664 conductors. Calculate the generator emf when flux per pole is 0.06 weber and the speed is 250 rpm. At wh

What is memory mapping, What is memory mapping? The assignment of memor...

What is memory mapping? The assignment of memory addresses to several registers in a memory chip is known as memory mapping.

Methods for speed and torque control of induction motors, Q. Methods for sp...

Q. Methods for speed and torque control of induction motors? The following are methods available for speed and torque control of induction motors. • Pole-changing method

Solve the nonlinear equation and find r, Q. An interface circuit consisting...

Q. An interface circuit consisting of R 1 and R 2 is designed between the source and the load, as illustrated in Figure such that the load sees a Thevenin resistance of 50  betw

Digital Electronics, Design a MOD-6 synchronous counter using J-K Flip-Flop...

Design a MOD-6 synchronous counter using J-K Flip-Flops.

Define multiple inputs - control system, Define Multiple Inputs - Control S...

Define Multiple Inputs - Control System When there is much more than one input to a system, the superposition principle can be employed. This is that: The response to variou

Transistors, how fet transistor works in self bias

how fet transistor works in self bias

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd