Determine the cpi load latency, Electrical Engineering

Assignment Help:

Question:

(a) Describe the following terminologies:
i. Branch
ii. Branch Prediction
iii. Branch Predictor
iv. Branch Misprediction

(b) Consider that 15% of instructions are loads and that 20% of the instructions following a load depend on its results and are stalled for 1 cycle. All instructions and all loads hit in their respective first-level caches. Consider further that 20% of instructions are branches, with 60% of them being taken and 40% being not taken. The penalty is 2 cycles if the branch is not taken, and it is 3 cycles if the branch is taken. Then, 1 cycle is lost for 20% of the loads, 2 cycles are lost when a conditional branch is not taken, and 3 cycles are lost for taken branches.

(i) Determine the CPI load latency, CPI branches, CPI, and IPC.

(ii) A very simple optimization implementation for branches is to consider that they are not taken. There will be no penalty if indeed the branch is not taken, and there will still be a 3 cycle penalty if it is taken. Calculate the CPI branches, CPI, and IPC.

(iii) Assuming that a branch-not-taken strategy has been implemented, plot CPI vs. branch misprediction cost when the latter varies between 3 and 20 cycles.

(iv) Do your computations in (iii) argue for sophisticated branch predictors when the pipelines become "deeper"?

(c) In (b), we assumed that the cache miss penalty was 20 cycles. With modern processors running at a frequency of 1 to 3 GHz, the cache miss penalty can reach several hundred cycles.

(i) Keeping all other parameters the same as in (b), plot CPI vs. cache miss penalty cost when the latter varies between 20 and 500 cycles.

(ii) Do your computations argue for the threat of a "memory wall" whereby loading instructions and data could potentially dominate the execution time?


Related Discussions:- Determine the cpi load latency

Analysis on solar charging system buck converter with VMC, PLS WRITE THE IN...

PLS WRITE THE INTRODUCTION, METHODOLOGY AND LITERATURE REVIEW

Equivalent circuits, Modern complex systems are normally constructed by int...

Modern complex systems are normally constructed by interconnecting sub-units. Therefore the systems engineer is often not too concerned about the details of what is inside these

Determine l and c of the band pass filter circuit, Q. Determine L and C of ...

Q. Determine L and C of the band pass filter circuit of Figure to have a center frequency of 1 MHz and a bandwidth of 10 kHz. Also find the Q of the filter.

Lki load register pair immediate instruction , LKI  Load Register pair  ...

LKI  Load Register pair  Immediate  Instruction This instruction is used to copy or  load  16 bit  data specified  in the  instruction  directly into  the register pair. The i

Accounting, I need the solutions for this assignment

I need the solutions for this assignment

Explain the discrete time systems, Explain the Discrete Time Systems? A...

Explain the Discrete Time Systems? A system operates on an input signal, x[n] and output the results, y[n]. For example, a digital filter (a system) can be represented by the f

Relative permeability, The force between two wires 1 metre apart and each c...

The force between two wires 1 metre apart and each carrying 1 amp is 2 x 10 -7 Newtons when in a vacuum (or in practice, air).However, it is found experimentally that the force is

Rahul, in 8085 name the 16 bit registers

in 8085 name the 16 bit registers

Explain junction transistors (npn and pnp), Explain junction transistors (n...

Explain junction transistors (npn and pnp). Junction Transistor: This transistor consists of two p-n junctions combined in one crystal as demonstrated in figure below.

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd