Determine the cpi load latency, Electrical Engineering

Assignment Help:

Question:

(a) Describe the following terminologies:
i. Branch
ii. Branch Prediction
iii. Branch Predictor
iv. Branch Misprediction

(b) Consider that 15% of instructions are loads and that 20% of the instructions following a load depend on its results and are stalled for 1 cycle. All instructions and all loads hit in their respective first-level caches. Consider further that 20% of instructions are branches, with 60% of them being taken and 40% being not taken. The penalty is 2 cycles if the branch is not taken, and it is 3 cycles if the branch is taken. Then, 1 cycle is lost for 20% of the loads, 2 cycles are lost when a conditional branch is not taken, and 3 cycles are lost for taken branches.

(i) Determine the CPI load latency, CPI branches, CPI, and IPC.

(ii) A very simple optimization implementation for branches is to consider that they are not taken. There will be no penalty if indeed the branch is not taken, and there will still be a 3 cycle penalty if it is taken. Calculate the CPI branches, CPI, and IPC.

(iii) Assuming that a branch-not-taken strategy has been implemented, plot CPI vs. branch misprediction cost when the latter varies between 3 and 20 cycles.

(iv) Do your computations in (iii) argue for sophisticated branch predictors when the pipelines become "deeper"?

(c) In (b), we assumed that the cache miss penalty was 20 cycles. With modern processors running at a frequency of 1 to 3 GHz, the cache miss penalty can reach several hundred cycles.

(i) Keeping all other parameters the same as in (b), plot CPI vs. cache miss penalty cost when the latter varies between 20 and 500 cycles.

(ii) Do your computations argue for the threat of a "memory wall" whereby loading instructions and data could potentially dominate the execution time?


Related Discussions:- Determine the cpi load latency

Draw a block diagram of a 4-bit pipo register, Q. Draw a block diagram of a...

Q. Draw a block diagram of a 4-bit PIPO register and briefly describe its operation. Q. Taking parallel data from a computer to be fed out over a single transmission line needs

Explain cascading of multiple pics 8259, Explain cascading of multiple PICS...

Explain cascading of multiple PICS 8259.  The 8259A adds 8 vectored priority encoded interrupts to the microprocessor. It can be expanded to 64 interrupt requests by using one

Explain computer-aided circuit analysis, Q. Explain Computer-aided circuit ...

Q. Explain Computer-aided circuit analysis? A word of caution is appropriate if this is the student's first experience with simulation. Just as the proliferation of calculators

Use a 4-to-1 multiplexer to simulate nand logic function, Q. Use a 4-to-1mu...

Q. Use a 4-to-1multiplexer to simulate the following: (a) NAND logic function. (b) EXCLUSIVE-OR logic function. (c) Σ m (1, 2, 4).

Analog averaging system, Consider the analog averager where x(t) is the inp...

Consider the analog averager where x(t) is the input and y(t) is the output. (a)   Find the impulse response h(t) of the average.  Is this system causal? (b)   Let x(t)

Single Phase Induction Motor, How do I connect a Capacitor on the machine i...

How do I connect a Capacitor on the machine if it has a start mechanism

Name 5 different addressing modes, The five addressing modes are given belo...

The five addressing modes are given below: Immediate, Register, Direct, Implied addressing modes           Register indirect,

Define characteristics of discrete time systems - linear, Define Characteri...

Define Characteristics of Discrete Time Systems - Linear? A discrete-time system is said to be linear if it obeys the principles of superposition. That is, the response of a li

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd