Determine the cpi load latency, Electrical Engineering

Assignment Help:

Question:

(a) Describe the following terminologies:
i. Branch
ii. Branch Prediction
iii. Branch Predictor
iv. Branch Misprediction

(b) Consider that 15% of instructions are loads and that 20% of the instructions following a load depend on its results and are stalled for 1 cycle. All instructions and all loads hit in their respective first-level caches. Consider further that 20% of instructions are branches, with 60% of them being taken and 40% being not taken. The penalty is 2 cycles if the branch is not taken, and it is 3 cycles if the branch is taken. Then, 1 cycle is lost for 20% of the loads, 2 cycles are lost when a conditional branch is not taken, and 3 cycles are lost for taken branches.

(i) Determine the CPI load latency, CPI branches, CPI, and IPC.

(ii) A very simple optimization implementation for branches is to consider that they are not taken. There will be no penalty if indeed the branch is not taken, and there will still be a 3 cycle penalty if it is taken. Calculate the CPI branches, CPI, and IPC.

(iii) Assuming that a branch-not-taken strategy has been implemented, plot CPI vs. branch misprediction cost when the latter varies between 3 and 20 cycles.

(iv) Do your computations in (iii) argue for sophisticated branch predictors when the pipelines become "deeper"?

(c) In (b), we assumed that the cache miss penalty was 20 cycles. With modern processors running at a frequency of 1 to 3 GHz, the cache miss penalty can reach several hundred cycles.

(i) Keeping all other parameters the same as in (b), plot CPI vs. cache miss penalty cost when the latter varies between 20 and 500 cycles.

(ii) Do your computations argue for the threat of a "memory wall" whereby loading instructions and data could potentially dominate the execution time?


Related Discussions:- Determine the cpi load latency

Super-heterodyne receiver, Write a short explanation of the principles of s...

Write a short explanation of the principles of super-heterodyne receiver. It may help to use sample block diagram to state the process. Why is the production of the intermediate fr

What are cu and neu in 8087, What are CU and NEU in 8087? CU-Control un...

What are CU and NEU in 8087? CU-Control unit NEU- Numeric extension unit. The numeric extension unit implements all the numeric processor instructions whereas the control un

C R O DELAY LINE, WHAT IS THE FUNCTION OF DELAY LINE IN CRO

WHAT IS THE FUNCTION OF DELAY LINE IN CRO

Power electronics, principle of 120 degree mode of operation of voltage sou...

principle of 120 degree mode of operation of voltage source intverter

Compute the maximum electromagnetic power, Q. A four-pole dc machine with 7...

Q. A four-pole dc machine with 728 active conductors and 30mWb flux per pole runs at 1800 r/min. (a) If the armature winding is lap wound, find the voltage induced in the armatu

How a nand gate decoder is used for 2716 eprom memory, With neat diagram in...

With neat diagram indicate how a simple NAND gate decoder is used to select a 2716 EPROM memory component for memory locations FF800H-FFFFFH. Simple NAND gate Decoder: As the 2

Explain function of application layer, Q. Explain function of application l...

Q. Explain function of application layer? Layers of OSI model are as follows: (1) The Physical Layer: This defines an interface in terms of connections, voltage levels and

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd