Briefly describe logic changes to the decode stage

Assignment Help Computer Engineering
Reference no: EM131438046

Homework - Clustered Pipeline Architecture?

One of the benefits of clustered pipeline architecture is that we have smaller register files in each cluster. Smaller register files translates into faster access to registers. Additionally, splitting the physical register file into multiple smaller pieces allows for more registers to be accessed in parallel for minimal additional power and area overhead. For architectures with high degree of parallelism (8 way superscalar as an example), many registers will be accessed at the same time, thus it is important to reduce RF (Register File) access time. One unwanted side effect is the extra communication overhead between clusters due to possible intercluster dependencies.

994_Figure.png

For more information about this architecture and its benefits and drawbacks you can read: https://www.hpl.hp.com/techreports/98/HPL-98-204.pdf

Review (Dispatch Bound VS Issue Bound):?

Dispatch Bound: In dispatch bound, the issue queue contains reservation stations (Memory elements that will hold the value of dependents once ready). For example, when an instruction is decoded, its dependents are read from the register file when ready, and when the instruction moves to the functional unit, the values are also sent with the instruction.

If the dependants are not ready, these dependants will be forwarded to the issue queue and stored in the reservation entities once the producing instruction completes.

Question 1 -

MOVC R1 #5

MOVC R2 #10

MOVC R3 #15

MOVC R4 #91

MOVC R5 #20

ADD R1 R1 #100

ADD R6 R1 R2

MULT R7 R6 R2

LOAD R8 R5 #500

DIV R9 R8 R4

STORE R7 #100 #200

STORE R9 #200 #300

Assumption?: You have an instruction "CLUSTER #NO" that directs all subsequent instructions to the cluster determined by the instruction operand. For example, the following instruction sequence will send the ADD to cluster 1 and MULT to cluster 2.

Cluster #1

ADD R1 R1 R1

Cluster #2

MULT R2 R2 R3

Please re-order the provided instruction sequence utilizing the cluster command to minimize intercluster communication.

Question 2 -

Please justify your answer to question 1.

Question 3 -

For the purpose of this question, you are allowed to add new data structures or modify existing structures inside the decode stage. Also assume that each cluster has limited capacity in terms of the number of instructions it can hold at a time. Propose any necessary changes so that dependent instructions, as much as possible, are executed in the same cluster. Briefly describe logic changes to the decode stage.

Note: ?Assume FRAT belongs to the decode stage.

Question 4 -

While it is true that executing all dependent instructions on one cluster will reduce intercluster dependency, this may limit parallelism and utilization of all clusters. Please suggest a smarter mechanism that would achieve an acceptable tradeoff reducing intercluster dependencies and maintaining good level of parallelism.

Question 5 -

Describe one scenario where dispatch-bound would more efficient than dispatch-bound.

Question 6 -

This question is similar to variation 2 for the architecture described in slide Lecture Slides 3, slide number 104. However, in this new proposed variation, each Rob entry holds the a copy of the instruction result. Given the following:

1. N_R: Number of physical registers

2. N_S: Number of reservation entities

3. W_Matrix -- Its entries are ready for you to use by the decode stage

4. RAT -- RAT entries will always point to physical register. It will never point to architectural register.

5. Each RoB Entry contains (Destination Physical Register Number, Destination

Architectural Register Number, A memory to hold the result of the instruction once ready). Assume Variables : Ph_NO, AR_NO, INST_VALUE respectively.

6. Free_List: A list of free physical registers

7. There is no Renamed[] vector.

At the decode stage, upon allocating a new register and performing renaming, there is a chance of freeing a physical register. Assuming the destination register number is R_Dest, write a pseudocode that would free a physical register when necessary.

Question 7 -

During instruction execution, there are other places where a physical might be freed. Briefly describe where else we need to put freeing logic and explain why.

Attachment:- Assignment.rar

Reference no: EM131438046

Questions Cloud

Describe the difference between formal and informal research : Write a paper that describes the differences between formal and informal research. What are the benefits and the drawbacks? Provide examples of when might you use each type of research.
What are the disadvantages of your proposed policy : 1. Describe how antipoverty programs can discourage the poor from working. How might you reduce this disincentive? 2. What are the disadvantages of your proposed policy?
Income share of the richest : What has happened to the income share of the richest fifth of the U.S. population over the past 40 years? Why has this happened?
What is sonya accounting or business profit : What is Sonya's accounting or business profit? Explain. What is Sonya's economic profit? Explain.
Briefly describe logic changes to the decode stage : CS 520: Computer Architecture and Organization Spring 2017 Homework. Propose any necessary changes so that dependent instructions, as much as possible, are executed in the same cluster. Briefly describe logic changes to the decode stage
Why amazons kindle cannot be made in the united states : Why Amazon's Kindle Cannot be Made in the United States? As a result of the benefit of the need for design engineers and those involved in distribution to interact.l
Letter of complaint hospitalli am victim : letter of complaint Hospitalli am victim, Paper Details please see attachment this needs to sound so convincing is going to Court please I am the victim here and I need to sound as if I have been a victim of discrimination of race and working in a ho..
Difference between the annual depreciation : If the MACRS depreciation method with a 3-year recovery period is used for tax purposes, determine the difference between the annual depreciation after 2 years calculated from both depreciation methods.
Total cost of treatment at an interest rate : How much will people who have to get every month from 1 month after treatment ends to get the same amount to the total cost of treatment at an interest rate of 12% per year compounded monthly?

Reviews

len1438046

3/23/2017 7:39:16 AM

Hi here is a version of homework again and also there are some hints included. please let me know the price quote again on this. This is the final version of the assignment. All further clarifications will appear as announcements on blackboard. A turnitin assignment will be created on my courses for this assignment. Please *do not* submit scanned copies of handwritten assignments. All assignments must be submitted as word or pdf files or any format accepted by turnitin. Also please abide by the number of lines suggested by each question.

Write a Review

Computer Engineering Questions & Answers

  Suppose that screen is an ostream iterator initialized

What is the output of the following program segment? (suppose that screen is an ostream iterator initialized to the standard output device to output elements of the type int.)

  Print all lines where the birthdays are in november

Print all lines where the birthdays are in November or December - Append three asterisks to the end of lines starting with Fred

  Implement a new instruction that has two register operands

The MIPS instruction sequence could be used to implement a new instruction that has two register operands.

  Create a work breakdown structure code for each task

This assignment consists of two (2) sections: a Work Breakdown Structure (WBS) and a summary document. You must submit the two (2) sections as separate files for the completion of this assignment.

  Describe some illustrations of functional versus

discuss some examples of functional versus nonfunctional requirements.what would exemplify these requirements in a

  Identify three ways in which your confidentiality

Identify three ways in which your confidentiality and privacy can be breached when your computer is connected to the Internet. b. Explain what firewalls are and give a type of firewall in use.

  How the resulting fft changes

Compare this result with the actual spectrum for the pulse. Try other combinations of the number of pulse samples and zero-pads to see how the resulting FFT changes.

  Describe a use case dependency for making an account deposit

Describe (in a one to two page narrative) a use case dependency for making an account deposit. Illustrate this use case with Visio or a similar product.

  What are the main tenets of information theory

Why would one assert that the type of information handled by the Internet is more MTC-bound and that the type of information handled by the Web is more semantic?

  You are given three channels a b and c with bit error rates

you are given three channels a b and c with bit error rates 10-6 10-5 and 10-9 respectively. which channel is best?

  How a first responder will identify and preserve evidence

the video South Tower Falls, Shot Front of Trinity Church, describe your concerns as a first responder. (For the purpose of your response assume that you do NOT know whether this is a chemical, biological, radiological, nuclear, or simple explosiv..

  Which input-output control model would you use

All data from the sensors will be fed into a group of sufficiently high-powered computers so that plan managers and supervisors can control and monitor the various processes taking place. at type of interface would you use between the sensors and ..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd