Vliw processor

Assignment Help Operating System
Reference no: EM131598

1. Given the following code and the assembler equivalent to the right:

for (i=999, i>=0, i--) x[i]=x[i]+y[i];

LOOP:   LD           F0, 0(R2) ;get x[i]

             LD           F1, 0(R3) ;get y[i]

            DADD    F2, F0, F1  ;multiply

             SD           F2, 0(R2) ;store back

                DADDUI R2,R2,#-8

                DADDUI R3,R3,#-8

                BNE R2, R4, LOOP

a. Using Figure A-3, Indicate the number of stalls that would occur between the lines of code as seen in the book and our in class example.  Write them between the lines to the right.

b. Unroll the loop so that two iterations are shown and rearrange the code to minimize the number of stalls while preserving the correctness of the code.

c.  Show how the unrolled code might be executed in a VLIW processor with the units below, given the same latencies in Figure A-3.

Load/Store Unit

Load/Store Unit

FP Unit

FP Unit

Integer Unit

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

2. Given the code below



BGE        F1, F0, SKIP ;  Check for a small number (0) in F1 - F0 has a small number

LD           F1, 0(R4) ; number was too small, load a fixed value for divide

SKIP       DDIV      F2, F2, F1 ; divide by F1,  if too small a division overflow error could occur

 

 

We see that an error could occur if the incorrect branch were taken, and a number small enough to cause overflow were in F1.  (BGE means branch on greater than,  if F1>=F0 then branch).  If we have a branch target buffer that predicts the branch taken when a value is in F1 that is too small (thus causing an error), explain how both the Tomasulo Algorithm and the Tomasulo Algorithm with ReOrder Buffers would preserve exception behavior.

3 In figure A-1, complete the fill out of the issue of the 4 instructions in the Tomasulo (without ROB) algorithm.

Reference no: EM131598

Questions Cloud

Electricity and circuit theory : Assessment on Electricity and Circuit Theory,  What part of a cable are insulators and conductors? Describe, in terms of charged particles, why some objects are electrically attracted to each other, while others repel each other?
Depict von neumann-morgenstern utility index u in a diagram : Depict the von Neumann-Morgenstern utility index u in a diagram
Develop a gui based java program : Designing and developing a College Registration program
Write an assembly program : Prepare an Assembly program that reads in a number of cents.
Vliw processor : VLIW processor - assembler
Calculate the risk : Calculate the risk and expected return for each asset.
Twin peaks building supplies : Chris White was a forestry technician who had been searching for several years for a business opportunity to combine with his forestry career
Travelling salesman problem : Travelling Salesman Problem on the L1-metric plane.
Project specification and plan : The Project Specification is a description of what you are working to achieve, and a Plan is the schedule proposed to achieve it

Reviews

Write a Review

Operating System Questions & Answers

  Simulating operating systems scheduling

Simulate the long-term scheduler, the short-term scheduler and the I/O scheduler of the computer using the First-Come-First-Serve algorithm.

  Marginal and average cost curves

n a competitive market place (pure competition) is it possible to continually sell your product at a price above the average cost of production.

  Develop a user mode command interpreter

Develop a user mode command interpreter which support list-short.

  Write a multi-threaded program

Write a multi-threaded program to solve producer and consumer problem

  Discuss an application that benefits barrier synchronization

Discuss an application that would benefit from the use of barrier synchronization

  Disk scheduling algorithm

Simulation of Elevator and Shortest Seek Time First

  Virtual machines

Virtual machines supported by a host operating system

  Implementation of algorithms for process management

The Shortest Job Next (SJN) algorithm queues processes in a way that the ones that use the shortest CPU cycle will be selected for running rst.

  Memory allocation in operating system

Analysis and implementation of algorithms for memory allocation in operating system, Explain First- t and best- t methods are used in memory allocation in operating systems.

  Issues with trusted platform module

Research paper discussing the issues with Trusted Platform Module (TPM)

  Threads

Explain a complication that concurrent processing adds to an operating system.

  Stand alone child process

Forking the child process

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd