SlideShare a Scribd company logo
4
Most read
6
Most read
11
Most read
Superscalar Processor
By
Manash Kumar Mondal
M.Tech. CSE. KUCSE
Contents
 What is superscalar processor ?
 Why Superscalar?
 Organization of superscalar processor
 Instruction dispatch
 Reservation station
 Reservation station: Centralized vs distributed
 Recorder buffer
 Instruction completion and Retire
 Limitations of superscalar processor
 References
Supersalar processor
A superscalar processor is a CPU that
implements a form of parallelism called
instruction-level parallelism within a single
processor.
Simple superscalar pipeline
By fetching and dispatching two instructions at a time, a maximum of two instructions per cycle can be
completed. (IF = Instruction Fetch, ID = Instruction Decode, EX = Execute, MEM = Memory access,
WB = Register write back, i = Instruction number, t = Clock cycle [i.e., time])
Why superscalar?
 Most operations are on scalar quantities
 Improve these operations to get an overall improvement
 Superscalar processor executes multiple independent instructions in
parallel.
Superscalar Organization
Instruction Dispatch
Instructing
fetching
Instruction
decoding
FU2FU1 FUnFU5FU3 FU4
Instruction Dispatch
 Route decoded instructions to appropriate functional units
Reservation Station
 Reservation station decouple instruction decoding and instruction
execution .
 Main task: Dispatching -- Waiting --Issuing
Allocate unit
Issuing unit
ReadyDispatch
Issuing
Busy
Entrytobeissued
Entrytobeallocated
Reservation station
Reservation station: Centralised Vs Distributed
Dispatch
(Issue)
Dispatch buffer
Issue
Execute
Execute
Completion buffer
Fig: Centralize reservation station (Intel P6) Fig: Distributed reservation station (Power PC 620)
Reorder Buffer
 Contain all in–flight instruction
Includes instruction in RS + instruction executing in FUs + instruction
which are finished execution but waiting to be completed in program order
 Only finished and non-speculative instructions can be completed
Next instruction to
complete (head pointer)
Next entry to be allocated
(tail pointer)
0 0 0 1 1 1 1 1 1 1
In-flight-instruction
Busy
Issued
Finished
Instruction address
Rename register
Speculative
Valid
Instruction completion and Retire
Completion – finish the execution and update the machine state
Retire - update the memory
 A store may complete by writing to store buffer, but it retire
only when the data is written into the memory
 When an interrupt occurs, stop fetching new instructions and finish the
execution of all-in-flight instructions
 When an exception occurs, the result of the completion may no longer be
valid.
Limitation of superscalar processor
Instruction-fetch inefficiencies caused by both branch delays and
instruction misalignment
 not worthwhile to explore highly- concurrent execution
hardware, rather, it is more appropriate to explore economical
execution hardware
 degree of intrinsic parallelism in the instruction stream
(instructions requiring the same computational resources from
the CPU)
 complexity and time cost of the dispatcher and associated
dependency checking logic
 branch instruction processing.
References
1. Nptel lecture IIT Madras (online certification course) Superscalar processor
organization
2. https://en.wikipedia.org/wiki/Supercalar
3. https://www.slideshare.net/
4. [John_L._Hennessy,_David_A._Patterson]_Computer_Architecture , A
Quantitative
Approach, 4th Edition , (Digital copy) from bookfi.net.
Thank you !

More Related Content

PPT
program partitioning and scheduling IN Advanced Computer Architecture
Pankaj Kumar Jain
 
PDF
Pipelining and ILP (Instruction Level Parallelism)
Dr. A. B. Shinde
 
PDF
Parallelism
Md Raseduzzaman
 
PPTX
Superscalar processor
noor ul ain
 
PPTX
PL/SQL - CURSORS
IshaRana14
 
PPSX
Particle Swarm optimization
midhulavijayan
 
PPT
block ciphers
Asad Ali
 
PPTX
Superscalar Architecture_AIUB
Nusrat Mary
 
program partitioning and scheduling IN Advanced Computer Architecture
Pankaj Kumar Jain
 
Pipelining and ILP (Instruction Level Parallelism)
Dr. A. B. Shinde
 
Parallelism
Md Raseduzzaman
 
Superscalar processor
noor ul ain
 
PL/SQL - CURSORS
IshaRana14
 
Particle Swarm optimization
midhulavijayan
 
block ciphers
Asad Ali
 
Superscalar Architecture_AIUB
Nusrat Mary
 

What's hot (20)

PPTX
Instruction Cycle in Computer Organization.pptx
Yash346903
 
PPT
Pipeline hazard
AJAL A J
 
PPTX
Instruction pipeline: Computer Architecture
InteX Research Lab
 
PPTX
Input output interface
Christ University
 
PPTX
Memory management
Muhammad Fayyaz
 
PPTX
Trends in computer architecture
muhammedsalihabbas
 
PPTX
Interleaved memory
ashishgy
 
PPT
Flynns classification
Yasir Khan
 
PPTX
Cache memory
Ansari Maviya
 
PPTX
Superscalar & superpipeline processor
Muhammad Ishaq
 
PPTX
Parallel computing and its applications
Burhan Ahmed
 
PPTX
CISC & RISC Architecture
Suvendu Kumar Dash
 
PPS
Virtual memory
Anuj Modi
 
PPTX
Memory organization in computer architecture
Faisal Hussain
 
PPTX
Interrupts
Urwa Shanza
 
PPTX
Parallel processing
Praveen Kumar
 
PPTX
Cache Memory
Subid Biswas
 
PPTX
Performance of Parallel Processors
Ashish KC
 
PPT
Computer Organization and Architecture.
CS_GDRCST
 
PPTX
Instruction pipelining
Tech_MX
 
Instruction Cycle in Computer Organization.pptx
Yash346903
 
Pipeline hazard
AJAL A J
 
Instruction pipeline: Computer Architecture
InteX Research Lab
 
Input output interface
Christ University
 
Memory management
Muhammad Fayyaz
 
Trends in computer architecture
muhammedsalihabbas
 
Interleaved memory
ashishgy
 
Flynns classification
Yasir Khan
 
Cache memory
Ansari Maviya
 
Superscalar & superpipeline processor
Muhammad Ishaq
 
Parallel computing and its applications
Burhan Ahmed
 
CISC & RISC Architecture
Suvendu Kumar Dash
 
Virtual memory
Anuj Modi
 
Memory organization in computer architecture
Faisal Hussain
 
Interrupts
Urwa Shanza
 
Parallel processing
Praveen Kumar
 
Cache Memory
Subid Biswas
 
Performance of Parallel Processors
Ashish KC
 
Computer Organization and Architecture.
CS_GDRCST
 
Instruction pipelining
Tech_MX
 
Ad

Similar to Superscalar Processor (20)

PDF
Two students were debating how to classify superscalar operation ver.pdf
birajdar2
 
PPTX
INSTRUCTION LEVEL PARALLALISM
Kamran Ashraf
 
PPT
Overview of Very long instruction word Computing
Raun24S
 
PPTX
ILP_Techniques_Explained_Presentation.pptx
Esh Ram
 
PPT
Chapt12Processor Structure and Function.ppt
AhmedJaha
 
PPTX
Instruction_Cycle_and_Coziyth9esty7098wITU08EAU8ntrol.pptx
pratikdagar100
 
PPTX
how to be a here in real life u-COA.pptx
SanikaAdhav
 
PDF
Thesis1 3-23
Hemant Dhoot
 
PPT
1.prallelism
Mahesh Kumar Attri
 
PPT
1.prallelism
Mahesh Kumar Attri
 
PPTX
Advanced processor principles
Dhaval Bagal
 
PPT
Chapter 3
Rozase Patel
 
PPT
Pipeling performance-Part-2-OOO Execution - Tomasulo Algorithm.ppt
hypnosis616
 
PDF
What is simultaneous multithreading
Fraboni Ec
 
PPTX
Computer architecture
PrabhanshuKatiyar1
 
PDF
Oversimplified CA
PrabhanshuKatiyar1
 
PPT
Unit 5-lecture 5
vishal choudhary
 
PPTX
Pipelining , structural hazards
Munaam Munawar
 
PPTX
Parallel Computing
Mohsin Bhat
 
PDF
Module 2 of apj Abdul kablam university hpc.pdf
22br14851
 
Two students were debating how to classify superscalar operation ver.pdf
birajdar2
 
INSTRUCTION LEVEL PARALLALISM
Kamran Ashraf
 
Overview of Very long instruction word Computing
Raun24S
 
ILP_Techniques_Explained_Presentation.pptx
Esh Ram
 
Chapt12Processor Structure and Function.ppt
AhmedJaha
 
Instruction_Cycle_and_Coziyth9esty7098wITU08EAU8ntrol.pptx
pratikdagar100
 
how to be a here in real life u-COA.pptx
SanikaAdhav
 
Thesis1 3-23
Hemant Dhoot
 
1.prallelism
Mahesh Kumar Attri
 
1.prallelism
Mahesh Kumar Attri
 
Advanced processor principles
Dhaval Bagal
 
Chapter 3
Rozase Patel
 
Pipeling performance-Part-2-OOO Execution - Tomasulo Algorithm.ppt
hypnosis616
 
What is simultaneous multithreading
Fraboni Ec
 
Computer architecture
PrabhanshuKatiyar1
 
Oversimplified CA
PrabhanshuKatiyar1
 
Unit 5-lecture 5
vishal choudhary
 
Pipelining , structural hazards
Munaam Munawar
 
Parallel Computing
Mohsin Bhat
 
Module 2 of apj Abdul kablam university hpc.pdf
22br14851
 
Ad

More from Manash Kumar Mondal (20)

PDF
The Basics of Networking . Connecting Devices in a Digital World
Manash Kumar Mondal
 
PDF
Cloud Computing : Fundamental Concepts and Models
Manash Kumar Mondal
 
PDF
Cloud Computing: Cloud Enabling Technologies
Manash Kumar Mondal
 
PDF
An Introduction to Cloud Computing and its Applications
Manash Kumar Mondal
 
PDF
Introduction to Machine_Learning for Absolute Beginner
Manash Kumar Mondal
 
PDF
Complexity Class of Algorithm for Beginner
Manash Kumar Mondal
 
PDF
Introduction to Artificial Intelligence (AI)
Manash Kumar Mondal
 
PDF
Introduction to Data Preprocessing for Machine Learning
Manash Kumar Mondal
 
PDF
A brief Introduction to Linux Operating System.
Manash Kumar Mondal
 
PDF
Systematic Literature Review on academic research
Manash Kumar Mondal
 
PDF
Statistical Inference & Hypothesis Testing.pdf
Manash Kumar Mondal
 
PDF
Role of NDLI in Higher Education _ Research, KU.pdf
Manash Kumar Mondal
 
PPTX
Various security issues and its solutions in the
Manash Kumar Mondal
 
PPTX
Omicron - A Covid 19 variant
Manash Kumar Mondal
 
PPTX
Computer network
Manash Kumar Mondal
 
PPTX
Boolean alebra
Manash Kumar Mondal
 
PPTX
Introduction to Algorithm
Manash Kumar Mondal
 
PPTX
File in C language
Manash Kumar Mondal
 
PDF
Pegasus, A spyware
Manash Kumar Mondal
 
The Basics of Networking . Connecting Devices in a Digital World
Manash Kumar Mondal
 
Cloud Computing : Fundamental Concepts and Models
Manash Kumar Mondal
 
Cloud Computing: Cloud Enabling Technologies
Manash Kumar Mondal
 
An Introduction to Cloud Computing and its Applications
Manash Kumar Mondal
 
Introduction to Machine_Learning for Absolute Beginner
Manash Kumar Mondal
 
Complexity Class of Algorithm for Beginner
Manash Kumar Mondal
 
Introduction to Artificial Intelligence (AI)
Manash Kumar Mondal
 
Introduction to Data Preprocessing for Machine Learning
Manash Kumar Mondal
 
A brief Introduction to Linux Operating System.
Manash Kumar Mondal
 
Systematic Literature Review on academic research
Manash Kumar Mondal
 
Statistical Inference & Hypothesis Testing.pdf
Manash Kumar Mondal
 
Role of NDLI in Higher Education _ Research, KU.pdf
Manash Kumar Mondal
 
Various security issues and its solutions in the
Manash Kumar Mondal
 
Omicron - A Covid 19 variant
Manash Kumar Mondal
 
Computer network
Manash Kumar Mondal
 
Boolean alebra
Manash Kumar Mondal
 
Introduction to Algorithm
Manash Kumar Mondal
 
File in C language
Manash Kumar Mondal
 
Pegasus, A spyware
Manash Kumar Mondal
 

Recently uploaded (20)

PPTX
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
PPTX
FUNDAMENTALS OF ELECTRIC VEHICLES UNIT-1
MikkiliSuresh
 
PPTX
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
PDF
LEAP-1B presedntation xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
hatem173148
 
PDF
Unit I Part II.pdf : Security Fundamentals
Dr. Madhuri Jawale
 
PDF
2010_Book_EnvironmentalBioengineering (1).pdf
EmilianoRodriguezTll
 
PPT
Understanding the Key Components and Parts of a Drone System.ppt
Siva Reddy
 
PPTX
Tunnel Ventilation System in Kanpur Metro
220105053
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PPTX
business incubation centre aaaaaaaaaaaaaa
hodeeesite4
 
PPTX
Civil Engineering Practices_BY Sh.JP Mishra 23.09.pptx
bineetmishra1990
 
PPTX
Online Cab Booking and Management System.pptx
diptipaneri80
 
PPTX
22PCOAM21 Session 1 Data Management.pptx
Guru Nanak Technical Institutions
 
DOCX
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
PPTX
quantum computing transition from classical mechanics.pptx
gvlbcy
 
PPTX
MULTI LEVEL DATA TRACKING USING COOJA.pptx
dollysharma12ab
 
PPTX
IoT_Smart_Agriculture_Presentations.pptx
poojakumari696707
 
PDF
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
PDF
2025 Laurence Sigler - Advancing Decision Support. Content Management Ecommer...
Francisco Javier Mora Serrano
 
PDF
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
FUNDAMENTALS OF ELECTRIC VEHICLES UNIT-1
MikkiliSuresh
 
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
LEAP-1B presedntation xxxxxxxxxxxxxxxxxxxxxxxxxxxxx
hatem173148
 
Unit I Part II.pdf : Security Fundamentals
Dr. Madhuri Jawale
 
2010_Book_EnvironmentalBioengineering (1).pdf
EmilianoRodriguezTll
 
Understanding the Key Components and Parts of a Drone System.ppt
Siva Reddy
 
Tunnel Ventilation System in Kanpur Metro
220105053
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
business incubation centre aaaaaaaaaaaaaa
hodeeesite4
 
Civil Engineering Practices_BY Sh.JP Mishra 23.09.pptx
bineetmishra1990
 
Online Cab Booking and Management System.pptx
diptipaneri80
 
22PCOAM21 Session 1 Data Management.pptx
Guru Nanak Technical Institutions
 
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
quantum computing transition from classical mechanics.pptx
gvlbcy
 
MULTI LEVEL DATA TRACKING USING COOJA.pptx
dollysharma12ab
 
IoT_Smart_Agriculture_Presentations.pptx
poojakumari696707
 
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
2025 Laurence Sigler - Advancing Decision Support. Content Management Ecommer...
Francisco Javier Mora Serrano
 
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 

Superscalar Processor

  • 1. Superscalar Processor By Manash Kumar Mondal M.Tech. CSE. KUCSE
  • 2. Contents  What is superscalar processor ?  Why Superscalar?  Organization of superscalar processor  Instruction dispatch  Reservation station  Reservation station: Centralized vs distributed  Recorder buffer  Instruction completion and Retire  Limitations of superscalar processor  References
  • 3. Supersalar processor A superscalar processor is a CPU that implements a form of parallelism called instruction-level parallelism within a single processor.
  • 4. Simple superscalar pipeline By fetching and dispatching two instructions at a time, a maximum of two instructions per cycle can be completed. (IF = Instruction Fetch, ID = Instruction Decode, EX = Execute, MEM = Memory access, WB = Register write back, i = Instruction number, t = Clock cycle [i.e., time])
  • 5. Why superscalar?  Most operations are on scalar quantities  Improve these operations to get an overall improvement  Superscalar processor executes multiple independent instructions in parallel.
  • 7. Instruction Dispatch Instructing fetching Instruction decoding FU2FU1 FUnFU5FU3 FU4 Instruction Dispatch  Route decoded instructions to appropriate functional units
  • 8. Reservation Station  Reservation station decouple instruction decoding and instruction execution .  Main task: Dispatching -- Waiting --Issuing Allocate unit Issuing unit ReadyDispatch Issuing Busy Entrytobeissued Entrytobeallocated Reservation station
  • 9. Reservation station: Centralised Vs Distributed Dispatch (Issue) Dispatch buffer Issue Execute Execute Completion buffer Fig: Centralize reservation station (Intel P6) Fig: Distributed reservation station (Power PC 620)
  • 10. Reorder Buffer  Contain all in–flight instruction Includes instruction in RS + instruction executing in FUs + instruction which are finished execution but waiting to be completed in program order  Only finished and non-speculative instructions can be completed Next instruction to complete (head pointer) Next entry to be allocated (tail pointer) 0 0 0 1 1 1 1 1 1 1 In-flight-instruction Busy Issued Finished Instruction address Rename register Speculative Valid
  • 11. Instruction completion and Retire Completion – finish the execution and update the machine state Retire - update the memory  A store may complete by writing to store buffer, but it retire only when the data is written into the memory  When an interrupt occurs, stop fetching new instructions and finish the execution of all-in-flight instructions  When an exception occurs, the result of the completion may no longer be valid.
  • 12. Limitation of superscalar processor Instruction-fetch inefficiencies caused by both branch delays and instruction misalignment  not worthwhile to explore highly- concurrent execution hardware, rather, it is more appropriate to explore economical execution hardware  degree of intrinsic parallelism in the instruction stream (instructions requiring the same computational resources from the CPU)  complexity and time cost of the dispatcher and associated dependency checking logic  branch instruction processing.
  • 13. References 1. Nptel lecture IIT Madras (online certification course) Superscalar processor organization 2. https://en.wikipedia.org/wiki/Supercalar 3. https://www.slideshare.net/ 4. [John_L._Hennessy,_David_A._Patterson]_Computer_Architecture , A Quantitative Approach, 4th Edition , (Digital copy) from bookfi.net.