SlideShare a Scribd company logo
Computer Architecture
Dr.S.Santhi
Professor, Department of CSE
KIT-Kalaignarkarunanidhi Institute of Technology
UNIT 1
PERFORMANCE, POWER WALL, UNI-PROCESSORS,
MULTIPROCESSORS
Performance Metrics
Purchasing perspective
◦ given a collection of machines, which has the
◦ best performance ?
◦ least cost ?
◦ best cost/performance?
Design perspective
◦ faced with design options, which has the
◦ best performance improvement ?
◦ least cost ?
◦ best cost/performance?
Both require
◦ basis for comparison
◦ metric for evaluation
Our goal is to understand what factors in the architecture contribute to overall
system performance and the relative importance (and cost) of these factors
Defining Performance
Which airplane has the best performance?
Response Time and Throughput
Response time
◦ How long it takes to do a task
Throughput
◦ Total work done per unit time
◦ e.g., tasks/transactions/… per hour
How are response time and throughput affected by
◦ Replacing the processor with a faster version?
◦ Adding more processors?
We’ll focus on response time for now…
Relative Performance
Performance = 1/Execution Time
“X is n time faster than Y”
n

 X
Y
Y
X
time
Execution
time
Execution
e
Performanc
e
Performanc
 Example: time taken to run a program
 10s on A, 15s on B
 Execution TimeB / Execution TimeA
= 15s / 10s = 1.5
 So A is 1.5 times faster than B
Measuring Execution Time
Elapsed time
◦ Total response time, including all aspects
◦ Processing, I/O, OS overhead, idle time
◦ Determines system performance
CPU time
◦ Time spent processing a given job
◦ Discounts I/O time, other jobs’ shares
◦ Comprises user CPU time and system CPU time
◦ Different programs are affected differently by CPU and system
performance
CPU Clocking
Operation of digital hardware governed by a constant-
rate clock
Clock (cycles)
Data transfer
and computation
Update state
Clock period
 Clock period: duration of a clock cycle
 e.g., 250ps = 0.25ns = 250×10–12s
 Clock frequency (rate): cycles per second
 e.g., 4.0GHz = 4000MHz = 4.0×109Hz
Review: Machine Clock Rate
Clock rate (clock cycles per second in MHz or GHz) is inverse of clock cycle
time (clock period)
one clock period
1 nsec (10-9) clock cycle => 1 GHz (109) clock rate
CPU Time
Performance improved by
◦ Reducing number of clock cycles
◦ Increasing clock rate
◦ Hardware designer must often trade off clock rate against cycle count
Rate
Clock
Cycles
Clock
CPU
Time
Cycle
Clock
Cycles
Clock
CPU
Time
CPU



CPU Time Example
Computer A: 2GHz clock, 10s CPU time
Designing Computer B
◦ Aim for 6s CPU time
◦ Can do faster clock, but causes 1.2 × clock cycles of A
How fast must Computer B clock be?
4GHz
6s
10
24
6s
10
20
1.2
Rate
Clock
10
20
2GHz
10s
Rate
Clock
Time
CPU
Cycles
Clock
6s
Cycles
Clock
1.2
Time
CPU
Cycles
Clock
Rate
Clock
9
9
B
9
A
A
A
A
B
B
B















Instruction Count and CPI
Instruction Count for a program
◦ Determined by program, ISA and compiler
Average cycles per instruction
◦ Determined by CPU hardware
◦ If different instructions have different CPI
◦ Average CPI affected by instruction mix
Rate
Clock
CPI
Count
n
Instructio
Time
Cycle
Clock
CPI
Count
n
Instructio
Time
CPU
n
Instructio
per
Cycles
Count
n
Instructio
Cycles
Clock







CPI Example
Computer A: Cycle Time = 250ps, CPI = 2.0
Computer B: Cycle Time = 500ps, CPI = 1.2
Same ISA
Which is faster, and by how much?
1.2
500ps
I
600ps
I
A
Time
CPU
B
Time
CPU
600ps
I
500ps
1.2
I
B
Time
Cycle
B
CPI
Count
n
Instructio
B
Time
CPU
500ps
I
250ps
2.0
I
A
Time
Cycle
A
CPI
Count
n
Instructio
A
Time
CPU




















A is faster…
…by this much
CPI in More Detail
If different instruction classes take different numbers of cycles




n
1
i
i
i )
Count
n
Instructio
(CPI
Cycles
Clock
 Weighted average CPI











n
1
i
i
i
Count
n
Instructio
Count
n
Instructio
CPI
Count
n
Instructio
Cycles
Clock
CPI
Relative frequency
Power Trends
In CMOS IC technology
§1.7
The
Power
Wall
Frequency
Voltage
load
Capacitive
Power 2



 )
2
/
1
(
×1000
×40 5V → 1V
Uniprocessor Performance
§1.8
The
Sea
Change:
The
Switch
to
Multiprocessors
Constrained by power, instruction-level parallelism, memory
latency
Multiprocessors
Multicore microprocessors
◦ More than one processor per chip
Requires explicitly parallel programming
◦ Compare with instruction level parallelism
◦ Hardware executes multiple instructions at once
◦ Hidden from the programmer
◦ Hard to do
◦ Programming for performance
◦ Load balancing
◦ Optimizing communication and synchronization
Thank You!

More Related Content

PPT
Chapter 1 computer abstractions and technology
BATMUNHMUNHZAYA
 
PPT
COMPUTER ARCHITECTURE BASIC CONCEPT
Azizul Mamun
 
PPTX
Book for general presentation for computer science
festustobi18
 
PDF
Computer architecture short note (version 8)
Nimmi Weeraddana
 
PPTX
L07_performance and cost in advanced hardware- computer architecture.pptx
Isaac383415
 
PDF
Computer Architecture Performance and Energy
Jason J Pulikkottil
 
PPTX
Cpu performance matrix
Rehman baig
 
PPT
L-2 (Computer Performance).ppt
ImranKhan997082
 
Chapter 1 computer abstractions and technology
BATMUNHMUNHZAYA
 
COMPUTER ARCHITECTURE BASIC CONCEPT
Azizul Mamun
 
Book for general presentation for computer science
festustobi18
 
Computer architecture short note (version 8)
Nimmi Weeraddana
 
L07_performance and cost in advanced hardware- computer architecture.pptx
Isaac383415
 
Computer Architecture Performance and Energy
Jason J Pulikkottil
 
Cpu performance matrix
Rehman baig
 
L-2 (Computer Performance).ppt
ImranKhan997082
 

Similar to performance uploading.pptx (20)

PPTX
2. Module_1_Computer Performance, Metrics, Measurement, & Evaluation (1).pptx
americanguncontrolre
 
PPT
lect1.ppt of a lot of things like computer
btlimhzjanolufattx
 
PPT
2 CPU Performance (1) by computer organization
wewiv47743
 
PDF
02 performance
marangburu42
 
PPT
Computer System Architecture Lecture Note 6: hardware performance
Budditha Hettige
 
PPTX
Basic Computer Architecture slides num.1
MdRakibulIslam697135
 
PDF
Parallel Computing - Lec 6
Shah Zaib
 
PPT
Computer Organization Design ch2Slides.ppt
rajesshs31r
 
PPTX
Unit i-introduction
akruthi k
 
PPT
Webinaron muticoreprocessors
Nagasuri Bala Venkateswarlu
 
PDF
Computer performance
Amit Kumar Rathi
 
PPTX
Trends in computer architecture
muhammedsalihabbas
 
PPTX
Section1:Course: Computer Architecture.pptx
shovonpoco
 
PPTX
Computer Organization & Architecture – Key Microprocessor Specs.pptx
Masi Karimi
 
PPT
Kiến trúc máy tính-COE 301 - Performance.ppt
TriTrang4
 
PPTX
Computer Architechture and Organization
Aiman Hafeez
 
PDF
Lecture1_Introduction_computerar (1).pdf
jacksafahi
 
PPT
CA UNIT I PPT.ppt
RAJESH S
 
PPTX
Clock and clock cycle in processor architecture
gogii6789
 
PPTX
Evaluation of computer performance
Prasenjit Dey
 
2. Module_1_Computer Performance, Metrics, Measurement, & Evaluation (1).pptx
americanguncontrolre
 
lect1.ppt of a lot of things like computer
btlimhzjanolufattx
 
2 CPU Performance (1) by computer organization
wewiv47743
 
02 performance
marangburu42
 
Computer System Architecture Lecture Note 6: hardware performance
Budditha Hettige
 
Basic Computer Architecture slides num.1
MdRakibulIslam697135
 
Parallel Computing - Lec 6
Shah Zaib
 
Computer Organization Design ch2Slides.ppt
rajesshs31r
 
Unit i-introduction
akruthi k
 
Webinaron muticoreprocessors
Nagasuri Bala Venkateswarlu
 
Computer performance
Amit Kumar Rathi
 
Trends in computer architecture
muhammedsalihabbas
 
Section1:Course: Computer Architecture.pptx
shovonpoco
 
Computer Organization & Architecture – Key Microprocessor Specs.pptx
Masi Karimi
 
Kiến trúc máy tính-COE 301 - Performance.ppt
TriTrang4
 
Computer Architechture and Organization
Aiman Hafeez
 
Lecture1_Introduction_computerar (1).pdf
jacksafahi
 
CA UNIT I PPT.ppt
RAJESH S
 
Clock and clock cycle in processor architecture
gogii6789
 
Evaluation of computer performance
Prasenjit Dey
 
Ad

Recently uploaded (20)

PPTX
Civil Engineering Practices_BY Sh.JP Mishra 23.09.pptx
bineetmishra1990
 
PPTX
Chapter_Seven_Construction_Reliability_Elective_III_Msc CM
SubashKumarBhattarai
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
PPTX
22PCOAM21 Session 2 Understanding Data Source.pptx
Guru Nanak Technical Institutions
 
PPT
SCOPE_~1- technology of green house and poyhouse
bala464780
 
PPTX
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
PDF
Traditional Exams vs Continuous Assessment in Boarding Schools.pdf
The Asian School
 
PDF
20ME702-Mechatronics-UNIT-1,UNIT-2,UNIT-3,UNIT-4,UNIT-5, 2025-2026
Mohanumar S
 
PDF
2025 Laurence Sigler - Advancing Decision Support. Content Management Ecommer...
Francisco Javier Mora Serrano
 
PDF
JUAL EFIX C5 IMU GNSS GEODETIC PERFECT BASE OR ROVER
Budi Minds
 
PPT
Ppt for engineering students application on field effect
lakshmi.ec
 
PDF
July 2025: Top 10 Read Articles Advanced Information Technology
ijait
 
PDF
Software Testing Tools - names and explanation
shruti533256
 
PPTX
business incubation centre aaaaaaaaaaaaaa
hodeeesite4
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
dse_final_merit_2025_26 gtgfffffcjjjuuyy
rushabhjain127
 
PDF
Introduction to Ship Engine Room Systems.pdf
Mahmoud Moghtaderi
 
PDF
Natural_Language_processing_Unit_I_notes.pdf
sanguleumeshit
 
PPTX
Information Retrieval and Extraction - Module 7
premSankar19
 
Civil Engineering Practices_BY Sh.JP Mishra 23.09.pptx
bineetmishra1990
 
Chapter_Seven_Construction_Reliability_Elective_III_Msc CM
SubashKumarBhattarai
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
22PCOAM21 Session 2 Understanding Data Source.pptx
Guru Nanak Technical Institutions
 
SCOPE_~1- technology of green house and poyhouse
bala464780
 
database slide on modern techniques for optimizing database queries.pptx
aky52024
 
Traditional Exams vs Continuous Assessment in Boarding Schools.pdf
The Asian School
 
20ME702-Mechatronics-UNIT-1,UNIT-2,UNIT-3,UNIT-4,UNIT-5, 2025-2026
Mohanumar S
 
2025 Laurence Sigler - Advancing Decision Support. Content Management Ecommer...
Francisco Javier Mora Serrano
 
JUAL EFIX C5 IMU GNSS GEODETIC PERFECT BASE OR ROVER
Budi Minds
 
Ppt for engineering students application on field effect
lakshmi.ec
 
July 2025: Top 10 Read Articles Advanced Information Technology
ijait
 
Software Testing Tools - names and explanation
shruti533256
 
business incubation centre aaaaaaaaaaaaaa
hodeeesite4
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
dse_final_merit_2025_26 gtgfffffcjjjuuyy
rushabhjain127
 
Introduction to Ship Engine Room Systems.pdf
Mahmoud Moghtaderi
 
Natural_Language_processing_Unit_I_notes.pdf
sanguleumeshit
 
Information Retrieval and Extraction - Module 7
premSankar19
 
Ad

performance uploading.pptx

  • 1. Computer Architecture Dr.S.Santhi Professor, Department of CSE KIT-Kalaignarkarunanidhi Institute of Technology
  • 2. UNIT 1 PERFORMANCE, POWER WALL, UNI-PROCESSORS, MULTIPROCESSORS
  • 3. Performance Metrics Purchasing perspective ◦ given a collection of machines, which has the ◦ best performance ? ◦ least cost ? ◦ best cost/performance? Design perspective ◦ faced with design options, which has the ◦ best performance improvement ? ◦ least cost ? ◦ best cost/performance? Both require ◦ basis for comparison ◦ metric for evaluation Our goal is to understand what factors in the architecture contribute to overall system performance and the relative importance (and cost) of these factors
  • 4. Defining Performance Which airplane has the best performance?
  • 5. Response Time and Throughput Response time ◦ How long it takes to do a task Throughput ◦ Total work done per unit time ◦ e.g., tasks/transactions/… per hour How are response time and throughput affected by ◦ Replacing the processor with a faster version? ◦ Adding more processors? We’ll focus on response time for now…
  • 6. Relative Performance Performance = 1/Execution Time “X is n time faster than Y” n   X Y Y X time Execution time Execution e Performanc e Performanc  Example: time taken to run a program  10s on A, 15s on B  Execution TimeB / Execution TimeA = 15s / 10s = 1.5  So A is 1.5 times faster than B
  • 7. Measuring Execution Time Elapsed time ◦ Total response time, including all aspects ◦ Processing, I/O, OS overhead, idle time ◦ Determines system performance CPU time ◦ Time spent processing a given job ◦ Discounts I/O time, other jobs’ shares ◦ Comprises user CPU time and system CPU time ◦ Different programs are affected differently by CPU and system performance
  • 8. CPU Clocking Operation of digital hardware governed by a constant- rate clock Clock (cycles) Data transfer and computation Update state Clock period  Clock period: duration of a clock cycle  e.g., 250ps = 0.25ns = 250×10–12s  Clock frequency (rate): cycles per second  e.g., 4.0GHz = 4000MHz = 4.0×109Hz
  • 9. Review: Machine Clock Rate Clock rate (clock cycles per second in MHz or GHz) is inverse of clock cycle time (clock period) one clock period 1 nsec (10-9) clock cycle => 1 GHz (109) clock rate
  • 10. CPU Time Performance improved by ◦ Reducing number of clock cycles ◦ Increasing clock rate ◦ Hardware designer must often trade off clock rate against cycle count Rate Clock Cycles Clock CPU Time Cycle Clock Cycles Clock CPU Time CPU   
  • 11. CPU Time Example Computer A: 2GHz clock, 10s CPU time Designing Computer B ◦ Aim for 6s CPU time ◦ Can do faster clock, but causes 1.2 × clock cycles of A How fast must Computer B clock be? 4GHz 6s 10 24 6s 10 20 1.2 Rate Clock 10 20 2GHz 10s Rate Clock Time CPU Cycles Clock 6s Cycles Clock 1.2 Time CPU Cycles Clock Rate Clock 9 9 B 9 A A A A B B B               
  • 12. Instruction Count and CPI Instruction Count for a program ◦ Determined by program, ISA and compiler Average cycles per instruction ◦ Determined by CPU hardware ◦ If different instructions have different CPI ◦ Average CPI affected by instruction mix Rate Clock CPI Count n Instructio Time Cycle Clock CPI Count n Instructio Time CPU n Instructio per Cycles Count n Instructio Cycles Clock       
  • 13. CPI Example Computer A: Cycle Time = 250ps, CPI = 2.0 Computer B: Cycle Time = 500ps, CPI = 1.2 Same ISA Which is faster, and by how much? 1.2 500ps I 600ps I A Time CPU B Time CPU 600ps I 500ps 1.2 I B Time Cycle B CPI Count n Instructio B Time CPU 500ps I 250ps 2.0 I A Time Cycle A CPI Count n Instructio A Time CPU                     A is faster… …by this much
  • 14. CPI in More Detail If different instruction classes take different numbers of cycles     n 1 i i i ) Count n Instructio (CPI Cycles Clock  Weighted average CPI            n 1 i i i Count n Instructio Count n Instructio CPI Count n Instructio Cycles Clock CPI Relative frequency
  • 15. Power Trends In CMOS IC technology §1.7 The Power Wall Frequency Voltage load Capacitive Power 2     ) 2 / 1 ( ×1000 ×40 5V → 1V
  • 17. Multiprocessors Multicore microprocessors ◦ More than one processor per chip Requires explicitly parallel programming ◦ Compare with instruction level parallelism ◦ Hardware executes multiple instructions at once ◦ Hidden from the programmer ◦ Hard to do ◦ Programming for performance ◦ Load balancing ◦ Optimizing communication and synchronization

Editor's Notes

  • #4: Or smallest/lightest Longest battery life Most reliable/durable (in space)
  • #5: November 21, 2023
  • #6: November 21, 2023
  • #7: November 21, 2023
  • #8: November 21, 2023
  • #9: November 21, 2023
  • #10: A clock cycle is the basic unit of time to execute one operation/pipeline stage/etc.
  • #11: November 21, 2023
  • #12: November 21, 2023
  • #13: November 21, 2023
  • #14: November 21, 2023
  • #15: November 21, 2023
  • #16: November 21, 2023
  • #17: November 21, 2023
  • #18: November 21, 2023