SlideShare a Scribd company logo
1
Shahram Salamian
Ultra Mobility Group
Intel Corporation
Intel® Atom™ Processor Pre-silicon Verification Experience
2
AgendaAgenda
• Atom IntroductionAtom Introduction
• Verification challengesVerification challenges
• High level verification methodologyHigh level verification methodology
• Verification development process and indicatorsVerification development process and indicators
• Results & key learning'sResults & key learning's
• Future challengesFuture challenges
3
ONLY THE MOST POWER EFFICIENT FEATURES
WERE ADDED TO FURTHER IMPROVE PERFORMANCE
1% perf for 3%
or more power
1% perf for
2% power
1% perf for 1%
or less power
ApplicationPower
Performance
Baseline
core
Micro-architecting for Low PowerMicro-architecting for Low Power
• Started with single-issue, in order
pipeline
• Kept adding new features
• Iterated until higher performance and
performance/watt efficiency goals
were met
4
Key Architecture Level FeaturesKey Architecture Level Features
• ISA compatible with Intel® Core™ 2
 including Intel® SSE3, SSSE3 and Intel®64
• Dual issue pipeline with dual-thread support
• Full width SIMD integer and SP FP add support
• Aggressive TDP/average/idle power management
 Support for Enhanced Intel SpeedStep® Technology and ACPI
states including Intel® Deep Power down Technology (C6)
• Intel® Virtualization technology support
• Burst mode capability to enable higher frequency operation
on thermally limited designs
• 800 MHZ – 1.8 GHz, .65W – 2.4W TDP
5
Micro-architecture FeaturesMicro-architecture Features
 32KB Instruction cache with pre-decode extension32KB Instruction cache with pre-decode extension
 Branch Trace Buffer, Gshare predictorBranch Trace Buffer, Gshare predictor
 Return stack buffers (@ fetch, @ decode)Return stack buffers (@ fetch, @ decode)
 Per-thread Instruction Scheduling QueuesPer-thread Instruction Scheduling Queues
 Scheduler that can pick 2 ops from either thread per clockScheduler that can pick 2 ops from either thread per clock
 128b SIMD Integer datapath (2 SIMD alus, 1 shuffle unit)128b SIMD Integer datapath (2 SIMD alus, 1 shuffle unit)
 64b FP, SIMD integer multipliers64b FP, SIMD integer multipliers
 FP adder with 128b support for SP adds (64b for others)FP adder with 128b support for SP adds (64b for others)
 SIR (Safe Instruction Recognition) support to allow out-of-order commitsSIR (Safe Instruction Recognition) support to allow out-of-order commits
 24KB Writeback Data cache24KB Writeback Data cache
 Two-level Data TLB hierarchy with large and small page structuresTwo-level Data TLB hierarchy with large and small page structures
 Hardware page walker (for instruction and data TLB misses)Hardware page walker (for instruction and data TLB misses)
 Integer store-> load forwarding supportInteger store-> load forwarding support
 512KB way Level2 Cache (256b per access) with inline ECC protection512KB way Level2 Cache (256b per access) with inline ECC protection
 L2 cache, Data Cache hardware prefetchersL2 cache, Data Cache hardware prefetchers
Fetch
And
Decode
Scheduling
FP/SIMD
execution
Memory
execution
6
Power C-StatesPower C-States
Core voltage
Core clock
PLL
L1 caches
L2 cache
Wakeup time
Power
C0 HFM C0 LFM C1/C2 C4 C6
OFF OFF OFF
OFF OFF
Partial flush of
f
active active
of
f
flushedflushed
7
Global Power ManagementGlobal Power Management
• Global clock gating (C states)Global clock gating (C states)
• Dynamic frequency and voltage scaling with Enhanced IntelDynamic frequency and voltage scaling with Enhanced Intel
SpeedStep® Technology (enhanced C states)SpeedStep® Technology (enhanced C states)
• PLL and Front Side Bus IOs shutdown and core voltagePLL and Front Side Bus IOs shutdown and core voltage
reduction (deep C states)reduction (deep C states)
• Sleep transistors allow dynamic power down of L2 cacheSleep transistors allow dynamic power down of L2 cache
ways (Ultra-drowsy mode in deep C states)ways (Ultra-drowsy mode in deep C states)
• Sleep transistors in L2 cache allow voltage to be reduced toSleep transistors in L2 cache allow voltage to be reduced to
inactive logic (Drowsy mode)inactive logic (Drowsy mode)
• CMOS mode on Front Side BusCMOS mode on Front Side Bus
8
AgendaAgenda
• Atom IntroductionAtom Introduction
• Verification challengesVerification challenges
• High level verification methodologyHigh level verification methodology
• Verification development process and indicatorsVerification development process and indicators
• Results & key learning'sResults & key learning's
• Future challengesFuture challenges
9
Verification ChallengesVerification Challenges
 New ground up uArchitectureNew ground up uArchitecture
• Large scale development on validation collateralsLarge scale development on validation collaterals
 Aggressive development schedule to first TOAggressive development schedule to first TO
 Constrained verification resource environmentConstrained verification resource environment
• More acute for the first half of the project where newMore acute for the first half of the project where new
development was taking placedevelopment was taking place
 Introduction of significant power mgmt featuresIntroduction of significant power mgmt features
 Minimize post-si validation qualification cycle, at least fromMinimize post-si validation qualification cycle, at least from
function/logic perspectivefunction/logic perspective
10
AgendaAgenda
• Atom introductionAtom introduction
• Verification challengesVerification challenges
• High level verification methodologyHigh level verification methodology
• Verification development process and indicatorsVerification development process and indicators
• Results & key learning'sResults & key learning's
• Future challengesFuture challenges
11
High level verification approachesHigh level verification approaches
 Cluster level validation & functional coverageCluster level validation & functional coverage
 New x86 test case generator for Architecture validationNew x86 test case generator for Architecture validation
 New power mgmt validation collaterals developmentNew power mgmt validation collaterals development
 Formal property & model checking verificationFormal property & model checking verification
 X86 legacy database reuse for verifying against ArchitectureX86 legacy database reuse for verifying against Architecture
 Focus on debug, testability & survivability features from theFocus on debug, testability & survivability features from the
startstart
 Complementary full chip test content from post-si domainComplementary full chip test content from post-si domain
 EmulationEmulation
12
Cluster level validationCluster level validation
 Developed new test benches for all clusters (5 HW clusters) in SpecmanDeveloped new test benches for all clusters (5 HW clusters) in Specman
“e”“e”
 Test bench architecture enabled building “super clusters” as appropriateTest bench architecture enabled building “super clusters” as appropriate
• Bus + Memory, Front end + schedulerBus + Memory, Front end + scheduler
• Allowed exercising complicated inter-cluster protocol more effectivelyAllowed exercising complicated inter-cluster protocol more effectively
 Scoreboard + checkers reused at full chip environmentScoreboard + checkers reused at full chip environment
 Detailed scoreboard + checker reviews with Architecture + design teamsDetailed scoreboard + checker reviews with Architecture + design teams
 Random templates + Specman constraints generated testsRandom templates + Specman constraints generated tests
 New Arch simulator facilitated standalone Ucode validation environmentNew Arch simulator facilitated standalone Ucode validation environment
13
Functional CoverageFunctional Coverage
 Significant emphasis on use of functional coverage in allSignificant emphasis on use of functional coverage in all
clusters & full chipclusters & full chip
 Propriety coverage language specification/collection/analysisPropriety coverage language specification/collection/analysis
flowflow
 Coverage space specification was result of collaboration withCoverage space specification was result of collaboration with
architecture, design teamsarchitecture, design teams
 Use coverage to identify test generation & content gap atUse coverage to identify test generation & content gap at
cluster and FC environmentscluster and FC environments
 Set goals for cluster and full chip functional coverage goalsSet goals for cluster and full chip functional coverage goals
 Could not attain high tape out coverage goals in some areasCould not attain high tape out coverage goals in some areas
- Detailed coverage analysis reviews with all stakeholders- Detailed coverage analysis reviews with all stakeholders
- Post-si validation focused on low coverage areas- Post-si validation focused on low coverage areas
14
Architectural ValidationArchitectural Validation
 Goal is to verify implementation adheres to all aspects of X86Goal is to verify implementation adheres to all aspects of X86
architecture (i.e Intel blue books)architecture (i.e Intel blue books)
 Leveraged large legacy test base suiteLeveraged large legacy test base suite
 Developed new X86 instruction set random test generatorDeveloped new X86 instruction set random test generator
 Leveraged X86 Architectural simulator as reference model toLeveraged X86 Architectural simulator as reference model to
check Architectural correctnesscheck Architectural correctness
• Complemented by key cluster uArch checkersComplemented by key cluster uArch checkers
 Developed special libraries to exercise power mgmt featuresDeveloped special libraries to exercise power mgmt features
15
It is all about Power stupid……It is all about Power stupid……
 Imperative to get the power saving features right at A0Imperative to get the power saving features right at A0
• Early post-si power characterization was key requirementEarly post-si power characterization was key requirement
• Enablement of power management SW development and debug asEnablement of power management SW development and debug as
soon as possiblesoon as possible
 Developed validation collaterals to inject random power eventsDeveloped validation collaterals to inject random power events
& dynamic checkers for correct behavior& dynamic checkers for correct behavior
• Power mgmt is collaboration of Ucode, HW & chipsetPower mgmt is collaboration of Ucode, HW & chipset
• High level of random variables to create scenariosHigh level of random variables to create scenarios
• Detailed checker to verify state & other power related featuresDetailed checker to verify state & other power related features
 State corruption capability as power domain turned offState corruption capability as power domain turned off
• Power specification approach was in its infancyPower specification approach was in its infancy
• Developed tools to identify sequential elements & corrupt state asDeveloped tools to identify sequential elements & corrupt state as
neededneeded
 Clock gating logic traversal capability and coverageClock gating logic traversal capability and coverage
specificationspecification
16
Formal VerificationFormal Verification
 Applied model checkingApplied model checking
• Selective areas on floating point algorithms with data pathSelective areas on floating point algorithms with data path
contentcontent
• Instruction length decode data pathInstruction length decode data path
 Extensive use of assertionsExtensive use of assertions
• Assumption checkingAssumption checking
• ProtocolsProtocols
• Multi-cycle pathsMulti-cycle paths
• ~50% of non-complex assertions proven formally~50% of non-complex assertions proven formally
17
Si Debug & SurvivabilitySi Debug & Survivability
 DebugDebug
• Numerous debug features were incorporated in the designNumerous debug features were incorporated in the design
• Treated debug features as important as functionalTreated debug features as important as functional
• Involved post-si validation to define usage model, test contentInvolved post-si validation to define usage model, test content
development, execution on pre-sidevelopment, execution on pre-si
 Survivability – Making progress in the presence of bugsSurvivability – Making progress in the presence of bugs
• Complex algorithms divided into smaller sub-algorithms, selectable withComplex algorithms divided into smaller sub-algorithms, selectable with
ON/OFF flopON/OFF flop
• Behavior on OFF reverts to simpler or slower performance or lowerBehavior on OFF reverts to simpler or slower performance or lower
power behaviorpower behavior
• Helps with bug isolationHelps with bug isolation
• Randomized all ON/OFF flops in cluster & full chip random templates,Randomized all ON/OFF flops in cluster & full chip random templates,
with checkers and coverage measurement enabledwith checkers and coverage measurement enabled
18
Post-Si content & EmulationPost-Si content & Emulation
 Test content diversificationTest content diversification
• Actively sought post-si test contentActively sought post-si test content
– Allowed for post-si content development to start & mature earlyAllowed for post-si content development to start & mature early
– Identified areas of improvement for the newly deployed ISAIdentified areas of improvement for the newly deployed ISA
generatorgenerator
– Additional, complementary source for coverageAdditional, complementary source for coverage
 Focused on full chip emulationFocused on full chip emulation
• Post-si team used emulation to develop & debug testPost-si team used emulation to develop & debug test
contentcontent
• Accelerate replay of failure tracesAccelerate replay of failure traces
• Successful boot of Unix-like kernel before TOSuccessful boot of Unix-like kernel before TO
19
AgendaAgenda
• Atom introductionAtom introduction
• Verification challengesVerification challenges
• High level verification methodologyHigh level verification methodology
• Verification development process and indicatorsVerification development process and indicators
• Results & key learning'sResults & key learning's
• Future challengesFuture challenges
20
Verification Development ProcessVerification Development Process
 Light verificationLight verification
• Split RTL & test bench development into distinct phases & have enoughSplit RTL & test bench development into distinct phases & have enough
features to allow full chip executionfeatures to allow full chip execution
• At end of each phase, set of clear acceptance criteria had to be met toAt end of each phase, set of clear acceptance criteria had to be met to
call phase donecall phase done
• Increasingly stringent quality control to check RTL & test bench code inIncreasingly stringent quality control to check RTL & test bench code in
data base with each phasedata base with each phase
 Heavy verificationHeavy verification
• Test planning, coverage specification phase, initial exerciseTest planning, coverage specification phase, initial exercise
• Template writing, directed test writing, start coverage roll upTemplate writing, directed test writing, start coverage roll up
• Coverage analysis with feedback to test generation & environmentCoverage analysis with feedback to test generation & environment
• More randomness, more injectors, more stress scenarios as new bugMore randomness, more injectors, more stress scenarios as new bug
arrival rate showed downward trendarrival rate showed downward trend
21
Verification MetricsVerification Metrics
 Light ValidationLight Validation
• Stage plan basedStage plan based
• Verification collateral development was tightly coupled with RTL featuresVerification collateral development was tightly coupled with RTL features
• Tight control on phase acceptance criteria contentTight control on phase acceptance criteria content
 Heavy verification phase indicatorsHeavy verification phase indicators
• % Coverage specification complete% Coverage specification complete
• % Functional coverage% Functional coverage
• % Tests written & passing rate for directed scenarios (DFT, DFV)% Tests written & passing rate for directed scenarios (DFT, DFV)
• % Legacy tests passing rate% Legacy tests passing rate
• Bug indicators such as incoming rate, open, ready-for-closure, type, etcBug indicators such as incoming rate, open, ready-for-closure, type, etc
• Last but certainly not least, health of model indicatorLast but certainly not least, health of model indicator
– Used by many previous generation CPU projectsUsed by many previous generation CPU projects
– Intended to act as RTL functional health indicator to design, project mgmt, etcIntended to act as RTL functional health indicator to design, project mgmt, etc
– Incorporates impact of unresolved bugs on validation progressIncorporates impact of unresolved bugs on validation progress
– Incorporates subjective “progress” factor as voted by different areas ofIncorporates subjective “progress” factor as voted by different areas of
validationvalidation
22
Health of the model indicatorHealth of the model indicator
Atom Health of the Model
0
10
20
30
40
50
60
70
80
90
100
0
Qtr 4 Qtr 3 Qtr 2 Qtr 1 Qtr 0
Time to Tapeout
Score
Black – Atom
Pink, Blue, Orange - Previous CPU projects
Green Line – Desired zone
Red Line - Urgent action required zone
23
AgendaAgenda
• Atom introductionAtom introduction
• Verification challengesVerification challenges
• High level verification methodologyHigh level verification methodology
• Verification development process and indicatorsVerification development process and indicators
• Results & key learning'sResults & key learning's
• Future challengesFuture challenges
24
ResultsResults
 10 Hr boot from part arrival to Windows & Linux boot10 Hr boot from part arrival to Windows & Linux boot
 Post-si activities were not blocked or curtailed as result ofPost-si activities were not blocked or curtailed as result of
functional bugsfunctional bugs
 Significant % of post-si found bugs were corner case, multi-Significant % of post-si found bugs were corner case, multi-
dimensional, timing-dependent sequence scenariosdimensional, timing-dependent sequence scenarios
 A small number of “we should have hit this” type of escapesA small number of “we should have hit this” type of escapes
 All debug/survivability features worked. Significantly helpedAll debug/survivability features worked. Significantly helped
with isolation & debug effortwith isolation & debug effort
25
SLT Total Bugs by Assigned Cluster
Bugs subm itted betw een 2006_01 & 2006_52
BNL
2%
BUS
22%
DFT
5%
FEC
16%
FPC
5%
IEC
13%
MEC
8%
uCODE
29%
Results
26
ResultsResults
Pre-si bug escapes to post-si
0
1
2
3
4
5
6
7
8
Intel
Pentium
Pro
Intel
Pentium 4
Intel
Pentium 4
- 65 nm
Intel Core-
Duo
Intel Core-
Duo - 45
nm
Intel Atom
Intel Architecture CPU
%oftotalpre-silicon
bugs
27
Ultimate indicator of successful result is
Intel® Atom™ powering many products in stores near you !
Results
28
Key Learning'sKey Learning's
 Build & keep full chip healthy as soon as possibleBuild & keep full chip healthy as soon as possible
 Keep stringent quality control to check-in RTL and/orKeep stringent quality control to check-in RTL and/or
validation collaterals in the design databasevalidation collaterals in the design database
 Putting same focus on DFT, DFV, etc as main streamPutting same focus on DFT, DFV, etc as main stream
functionality pays off in long termfunctionality pays off in long term
 Dig behind the indicators, especially when they show rosyDig behind the indicators, especially when they show rosy
picturespictures
 Small, highly focused, motivated team can do wonderfulSmall, highly focused, motivated team can do wonderful
thingsthings
29
AgendaAgenda
• Atom introductionAtom introduction
• Verification challengesVerification challenges
• High level verification methodologyHigh level verification methodology
• Verification development process and indicatorsVerification development process and indicators
• ResultsResults & key learnings& key learnings
• Future challengesFuture challenges
30
Future challengesFuture challenges
 Effective & systematic coverage specification andEffective & systematic coverage specification and
measurement for post-si and how to tie ormeasurement for post-si and how to tie or
leverage pre-sileverage pre-si
 Ways to capture state & debug, especially asWays to capture state & debug, especially as
move to even more integrated multi-core, multi-IPmove to even more integrated multi-core, multi-IP
worldworld
 Analog/digital, circuit, process i.e non-logic issuesAnalog/digital, circuit, process i.e non-logic issues
are starting to dominate post-si qualification cycleare starting to dominate post-si qualification cycle

More Related Content

What's hot (20)

PPTX
RISC-V Introduction
RISC-V International
 
PPTX
Raspberry pi
Prashant Kumar
 
PPTX
SOC Verification using SystemVerilog
Ramdas Mozhikunnath
 
PPTX
Architecture Exploration of RISC-V Processor and Comparison with ARM Cortex-A53
KarthiSugumar
 
PPTX
RISC-V Introduction
Yi-Hsiu Hsu
 
PPTX
03_03_Implementing_PCIe_ATS_in_ARM-based_SoCs_Final
Gopi Krishnamurthy
 
PPTX
AXI Protocol.pptx
Yazan Yousef
 
PDF
How to create SystemVerilog verification environment?
Sameh El-Ashry
 
PDF
CPU Verification
Ramdas Mozhikunnath
 
PPTX
Riscv 20160507-patterson
Krste Asanovic
 
PDF
IBM Think 2020 Openshift on IBM Z and LinuxONE
Filipe Miranda
 
PPT
Red hat linux 9 ppt2003
ashishsjcit
 
PDF
Cracking Digital VLSI Verification Interview: Interview Success
Ramdas Mozhikunnath
 
PDF
DPDK IPSec Security Gateway Application
Michelle Holley
 
PPTX
VMworld 2017 vSAN Network Design
Cormac Hogan
 
PDF
Session 8,9 PCI Express
Subhash Iyer
 
PPTX
Monitoramento e Gerenciamento de Infraestrutura com Zabbix - Patrícia Ladislau
Patricia Ladislau Silva
 
PPT
Hardware accelerated Virtualization in the ARM Cortex™ Processors
The Linux Foundation
 
PDF
RISC-V Online Tutor
RISC-V International
 
PDF
Pcie basic
Saifuddin Kaijar
 
RISC-V Introduction
RISC-V International
 
Raspberry pi
Prashant Kumar
 
SOC Verification using SystemVerilog
Ramdas Mozhikunnath
 
Architecture Exploration of RISC-V Processor and Comparison with ARM Cortex-A53
KarthiSugumar
 
RISC-V Introduction
Yi-Hsiu Hsu
 
03_03_Implementing_PCIe_ATS_in_ARM-based_SoCs_Final
Gopi Krishnamurthy
 
AXI Protocol.pptx
Yazan Yousef
 
How to create SystemVerilog verification environment?
Sameh El-Ashry
 
CPU Verification
Ramdas Mozhikunnath
 
Riscv 20160507-patterson
Krste Asanovic
 
IBM Think 2020 Openshift on IBM Z and LinuxONE
Filipe Miranda
 
Red hat linux 9 ppt2003
ashishsjcit
 
Cracking Digital VLSI Verification Interview: Interview Success
Ramdas Mozhikunnath
 
DPDK IPSec Security Gateway Application
Michelle Holley
 
VMworld 2017 vSAN Network Design
Cormac Hogan
 
Session 8,9 PCI Express
Subhash Iyer
 
Monitoramento e Gerenciamento de Infraestrutura com Zabbix - Patrícia Ladislau
Patricia Ladislau Silva
 
Hardware accelerated Virtualization in the ARM Cortex™ Processors
The Linux Foundation
 
RISC-V Online Tutor
RISC-V International
 
Pcie basic
Saifuddin Kaijar
 

Viewers also liked (15)

PDF
Pre-Si Verification for Post-Si Validation
DVClub
 
PPT
Validating Next Generation CPUs
DVClub
 
PDF
Intel Atom X Mobile Processors Announcement Slides MWC 2015
Ronen Mendezitsky
 
PDF
Verification of Graphics ASICs (Part II)
DVClub
 
PDF
Verification of Graphics ASICs (Part I)
DVClub
 
PDF
AMD_11th_Intl_SoC_Conf_UCI_Irvine
Pankaj Singh
 
PPTX
Intel_Low Power Intelligent Solutions with Intel Atom Processor
Işınsu Akçetin
 
PDF
Efficiency Through Methodology
DVClub
 
PDF
Intel Xeon Pre-Silicon Validation: Introduction and Challenges
DVClub
 
PDF
UVM: Basic Sequences
Arrow Devices
 
PDF
Verification challenges and methodologies - SoC and ASICs
Dr. Shivananda Koteshwar
 
PDF
Functional verification techniques EW16 session
Sameh El-Ashry
 
PDF
UVM Methodology Tutorial
Arrow Devices
 
PDF
Coverage and Introduction to UVM
Dr. Shivananda Koteshwar
 
PPTX
Intel Processors
home
 
Pre-Si Verification for Post-Si Validation
DVClub
 
Validating Next Generation CPUs
DVClub
 
Intel Atom X Mobile Processors Announcement Slides MWC 2015
Ronen Mendezitsky
 
Verification of Graphics ASICs (Part II)
DVClub
 
Verification of Graphics ASICs (Part I)
DVClub
 
AMD_11th_Intl_SoC_Conf_UCI_Irvine
Pankaj Singh
 
Intel_Low Power Intelligent Solutions with Intel Atom Processor
Işınsu Akçetin
 
Efficiency Through Methodology
DVClub
 
Intel Xeon Pre-Silicon Validation: Introduction and Challenges
DVClub
 
UVM: Basic Sequences
Arrow Devices
 
Verification challenges and methodologies - SoC and ASICs
Dr. Shivananda Koteshwar
 
Functional verification techniques EW16 session
Sameh El-Ashry
 
UVM Methodology Tutorial
Arrow Devices
 
Coverage and Introduction to UVM
Dr. Shivananda Koteshwar
 
Intel Processors
home
 
Ad

Similar to Intel Atom Processor Pre-Silicon Verification Experience (20)

PDF
Sharam salamian
Obsidian Software
 
PDF
Zehr dv club_12052006
Obsidian Software
 
PDF
The Cortex-A15 Verification Story
DVClub
 
PDF
FUSION APU & TRENDS/ CHALLENGES IN FUTURE SoC DESIGN
Pankaj Singh
 
PPT
Vishakantaiah validating
Obsidian Software
 
PDF
OVERCOMING KEY CHALLENGES OF TODAY'S COMPLEX SOC: PERFORMANCE OPTIMIZATION AN...
Pankaj Singh
 
PPTX
Caqa5e ch1 with_review_and_examples
Aravindharamanan S
 
PPTX
Mirabilis_Presentation_SCC_July_2024.pptx
Deepak Shankar
 
PPTX
Mirabilis_Presentation_DAC_June_2024.pptx
Deepak Shankar
 
PDF
HC24.29.625-IA-23-Wide-Ruhl-Intel_2012_NTV_iA
Saurabh Dighe
 
PDF
Priorities Shift In IC Design
Abacus Technologies
 
PPTX
CAQA5e_ch1 (3).pptx
SPOCSumaLatha
 
PPTX
Embedded system hardware architecture ii
Grace Abraham
 
PPTX
Introduction to architecture exploration
Deepak Shankar
 
PDF
Low-Power Design and Verification
DVClub
 
PDF
Systems on chip (so c)
sandeep1721
 
PPTX
Gary Brown (Movidius, Intel): Deep Learning in AR: the 3 Year Horizon
AugmentedWorldExpo
 
PPT
QSpiders - Basic intel architecture
Qspiders - Software Testing Training Institute
 
PDF
Heterogeneous Computing : The Future of Systems
Anand Haridass
 
PDF
Keynote Speech - Low Power Seminar, Jain College, October 5th 2012
Dr. Shivananda Koteshwar
 
Sharam salamian
Obsidian Software
 
Zehr dv club_12052006
Obsidian Software
 
The Cortex-A15 Verification Story
DVClub
 
FUSION APU & TRENDS/ CHALLENGES IN FUTURE SoC DESIGN
Pankaj Singh
 
Vishakantaiah validating
Obsidian Software
 
OVERCOMING KEY CHALLENGES OF TODAY'S COMPLEX SOC: PERFORMANCE OPTIMIZATION AN...
Pankaj Singh
 
Caqa5e ch1 with_review_and_examples
Aravindharamanan S
 
Mirabilis_Presentation_SCC_July_2024.pptx
Deepak Shankar
 
Mirabilis_Presentation_DAC_June_2024.pptx
Deepak Shankar
 
HC24.29.625-IA-23-Wide-Ruhl-Intel_2012_NTV_iA
Saurabh Dighe
 
Priorities Shift In IC Design
Abacus Technologies
 
CAQA5e_ch1 (3).pptx
SPOCSumaLatha
 
Embedded system hardware architecture ii
Grace Abraham
 
Introduction to architecture exploration
Deepak Shankar
 
Low-Power Design and Verification
DVClub
 
Systems on chip (so c)
sandeep1721
 
Gary Brown (Movidius, Intel): Deep Learning in AR: the 3 Year Horizon
AugmentedWorldExpo
 
QSpiders - Basic intel architecture
Qspiders - Software Testing Training Institute
 
Heterogeneous Computing : The Future of Systems
Anand Haridass
 
Keynote Speech - Low Power Seminar, Jain College, October 5th 2012
Dr. Shivananda Koteshwar
 
Ad

More from DVClub (20)

PDF
IP Reuse Impact on Design Verification Management Across the Enterprise
DVClub
 
PDF
Cisco Base Environment Overview
DVClub
 
PDF
Stop Writing Assertions! Efficient Verification Methodology
DVClub
 
PPT
Verification Automation Using IPXACT
DVClub
 
PDF
Validation and Design in a Small Team Environment
DVClub
 
PDF
Trends in Mixed Signal Validation
DVClub
 
PDF
Verification In A Global Design Community
DVClub
 
PDF
Design Verification Using SystemC
DVClub
 
PDF
Verification Strategy for PCI-Express
DVClub
 
PDF
SystemVerilog Assertions (SVA) in the Design/Verification Process
DVClub
 
PDF
OpenSPARC T1 Processor
DVClub
 
PDF
Using Assertions in AMS Verification
DVClub
 
PDF
UVM Update: Register Package
DVClub
 
PDF
Verification of the QorIQ Communication Platform Containing CoreNet Fabric wi...
DVClub
 
PDF
Verification of Wireless SoCs: No Longer in the Dark Ages
DVClub
 
PDF
I Never Thought I Would Grow Up to be This Formal
DVClub
 
PDF
OMAP Verification
DVClub
 
PDF
Roy aeroVerifying Power Domains in AeroFONE
DVClub
 
PDF
Challenges in Using UVM at SoC Level
DVClub
 
PDF
Advanced Methodologies Used for Top-Level Verification of Mixed Signal Products
DVClub
 
IP Reuse Impact on Design Verification Management Across the Enterprise
DVClub
 
Cisco Base Environment Overview
DVClub
 
Stop Writing Assertions! Efficient Verification Methodology
DVClub
 
Verification Automation Using IPXACT
DVClub
 
Validation and Design in a Small Team Environment
DVClub
 
Trends in Mixed Signal Validation
DVClub
 
Verification In A Global Design Community
DVClub
 
Design Verification Using SystemC
DVClub
 
Verification Strategy for PCI-Express
DVClub
 
SystemVerilog Assertions (SVA) in the Design/Verification Process
DVClub
 
OpenSPARC T1 Processor
DVClub
 
Using Assertions in AMS Verification
DVClub
 
UVM Update: Register Package
DVClub
 
Verification of the QorIQ Communication Platform Containing CoreNet Fabric wi...
DVClub
 
Verification of Wireless SoCs: No Longer in the Dark Ages
DVClub
 
I Never Thought I Would Grow Up to be This Formal
DVClub
 
OMAP Verification
DVClub
 
Roy aeroVerifying Power Domains in AeroFONE
DVClub
 
Challenges in Using UVM at SoC Level
DVClub
 
Advanced Methodologies Used for Top-Level Verification of Mixed Signal Products
DVClub
 

Intel Atom Processor Pre-Silicon Verification Experience

  • 1. 1 Shahram Salamian Ultra Mobility Group Intel Corporation Intel® Atom™ Processor Pre-silicon Verification Experience
  • 2. 2 AgendaAgenda • Atom IntroductionAtom Introduction • Verification challengesVerification challenges • High level verification methodologyHigh level verification methodology • Verification development process and indicatorsVerification development process and indicators • Results & key learning'sResults & key learning's • Future challengesFuture challenges
  • 3. 3 ONLY THE MOST POWER EFFICIENT FEATURES WERE ADDED TO FURTHER IMPROVE PERFORMANCE 1% perf for 3% or more power 1% perf for 2% power 1% perf for 1% or less power ApplicationPower Performance Baseline core Micro-architecting for Low PowerMicro-architecting for Low Power • Started with single-issue, in order pipeline • Kept adding new features • Iterated until higher performance and performance/watt efficiency goals were met
  • 4. 4 Key Architecture Level FeaturesKey Architecture Level Features • ISA compatible with Intel® Core™ 2  including Intel® SSE3, SSSE3 and Intel®64 • Dual issue pipeline with dual-thread support • Full width SIMD integer and SP FP add support • Aggressive TDP/average/idle power management  Support for Enhanced Intel SpeedStep® Technology and ACPI states including Intel® Deep Power down Technology (C6) • Intel® Virtualization technology support • Burst mode capability to enable higher frequency operation on thermally limited designs • 800 MHZ – 1.8 GHz, .65W – 2.4W TDP
  • 5. 5 Micro-architecture FeaturesMicro-architecture Features  32KB Instruction cache with pre-decode extension32KB Instruction cache with pre-decode extension  Branch Trace Buffer, Gshare predictorBranch Trace Buffer, Gshare predictor  Return stack buffers (@ fetch, @ decode)Return stack buffers (@ fetch, @ decode)  Per-thread Instruction Scheduling QueuesPer-thread Instruction Scheduling Queues  Scheduler that can pick 2 ops from either thread per clockScheduler that can pick 2 ops from either thread per clock  128b SIMD Integer datapath (2 SIMD alus, 1 shuffle unit)128b SIMD Integer datapath (2 SIMD alus, 1 shuffle unit)  64b FP, SIMD integer multipliers64b FP, SIMD integer multipliers  FP adder with 128b support for SP adds (64b for others)FP adder with 128b support for SP adds (64b for others)  SIR (Safe Instruction Recognition) support to allow out-of-order commitsSIR (Safe Instruction Recognition) support to allow out-of-order commits  24KB Writeback Data cache24KB Writeback Data cache  Two-level Data TLB hierarchy with large and small page structuresTwo-level Data TLB hierarchy with large and small page structures  Hardware page walker (for instruction and data TLB misses)Hardware page walker (for instruction and data TLB misses)  Integer store-> load forwarding supportInteger store-> load forwarding support  512KB way Level2 Cache (256b per access) with inline ECC protection512KB way Level2 Cache (256b per access) with inline ECC protection  L2 cache, Data Cache hardware prefetchersL2 cache, Data Cache hardware prefetchers Fetch And Decode Scheduling FP/SIMD execution Memory execution
  • 6. 6 Power C-StatesPower C-States Core voltage Core clock PLL L1 caches L2 cache Wakeup time Power C0 HFM C0 LFM C1/C2 C4 C6 OFF OFF OFF OFF OFF Partial flush of f active active of f flushedflushed
  • 7. 7 Global Power ManagementGlobal Power Management • Global clock gating (C states)Global clock gating (C states) • Dynamic frequency and voltage scaling with Enhanced IntelDynamic frequency and voltage scaling with Enhanced Intel SpeedStep® Technology (enhanced C states)SpeedStep® Technology (enhanced C states) • PLL and Front Side Bus IOs shutdown and core voltagePLL and Front Side Bus IOs shutdown and core voltage reduction (deep C states)reduction (deep C states) • Sleep transistors allow dynamic power down of L2 cacheSleep transistors allow dynamic power down of L2 cache ways (Ultra-drowsy mode in deep C states)ways (Ultra-drowsy mode in deep C states) • Sleep transistors in L2 cache allow voltage to be reduced toSleep transistors in L2 cache allow voltage to be reduced to inactive logic (Drowsy mode)inactive logic (Drowsy mode) • CMOS mode on Front Side BusCMOS mode on Front Side Bus
  • 8. 8 AgendaAgenda • Atom IntroductionAtom Introduction • Verification challengesVerification challenges • High level verification methodologyHigh level verification methodology • Verification development process and indicatorsVerification development process and indicators • Results & key learning'sResults & key learning's • Future challengesFuture challenges
  • 9. 9 Verification ChallengesVerification Challenges  New ground up uArchitectureNew ground up uArchitecture • Large scale development on validation collateralsLarge scale development on validation collaterals  Aggressive development schedule to first TOAggressive development schedule to first TO  Constrained verification resource environmentConstrained verification resource environment • More acute for the first half of the project where newMore acute for the first half of the project where new development was taking placedevelopment was taking place  Introduction of significant power mgmt featuresIntroduction of significant power mgmt features  Minimize post-si validation qualification cycle, at least fromMinimize post-si validation qualification cycle, at least from function/logic perspectivefunction/logic perspective
  • 10. 10 AgendaAgenda • Atom introductionAtom introduction • Verification challengesVerification challenges • High level verification methodologyHigh level verification methodology • Verification development process and indicatorsVerification development process and indicators • Results & key learning'sResults & key learning's • Future challengesFuture challenges
  • 11. 11 High level verification approachesHigh level verification approaches  Cluster level validation & functional coverageCluster level validation & functional coverage  New x86 test case generator for Architecture validationNew x86 test case generator for Architecture validation  New power mgmt validation collaterals developmentNew power mgmt validation collaterals development  Formal property & model checking verificationFormal property & model checking verification  X86 legacy database reuse for verifying against ArchitectureX86 legacy database reuse for verifying against Architecture  Focus on debug, testability & survivability features from theFocus on debug, testability & survivability features from the startstart  Complementary full chip test content from post-si domainComplementary full chip test content from post-si domain  EmulationEmulation
  • 12. 12 Cluster level validationCluster level validation  Developed new test benches for all clusters (5 HW clusters) in SpecmanDeveloped new test benches for all clusters (5 HW clusters) in Specman “e”“e”  Test bench architecture enabled building “super clusters” as appropriateTest bench architecture enabled building “super clusters” as appropriate • Bus + Memory, Front end + schedulerBus + Memory, Front end + scheduler • Allowed exercising complicated inter-cluster protocol more effectivelyAllowed exercising complicated inter-cluster protocol more effectively  Scoreboard + checkers reused at full chip environmentScoreboard + checkers reused at full chip environment  Detailed scoreboard + checker reviews with Architecture + design teamsDetailed scoreboard + checker reviews with Architecture + design teams  Random templates + Specman constraints generated testsRandom templates + Specman constraints generated tests  New Arch simulator facilitated standalone Ucode validation environmentNew Arch simulator facilitated standalone Ucode validation environment
  • 13. 13 Functional CoverageFunctional Coverage  Significant emphasis on use of functional coverage in allSignificant emphasis on use of functional coverage in all clusters & full chipclusters & full chip  Propriety coverage language specification/collection/analysisPropriety coverage language specification/collection/analysis flowflow  Coverage space specification was result of collaboration withCoverage space specification was result of collaboration with architecture, design teamsarchitecture, design teams  Use coverage to identify test generation & content gap atUse coverage to identify test generation & content gap at cluster and FC environmentscluster and FC environments  Set goals for cluster and full chip functional coverage goalsSet goals for cluster and full chip functional coverage goals  Could not attain high tape out coverage goals in some areasCould not attain high tape out coverage goals in some areas - Detailed coverage analysis reviews with all stakeholders- Detailed coverage analysis reviews with all stakeholders - Post-si validation focused on low coverage areas- Post-si validation focused on low coverage areas
  • 14. 14 Architectural ValidationArchitectural Validation  Goal is to verify implementation adheres to all aspects of X86Goal is to verify implementation adheres to all aspects of X86 architecture (i.e Intel blue books)architecture (i.e Intel blue books)  Leveraged large legacy test base suiteLeveraged large legacy test base suite  Developed new X86 instruction set random test generatorDeveloped new X86 instruction set random test generator  Leveraged X86 Architectural simulator as reference model toLeveraged X86 Architectural simulator as reference model to check Architectural correctnesscheck Architectural correctness • Complemented by key cluster uArch checkersComplemented by key cluster uArch checkers  Developed special libraries to exercise power mgmt featuresDeveloped special libraries to exercise power mgmt features
  • 15. 15 It is all about Power stupid……It is all about Power stupid……  Imperative to get the power saving features right at A0Imperative to get the power saving features right at A0 • Early post-si power characterization was key requirementEarly post-si power characterization was key requirement • Enablement of power management SW development and debug asEnablement of power management SW development and debug as soon as possiblesoon as possible  Developed validation collaterals to inject random power eventsDeveloped validation collaterals to inject random power events & dynamic checkers for correct behavior& dynamic checkers for correct behavior • Power mgmt is collaboration of Ucode, HW & chipsetPower mgmt is collaboration of Ucode, HW & chipset • High level of random variables to create scenariosHigh level of random variables to create scenarios • Detailed checker to verify state & other power related featuresDetailed checker to verify state & other power related features  State corruption capability as power domain turned offState corruption capability as power domain turned off • Power specification approach was in its infancyPower specification approach was in its infancy • Developed tools to identify sequential elements & corrupt state asDeveloped tools to identify sequential elements & corrupt state as neededneeded  Clock gating logic traversal capability and coverageClock gating logic traversal capability and coverage specificationspecification
  • 16. 16 Formal VerificationFormal Verification  Applied model checkingApplied model checking • Selective areas on floating point algorithms with data pathSelective areas on floating point algorithms with data path contentcontent • Instruction length decode data pathInstruction length decode data path  Extensive use of assertionsExtensive use of assertions • Assumption checkingAssumption checking • ProtocolsProtocols • Multi-cycle pathsMulti-cycle paths • ~50% of non-complex assertions proven formally~50% of non-complex assertions proven formally
  • 17. 17 Si Debug & SurvivabilitySi Debug & Survivability  DebugDebug • Numerous debug features were incorporated in the designNumerous debug features were incorporated in the design • Treated debug features as important as functionalTreated debug features as important as functional • Involved post-si validation to define usage model, test contentInvolved post-si validation to define usage model, test content development, execution on pre-sidevelopment, execution on pre-si  Survivability – Making progress in the presence of bugsSurvivability – Making progress in the presence of bugs • Complex algorithms divided into smaller sub-algorithms, selectable withComplex algorithms divided into smaller sub-algorithms, selectable with ON/OFF flopON/OFF flop • Behavior on OFF reverts to simpler or slower performance or lowerBehavior on OFF reverts to simpler or slower performance or lower power behaviorpower behavior • Helps with bug isolationHelps with bug isolation • Randomized all ON/OFF flops in cluster & full chip random templates,Randomized all ON/OFF flops in cluster & full chip random templates, with checkers and coverage measurement enabledwith checkers and coverage measurement enabled
  • 18. 18 Post-Si content & EmulationPost-Si content & Emulation  Test content diversificationTest content diversification • Actively sought post-si test contentActively sought post-si test content – Allowed for post-si content development to start & mature earlyAllowed for post-si content development to start & mature early – Identified areas of improvement for the newly deployed ISAIdentified areas of improvement for the newly deployed ISA generatorgenerator – Additional, complementary source for coverageAdditional, complementary source for coverage  Focused on full chip emulationFocused on full chip emulation • Post-si team used emulation to develop & debug testPost-si team used emulation to develop & debug test contentcontent • Accelerate replay of failure tracesAccelerate replay of failure traces • Successful boot of Unix-like kernel before TOSuccessful boot of Unix-like kernel before TO
  • 19. 19 AgendaAgenda • Atom introductionAtom introduction • Verification challengesVerification challenges • High level verification methodologyHigh level verification methodology • Verification development process and indicatorsVerification development process and indicators • Results & key learning'sResults & key learning's • Future challengesFuture challenges
  • 20. 20 Verification Development ProcessVerification Development Process  Light verificationLight verification • Split RTL & test bench development into distinct phases & have enoughSplit RTL & test bench development into distinct phases & have enough features to allow full chip executionfeatures to allow full chip execution • At end of each phase, set of clear acceptance criteria had to be met toAt end of each phase, set of clear acceptance criteria had to be met to call phase donecall phase done • Increasingly stringent quality control to check RTL & test bench code inIncreasingly stringent quality control to check RTL & test bench code in data base with each phasedata base with each phase  Heavy verificationHeavy verification • Test planning, coverage specification phase, initial exerciseTest planning, coverage specification phase, initial exercise • Template writing, directed test writing, start coverage roll upTemplate writing, directed test writing, start coverage roll up • Coverage analysis with feedback to test generation & environmentCoverage analysis with feedback to test generation & environment • More randomness, more injectors, more stress scenarios as new bugMore randomness, more injectors, more stress scenarios as new bug arrival rate showed downward trendarrival rate showed downward trend
  • 21. 21 Verification MetricsVerification Metrics  Light ValidationLight Validation • Stage plan basedStage plan based • Verification collateral development was tightly coupled with RTL featuresVerification collateral development was tightly coupled with RTL features • Tight control on phase acceptance criteria contentTight control on phase acceptance criteria content  Heavy verification phase indicatorsHeavy verification phase indicators • % Coverage specification complete% Coverage specification complete • % Functional coverage% Functional coverage • % Tests written & passing rate for directed scenarios (DFT, DFV)% Tests written & passing rate for directed scenarios (DFT, DFV) • % Legacy tests passing rate% Legacy tests passing rate • Bug indicators such as incoming rate, open, ready-for-closure, type, etcBug indicators such as incoming rate, open, ready-for-closure, type, etc • Last but certainly not least, health of model indicatorLast but certainly not least, health of model indicator – Used by many previous generation CPU projectsUsed by many previous generation CPU projects – Intended to act as RTL functional health indicator to design, project mgmt, etcIntended to act as RTL functional health indicator to design, project mgmt, etc – Incorporates impact of unresolved bugs on validation progressIncorporates impact of unresolved bugs on validation progress – Incorporates subjective “progress” factor as voted by different areas ofIncorporates subjective “progress” factor as voted by different areas of validationvalidation
  • 22. 22 Health of the model indicatorHealth of the model indicator Atom Health of the Model 0 10 20 30 40 50 60 70 80 90 100 0 Qtr 4 Qtr 3 Qtr 2 Qtr 1 Qtr 0 Time to Tapeout Score Black – Atom Pink, Blue, Orange - Previous CPU projects Green Line – Desired zone Red Line - Urgent action required zone
  • 23. 23 AgendaAgenda • Atom introductionAtom introduction • Verification challengesVerification challenges • High level verification methodologyHigh level verification methodology • Verification development process and indicatorsVerification development process and indicators • Results & key learning'sResults & key learning's • Future challengesFuture challenges
  • 24. 24 ResultsResults  10 Hr boot from part arrival to Windows & Linux boot10 Hr boot from part arrival to Windows & Linux boot  Post-si activities were not blocked or curtailed as result ofPost-si activities were not blocked or curtailed as result of functional bugsfunctional bugs  Significant % of post-si found bugs were corner case, multi-Significant % of post-si found bugs were corner case, multi- dimensional, timing-dependent sequence scenariosdimensional, timing-dependent sequence scenarios  A small number of “we should have hit this” type of escapesA small number of “we should have hit this” type of escapes  All debug/survivability features worked. Significantly helpedAll debug/survivability features worked. Significantly helped with isolation & debug effortwith isolation & debug effort
  • 25. 25 SLT Total Bugs by Assigned Cluster Bugs subm itted betw een 2006_01 & 2006_52 BNL 2% BUS 22% DFT 5% FEC 16% FPC 5% IEC 13% MEC 8% uCODE 29% Results
  • 26. 26 ResultsResults Pre-si bug escapes to post-si 0 1 2 3 4 5 6 7 8 Intel Pentium Pro Intel Pentium 4 Intel Pentium 4 - 65 nm Intel Core- Duo Intel Core- Duo - 45 nm Intel Atom Intel Architecture CPU %oftotalpre-silicon bugs
  • 27. 27 Ultimate indicator of successful result is Intel® Atom™ powering many products in stores near you ! Results
  • 28. 28 Key Learning'sKey Learning's  Build & keep full chip healthy as soon as possibleBuild & keep full chip healthy as soon as possible  Keep stringent quality control to check-in RTL and/orKeep stringent quality control to check-in RTL and/or validation collaterals in the design databasevalidation collaterals in the design database  Putting same focus on DFT, DFV, etc as main streamPutting same focus on DFT, DFV, etc as main stream functionality pays off in long termfunctionality pays off in long term  Dig behind the indicators, especially when they show rosyDig behind the indicators, especially when they show rosy picturespictures  Small, highly focused, motivated team can do wonderfulSmall, highly focused, motivated team can do wonderful thingsthings
  • 29. 29 AgendaAgenda • Atom introductionAtom introduction • Verification challengesVerification challenges • High level verification methodologyHigh level verification methodology • Verification development process and indicatorsVerification development process and indicators • ResultsResults & key learnings& key learnings • Future challengesFuture challenges
  • 30. 30 Future challengesFuture challenges  Effective & systematic coverage specification andEffective & systematic coverage specification and measurement for post-si and how to tie ormeasurement for post-si and how to tie or leverage pre-sileverage pre-si  Ways to capture state & debug, especially asWays to capture state & debug, especially as move to even more integrated multi-core, multi-IPmove to even more integrated multi-core, multi-IP worldworld  Analog/digital, circuit, process i.e non-logic issuesAnalog/digital, circuit, process i.e non-logic issues are starting to dominate post-si qualification cycleare starting to dominate post-si qualification cycle