SlideShare a Scribd company logo
Australia’s National Science Agency
Liming Zhu
Research Director, CSIRO’s Data61
Conjoint Professor, UNSW
Responsible/Trustworthy
AI in the Era of
Foundation Models
All pencil drawings in this presentation are created by AI
What’s Responsible AI?
2 |
Responsible AI is the practice of developing
and using AI systems in a way that provides
benefits to individuals, groups, and wider
society, while minimizing the risk of
negative consequences.
Not model/algorithm
System requirements/quality
linked to benefit/risk impact
What about the System/SE Level?
3 |
2014-2015 2020-2022
ICSE23 TechDebt Keynote - Technical Debt in AI-based
Software Systems: Challenges and Approaches.
CSIRO’s Data61, Sherry Xu
ICSE23 DeepTest Keynote - Testing Generative Large Language
Model: Mission Impossible or Where Lies the Path?
CSIRO’s Data61, Zhenchang Xing
Trust Debt
Architecture Debt
Explainability Debt
Prompt Controllability/Testability
Modular/Testable AI Chains
Beyond Accuracy
Build/Evaluate -> Discover/Oversee
4 |
intentions -> agents -> oversee
• data foraging/synthesis
• emerging capabilities
• scalable (AI) oversights
https://medium.com/@itamar_f/software-3-0-the-era-of-intelligent-software-
development-acd3cafe6cd7
https://karpathy.medium.com/software-2-0-a64152b37c35
requirements -> build
-> evaluate
examples -> discover
-> assess risk
Future directions
• (Learned) Guardrails
• Radical observability
• Understand rather than build
at the system-level
Australia’s National Science Agency
Challenges
&
Trends
Australia’s AI ethics framework OECD AI principles
Principles
Standards
Frameworks NIST AI RMF ISO Standards
Algorithms
Models
SE for RAI
……
…
1. The Vertical Gap – Alignment & Practices
Model Alignment != System Alignment
Principles/Standards != Eng. Practices
Lu, Q., Luo, Y., Zhu, L., Tang, M., Xu, X., Whittle, J., 2023. Operationalising Responsible AI Using a
Pattern-Oriented Approach: A Case Study on Chatbots in Financial Services. IEEE Intelligent Systems.
6 |
2. The Understanding Gap - Inscrutable
Do we have to fully understand the AI model?
Can system-level understanding help?
7 |
One More Thing – Here Come the LLMs
8 |
Lu, Q., Zhu, L., Xu, X., Xing, Z., Whittle, J., 2023. Towards Responsible AI in the Era of ChatGPT: A Reference
Architecture for Designing Foundation Model-based AI Systems. https://arxiv.org/abs/2304.11090
Australia’s National Science Agency
Directions
&
Questions
1. Close the Gaps – engineering practices
10 |
Lu, Q., Zhu, L., Xu, X., Whittle, J., Xing, Z., 2022. Towards a Roadmap on Software Engineering for
Responsible AI, in: 1st International Conference on AI Engineering (CAIN)
Measurements/Metrics, Evaluation/Verification/Validation Methods
Close the Gaps – operationalisable
11 |
Xia, B., Lu, Q., Perera, H., Zhu, L., Xing, Z., Liu, Y., Whittle, J., 2023. Towards Concrete and
Connected AI Risk Assessment (C2AIRA). 2nd International Conference on AI Engineering (CAIN)
Dozens of Frameworks
Which methods & tools
for which stakeholders?
Close the Gaps – Connected Patterns
12 |
Lu, Q., Zhu, L., Xu, X., Whittle, J., 2023. Responsible-AI-by-Design: A Pattern Collection for Designing Responsible
AI Systems. IEEE Software https://research.csiro.au/ss/science/projects/responsible-ai-pattern-catalogue/
Lee, S.U., Perera, H., Xia, B., Liu, Y., Lu, Q., Zhu, L., Salvado, O., Whittle, J., 2023. QB4AIRA: A Question Bank for AI
Risk Assessment. https://doi.org/10.48550/arXiv.2305.09300
2. Understand at the System Level
Increasingly, the study of these trained
(but un-designed) systems seems
destined to become a kind of natural
science…
… they are similar to the grand goals
of biology, which is to "figure out"
while being content to get by without
proofs or guarantees …
“AI as (an Ersatz) Natural Science?”
by Subbarao Kambhampati
13 |
Understanding via “Testing”
Zhuo, T.Y., Huang, Y., Chen, C., Xing, Z., 2023. Exploring AI Ethics of ChatGPT: A
Diagnostic Analysis https://arxiv.org/abs/2301.12867
14 |
ICSE23 DeepTest Keynote - Testing Generative Large Language Model:
Mission Impossible or Where Lies the Path? Zhenchang Xing, CSIRO’s Data61
Capability +/-/⊥ Alignment
Waluigi Effect prevents
model-level solution
Understanding via Accountability
15 |
No Agreed Best Practices
No Agreed Safety Test
Verifiable investment in safety
Accountability enforced by law/market
Understanding via Accountability
16 |
Xu, X., Wang, C., Wang, Jeff, Lu, Q., Zhu, L., 2022. Dependency tracking for risk
mitigation in machine learning systems, in: 44th ICSE
Xia, B., Bi, T., Xing, Z., Lu, Q., Zhu, L., 2023. An Empirical Study on Software
Bill of Materials: Where We Stand and the Road Ahead, in: 45th ICSE
Software Bills of Materials (SBOM)/AIBOM
3. Design Foundation Model-based Systems
Lu, Q., Zhu, L., Xu, X., Xing, Z., Whittle, J., 2023. A Framework for Designing
Foundation Model based Systems https://arxiv.org/abs/2305.05352v1
LLM eating the traditional system functions
Moving boundaries ex emerging capabilities
• Design with capabilities, not functionalities
• Design for capability evolution and agility
Tools being optimized for LLM/Agents
• Selected/Used by both human and LLM/Agents
• Trusted by human and LLM/Agents
Responsible AI for LLM-based Applications
18 |
Lu, Q., Zhu, L., Xu, X., Xing, Z., Whittle, J., 2023. Towards Responsible AI in the Era of ChatGPT: A Reference
Architecture for Designing Foundation Model-based AI Systems. http://arxiv.org/abs/2304.11090
RAI in the Era of Foundation Models
AI Engineering Directions
• (Learned) Guardrails
• Radical observability
• Understand rather than build
Responsible AI Engineering
• Close the principle-alg. gaps
• Engineering practices/methods
• Measurement/metrics
• Connected patterns
• Understand at the system level
• AIBOM & accountability
More info & Contact
https://research.csiro.au/ss/
Liming.Zhu@data61.csiro.au
Brendan.Omalley@data61.csiro.au
Coming out late 2023
Foundation Models
• Design with capabilities, not func.
• Design for system evolution
• Tools optimised for LLM/Agents
• Special RAI patterns
Collaborate with CSIRO’s Data61 on
• RAI Engineering best practices & evaluation
• LLM/Foundation model-based system design/eval
For the latest, follow me on
Twitter: @limingz
LinkedIn: Liming Zhu

More Related Content

PDF
ICSE23 Keynote: Software Engineering as the Linchpin of Responsible AI
Liming Zhu
 
PDF
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
PPTX
Virtual Reality and Augmented Reality
NikitaGour5
 
PDF
ChatGPT (and generative AI) in journalism
Paul Bradshaw
 
PDF
An Introduction to Generative AI
Cori Faklaris
 
PDF
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu
 
PPTX
Blockchain : A Key Player in Metaverse.pptx
Dr. Mohamed Torky
 
PPTX
The Self as Cognitive Construct
KimberlyLina1
 
ICSE23 Keynote: Software Engineering as the Linchpin of Responsible AI
Liming Zhu
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
Virtual Reality and Augmented Reality
NikitaGour5
 
ChatGPT (and generative AI) in journalism
Paul Bradshaw
 
An Introduction to Generative AI
Cori Faklaris
 
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu
 
Blockchain : A Key Player in Metaverse.pptx
Dr. Mohamed Torky
 
The Self as Cognitive Construct
KimberlyLina1
 

What's hot (20)

PDF
Artificial Intelligence (AI) Interview Questions and Answers | Edureka
Edureka!
 
PDF
Ml ops on AWS
PhilipBasford
 
PPTX
The Future of AI is Generative not Discriminative 5/26/2021
Steve Omohundro
 
PDF
Data and AI reference architecture
Willy Marroquin (WillyDevNET)
 
PDF
Introduction to Artificial Intelligence & Ethics
Boris Villazon-Terrazas
 
PDF
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
Aroh Shukla
 
PDF
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...
Edureka!
 
PPTX
Using Generative AI
Mark DeLoura
 
PPTX
Microsoft AI Platform Overview
David Chou
 
PPTX
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
PDF
generative-ai-fundamentals and Large language models
AdventureWorld5
 
PDF
Responsible Generative AI
CMassociates
 
PDF
Responsible AI
Neo4j
 
PPTX
introduction Azure OpenAI by Usama wahab khan
Usama Wahab Khan Cloud, Data and AI
 
PPTX
Generative AI Use cases for Enterprise - Second Session
Gene Leybzon
 
PPTX
Journey of Generative AI
thomasjvarghese49
 
PDF
The Future is in Responsible Generative AI
Saeed Al Dhaheri
 
PDF
How do OpenAI GPT Models Work - Misconceptions and Tips for Developers
Ivo Andreev
 
PDF
Nasscom AI top 50 use cases
ADDI AI 2050
 
PPTX
GPT, LLM, RAG, and RAG in Action: Understanding the Future of AI-Powered Info...
Muralidharan Deenathayalan
 
Artificial Intelligence (AI) Interview Questions and Answers | Edureka
Edureka!
 
Ml ops on AWS
PhilipBasford
 
The Future of AI is Generative not Discriminative 5/26/2021
Steve Omohundro
 
Data and AI reference architecture
Willy Marroquin (WillyDevNET)
 
Introduction to Artificial Intelligence & Ethics
Boris Villazon-Terrazas
 
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
Aroh Shukla
 
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...
Edureka!
 
Using Generative AI
Mark DeLoura
 
Microsoft AI Platform Overview
David Chou
 
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
generative-ai-fundamentals and Large language models
AdventureWorld5
 
Responsible Generative AI
CMassociates
 
Responsible AI
Neo4j
 
introduction Azure OpenAI by Usama wahab khan
Usama Wahab Khan Cloud, Data and AI
 
Generative AI Use cases for Enterprise - Second Session
Gene Leybzon
 
Journey of Generative AI
thomasjvarghese49
 
The Future is in Responsible Generative AI
Saeed Al Dhaheri
 
How do OpenAI GPT Models Work - Misconceptions and Tips for Developers
Ivo Andreev
 
Nasscom AI top 50 use cases
ADDI AI 2050
 
GPT, LLM, RAG, and RAG in Action: Understanding the Future of AI-Powered Info...
Muralidharan Deenathayalan
 
Ad

Similar to Responsible/Trustworthy AI in the Era of Foundation Models (20)

PDF
AI Transformation
Liming Zhu
 
PDF
Responsible AI The Australian Approach
Liming Zhu
 
PDF
Deciphering AI: Human Expertise in the Age of Evolving AI
Liming Zhu
 
PDF
Leveraging LLM Agents for Scientific Discovery - 27 November 2024.pptx.pdf
gdgforscience
 
PDF
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Universita della Calabria,
 
PDF
Responsible AI & Cybersecurity: A tale of two technology risks
Liming Zhu
 
PDF
Visualization for Software Analytics
Margaret-Anne Storey
 
PDF
interacting-with-ai-2023---module-2---session-1---handout.pdf
cniclsh1
 
PDF
Principles Governing Ethical Development and Deployment of AI
PriyankaKilaniya
 
PDF
Distributed Trust Architecture: The New Foundation of Everything
Liming Zhu
 
PDF
20220518 Roberto_Zicari ISSIP_Award_Talk.pdf
International Society of Service Innovation Professionals
 
DOCX
Chi 2019 paper chi 2019, may 4–9, 2019, glasgow, scotland, uk gu
SONU61709
 
DOCX
CHI 2019 Paper CHI 2019, May 4–9, 2019, Glasgow, Scotland, UK Gu
JinElias52
 
PDF
A Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC Charlotte
Cori Faklaris
 
PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
DataScienceConferenc1
 
PPTX
Open Mining Education, Ethics & AI
Robert Farrow
 
PPTX
Building Effective Visualization Shiny WVF
Olga Scrivner
 
PPTX
Tecnologías emergentes: priorizando al ciudadano
Comisión de Regulación de Comunicaciones
 
PDF
AI Unveiled: From Current State to Future Frontiers
Liming Zhu
 
PDF
Interventionist-methods - Methods in user-technology studies
Antti Salovaara
 
AI Transformation
Liming Zhu
 
Responsible AI The Australian Approach
Liming Zhu
 
Deciphering AI: Human Expertise in the Age of Evolving AI
Liming Zhu
 
Leveraging LLM Agents for Scientific Discovery - 27 November 2024.pptx.pdf
gdgforscience
 
Ph.D. Thesis: A Methodology for the Development of Autonomic and Cognitive In...
Universita della Calabria,
 
Responsible AI & Cybersecurity: A tale of two technology risks
Liming Zhu
 
Visualization for Software Analytics
Margaret-Anne Storey
 
interacting-with-ai-2023---module-2---session-1---handout.pdf
cniclsh1
 
Principles Governing Ethical Development and Deployment of AI
PriyankaKilaniya
 
Distributed Trust Architecture: The New Foundation of Everything
Liming Zhu
 
20220518 Roberto_Zicari ISSIP_Award_Talk.pdf
International Society of Service Innovation Professionals
 
Chi 2019 paper chi 2019, may 4–9, 2019, glasgow, scotland, uk gu
SONU61709
 
CHI 2019 Paper CHI 2019, May 4–9, 2019, Glasgow, Scotland, UK Gu
JinElias52
 
A Guide to AI for Smarter Nonprofits - Dr. Cori Faklaris, UNC Charlotte
Cori Faklaris
 
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
DataScienceConferenc1
 
Open Mining Education, Ethics & AI
Robert Farrow
 
Building Effective Visualization Shiny WVF
Olga Scrivner
 
Tecnologías emergentes: priorizando al ciudadano
Comisión de Regulación de Comunicaciones
 
AI Unveiled: From Current State to Future Frontiers
Liming Zhu
 
Interventionist-methods - Methods in user-technology studies
Antti Salovaara
 
Ad

More from Liming Zhu (16)

PPTX
AI Transformation A Clash with Human Expertise
Liming Zhu
 
PDF
GenAI in Research with Responsible AI
Liming Zhu
 
PDF
Software Architecture for Foundation Model-Based Systems
Liming Zhu
 
PDF
Trends & Innovation in Cyber and Digitaltech
Liming Zhu
 
PDF
International Cooperation for Research on Privacy and Data Protection - Austr...
Liming Zhu
 
PDF
RegTech for IR - Opportunities and Lessons
Liming Zhu
 
PDF
Emerging Technologies in Data Sharing and Analytics at Data61
Liming Zhu
 
PDF
Distributed Trust Architecture: The New Reality of ML-based Systems
Liming Zhu
 
PDF
Cyber technologies for SME growth – Barriers and Solutions
Liming Zhu
 
PDF
Emerging Technologies in Synthetic Representation and Digital Twin
Liming Zhu
 
PPTX
POD-Diagnosis: Error Detection and Diagnosis of Sporadic Operations on Cloud ...
Liming Zhu
 
PPTX
Challenges in Practicing High Frequency Releases in Cloud Environments
Liming Zhu
 
PPTX
Dependable Operation - Performance Management and Capacity Planning Under Con...
Liming Zhu
 
PPT
Dependable Operations
Liming Zhu
 
PPT
Modelling and Analysing Operation Processes for Dependability
Liming Zhu
 
PPT
Cloud API Issues: an Empirical Study and Impact
Liming Zhu
 
AI Transformation A Clash with Human Expertise
Liming Zhu
 
GenAI in Research with Responsible AI
Liming Zhu
 
Software Architecture for Foundation Model-Based Systems
Liming Zhu
 
Trends & Innovation in Cyber and Digitaltech
Liming Zhu
 
International Cooperation for Research on Privacy and Data Protection - Austr...
Liming Zhu
 
RegTech for IR - Opportunities and Lessons
Liming Zhu
 
Emerging Technologies in Data Sharing and Analytics at Data61
Liming Zhu
 
Distributed Trust Architecture: The New Reality of ML-based Systems
Liming Zhu
 
Cyber technologies for SME growth – Barriers and Solutions
Liming Zhu
 
Emerging Technologies in Synthetic Representation and Digital Twin
Liming Zhu
 
POD-Diagnosis: Error Detection and Diagnosis of Sporadic Operations on Cloud ...
Liming Zhu
 
Challenges in Practicing High Frequency Releases in Cloud Environments
Liming Zhu
 
Dependable Operation - Performance Management and Capacity Planning Under Con...
Liming Zhu
 
Dependable Operations
Liming Zhu
 
Modelling and Analysing Operation Processes for Dependability
Liming Zhu
 
Cloud API Issues: an Empirical Study and Impact
Liming Zhu
 

Recently uploaded (20)

PDF
What to consider before purchasing Microsoft 365 Business Premium_PDF.pdf
Q-Advise
 
PDF
An Experience-Based Look at AI Lead Generation Pricing, Features & B2B Results
Thomas albart
 
PPTX
classification of computer and basic part of digital computer
ravisinghrajpurohit3
 
PPT
Activate_Methodology_Summary presentatio
annapureddyn
 
PDF
Appium Automation Testing Tutorial PDF: Learn Mobile Testing in 7 Days
jamescantor38
 
PDF
Enhancing Healthcare RPM Platforms with Contextual AI Integration
Cadabra Studio
 
PPTX
slidesgo-unlocking-the-code-the-dynamic-dance-of-variables-and-constants-2024...
kr2589474
 
PDF
ShowUs: Pharo Stream Deck (ESUG 2025, Gdansk)
ESUG
 
PDF
Generating Union types w/ Static Analysis
K. Matthew Dupree
 
PDF
New Download MiniTool Partition Wizard Crack Latest Version 2025
imang66g
 
PPTX
The-Dawn-of-AI-Reshaping-Our-World.pptxx
parthbhanushali307
 
PPTX
Presentation about variables and constant.pptx
safalsingh810
 
PDF
lesson-2-rules-of-netiquette.pdf.bshhsjdj
jasmenrojas249
 
PPTX
Visualising Data with Scatterplots in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PDF
Download iTop VPN Free 6.1.0.5882 Crack Full Activated Pre Latest 2025
imang66g
 
PPTX
Presentation about variables and constant.pptx
kr2589474
 
PPTX
Odoo Integration Services by Candidroot Solutions
CandidRoot Solutions Private Limited
 
PDF
Applitools Platform Pulse: What's New and What's Coming - July 2025
Applitools
 
PDF
Balancing Resource Capacity and Workloads with OnePlan – Avoid Overloading Te...
OnePlan Solutions
 
PPTX
Presentation about Database and Database Administrator
abhishekchauhan86963
 
What to consider before purchasing Microsoft 365 Business Premium_PDF.pdf
Q-Advise
 
An Experience-Based Look at AI Lead Generation Pricing, Features & B2B Results
Thomas albart
 
classification of computer and basic part of digital computer
ravisinghrajpurohit3
 
Activate_Methodology_Summary presentatio
annapureddyn
 
Appium Automation Testing Tutorial PDF: Learn Mobile Testing in 7 Days
jamescantor38
 
Enhancing Healthcare RPM Platforms with Contextual AI Integration
Cadabra Studio
 
slidesgo-unlocking-the-code-the-dynamic-dance-of-variables-and-constants-2024...
kr2589474
 
ShowUs: Pharo Stream Deck (ESUG 2025, Gdansk)
ESUG
 
Generating Union types w/ Static Analysis
K. Matthew Dupree
 
New Download MiniTool Partition Wizard Crack Latest Version 2025
imang66g
 
The-Dawn-of-AI-Reshaping-Our-World.pptxx
parthbhanushali307
 
Presentation about variables and constant.pptx
safalsingh810
 
lesson-2-rules-of-netiquette.pdf.bshhsjdj
jasmenrojas249
 
Visualising Data with Scatterplots in IBM SPSS Statistics.pptx
Version 1 Analytics
 
Download iTop VPN Free 6.1.0.5882 Crack Full Activated Pre Latest 2025
imang66g
 
Presentation about variables and constant.pptx
kr2589474
 
Odoo Integration Services by Candidroot Solutions
CandidRoot Solutions Private Limited
 
Applitools Platform Pulse: What's New and What's Coming - July 2025
Applitools
 
Balancing Resource Capacity and Workloads with OnePlan – Avoid Overloading Te...
OnePlan Solutions
 
Presentation about Database and Database Administrator
abhishekchauhan86963
 

Responsible/Trustworthy AI in the Era of Foundation Models

  • 1. Australia’s National Science Agency Liming Zhu Research Director, CSIRO’s Data61 Conjoint Professor, UNSW Responsible/Trustworthy AI in the Era of Foundation Models All pencil drawings in this presentation are created by AI
  • 2. What’s Responsible AI? 2 | Responsible AI is the practice of developing and using AI systems in a way that provides benefits to individuals, groups, and wider society, while minimizing the risk of negative consequences. Not model/algorithm System requirements/quality linked to benefit/risk impact
  • 3. What about the System/SE Level? 3 | 2014-2015 2020-2022 ICSE23 TechDebt Keynote - Technical Debt in AI-based Software Systems: Challenges and Approaches. CSIRO’s Data61, Sherry Xu ICSE23 DeepTest Keynote - Testing Generative Large Language Model: Mission Impossible or Where Lies the Path? CSIRO’s Data61, Zhenchang Xing Trust Debt Architecture Debt Explainability Debt Prompt Controllability/Testability Modular/Testable AI Chains Beyond Accuracy
  • 4. Build/Evaluate -> Discover/Oversee 4 | intentions -> agents -> oversee • data foraging/synthesis • emerging capabilities • scalable (AI) oversights https://medium.com/@itamar_f/software-3-0-the-era-of-intelligent-software- development-acd3cafe6cd7 https://karpathy.medium.com/software-2-0-a64152b37c35 requirements -> build -> evaluate examples -> discover -> assess risk Future directions • (Learned) Guardrails • Radical observability • Understand rather than build at the system-level
  • 5. Australia’s National Science Agency Challenges & Trends
  • 6. Australia’s AI ethics framework OECD AI principles Principles Standards Frameworks NIST AI RMF ISO Standards Algorithms Models SE for RAI …… … 1. The Vertical Gap – Alignment & Practices Model Alignment != System Alignment Principles/Standards != Eng. Practices Lu, Q., Luo, Y., Zhu, L., Tang, M., Xu, X., Whittle, J., 2023. Operationalising Responsible AI Using a Pattern-Oriented Approach: A Case Study on Chatbots in Financial Services. IEEE Intelligent Systems. 6 |
  • 7. 2. The Understanding Gap - Inscrutable Do we have to fully understand the AI model? Can system-level understanding help? 7 |
  • 8. One More Thing – Here Come the LLMs 8 | Lu, Q., Zhu, L., Xu, X., Xing, Z., Whittle, J., 2023. Towards Responsible AI in the Era of ChatGPT: A Reference Architecture for Designing Foundation Model-based AI Systems. https://arxiv.org/abs/2304.11090
  • 9. Australia’s National Science Agency Directions & Questions
  • 10. 1. Close the Gaps – engineering practices 10 | Lu, Q., Zhu, L., Xu, X., Whittle, J., Xing, Z., 2022. Towards a Roadmap on Software Engineering for Responsible AI, in: 1st International Conference on AI Engineering (CAIN) Measurements/Metrics, Evaluation/Verification/Validation Methods
  • 11. Close the Gaps – operationalisable 11 | Xia, B., Lu, Q., Perera, H., Zhu, L., Xing, Z., Liu, Y., Whittle, J., 2023. Towards Concrete and Connected AI Risk Assessment (C2AIRA). 2nd International Conference on AI Engineering (CAIN) Dozens of Frameworks Which methods & tools for which stakeholders?
  • 12. Close the Gaps – Connected Patterns 12 | Lu, Q., Zhu, L., Xu, X., Whittle, J., 2023. Responsible-AI-by-Design: A Pattern Collection for Designing Responsible AI Systems. IEEE Software https://research.csiro.au/ss/science/projects/responsible-ai-pattern-catalogue/ Lee, S.U., Perera, H., Xia, B., Liu, Y., Lu, Q., Zhu, L., Salvado, O., Whittle, J., 2023. QB4AIRA: A Question Bank for AI Risk Assessment. https://doi.org/10.48550/arXiv.2305.09300
  • 13. 2. Understand at the System Level Increasingly, the study of these trained (but un-designed) systems seems destined to become a kind of natural science… … they are similar to the grand goals of biology, which is to "figure out" while being content to get by without proofs or guarantees … “AI as (an Ersatz) Natural Science?” by Subbarao Kambhampati 13 |
  • 14. Understanding via “Testing” Zhuo, T.Y., Huang, Y., Chen, C., Xing, Z., 2023. Exploring AI Ethics of ChatGPT: A Diagnostic Analysis https://arxiv.org/abs/2301.12867 14 | ICSE23 DeepTest Keynote - Testing Generative Large Language Model: Mission Impossible or Where Lies the Path? Zhenchang Xing, CSIRO’s Data61 Capability +/-/⊥ Alignment Waluigi Effect prevents model-level solution
  • 15. Understanding via Accountability 15 | No Agreed Best Practices No Agreed Safety Test Verifiable investment in safety Accountability enforced by law/market
  • 16. Understanding via Accountability 16 | Xu, X., Wang, C., Wang, Jeff, Lu, Q., Zhu, L., 2022. Dependency tracking for risk mitigation in machine learning systems, in: 44th ICSE Xia, B., Bi, T., Xing, Z., Lu, Q., Zhu, L., 2023. An Empirical Study on Software Bill of Materials: Where We Stand and the Road Ahead, in: 45th ICSE Software Bills of Materials (SBOM)/AIBOM
  • 17. 3. Design Foundation Model-based Systems Lu, Q., Zhu, L., Xu, X., Xing, Z., Whittle, J., 2023. A Framework for Designing Foundation Model based Systems https://arxiv.org/abs/2305.05352v1 LLM eating the traditional system functions Moving boundaries ex emerging capabilities • Design with capabilities, not functionalities • Design for capability evolution and agility Tools being optimized for LLM/Agents • Selected/Used by both human and LLM/Agents • Trusted by human and LLM/Agents
  • 18. Responsible AI for LLM-based Applications 18 | Lu, Q., Zhu, L., Xu, X., Xing, Z., Whittle, J., 2023. Towards Responsible AI in the Era of ChatGPT: A Reference Architecture for Designing Foundation Model-based AI Systems. http://arxiv.org/abs/2304.11090
  • 19. RAI in the Era of Foundation Models AI Engineering Directions • (Learned) Guardrails • Radical observability • Understand rather than build Responsible AI Engineering • Close the principle-alg. gaps • Engineering practices/methods • Measurement/metrics • Connected patterns • Understand at the system level • AIBOM & accountability More info & Contact https://research.csiro.au/ss/ [email protected] [email protected] Coming out late 2023 Foundation Models • Design with capabilities, not func. • Design for system evolution • Tools optimised for LLM/Agents • Special RAI patterns Collaborate with CSIRO’s Data61 on • RAI Engineering best practices & evaluation • LLM/Foundation model-based system design/eval For the latest, follow me on Twitter: @limingz LinkedIn: Liming Zhu

Editor's Notes

  • #3: Not AI algorithms and models Functional and non-functional requirements AI alignment + existential risks; AI safety; ethical/law risks
  • #4: Entanglements, Cascades, Dependency, Unstable Data Dependencies, Hidden Feedback Loops Debt: Abstraction, Reproducibility ”Federated data collection, storage, model, and infrastructure” Interaction with other teams “co-design and co-versioning”…
  • #8: Mechics/physics Bridges and buildings Fully understand the human brain to trust No Empirical software engineering and testing. Level of understanding ; I am not talking about you fully Why? My wife expecting, apology
  • #11: Governance to connect with management Process to connect with other practices
  • #14: The "science" suffix of computer science has sometimes been questioned and caricatured; perhaps not any longer, as AI becomes an ersatz natural science studying large learned artifacts. Likewise, LLMs are produced by a relatively simple training process (minimizing loss on next-token prediction, using a large training set from the internet, Github, Wikipedia etc.) but the resulting 175 billion parameter model is extremely inscrutable. This is the why the field of “AI interpretability” exists at all: to probe large models such as LLMs, and understand how they are producing the incredible results they are producing. Increasingly, the study of these large trained (but un-designed) systems seems destined to become a kind of natural science, even if an ersatz one: observing the capabilities they seem to have, doing a few ablation studies here and there, and trying to develop at least a qualitative understanding of the best practices for getting good performance out of them. Modulo the fact that these are going to be studies of in vitro rather than in vivo artifacts, they are similar to the grand goals of biology, which is  to "figure out" while being content to get by without proofs or guarantees. Indeed, machine learning is replete with research efforts focused more on why the system is doing what it is doing (sort of "FMRI studies" of large learned systems, if you will), instead of proving that we designed the system to do so. The knowledge we glean from such studies might allow us to intervene in modulating the system's behavior a little (as medicine does). The in vitro part does, of course, allow for far more targeted interventions than in vivo settings do. AI's  turn to natural science also has implications to computer science at large–given the outsized impact AI seems to be having on almost all areas of  computing.  The "science" suffix of computer science has sometimes been questioned and caricatured; perhaps not any longer, as AI becomes an ersatz natural science studying large learned artifacts. Of course, there might be significant methodological resistance and reservations to this shift. After all, CS has long been used to the "correct by construction" holy grail, and from there it is quite a  shift to getting used to living with systems that are at best incentivized ("dog trained")  to be sort of correct—sort of like us humans! Indeed, in a 2003 lecture, Turing laureate Leslie Lamport sounded alarms about the  very possibility of the future of computing belonging to biology rather than logic, saying it will lead us to living in a world of homeopathy and faith healing! To think that his angst was mostly at complex software systems that were still human-coded, rather than about these even more mysterious large learned models!
  • #18: Everyone is a requirements engineering, architect and tester/verifier.