SlideShare a Scribd company logo
Agility in
Software 2.0
– Notebook Interfaces
and MLOps with
Buttresses and Rebars
6th Int’l. Conference on Lean and
Agile Software Development
January 22, 2022
RISE Research Institutes of Sweden
CC
BY-NC
2.0
K.
Edblom
Markus Borg
@mrksbrg
mrksbrg.com
Agile iterates fast.
Data science moves faster!
(CC
CC BY-NC 2.0 K. Edblom
Who is Markus?
• Development engineer, ABB 2007-2010
– Process automation
– Editor and compiler development
• PhD student, LundUniversity 2010-2015
– Requirements engineering and testing
– Traceability, change impact analysis
• Senior researcher, RISE 2015-
– Software engineering for AI/ML
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and Rebars
Also Markus…
• Member of the board, Swedsoft
– Influence decision makers
– Write comment letters
– Facilitate networking
• Adjunct lecturer, Lund University
– Teaching software engineering
Software 2.0
Computational
Notebooks
MLOps
(CC BY-NY-ND 2.0 Flick: *Hajee)
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and Rebars
Software 2.0?
”a large portion of real-world problems have the
property that it is significantly easier
to collect the data than to
explicitly write the program”
https://medium.com/@karpathy/software-2-0-a64152b37c35
Andrej Karpathy
Director of AI at Tesla
9
Karpathy’s Software 2.0
Software 1.0
• Humans write source code
• Other humans comprehend the
source code
Software 2.0
• Humans curate data and specify goals
• Backprop. and gradient descent produces
millions of weights
• Humans cannot comprehend mapping
from input to output
“computers’ ability to learn without
being explicitly programmed”
- Arthur Samuel (1959)
11
12
Cars
Car
f(x)
Not cars
Supervised learning
Neural
network
YOLO (You Only Look Once) by Redmon et al. (2016)
13
CC BY-NC 2.0
Flickr: @andreas_komodromos
15
Another ware!
Hardware
Software
AI
MLware
16
17
Established software quality
assurance no longer sufficient
18
MLware is different
Software engineering practices throughout the lifecycle must evolve
– New types of requirements (explainability, fairness, …)
– New architectures (system, neural networks, …)
– Configuration management (data, models, parameters, …)
– Testing levels (data, model, …)
– Operations (scaling, monitoring, retraining, …)
Computational
Notebooks
Markus
Borg
Martin
Jakobsson
Johan
Henriksson
20
Oskar
Handmark
21
Data science is not software engineering
Established best practices might not apply
Biggest difference is how experimental it is
• Maximum agility needed to quickly reach insights
CACE principle
• “Changing Anything Changes Everything”
22
Also different peopleware
Data scientists are not software engineers
• and maybe not software developers
• perhaps not even computer scientists
Data scientists master the art of taming data and train models
Analogy: research on development of scientific software
23
“Literate programming”
- Donald E. Knuth (1984)
24
Mix source code and explanatory text
Computational notebooks
Extended into “literate computing” with
three types of cells
• (Python) Source code
• Explanatory text describing the code
• Visual content
Promotes interaction!
Interpreter runs in the background
Cells can be executed in any order
Very agile, but very messy
26
27
Notebook collaboration pain points
Concurrent editing is confusing
Code management and refactoring
Replicability is low
Productization of a Notebook proof-of-concept is a big step
CHI’20
28
28
Very agile!
Loads of tools!
29
30
30
Jakobsson and Henriksson (2021)
https://lup.lub.lu.se/student-papers/search/publication/9066685
https://github.com/backtick-se/cowait
dataset versioning?
dependency management?
deployment?
real-time performance?
version control?
repeatability?
scalability?
monitoring?
feature store? model management?
Data Features Training Model
MLware sounds simple!
31
Sculley et al. (2015), Hidden Technical Debt in
Machine Learning Systems
In Proc. of the Advances in Neural Information Processing Systems 28
MLOps
34
What is
MLOps
anyway?
36
“MLOps is the standardization and
streamlining of machine learning life
cycle management”
As an engineering discipline, MLOps is a set of practices
that combines Machine Learning, DevOps and Data Engineering,
which aims to
deploy and maintain ML systems in production reliably and efficiently.
- Treveil et al., Introducing MLOps, O'Reilley
Media, Inc., 2020.
(CC BY 2.0 Flick: Kuhnt)
DevOps with ML specific additions
• Experiment tracking
• Model management
37
MLOps
Please!
How?
Fully managed end-to-end
solution by the hyperscalers
Custom-built on-prem pipelines
38
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and Rebars
40
Data
Validation
Model
Training
Model
Validation
Packaging Deploy in
Test Env.
Deploy in
Prod. Env.
Training Deployment
Model
Monitoring
Software Development
ML Development
Data
Repos
Code
Repos
ML Ops
Release
Management
40
Published April 21, 2021
High Risk
Unacceptable
Prohibited:
Social Scoring, Public Facial Detection,
User Manipulation, …
Minimal Risk
Business as usual:
Video games, Camera
Effects, Spamfilters, …
Limited Risk
Increased Transparency:
Chatbots, Deep Fakes,
Emotion Recognition, …
Conformity Assessment:
1) X under product safety regulations
2) Education, employment, healthcare,
immigration, justice, …
43
High-risk AI providers must
• Document internal rigorous engineering activities
• Quality assurance, fairness, traceability, auditability, …
• Pass independent conformance assessment
• National Supervisory Authority
• Monitoring to continuously check compliance
If not? GDPR style punishment…
• Up to 6% global annual turnover!
44
Agile iterates fast.
Data science moves faster!
(CC
CC BY-NC 2.0 K. Edblom
Agility in
Software 2.0
– Notebook Interfaces
and MLOps with
Buttresses and Rebars
6th Int’l. Conference on Lean and
Agile Software Development
January 22, 2022
RISE Research Institutes of Sweden
CC
BY-NC
2.0
K.
Edblom
Markus Borg
@mrksbrg
mrksbrg.com

More Related Content

Similar to Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and Rebars (20)

PDF
Practical machine learning
Faizan Javed
 
KEY
Become Efficient or Die: The Story of BackType
nathanmarz
 
PDF
116 Machine learning for Product Managers
ProductCamp Boston
 
PPTX
Machine learning for product managers. Presented at Boston ProductCamp (June...
Mukund Seshadri
 
PDF
David Michels: DevOps My AI at AWS Community Day Midwest 2024
AWS Chicago
 
PDF
10 more lessons learned from building Machine Learning systems - MLConf
Xavier Amatriain
 
PDF
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
MLconf
 
PDF
10 more lessons learned from building Machine Learning systems
Xavier Amatriain
 
PDF
Sustainable & Composable Generative AI
Debmalya Biswas
 
PDF
Fixing the program my computer learned: End-user debugging of machine-learned...
City University London
 
PDF
From DevOps to MLOps: practical steps for a smooth transition
Anne-Marie Tousch
 
PDF
Dances with unicorns
EspritAgile
 
PPTX
2024-02-24_Session 1 - PMLE_UPDATED.pptx
gdgsurrey
 
PPTX
Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
AgileNetwork
 
PDF
Productionising Machine Learning Models
Tash Bickley
 
PPTX
MLOps.pptx
sundharakumarkb1
 
PDF
Building Data Products with Python (Georgetown)
Benjamin Bengfort
 
DOC
jlettvin.resume.20160922.STAR
Jonathan Lettvin
 
PDF
EclipseCon France 2015 - Science Track
Boris Adryan
 
PDF
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
Trivadis
 
Practical machine learning
Faizan Javed
 
Become Efficient or Die: The Story of BackType
nathanmarz
 
116 Machine learning for Product Managers
ProductCamp Boston
 
Machine learning for product managers. Presented at Boston ProductCamp (June...
Mukund Seshadri
 
David Michels: DevOps My AI at AWS Community Day Midwest 2024
AWS Chicago
 
10 more lessons learned from building Machine Learning systems - MLConf
Xavier Amatriain
 
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
MLconf
 
10 more lessons learned from building Machine Learning systems
Xavier Amatriain
 
Sustainable & Composable Generative AI
Debmalya Biswas
 
Fixing the program my computer learned: End-user debugging of machine-learned...
City University London
 
From DevOps to MLOps: practical steps for a smooth transition
Anne-Marie Tousch
 
Dances with unicorns
EspritAgile
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
gdgsurrey
 
Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
AgileNetwork
 
Productionising Machine Learning Models
Tash Bickley
 
MLOps.pptx
sundharakumarkb1
 
Building Data Products with Python (Georgetown)
Benjamin Bengfort
 
jlettvin.resume.20160922.STAR
Jonathan Lettvin
 
EclipseCon France 2015 - Science Track
Boris Adryan
 
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
Trivadis
 

More from Markus Borg (20)

PDF
Quality Assurance Of Generative Dialog Models in an evolving Conversationa...
Markus Borg
 
PDF
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
Markus Borg
 
PDF
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Markus Borg
 
PPTX
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
Markus Borg
 
PPTX
Trained, Not Coded - Still Safe?
Markus Borg
 
PDF
SZZ Unleashed: An Open Implementation of the SZZ Algorithm
Markus Borg
 
PPTX
Explainability First! Cousteauing the Depths of Neural Networks
Markus Borg
 
PDF
Test Automation Research... Is That Really Needed in 2018?
Markus Borg
 
PPTX
Supporting Change Impact Analysis Using a Recommendation System - An Industri...
Markus Borg
 
PPTX
Component Source Origin Decisions in Practice - A Survey of Decision Making i...
Markus Borg
 
PPTX
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
Markus Borg
 
PDF
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
Markus Borg
 
PDF
From Bugs to Decision Support - Selected Research Highlights
Markus Borg
 
PDF
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
Markus Borg
 
PPTX
Automation in the Bug Flow - Machine Learning for Triaging and Tracing
Markus Borg
 
PPTX
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
Markus Borg
 
PDF
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
Markus Borg
 
PPTX
Analyzing networks of issue reports
Markus Borg
 
PPTX
Findability through Traceability - A Realistic Application of Candidate Tr...
Markus Borg
 
PPTX
Recommendation Systems for Issue Management
Markus Borg
 
Quality Assurance Of Generative Dialog Models in an evolving Conversationa...
Markus Borg
 
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
Markus Borg
 
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Markus Borg
 
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
Markus Borg
 
Trained, Not Coded - Still Safe?
Markus Borg
 
SZZ Unleashed: An Open Implementation of the SZZ Algorithm
Markus Borg
 
Explainability First! Cousteauing the Depths of Neural Networks
Markus Borg
 
Test Automation Research... Is That Really Needed in 2018?
Markus Borg
 
Supporting Change Impact Analysis Using a Recommendation System - An Industri...
Markus Borg
 
Component Source Origin Decisions in Practice - A Survey of Decision Making i...
Markus Borg
 
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
Markus Borg
 
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
Markus Borg
 
From Bugs to Decision Support - Selected Research Highlights
Markus Borg
 
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
Markus Borg
 
Automation in the Bug Flow - Machine Learning for Triaging and Tracing
Markus Borg
 
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
Markus Borg
 
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
Markus Borg
 
Analyzing networks of issue reports
Markus Borg
 
Findability through Traceability - A Realistic Application of Candidate Tr...
Markus Borg
 
Recommendation Systems for Issue Management
Markus Borg
 
Ad

Recently uploaded (20)

PDF
Balancing Resource Capacity and Workloads with OnePlan – Avoid Overloading Te...
OnePlan Solutions
 
PDF
Troubleshooting Virtual Threads in Java!
Tier1 app
 
PDF
MiniTool Power Data Recovery Crack New Pre Activated Version Latest 2025
imang66g
 
PPTX
Employee salary prediction using Machine learning Project template.ppt
bhanuk27082004
 
PDF
SAP GUI Installation Guide for macOS (iOS) | Connect to SAP Systems on Mac
SAP Vista, an A L T Z E N Company
 
PDF
System Center 2025 vs. 2022; What’s new, what’s next_PDF.pdf
Q-Advise
 
PDF
advancepresentationskillshdhdhhdhdhdhhfhf
jasmenrojas249
 
PDF
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
Safe Software
 
PPT
Activate_Methodology_Summary presentatio
annapureddyn
 
PDF
What companies do with Pharo (ESUG 2025)
ESUG
 
PDF
Protecting the Digital World Cyber Securit
dnthakkar16
 
PDF
How to Download and Install ADT (ABAP Development Tools) for Eclipse IDE | SA...
SAP Vista, an A L T Z E N Company
 
PDF
AWS_Agentic_AI_in_Indian_BFSI_A_Strategic_Blueprint_for_Customer.pdf
siddharthnetsavvies
 
PDF
Summary Of Odoo 18.1 to 18.4 : The Way For Odoo 19
CandidRoot Solutions Private Limited
 
PDF
Enhancing Healthcare RPM Platforms with Contextual AI Integration
Cadabra Studio
 
PDF
On Software Engineers' Productivity - Beyond Misleading Metrics
Romén Rodríguez-Gil
 
PDF
AI Image Enhancer: Revolutionizing Visual Quality”
docmasoom
 
PPTX
Explanation about Structures in C language.pptx
Veeral Rathod
 
PDF
New Download MiniTool Partition Wizard Crack Latest Version 2025
imang66g
 
PDF
New Download FL Studio Crack Full Version [Latest 2025]
imang66g
 
Balancing Resource Capacity and Workloads with OnePlan – Avoid Overloading Te...
OnePlan Solutions
 
Troubleshooting Virtual Threads in Java!
Tier1 app
 
MiniTool Power Data Recovery Crack New Pre Activated Version Latest 2025
imang66g
 
Employee salary prediction using Machine learning Project template.ppt
bhanuk27082004
 
SAP GUI Installation Guide for macOS (iOS) | Connect to SAP Systems on Mac
SAP Vista, an A L T Z E N Company
 
System Center 2025 vs. 2022; What’s new, what’s next_PDF.pdf
Q-Advise
 
advancepresentationskillshdhdhhdhdhdhhfhf
jasmenrojas249
 
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
Safe Software
 
Activate_Methodology_Summary presentatio
annapureddyn
 
What companies do with Pharo (ESUG 2025)
ESUG
 
Protecting the Digital World Cyber Securit
dnthakkar16
 
How to Download and Install ADT (ABAP Development Tools) for Eclipse IDE | SA...
SAP Vista, an A L T Z E N Company
 
AWS_Agentic_AI_in_Indian_BFSI_A_Strategic_Blueprint_for_Customer.pdf
siddharthnetsavvies
 
Summary Of Odoo 18.1 to 18.4 : The Way For Odoo 19
CandidRoot Solutions Private Limited
 
Enhancing Healthcare RPM Platforms with Contextual AI Integration
Cadabra Studio
 
On Software Engineers' Productivity - Beyond Misleading Metrics
Romén Rodríguez-Gil
 
AI Image Enhancer: Revolutionizing Visual Quality”
docmasoom
 
Explanation about Structures in C language.pptx
Veeral Rathod
 
New Download MiniTool Partition Wizard Crack Latest Version 2025
imang66g
 
New Download FL Studio Crack Full Version [Latest 2025]
imang66g
 
Ad

Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and Rebars

  • 1. Agility in Software 2.0 – Notebook Interfaces and MLOps with Buttresses and Rebars 6th Int’l. Conference on Lean and Agile Software Development January 22, 2022 RISE Research Institutes of Sweden CC BY-NC 2.0 K. Edblom Markus Borg @mrksbrg mrksbrg.com
  • 2. Agile iterates fast. Data science moves faster! (CC CC BY-NC 2.0 K. Edblom
  • 3. Who is Markus? • Development engineer, ABB 2007-2010 – Process automation – Editor and compiler development • PhD student, LundUniversity 2010-2015 – Requirements engineering and testing – Traceability, change impact analysis • Senior researcher, RISE 2015- – Software engineering for AI/ML
  • 5. Also Markus… • Member of the board, Swedsoft – Influence decision makers – Write comment letters – Facilitate networking • Adjunct lecturer, Lund University – Teaching software engineering
  • 9. ”a large portion of real-world problems have the property that it is significantly easier to collect the data than to explicitly write the program” https://medium.com/@karpathy/software-2-0-a64152b37c35 Andrej Karpathy Director of AI at Tesla 9
  • 10. Karpathy’s Software 2.0 Software 1.0 • Humans write source code • Other humans comprehend the source code Software 2.0 • Humans curate data and specify goals • Backprop. and gradient descent produces millions of weights • Humans cannot comprehend mapping from input to output
  • 11. “computers’ ability to learn without being explicitly programmed” - Arthur Samuel (1959) 11
  • 13. Neural network YOLO (You Only Look Once) by Redmon et al. (2016) 13
  • 14. CC BY-NC 2.0 Flickr: @andreas_komodromos
  • 16. 16
  • 18. 18 MLware is different Software engineering practices throughout the lifecycle must evolve – New types of requirements (explainability, fairness, …) – New architectures (system, neural networks, …) – Configuration management (data, models, parameters, …) – Testing levels (data, model, …) – Operations (scaling, monitoring, retraining, …)
  • 21. 21 Data science is not software engineering Established best practices might not apply Biggest difference is how experimental it is • Maximum agility needed to quickly reach insights CACE principle • “Changing Anything Changes Everything”
  • 22. 22 Also different peopleware Data scientists are not software engineers • and maybe not software developers • perhaps not even computer scientists Data scientists master the art of taming data and train models Analogy: research on development of scientific software
  • 23. 23
  • 24. “Literate programming” - Donald E. Knuth (1984) 24 Mix source code and explanatory text
  • 25. Computational notebooks Extended into “literate computing” with three types of cells • (Python) Source code • Explanatory text describing the code • Visual content Promotes interaction! Interpreter runs in the background Cells can be executed in any order Very agile, but very messy
  • 26. 26
  • 27. 27 Notebook collaboration pain points Concurrent editing is confusing Code management and refactoring Replicability is low Productization of a Notebook proof-of-concept is a big step CHI’20
  • 29. 29
  • 30. 30 30 Jakobsson and Henriksson (2021) https://lup.lub.lu.se/student-papers/search/publication/9066685 https://github.com/backtick-se/cowait
  • 31. dataset versioning? dependency management? deployment? real-time performance? version control? repeatability? scalability? monitoring? feature store? model management? Data Features Training Model MLware sounds simple! 31
  • 32. Sculley et al. (2015), Hidden Technical Debt in Machine Learning Systems In Proc. of the Advances in Neural Information Processing Systems 28
  • 33. MLOps
  • 34. 34
  • 36. 36 “MLOps is the standardization and streamlining of machine learning life cycle management” As an engineering discipline, MLOps is a set of practices that combines Machine Learning, DevOps and Data Engineering, which aims to deploy and maintain ML systems in production reliably and efficiently. - Treveil et al., Introducing MLOps, O'Reilley Media, Inc., 2020.
  • 37. (CC BY 2.0 Flick: Kuhnt) DevOps with ML specific additions • Experiment tracking • Model management 37
  • 38. MLOps Please! How? Fully managed end-to-end solution by the hyperscalers Custom-built on-prem pipelines 38
  • 40. 40 Data Validation Model Training Model Validation Packaging Deploy in Test Env. Deploy in Prod. Env. Training Deployment Model Monitoring Software Development ML Development Data Repos Code Repos ML Ops Release Management 40
  • 42. High Risk Unacceptable Prohibited: Social Scoring, Public Facial Detection, User Manipulation, … Minimal Risk Business as usual: Video games, Camera Effects, Spamfilters, … Limited Risk Increased Transparency: Chatbots, Deep Fakes, Emotion Recognition, … Conformity Assessment: 1) X under product safety regulations 2) Education, employment, healthcare, immigration, justice, …
  • 43. 43 High-risk AI providers must • Document internal rigorous engineering activities • Quality assurance, fairness, traceability, auditability, … • Pass independent conformance assessment • National Supervisory Authority • Monitoring to continuously check compliance If not? GDPR style punishment… • Up to 6% global annual turnover!
  • 44. 44
  • 45. Agile iterates fast. Data science moves faster! (CC CC BY-NC 2.0 K. Edblom
  • 46. Agility in Software 2.0 – Notebook Interfaces and MLOps with Buttresses and Rebars 6th Int’l. Conference on Lean and Agile Software Development January 22, 2022 RISE Research Institutes of Sweden CC BY-NC 2.0 K. Edblom Markus Borg @mrksbrg mrksbrg.com