SlideShare a Scribd company logo
Delivery Excellence:
Love is in the Air –
Rendezvous Architecture
for Analytics in Production
Munich, October 25, 2019, Daniel Schulz
AAC_RendezvousArchitecture.pptx
Love is in the Air
What is Rendezvous Architecture
and why is it Important?
AAC_RendezvousArchitecture.pptx
Why is Rendezvous Architecture Important?
• An Event-driven Architecture (EDA)
• Applies Event stream processing style for Supervised Learning tasks
• Solves common obstacles in productionizing Machine Learning Models for both
• Initial Go-Live &
• Updates in production
• Analogous to the Ivory Tower Effect in Enterprise Architecture
• Like the architect needs feedback from production systems to improve this & future architectures
• Data Scientists need production feedback from their ML Models in order to improve the upcoming versions of it
© 2019 Daniel Schulz. All rights reserved. 3
Taking Machine Learning Models to Production is Challenging
AAC_RendezvousArchitecture.pptx
Traditional Green/Blue Deployments – a Load Balancer Forwards
any Request to Exactly One Machine Learning Model
© 2019 Daniel Schulz. All rights reserved. 4
The Reverse Proxy Enables Green/Blue Deployments Only
Source & image courtesy: [MLL]
AAC_RendezvousArchitecture.pptx
Load Balanced,
Parallelized Models
Enable Multiple ML Models at Once
AAC_RendezvousArchitecture.pptx
Multiple Machine Learning Models in Parallel –
Adding a Message Queue Enables Concurrency of Models
© 2019 Daniel Schulz. All rights reserved. 6
But which Prediction to Choose in the End?
Source & image courtesy: [MLL]
AAC_RendezvousArchitecture.pptx
Multiple Machine Learning Models in Parallel –
Adding a Message Queue Enables Concurrency of Models
• Shortcomings:
• Somewhat challenging various models for reference against one another
• No comparing many model’s accuracies against one another
• Continuous improvement in Data Science resp. DataOps team unlikely due to lack of feedback
• Need to add return address (as an URI) and boolean flag to whether to return anything at all in the queue –
for the consumer to respond to the original (HTTP) request, which might be completed then
• Persistent Message Queues tend to be slower than pure in-memory ones;
Spark Streaming working in micro-batches might have higher latencies as well
© 2019 Daniel Schulz. All rights reserved. 7
But which Prediction to Choose in the End?
AAC_RendezvousArchitecture.pptx
Rendezvous
Architecture
Stop-back Request Guarantees,
Parallelized, Comparable,
Rapid Model Updates,
QoS Guarantees,
etc.
AAC_RendezvousArchitecture.pptx
The ML Model’s Rendezvous – a Scoring Stream Collects Various
Predictions & Returns them to the Original Request’s Client
© 2019 Daniel Schulz. All rights reserved. 9
Many Models – One Final Prediction
Source & image courtesy: [MLL]
AAC_RendezvousArchitecture.pptx
Stateless Models are Easier to Replicate –
but Data Augmentation Might be Helpful or Necessary
© 2019 Daniel Schulz. All rights reserved. 10
Any Additional Piece of or State Information Helpful to our Models Shall be Added in One Central Place
Source & image courtesy: [MLL]
AAC_RendezvousArchitecture.pptx
The Decoy Model –
Collect Production Data for Debugging, Optimizations and Reproducibility
© 2019 Daniel Schulz. All rights reserved. 11
The “Unit Tests” of AI Models in Production is Real-World Data
Source & image courtesy: [MLL]
AAC_RendezvousArchitecture.pptx
Discussion when to Add External Information
© 2019 Daniel Schulz. All rights reserved. 12
No Augmentation
▪ Models only receive request information
▪ More reliable for technical failure
▪ More overhead when many models fetch the same data –
use caches here
AAC_RendezvousArchitecture.pptx
Discussion when to Add External Information
© 2019 Daniel Schulz. All rights reserved. 13
No Augmentation
▪ Models only receive request information
▪ More reliable for technical failure
▪ More overhead when many models fetch the same data –
use caches here
All Augmentation in One Central Place
▪ Models receive all complete information
▪ Ideal case for reproducibility –
as the “Decoy Model” stores all information for
debugging and to explain predictions later-on
▪ Usually faster due to smaller overhead
→ Best Practice by Ellen Friedman & Ted Dunning,
Chapter 3, sub-section “Stateful Models” in [MLL]
AAC_RendezvousArchitecture.pptx
Add Metrics & the Canary Model for Monitoring & Optimizations
© 2019 Daniel Schulz. All rights reserved. 14
Metrics are Crucial to Compare Many Models Against One Another & Hence Supports Accuracy Improvements
Source & image courtesy: [MLL]
AAC_RendezvousArchitecture.pptx
Add Metrics & the Canary Model for Monitoring & Optimizations
© 2019 Daniel Schulz. All rights reserved. 15
Metrics
▪ Metrics monitor all model’s predictions and compares
their performances
▪ Hence, we are able to judge challenging model’s
performances compared to the incumbent model’s ones
▪ Metrics are helpful to detect outliers in predictions –
• e.g. Adversarial Images, where models might predict
obscure classifications
• e.g. detect swayed models, shift in input data’s
distributions, etc.
▪ Both technological SLA, timing & AI metrics
• SLA metrics:
latency, throughput, etc.
• Timing metrics:
computation time in threads, time for requests by
source/endpoint, etc.
• AI metrics:
accuracy, error metrics, AUC, F-statistics, etc.
AAC_RendezvousArchitecture.pptx
Add Metrics & the Canary Model for Monitoring & Optimizations
© 2019 Daniel Schulz. All rights reserved. 16
Metrics
▪ Metrics monitor all model’s predictions and compares
their performances
▪ Hence, we are able to judge challenging model’s
performances compared to the incumbent model’s ones
▪ Metrics are helpful to detect outliers in predictions –
• e.g. Adversarial Images, where models might predict
obscure classifications
• e.g. detect swayed models, shift in input data’s
distributions, etc.
▪ Both technological SLA, timing & AI metrics
• SLA metrics:
latency, throughput, etc.
• Timing metrics:
computation time in threads, time for requests by
source/endpoint, etc.
• AI metrics:
accuracy, error metrics, AUC, F-statistics, etc.
The Canary Model
▪ Also a Best Practice by [MLL] for finding anomalies
▪ Is a rather dated model that keeps predicting to compare
the newer models’ predictions with it – to help detect
how production-ready they really are
▪ The difference in Canary Model and later ones is proof of
progress – or lack thereof for the Data Scientists
AAC_RendezvousArchitecture.pptx
Rendezvous Architecture – a Mixture of Models in Harmony
• Advantages:
• Model “warm-up” in production-like environments
• Switch models in an instant – un-deploy & deploy AI models swiftly
• Introduce time guarantees: all models work in parallel like Cassandra queries
• Mix simple, technically robust (not failing) models along w/ more sophisticated ones, which might break suddenly
• Incumbent vs challenging – collect raw data and performance metrics for various models (e.g. XGBoost vs Random
Forests; e.g. Linear Model vs SVM; e.g. PCA vs T-SNE) and differing versions in model streams (version 0.1, 0.2, …)
• Backstop:
when taking too long, a simpler, less robust model may answer as a backstop for more complex, more sophisticated
models; the same applies when longer-term performance metrics might indicate another model would perform better
© 2019 Daniel Schulz. All rights reserved. 17
My Suggestion for Reliable AI Systems Due to…
AAC_RendezvousArchitecture.pptx
Résumé
Resilient, Extendable &
Production-ready Architecture for
Predictive Analytics
AAC_RendezvousArchitecture.pptx
Resumé on Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 19
Limitations
▪ Is no silver bullet – does not solve all logistical obstacles in AI projects
▪ Focusses on production-side architecture – development, test, QA and LTE environments might benefit from it
▪ Focus on technological, software architecture – does not cover ML Metrics, Hyperparameter Tuning, etc.
▪ Latencies increased a bit by Message-Queue-dependency
▪ I am not aware of any implementation as of now – neither OSS, nor to purchase, nor Cloud-based
AAC_RendezvousArchitecture.pptx
Resumé on Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 20
Major Advantages
▪ Manage multiple models in production and alike
environments
▪ Test-drive incumbent & challenging models against one
another
▪ Reproducibility & transparency of model’s predictions
▪ Solves common obstacles in productionizing
Machine Learning Models for both
• Initial Go-Live &
• Updates in production
AAC_RendezvousArchitecture.pptx
Resumé on Rendezvous Architecture
© 2019 Daniel Schulz. All rights reserved. 21
Major Advantages
▪ Manage multiple models in production and alike
environments
▪ Test-drive incumbent & challenging models against one
another
▪ Reproducibility & transparency of model’s predictions
▪ Solves common obstacles in productionizing
Machine Learning Models for both
• Initial Go-Live &
• Updates in production
Minor Benefits
▪ Collect real-world data for future development
▪ Establish baseline performance values for Predictive
Analytics
▪ Rapid development due to default, fallback models
▪ Latency guarantees for model predictions
resilient, robust Predictive models
in modern Agile & DevOps projects
AAC_RendezvousArchitecture.pptx
Rendezvous Architecture
is the Modern Bedrock
of Robust Predictive models for
Today’s Agile & DevOps Projects
AAC_RendezvousArchitecture.pptx
Thank You for Your Attention
© 2019 Daniel Schulz. All rights reserved. 23
Please Feel Free to Ask any Open Questions, Suggestions or Voice Your Opinion…
A global leader in consulting, technology services and digital transformation,
Capgemini is at the forefront of innovation to address the entire breadth of clients’
opportunities in the evolving world of cloud, digital and platforms. Building on its
strong 50-year heritage and deep industry-specific expertise, Capgemini enables
organizations to realize their business ambitions through an array of services from
strategy to operations. Capgemini is driven by the conviction that the business
value of technology comes from and through people. It is a multicultural company
of over 200,000 team members in more than 40 countries. The Group reported
2018 global revenues of EUR 13.2 billion.
About Capgemini
Learn more about us at
www.capgemini.com
This presentation contains information that may be privileged or confidential and
is the property of the Capgemini Group.
Copyright © 2019 Daniel Schulz. All rights reserved.
People matter, results count.
AAC_RendezvousArchitecture.pptx
Source & Image Courtesy from Book “Machine Learning Logistics”
© 2019 Daniel Schulz. All rights reserved. 25
“Machine Learning Logistics” by Ellen Friedman & Ted Dunning
▪ Authors: Ellen Friedman & Ted Dunning
▪ Publisher: O'Reilly Media, Inc.
▪ Release Date: October 2017
▪ ISBN: 9 7814 9199 7611
▪ Picture source ID: MLL
AAC_RendezvousArchitecture.pptx
Source from Book “Hands-on Machine Learning with Scikit-Learn,
Keras, and TensorFlow, 2nd Edition”
© 2019 Daniel Schulz. All rights reserved. 26
“Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow” by Aurélien Géron
▪ Author: Aurélien Géron
▪ Publisher: O'Reilly Media, Inc.
▪ Release Date: October 2019
▪ ISBN: 9 7814 9203 2649

More Related Content

What's hot (14)

PPT
Cp Repton
guestea711d0
 
PDF
Future Facilities 6SigmaDCX Overview HD
Robert Schmidt
 
PPTX
Designing for the Cloud Tutorial - QCon SF 2009
Stuart Charlton
 
PDF
Avoiding Cloud Computing Planning & Implementation Failure
Nathaniel Payne
 
PDF
Successfully Integrating MBSE Data Without Replication Using OSLC
Joseph Lopez, M.ISM
 
PPTX
A Software Engineering Perspective on SDN Programmability
Felipe Alencar
 
PDF
Migrating to Cloud: Inhouse Hadoop to Databricks (3)
Knoldus Inc.
 
PDF
Modeling and Forecasting – Effective Baselines for Capacity Management
Precisely
 
PDF
Modern Agile Software Architecture
Kannan Durairaj
 
PPTX
A Model-Driven Approach to Support Cloud Migration Process- A Language Infras...
Mahdi_Fahmideh
 
PPTX
Show Me the Money: Connecting Performance Engineering to Real Business Results
Correlsense
 
PDF
PROSTEP Api for the Digital Thread
Joseph Lopez, M.ISM
 
PDF
Adam boczek 2015 agile architecture in 10 steps v1.0
iasaglobal
 
PPTX
Monolith to serverless service based architectures in the enterprise
Sameh Deabes
 
Cp Repton
guestea711d0
 
Future Facilities 6SigmaDCX Overview HD
Robert Schmidt
 
Designing for the Cloud Tutorial - QCon SF 2009
Stuart Charlton
 
Avoiding Cloud Computing Planning & Implementation Failure
Nathaniel Payne
 
Successfully Integrating MBSE Data Without Replication Using OSLC
Joseph Lopez, M.ISM
 
A Software Engineering Perspective on SDN Programmability
Felipe Alencar
 
Migrating to Cloud: Inhouse Hadoop to Databricks (3)
Knoldus Inc.
 
Modeling and Forecasting – Effective Baselines for Capacity Management
Precisely
 
Modern Agile Software Architecture
Kannan Durairaj
 
A Model-Driven Approach to Support Cloud Migration Process- A Language Infras...
Mahdi_Fahmideh
 
Show Me the Money: Connecting Performance Engineering to Real Business Results
Correlsense
 
PROSTEP Api for the Digital Thread
Joseph Lopez, M.ISM
 
Adam boczek 2015 agile architecture in 10 steps v1.0
iasaglobal
 
Monolith to serverless service based architectures in the enterprise
Sameh Deabes
 

Similar to Productionizing Predictive Analytics using the Rendezvous Architecture - for Architects (20)

PDF
Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
danielschulz2005
 
PPTX
ML Workshop 1: A New Architecture for Machine Learning Logistics
MapR Technologies
 
PPTX
Machine Learning Logistics
Ted Dunning
 
PDF
DataOps: An Agile Method for Data-Driven Organizations
Ellen Friedman
 
PPTX
Machine Learning Success: The Key to Easier Model Management
MapR Technologies
 
PPTX
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive
 
PPTX
Machine Learning logistics
Ted Dunning
 
PDF
Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...
Otávio Carvalho
 
PDF
How to build containerized architectures for deep learning - Data Festival 20...
Antje Barth
 
PDF
Future of AI-powered automation in business
Louis Dorard
 
PDF
Smart Data - The Foundation for Better Business Outcomes
DATAVERSITY
 
PDF
Open Blueprint for Real-Time Analytics in Retail: Strata Hadoop World 2017 S...
Grid Dynamics
 
PDF
Platforming the Major Analytic Use Cases for Modern Engineering
DATAVERSITY
 
PDF
1710 track3 zhu
Rising Media, Inc.
 
PDF
Latest Smadav Pro Crack latest version 2025
areebaramzanrr
 
PDF
Discover Lumion Pro - Lumion 3D Rendering Software for ...
mohsinbha003
 
PDF
Avast Premium Security 25.2.9898 Crack + License Key Till 2050
areebaramzanrr
 
PPTX
Lumion Pro Crack Licence Key 2025 Latest
leliaschamberger821
 
PPTX
IDM Crack 6.42 Build 26 Patch With Activation
leliaschamberger81
 
PDF
Analytics&IoT
Selvaraj Kesavan
 
Productionizing Predictive Analytics using the Rendezvous Architecture - for ...
danielschulz2005
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
MapR Technologies
 
Machine Learning Logistics
Ted Dunning
 
DataOps: An Agile Method for Data-Driven Organizations
Ellen Friedman
 
Machine Learning Success: The Key to Easier Model Management
MapR Technologies
 
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive
 
Machine Learning logistics
Ted Dunning
 
Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...
Otávio Carvalho
 
How to build containerized architectures for deep learning - Data Festival 20...
Antje Barth
 
Future of AI-powered automation in business
Louis Dorard
 
Smart Data - The Foundation for Better Business Outcomes
DATAVERSITY
 
Open Blueprint for Real-Time Analytics in Retail: Strata Hadoop World 2017 S...
Grid Dynamics
 
Platforming the Major Analytic Use Cases for Modern Engineering
DATAVERSITY
 
1710 track3 zhu
Rising Media, Inc.
 
Latest Smadav Pro Crack latest version 2025
areebaramzanrr
 
Discover Lumion Pro - Lumion 3D Rendering Software for ...
mohsinbha003
 
Avast Premium Security 25.2.9898 Crack + License Key Till 2050
areebaramzanrr
 
Lumion Pro Crack Licence Key 2025 Latest
leliaschamberger821
 
IDM Crack 6.42 Build 26 Patch With Activation
leliaschamberger81
 
Analytics&IoT
Selvaraj Kesavan
 
Ad

Recently uploaded (20)

PPTX
AI Project Cycle and Ethical Frameworks.pptx
RiddhimaVarshney1
 
PPTX
Usage of Power BI for Pharmaceutical Data analysis.pptx
Anisha Herala
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PPTX
Rocket-Launched-PowerPoint-Template.pptx
Arden31
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PDF
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
PDF
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PPTX
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PPT
01 presentation finyyyal معهد معايره.ppt
eltohamym057
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PDF
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PPTX
Climate Action.pptx action plan for climate
justfortalabat
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PDF
Performance Report Sample (Draft7).pdf
AmgadMaher5
 
AI Project Cycle and Ethical Frameworks.pptx
RiddhimaVarshney1
 
Usage of Power BI for Pharmaceutical Data analysis.pptx
Anisha Herala
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
Rocket-Launched-PowerPoint-Template.pptx
Arden31
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
Choosing the Right Database for Indexing.pdf
Tamanna
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
01 presentation finyyyal معهد معايره.ppt
eltohamym057
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
Climate Action.pptx action plan for climate
justfortalabat
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
Performance Report Sample (Draft7).pdf
AmgadMaher5
 
Ad

Productionizing Predictive Analytics using the Rendezvous Architecture - for Architects

  • 1. Delivery Excellence: Love is in the Air – Rendezvous Architecture for Analytics in Production Munich, October 25, 2019, Daniel Schulz
  • 2. AAC_RendezvousArchitecture.pptx Love is in the Air What is Rendezvous Architecture and why is it Important?
  • 3. AAC_RendezvousArchitecture.pptx Why is Rendezvous Architecture Important? • An Event-driven Architecture (EDA) • Applies Event stream processing style for Supervised Learning tasks • Solves common obstacles in productionizing Machine Learning Models for both • Initial Go-Live & • Updates in production • Analogous to the Ivory Tower Effect in Enterprise Architecture • Like the architect needs feedback from production systems to improve this & future architectures • Data Scientists need production feedback from their ML Models in order to improve the upcoming versions of it © 2019 Daniel Schulz. All rights reserved. 3 Taking Machine Learning Models to Production is Challenging
  • 4. AAC_RendezvousArchitecture.pptx Traditional Green/Blue Deployments – a Load Balancer Forwards any Request to Exactly One Machine Learning Model © 2019 Daniel Schulz. All rights reserved. 4 The Reverse Proxy Enables Green/Blue Deployments Only Source & image courtesy: [MLL]
  • 6. AAC_RendezvousArchitecture.pptx Multiple Machine Learning Models in Parallel – Adding a Message Queue Enables Concurrency of Models © 2019 Daniel Schulz. All rights reserved. 6 But which Prediction to Choose in the End? Source & image courtesy: [MLL]
  • 7. AAC_RendezvousArchitecture.pptx Multiple Machine Learning Models in Parallel – Adding a Message Queue Enables Concurrency of Models • Shortcomings: • Somewhat challenging various models for reference against one another • No comparing many model’s accuracies against one another • Continuous improvement in Data Science resp. DataOps team unlikely due to lack of feedback • Need to add return address (as an URI) and boolean flag to whether to return anything at all in the queue – for the consumer to respond to the original (HTTP) request, which might be completed then • Persistent Message Queues tend to be slower than pure in-memory ones; Spark Streaming working in micro-batches might have higher latencies as well © 2019 Daniel Schulz. All rights reserved. 7 But which Prediction to Choose in the End?
  • 9. AAC_RendezvousArchitecture.pptx The ML Model’s Rendezvous – a Scoring Stream Collects Various Predictions & Returns them to the Original Request’s Client © 2019 Daniel Schulz. All rights reserved. 9 Many Models – One Final Prediction Source & image courtesy: [MLL]
  • 10. AAC_RendezvousArchitecture.pptx Stateless Models are Easier to Replicate – but Data Augmentation Might be Helpful or Necessary © 2019 Daniel Schulz. All rights reserved. 10 Any Additional Piece of or State Information Helpful to our Models Shall be Added in One Central Place Source & image courtesy: [MLL]
  • 11. AAC_RendezvousArchitecture.pptx The Decoy Model – Collect Production Data for Debugging, Optimizations and Reproducibility © 2019 Daniel Schulz. All rights reserved. 11 The “Unit Tests” of AI Models in Production is Real-World Data Source & image courtesy: [MLL]
  • 12. AAC_RendezvousArchitecture.pptx Discussion when to Add External Information © 2019 Daniel Schulz. All rights reserved. 12 No Augmentation ▪ Models only receive request information ▪ More reliable for technical failure ▪ More overhead when many models fetch the same data – use caches here
  • 13. AAC_RendezvousArchitecture.pptx Discussion when to Add External Information © 2019 Daniel Schulz. All rights reserved. 13 No Augmentation ▪ Models only receive request information ▪ More reliable for technical failure ▪ More overhead when many models fetch the same data – use caches here All Augmentation in One Central Place ▪ Models receive all complete information ▪ Ideal case for reproducibility – as the “Decoy Model” stores all information for debugging and to explain predictions later-on ▪ Usually faster due to smaller overhead → Best Practice by Ellen Friedman & Ted Dunning, Chapter 3, sub-section “Stateful Models” in [MLL]
  • 14. AAC_RendezvousArchitecture.pptx Add Metrics & the Canary Model for Monitoring & Optimizations © 2019 Daniel Schulz. All rights reserved. 14 Metrics are Crucial to Compare Many Models Against One Another & Hence Supports Accuracy Improvements Source & image courtesy: [MLL]
  • 15. AAC_RendezvousArchitecture.pptx Add Metrics & the Canary Model for Monitoring & Optimizations © 2019 Daniel Schulz. All rights reserved. 15 Metrics ▪ Metrics monitor all model’s predictions and compares their performances ▪ Hence, we are able to judge challenging model’s performances compared to the incumbent model’s ones ▪ Metrics are helpful to detect outliers in predictions – • e.g. Adversarial Images, where models might predict obscure classifications • e.g. detect swayed models, shift in input data’s distributions, etc. ▪ Both technological SLA, timing & AI metrics • SLA metrics: latency, throughput, etc. • Timing metrics: computation time in threads, time for requests by source/endpoint, etc. • AI metrics: accuracy, error metrics, AUC, F-statistics, etc.
  • 16. AAC_RendezvousArchitecture.pptx Add Metrics & the Canary Model for Monitoring & Optimizations © 2019 Daniel Schulz. All rights reserved. 16 Metrics ▪ Metrics monitor all model’s predictions and compares their performances ▪ Hence, we are able to judge challenging model’s performances compared to the incumbent model’s ones ▪ Metrics are helpful to detect outliers in predictions – • e.g. Adversarial Images, where models might predict obscure classifications • e.g. detect swayed models, shift in input data’s distributions, etc. ▪ Both technological SLA, timing & AI metrics • SLA metrics: latency, throughput, etc. • Timing metrics: computation time in threads, time for requests by source/endpoint, etc. • AI metrics: accuracy, error metrics, AUC, F-statistics, etc. The Canary Model ▪ Also a Best Practice by [MLL] for finding anomalies ▪ Is a rather dated model that keeps predicting to compare the newer models’ predictions with it – to help detect how production-ready they really are ▪ The difference in Canary Model and later ones is proof of progress – or lack thereof for the Data Scientists
  • 17. AAC_RendezvousArchitecture.pptx Rendezvous Architecture – a Mixture of Models in Harmony • Advantages: • Model “warm-up” in production-like environments • Switch models in an instant – un-deploy & deploy AI models swiftly • Introduce time guarantees: all models work in parallel like Cassandra queries • Mix simple, technically robust (not failing) models along w/ more sophisticated ones, which might break suddenly • Incumbent vs challenging – collect raw data and performance metrics for various models (e.g. XGBoost vs Random Forests; e.g. Linear Model vs SVM; e.g. PCA vs T-SNE) and differing versions in model streams (version 0.1, 0.2, …) • Backstop: when taking too long, a simpler, less robust model may answer as a backstop for more complex, more sophisticated models; the same applies when longer-term performance metrics might indicate another model would perform better © 2019 Daniel Schulz. All rights reserved. 17 My Suggestion for Reliable AI Systems Due to…
  • 19. AAC_RendezvousArchitecture.pptx Resumé on Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 19 Limitations ▪ Is no silver bullet – does not solve all logistical obstacles in AI projects ▪ Focusses on production-side architecture – development, test, QA and LTE environments might benefit from it ▪ Focus on technological, software architecture – does not cover ML Metrics, Hyperparameter Tuning, etc. ▪ Latencies increased a bit by Message-Queue-dependency ▪ I am not aware of any implementation as of now – neither OSS, nor to purchase, nor Cloud-based
  • 20. AAC_RendezvousArchitecture.pptx Resumé on Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 20 Major Advantages ▪ Manage multiple models in production and alike environments ▪ Test-drive incumbent & challenging models against one another ▪ Reproducibility & transparency of model’s predictions ▪ Solves common obstacles in productionizing Machine Learning Models for both • Initial Go-Live & • Updates in production
  • 21. AAC_RendezvousArchitecture.pptx Resumé on Rendezvous Architecture © 2019 Daniel Schulz. All rights reserved. 21 Major Advantages ▪ Manage multiple models in production and alike environments ▪ Test-drive incumbent & challenging models against one another ▪ Reproducibility & transparency of model’s predictions ▪ Solves common obstacles in productionizing Machine Learning Models for both • Initial Go-Live & • Updates in production Minor Benefits ▪ Collect real-world data for future development ▪ Establish baseline performance values for Predictive Analytics ▪ Rapid development due to default, fallback models ▪ Latency guarantees for model predictions resilient, robust Predictive models in modern Agile & DevOps projects
  • 22. AAC_RendezvousArchitecture.pptx Rendezvous Architecture is the Modern Bedrock of Robust Predictive models for Today’s Agile & DevOps Projects
  • 23. AAC_RendezvousArchitecture.pptx Thank You for Your Attention © 2019 Daniel Schulz. All rights reserved. 23 Please Feel Free to Ask any Open Questions, Suggestions or Voice Your Opinion…
  • 24. A global leader in consulting, technology services and digital transformation, Capgemini is at the forefront of innovation to address the entire breadth of clients’ opportunities in the evolving world of cloud, digital and platforms. Building on its strong 50-year heritage and deep industry-specific expertise, Capgemini enables organizations to realize their business ambitions through an array of services from strategy to operations. Capgemini is driven by the conviction that the business value of technology comes from and through people. It is a multicultural company of over 200,000 team members in more than 40 countries. The Group reported 2018 global revenues of EUR 13.2 billion. About Capgemini Learn more about us at www.capgemini.com This presentation contains information that may be privileged or confidential and is the property of the Capgemini Group. Copyright © 2019 Daniel Schulz. All rights reserved. People matter, results count.
  • 25. AAC_RendezvousArchitecture.pptx Source & Image Courtesy from Book “Machine Learning Logistics” © 2019 Daniel Schulz. All rights reserved. 25 “Machine Learning Logistics” by Ellen Friedman & Ted Dunning ▪ Authors: Ellen Friedman & Ted Dunning ▪ Publisher: O'Reilly Media, Inc. ▪ Release Date: October 2017 ▪ ISBN: 9 7814 9199 7611 ▪ Picture source ID: MLL
  • 26. AAC_RendezvousArchitecture.pptx Source from Book “Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition” © 2019 Daniel Schulz. All rights reserved. 26 “Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow” by Aurélien Géron ▪ Author: Aurélien Géron ▪ Publisher: O'Reilly Media, Inc. ▪ Release Date: October 2019 ▪ ISBN: 9 7814 9203 2649