SlideShare a Scribd company logo
QoA4ML – A Framework for
Supporting Contracts in
Machine Learning Services
Hong-Linh Truong, Minh-Tri Nguyen
Department of Computer Science
https://rdsea.github.io
Outline
▪ Context, scenario and research questions
▪ Key components of the QoA4ML framework
▪ Prototype and experiments
▪ Conclusions and future work
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
2
Context
▪ Machine learning as a service becomes popular
▪ ML service providers offer ML services for different consumers
▪ Different stakeholders and interaction models
▪ Two stakeholders engagement: consumer and ML service provider
▪ Three stakeholders engagement: consumer, ML service provider
and ML infrastructure/platform provider
▪ Key issue
▪ How do we support contracts between the ML service provider and
other stakeholders? It is not just about performance!
▪ ML has several distinguishable attributes
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
3
Scenario: predictive maintenance in
Base Transceiver Stations (BTS)
▪ Dynamic inference from IoT data about equipment and infrastructure
components in a BTS
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
4
Key research questions & our approach
▪ Which are key attributes for ML contracts?
▪ How would ML attributes and constraints be specified?
▪ How would ML-specific attributes/constraints be monitored and
evaluated?
▪ Approach
▪ Focus on ML-specific attributes
▪ Researchers have identified many attributes for ML models and systems
▪ Design ML contract specs suitable for cloud-native services
▪ Constraints, policies and monitoring reports
▪ Monitor ML attributes for contract monitoring
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
5
QoA4ML framework – important attributes
for ML-specific contracts
▪ Focus on important categories
▪ Inference Accuracy, Reliability and Elasticity, Quality of Data,
Security and Privacy, Fairness and Interpretability and Cost
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
6
QoA4ML specifications
▪ Decoupling attributes/constraints vs policies
▪ Required attributes and their constraints can be changed and
updated at runtime
▪ Policies to check attributes and constraints can be implemented in
different ways
▪ Monitoring probes and other utilities supporting observability
▪ New probes for quality of data and ML models
▪ Need to be instrumented and deployed to capture runtime attributes
▪ Must be well integrated with common monitoring features
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
7
QoA4ML
specification -
the initial
implementation
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
8
Constraints for the BTS ML service
▪ Use terms in the
QoA4ML specs
▪ Attributes and
constraints can be
changed
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
9
Example of policies for validating contract
constraints
▪ Is based on Rego
▪ Can load contract
terms from JSON
and compare with
runtime monitoring
▪ Can be changed at
runtime
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
10
Monitoring utilities and Observability
Service
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
11
Design for different
engines to be used
Reuse well-known
monitoring systems
Monitor ML-specific
attributes
Current prototype
▪ QoA4ML Specs: initial version based on JSON
▪ Use OPA (https://www.openpolicyagent.org/) as engine
▪ Rego and JSON are used for policies, attributes and constraints
▪ QoA4ML Observability as microservices
▪ Using state-of-the-art monitoring tools like
Prometheus/Grafana
▪ Testing environments
▪ Edge and cloud infrastructures
▪ Source code is currently being pushed into:
▪ https://github.com/rdsea/QoA4ML
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
12
Experiments
▪ Dynamic inferences of BTS load of power grid
▪ LSTM, TensorFlow
▪ IoT data from BTS (several months)
▪ Training in cloud and export to the edge (BTS-model-edge)
and retraining several times in the cloud (BTS-model-cloud)
▪ Deployment
▪ Contracts:
▪ ResponseTime
▪ Inference Accuracy
▪ Data Quality
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
13
Effect of edge and cloud serving
platform deployment in ML contracts.
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
14
Both consumer and service are in the
same edge; 3000 records per 15 minutes
Both consumer and broker are in the same edge
Broker is in the cloud
Impact of violation monitoring
All services in the edge (except the observability)
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
15
Help to detect outdated models in ML
services: violation changes when
retraining models
Help to see
correlations
among
attributes:
data quality
and inference
accuracy
Conclusions and future work
▪ QoA4ML is a framework to support ML service contracts
▪ Contract specifications (constraints and policies), tools and services
▪ QoA4ML benefits
▪ Establish contracts, moving to the step of continuous testing and
observability of ML production
▪ Support flexible contracts and policies, enabling reuses and real-
world ML services integration
▪ Future work
▪ Extending ML attributes and specifications; integration with cloud
service contracts; new probes and observability capabilities
September 9, 2021
IEEE International Conference on Web Services (ICWS) 2021
16
Thanks!
Hong-Linh Truong
Department of Computer Science
rdsea.github.io
IEEE International Conference on Web Services (ICWS) 2021
17
September 9, 2021

More Related Content

Similar to QoA4ML – A Framework for Supporting Contracts in Machine Learning Services (20)

PPTX
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
PDF
Data ops: Machine Learning in production
Stepan Pushkarev
 
PDF
CI/CD for Machine Learning
C4Media
 
PDF
Sustainable & Composable Generative AI
Debmalya Biswas
 
PDF
Slides-Артем Коваль-Cloud-Native MLOps Framework - DataFest 2021.pdf
vitm11
 
PPTX
MOPs & ML Pipelines on GCP - Session 6, RGDC
gdgsurrey
 
PDF
Introducing MLOps.pdf
Dr. Anish Cheriyan (PhD)
 
PDF
Ml ops intro session
Avinash Patil
 
PDF
Machine learning at scale challenges and solutions
Stavros Kontopoulos
 
PPTX
Big Data Pipelines and Machine Learning at Uber
Sudhir Tonse
 
PPTX
ICML'16 Scaling ML System@Twitter
Jack Xiaojiang Guo
 
PPTX
Compositional AI: Fusion of AI/ML Services
Debmalya Biswas
 
PDF
Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...
HostedbyConfluent
 
PDF
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
Sanjana Chowdhury
 
PDF
MLSD18. Automating Machine Learning Workflows
BigML, Inc
 
PPTX
Open, Secure & Transparent AI Pipelines
Nick Pentreath
 
PDF
Rete network slicing for Model Queries
Zoltán Ujhelyi
 
PDF
Maintainability Challenges inML:ASLR
SEAA 2022
 
PPTX
Feature Store as a Data Foundation for Machine Learning
Provectus
 
PDF
DutchMLSchool. ML: A Technical Perspective
BigML, Inc
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
Data ops: Machine Learning in production
Stepan Pushkarev
 
CI/CD for Machine Learning
C4Media
 
Sustainable & Composable Generative AI
Debmalya Biswas
 
Slides-Артем Коваль-Cloud-Native MLOps Framework - DataFest 2021.pdf
vitm11
 
MOPs & ML Pipelines on GCP - Session 6, RGDC
gdgsurrey
 
Introducing MLOps.pdf
Dr. Anish Cheriyan (PhD)
 
Ml ops intro session
Avinash Patil
 
Machine learning at scale challenges and solutions
Stavros Kontopoulos
 
Big Data Pipelines and Machine Learning at Uber
Sudhir Tonse
 
ICML'16 Scaling ML System@Twitter
Jack Xiaojiang Guo
 
Compositional AI: Fusion of AI/ML Services
Debmalya Biswas
 
Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...
HostedbyConfluent
 
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
Sanjana Chowdhury
 
MLSD18. Automating Machine Learning Workflows
BigML, Inc
 
Open, Secure & Transparent AI Pipelines
Nick Pentreath
 
Rete network slicing for Model Queries
Zoltán Ujhelyi
 
Maintainability Challenges inML:ASLR
SEAA 2022
 
Feature Store as a Data Foundation for Machine Learning
Provectus
 
DutchMLSchool. ML: A Technical Perspective
BigML, Inc
 

More from Hong-Linh Truong (20)

PDF
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Hong-Linh Truong
 
PDF
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
Hong-Linh Truong
 
PDF
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Hong-Linh Truong
 
PDF
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Hong-Linh Truong
 
PDF
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Hong-Linh Truong
 
PDF
Characterizing Incidents in Cloud-based IoT Data Analytics
Hong-Linh Truong
 
PDF
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Hong-Linh Truong
 
PDF
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Hong-Linh Truong
 
PDF
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Hong-Linh Truong
 
PDF
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Hong-Linh Truong
 
PDF
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Hong-Linh Truong
 
PDF
Towards a Resource Slice Interoperability Hub for IoT
Hong-Linh Truong
 
PDF
On Supporting Contract-aware IoT Dataspace Services
Hong-Linh Truong
 
PDF
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Hong-Linh Truong
 
PDF
On Engineering Analytics of Elastic IoT Cloud Systems
Hong-Linh Truong
 
PDF
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
Hong-Linh Truong
 
PDF
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
Hong-Linh Truong
 
PDF
Governing Elastic IoT Cloud Systems under Uncertainties
Hong-Linh Truong
 
PDF
SmartSociety – A Platform for Collaborative People-Machine Computation
Hong-Linh Truong
 
PDF
On Developing and Operating of Data Elasticity Management Process
Hong-Linh Truong
 
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Hong-Linh Truong
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
Hong-Linh Truong
 
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Hong-Linh Truong
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Hong-Linh Truong
 
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Hong-Linh Truong
 
Characterizing Incidents in Cloud-based IoT Data Analytics
Hong-Linh Truong
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Hong-Linh Truong
 
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Hong-Linh Truong
 
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Hong-Linh Truong
 
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Hong-Linh Truong
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Hong-Linh Truong
 
Towards a Resource Slice Interoperability Hub for IoT
Hong-Linh Truong
 
On Supporting Contract-aware IoT Dataspace Services
Hong-Linh Truong
 
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Hong-Linh Truong
 
On Engineering Analytics of Elastic IoT Cloud Systems
Hong-Linh Truong
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
Hong-Linh Truong
 
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
Hong-Linh Truong
 
Governing Elastic IoT Cloud Systems under Uncertainties
Hong-Linh Truong
 
SmartSociety – A Platform for Collaborative People-Machine Computation
Hong-Linh Truong
 
On Developing and Operating of Data Elasticity Management Process
Hong-Linh Truong
 
Ad

Recently uploaded (20)

PPTX
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
PPTX
Introduction to Design of Machine Elements
PradeepKumarS27
 
PDF
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
PPTX
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
PPTX
GitOps_Without_K8s_Training_detailed git repository
DanialHabibi2
 
PDF
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PPTX
Mechanical Design of shell and tube heat exchangers as per ASME Sec VIII Divi...
shahveer210504
 
PPTX
Day2 B2 Best.pptx
helenjenefa1
 
PPTX
GitOps_Without_K8s_Training simple one without k8s
DanialHabibi2
 
PPTX
GitOps_Repo_Structure for begeinner(Scaffolindg)
DanialHabibi2
 
PPTX
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
PPT
PPT2_Metal formingMECHANICALENGINEEIRNG .ppt
Praveen Kumar
 
PDF
Design Thinking basics for Engineers.pdf
CMR University
 
PPTX
Lecture 1 Shell and Tube Heat exchanger-1.pptx
mailforillegalwork
 
DOC
MRRS Strength and Durability of Concrete
CivilMythili
 
PPTX
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
PPTX
Introduction to Neural Networks and Perceptron Learning Algorithm.pptx
Kayalvizhi A
 
PPTX
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
PDF
Unified_Cloud_Comm_Presentation anil singh ppt
anilsingh298751
 
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
Introduction to Design of Machine Elements
PradeepKumarS27
 
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
GitOps_Without_K8s_Training_detailed git repository
DanialHabibi2
 
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
Mechanical Design of shell and tube heat exchangers as per ASME Sec VIII Divi...
shahveer210504
 
Day2 B2 Best.pptx
helenjenefa1
 
GitOps_Without_K8s_Training simple one without k8s
DanialHabibi2
 
GitOps_Repo_Structure for begeinner(Scaffolindg)
DanialHabibi2
 
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
PPT2_Metal formingMECHANICALENGINEEIRNG .ppt
Praveen Kumar
 
Design Thinking basics for Engineers.pdf
CMR University
 
Lecture 1 Shell and Tube Heat exchanger-1.pptx
mailforillegalwork
 
MRRS Strength and Durability of Concrete
CivilMythili
 
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
Introduction to Neural Networks and Perceptron Learning Algorithm.pptx
Kayalvizhi A
 
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
Unified_Cloud_Comm_Presentation anil singh ppt
anilsingh298751
 
Ad

QoA4ML – A Framework for Supporting Contracts in Machine Learning Services

  • 1. QoA4ML – A Framework for Supporting Contracts in Machine Learning Services Hong-Linh Truong, Minh-Tri Nguyen Department of Computer Science https://rdsea.github.io
  • 2. Outline ▪ Context, scenario and research questions ▪ Key components of the QoA4ML framework ▪ Prototype and experiments ▪ Conclusions and future work September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 2
  • 3. Context ▪ Machine learning as a service becomes popular ▪ ML service providers offer ML services for different consumers ▪ Different stakeholders and interaction models ▪ Two stakeholders engagement: consumer and ML service provider ▪ Three stakeholders engagement: consumer, ML service provider and ML infrastructure/platform provider ▪ Key issue ▪ How do we support contracts between the ML service provider and other stakeholders? It is not just about performance! ▪ ML has several distinguishable attributes September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 3
  • 4. Scenario: predictive maintenance in Base Transceiver Stations (BTS) ▪ Dynamic inference from IoT data about equipment and infrastructure components in a BTS September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 4
  • 5. Key research questions & our approach ▪ Which are key attributes for ML contracts? ▪ How would ML attributes and constraints be specified? ▪ How would ML-specific attributes/constraints be monitored and evaluated? ▪ Approach ▪ Focus on ML-specific attributes ▪ Researchers have identified many attributes for ML models and systems ▪ Design ML contract specs suitable for cloud-native services ▪ Constraints, policies and monitoring reports ▪ Monitor ML attributes for contract monitoring September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 5
  • 6. QoA4ML framework – important attributes for ML-specific contracts ▪ Focus on important categories ▪ Inference Accuracy, Reliability and Elasticity, Quality of Data, Security and Privacy, Fairness and Interpretability and Cost September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 6
  • 7. QoA4ML specifications ▪ Decoupling attributes/constraints vs policies ▪ Required attributes and their constraints can be changed and updated at runtime ▪ Policies to check attributes and constraints can be implemented in different ways ▪ Monitoring probes and other utilities supporting observability ▪ New probes for quality of data and ML models ▪ Need to be instrumented and deployed to capture runtime attributes ▪ Must be well integrated with common monitoring features September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 7
  • 8. QoA4ML specification - the initial implementation September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 8
  • 9. Constraints for the BTS ML service ▪ Use terms in the QoA4ML specs ▪ Attributes and constraints can be changed September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 9
  • 10. Example of policies for validating contract constraints ▪ Is based on Rego ▪ Can load contract terms from JSON and compare with runtime monitoring ▪ Can be changed at runtime September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 10
  • 11. Monitoring utilities and Observability Service September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 11 Design for different engines to be used Reuse well-known monitoring systems Monitor ML-specific attributes
  • 12. Current prototype ▪ QoA4ML Specs: initial version based on JSON ▪ Use OPA (https://www.openpolicyagent.org/) as engine ▪ Rego and JSON are used for policies, attributes and constraints ▪ QoA4ML Observability as microservices ▪ Using state-of-the-art monitoring tools like Prometheus/Grafana ▪ Testing environments ▪ Edge and cloud infrastructures ▪ Source code is currently being pushed into: ▪ https://github.com/rdsea/QoA4ML September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 12
  • 13. Experiments ▪ Dynamic inferences of BTS load of power grid ▪ LSTM, TensorFlow ▪ IoT data from BTS (several months) ▪ Training in cloud and export to the edge (BTS-model-edge) and retraining several times in the cloud (BTS-model-cloud) ▪ Deployment ▪ Contracts: ▪ ResponseTime ▪ Inference Accuracy ▪ Data Quality September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 13
  • 14. Effect of edge and cloud serving platform deployment in ML contracts. September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 14 Both consumer and service are in the same edge; 3000 records per 15 minutes Both consumer and broker are in the same edge Broker is in the cloud
  • 15. Impact of violation monitoring All services in the edge (except the observability) September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 15 Help to detect outdated models in ML services: violation changes when retraining models Help to see correlations among attributes: data quality and inference accuracy
  • 16. Conclusions and future work ▪ QoA4ML is a framework to support ML service contracts ▪ Contract specifications (constraints and policies), tools and services ▪ QoA4ML benefits ▪ Establish contracts, moving to the step of continuous testing and observability of ML production ▪ Support flexible contracts and policies, enabling reuses and real- world ML services integration ▪ Future work ▪ Extending ML attributes and specifications; integration with cloud service contracts; new probes and observability capabilities September 9, 2021 IEEE International Conference on Web Services (ICWS) 2021 16
  • 17. Thanks! Hong-Linh Truong Department of Computer Science rdsea.github.io IEEE International Conference on Web Services (ICWS) 2021 17 September 9, 2021