SlideShare a Scribd company logo
Patrocina Colabora
DataOps
El ciclo de despliegue continuo
en el análisis de datos
Olivier Perard| Data Scientist en Oracle
DataOps
Definitions
VP Technology Strategy, MapR
DataOps is an agile methodology for developing and deploying data-intensive
applications, including data science and machine learning. A DataOps workflow supports
cross-functional collaboration and fast time to value.
http://www.gartner.com/it-glossary/data-ops/
A hub for collecting and distributing data, with a mandate to provide controlled access to systems
of record for customer and marketing performance data, while protecting privacy, usage
restrictions, and data integrity..
Tamr CEO Andy Palmer
DataOps is an enterprise collaboration framework that aligns data-management
objectives with data-consumption ideals to maximize data-derived value.
Nexla CEO
DataOps is the function within an organization that controls the data journey from
source to value.
DataOps
Gartner
Data & Analytics Summit 2018
DataOps, la plataforma de base de datos de nube privada como servicio (dbPaaS) y la gestión de
datos habilitados para el aprendizaje automático.
DataOps es una nueva práctica sin estándares ni frameworks
Nick Heudecker, vicepresidente de investigación de Gartner
COMPARING
DEVOPS AND
DATAOPS
WHAT’S DIFFERENT OR THE
SAME?
Developers &
Architects
Data Engineers
Data
Scientists
Security &
Governance
Operations
DataOps
DevOps DataOps
DataOps
Brings Flexibility & Focus
Expands DevOps to include data-heavy roles
Organized around data-related goals
Better collaboration and communication between roles
DataOps
AN AGILE METHODOLOGY
FOR DATA-DRIVEN
ORGANIZATIONS
AXIOMS:
Continuous model deployment
Promote repeatability
Promote productivity -- focus on core competencies
Promote agility
Promote self-service
Data is central to disruptive enterprise applications
• Lightweight, stateless functions do not represent the majority of workloads
Data science and machine learning are an important paradigm
• Scientists become active users -- no longer just application developers
• Iterative workflow with different data usage patterns
Data volumes continue to grow
Moving data is a performance bottleneck
DataOps Goals:
DataOps 7
Analyze and VisualizeStore and ProcessConnect and Integrate
Structured
Data
Unstructured
Data
1010101
01010 Sandboxes
Data lakes
Varying data
types
Quick and actionable
business insights
Focus on algorithms,
not infrastructure
Data available from
structured and
unstructured sources
Data marts / warehouses
DATA PLATFORM DATA Stream DATA ANALYTICS
Data Science
Platforms CLOUD PROVIDERS
ETL & DATA
ENGINEERING VERTICAL
APPLICATIONS
BI & VISUALIZATION
TOOLS
SECURIT
Y
INFRASTRUCTU
RE
LIBRARIE
S
TOOL
S
DATA PLATFORMS
DATA SCIENCE PLATFORMS
DataOps
Approach Advantages
Data Self-Service
• Data Scientists need to develop Use Cases
quickly using the enterprise’s data without
any restrictions from IT.
Improved efficiency and better use of Team’s time
• Deploy Analytic platform in one click
Faster Time-to-Value
Improve productivity
• Implement use cases in parallel using the
same data, but with dedicated platforms to
each analytic teams. Storage
Compute
LIBRARI
ES
TOO
LS
DATA SCIENCE
PLATFORMS
DataOps
Continuous Model
Deployment
Key Building Blocks for Agility:
• Unified data platform
• Data governance
• Self-service data and compute access
• Multitenancy and resource management
Data
Engineering
Model
Development
Model
Management
Model
Deployment
Model Monitoring &
Rescoring
DataOps
Storage
Compute
Data
Lab
Sand
box
Data
Pod
DataOps
Data Platform Deployment
Oracle GitHub OCI Ansible Modules
Oracle Database 12c
Jupyter
Zeppelin OML
1
2
Data Integration
CDC / ETL
3
Data Lab
DataOps
Data-Driven Architecture
Traditional and Modern
Legacy, Custom, Mainframe, SaaS, Microservices, …
Source: Oracle Insight
Data Platform
Analytics
• Advanced Analytics
• Self-service
• Predictive
Data Science
• Machine Learning
• Deep Learning
Modern Data
Platform
Security & Compliance
X Data
Applications
Real-time Analytics
• Real-time
Marketing
• Fraud detection • Exec
Dashboarding
Real-time
Real-time Services
{OOP}
SparklineData
• Accessing multiple source of data
(Technologies, Silos/Locations,
Clouds) …
• … with high performances …
• … for broader Cross Multi-model
queries/algorithms on real-time
data as well as historical data
Applications
BigData SQL
DataOps
Cloud Native & Open Source
Community
Artificial
Intelligence Block Chain Internet of
Things
Container Native Microservices
Open Serverless Computing DevOps
Prometeus
Open Source
Cloud Native
Innovation
Open Source
Cloud Native
Development
ISTIO
Cloud-Native and Community Driven Innovation
Open Source Managed and Autonomous Cloud Native
DataOps
Data Stream
Data Preparation
Data Replication
Data ETLLogs
Oracle Cloud Infrastructure
Analytics
Consumers
Data Platform
BI
NL / AI
Data Integration
CDC / ETL
Discovering Structuring Cleaning Enriching Validating Deploying
DataOps
Data Stream
Lineage
Pipeline
Quality
Speed
Efficiency
Oracle Data
Science
Data Science Requires a Comprehensive Platform to Simplify Operations
and Deliver Value at Scale
• Accelerate use of proper tools, frameworks and infrastructure
• Overcome restricted skillsets with a simple, collaborative platform
• Quickly leverage predictive analytics to drive positive business outcomes
Collaborate
securely
Power
business
Work in standardized
environments
A Robust, Easy-to-Use Data Science Platform Removes Barriers to
Deploying Valuable Machine Learning Models in Production
Manage data
and tools
Oracle Data
Science
Projects LifeCycle
Reproducibility
Data
Versioning
Code
Versioning
Model
Versioning
Environment
Management
Model Deployment
Operationalize Models as
Scalable APIs
Model Management
Monitor and Optimize Model
Performance
Data Exploration
Collaborative Data Analysis /
Feature Engineering
Model Build and
Train
with Open Source
Frameworks
Collaborators
∙ Data Scientists
∙ Business Stakeholders
∙ App Developers
∙ IT Admins
Business
Analyst/Leader
Defining business
problem and
objective of analyses
Data Engineer
Prepare data, build
pipelines, and provide
data access for
analytical or
operational uses.
IT Admin
Oversees underlying
process, architecture,
operations, resource
constraints.
Data Scientist
Analyze data using
statistical methods
and coding languages
like Python, R, Scala
Application
Developer
Deploy data science
models into
applications. Build
data products.
Oracle Data
Science
Modules
Collaborative
Integrated
Enterprise-Grade
Oracle Data Science Cloud
Oracle PaaS & IaaS
Projects Notebooks
Open Source
Languages &
Libraries
Version Control Use Case
Templates
Model
Build & Train
Self-Service Scalable Compute (OCI)
Object
Store
Catalog Data Lake Streaming
Autonomous
Data Warehouse
Model
Deployment
Model
Monitoring
Access
Controls &
Security
Project driven UI enables teams to easily
work together on end-to-end modeling
workflows with self-service access to data
and resources
Support for latest open source tools, version
control, and tight integration with OCI and
Oracle Big Data Platform
A fully managed platform built to meet the
needs of the modern enterprise
Oracle Data
Science
Environment complexity
Oracle Data
Science
Configure, Train & Deploy
Oracle PaaS
Language
Image
Video
HREmotion
Easy Deployment
3
Deploy
Model
Train
Data
Definitio
n
Model
Test
Publish
API
Data
Select
Code
Noteboo
k
2
Train
• Frameworks
• AI libraries
• Samples
• GPU clusters
• Connect to data
• Auto scale, updates
• HS network, storage
•Object Stores
•Database CS
•Spark
Easy Data Access
+
1
Configure
Autonomous
Setup
Model Sharing Model Library APIsModel Analytics
IT Persona
DevOps
Data Scientist
Data Scientist
Easy Development
Easy setup
Oracle Data
Science
Build & Train
DEV
TEST
PROD
Oracle Data
Science
Deploy
DEV
TEST
PROD
DataOps
Conclusiones
Multi-Model Data Access
Interoperability
Data preparation and pipeline
Automation
Elasticity
Multidimensional agility
Automated governance
Next Generation
Platform for
All Data
Complete,
Integrated, Open
AI and Machine
Learning
ALL IN ONE
ORACLE PROVIDES
Patrocina Colabora
Muchas Gracias
Olivier Perard
https://twitter.com/oracle_es?lang=es

More Related Content

PDF
DevOps Spain 2019. David Cañadillas -Cloudbees
atSistemas
 
PDF
DevOps Spain 2019. Pablo Chico de Guzmán -Okteto
atSistemas
 
PDF
DevOps Spain 2019. Pedro Mendoza-AWS
atSistemas
 
PDF
DevOps Spain 2019. Jaime Balañá-NetApp
atSistemas
 
PDF
CI/CD on Google Cloud Platform
DevOps Indonesia
 
PDF
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
DevOps.com
 
PDF
Cncf checkov and bridgecrew
LibbySchulze
 
PPTX
DevOps to DevSecOps Journey..
Siddharth Joshi
 
DevOps Spain 2019. David Cañadillas -Cloudbees
atSistemas
 
DevOps Spain 2019. Pablo Chico de Guzmán -Okteto
atSistemas
 
DevOps Spain 2019. Pedro Mendoza-AWS
atSistemas
 
DevOps Spain 2019. Jaime Balañá-NetApp
atSistemas
 
CI/CD on Google Cloud Platform
DevOps Indonesia
 
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
DevOps.com
 
Cncf checkov and bridgecrew
LibbySchulze
 
DevOps to DevSecOps Journey..
Siddharth Joshi
 

What's hot (20)

PPT
Microsoft Azure DevOps
tdc-globalcode
 
PDF
Hardening Your CI/CD Pipelines with GitOps and Continuous Security
Weaveworks
 
PDF
Yannis Zarkadas. Enterprise data science workflows on kubeflow
MarynaHoldaieva
 
PPTX
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com
 
PDF
Next Generation Vulnerability Assessment Using Datadog and Snyk
DevOps.com
 
PPTX
Crap. Your Big Data Kitchen Is Broken.
Altoros
 
PPTX
Cloud Native Transformation (Alexis Richardson) - Continuous Lifecycle 2018 ...
Weaveworks
 
PDF
Kubernetes Administration Certification Cost-Register Now(7262008866)
Novel Vista
 
PDF
Monitoring Serverless Applications with Datadog
DevOps.com
 
PDF
Journey Through Four Stages of Kubernetes Deployment Maturity
Altoros
 
PDF
Architecting for Continuous Delivery
Mohammad Bilal Wahla
 
PPTX
Why cloud native matters
Cheryl Hung
 
PPTX
Cloud Native Summit 2019 Summary
Everett Toews
 
PDF
Using Google Cloud Services with Spring Boot and Pivotal Cloud Foundry (Pivot...
VMware Tanzu
 
PPTX
0 to hero with Azure DevOps
Christos Matskas
 
PPTX
CWIN17 london becoming cloud native part 2 - guy martin docker
Capgemini
 
PDF
Data-Driven DevOps: Improve Velocity and Quality of Software Delivery with Me...
Splunk
 
PDF
InfoSec: Evolve Thyself to Keep Pace in the Age of DevOps
VMware Tanzu
 
PPTX
Tectonic Summit 2016: Betting on Kubernetes
CoreOS
 
PDF
Java Application Modernization Patterns and Stories from the IBM Garage
Holly Cummins
 
Microsoft Azure DevOps
tdc-globalcode
 
Hardening Your CI/CD Pipelines with GitOps and Continuous Security
Weaveworks
 
Yannis Zarkadas. Enterprise data science workflows on kubeflow
MarynaHoldaieva
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com
 
Next Generation Vulnerability Assessment Using Datadog and Snyk
DevOps.com
 
Crap. Your Big Data Kitchen Is Broken.
Altoros
 
Cloud Native Transformation (Alexis Richardson) - Continuous Lifecycle 2018 ...
Weaveworks
 
Kubernetes Administration Certification Cost-Register Now(7262008866)
Novel Vista
 
Monitoring Serverless Applications with Datadog
DevOps.com
 
Journey Through Four Stages of Kubernetes Deployment Maturity
Altoros
 
Architecting for Continuous Delivery
Mohammad Bilal Wahla
 
Why cloud native matters
Cheryl Hung
 
Cloud Native Summit 2019 Summary
Everett Toews
 
Using Google Cloud Services with Spring Boot and Pivotal Cloud Foundry (Pivot...
VMware Tanzu
 
0 to hero with Azure DevOps
Christos Matskas
 
CWIN17 london becoming cloud native part 2 - guy martin docker
Capgemini
 
Data-Driven DevOps: Improve Velocity and Quality of Software Delivery with Me...
Splunk
 
InfoSec: Evolve Thyself to Keep Pace in the Age of DevOps
VMware Tanzu
 
Tectonic Summit 2016: Betting on Kubernetes
CoreOS
 
Java Application Modernization Patterns and Stories from the IBM Garage
Holly Cummins
 
Ad

Similar to DevOps Spain 2019. Olivier Perard-Oracle (20)

PDF
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
PDF
About CDAP
Cask Data
 
PPTX
MLOps - The Assembly Line of ML
Jordan Birdsell
 
PDF
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Dataconomy Media
 
PDF
2022 Trends in Enterprise Analytics
DATAVERSITY
 
PPTX
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 
PDF
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
Sandesh Rao
 
PPTX
Cloud and Analytics -- 2020 sparksummit
Ming Yuan
 
PDF
Embedded-ml(ai)applications - Bjoern Staender
Dataconomy Media
 
PDF
Advanced Analytics and Machine Learning with Data Virtualization (India)
Denodo
 
PDF
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData Inc.
 
PDF
Cloud and Analytics - From Platforms to an Ecosystem
Databricks
 
PDF
CSC - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
 
PPTX
Using standards, open-source and advances in technology to bring down soft co...
Infiswift Solutions
 
PDF
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Denodo
 
PDF
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 
PDF
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
Sandesh Rao
 
PPTX
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Precisely
 
PPTX
Microsoft cloud big data strategy
James Serra
 
PDF
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo
 
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
About CDAP
Cask Data
 
MLOps - The Assembly Line of ML
Jordan Birdsell
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Dataconomy Media
 
2022 Trends in Enterprise Analytics
DATAVERSITY
 
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
Sandesh Rao
 
Cloud and Analytics -- 2020 sparksummit
Ming Yuan
 
Embedded-ml(ai)applications - Bjoern Staender
Dataconomy Media
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Denodo
 
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData Inc.
 
Cloud and Analytics - From Platforms to an Ecosystem
Databricks
 
CSC - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
 
Using standards, open-source and advances in technology to bring down soft co...
Infiswift Solutions
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Denodo
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
Sandesh Rao
 
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Precisely
 
Microsoft cloud big data strategy
James Serra
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo
 
Ad

More from atSistemas (20)

PPTX
Agile itsm con atlassian
atSistemas
 
PPTX
Bizz Chat metamorfosis digital
atSistemas
 
PPTX
Webinar 5 net5-2021
atSistemas
 
PPTX
Webinar Speed Up Academy: Acelera la incorporación de talento.
atSistemas
 
PPTX
Webinar: Descubre los diferentes servicios Cloud Native en Azure
atSistemas
 
PDF
El futuro del trabajo en equipo
atSistemas
 
PPTX
La tecnología al servicio de la agilidad empresarial
atSistemas
 
PPTX
Transformación Agile
atSistemas
 
PPTX
Transformación cultural
atSistemas
 
PDF
Technical considerations for Blockchain networks with AWS
atSistemas
 
PDF
Blockchain Spain II Edición - Ángel Miguel Martínez
atSistemas
 
PDF
Blockchain Spain - Néstor Gándara
atSistemas
 
PDF
Blockchain Spain - Juan Luis Gozalo
atSistemas
 
PDF
Blockchain Spain - Ramón Abruña
atSistemas
 
PDF
Blockchain Spain - Santiago Chamat
atSistemas
 
PDF
Blockchain Spain - Antonio Gómez
atSistemas
 
PDF
Blockchain Spain - Miguel Ángel Rojas
atSistemas
 
PDF
Blockchain Spain - Andrés Sánchez
atSistemas
 
PDF
Blockchain Spain II Edición - Autoridad Portuaria de Cartagena, Ilboc, Repsol
atSistemas
 
PDF
Blockchain Spain II Edición - Juan Manuel Martínez
atSistemas
 
Agile itsm con atlassian
atSistemas
 
Bizz Chat metamorfosis digital
atSistemas
 
Webinar 5 net5-2021
atSistemas
 
Webinar Speed Up Academy: Acelera la incorporación de talento.
atSistemas
 
Webinar: Descubre los diferentes servicios Cloud Native en Azure
atSistemas
 
El futuro del trabajo en equipo
atSistemas
 
La tecnología al servicio de la agilidad empresarial
atSistemas
 
Transformación Agile
atSistemas
 
Transformación cultural
atSistemas
 
Technical considerations for Blockchain networks with AWS
atSistemas
 
Blockchain Spain II Edición - Ángel Miguel Martínez
atSistemas
 
Blockchain Spain - Néstor Gándara
atSistemas
 
Blockchain Spain - Juan Luis Gozalo
atSistemas
 
Blockchain Spain - Ramón Abruña
atSistemas
 
Blockchain Spain - Santiago Chamat
atSistemas
 
Blockchain Spain - Antonio Gómez
atSistemas
 
Blockchain Spain - Miguel Ángel Rojas
atSistemas
 
Blockchain Spain - Andrés Sánchez
atSistemas
 
Blockchain Spain II Edición - Autoridad Portuaria de Cartagena, Ilboc, Repsol
atSistemas
 
Blockchain Spain II Edición - Juan Manuel Martínez
atSistemas
 

Recently uploaded (20)

PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Software Development Methodologies in 2025
KodekX
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
Doc9.....................................
SofiaCollazos
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Software Development Methodologies in 2025
KodekX
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
Doc9.....................................
SofiaCollazos
 

DevOps Spain 2019. Olivier Perard-Oracle

  • 1. Patrocina Colabora DataOps El ciclo de despliegue continuo en el análisis de datos Olivier Perard| Data Scientist en Oracle
  • 2. DataOps Definitions VP Technology Strategy, MapR DataOps is an agile methodology for developing and deploying data-intensive applications, including data science and machine learning. A DataOps workflow supports cross-functional collaboration and fast time to value. http://www.gartner.com/it-glossary/data-ops/ A hub for collecting and distributing data, with a mandate to provide controlled access to systems of record for customer and marketing performance data, while protecting privacy, usage restrictions, and data integrity.. Tamr CEO Andy Palmer DataOps is an enterprise collaboration framework that aligns data-management objectives with data-consumption ideals to maximize data-derived value. Nexla CEO DataOps is the function within an organization that controls the data journey from source to value.
  • 3. DataOps Gartner Data & Analytics Summit 2018 DataOps, la plataforma de base de datos de nube privada como servicio (dbPaaS) y la gestión de datos habilitados para el aprendizaje automático. DataOps es una nueva práctica sin estándares ni frameworks Nick Heudecker, vicepresidente de investigación de Gartner
  • 4. COMPARING DEVOPS AND DATAOPS WHAT’S DIFFERENT OR THE SAME? Developers & Architects Data Engineers Data Scientists Security & Governance Operations DataOps DevOps DataOps
  • 5. DataOps Brings Flexibility & Focus Expands DevOps to include data-heavy roles Organized around data-related goals Better collaboration and communication between roles
  • 6. DataOps AN AGILE METHODOLOGY FOR DATA-DRIVEN ORGANIZATIONS AXIOMS: Continuous model deployment Promote repeatability Promote productivity -- focus on core competencies Promote agility Promote self-service Data is central to disruptive enterprise applications • Lightweight, stateless functions do not represent the majority of workloads Data science and machine learning are an important paradigm • Scientists become active users -- no longer just application developers • Iterative workflow with different data usage patterns Data volumes continue to grow Moving data is a performance bottleneck DataOps Goals:
  • 7. DataOps 7 Analyze and VisualizeStore and ProcessConnect and Integrate Structured Data Unstructured Data 1010101 01010 Sandboxes Data lakes Varying data types Quick and actionable business insights Focus on algorithms, not infrastructure Data available from structured and unstructured sources Data marts / warehouses DATA PLATFORM DATA Stream DATA ANALYTICS
  • 8. Data Science Platforms CLOUD PROVIDERS ETL & DATA ENGINEERING VERTICAL APPLICATIONS BI & VISUALIZATION TOOLS SECURIT Y INFRASTRUCTU RE LIBRARIE S TOOL S DATA PLATFORMS DATA SCIENCE PLATFORMS
  • 9. DataOps Approach Advantages Data Self-Service • Data Scientists need to develop Use Cases quickly using the enterprise’s data without any restrictions from IT. Improved efficiency and better use of Team’s time • Deploy Analytic platform in one click Faster Time-to-Value Improve productivity • Implement use cases in parallel using the same data, but with dedicated platforms to each analytic teams. Storage Compute LIBRARI ES TOO LS DATA SCIENCE PLATFORMS
  • 10. DataOps Continuous Model Deployment Key Building Blocks for Agility: • Unified data platform • Data governance • Self-service data and compute access • Multitenancy and resource management Data Engineering Model Development Model Management Model Deployment Model Monitoring & Rescoring
  • 12. DataOps Data Platform Deployment Oracle GitHub OCI Ansible Modules Oracle Database 12c Jupyter Zeppelin OML 1 2 Data Integration CDC / ETL 3 Data Lab
  • 13. DataOps Data-Driven Architecture Traditional and Modern Legacy, Custom, Mainframe, SaaS, Microservices, … Source: Oracle Insight Data Platform Analytics • Advanced Analytics • Self-service • Predictive Data Science • Machine Learning • Deep Learning Modern Data Platform Security & Compliance X Data Applications Real-time Analytics • Real-time Marketing • Fraud detection • Exec Dashboarding Real-time Real-time Services {OOP} SparklineData • Accessing multiple source of data (Technologies, Silos/Locations, Clouds) … • … with high performances … • … for broader Cross Multi-model queries/algorithms on real-time data as well as historical data Applications BigData SQL
  • 14. DataOps Cloud Native & Open Source Community Artificial Intelligence Block Chain Internet of Things Container Native Microservices Open Serverless Computing DevOps Prometeus Open Source Cloud Native Innovation Open Source Cloud Native Development ISTIO Cloud-Native and Community Driven Innovation Open Source Managed and Autonomous Cloud Native
  • 15. DataOps Data Stream Data Preparation Data Replication Data ETLLogs Oracle Cloud Infrastructure Analytics Consumers Data Platform BI NL / AI Data Integration CDC / ETL Discovering Structuring Cleaning Enriching Validating Deploying
  • 17. Oracle Data Science Data Science Requires a Comprehensive Platform to Simplify Operations and Deliver Value at Scale • Accelerate use of proper tools, frameworks and infrastructure • Overcome restricted skillsets with a simple, collaborative platform • Quickly leverage predictive analytics to drive positive business outcomes Collaborate securely Power business Work in standardized environments A Robust, Easy-to-Use Data Science Platform Removes Barriers to Deploying Valuable Machine Learning Models in Production Manage data and tools
  • 18. Oracle Data Science Projects LifeCycle Reproducibility Data Versioning Code Versioning Model Versioning Environment Management Model Deployment Operationalize Models as Scalable APIs Model Management Monitor and Optimize Model Performance Data Exploration Collaborative Data Analysis / Feature Engineering Model Build and Train with Open Source Frameworks Collaborators ∙ Data Scientists ∙ Business Stakeholders ∙ App Developers ∙ IT Admins Business Analyst/Leader Defining business problem and objective of analyses Data Engineer Prepare data, build pipelines, and provide data access for analytical or operational uses. IT Admin Oversees underlying process, architecture, operations, resource constraints. Data Scientist Analyze data using statistical methods and coding languages like Python, R, Scala Application Developer Deploy data science models into applications. Build data products.
  • 19. Oracle Data Science Modules Collaborative Integrated Enterprise-Grade Oracle Data Science Cloud Oracle PaaS & IaaS Projects Notebooks Open Source Languages & Libraries Version Control Use Case Templates Model Build & Train Self-Service Scalable Compute (OCI) Object Store Catalog Data Lake Streaming Autonomous Data Warehouse Model Deployment Model Monitoring Access Controls & Security Project driven UI enables teams to easily work together on end-to-end modeling workflows with self-service access to data and resources Support for latest open source tools, version control, and tight integration with OCI and Oracle Big Data Platform A fully managed platform built to meet the needs of the modern enterprise
  • 21. Oracle Data Science Configure, Train & Deploy Oracle PaaS Language Image Video HREmotion Easy Deployment 3 Deploy Model Train Data Definitio n Model Test Publish API Data Select Code Noteboo k 2 Train • Frameworks • AI libraries • Samples • GPU clusters • Connect to data • Auto scale, updates • HS network, storage •Object Stores •Database CS •Spark Easy Data Access + 1 Configure Autonomous Setup Model Sharing Model Library APIsModel Analytics IT Persona DevOps Data Scientist Data Scientist Easy Development Easy setup
  • 22. Oracle Data Science Build & Train DEV TEST PROD
  • 24. DataOps Conclusiones Multi-Model Data Access Interoperability Data preparation and pipeline Automation Elasticity Multidimensional agility Automated governance Next Generation Platform for All Data Complete, Integrated, Open AI and Machine Learning ALL IN ONE ORACLE PROVIDES
  • 25. Patrocina Colabora Muchas Gracias Olivier Perard https://twitter.com/oracle_es?lang=es