SlideShare a Scribd company logo
Building a
Machine Learning
Recommendation Engine
in SQL
@garyorenstein @memsql
MemSQL 1
Today’s Talk
1. State of Data 2018 according to Gartner
2. Rise of Machine Learning
3. Live Demo - A SQL Recommendation Engine
MemSQL 2
SECTION 1
The State of DataAccording to Gartner 2018
MemSQL 3
Hype Cycle for Data
Management
26 July 2017
Donald Feinberg
Adam M. Ronthal
G00313950
MemSQL 4
MemSQL 5
Multimodel has the potential
to support both relational and
nonrelational use cases
while reducing the number of
disparate DBMS products
in an organization.
MemSQL 6
the idea of a
Hadoop distribution
will become obsolete
before it reaches
the Plateau of Productivity
MemSQL 7
Penetration continues to increase and organizations
should be evaluating these resources for
— cost-efficiency
— infrastructure simplification and
— new use cases, such as Hybrid Transactional/
Analytical Processing (HTAP)
MemSQL 8
Build Your Digital Business
Platform Around Data and
Analytics
31 January 2018
Andrew White
W. Roy Schulte
Roxane Edjlali
Joao Tapadinhas
Svetlana Sicular
G00350435
MemSQL 9
Select Challenges
Data and analytics investments that are tied to
measurable business outcomes are more likely to
produce reportable benefits.
MemSQL 10
Magic Quadrant for Data
Management Solutions for
Analytics
13 February 2018
Adam M. Ronthal
Roxane Edjlali
Rick Greenwald
G00326691
MemSQL 11
We define four primary use cases for DMSAs that reflect
this diversity of data and use cases:
— Traditional data warehouse
— Real-time data warehouse
— Context-independent data warehouse
— Logical data warehouse
MemSQL 12
MemSQL 13
MemSQL 14
Real-Time Data Warehouse
This use case adds a real-time component to analytics
use cases, with the aim of reducing latency — the time
lag between when data is generated and when it can be
analyzed.
MemSQL 15
MemSQL 16
Other Vendors to Consider for
Operational DBMSs
23 November 2017
Donald Feinberg
Merv Adrian
Nick Heudecker
G00327284
MemSQL 17
Other Vendors to Consider for Operational DBMSs
Actian
Aerospike
Alibaba Cloud
Altibase
ArangoDB
Cloudera
Clustrix
Couchbase
FairCom
Fujitsu
General Data Technology
Hortonworks
MariaDB
MemSQL
MongoDB
Neo4j
NuoDB
Percona
Redis Labs
SequoiaDB
TmaxSoft
VoltDB
MemSQL 18
Other Vendors to Consider for Operational DBMSs
also listed as Challenger or Leader
in the Magic Quadrant
for Data Management Solutions for Analytics
MemSQL
MemSQL 19
MemSQL 20
Over the next five years,
the OPDBMS and DMSA
markets converge to a
single DBMS market.
MemSQL 21
Look to your operational DBMS
vendor for both transactional
and analytical workloads.
MemSQL 22
SECTION 2
Rise of Machine Learning
MemSQL 23
MemSQL 24
MemSQL 25
MemSQL 26
MemSQL 27
MemSQL 28
MemSQL 29
2018 Outlook Survey
MemSQL and O’Reilly
1600+ respondents
memsql.com/MLsurvey
MemSQL 30
MemSQL 31
MemSQL 32
Machine Learning and
Databases
MemSQL 33
MemSQL 34
MemSQL 35
MemSQL 36
MemSQL 37
MemSQL 38
MemSQL 39
MemSQL 40
MemSQL 41
MemSQL 42
MemSQL 43
MemSQL 44
MemSQL 45
MemSQL 46
MemSQL 47
SECTION 3
DEMO with Yelp Dataset
MemSQL 48
MemSQL 49
MemSQL 50
MemSQL 51
MemSQL 52
Can you build a machine
learning recommendation
engine in SQL?
Yes
MemSQL 53
Can you build a machine learning
recommendation engine in SQL?
Yes
Should you?
For training? Maybe, maybe not.
For Operational Scoring?
Absolutely!
MemSQL 54
MemSQL 55
MemSQL 56
Secret Weapons to Machine Learning in SQL
— Extensibility
— Stored Procedures
— User Defined Functions
— User Defined Aggregates
— DOT_PRODUCT
— Compare two vectors
MemSQL 57
MemSQL 58
MemSQL 59
Sequel Pro Mac app for MySQL databases
MemSQL 60
MemSQL in one slide
— Distributed SQL database
— Massively parallel, lock-free, fast
— Full ACID features
— In-memory and on-disk
— JSON, key-value, geospatial, full-text search
— Robust security
— Built for transactions and analytics
MemSQL 61
MemSQL 62
MemSQL 63
Why do ML in SQL?
— Train in any number of systems
— Score in the database for applications from real-time
drilling to fraud detection to personalization
— Complete certain functions within the database to
radically simplify operational infrastructure
MemSQL 64
“It is a fine line between
a well executed SQL query on
live data and ML/AI”
MemSQL 65
MemSQL 66
Thank you!
Please visit our booth
www.memsql.com
@garyorenstein
@memsql
MemSQL 67
Abstract: Building a Machine Learning Recommendation Engine in SQL
Modern businesses constantly seek deeper customer relationships and more
compelling experiences.
To accomplish this, companies are looking to machine learning and artificial
intelligence solutions; however, that often involves a host of new systems and
approaches.
With a modern database architecture, it is possible to build compelling machine
learning solutions with SQL, deliver real-time engagements, and rapidly move to
operational applications.
See live, how a modern database can accomplish these feats within a single
integrated solution.
MemSQL 68

More Related Content

What's hot (20)

PPTX
See who is using MemSQL
jenjermain
 
PPT
Google App Engine
Dave Nielsen
 
PPTX
Introducing MemSQL 4
SingleStore
 
PPTX
Real-Time Geospatial Intelligence at Scale
SingleStore
 
PPTX
Bringing olap fully online analyze changing datasets in mem sql and spark wi...
SingleStore
 
PPTX
In-Memory Database Performance on AWS M4 Instances
SingleStore
 
PPTX
Internet of Things and Multi-model Data Infrastructure
SingleStore
 
PDF
Denodo DataFest 2017: Integrating Big Data and Streaming Data with Enterprise...
Denodo
 
PDF
Democratizing Data
Databricks
 
PPTX
Getting It Right Exactly Once: Principles for Streaming Architectures
SingleStore
 
PDF
Add Historical Analysis of Operational Data with Easy Configurations in Fivet...
Databricks
 
PDF
MemSQL
Ramzi Alqrainy
 
PDF
Presto: Fast SQL on Everything
David Phillips
 
PDF
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Databricks
 
PDF
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
Databricks
 
PDF
Making Data Timelier and More Reliable with Lakehouse Technology
Matei Zaharia
 
PDF
Ebooks - Accelerating Time to Value of Big Data of Apache Spark | Qubole
Vasu S
 
PDF
Personalization Journey: From Single Node to Cloud Streaming
Databricks
 
PDF
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
Databricks
 
PDF
IBM Cloud Day January 2021 - A well architected data lake
Torsten Steinbach
 
See who is using MemSQL
jenjermain
 
Google App Engine
Dave Nielsen
 
Introducing MemSQL 4
SingleStore
 
Real-Time Geospatial Intelligence at Scale
SingleStore
 
Bringing olap fully online analyze changing datasets in mem sql and spark wi...
SingleStore
 
In-Memory Database Performance on AWS M4 Instances
SingleStore
 
Internet of Things and Multi-model Data Infrastructure
SingleStore
 
Denodo DataFest 2017: Integrating Big Data and Streaming Data with Enterprise...
Denodo
 
Democratizing Data
Databricks
 
Getting It Right Exactly Once: Principles for Streaming Architectures
SingleStore
 
Add Historical Analysis of Operational Data with Easy Configurations in Fivet...
Databricks
 
Presto: Fast SQL on Everything
David Phillips
 
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Databricks
 
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
Databricks
 
Making Data Timelier and More Reliable with Lakehouse Technology
Matei Zaharia
 
Ebooks - Accelerating Time to Value of Big Data of Apache Spark | Qubole
Vasu S
 
Personalization Journey: From Single Node to Cloud Streaming
Databricks
 
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
Databricks
 
IBM Cloud Day January 2021 - A well architected data lake
Torsten Steinbach
 

Similar to Building a Machine Learning Recommendation Engine in SQL (20)

PDF
Get a clearer picture of potential cloud performance by looking beyond SPECra...
Principled Technologies
 
PDF
Making Sense of NoSQL and Big Data Amidst High Expectations
Rackspace
 
PPTX
Logical Data Warehouse: The Foundation of Modern Data and Analytics
Denodo
 
PPT
Mule microsoft
D.Rajesh Kumar
 
PPT
Mule esb-microsoft
D.Rajesh Kumar
 
PDF
Guide to NoSQL with MySQL
Samuel Rohaut
 
PDF
bigdatasqloverview21jan2015-2408000
Kartik Padmanabhan
 
PDF
Microsoft Sql Server 2016 Is Now Live
Amber Moore
 
PDF
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Matt Stubbs
 
PDF
Connecting Silos in Real Time with Data Virtualization
Denodo
 
PDF
SQL Saturday 119 Chicago -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
PDF
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
PPT
Migrating legacy ERP data into Hadoop
DataWorks Summit
 
PPTX
SAP and Microsoft Manufacturing Solution
SAP Technology
 
PDF
Webinar: Faster Big Data Analytics with MongoDB
MongoDB
 
PDF
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Hitachi Vantara
 
PDF
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
Denodo
 
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
PDF
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
Anand Haridass
 
PDF
GigaOm-sector-roadmap-cloud-analytic-databases-2017
Jeremy Maranitch
 
Get a clearer picture of potential cloud performance by looking beyond SPECra...
Principled Technologies
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Rackspace
 
Logical Data Warehouse: The Foundation of Modern Data and Analytics
Denodo
 
Mule microsoft
D.Rajesh Kumar
 
Mule esb-microsoft
D.Rajesh Kumar
 
Guide to NoSQL with MySQL
Samuel Rohaut
 
bigdatasqloverview21jan2015-2408000
Kartik Padmanabhan
 
Microsoft Sql Server 2016 Is Now Live
Amber Moore
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Matt Stubbs
 
Connecting Silos in Real Time with Data Virtualization
Denodo
 
SQL Saturday 119 Chicago -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
Migrating legacy ERP data into Hadoop
DataWorks Summit
 
SAP and Microsoft Manufacturing Solution
SAP Technology
 
Webinar: Faster Big Data Analytics with MongoDB
MongoDB
 
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Hitachi Vantara
 
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
Denodo
 
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
Anand Haridass
 
GigaOm-sector-roadmap-cloud-analytic-databases-2017
Jeremy Maranitch
 
Ad

More from SingleStore (20)

PPTX
MemSQL 201: Advanced Tips and Tricks Webcast
SingleStore
 
PDF
Introduction to MemSQL
SingleStore
 
PPTX
Building a Fault Tolerant Distributed Architecture
SingleStore
 
PDF
Stream Processing with Pipelines and Stored Procedures
SingleStore
 
PPTX
Curriculum Associates Strata NYC 2017
SingleStore
 
PPTX
Image Recognition on Streaming Data
SingleStore
 
PPTX
Spark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
SingleStore
 
PDF
How Database Convergence Impacts the Coming Decades of Data Management
SingleStore
 
PPTX
Teaching Databases to Learn in the World of AI
SingleStore
 
PDF
Gartner Catalyst 2017: The Data Warehouse Blueprint for ML, AI, and Hybrid Cloud
SingleStore
 
PPTX
Gartner Catalyst 2017: Image Recognition on Streaming Data
SingleStore
 
PPTX
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
SingleStore
 
PDF
Real-Time Analytics at Uber Scale
SingleStore
 
PDF
Machines and the Magic of Fast Learning
SingleStore
 
PPTX
Machines and the Magic of Fast Learning - Strata Keynote
SingleStore
 
PDF
Enabling Real-Time Analytics for IoT
SingleStore
 
PPTX
Driving the On-Demand Economy with Predictive Analytics
SingleStore
 
PPTX
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
SingleStore
 
PPTX
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
SingleStore
 
PDF
Enabling Real-Time Analytics for IoT
SingleStore
 
MemSQL 201: Advanced Tips and Tricks Webcast
SingleStore
 
Introduction to MemSQL
SingleStore
 
Building a Fault Tolerant Distributed Architecture
SingleStore
 
Stream Processing with Pipelines and Stored Procedures
SingleStore
 
Curriculum Associates Strata NYC 2017
SingleStore
 
Image Recognition on Streaming Data
SingleStore
 
Spark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
SingleStore
 
How Database Convergence Impacts the Coming Decades of Data Management
SingleStore
 
Teaching Databases to Learn in the World of AI
SingleStore
 
Gartner Catalyst 2017: The Data Warehouse Blueprint for ML, AI, and Hybrid Cloud
SingleStore
 
Gartner Catalyst 2017: Image Recognition on Streaming Data
SingleStore
 
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
SingleStore
 
Real-Time Analytics at Uber Scale
SingleStore
 
Machines and the Magic of Fast Learning
SingleStore
 
Machines and the Magic of Fast Learning - Strata Keynote
SingleStore
 
Enabling Real-Time Analytics for IoT
SingleStore
 
Driving the On-Demand Economy with Predictive Analytics
SingleStore
 
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
SingleStore
 
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
SingleStore
 
Enabling Real-Time Analytics for IoT
SingleStore
 
Ad

Recently uploaded (20)

PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
Biography of Daniel Podor.pdf
Daniel Podor
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Biography of Daniel Podor.pdf
Daniel Podor
 

Building a Machine Learning Recommendation Engine in SQL

  • 1. Building a Machine Learning Recommendation Engine in SQL @garyorenstein @memsql MemSQL 1
  • 2. Today’s Talk 1. State of Data 2018 according to Gartner 2. Rise of Machine Learning 3. Live Demo - A SQL Recommendation Engine MemSQL 2
  • 3. SECTION 1 The State of DataAccording to Gartner 2018 MemSQL 3
  • 4. Hype Cycle for Data Management 26 July 2017 Donald Feinberg Adam M. Ronthal G00313950 MemSQL 4
  • 6. Multimodel has the potential to support both relational and nonrelational use cases while reducing the number of disparate DBMS products in an organization. MemSQL 6
  • 7. the idea of a Hadoop distribution will become obsolete before it reaches the Plateau of Productivity MemSQL 7
  • 8. Penetration continues to increase and organizations should be evaluating these resources for — cost-efficiency — infrastructure simplification and — new use cases, such as Hybrid Transactional/ Analytical Processing (HTAP) MemSQL 8
  • 9. Build Your Digital Business Platform Around Data and Analytics 31 January 2018 Andrew White W. Roy Schulte Roxane Edjlali Joao Tapadinhas Svetlana Sicular G00350435 MemSQL 9
  • 10. Select Challenges Data and analytics investments that are tied to measurable business outcomes are more likely to produce reportable benefits. MemSQL 10
  • 11. Magic Quadrant for Data Management Solutions for Analytics 13 February 2018 Adam M. Ronthal Roxane Edjlali Rick Greenwald G00326691 MemSQL 11
  • 12. We define four primary use cases for DMSAs that reflect this diversity of data and use cases: — Traditional data warehouse — Real-time data warehouse — Context-independent data warehouse — Logical data warehouse MemSQL 12
  • 15. Real-Time Data Warehouse This use case adds a real-time component to analytics use cases, with the aim of reducing latency — the time lag between when data is generated and when it can be analyzed. MemSQL 15
  • 17. Other Vendors to Consider for Operational DBMSs 23 November 2017 Donald Feinberg Merv Adrian Nick Heudecker G00327284 MemSQL 17
  • 18. Other Vendors to Consider for Operational DBMSs Actian Aerospike Alibaba Cloud Altibase ArangoDB Cloudera Clustrix Couchbase FairCom Fujitsu General Data Technology Hortonworks MariaDB MemSQL MongoDB Neo4j NuoDB Percona Redis Labs SequoiaDB TmaxSoft VoltDB MemSQL 18
  • 19. Other Vendors to Consider for Operational DBMSs also listed as Challenger or Leader in the Magic Quadrant for Data Management Solutions for Analytics MemSQL MemSQL 19
  • 21. Over the next five years, the OPDBMS and DMSA markets converge to a single DBMS market. MemSQL 21
  • 22. Look to your operational DBMS vendor for both transactional and analytical workloads. MemSQL 22
  • 23. SECTION 2 Rise of Machine Learning MemSQL 23
  • 30. 2018 Outlook Survey MemSQL and O’Reilly 1600+ respondents memsql.com/MLsurvey MemSQL 30
  • 48. SECTION 3 DEMO with Yelp Dataset MemSQL 48
  • 53. Can you build a machine learning recommendation engine in SQL? Yes MemSQL 53
  • 54. Can you build a machine learning recommendation engine in SQL? Yes Should you? For training? Maybe, maybe not. For Operational Scoring? Absolutely! MemSQL 54
  • 57. Secret Weapons to Machine Learning in SQL — Extensibility — Stored Procedures — User Defined Functions — User Defined Aggregates — DOT_PRODUCT — Compare two vectors MemSQL 57
  • 60. Sequel Pro Mac app for MySQL databases MemSQL 60
  • 61. MemSQL in one slide — Distributed SQL database — Massively parallel, lock-free, fast — Full ACID features — In-memory and on-disk — JSON, key-value, geospatial, full-text search — Robust security — Built for transactions and analytics MemSQL 61
  • 64. Why do ML in SQL? — Train in any number of systems — Score in the database for applications from real-time drilling to fraud detection to personalization — Complete certain functions within the database to radically simplify operational infrastructure MemSQL 64
  • 65. “It is a fine line between a well executed SQL query on live data and ML/AI” MemSQL 65
  • 67. Thank you! Please visit our booth www.memsql.com @garyorenstein @memsql MemSQL 67
  • 68. Abstract: Building a Machine Learning Recommendation Engine in SQL Modern businesses constantly seek deeper customer relationships and more compelling experiences. To accomplish this, companies are looking to machine learning and artificial intelligence solutions; however, that often involves a host of new systems and approaches. With a modern database architecture, it is possible to build compelling machine learning solutions with SQL, deliver real-time engagements, and rapidly move to operational applications. See live, how a modern database can accomplish these feats within a single integrated solution. MemSQL 68