SlideShare a Scribd company logo
RTB Optimizer: Behind the scenes witha Predictive API 
Nicolas KruchtenPAPIs.io –November 18, 2014 
REAL TIME MACHINE LEARNING 
DECISIONS AS A SERVICE
About Datacratic 
•Software company specializing in high performance systems andmachine learning 
•30 employees, founded in 2009, based in Montréal, Québec, Canada with an office in New York 
•3 Predictive APIs in market today 
•Building a Machine Learning Database to help others build Predictive APIs and Apps
Real-Time Bidding for online advertising 
Real-Time 
Exchange 
Bidder 
Bidder 
Bidder 
Bidder 
Web 
Browser 
GET ad 
bid requests
Real-Time Bidding for online advertising 
Real-Time 
Exchange 
Bidder 
Bidder 
Bidder 
Bidder 
Web 
Browser 
ad 
bids 
auction
Real-Time Bidding for online advertising 
Real-Time 
Exchange 
Bidder 
Bidder 
Bidder 
Bidder 
Web 
Browser 
This happens millions of times per second 
Bidders must respond within 100 milliseconds 
ad 
bids 
auction
Real-Time Bidding for online advertising 
Real-Time 
Exchange 
Bidder 
Bidder 
Bidder 
Bidder 
Web 
Browser 
RTB Optimizer enables bidders to achieve campaign goals 
ad 
bids 
auction
Campaign goals 
•Advertising campaignsare typically outcome-oriented 
–Clicks 
–Video views 
–Conversions: app installs, purchases, sign-ups 
•e.g. Ad network has sold someone 1,000 outcomes for $1,000 
•e.g. Advertiser has $1,000 to get as many outcomes as possible 
•Essentially maximize profit or minimize cost-per-outcome
Datacratic’s RTB Optimizer 
•Client bidder relays bid-requests to API, API tells it how to bid 
•Handles 100,000 queries per second, for 100s of campaign 
•API says which campaign should bid and how much 
•API also needs outcomes in real-time and campaign goals
RTB Optimizer 
Bids API 
Outcomes API
A Predictive API that learns 
•Datacratichas no proprietary data set 
•API can learn from scratch from the bid-request stream what works for each campaign: 
–Contextual features: website, time of day, banner size and placement 
–User features: geo-location, browser, language, # of impressions shown 
–Customer-provided data: about the user, about the website 
•Provides insightsinto what features are driving performance 
•Can re-use learningsfrom previous campaigns
Second price auctions 
•First Price Auctions 
–You bid $1, I bid $2: I win, and I pay $2 
•RTB uses Second Price Auctions 
–You bid $1, I bid $2: I win, and I pay $1 
•Optimal bid = E[ value ] 
–Say it’s worth $2 to me 
–I will never bid more than $2 
–If I bid $1.50 and you bid $1.75: I’ve lost an opportunity for $0.25 surplus! 
–I should always bid $2
Don’t buy lottery tickets! 
E[ value ] = payout * P( getting the payout )
What’s it to you? 
•If client gets paid $10,000 for 1,000 then payout = $10E[ value | bid-request ] = $10 * P( conversion | bid-request ) 
•What was an economics problem is now a prediction problem 
•We need to calibrate to predict true probabilities
RTB Optimizer 
Bids API 
E[ value ] 
Outcomes API 
P( outcome )
Collecting the data 
•To compute P( X | Y ) we need examples of Y’s with an X label 
•RTB Optimizer uses mix of strategies to meet campaign goals 
•Probe strategy bids randomly to collect data 
•Optimized strategy bids with E[ value] 
•Automatic training/retraining when API see enough examples
RTB Optimizer 
Probe 
Bids API 
E[ value ] 
Training 
Outcomes API 
P( outcome )
Bias control 
•Never stop the probe strategy 
•Always need control group for evaluation, retraining 
•Risk of filter bubbles: future models trained on previous output 
•Bid requests are randomly routed to probe, less often over time 
•Models automatically back-tested before deployment
How to learn in real-time 
•Classify using bagged generalized linear models 
•Generate non-linear features with statistics tables 
•Periodically retrain classifier 
•Continuously update stats tables
Statistics Table by example 
Table 
Bucket 
Impressions 
Outcomes 
Outcomes/Impressions 
95%Confidence 
Lower Bound on 
Outcomes/Impressions 
Browser 
Chrome 
5M 
3k 
0.060% 
0.058% 
Firefox 
3M 
1k 
0.033% 
0.031% 
Website 
abc.com 
4M 
2k 
0.050% 
0.048% 
xyz.com 
1k 
10 
1.000% 
0.481%
RTB Optimizer 
Probe 
Bids API 
E[ value ] 
Training 
Outcomes API 
GLZ Classifier 
Stats Tables 
Real-Time 
Batch
Implementation details (are everything) 
•100k requests per second, 10 millisecond latency, running 24/7,1 trillion predictions to date 
•Distributed system, written in C++ 11 
•AWS: data in S3, training runs on Amazon EC2 spot market 
•http://opensource.datacratic.com/ 
–RTBkit 
–JML 
–StarCluster
Does it work? 
Classification success? ROC or calibration curves…
Does it work? 
Classification success? ROC and calibration curves… 
Optimization success? 80% reductions in cost-per-outcome…
Does it work? 
Classification success? ROC or calibration curves… 
Optimization success? 80% reductions in cost-per-outcome… 
Customer success! 25% monthly growth
Thanks! 
nicolas@datacratic.com 
REAL TIME MACHINE LEARNING 
DECISIONS AS A SERVICE

More Related Content

PPTX
Fast Data Driving Personalization - Nick Gorski
Hakka Labs
 
PPTX
Aerospike at Tapad
Toby Matejovsky
 
PDF
CPEx Audience – Adform PAB
Matěj Novák
 
PPTX
主題二、糧食安全與產業衝擊(蔡培慧)
育宗 劉
 
PPTX
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...
Datacratic
 
PPT
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Datacratic
 
PPTX
Computational Advertising in Yelp Local Ads
soupsranjan
 
PDF
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Databricks
 
Fast Data Driving Personalization - Nick Gorski
Hakka Labs
 
Aerospike at Tapad
Toby Matejovsky
 
CPEx Audience – Adform PAB
Matěj Novák
 
主題二、糧食安全與產業衝擊(蔡培慧)
育宗 劉
 
RTBkit Meetup - Developer Spotlight, Behind the Scenes of RTBkit and Intro to...
Datacratic
 
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Datacratic
 
Computational Advertising in Yelp Local Ads
soupsranjan
 
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Databricks
 

Similar to Nicolas Kruchten @ Datacratic (20)

PPTX
Mobile Ad Monetization for Games | Christian Calderon
Jessica Tams
 
PPTX
Szetela practcal ad words ai rocks
David Szetela
 
PPTX
Artificial Intelligence and Machine Learning in PPC - David Szetela
State of Search Conference
 
PDF
RTBMA ECIR 2016 tutorial
Shuai Yuan
 
PPTX
AI and Machine Language in PPC
David Szetela
 
PPTX
Response prediction for display advertising - WSDM 2014
Olivier Chapelle
 
PPTX
RTB Bid Landscape in Adform
PrzemysawPiotrowski6
 
PPTX
Vpon - 廣告效果導向為基礎的行動廣告系統
Vpon
 
PDF
Preffered account structure and performance tracking in Apple Search Ads
Anna Yurchuk
 
PPTX
Offer Platform
3Di
 
PPTX
Smartone v1.0
Jinyean Tan
 
PPTX
Building Low Latency ML Systems for Real-Time Model Predictions at Xandr
ScyllaDB
 
PPTX
Data-Driven Reserve Prices for Social Advertising Auctions at LinkedIn
Kun Liu
 
PDF
Digital Marketing Course Week 8: Online Advertising
Ayca Turhan
 
PDF
apidays LIVE London 2021 - Productization of APIs by Jay Dembani, Avalara
apidays
 
PDF
SiteScout August Buyer Strategy Webinar
sitescout
 
PPTX
Making advertising personal, 4th NL Recommenders Meetup
Olivier Koch
 
PDF
How to buy traffic from Facebook, Instagram and Facebook Audience Network
Travelpayouts
 
PPT
Maximize Your Revenue in Paid Search
Hanapin Marketing
 
PPTX
Why is programmatic taking off? What is this revolution all about?
Datacratic
 
Mobile Ad Monetization for Games | Christian Calderon
Jessica Tams
 
Szetela practcal ad words ai rocks
David Szetela
 
Artificial Intelligence and Machine Learning in PPC - David Szetela
State of Search Conference
 
RTBMA ECIR 2016 tutorial
Shuai Yuan
 
AI and Machine Language in PPC
David Szetela
 
Response prediction for display advertising - WSDM 2014
Olivier Chapelle
 
RTB Bid Landscape in Adform
PrzemysawPiotrowski6
 
Vpon - 廣告效果導向為基礎的行動廣告系統
Vpon
 
Preffered account structure and performance tracking in Apple Search Ads
Anna Yurchuk
 
Offer Platform
3Di
 
Smartone v1.0
Jinyean Tan
 
Building Low Latency ML Systems for Real-Time Model Predictions at Xandr
ScyllaDB
 
Data-Driven Reserve Prices for Social Advertising Auctions at LinkedIn
Kun Liu
 
Digital Marketing Course Week 8: Online Advertising
Ayca Turhan
 
apidays LIVE London 2021 - Productization of APIs by Jay Dembani, Avalara
apidays
 
SiteScout August Buyer Strategy Webinar
sitescout
 
Making advertising personal, 4th NL Recommenders Meetup
Olivier Koch
 
How to buy traffic from Facebook, Instagram and Facebook Audience Network
Travelpayouts
 
Maximize Your Revenue in Paid Search
Hanapin Marketing
 
Why is programmatic taking off? What is this revolution all about?
Datacratic
 
Ad

More from PAPIs.io (20)

PDF
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
PAPIs.io
 
PDF
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
PAPIs.io
 
PDF
Extracting information from images using deep learning and transfer learning ...
PAPIs.io
 
PDF
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
PAPIs.io
 
PDF
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
PAPIs.io
 
PDF
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
PAPIs.io
 
PDF
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
PAPIs.io
 
PDF
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
PAPIs.io
 
PDF
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
PAPIs.io
 
PDF
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
PAPIs.io
 
PDF
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
PAPIs.io
 
PDF
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
PAPIs.io
 
PDF
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
PAPIs.io
 
PDF
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
PAPIs.io
 
PDF
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
PAPIs.io
 
PDF
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
PAPIs.io
 
PDF
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
PAPIs.io
 
PDF
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
PAPIs.io
 
PDF
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
PAPIs.io
 
PDF
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
PAPIs.io
 
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
PAPIs.io
 
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
PAPIs.io
 
Extracting information from images using deep learning and transfer learning ...
PAPIs.io
 
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
PAPIs.io
 
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
PAPIs.io
 
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
PAPIs.io
 
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
PAPIs.io
 
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
PAPIs.io
 
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
PAPIs.io
 
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
PAPIs.io
 
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
PAPIs.io
 
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
PAPIs.io
 
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
PAPIs.io
 
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
PAPIs.io
 
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
PAPIs.io
 
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
PAPIs.io
 
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
PAPIs.io
 
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
PAPIs.io
 
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
PAPIs.io
 
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
PAPIs.io
 
Ad

Recently uploaded (20)

PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PDF
oop_java (1) of ice or cse or eee ic.pdf
sabiquntoufiqlabonno
 
PPTX
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PPTX
Power BI in Business Intelligence with AI
KPR Institute of Engineering and Technology
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PDF
Company Presentation pada Perusahaan ADB.pdf
didikfahmi
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PPT
Grade 5 PPT_Science_Q2_W6_Methods of reproduction.ppt
AaronBaluyut
 
PPTX
Complete_STATA_Introduction_Beginner.pptx
mbayekebe
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PPTX
Employee Salary Presentation.l based on data science collection of data
barridevakumari2004
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
oop_java (1) of ice or cse or eee ic.pdf
sabiquntoufiqlabonno
 
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
Power BI in Business Intelligence with AI
KPR Institute of Engineering and Technology
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
Company Presentation pada Perusahaan ADB.pdf
didikfahmi
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
Grade 5 PPT_Science_Q2_W6_Methods of reproduction.ppt
AaronBaluyut
 
Complete_STATA_Introduction_Beginner.pptx
mbayekebe
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
Employee Salary Presentation.l based on data science collection of data
barridevakumari2004
 

Nicolas Kruchten @ Datacratic

  • 1. RTB Optimizer: Behind the scenes witha Predictive API Nicolas KruchtenPAPIs.io –November 18, 2014 REAL TIME MACHINE LEARNING DECISIONS AS A SERVICE
  • 2. About Datacratic •Software company specializing in high performance systems andmachine learning •30 employees, founded in 2009, based in Montréal, Québec, Canada with an office in New York •3 Predictive APIs in market today •Building a Machine Learning Database to help others build Predictive APIs and Apps
  • 3. Real-Time Bidding for online advertising Real-Time Exchange Bidder Bidder Bidder Bidder Web Browser GET ad bid requests
  • 4. Real-Time Bidding for online advertising Real-Time Exchange Bidder Bidder Bidder Bidder Web Browser ad bids auction
  • 5. Real-Time Bidding for online advertising Real-Time Exchange Bidder Bidder Bidder Bidder Web Browser This happens millions of times per second Bidders must respond within 100 milliseconds ad bids auction
  • 6. Real-Time Bidding for online advertising Real-Time Exchange Bidder Bidder Bidder Bidder Web Browser RTB Optimizer enables bidders to achieve campaign goals ad bids auction
  • 7. Campaign goals •Advertising campaignsare typically outcome-oriented –Clicks –Video views –Conversions: app installs, purchases, sign-ups •e.g. Ad network has sold someone 1,000 outcomes for $1,000 •e.g. Advertiser has $1,000 to get as many outcomes as possible •Essentially maximize profit or minimize cost-per-outcome
  • 8. Datacratic’s RTB Optimizer •Client bidder relays bid-requests to API, API tells it how to bid •Handles 100,000 queries per second, for 100s of campaign •API says which campaign should bid and how much •API also needs outcomes in real-time and campaign goals
  • 9. RTB Optimizer Bids API Outcomes API
  • 10. A Predictive API that learns •Datacratichas no proprietary data set •API can learn from scratch from the bid-request stream what works for each campaign: –Contextual features: website, time of day, banner size and placement –User features: geo-location, browser, language, # of impressions shown –Customer-provided data: about the user, about the website •Provides insightsinto what features are driving performance •Can re-use learningsfrom previous campaigns
  • 11. Second price auctions •First Price Auctions –You bid $1, I bid $2: I win, and I pay $2 •RTB uses Second Price Auctions –You bid $1, I bid $2: I win, and I pay $1 •Optimal bid = E[ value ] –Say it’s worth $2 to me –I will never bid more than $2 –If I bid $1.50 and you bid $1.75: I’ve lost an opportunity for $0.25 surplus! –I should always bid $2
  • 12. Don’t buy lottery tickets! E[ value ] = payout * P( getting the payout )
  • 13. What’s it to you? •If client gets paid $10,000 for 1,000 then payout = $10E[ value | bid-request ] = $10 * P( conversion | bid-request ) •What was an economics problem is now a prediction problem •We need to calibrate to predict true probabilities
  • 14. RTB Optimizer Bids API E[ value ] Outcomes API P( outcome )
  • 15. Collecting the data •To compute P( X | Y ) we need examples of Y’s with an X label •RTB Optimizer uses mix of strategies to meet campaign goals •Probe strategy bids randomly to collect data •Optimized strategy bids with E[ value] •Automatic training/retraining when API see enough examples
  • 16. RTB Optimizer Probe Bids API E[ value ] Training Outcomes API P( outcome )
  • 17. Bias control •Never stop the probe strategy •Always need control group for evaluation, retraining •Risk of filter bubbles: future models trained on previous output •Bid requests are randomly routed to probe, less often over time •Models automatically back-tested before deployment
  • 18. How to learn in real-time •Classify using bagged generalized linear models •Generate non-linear features with statistics tables •Periodically retrain classifier •Continuously update stats tables
  • 19. Statistics Table by example Table Bucket Impressions Outcomes Outcomes/Impressions 95%Confidence Lower Bound on Outcomes/Impressions Browser Chrome 5M 3k 0.060% 0.058% Firefox 3M 1k 0.033% 0.031% Website abc.com 4M 2k 0.050% 0.048% xyz.com 1k 10 1.000% 0.481%
  • 20. RTB Optimizer Probe Bids API E[ value ] Training Outcomes API GLZ Classifier Stats Tables Real-Time Batch
  • 21. Implementation details (are everything) •100k requests per second, 10 millisecond latency, running 24/7,1 trillion predictions to date •Distributed system, written in C++ 11 •AWS: data in S3, training runs on Amazon EC2 spot market •http://opensource.datacratic.com/ –RTBkit –JML –StarCluster
  • 22. Does it work? Classification success? ROC or calibration curves…
  • 23. Does it work? Classification success? ROC and calibration curves… Optimization success? 80% reductions in cost-per-outcome…
  • 24. Does it work? Classification success? ROC or calibration curves… Optimization success? 80% reductions in cost-per-outcome… Customer success! 25% monthly growth
  • 25. Thanks! [email protected] REAL TIME MACHINE LEARNING DECISIONS AS A SERVICE