SlideShare a Scribd company logo
RecSys FR: 4th Session
6th October 2016
Etienne Sanson – manager R&D Engine
About Criteo
1
3 | Copyright © 2016 Criteo
Our mission
TARGET THE
RIGHT USER
AT THE
RIGHT TIME
WITH THE RIGHT
MESSAGE
4 | Copyright © 2016 Criteo
Key Figures
16 000
PUBLISHERS
90%
RETENTION RATE2
+130
COUNTRIES
LISTED ON THE
NASDAQ
SINCE
OCTOBER 2013
R&D REPRESENTS 21%
OF THE WORKFORCE
2000
EMPLOYEES
21
BILLIONS $3
11 000
ADVERTISERS
1.19 bn€1
31
OFFICES
1: REVENUE IN 2015
2: ANNUAL RATE 2015
3: $ OF TURNOVER GENERATED TO OUR CLIENTS - TURNOVER POST-CLICK WW FROM JANUARY TO DECEMBER
2015
5 | Copyright © 2016 Criteo
Revenue Growth
2009 2010 2011 2012 2013
22M$
86M$
199M$
349M$
589M$
988M$
2014
1,3MM$
2015
6 | Copyright © 2016 Criteo
GENERAL CONCEPT
Users visit an
advertiser’s website
1
Criteo identifies the users
(via cookies)
2
Users leave the advertiser’s website
& browse publisher on the Internet
3
Criteo identifies users on
these pages
(via cookie)
4
Criteo displays an advertising
banner, personalized for
each user
5
Click through directly
to the advertiser’s
page
6
@
Retargeting principles
7 | Copyright © 2016 Criteo
Infrastructure Key Figures
Sunnyvale
2 PoP
500 kVA
1 559 Servers
New York
2 PoP
930 kVA
2 625 Servers
Hong Kong
2 PoP
472 kVA
2185 Servers
Paris
4 Pop
1 800 kVA
3 625 Servers
Amsterdam
2 PoP
+2 500 kVA
3 609 Servers
Tokyo
2 PoP
455 kVA
2 564 Servers
Shanghai
1 PoP
200 kVA
931 Servers
World Wide
15 PoP
6,8 MVA
17 098 Servers
> 55Gbps
+ 2.5M req/s
Hosting Global Partners :
About ML@Criteo
2
9 | Copyright © 2016 Criteo
Our challenges – Product recommendation
• Select the best ~10 products to show to a user
• >1B users
• Product catalog contains ~1M items, up to 1B
• Time constraints: 20ms
• Combination of offline/online processing steps
• CF
• Product embeddings (word2vec -> prod2vec)
• CNN for image features
What products should
we recommend?
10 | Copyright © 2016 Criteo
Our challenges - Bidding
How much should we
bid for this display?
What is the best
campaign to display?
My company
BUY! BUY! BUY!
BUY!
• Select the best campaign to display and evaluate its
value in a few ms
• Large scale regression models
• >1B daily displays (but few positive examples!)
• >1M parameters
• Distributed optimization (SGD, L-BFGS)
• Feature Engineering
• Transfer learning, FFM, Policy learning
• Marketplace, game theory, auction theory
11 | Copyright © 2016 Criteo
Our challenges – X-device
• Build a huge graph (Billions of nodes/edges):
• Nodes = devices
• Edge = the 2 devices belong to the same user
• How to connect 2 devices?
• How to know the ground truth?
• How to keep it stable?
• At scale & taking care about privacy
Who is the user
behind the device?
12 | Copyright © 2016 Criteo
Our challenges – Testing
• We test everything!
• Offline tests / AB Tests
• Infrastructure to perform large-scale tests
• >100K offline tests / year
• >1K AB Tests / year
• Dedicated teams
• Technical / Business Metrics
• Randomization
• Counterfactual evaluation
Thank you!
…and we’re hiring!

More Related Content

PDF
Pulpix - Video Recommendation at Scale
recsysfr
 
PDF
New machine learning challenges at Criteo
Olivier Koch
 
PPTX
Machine learning at Criteo - Paris Datageeks
Nicolas Le Roux
 
PDF
New challenges for scalable machine learning in online advertising
Olivier Koch
 
PPSX
EN - Criteo - BD Deck -July 2014 - rebrand
Djilali Zitouni
 
PDF
[Correction 250+ Attendees] Event Report - ADP ReThink 2019 - Global Success,...
Holger Mueller
 
PDF
Ingrid IHASZ: Hungarian digital & programmatic advertising market overview / ...
Ingrid Ihasz
 
PDF
Cwin16 tls-s daniel-georice
Capgemini
 
Pulpix - Video Recommendation at Scale
recsysfr
 
New machine learning challenges at Criteo
Olivier Koch
 
Machine learning at Criteo - Paris Datageeks
Nicolas Le Roux
 
New challenges for scalable machine learning in online advertising
Olivier Koch
 
EN - Criteo - BD Deck -July 2014 - rebrand
Djilali Zitouni
 
[Correction 250+ Attendees] Event Report - ADP ReThink 2019 - Global Success,...
Holger Mueller
 
Ingrid IHASZ: Hungarian digital & programmatic advertising market overview / ...
Ingrid Ihasz
 
Cwin16 tls-s daniel-georice
Capgemini
 

Viewers also liked (9)

PDF
Injecting semantic links into a graph-based recommender system
recsysfr
 
PDF
What can bring library metadata to the web? Trust, links and love
recsysfr
 
PDF
Dictionary Learning for Massive Matrix Factorization
recsysfr
 
PDF
Predictive quality metrics @ tinyclues - Artem Kozhevnikov - Tinyclues
recsysfr
 
PDF
Recommendation @ Meetic
recsysfr
 
PDF
Meta-Prod2Vec: Simple Product Embeddings with Side-Information
recsysfr
 
PDF
Sequential Learning in the Position-Based Model
recsysfr
 
PDF
CONTENT2VEC: a Joint Architecture to use Product Image and Text for the task ...
recsysfr
 
PDF
Highlights on most interesting RecSys papers - Elena Smirnova, Lowik Chanusso...
recsysfr
 
Injecting semantic links into a graph-based recommender system
recsysfr
 
What can bring library metadata to the web? Trust, links and love
recsysfr
 
Dictionary Learning for Massive Matrix Factorization
recsysfr
 
Predictive quality metrics @ tinyclues - Artem Kozhevnikov - Tinyclues
recsysfr
 
Recommendation @ Meetic
recsysfr
 
Meta-Prod2Vec: Simple Product Embeddings with Side-Information
recsysfr
 
Sequential Learning in the Position-Based Model
recsysfr
 
CONTENT2VEC: a Joint Architecture to use Product Image and Text for the task ...
recsysfr
 
Highlights on most interesting RecSys papers - Elena Smirnova, Lowik Chanusso...
recsysfr
 
Ad

Similar to RecsysFR: Criteo presentation (20)

PPTX
Selling Display to Your SMB Clients
Acquisio
 
PPTX
Selling display display to your smb clients final
Acquisio
 
PDF
Thomvest Advertising Technology overview - Sept 2014
andrewtweed1
 
PDF
Algoth presentation regarding trading automation
maksymkryvenko2
 
PPTX
Tag Management : A Key Component of an International Digital Strategy
Ensighten
 
PPTX
Seedtag Media Kit 2024 Faaor UAE and KSA
AdilHussain801079
 
PDF
Oct-2016_AdAsia Credentials
Dang Pham
 
PPTX
Machine Learning for Computational Advertising
Paris Women in Machine Learning and Data Science
 
PPTX
EBU (Geneva, April 2016)
Nice People At Work
 
PPTX
Big Data at Tube: Events to Insights to Action
Murtaza Doctor
 
PPTX
From Analytics to Intelligence
Catherine Mylinh
 
PDF
SCREENlens - People Tracking with Quividi
pilot Screentime GmbH
 
PPTX
The Importance of a Data-Driven Dynamic Creative Strategy
In Marketing We Trust
 
PDF
Smaato - NOAH16 Berlin
NOAH Advisors
 
PDF
S4M - NOAH15 London
NOAH Advisors
 
PDF
Media Management Workshop by OpenText and Skillz Middle East
Dieter Hovorka
 
PDF
Zinnov Zones - Media & Technology 2016
Zinnov
 
PDF
Odoo Strategy and Roadmap
Odoo
 
PDF
Keynotes Odoo - 2014 Opendays by Fabien (CEO @ Odoo)
Mustufa Rangwala
 
PDF
The Very Best Intranets & Digital Workplace from the 2016 Intranet Global Forum
Prescient Digital Media
 
Selling Display to Your SMB Clients
Acquisio
 
Selling display display to your smb clients final
Acquisio
 
Thomvest Advertising Technology overview - Sept 2014
andrewtweed1
 
Algoth presentation regarding trading automation
maksymkryvenko2
 
Tag Management : A Key Component of an International Digital Strategy
Ensighten
 
Seedtag Media Kit 2024 Faaor UAE and KSA
AdilHussain801079
 
Oct-2016_AdAsia Credentials
Dang Pham
 
Machine Learning for Computational Advertising
Paris Women in Machine Learning and Data Science
 
EBU (Geneva, April 2016)
Nice People At Work
 
Big Data at Tube: Events to Insights to Action
Murtaza Doctor
 
From Analytics to Intelligence
Catherine Mylinh
 
SCREENlens - People Tracking with Quividi
pilot Screentime GmbH
 
The Importance of a Data-Driven Dynamic Creative Strategy
In Marketing We Trust
 
Smaato - NOAH16 Berlin
NOAH Advisors
 
S4M - NOAH15 London
NOAH Advisors
 
Media Management Workshop by OpenText and Skillz Middle East
Dieter Hovorka
 
Zinnov Zones - Media & Technology 2016
Zinnov
 
Odoo Strategy and Roadmap
Odoo
 
Keynotes Odoo - 2014 Opendays by Fabien (CEO @ Odoo)
Mustufa Rangwala
 
The Very Best Intranets & Digital Workplace from the 2016 Intranet Global Forum
Prescient Digital Media
 
Ad

More from recsysfr (15)

PPTX
Multi Task DPP for Basket Completion by Romain WARLOP, Fifty Five
recsysfr
 
PDF
Building a recommender system with Annoy and Word2Vec by Cristian PEREZ, Kern...
recsysfr
 
PDF
An Homophily-based Approach for Fast Post Recommendation in Microblogging Sys...
recsysfr
 
PDF
Recommendations @ Rakuten Group
recsysfr
 
PDF
Data-Driven Recommender Systems
recsysfr
 
PPTX
Recommender systems
recsysfr
 
PDF
Recommendation @Deezer
recsysfr
 
PPTX
Flexible recommender systems based on graphs
recsysfr
 
PPTX
Using Neural Networks to predict user ratings
recsysfr
 
PDF
Preference Elicitation in Mangaki: Is Your Taste Kinda Weird?
recsysfr
 
PDF
Recommendation @ PriceMinister-Rakuten - Road to personalization
recsysfr
 
PDF
Rakuten Institute of Technology Paris
recsysfr
 
PDF
Tailor-made personalization and recommendation - Sailendra
recsysfr
 
PDF
New tools from the bandit literature to improve A/B Testing
recsysfr
 
PDF
Story of the algorithms behind Deezer Flow
recsysfr
 
Multi Task DPP for Basket Completion by Romain WARLOP, Fifty Five
recsysfr
 
Building a recommender system with Annoy and Word2Vec by Cristian PEREZ, Kern...
recsysfr
 
An Homophily-based Approach for Fast Post Recommendation in Microblogging Sys...
recsysfr
 
Recommendations @ Rakuten Group
recsysfr
 
Data-Driven Recommender Systems
recsysfr
 
Recommender systems
recsysfr
 
Recommendation @Deezer
recsysfr
 
Flexible recommender systems based on graphs
recsysfr
 
Using Neural Networks to predict user ratings
recsysfr
 
Preference Elicitation in Mangaki: Is Your Taste Kinda Weird?
recsysfr
 
Recommendation @ PriceMinister-Rakuten - Road to personalization
recsysfr
 
Rakuten Institute of Technology Paris
recsysfr
 
Tailor-made personalization and recommendation - Sailendra
recsysfr
 
New tools from the bandit literature to improve A/B Testing
recsysfr
 
Story of the algorithms behind Deezer Flow
recsysfr
 

Recently uploaded (20)

PPTX
Microsoft PowerPoint Student PPT slides.pptx
Garleys Putin
 
PPTX
Parallel & Concurrent ...
yashpavasiya892
 
PDF
PDF document: World Game (s) Great Redesign.pdf
Steven McGee
 
PPTX
原版北不列颠哥伦比亚大学毕业证文凭UNBC成绩单2025年新版在线制作学位证书
e7nw4o4
 
PDF
LOGENVIDAD DANNYFGRETRRTTRRRTRRRRRRRRR.pdf
juan456ytpro
 
PPTX
Perkembangan Perangkat jaringan komputer dan telekomunikasi 3.pptx
Prayudha3
 
PDF
DNSSEC Made Easy, presented at PHNOG 2025
APNIC
 
PDF
Project English Paja Jara Alejandro.jpdf
AlejandroAlonsoPajaJ
 
PPTX
Crypto Recovery California Services.pptx
lionsgate network
 
PDF
UI/UX Developer Guide: Tools, Trends, and Tips for 2025
Penguin peak
 
PDF
LB# 820-1889_051-7370_C000.schematic.pdf
matheusalbuquerqueco3
 
PPT
Introduction to dns domain name syst.ppt
MUHAMMADKAVISHSHABAN
 
PPTX
办理方法西班牙假毕业证蒙德拉贡大学成绩单MULetter文凭样本
xxxihn4u
 
PPTX
Generics jehfkhkshfhskjghkshhhhlshluhueheuhuhhlhkhk.pptx
yashpavasiya892
 
PDF
Generative AI Foundations: AI Skills for the Future of Work
hemal sharma
 
PDF
Data Protection & Resilience in Focus.pdf
AmyPoblete3
 
PPT
1965 INDO PAK WAR which Pak will never forget.ppt
sanjaychief112
 
PPTX
The Internet of Things (IoT) refers to a vast network of interconnected devic...
chethana8182
 
PPTX
dns domain name system history work.pptx
MUHAMMADKAVISHSHABAN
 
PPTX
ppt lighfrsefsefesfesfsefsefsefsefserrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrt.pptx
atharvawafgaonkar
 
Microsoft PowerPoint Student PPT slides.pptx
Garleys Putin
 
Parallel & Concurrent ...
yashpavasiya892
 
PDF document: World Game (s) Great Redesign.pdf
Steven McGee
 
原版北不列颠哥伦比亚大学毕业证文凭UNBC成绩单2025年新版在线制作学位证书
e7nw4o4
 
LOGENVIDAD DANNYFGRETRRTTRRRTRRRRRRRRR.pdf
juan456ytpro
 
Perkembangan Perangkat jaringan komputer dan telekomunikasi 3.pptx
Prayudha3
 
DNSSEC Made Easy, presented at PHNOG 2025
APNIC
 
Project English Paja Jara Alejandro.jpdf
AlejandroAlonsoPajaJ
 
Crypto Recovery California Services.pptx
lionsgate network
 
UI/UX Developer Guide: Tools, Trends, and Tips for 2025
Penguin peak
 
LB# 820-1889_051-7370_C000.schematic.pdf
matheusalbuquerqueco3
 
Introduction to dns domain name syst.ppt
MUHAMMADKAVISHSHABAN
 
办理方法西班牙假毕业证蒙德拉贡大学成绩单MULetter文凭样本
xxxihn4u
 
Generics jehfkhkshfhskjghkshhhhlshluhueheuhuhhlhkhk.pptx
yashpavasiya892
 
Generative AI Foundations: AI Skills for the Future of Work
hemal sharma
 
Data Protection & Resilience in Focus.pdf
AmyPoblete3
 
1965 INDO PAK WAR which Pak will never forget.ppt
sanjaychief112
 
The Internet of Things (IoT) refers to a vast network of interconnected devic...
chethana8182
 
dns domain name system history work.pptx
MUHAMMADKAVISHSHABAN
 
ppt lighfrsefsefesfesfsefsefsefsefserrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrt.pptx
atharvawafgaonkar
 

RecsysFR: Criteo presentation

  • 1. RecSys FR: 4th Session 6th October 2016 Etienne Sanson – manager R&D Engine
  • 3. 3 | Copyright © 2016 Criteo Our mission TARGET THE RIGHT USER AT THE RIGHT TIME WITH THE RIGHT MESSAGE
  • 4. 4 | Copyright © 2016 Criteo Key Figures 16 000 PUBLISHERS 90% RETENTION RATE2 +130 COUNTRIES LISTED ON THE NASDAQ SINCE OCTOBER 2013 R&D REPRESENTS 21% OF THE WORKFORCE 2000 EMPLOYEES 21 BILLIONS $3 11 000 ADVERTISERS 1.19 bn€1 31 OFFICES 1: REVENUE IN 2015 2: ANNUAL RATE 2015 3: $ OF TURNOVER GENERATED TO OUR CLIENTS - TURNOVER POST-CLICK WW FROM JANUARY TO DECEMBER 2015
  • 5. 5 | Copyright © 2016 Criteo Revenue Growth 2009 2010 2011 2012 2013 22M$ 86M$ 199M$ 349M$ 589M$ 988M$ 2014 1,3MM$ 2015
  • 6. 6 | Copyright © 2016 Criteo GENERAL CONCEPT Users visit an advertiser’s website 1 Criteo identifies the users (via cookies) 2 Users leave the advertiser’s website & browse publisher on the Internet 3 Criteo identifies users on these pages (via cookie) 4 Criteo displays an advertising banner, personalized for each user 5 Click through directly to the advertiser’s page 6 @ Retargeting principles
  • 7. 7 | Copyright © 2016 Criteo Infrastructure Key Figures Sunnyvale 2 PoP 500 kVA 1 559 Servers New York 2 PoP 930 kVA 2 625 Servers Hong Kong 2 PoP 472 kVA 2185 Servers Paris 4 Pop 1 800 kVA 3 625 Servers Amsterdam 2 PoP +2 500 kVA 3 609 Servers Tokyo 2 PoP 455 kVA 2 564 Servers Shanghai 1 PoP 200 kVA 931 Servers World Wide 15 PoP 6,8 MVA 17 098 Servers > 55Gbps + 2.5M req/s Hosting Global Partners :
  • 9. 9 | Copyright © 2016 Criteo Our challenges – Product recommendation • Select the best ~10 products to show to a user • >1B users • Product catalog contains ~1M items, up to 1B • Time constraints: 20ms • Combination of offline/online processing steps • CF • Product embeddings (word2vec -> prod2vec) • CNN for image features What products should we recommend?
  • 10. 10 | Copyright © 2016 Criteo Our challenges - Bidding How much should we bid for this display? What is the best campaign to display? My company BUY! BUY! BUY! BUY! • Select the best campaign to display and evaluate its value in a few ms • Large scale regression models • >1B daily displays (but few positive examples!) • >1M parameters • Distributed optimization (SGD, L-BFGS) • Feature Engineering • Transfer learning, FFM, Policy learning • Marketplace, game theory, auction theory
  • 11. 11 | Copyright © 2016 Criteo Our challenges – X-device • Build a huge graph (Billions of nodes/edges): • Nodes = devices • Edge = the 2 devices belong to the same user • How to connect 2 devices? • How to know the ground truth? • How to keep it stable? • At scale & taking care about privacy Who is the user behind the device?
  • 12. 12 | Copyright © 2016 Criteo Our challenges – Testing • We test everything! • Offline tests / AB Tests • Infrastructure to perform large-scale tests • >100K offline tests / year • >1K AB Tests / year • Dedicated teams • Technical / Business Metrics • Randomization • Counterfactual evaluation

Editor's Notes

  • #10: How to connect? We have some data that allows for deterministic match, but we also have to build a probabilistic match
  • #11: Hashing trick, one-hot encoding, distributed optimization Tradeoff between freshness of model and historical data Irma
  • #12: How to connect? We have some data that allows for deterministic match, but we also have to build a probabilistic match
  • #13: trade-off fast-cheap / expensive-truth