Find out how DataScience has revolutionized SEO for OVH
DATA-SEO
NEXT LEVEL
VINCENT TERRASI / REMI BACHA
HEAD OF DATA / HEAD OF SEO
@vincentterrasi / @remibacha
Any sufficiently advanced technology
is indistinguishable from magic
ARTHUR CLARKE
Find out how DataScience has revolutionized SEO for OVH
BIG DATA
ARTIFICIAL INTELLIGENCE
DATA SCIENCE PROJECT
SEO PROJECTOVH
Find out how DataScience has revolutionized SEO for OVH
Find out how DataScience has revolutionized SEO for OVH
Find out how DataScience has revolutionized SEO for OVH
Find out how DataScience has revolutionized SEO for OVH
Find out how DataScience has revolutionized SEO for OVH
SEO
IS A BIG DATA JOB
UNDERSTAND
DATA
MANIPULATE
& ANALYSE
BRING VALUE
TO DATA
DATA SCIENCE
EMPIRICISM
01.
MAKE OBSERVATIONS
05BIS.
REFINE, ALTER, EXPAND,
OR REJECT HYPOTHESES
04.
DEVELOP TESTABLE
PREDICTIONS
02.
THINK OF INTERESTING
QUESTIONS
06.
DEVELOP
GENERAL THEORIES
05.
GATHER DATA TO TEST
PREDICTIONS
03.
FORMULATE
HYPOTHESES
DATA CENTRIC
EMPIRICISM
RANK BRAIN
Find out how DataScience has revolutionized SEO for OVH
01 02 03 04
CHANGING SEO FACTORS NEW FACTORS RANKING MISTAKES ULTRA-
PERSONNALISATION
IT’S TIME
TO UPGRADE SEO
MACHINE LEARNING
IA
BIG DATA
DATA SCIENCE
DEEP LEARNING
RANKBRAIN
WELCOME TO THE
DATA SEO ERA
NEW JOB
DATA SCIENTIST SEO
LEARNING DATA
SCIENCE
– Data Scientist Toolbox
– Getting & cleaning Data
– R / Python Programming
– Explorary data
– Machine Learning
– Big Data
SEO DATAMART
COMPETITORS
OTHER TRAFFIC SOURCES DATA
SOCIAL NETWORK
SEARCH CONSOLE
CRAWLS
STOCK, PRICES, SALES DATA
CUSTOMERS DATA
EVENTS
WEB ANALYTICS
NETLINKING
SEMANTICAL
WEBPERFS
SEARCH TRENDS
SERVER LOGS
SEO DATAMART
COMPETITORS
CRAWLS
NETLINKING
SEMANTICAL
WEBPERFS
XGBOOST
33TREES
10MAX DEPTH
100WAS GRID OF SIZE
ROC AUC : 0.915
?
?
?
?
?
?
?
MOST IMPORTANT VARIABLES
Screamingfrog_in_csv
Semrush_out_csv
Screamingfrog_in_csv_pre
pared
Semrush_
screamingfrog_out
_postgres
Majestic_out_
postgres
Visiblis_out_
postgres
Semrush_
screamingfrog_
majestic_visiblis_
Prediction
(XGBOOST_
CLASSIFICATION) on
DATAIKU DSS
The most complete
Data Science platform
Data
Preparation
Machine
Learning
Deployment Collaboration
WHY PREDICT
GOOGLE RANKINGS?
HOW TO PREDICT
GOOGLE RANKINGS?
GETTING SERP
DATA FROM SEMRUSH
Find out how DataScience has revolutionized SEO for OVH
Find out how DataScience has revolutionized SEO for OVH
CLEAN
DATA
REMOVE INVALID URLS
Slow Crawl
Rate
Non-HTML
Content
Network
Problems
Slow
Web Servers
WAIT TIMES
Errors from
Web Servers
URL Moved Permanently
Redirect (301)
URL Moved Temporarily
Redirect (302)
Authentication Required (401)
or Document Not Found (404)
Cyclic
Redirects
Find out how DataScience has revolutionized SEO for OVH
Find out how DataScience has revolutionized SEO for OVH
CREATE PREDICTION MODEL
XGBOOST
Adaptive boosting
Gradient boosting
Bagging
Random forest
BIAS RELATED
ERRORS
VARIANCE RELATED
ERRORS
Find out how DataScience has revolutionized SEO for OVH
Find out how DataScience has revolutionized SEO for OVH
?
?
?
?
?
?
?
XGBOOST
33TREES
10MAX DEPTH
100WAS GRID OF SIZE
ROC AUC : 0.915
MOST IMPORTANT VARIABLES
ExtBackLinks
RefDomains
TrustFlow
External Outlinks
Response Time
CitationFlow
TAKE AWAY
…
AUTOMATED MACHINE
LEARNING WITH DATAIKU
AUTOMATED KPI REPORTING SEO DATALAKE TEXT GENERATION
OPPORTUNITIES DETECTION PREDICTIVE ANALYSIS PROCESS MINING
AUTOMATED MACHINE
LEARNING WITH DATAIKU
SEO DATAMART
NOW, MACHINES CAN LEARN
AND ADAPT, IT IS TIME TO TAKE
ADVANTAGE OF THE OPPORTUNITY TO
CREATE NEW JOBS.
Data-SEO, Data-Doctor,
Data-Journalist …
THANK YOU
GET ALL OUR LAST DISCOVERIES AND UPDATES
Vincent TERRASI
@vincentterrasi
Remi BACHA
@remibacha
Data-seo.com Remibacha.com

More Related Content

PPTX
How Data Science can boost your SEO ?
PPTX
All About HTML Tags
PDF
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
PPTX
Tom Capper Mozcon 2021 - Core Web Vitals - The Fast & The Spurious
PDF
Advanced data-driven technical SEO - SMX London 2019
PDF
FoundConf 2018 Signals Speak - Alexis Sanders
PDF
Crawl Budget - Some Insights & Ideas @ seokomm 2015
PPTX
Everyone Screws Up HTTPS
How Data Science can boost your SEO ?
All About HTML Tags
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Tom Capper Mozcon 2021 - Core Web Vitals - The Fast & The Spurious
Advanced data-driven technical SEO - SMX London 2019
FoundConf 2018 Signals Speak - Alexis Sanders
Crawl Budget - Some Insights & Ideas @ seokomm 2015
Everyone Screws Up HTTPS

What's hot (20)

PDF
A Hybrid Recommender with Yelp Challenge Data
PPT
A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox
PDF
SEO for Librarians
PPTX
Google's Top 3 Ranking Factors - Content, Links, and RankBrain - Raleigh SEO ...
PPTX
Scrape box presentation
PDF
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration
PDF
Infinite Loops Dirty Architecture And Too Many Indexed URLs
PPTX
Mobile-First Indexing and AMP - SMX Advanced 2018
PDF
Log analysis and pro use cases for search marketers online version (1)
PDF
Negotiating crawl budget with googlebots
PPTX
Better Safe Than Sorry with HTTPS - SMX East 2016 - Patrick Stox
PPTX
React JS and Search Engines - Patrick Stox at Triangle ReactJS Meetup
PDF
Location-Free Local SEO
PPTX
the SEO cyborg - Moz 2018 (full edition)
PPTX
.htaccess for SEOs - A presentation by Roxana Stingu
PPTX
Search Like a Pro: Sources and Strategies for Conducting Effective Research
PPTX
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
PPT
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
PPT
Search Hubs and Custom Search Engines (ILI2007)
PPTX
TechSEO Boost 2017: The State of Technical SEO
A Hybrid Recommender with Yelp Challenge Data
A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox
SEO for Librarians
Google's Top 3 Ranking Factors - Content, Links, and RankBrain - Raleigh SEO ...
Scrape box presentation
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration
Infinite Loops Dirty Architecture And Too Many Indexed URLs
Mobile-First Indexing and AMP - SMX Advanced 2018
Log analysis and pro use cases for search marketers online version (1)
Negotiating crawl budget with googlebots
Better Safe Than Sorry with HTTPS - SMX East 2016 - Patrick Stox
React JS and Search Engines - Patrick Stox at Triangle ReactJS Meetup
Location-Free Local SEO
the SEO cyborg - Moz 2018 (full edition)
.htaccess for SEOs - A presentation by Roxana Stingu
Search Like a Pro: Sources and Strategies for Conducting Effective Research
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
Pubcon Vegas 2017 You're Going To Screw Up International SEO - Patrick Stox
Search Hubs and Custom Search Engines (ILI2007)
TechSEO Boost 2017: The State of Technical SEO
Ad

Similar to Find out how DataScience has revolutionized SEO for OVH (20)

PDF
Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...
PDF
What Managers Need to Know about Data Science
PPTX
Just ask Watson Seminar
PDF
How to Build Data Science Teams that Deliver Business Value
PDF
From Rocket Science to Data Science
PPTX
Data Science Demystified
PDF
The Right Data Warehouse: Automation Now, Business Value Thereafter
PDF
EDW 2015 cognitive computing panel session
PPTX
Keynote Dubai
PPTX
Matt McIlwain opening keynote
PDF
Business in the Driver’s Seat – An Improved Model for Integration
PDF
Python for Data Science - TDC 2015
PDF
Big Data Concepts Technologies And Applications Mohammad Shahid Husain
PDF
The Analytic Platform: Empowering the Business Now
PDF
From Volume to Value - A Guide to Data Engineering
PDF
GalvanizeU Seattle: Eleven Almost-Truisms About Data
PDF
CalPoly App Dev Club Presentation
PPTX
Predictive analytics from a to z
PDF
How to crack Big Data and Data Science roles
PDF
Data Science for Marketing
Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...
What Managers Need to Know about Data Science
Just ask Watson Seminar
How to Build Data Science Teams that Deliver Business Value
From Rocket Science to Data Science
Data Science Demystified
The Right Data Warehouse: Automation Now, Business Value Thereafter
EDW 2015 cognitive computing panel session
Keynote Dubai
Matt McIlwain opening keynote
Business in the Driver’s Seat – An Improved Model for Integration
Python for Data Science - TDC 2015
Big Data Concepts Technologies And Applications Mohammad Shahid Husain
The Analytic Platform: Empowering the Business Now
From Volume to Value - A Guide to Data Engineering
GalvanizeU Seattle: Eleven Almost-Truisms About Data
CalPoly App Dev Club Presentation
Predictive analytics from a to z
How to crack Big Data and Data Science roles
Data Science for Marketing
Ad

More from Vincent Terrasi (14)

PDF
SEO CAMP'us Paris 2024 - Déploiement de l'IA générative privée dans les organ...
PDF
IA générative : Menace ou Opportunité pour le SEO
PPTX
slides SEO CAMP'us Paris 2022 - Google et tools SEO On vous a menti
PPTX
Une IA pour votre SEO, une méthode inédite pour accélérer vos projets Data SEO
PPTX
SEO AnswerBox, une méthode inédite pour interroger vos données et créer vos d...
PPTX
Génération de contenu pour le SEO
PPTX
Comment faire du Data SEO sans savoir programmer ?
PPTX
Explainable Machine Learning for Ranking Factors
PPTX
Fausses données et Bad Data : restez vigilant !
PPTX
Comment les plateformes de Data Science métamorphosent le SEO ?
PPTX
How to boost your datamanagement with Dremio ?
PPTX
How to automate all your SEO projects
PPTX
Meetup Data-science OVH
PDF
Analyse your SEO Data with R and Kibana
SEO CAMP'us Paris 2024 - Déploiement de l'IA générative privée dans les organ...
IA générative : Menace ou Opportunité pour le SEO
slides SEO CAMP'us Paris 2022 - Google et tools SEO On vous a menti
Une IA pour votre SEO, une méthode inédite pour accélérer vos projets Data SEO
SEO AnswerBox, une méthode inédite pour interroger vos données et créer vos d...
Génération de contenu pour le SEO
Comment faire du Data SEO sans savoir programmer ?
Explainable Machine Learning for Ranking Factors
Fausses données et Bad Data : restez vigilant !
Comment les plateformes de Data Science métamorphosent le SEO ?
How to boost your datamanagement with Dremio ?
How to automate all your SEO projects
Meetup Data-science OVH
Analyse your SEO Data with R and Kibana

Recently uploaded (20)

PPTX
SET 1 Compulsory MNH machine learning intro
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PPT
DU, AIS, Big Data and Data Analytics.ppt
PDF
A biomechanical Functional analysis of the masitary muscles in man
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PPTX
ai agent creaction with langgraph_presentation_
PPTX
Tapan_20220802057_Researchinternship_final_stage.pptx
PDF
Navigating the Thai Supplements Landscape.pdf
PDF
technical specifications solar ear 2025.
PPTX
PPT for Diseases.pptx, there are 3 types of diseases
PPT
Image processing and pattern recognition 2.ppt
PPTX
machinelearningoverview-250809184828-927201d2.pptx
PPT
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
PPTX
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
PPTX
recommendation Project PPT with details attached
PPTX
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
PPTX
MBA JAPAN: 2025 the University of Waseda
PPTX
indiraparyavaranbhavan-240418134200-31d840b3.pptx
PPTX
Machine Learning and working of machine Learning
SET 1 Compulsory MNH machine learning intro
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
DU, AIS, Big Data and Data Analytics.ppt
A biomechanical Functional analysis of the masitary muscles in man
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
ai agent creaction with langgraph_presentation_
Tapan_20220802057_Researchinternship_final_stage.pptx
Navigating the Thai Supplements Landscape.pdf
technical specifications solar ear 2025.
PPT for Diseases.pptx, there are 3 types of diseases
Image processing and pattern recognition 2.ppt
machinelearningoverview-250809184828-927201d2.pptx
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
recommendation Project PPT with details attached
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
MBA JAPAN: 2025 the University of Waseda
indiraparyavaranbhavan-240418134200-31d840b3.pptx
Machine Learning and working of machine Learning

Find out how DataScience has revolutionized SEO for OVH