SlideShare a Scribd company logo
The Latest in
Advanced
Technical SEO
Index Bloat &
Discovery: from
Facets to
Frameworks
Hi!
Good Afternoon.
Ari	Nahmani
CEO	/	Founder
Kahena	Digital	Marketing
ari@kahenadigital.com
TeamClients
index bloat
index bloat
crawl budget
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
web-tech > googlebot
discoverability
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
Today’s Session
• Technical SEO issues around e-commerce /
large site architecture
• Preventing index bloat & preserving crawl
budget as a core methodology
• Current solutions & upcoming threats (JS,
AJAX, new frameworks, pre-rendering)
Index Bloat Prevention
Index Bloat Prevention
A bloated index =
if indexed URLS > “unique
pages”
Index Bloat Prevention
On an ecommerce site:
A bloated index =
if indexed URLS >
sum(CAT+PDP+Static)
Index Bloat Prevention
On a ‘content’ site:
A bloated index =
if indexed URLS >
sum(Articles+Static)
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
cannibalization
Index Bloat Prevention: Cannibalization
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
Index Bloat Prevention: Sorts & Facets
Index Bloat Prevention: Sorts & Filters
http://www.site.com/guys/tees/?pref
n1=bvAverageRating&prefn2=col
orGroup&prefv3=LG&srule=sortin
gNewArrival&prefv1=4&prefv2=RE
D&prefn3=size
Index Bloat Prevention: Sorts & Filters
<link	rel="canonical"	
href=”http://www.site.com/guys/tees/"	/>
• Basic Solution: Strip out the unnecessary
parameters
Solution: Filtering Out All Facet Params
• PROS:
– Avoids diluted / dupe URLs (request, not
directive)
• CONS:
– If you want/need specific parameters indexed
and exposed (size, color), need properly coded
canonical tag logic, recipe for major leak and
confusion.
– Considerations w/ pagination & view-all page
Crawl Budget: Facet Parameter URLs
Crawl Budget: Facet Parameter URLs
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
JS / AJAX Indexation
Index Bloat VS Discovery: JS + AJAX
Index Bloat Prevention: JS + AJAX
AJAX	Refinement	V1	=		
NO	URL	CHANGE
Index Bloat Prevention: JS + AJAX
AJAX	Refinement	V1	- NO	URL	CHANGE,	
but	inactive,	different href=	URL	exists
AJAX Facet Refinements V1 (NO URL CHANGE)
• PROS:
– Theoretically no parameters exposed to bloat the
index
• CONS:
– Users can’t share refined / filtered content to
friends, no accurate bookmarking. (Terrible UX)
– Googlebot will still crawl hidden href=' or other JS
framework links like Angular: ng-href= (check
canonical logic!!)
Index Bloat Prevention: JS + AJAX
AJAX	Refinement	V2	=	
html	5 history.pushState()
Index Bloat Prevention: JS + AJAX
html	5	history.pushState()
http://www.site.com/guys/tees/?color=green&size=large
Consistent URL Signals - Navigation
Ideal consistency:
Navigation URLs =
Pushstate() URLs =
Canonical URLs =
XML Sitemap URLs =
Consistent URL Signals - Navigation
Ideal consistency:
Navigation URLs =
Pushstate() URLs ≠
Canonical URLs =
XML Sitemap URLs =
Index Bloat Prevention: JS + AJAX
Google	preferred	pushstate URL	version,	we	had	to	reinforce	
(via	normal	inline	href=‘’,	canonical,	xml	sitemap)
AJAX Facet Refinements V2 (PushState URL Change)
• PROS:
– Users can now share /bookmark the correct
content
– Added to browser history
• CONS:
– Still need to have consistent canonical structure
due to Googlebot crawling pushstate()
– Different hidden URL structure via AJAX facets
may require further unpredictable
canonicalization logic / further dev work
Indexing AJAX & JS Frameworks
Indexing AJAX & JS Frameworks
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
Indexing AJAX & JS Frameworks
What method exists
that we know still
works?
Indexing AJAX & JS Frameworks
HTML SNAPSHOT
<head>
<meta name="fragment" content="!">
Google / Bing crawls with:
_escaped_fragment_=
Indexing AJAX & JS: HTML Snapshot
Indexing AJAX & JS: HTML Snapshot
Indexing AJAX & JS: HTML Snapshot
Pre or Realtime
Rendered
(to users & bots)
Indexing AJAX & JS: How To Decide?
HTML
SNAPSHOT
_escaped_fragment_=
Trust
Googlebot
VALIDATE!
Progressive
Enhancement
‘Dumbed down’
HTML Template
3rd Party
Service
(prerender.io)
Server side
(phantomJS /
headless browser)
Pre-Rendered
(to bots)
Pre or Realtime
Rendered
(to users & bots)
Indexing AJAX & JS: How To Decide?
HTML
SNAPSHOT
_escaped_fragment_=
Trust
Googlebot
VALIDATE!
Progressive
Enhancement
‘Dumbed down’
HTML Template
3rd Party
Service
(prerender.io)
Pre-Rendered
(to bots)
Server side
(phantomJS /
headless browser)
Indexing AJAX & JS: HTML Snapshot
• Upon crawl of URL with _escaped_fragment_=,
serve ’dumbed down’ HTML version of page.
• Not pre-rendered, rather simplified.
• For example, on ecommerce à a view-all
category listing with no dynamic facets.
Amazing results from our clients.
Indexing AJAX & JS: How To Decide?
HTML
SNAPSHOT
_escaped_fragment_=
Trust
Googlebot
VALIDATE!
Progressive
Enhancement
‘Dumbed down’
HTML Template
3rd Party
Service
(prerender.io)
Pre or Realtime
Rendered
(to users & bots)
Pre-Rendered
(to bots)
Server side
(phantomJS /
headless browser)
Indexing AJAX & JS: Pre-rendering
Upon crawl of URL with _escaped_fragment_=
1. prerender.io – middleware via reverse proxy
that serves a pre-rendered, cached HTML
page to bots
OR
2. Server side – the server pre-rendered the JS
in cached html pages to serve to bots or
does it in real-time (headless browser).
Indexing AJAX & JS: Prerender.io
Indexing AJAX & JS: Prerender.io
Indexing AJAX & JS: BromBone
Indexing AJAX & JS: Server Prerender
Server side
(phantomJS /
headless browser)
Pre or Realtime
Rendered
(to users & bots)
Indexing AJAX & JS: How To Decide?
HTML
SNAPSHOT
_escaped_fragment_=
Trust
Googlebot
VALIDATE!
Progressive
Enhancement
‘Dumbed down’
HTML Template
3rd Party
Service
(prerender.io)
Pre-Rendered
(to bots)
Indexing AJAX & JS: Server Side
bit.ly/javascriptseo
Indexing AJAX & JS: Server Side
bit.ly/javascriptseobit.ly/javascriptseo
Indexing AJAX & JS: Server Side
bit.ly/javascriptseobit.ly/javascriptseo
Server side
(phantomJS /
headless browser)
Pre or Realtime
Rendered
(to users & bots)
Indexing AJAX & JS: How To Decide?
HTML
SNAPSHOT
_escaped_fragment_=
Trust
Googlebot
VALIDATE!
Progressive
Enhancement
‘Dumbed down’
HTML Template
3rd Party
Service
(prerender.io)
Pre-Rendered
(to bots)
Indexing AJAX & JS: Trust Googlebot
read	these	first…
Testing JS Indexation: Jscrawlability.com
Validation & Testing:
Discovery vs Bloat
Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016
Testing: Fetch & Render JS / AJAX
Testing: Slice and Dice the Index
Advanced	Site	Operators
site:yoursite.com –inurl:cat.jsp
-inurl:prod.jsp –inurl:store.jsp
Testing: Slice and Dice the Index
Advanced	Site	Operators
site:yoursite.com inurl:size
inurl:cat.jsp -inurl:cid
Testing: Slice and Dice the Index
Advanced	Site	Operators
site:yoursite.com inurl:pdp
intext:”write	a	review”
Testing: Automate Bloat + Discovery Check
Testing: Automate Bloat + Discovery Check
Testing: Search Analytics for Bloat / Discovery
Testing: Go To The Source: Server Logs!
Summing It Up
• Index Bloat, Crawl Budget, & Testing: Large sites are
prone to serious index bloat and wasted crawl budget.
Needs diligent testing and an OCD-like attention to detail
with the basics. Test often & automate!
• JS/AJAX: Pushstate(), JS Frameworks and AJAX present
both discovery and bloat challenges. Know the options:
short term fixes like HTML snapshot (G+B), and long term
re-designs with modern frameworks w/ built in server side
rendering.
Dankeschön!
Questions?
Ari	Nahmani
CEO	/	Founder
Kahena	Digital	Marketing
ari@kahenadigital.com
@AriNahmani
References:
• Can You Now Trust Google To Crawl Ajax Sites?
• Search Engine Optimization Best Practices for AJAX URLs | Webmaster Blog
• We Tested How Googlebot Crawls Javascript And Here's What We Learned
• Prerender - AngularJS SEO, BackboneJS SEO, or EmberJS SEO
• SMX Munich Advanced Technical SEO Brainstorm - Google Docs
• www.simoahava.com/seo/dynamically-added-meta-data-indexed-google-crawlers/
• Speakers | Search Marketing Expo &ndash; SMX Munich
• JavaScript + SEO: Better Together &mdash; Medium
• SEO AJAX Crawlability in a Responsive Publisher World
• SEO Strategies for JavaScript-Heavy Single Page Applications or AJAX Sites | Search Engine Watch
• The Basics of JavaScript Framework SEO in AngularJS - Builtvisible
• Can Search Engines Crawl Javascript?
• https://www.w3.org/wiki/Graceful_degradation_versus_progressive_enhancement#Graceful_degradatio
n_and_progressive_enhancement_in_a_nutshell
• SEO and JS: New Challenges
• BromBone | SEO for your AngularJS, EmberJS, or BackboneJS website.
• DIY AngularJS SEO with PhantomJS (the easy way!) | Lawsonry
• https://scotch.io/tutorials/angularjs-seo-with-prerender-io
Image Credits:
fat-american-1.jpg (1280×955)
bigbrands1.jpg (570×383)
consistencydemotivator_large.jpeg (480×338)
04-godfather-keep-friend.jpg (518×300)
4da1a1a23dba011a7ba6918986a6b818302b949ae694b27d559cf8e733
08bf7b.jpg (604×392)
the-17-craziest-cannibal-attacks-in-history-u2.jpg (520×272)
taxonomy-types-800x450.png (800×450)
wireframes-homecat.png (1000×460)
Check-yoself.jpg (800×1025)
Dangerous-Curve-Ahead-Sign-K-6513.gif (400×400)
crawlerserver2.png (884×445)
beach.png (1196×838)

More Related Content

PDF
SEO AJAX Crawlability in a Responsive Publisher World
Eric Wu
 
PDF
Www amazon com-report
MahipSingh13
 
PDF
Technical SEO Best Practices
Nishanth Stephen
 
PDF
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
Erudite
 
PPTX
SEO for developers (session 1)
RankAbove
 
PPTX
Advanced Data-Driven SEO
Hamlet Batista
 
PDF
Top 10 Technical SEO Mistakes (that we see time and again)...
Erudite
 
SEO AJAX Crawlability in a Responsive Publisher World
Eric Wu
 
Www amazon com-report
MahipSingh13
 
Technical SEO Best Practices
Nishanth Stephen
 
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
Erudite
 
SEO for developers (session 1)
RankAbove
 
Advanced Data-Driven SEO
Hamlet Batista
 
Top 10 Technical SEO Mistakes (that we see time and again)...
Erudite
 

What's hot (17)

PDF
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Jason Mun
 
PPTX
SMX East - SEO Tools Panel
Abby Hamilton
 
PDF
Video SEO
Pablo Augusto
 
PDF
How Googlebot Renders (Roleplaying as Google's Web Rendering Service-- D&D st...
Jamie Indigo
 
PPTX
The New Renaissance of JavaScript
Hamlet Batista
 
PPTX
Clientside vs Serverside - SEO
Ali Haris
 
PPTX
BrightonSEO 2019 - Edge SEO - Using CDNs To Perform SEO On The Edge
Dan Taylor
 
XLSX
SEO Audit Checklist and Worksheet - over 90 SEO checkpoints!
Jonathon Colman
 
PDF
Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Ho...
Semrush
 
PDF
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
Dawn Anderson MSc DigM
 
PDF
WordPress SEO & Optimisation
Joost de Valk
 
PDF
Introduction to SEO and SEO for WordPress
Chris Burgess
 
PDF
How to Optimize Your Website for Crawl Efficiency
Semrush
 
PDF
Web Performance Optimisation
Chris Burgess
 
PDF
SEO for Large/Enterprise Websites - Data & Tech Side
Dominic Woodman
 
PDF
Search Engines and Flash: Secrets, Tricks, and Black Magic
guestb1f3a
 
PPTX
What's Next for Page Experience - SMX Next 2021 - Patrick Stox
Ahrefs
 
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Jason Mun
 
SMX East - SEO Tools Panel
Abby Hamilton
 
Video SEO
Pablo Augusto
 
How Googlebot Renders (Roleplaying as Google's Web Rendering Service-- D&D st...
Jamie Indigo
 
The New Renaissance of JavaScript
Hamlet Batista
 
Clientside vs Serverside - SEO
Ali Haris
 
BrightonSEO 2019 - Edge SEO - Using CDNs To Perform SEO On The Edge
Dan Taylor
 
SEO Audit Checklist and Worksheet - over 90 SEO checkpoints!
Jonathon Colman
 
Jamie Alberico — How to Leverage Insights from Your Site’s Server Logs | 5 Ho...
Semrush
 
Technical SEO Myths Facts And Theories On Crawl Budget And The Importance Of ...
Dawn Anderson MSc DigM
 
WordPress SEO & Optimisation
Joost de Valk
 
Introduction to SEO and SEO for WordPress
Chris Burgess
 
How to Optimize Your Website for Crawl Efficiency
Semrush
 
Web Performance Optimisation
Chris Burgess
 
SEO for Large/Enterprise Websites - Data & Tech Side
Dominic Woodman
 
Search Engines and Flash: Secrets, Tricks, and Black Magic
guestb1f3a
 
What's Next for Page Experience - SMX Next 2021 - Patrick Stox
Ahrefs
 
Ad

Viewers also liked (20)

PPTX
Ad Fraud Blocking Analytics Webinar
Dr. Augustine Fou - Independent Ad Fraud Researcher
 
PPTX
MAU Vegas 2016 — The Future of SEO
Grow.co
 
PPTX
Where the Wild Bots are OPSNY June 2016
Dr. Augustine Fou - Independent Ad Fraud Researcher
 
PDF
BrightEdge Share15 - S302: Beyond the Algorithm – Advanced SEO & Technical Tr...
BrightEdge Technologies
 
PDF
Technical SEO
Natacha Gajdoczki
 
PDF
The New Rules of SEO: A Crash Course in Modern Search - Slides
DemandWave
 
PDF
The Science of Content Marketing
DemandWave
 
PPTX
Agile and Technical SEO
Adam Gent
 
PPTX
SEO Audits
Christian Greiner
 
PPTX
Content marketing analytics: how to make your data work harder for your business
Daniel Smulevich
 
PDF
The price of technical seo debt final
John Doherty
 
PPTX
Technical SEO Face-Off
Rand Fishkin
 
PPTX
Implementing SEO Strategies for Small Business Health Care Companies
Ryan McEniff
 
PDF
40 Must-Use Powerful SEO Tools
Dmytro Spilka
 
PDF
Technical SEO Auditing: How healthy is your site?
Builtvisible
 
PDF
Technical SEO Audit Checklist
Erudite
 
PPTX
Link Building Fundamentals SMX West 2016
Matt Siltala
 
PPTX
How to Structure a Scalable SEO Strategy
Digital Reach
 
PPTX
Technical SEO and SEO Audits - Engage 2017 Portland - Bill Hartzer
Bill Hartzer
 
PPTX
Intergrating Search, Content and Social Media Marketing for B2B Businesses - ...
Ann Stanley
 
Ad Fraud Blocking Analytics Webinar
Dr. Augustine Fou - Independent Ad Fraud Researcher
 
MAU Vegas 2016 — The Future of SEO
Grow.co
 
Where the Wild Bots are OPSNY June 2016
Dr. Augustine Fou - Independent Ad Fraud Researcher
 
BrightEdge Share15 - S302: Beyond the Algorithm – Advanced SEO & Technical Tr...
BrightEdge Technologies
 
Technical SEO
Natacha Gajdoczki
 
The New Rules of SEO: A Crash Course in Modern Search - Slides
DemandWave
 
The Science of Content Marketing
DemandWave
 
Agile and Technical SEO
Adam Gent
 
SEO Audits
Christian Greiner
 
Content marketing analytics: how to make your data work harder for your business
Daniel Smulevich
 
The price of technical seo debt final
John Doherty
 
Technical SEO Face-Off
Rand Fishkin
 
Implementing SEO Strategies for Small Business Health Care Companies
Ryan McEniff
 
40 Must-Use Powerful SEO Tools
Dmytro Spilka
 
Technical SEO Auditing: How healthy is your site?
Builtvisible
 
Technical SEO Audit Checklist
Erudite
 
Link Building Fundamentals SMX West 2016
Matt Siltala
 
How to Structure a Scalable SEO Strategy
Digital Reach
 
Technical SEO and SEO Audits - Engage 2017 Portland - Bill Hartzer
Bill Hartzer
 
Intergrating Search, Content and Social Media Marketing for B2B Businesses - ...
Ann Stanley
 
Ad

Similar to Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016 (20)

PDF
Migration Best Practices - Search Y 2019, Paris
Bastian Grimm
 
PDF
Summit Australia 2019 - PowerApp Portals - Andrew Ly & Lachlan Wright
Andrew Ly
 
PPTX
PRAKHAR-Building-the-Web.pptx on web development
jairamji1536
 
PPTX
Modelling Web Performance Optimization - FFSUx
Haribabu Nandyal Padmanaban
 
PDF
The Technical SEO Full Course how to do
asadkhan888889990
 
PPTX
SEARCH Y - Bastian Grimm - Migrations Best Practices
SEARCH Y - Philippe Yonnet Evénements
 
PDF
Technial SEO
Bartosz Stankiewicz
 
PPTX
Boost and SEO
Tamaghna Banerjee
 
PDF
Hey Googlebot, did you cache that ?
Petra Kis-Herczegh
 
PPTX
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Dan Taylor
 
PPTX
Search engine optimization (seo) from Endeca & ATG
Vignesh sitaraman
 
PPTX
Angular jS Introduction by Google
ASG
 
PPTX
JS Fest 2019/Autumn. Александр Товмач. JAMstack
JSFestUA
 
PPTX
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
patrickstox
 
PDF
It is not HTML5. but ... / HTML5ではないサイトからHTML5を考える
Sadaaki HIRAI
 
PPTX
DeepCrawl Webinar: Performing SEO on the Edge
Dan Taylor
 
PPTX
Deep Crawl Webinar, Edge SEO with Dan Taylor
Dan Taylor
 
PDF
Technical SEO Checklist For Developers.pdf
Bluebash
 
PPTX
Presentation about html5 css3
Gopi A
 
PPT
Make Web Not War 2010
iProspect Canada
 
Migration Best Practices - Search Y 2019, Paris
Bastian Grimm
 
Summit Australia 2019 - PowerApp Portals - Andrew Ly & Lachlan Wright
Andrew Ly
 
PRAKHAR-Building-the-Web.pptx on web development
jairamji1536
 
Modelling Web Performance Optimization - FFSUx
Haribabu Nandyal Padmanaban
 
The Technical SEO Full Course how to do
asadkhan888889990
 
SEARCH Y - Bastian Grimm - Migrations Best Practices
SEARCH Y - Philippe Yonnet Evénements
 
Technial SEO
Bartosz Stankiewicz
 
Boost and SEO
Tamaghna Banerjee
 
Hey Googlebot, did you cache that ?
Petra Kis-Herczegh
 
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Dan Taylor
 
Search engine optimization (seo) from Endeca & ATG
Vignesh sitaraman
 
Angular jS Introduction by Google
ASG
 
JS Fest 2019/Autumn. Александр Товмач. JAMstack
JSFestUA
 
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
patrickstox
 
It is not HTML5. but ... / HTML5ではないサイトからHTML5を考える
Sadaaki HIRAI
 
DeepCrawl Webinar: Performing SEO on the Edge
Dan Taylor
 
Deep Crawl Webinar, Edge SEO with Dan Taylor
Dan Taylor
 
Technical SEO Checklist For Developers.pdf
Bluebash
 
Presentation about html5 css3
Gopi A
 
Make Web Not War 2010
iProspect Canada
 

More from Kahena Digital Marketing (18)

PDF
The Wild West of Google Featured Snippets & Direct Answers - Ari Nahmani - SM...
Kahena Digital Marketing
 
PPTX
App Store Optimization - Metrics, Organic Discovery, & The Future | SMX Muni...
Kahena Digital Marketing
 
PDF
SMX Israel 2014 - Ari Nahmani | Future Proof Link Earning: Automation, Accele...
Kahena Digital Marketing
 
PDF
SMX Israel 2014 - Ari Nahmani | Predictive Search: Apple Siri | Kahena Digita...
Kahena Digital Marketing
 
PDF
SMX Israel 2014 - Ari Nahmani | SEOs Give Up Their Best Ideas Panel | Kahena ...
Kahena Digital Marketing
 
PPTX
Aaron Friedman - Multi Channel Coordination Within Inbound Marketing - Kahena...
Kahena Digital Marketing
 
PDF
Ari Nahmani - KahenaCon Introduction - KahenaCon 2013
Kahena Digital Marketing
 
PDF
Tal Zilberman - Optimization in a Mobile World - KahenaCon 2013
Kahena Digital Marketing
 
PPTX
Jon Burg - Content Ignition - KahenaCon 2013
Kahena Digital Marketing
 
PPTX
Avi Mayer - How Building a Social Presence is Like Baking Challah - KahenaCon...
Kahena Digital Marketing
 
PDF
Tomer Hen - Generating Leads and Sales with Mobile Marketing - KahenaCon 2013
Kahena Digital Marketing
 
PPTX
Yehoshua Coren - Analytics Ninja - Metrics That Matter - KahenaCon 2013
Kahena Digital Marketing
 
PPTX
Natasha Shine - App Store Optimization
Kahena Digital Marketing
 
PPTX
Ross Plotkin - Take a Walk on the Paid Side - KahenaCon Spring 2012
Kahena Digital Marketing
 
PPTX
Rank Ranger - KahenaCon Spring 2012
Kahena Digital Marketing
 
PPTX
Hunter & Bard - The Art of Engagement Then Measurement - KahenaCon Spring 2012
Kahena Digital Marketing
 
PPT
Sam Michelson - Reputation Management - Managing the Big Picture - KahenaCon...
Kahena Digital Marketing
 
PPTX
Ari Nahmani - Multi Touch Funnels - KahenaCon Spring 2012
Kahena Digital Marketing
 
The Wild West of Google Featured Snippets & Direct Answers - Ari Nahmani - SM...
Kahena Digital Marketing
 
App Store Optimization - Metrics, Organic Discovery, & The Future | SMX Muni...
Kahena Digital Marketing
 
SMX Israel 2014 - Ari Nahmani | Future Proof Link Earning: Automation, Accele...
Kahena Digital Marketing
 
SMX Israel 2014 - Ari Nahmani | Predictive Search: Apple Siri | Kahena Digita...
Kahena Digital Marketing
 
SMX Israel 2014 - Ari Nahmani | SEOs Give Up Their Best Ideas Panel | Kahena ...
Kahena Digital Marketing
 
Aaron Friedman - Multi Channel Coordination Within Inbound Marketing - Kahena...
Kahena Digital Marketing
 
Ari Nahmani - KahenaCon Introduction - KahenaCon 2013
Kahena Digital Marketing
 
Tal Zilberman - Optimization in a Mobile World - KahenaCon 2013
Kahena Digital Marketing
 
Jon Burg - Content Ignition - KahenaCon 2013
Kahena Digital Marketing
 
Avi Mayer - How Building a Social Presence is Like Baking Challah - KahenaCon...
Kahena Digital Marketing
 
Tomer Hen - Generating Leads and Sales with Mobile Marketing - KahenaCon 2013
Kahena Digital Marketing
 
Yehoshua Coren - Analytics Ninja - Metrics That Matter - KahenaCon 2013
Kahena Digital Marketing
 
Natasha Shine - App Store Optimization
Kahena Digital Marketing
 
Ross Plotkin - Take a Walk on the Paid Side - KahenaCon Spring 2012
Kahena Digital Marketing
 
Rank Ranger - KahenaCon Spring 2012
Kahena Digital Marketing
 
Hunter & Bard - The Art of Engagement Then Measurement - KahenaCon Spring 2012
Kahena Digital Marketing
 
Sam Michelson - Reputation Management - Managing the Big Picture - KahenaCon...
Kahena Digital Marketing
 
Ari Nahmani - Multi Touch Funnels - KahenaCon Spring 2012
Kahena Digital Marketing
 

Recently uploaded (20)

PPTX
What Branding looks like, by: Cayancela Sánchez Jairo
Jairo Cayancela Sánchez
 
PDF
Reality Check: Rethinking B2B Brand Strategy for an Algorithmic, AI-Driven, C...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
PDF
SUPERMETRIC Design Agency - Capabilities Deck
Olaf Kreitz
 
PDF
Digital Marketing Trends to Watch in 2025
thedigitalfoot
 
PDF
How Strategic Marketing Drives Executive-Level Growth - Driving Growth, Shapi...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
PDF
Why Digital Marketing is the Future of Business Growth
Elysium Aviation Academy
 
PDF
First Thought Agency - all you need to know about us!
knownsense59
 
PDF
Hit Play, Then Pause: Reframing Entertainment for Resilience SXSW 2026 Propos...
morgan212401
 
PPTX
Paid Search vs. Paid Social. Why You Should Combine
Higher Education Marketing
 
PDF
How AI is Reshaping SEO: Trends, Predictions, and Opportunities for Marketers
Fractl - Content Marketing Agency
 
PDF
The Unshakeable Business Case for Empathetic CX
VWO
 
PPTX
DISSERTATION For management students.pptx
ganesh831732
 
PDF
Becoming a Better You: How to Discover a Better Version of Yourself - Jamie T...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
PPTX
SEO Keeps Changing Let"s be Honest
hrithiksrivastava683
 
PDF
Igniting Growth Social Media as a Strategic Imperative in the UAE
Viacon
 
PDF
Rebranding Social Media: Hello, Scroll Media by Saleh Lzeik
Saleh Lzeik
 
PDF
GBS Company Profile || Trending Branding
Pratibha Singh
 
PDF
The New SEO Playbook: How AI Is Reshaping Search And Content
Search Engine Journal
 
PPTX
SXSW_ Cultural Intelligence Overview.pptx
ericabuchman1
 
PPTX
Presentation - Marketing Agency Benefits.pptx
khadijaaaism
 
What Branding looks like, by: Cayancela Sánchez Jairo
Jairo Cayancela Sánchez
 
Reality Check: Rethinking B2B Brand Strategy for an Algorithmic, AI-Driven, C...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
SUPERMETRIC Design Agency - Capabilities Deck
Olaf Kreitz
 
Digital Marketing Trends to Watch in 2025
thedigitalfoot
 
How Strategic Marketing Drives Executive-Level Growth - Driving Growth, Shapi...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
Why Digital Marketing is the Future of Business Growth
Elysium Aviation Academy
 
First Thought Agency - all you need to know about us!
knownsense59
 
Hit Play, Then Pause: Reframing Entertainment for Resilience SXSW 2026 Propos...
morgan212401
 
Paid Search vs. Paid Social. Why You Should Combine
Higher Education Marketing
 
How AI is Reshaping SEO: Trends, Predictions, and Opportunities for Marketers
Fractl - Content Marketing Agency
 
The Unshakeable Business Case for Empathetic CX
VWO
 
DISSERTATION For management students.pptx
ganesh831732
 
Becoming a Better You: How to Discover a Better Version of Yourself - Jamie T...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
SEO Keeps Changing Let"s be Honest
hrithiksrivastava683
 
Igniting Growth Social Media as a Strategic Imperative in the UAE
Viacon
 
Rebranding Social Media: Hello, Scroll Media by Saleh Lzeik
Saleh Lzeik
 
GBS Company Profile || Trending Branding
Pratibha Singh
 
The New SEO Playbook: How AI Is Reshaping Search And Content
Search Engine Journal
 
SXSW_ Cultural Intelligence Overview.pptx
ericabuchman1
 
Presentation - Marketing Agency Benefits.pptx
khadijaaaism
 

Advanced Technical SEO - Index Bloat & Discovery: from Facets to Javascript Frameworks - SMX Munich 2016

  • 1. The Latest in Advanced Technical SEO Index Bloat & Discovery: from Facets to Frameworks
  • 11. Today’s Session • Technical SEO issues around e-commerce / large site architecture • Preventing index bloat & preserving crawl budget as a core methodology • Current solutions & upcoming threats (JS, AJAX, new frameworks, pre-rendering)
  • 13. Index Bloat Prevention A bloated index = if indexed URLS > “unique pages”
  • 14. Index Bloat Prevention On an ecommerce site: A bloated index = if indexed URLS > sum(CAT+PDP+Static)
  • 15. Index Bloat Prevention On a ‘content’ site: A bloated index = if indexed URLS > sum(Articles+Static)
  • 18. Index Bloat Prevention: Cannibalization
  • 21. Index Bloat Prevention: Sorts & Facets
  • 22. Index Bloat Prevention: Sorts & Filters http://www.site.com/guys/tees/?pref n1=bvAverageRating&prefn2=col orGroup&prefv3=LG&srule=sortin gNewArrival&prefv1=4&prefv2=RE D&prefn3=size
  • 23. Index Bloat Prevention: Sorts & Filters <link rel="canonical" href=”http://www.site.com/guys/tees/" /> • Basic Solution: Strip out the unnecessary parameters
  • 24. Solution: Filtering Out All Facet Params • PROS: – Avoids diluted / dupe URLs (request, not directive) • CONS: – If you want/need specific parameters indexed and exposed (size, color), need properly coded canonical tag logic, recipe for major leak and confusion. – Considerations w/ pagination & view-all page
  • 25. Crawl Budget: Facet Parameter URLs
  • 26. Crawl Budget: Facet Parameter URLs
  • 28. JS / AJAX Indexation
  • 29. Index Bloat VS Discovery: JS + AJAX
  • 30. Index Bloat Prevention: JS + AJAX AJAX Refinement V1 = NO URL CHANGE
  • 31. Index Bloat Prevention: JS + AJAX AJAX Refinement V1 - NO URL CHANGE, but inactive, different href= URL exists
  • 32. AJAX Facet Refinements V1 (NO URL CHANGE) • PROS: – Theoretically no parameters exposed to bloat the index • CONS: – Users can’t share refined / filtered content to friends, no accurate bookmarking. (Terrible UX) – Googlebot will still crawl hidden href=' or other JS framework links like Angular: ng-href= (check canonical logic!!)
  • 33. Index Bloat Prevention: JS + AJAX AJAX Refinement V2 = html 5 history.pushState()
  • 34. Index Bloat Prevention: JS + AJAX html 5 history.pushState() http://www.site.com/guys/tees/?color=green&size=large
  • 35. Consistent URL Signals - Navigation Ideal consistency: Navigation URLs = Pushstate() URLs = Canonical URLs = XML Sitemap URLs =
  • 36. Consistent URL Signals - Navigation Ideal consistency: Navigation URLs = Pushstate() URLs ≠ Canonical URLs = XML Sitemap URLs =
  • 37. Index Bloat Prevention: JS + AJAX Google preferred pushstate URL version, we had to reinforce (via normal inline href=‘’, canonical, xml sitemap)
  • 38. AJAX Facet Refinements V2 (PushState URL Change) • PROS: – Users can now share /bookmark the correct content – Added to browser history • CONS: – Still need to have consistent canonical structure due to Googlebot crawling pushstate() – Different hidden URL structure via AJAX facets may require further unpredictable canonicalization logic / further dev work
  • 39. Indexing AJAX & JS Frameworks
  • 40. Indexing AJAX & JS Frameworks
  • 44. Indexing AJAX & JS Frameworks What method exists that we know still works?
  • 45. Indexing AJAX & JS Frameworks HTML SNAPSHOT
  • 46. <head> <meta name="fragment" content="!"> Google / Bing crawls with: _escaped_fragment_= Indexing AJAX & JS: HTML Snapshot
  • 47. Indexing AJAX & JS: HTML Snapshot
  • 48. Indexing AJAX & JS: HTML Snapshot
  • 49. Pre or Realtime Rendered (to users & bots) Indexing AJAX & JS: How To Decide? HTML SNAPSHOT _escaped_fragment_= Trust Googlebot VALIDATE! Progressive Enhancement ‘Dumbed down’ HTML Template 3rd Party Service (prerender.io) Server side (phantomJS / headless browser) Pre-Rendered (to bots)
  • 50. Pre or Realtime Rendered (to users & bots) Indexing AJAX & JS: How To Decide? HTML SNAPSHOT _escaped_fragment_= Trust Googlebot VALIDATE! Progressive Enhancement ‘Dumbed down’ HTML Template 3rd Party Service (prerender.io) Pre-Rendered (to bots) Server side (phantomJS / headless browser)
  • 51. Indexing AJAX & JS: HTML Snapshot • Upon crawl of URL with _escaped_fragment_=, serve ’dumbed down’ HTML version of page. • Not pre-rendered, rather simplified. • For example, on ecommerce à a view-all category listing with no dynamic facets. Amazing results from our clients.
  • 52. Indexing AJAX & JS: How To Decide? HTML SNAPSHOT _escaped_fragment_= Trust Googlebot VALIDATE! Progressive Enhancement ‘Dumbed down’ HTML Template 3rd Party Service (prerender.io) Pre or Realtime Rendered (to users & bots) Pre-Rendered (to bots) Server side (phantomJS / headless browser)
  • 53. Indexing AJAX & JS: Pre-rendering Upon crawl of URL with _escaped_fragment_= 1. prerender.io – middleware via reverse proxy that serves a pre-rendered, cached HTML page to bots OR 2. Server side – the server pre-rendered the JS in cached html pages to serve to bots or does it in real-time (headless browser).
  • 54. Indexing AJAX & JS: Prerender.io
  • 55. Indexing AJAX & JS: Prerender.io
  • 56. Indexing AJAX & JS: BromBone
  • 57. Indexing AJAX & JS: Server Prerender
  • 58. Server side (phantomJS / headless browser) Pre or Realtime Rendered (to users & bots) Indexing AJAX & JS: How To Decide? HTML SNAPSHOT _escaped_fragment_= Trust Googlebot VALIDATE! Progressive Enhancement ‘Dumbed down’ HTML Template 3rd Party Service (prerender.io) Pre-Rendered (to bots)
  • 59. Indexing AJAX & JS: Server Side bit.ly/javascriptseo
  • 60. Indexing AJAX & JS: Server Side bit.ly/javascriptseobit.ly/javascriptseo
  • 61. Indexing AJAX & JS: Server Side bit.ly/javascriptseobit.ly/javascriptseo
  • 62. Server side (phantomJS / headless browser) Pre or Realtime Rendered (to users & bots) Indexing AJAX & JS: How To Decide? HTML SNAPSHOT _escaped_fragment_= Trust Googlebot VALIDATE! Progressive Enhancement ‘Dumbed down’ HTML Template 3rd Party Service (prerender.io) Pre-Rendered (to bots)
  • 63. Indexing AJAX & JS: Trust Googlebot read these first…
  • 64. Testing JS Indexation: Jscrawlability.com
  • 67. Testing: Fetch & Render JS / AJAX
  • 68. Testing: Slice and Dice the Index Advanced Site Operators site:yoursite.com –inurl:cat.jsp -inurl:prod.jsp –inurl:store.jsp
  • 69. Testing: Slice and Dice the Index Advanced Site Operators site:yoursite.com inurl:size inurl:cat.jsp -inurl:cid
  • 70. Testing: Slice and Dice the Index Advanced Site Operators site:yoursite.com inurl:pdp intext:”write a review”
  • 71. Testing: Automate Bloat + Discovery Check
  • 72. Testing: Automate Bloat + Discovery Check
  • 73. Testing: Search Analytics for Bloat / Discovery
  • 74. Testing: Go To The Source: Server Logs!
  • 75. Summing It Up • Index Bloat, Crawl Budget, & Testing: Large sites are prone to serious index bloat and wasted crawl budget. Needs diligent testing and an OCD-like attention to detail with the basics. Test often & automate! • JS/AJAX: Pushstate(), JS Frameworks and AJAX present both discovery and bloat challenges. Know the options: short term fixes like HTML snapshot (G+B), and long term re-designs with modern frameworks w/ built in server side rendering.
  • 77. References: • Can You Now Trust Google To Crawl Ajax Sites? • Search Engine Optimization Best Practices for AJAX URLs | Webmaster Blog • We Tested How Googlebot Crawls Javascript And Here's What We Learned • Prerender - AngularJS SEO, BackboneJS SEO, or EmberJS SEO • SMX Munich Advanced Technical SEO Brainstorm - Google Docs • www.simoahava.com/seo/dynamically-added-meta-data-indexed-google-crawlers/ • Speakers | Search Marketing Expo &ndash; SMX Munich • JavaScript + SEO: Better Together &mdash; Medium • SEO AJAX Crawlability in a Responsive Publisher World • SEO Strategies for JavaScript-Heavy Single Page Applications or AJAX Sites | Search Engine Watch • The Basics of JavaScript Framework SEO in AngularJS - Builtvisible • Can Search Engines Crawl Javascript? • https://www.w3.org/wiki/Graceful_degradation_versus_progressive_enhancement#Graceful_degradatio n_and_progressive_enhancement_in_a_nutshell • SEO and JS: New Challenges • BromBone | SEO for your AngularJS, EmberJS, or BackboneJS website. • DIY AngularJS SEO with PhantomJS (the easy way!) | Lawsonry • https://scotch.io/tutorials/angularjs-seo-with-prerender-io
  • 78. Image Credits: fat-american-1.jpg (1280×955) bigbrands1.jpg (570×383) consistencydemotivator_large.jpeg (480×338) 04-godfather-keep-friend.jpg (518×300) 4da1a1a23dba011a7ba6918986a6b818302b949ae694b27d559cf8e733 08bf7b.jpg (604×392) the-17-craziest-cannibal-attacks-in-history-u2.jpg (520×272) taxonomy-types-800x450.png (800×450) wireframes-homecat.png (1000×460) Check-yoself.jpg (800×1025) Dangerous-Curve-Ahead-Sign-K-6513.gif (400×400) crawlerserver2.png (884×445) beach.png (1196×838)