Adam Rogers, Data Scientist at Jane.com
Jane Recommendation
Engine
Jane Recommendation Overview
Amazon’s percent of sales from recommendation 35% (2006)
Netflix estimates that 75 percent of viewer activity is driven by recommendation.
(2013 - Wired)
Why Recommendations?
How does it work?
Application
User
Events
Kinesis
Lambda
Lambda
Lambda
DB
Collaborative Filtering
Amazon’s “Users also Purchased”
Recommend products based on shared activity with other users
Predicts what other product-user mappings are likely based on current ones
www.amazon.com
Jane Recommendation Engines
The Tools: Spark, Mahout, Cloudsearch
Spark:
Fast Parallel Data Processing and Machine Learning
Scales to massive amounts of data
Mahout:
Parallel Linear Algebra (Matrix Operations) and Machine Learning
Spark and Mahout together enable fast collaborative filtering on massive
datasets
Cloudsearch:
http://s6.postimg.org/r0m8bpjw1/recommender_architecture.png
Jane’s Recommendation Challenges
“Cold Start Problem” To the Max
No long-lived products to use as baseline for new ones
Every day ⅓ of products are brand new
Means we need to use events as far back as we
reasonably can in our calculation
http://www.beautifulonraw.com/raw-food-blog/wp-
content/uploads/2010/06/Shivering.jpg
Other Types of Recommenders
Content
Popular
User Similarity
"Collaborative Filtering in Recommender Systems" by Moshanin - Own work. Licensed under CC BY-SA 3.0 via Commons -
https://commons.wikimedia.org/wiki/File:Collaborative_Filtering_in_Recommender_Systems.jpg#/media/File:Collaborative_Filtering_in_Recommender_Systems.jpg
Content Recommendations
Recommend items that are similar to the given item
Based on information contained in the item - title, description, images, etc.
Avoids the “Cold Start” problem
User may not want to buy 2 very similar things though
Word Embeddings
Word Embeddings
http://spark-public.s3.amazonaws.com/neuralnets/images/Lecture4/turian.png
Content Recommendations with Word Embeddings
Calculate word embeddings on text within product (description, title, tags, etc.)
Compute distances between “embedded” product information
Euclidean distance is poor in such high dimensions - try cosine, mahalanobis, others
N nearest neighbors to the product in question are your recommendation
Improving Content Recommendations
Remove meaningless, common stopwords
Weight your embedded vectors on given criteria
Use category information
Get creative with your data - different patterns in each dataset
Improving, cont.
Can “embed” images in a similar fashion using deep networks
Compute distance between embedded images
Combine image distances and text distances to give combined distance metric
Determine nearest neighbors from new distance metric
Summary
Recommendations are a powerful (and these days, standard and necessary) tool
for improving customer interaction, conversion, etc.
Collaborative filtering is a proven algorithm for relevant recommendations (given
lots of user data and products)
Great tools for building collaborative filtering recommendation systems exist
(AWS, Spark, etc.) but you need to adapt to your specific needs
Content recommendations can supplement the weaknesses of collaborative
filtering
Get creative to improve the quality of your recommendations
Sources
http://www.cs.umd.edu/~samir/498/Amazon-Recommendations.pdf
"Collaborative Filtering in Recommender Systems" by Moshanin - Own work.
Licensed under CC BY-SA 3.0 via Commons -
https://commons.wikimedia.org/wiki/File:Collaborative_Filtering_in_Recommender_
Systems.jpg#/media/File:Collaborative_Filtering_in_Recommender_Systems.jpg
https://aws.amazon.com/kinesis/streams/

More Related Content

PPTX
JoTechies -Azure Machine Learning
PPTX
Collab365 Empower-Your-Applications-With-Azure-Machine-Learning
PDF
Architecting for Data Science
PPTX
Data science 101 Masterclass
PPTX
Introduction to Azure Machine Learning
PPTX
Building Real Time Targeting Capabilities - Ryan Zotti, Subbu Thiruppathy - C...
PPTX
Azure Machine Learning 101
PDF
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
JoTechies -Azure Machine Learning
Collab365 Empower-Your-Applications-With-Azure-Machine-Learning
Architecting for Data Science
Data science 101 Masterclass
Introduction to Azure Machine Learning
Building Real Time Targeting Capabilities - Ryan Zotti, Subbu Thiruppathy - C...
Azure Machine Learning 101
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...

What's hot (20)

PDF
Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
PPTX
Introduction to Azure machine learning
PDF
Case Study to build a tablet based app that is a shopping assistant.
PDF
Modern Machine Learning Infrastructure and Practices
PDF
Azure Machine Learning 101
PPTX
Production machine learning_infrastructure
PPT
Building Personalized Data Products with Dato
PPTX
Predicting the Future with Azure Machine Learning
PPTX
Webinar - Product Matching - Palombo (20160428)
PDF
H2O for Medicine and Intro to H2O in Python
PDF
Industrial Data Science
PDF
Welcome to a new state of find
PPTX
DevOps: Monitorando aplicação com App Insights
PDF
Introduction to Recommender Systems
PPTX
Introduction to Machine learning and Deep Learning
PDF
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
PDF
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
PDF
Azure Machine Learning and Data Journeys
PDF
Leveraging NLP and Deep Learning for Document Recommendations in the Cloud
PDF
Dynamic generation of personalized hybrid recommender systems
Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
Introduction to Azure machine learning
Case Study to build a tablet based app that is a shopping assistant.
Modern Machine Learning Infrastructure and Practices
Azure Machine Learning 101
Production machine learning_infrastructure
Building Personalized Data Products with Dato
Predicting the Future with Azure Machine Learning
Webinar - Product Matching - Palombo (20160428)
H2O for Medicine and Intro to H2O in Python
Industrial Data Science
Welcome to a new state of find
DevOps: Monitorando aplicação com App Insights
Introduction to Recommender Systems
Introduction to Machine learning and Deep Learning
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
Azure Machine Learning and Data Journeys
Leveraging NLP and Deep Learning for Document Recommendations in the Cloud
Dynamic generation of personalized hybrid recommender systems
Ad

Similar to Jane Recommendation Engines (20)

PDF
Real-time personalized recommendations using product embeddings
PDF
Recommendation
PDF
Recommender systems
PDF
Building a Recommender systems by Vivek Murugesan - Technical Architect at Cr...
PPTX
Recommended System.pptx
PDF
Introduction to Recommendation Systems (Vietnam Web Submit)
PPTX
Lecture Notes on Recommender System Introduction
PDF
Introduction to Recommendation Systems
PDF
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
PPTX
videorecommendationsystemfornewseducationandentertainment-170519183703.pptx
PDF
Embeddings! embeddings everywhere!
PPT
Cs548 s15 showcase_web_mining
PDF
A recommendation engine for your applications - M.Orselli - Codemotion Rome 17
PPTX
Recommender systems for E-commerce
PDF
Best data science courses in pune
PDF
best online data science courses
PDF
Big data certification training mumbai
PDF
Top data science institutes in hyderabad
PDF
AI in Entertainment – Movie Recommendation System
PPTX
Teacher training material
Real-time personalized recommendations using product embeddings
Recommendation
Recommender systems
Building a Recommender systems by Vivek Murugesan - Technical Architect at Cr...
Recommended System.pptx
Introduction to Recommendation Systems (Vietnam Web Submit)
Lecture Notes on Recommender System Introduction
Introduction to Recommendation Systems
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
videorecommendationsystemfornewseducationandentertainment-170519183703.pptx
Embeddings! embeddings everywhere!
Cs548 s15 showcase_web_mining
A recommendation engine for your applications - M.Orselli - Codemotion Rome 17
Recommender systems for E-commerce
Best data science courses in pune
best online data science courses
Big data certification training mumbai
Top data science institutes in hyderabad
AI in Entertainment – Movie Recommendation System
Teacher training material
Ad

Recently uploaded (20)

PPTX
inbound2857676998455010149.pptxmmmmmmmmm
PDF
General category merit rank list for neet pg
PPT
What is life? We never know the answer exactly
PPTX
Sheep Seg. Marketing Plan_C2 2025 (1).pptx
PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
PDF
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
PDF
Book Trusted Companions in Delhi – 24/7 Available Delhi Personal Meeting Ser...
PDF
The Role of Pathology AI in Translational Cancer Research and Education
PPTX
lung disease detection using transfer learning approach.pptx
PPTX
C programming msc chemistry pankaj pandey
PPTX
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
PPTX
machinelearningoverview-250809184828-927201d2.pptx
PPTX
Hushh.ai: Your Personal Data, Your Business
PPTX
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
PPTX
Introduction to Fundamentals of Data Security
PPTX
GPS sensor used agriculture land for automation
PPTX
Machine Learning and working of machine Learning
PDF
Hikvision-IR-PPT---EN.pdfSADASDASSAAAAAAAAAAAAAAA
PPTX
transformers as a tool for understanding advance algorithms in deep learning
inbound2857676998455010149.pptxmmmmmmmmm
General category merit rank list for neet pg
What is life? We never know the answer exactly
Sheep Seg. Marketing Plan_C2 2025 (1).pptx
1 hour to get there before the game is done so you don’t need a car seat for ...
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
Book Trusted Companions in Delhi – 24/7 Available Delhi Personal Meeting Ser...
The Role of Pathology AI in Translational Cancer Research and Education
lung disease detection using transfer learning approach.pptx
C programming msc chemistry pankaj pandey
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
machinelearningoverview-250809184828-927201d2.pptx
Hushh.ai: Your Personal Data, Your Business
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
Introduction to Fundamentals of Data Security
GPS sensor used agriculture land for automation
Machine Learning and working of machine Learning
Hikvision-IR-PPT---EN.pdfSADASDASSAAAAAAAAAAAAAAA
transformers as a tool for understanding advance algorithms in deep learning

Jane Recommendation Engines

  • 1. Adam Rogers, Data Scientist at Jane.com Jane Recommendation Engine
  • 3. Amazon’s percent of sales from recommendation 35% (2006) Netflix estimates that 75 percent of viewer activity is driven by recommendation. (2013 - Wired) Why Recommendations?
  • 4. How does it work? Application User Events Kinesis Lambda Lambda Lambda DB
  • 5. Collaborative Filtering Amazon’s “Users also Purchased” Recommend products based on shared activity with other users Predicts what other product-user mappings are likely based on current ones www.amazon.com
  • 7. The Tools: Spark, Mahout, Cloudsearch Spark: Fast Parallel Data Processing and Machine Learning Scales to massive amounts of data Mahout: Parallel Linear Algebra (Matrix Operations) and Machine Learning Spark and Mahout together enable fast collaborative filtering on massive datasets Cloudsearch:
  • 9. Jane’s Recommendation Challenges “Cold Start Problem” To the Max No long-lived products to use as baseline for new ones Every day ⅓ of products are brand new Means we need to use events as far back as we reasonably can in our calculation http://www.beautifulonraw.com/raw-food-blog/wp- content/uploads/2010/06/Shivering.jpg
  • 10. Other Types of Recommenders Content Popular User Similarity "Collaborative Filtering in Recommender Systems" by Moshanin - Own work. Licensed under CC BY-SA 3.0 via Commons - https://commons.wikimedia.org/wiki/File:Collaborative_Filtering_in_Recommender_Systems.jpg#/media/File:Collaborative_Filtering_in_Recommender_Systems.jpg
  • 11. Content Recommendations Recommend items that are similar to the given item Based on information contained in the item - title, description, images, etc. Avoids the “Cold Start” problem User may not want to buy 2 very similar things though
  • 14. Content Recommendations with Word Embeddings Calculate word embeddings on text within product (description, title, tags, etc.) Compute distances between “embedded” product information Euclidean distance is poor in such high dimensions - try cosine, mahalanobis, others N nearest neighbors to the product in question are your recommendation
  • 15. Improving Content Recommendations Remove meaningless, common stopwords Weight your embedded vectors on given criteria Use category information Get creative with your data - different patterns in each dataset
  • 16. Improving, cont. Can “embed” images in a similar fashion using deep networks Compute distance between embedded images Combine image distances and text distances to give combined distance metric Determine nearest neighbors from new distance metric
  • 17. Summary Recommendations are a powerful (and these days, standard and necessary) tool for improving customer interaction, conversion, etc. Collaborative filtering is a proven algorithm for relevant recommendations (given lots of user data and products) Great tools for building collaborative filtering recommendation systems exist (AWS, Spark, etc.) but you need to adapt to your specific needs Content recommendations can supplement the weaknesses of collaborative filtering Get creative to improve the quality of your recommendations
  • 18. Sources http://www.cs.umd.edu/~samir/498/Amazon-Recommendations.pdf "Collaborative Filtering in Recommender Systems" by Moshanin - Own work. Licensed under CC BY-SA 3.0 via Commons - https://commons.wikimedia.org/wiki/File:Collaborative_Filtering_in_Recommender_ Systems.jpg#/media/File:Collaborative_Filtering_in_Recommender_Systems.jpg https://aws.amazon.com/kinesis/streams/