SlideShare a Scribd company logo
Content and Popularity based
RecoSys
Learning Objectives
• Introduction to recommendation system and its applications.
• Why recommendation system is important.
• Key approaches of Recommendation system.
• Market basket analysis.
• Content Based Recommendation system and its applications.
• Case studies for Content based and Popularity based RecoSys.
2
Introduction to recommendation system
and its applications
• Recommender systems are one of the most successful and widespread application of machine
learning technologies in business.
• You can apply recommender systems in scenarios where many users interact with many items.
Intro to RecoSys Contd.
• You can find large scale recommender systems in retail, video on demand, or music streaming.
• Recommendation of news or videos for media, product recommendation or personalization in travel
and retail can be handled by similar machine learning algorithms.
• There are also recommender systems for experts, collaborators, jokes, restaurants, garments, financial
services, life insurance, romantic partners (online dating), and Twitter pages.
Why recommendation system is important
Recommendation systems can help match users to items
They help item provider deliver their items to right user.
• Identify products most relevant to the user.
• Personalized content
• Eg. Top and Offers
Help website improve user engagement and increase online sales
Key approaches of Recommendation system
The key approaches to RecSys
• Collaborative filtering
• Content based
• Popularity based
Popularity based Recommendation system
• Recommended items viewed/purchased by most people.
• Recommendations : ranked list of items by their purchase count / viewed count “Popular
news”.
• It uses:
– Text only
– Purchase history
– User and item features
– Scale
Note: The problem with popularity based recommendation system is that the personalization is not
available with this method . For eg: for a song recommendation if the model is popularity based
recommender, it is not personalised towards any user and will output the same list of
recommended songs
Content Based Recommendation system
and its applications
• A content based recommender works with data that the user provides, either explicitly (rating) or
implicitly (clicking on a link).
• Based on that data, a user profile is generated, which is then used to make suggestions to the user.
• As the user provides more inputs or takes actions on the recommendations, the engine becomes more
and more accurate.
• The information source that content-based filtering systems are mostly used with are text documents.
A standard approach for term parsing selects single words from documents.
Content based RecoSys Contd.
Advantages
• Content-based recommender systems do not use a lot of user data.
• Only requires item data and you can start giving recommendations to users.
• It is viable to give recommendations to even your first customer as long as you have sufficient data to
build his user profile
Challenges
• The item data needs to be well distributed
• Recommendations to the user will be mostly substitutes, and not complements.
• Less dynamic
Bag of Words (BOW) vectorization
• Text data cannot be directly used for modelling.
• One approach to extract features involves the BOW approach.
• The BOW approach generates a Document-Term Matrix (DTM) where columns represents all the
unique words in the corpus and the cell value indicates the frequency of that word in a given
document.
TF-IDF
TF-IDF is composed of 2 terms:
• Term Frequency (TF) measures how frequently a term appears in the document.
• Inverse Document Frequency (IDF) measures the relative importance of words in a corpus.
Word Embedding Models (Word2Vec)
• Word vectors are trained by setting up a fake training task.
• The weights of the hidden layer learnt during the training process provides the feature vector for each
word – words that appear in similar context ends up having similar feature vectors.
Case Study
Anonymous Ratings from the Jester Online Joke Recommender System
Abstract
Ratings are real values ranging from -10.00 to +10.00 (the value "99" corresponds to "null" = "not rated").
One row per user
The first column gives the number of jokes rated by that user. The next 100 columns give the ratings for jokes 01
- 100.
Dataset
http://eigentaste.berkeley.edu/dataset/
13
Steps to follow
1. Read the dataset
2. Take care about the header as there are no column names given in the dataset.
3. Consider ratings named dataframe with only first 200 rows and all columns from
1(first column is 0) of dataset.
4. Change the column indices from 0 to 99
5. In the dataset, the null ratings are given as 99.00, so replace all 99.00s with 0
6. Normalize the ratings using StandardScaler and save them in ratings_diff variable
7. Find the mean for each column in ratings_diff i.e, for each joke
8. Consider all the mean ratings and find the jokes with highest mean value and
display the top 10 joke IDs.
14
Case study on Content based RS
Context:
This dataset contains ratings for ten thousand popular books. As to the source, let's say
that these ratings were found on the internet. Generally, there are 100 reviews for each
book, although some have less - fewer - ratings. Ratings go from one to five
Both book IDs and user IDs are contiguous. For books, they are 1-10000, for users, 1-
53424. All users have made at least two ratings. Median number of ratings per user is 8.
There are also books marked to read by the users, book metadata (author, year, etc.) and
tags.
Here is to recommend good book
15
Attribute information
1. id 13. average_rating
2. book_id 14. ratings_count
3. best_book_id 15. work_ratings_count
4. work_id 16. work_text_reviews_count
5. books_count 17. ratings_1
6. isbn 18. ratings_2
7. isbn13 19. ratings_3
8. authors 20. ratings_4
9. original_publication_year 21. ratings_5
10.original_title 22. image_url
11.title 23. small_image_url
12.language_code
16
Steps to follow
1. Get the data.
2. Perform data pre-processing
3. Vectorize title
4. What happens when we change the value of min_df value? What is this argument
controlling?
5. Find Similarity Between Items
6. What are the most similiar books? Why does "First to Kill", "Kill me if you can" etc
all have same ratings?
7. Find out what features have been considered by the vectorizer for a given title?
17
Reasesrty djhjan S - explanation required.pptx
Questions
?

More Related Content

PDF
Recommender Systems, Matrices and Graphs
Roelof Pieters
 
PPT
Content based recommendation systems
Aravindharamanan S
 
PPTX
Immersive Recommendation Workshop, NYC Media Lab'17
Longqi Yang
 
PDF
[系列活動] 人工智慧與機器學習在推薦系統上的應用
台灣資料科學年會
 
PDF
Recommender Systems and Linked Open Data
Polytechnic University of Bari
 
PDF
Demystifying Recommendation Systems
Rumman Chowdhury
 
PDF
Recommender Systems
Chu-Yu Hsu
 
PPT
CS8091_BDA_Unit_III_Content_Based_Recommendation
Palani Kumar
 
Recommender Systems, Matrices and Graphs
Roelof Pieters
 
Content based recommendation systems
Aravindharamanan S
 
Immersive Recommendation Workshop, NYC Media Lab'17
Longqi Yang
 
[系列活動] 人工智慧與機器學習在推薦系統上的應用
台灣資料科學年會
 
Recommender Systems and Linked Open Data
Polytechnic University of Bari
 
Demystifying Recommendation Systems
Rumman Chowdhury
 
Recommender Systems
Chu-Yu Hsu
 
CS8091_BDA_Unit_III_Content_Based_Recommendation
Palani Kumar
 

Similar to Reasesrty djhjan S - explanation required.pptx (20)

PDF
Real-time personalized recommendations using product embeddings
Jakub Macina
 
PPTX
Design Recommender systems from scratch
Dr. Amit Sachan
 
PPT
Social Recommender Systems Tutorial - WWW 2011
idoguy
 
PDF
Research on Recommender Systems: Beyond Ratings and Lists
Denis Parra Santander
 
PDF
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
David Zibriczky
 
PDF
Recommandation systems -
Yousef Fadila
 
PPTX
Recommenders Systems
Tariq Hassan
 
PPT
Recommender systems
Tamer Rezk
 
PPTX
Recommender systems: Content-based and collaborative filtering
Viet-Trung TRAN
 
PDF
A recommendation engine for your applications - M.Orselli - Codemotion Rome 17
Codemotion
 
PDF
IntroductionRecommenderSystems_Petroni.pdf
AlphaIssaghaDiallo
 
PDF
Introduction to Recommender Systems
Turi, Inc.
 
PDF
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
Mladen Jovanovic
 
PDF
Netflix Recommendations - Beyond the 5 Stars
Xavier Amatriain
 
PDF
A recommendation engine for your php application
Michele Orselli
 
PPT
Recommender lecture
Aravindharamanan S
 
PDF
ESSIR 2013 Recommender Systems tutorial
Alexandros Karatzoglou
 
PDF
Collaborative Filtering and Recommender Systems By Navisro Analytics
Navisro Analytics
 
PDF
Aaa ped-21-Recommender Systems: Content-based Filtering
AminaRepo
 
PPTX
Movie Recommendation System.pptx
randominfo
 
Real-time personalized recommendations using product embeddings
Jakub Macina
 
Design Recommender systems from scratch
Dr. Amit Sachan
 
Social Recommender Systems Tutorial - WWW 2011
idoguy
 
Research on Recommender Systems: Beyond Ratings and Lists
Denis Parra Santander
 
Highlights from the 8th ACM Conference on Recommender Systems (RecSys 2014)
David Zibriczky
 
Recommandation systems -
Yousef Fadila
 
Recommenders Systems
Tariq Hassan
 
Recommender systems
Tamer Rezk
 
Recommender systems: Content-based and collaborative filtering
Viet-Trung TRAN
 
A recommendation engine for your applications - M.Orselli - Codemotion Rome 17
Codemotion
 
IntroductionRecommenderSystems_Petroni.pdf
AlphaIssaghaDiallo
 
Introduction to Recommender Systems
Turi, Inc.
 
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
Mladen Jovanovic
 
Netflix Recommendations - Beyond the 5 Stars
Xavier Amatriain
 
A recommendation engine for your php application
Michele Orselli
 
Recommender lecture
Aravindharamanan S
 
ESSIR 2013 Recommender Systems tutorial
Alexandros Karatzoglou
 
Collaborative Filtering and Recommender Systems By Navisro Analytics
Navisro Analytics
 
Aaa ped-21-Recommender Systems: Content-based Filtering
AminaRepo
 
Movie Recommendation System.pptx
randominfo
 
Ad

More from AnkitaVerma776806 (20)

PPTX
fundamental of full stack development.pptx
AnkitaVerma776806
 
PPTX
yuuuuuuuuuuuCloud-Storage-PPT[1][1].pptx
AnkitaVerma776806
 
PPT
Technical Seminar of Mca computer vision .ppt
AnkitaVerma776806
 
PPTX
introduction the vision of the computer.pptx
AnkitaVerma776806
 
PPTX
engineering topic SEMINAR BIOMETRIC2.pptx
AnkitaVerma776806
 
PPTX
1000000000606084982-mini-project-ppt.pptx
AnkitaVerma776806
 
PPTX
Internship full stack developer ppt report .pptx
AnkitaVerma776806
 
PPT
ppt presentation 555_Spring12_topic01.ppt
AnkitaVerma776806
 
PPTX
the reference of book detail Hospital Booking.pptx
AnkitaVerma776806
 
PPTX
ARTIFICIAL INTELLLLIGENCEE modul11_AI.pptx
AnkitaVerma776806
 
PPT
voicecontrolhomeautomation-130627021245-phpapp01 (1).ppt
AnkitaVerma776806
 
PPTX
the refernce of programming C notes ppt.pptx
AnkitaVerma776806
 
PPTX
detail the reference of Hospital Booking.pptx
AnkitaVerma776806
 
PPTX
Advances in ML learning process require. ppt.pptx
AnkitaVerma776806
 
PPTX
SEMINAR_BIOMETRIC of hand fingerprint,voice bsed biometric ,eye based biometric
AnkitaVerma776806
 
PPTX
ch8.pptx
AnkitaVerma776806
 
PPTX
AI(Module1).pptx
AnkitaVerma776806
 
PPTX
continuity of module 2.pptx
AnkitaVerma776806
 
PPTX
Advances in ML. ppt.pptx
AnkitaVerma776806
 
PPTX
Module 3_DAA (2).pptx
AnkitaVerma776806
 
fundamental of full stack development.pptx
AnkitaVerma776806
 
yuuuuuuuuuuuCloud-Storage-PPT[1][1].pptx
AnkitaVerma776806
 
Technical Seminar of Mca computer vision .ppt
AnkitaVerma776806
 
introduction the vision of the computer.pptx
AnkitaVerma776806
 
engineering topic SEMINAR BIOMETRIC2.pptx
AnkitaVerma776806
 
1000000000606084982-mini-project-ppt.pptx
AnkitaVerma776806
 
Internship full stack developer ppt report .pptx
AnkitaVerma776806
 
ppt presentation 555_Spring12_topic01.ppt
AnkitaVerma776806
 
the reference of book detail Hospital Booking.pptx
AnkitaVerma776806
 
ARTIFICIAL INTELLLLIGENCEE modul11_AI.pptx
AnkitaVerma776806
 
voicecontrolhomeautomation-130627021245-phpapp01 (1).ppt
AnkitaVerma776806
 
the refernce of programming C notes ppt.pptx
AnkitaVerma776806
 
detail the reference of Hospital Booking.pptx
AnkitaVerma776806
 
Advances in ML learning process require. ppt.pptx
AnkitaVerma776806
 
SEMINAR_BIOMETRIC of hand fingerprint,voice bsed biometric ,eye based biometric
AnkitaVerma776806
 
AI(Module1).pptx
AnkitaVerma776806
 
continuity of module 2.pptx
AnkitaVerma776806
 
Advances in ML. ppt.pptx
AnkitaVerma776806
 
Module 3_DAA (2).pptx
AnkitaVerma776806
 
Ad

Recently uploaded (20)

PDF
PC228USLC-3E0 Komatsu Hydraulic Excavator Parts Manual SN 40001-UP
Heavy Equipment Manual
 
PDF
PC450LC-8R KOMATSU HYDRAULIC EXCAVATOR PARTS MANUAL SN 75001-UP
Heavy Equipment Manual
 
PPTX
July 2025 - Automobile_Industry_Trends_Presentation.pptx
savithrir7
 
PPTX
MEASURING-TOOLS FOR GRADE 8 AUTOMOTIVE SERVICING
RosalieUsmanLegaspi
 
PDF
Hitachi 130 EXCAVATOR Repair Manual Download
Service Repair Manual
 
PDF
Transform Your Lexus for the Trails with Expert Off-Road Customization Services
MW4 Outfitters
 
PPTX
Yogi_Allcase_February 3rd aaaaaa 2025.pptx
DorothyKaryaYogiLubi
 
PPTX
Presentation Homologation Kendaraan Roda 3
delapanpaduprima
 
PDF
INGLES AERONAUTICO PARA TCP - VERSION LARGA PARA ESTUDIO (CENEAS).pdf
MariaDelValleUrisa
 
PPTX
托莱多大学文凭办理|办理UT毕业证i20购买学位证书电子版
xxxihn4u
 
PPTX
My analysis on GM_History_Presentation.pptx
savithrir7
 
PDF
PC1400-1 KOMATSU Hydraulic Mining Shovels Parts Manual
Heavy Equipment Manual
 
PDF
Instant Access Hitachi 125US 135US Excavator Service Manual.pdf
Service Repair Manual
 
PDF
deloitte-nl-integrated-annual-report-2018-2019.pdf
dsoham206
 
PPTX
power of social media hackertghfdgfgdfgdfggh
xoviva2925
 
PDF
PW60-3 KOMATSU WHEELED EXCAVATOR PARTS MANUAL 2001-UP
Heavy Equipment Manual
 
PPTX
Detroit Business Travel Made Easy with Detroit DTW Cars
Detroit DTW Car
 
PDF
D6E Volvo EW160C EW180C Engine SM Download
Service Repair Manual
 
PDF
Coco Robotics: From Dorm Rooms to Sidewalks
ricky228571
 
PDF
Instant Download ec300dnl manual Volvo Excavator Manual.pdf
Service Repair Manual
 
PC228USLC-3E0 Komatsu Hydraulic Excavator Parts Manual SN 40001-UP
Heavy Equipment Manual
 
PC450LC-8R KOMATSU HYDRAULIC EXCAVATOR PARTS MANUAL SN 75001-UP
Heavy Equipment Manual
 
July 2025 - Automobile_Industry_Trends_Presentation.pptx
savithrir7
 
MEASURING-TOOLS FOR GRADE 8 AUTOMOTIVE SERVICING
RosalieUsmanLegaspi
 
Hitachi 130 EXCAVATOR Repair Manual Download
Service Repair Manual
 
Transform Your Lexus for the Trails with Expert Off-Road Customization Services
MW4 Outfitters
 
Yogi_Allcase_February 3rd aaaaaa 2025.pptx
DorothyKaryaYogiLubi
 
Presentation Homologation Kendaraan Roda 3
delapanpaduprima
 
INGLES AERONAUTICO PARA TCP - VERSION LARGA PARA ESTUDIO (CENEAS).pdf
MariaDelValleUrisa
 
托莱多大学文凭办理|办理UT毕业证i20购买学位证书电子版
xxxihn4u
 
My analysis on GM_History_Presentation.pptx
savithrir7
 
PC1400-1 KOMATSU Hydraulic Mining Shovels Parts Manual
Heavy Equipment Manual
 
Instant Access Hitachi 125US 135US Excavator Service Manual.pdf
Service Repair Manual
 
deloitte-nl-integrated-annual-report-2018-2019.pdf
dsoham206
 
power of social media hackertghfdgfgdfgdfggh
xoviva2925
 
PW60-3 KOMATSU WHEELED EXCAVATOR PARTS MANUAL 2001-UP
Heavy Equipment Manual
 
Detroit Business Travel Made Easy with Detroit DTW Cars
Detroit DTW Car
 
D6E Volvo EW160C EW180C Engine SM Download
Service Repair Manual
 
Coco Robotics: From Dorm Rooms to Sidewalks
ricky228571
 
Instant Download ec300dnl manual Volvo Excavator Manual.pdf
Service Repair Manual
 

Reasesrty djhjan S - explanation required.pptx

  • 1. Content and Popularity based RecoSys
  • 2. Learning Objectives • Introduction to recommendation system and its applications. • Why recommendation system is important. • Key approaches of Recommendation system. • Market basket analysis. • Content Based Recommendation system and its applications. • Case studies for Content based and Popularity based RecoSys. 2
  • 3. Introduction to recommendation system and its applications • Recommender systems are one of the most successful and widespread application of machine learning technologies in business. • You can apply recommender systems in scenarios where many users interact with many items.
  • 4. Intro to RecoSys Contd. • You can find large scale recommender systems in retail, video on demand, or music streaming. • Recommendation of news or videos for media, product recommendation or personalization in travel and retail can be handled by similar machine learning algorithms. • There are also recommender systems for experts, collaborators, jokes, restaurants, garments, financial services, life insurance, romantic partners (online dating), and Twitter pages.
  • 5. Why recommendation system is important Recommendation systems can help match users to items They help item provider deliver their items to right user. • Identify products most relevant to the user. • Personalized content • Eg. Top and Offers Help website improve user engagement and increase online sales
  • 6. Key approaches of Recommendation system The key approaches to RecSys • Collaborative filtering • Content based • Popularity based
  • 7. Popularity based Recommendation system • Recommended items viewed/purchased by most people. • Recommendations : ranked list of items by their purchase count / viewed count “Popular news”. • It uses: – Text only – Purchase history – User and item features – Scale Note: The problem with popularity based recommendation system is that the personalization is not available with this method . For eg: for a song recommendation if the model is popularity based recommender, it is not personalised towards any user and will output the same list of recommended songs
  • 8. Content Based Recommendation system and its applications • A content based recommender works with data that the user provides, either explicitly (rating) or implicitly (clicking on a link). • Based on that data, a user profile is generated, which is then used to make suggestions to the user. • As the user provides more inputs or takes actions on the recommendations, the engine becomes more and more accurate. • The information source that content-based filtering systems are mostly used with are text documents. A standard approach for term parsing selects single words from documents.
  • 9. Content based RecoSys Contd. Advantages • Content-based recommender systems do not use a lot of user data. • Only requires item data and you can start giving recommendations to users. • It is viable to give recommendations to even your first customer as long as you have sufficient data to build his user profile Challenges • The item data needs to be well distributed • Recommendations to the user will be mostly substitutes, and not complements. • Less dynamic
  • 10. Bag of Words (BOW) vectorization • Text data cannot be directly used for modelling. • One approach to extract features involves the BOW approach. • The BOW approach generates a Document-Term Matrix (DTM) where columns represents all the unique words in the corpus and the cell value indicates the frequency of that word in a given document.
  • 11. TF-IDF TF-IDF is composed of 2 terms: • Term Frequency (TF) measures how frequently a term appears in the document. • Inverse Document Frequency (IDF) measures the relative importance of words in a corpus.
  • 12. Word Embedding Models (Word2Vec) • Word vectors are trained by setting up a fake training task. • The weights of the hidden layer learnt during the training process provides the feature vector for each word – words that appear in similar context ends up having similar feature vectors.
  • 13. Case Study Anonymous Ratings from the Jester Online Joke Recommender System Abstract Ratings are real values ranging from -10.00 to +10.00 (the value "99" corresponds to "null" = "not rated"). One row per user The first column gives the number of jokes rated by that user. The next 100 columns give the ratings for jokes 01 - 100. Dataset http://eigentaste.berkeley.edu/dataset/ 13
  • 14. Steps to follow 1. Read the dataset 2. Take care about the header as there are no column names given in the dataset. 3. Consider ratings named dataframe with only first 200 rows and all columns from 1(first column is 0) of dataset. 4. Change the column indices from 0 to 99 5. In the dataset, the null ratings are given as 99.00, so replace all 99.00s with 0 6. Normalize the ratings using StandardScaler and save them in ratings_diff variable 7. Find the mean for each column in ratings_diff i.e, for each joke 8. Consider all the mean ratings and find the jokes with highest mean value and display the top 10 joke IDs. 14
  • 15. Case study on Content based RS Context: This dataset contains ratings for ten thousand popular books. As to the source, let's say that these ratings were found on the internet. Generally, there are 100 reviews for each book, although some have less - fewer - ratings. Ratings go from one to five Both book IDs and user IDs are contiguous. For books, they are 1-10000, for users, 1- 53424. All users have made at least two ratings. Median number of ratings per user is 8. There are also books marked to read by the users, book metadata (author, year, etc.) and tags. Here is to recommend good book 15
  • 16. Attribute information 1. id 13. average_rating 2. book_id 14. ratings_count 3. best_book_id 15. work_ratings_count 4. work_id 16. work_text_reviews_count 5. books_count 17. ratings_1 6. isbn 18. ratings_2 7. isbn13 19. ratings_3 8. authors 20. ratings_4 9. original_publication_year 21. ratings_5 10.original_title 22. image_url 11.title 23. small_image_url 12.language_code 16
  • 17. Steps to follow 1. Get the data. 2. Perform data pre-processing 3. Vectorize title 4. What happens when we change the value of min_df value? What is this argument controlling? 5. Find Similarity Between Items 6. What are the most similiar books? Why does "First to Kill", "Kill me if you can" etc all have same ratings? 7. Find out what features have been considered by the vectorizer for a given title? 17