0% found this document useful (0 votes)

105 views16 pages

Introduction to Recommender Systems

Uploaded by

koshmitha.28

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

105 views16 pages

Introduction to Recommender Systems

Uploaded by

koshmitha.28

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

UNIT-5

Recommendation Systems: Introduction, A Model for Recommendation

Systems, Collaborative Filtering System and Content Based
Recommendations.

Q) What is Recommender System? How it is useful to its users?

Recommender systems are the systems that are designed to recommend

things to the user based on various factors. It takes the user model
consisting of ratings, preferences, demographics, etc. and items with its
descriptions as input, finds relevant score which is used for ranking, and
finally recommends items that are relevant to the user.

Recommendations are commonly seen in e-commerce systems, LinkedIn,

friend recommendation on Facebook, song recommendation at FM, news
recommendations at Forbes.com, etc.. Companies like Netflix, Amazon, etc.
use recommender systems to help their users to identify the correct product
or movies for them.

The recommender system deals with a large volume of information present

by filtering the most important information based on the data provided by a
user and other factors that take care of the user’s preference and interest. It
finds out the match between user and item and imputes the similarities
between users and items for recommendation.

Q) Explain a model for Recommendation System.

Recommendation system is a facility that involves predicting user responses

to options in web applications.

Generally we see the following recommendations:

1. “You may also like these…”, “People who liked this also liked…”.
2. If you download presentations from slideshare, it says “similar content
you can save and browselater”.

Use-Cases Of Recommendation System

There are many use-cases of it. Some are
A. Personalized Content: Helps to Improve the on-site experience by
creating dynamic recommendations for different kinds of audiences like
Netflix does.
B. Better Product search experience: Helps to categories the product
based on their features. Eg: Material, Season, etc.

These suggestions are from the recommender system. The models used are
as follows:
1. Collaborative-filtering system: It uses community data from peer
groups for recommendations. This exhibits all those things that are popular
among the peers.
Collaborative filtering systems recommend items based on similarity
measures between users and/or items. The items recommended to a user
are those preferred by similar users (community data).
Eg.
When we shop on Amazon it recommends new products saying “Customer
who brought this also brought”.

1.1 User-Based Collaborative Filtering

It is based on the notion of users’ similarity.

Eg. On the left side, you can see a picture where 3 children named A, B, C,
and 4 fruits i.e, grapes, strawberry, watermelon, and orange respectively.
Based on the image let assume A purchased all 4 fruits, B purchased
only strawberry and C purchased strawberry as well as watermelon. Here A
& C are similar kinds of users because of this C will be recommended
Grapes and Orange as shown in dotted line.

Fig. User Based Filtering

1.2 Item-Based Collaborative Filtering

It is based on the notion of item similarity.

Eg. Here the only difference is that we see similar items, not similar users
like if you see grapes and watermelon you will realize that watermelon is
purchased by all of them but grapes are purchased by Children A& B.
Hence Children C is being recommended grapes.
Fig. Item Based Filtering

If number of items is greater than number of users go with user-based

collaborative filtering as it will reduce the computation power and If number
of users is greater than number of items go with item-based collaborative
filtering.

Advantages
 It works well even if the data is small.
 This model helps the users to discover a new interest in a given item but the
model might still recommend it because similar users are interested in that
item.
 No need for Domain Knowledge

Disadvantages
 It cannot handle new items because the model doesn’t get trained on the
newly added items in the database. This problem is known as Cold Start
Problem.
 The sparsity problem occurs when there is too little information to work
with, in order to provide the user base with decent approximations as to
which products they would likely prefer.
 A gray sheep is a user which has no obvious closest neighbour and seems
to rate content in an unpredictable pattern unlike that of any other user.

2. Content-based systems: In content based recommendation system,

relevant items are shown using the content of the previously searched items
by the users. Here content refers to the attribute/tag of the product
that the user likes.
Eg. 1. If a user has watched many “scientific fiction” movies, then the
recommender system will recommend movie classified in the database as
having the “scientific fiction” genre.
2. If a user has read a book, then the recommender system will recommend
other similar books.
Fig. Content Based Filtering

Advantages

 Model doesn’t need data of other users since recommendations are specific
to a single user.
 It makes it easier to scale to a large number of users.
 The model can capture the specific Interests of the user and can recommend
items that very few other users are interested in.

Disadvantages

 Feature representation of items is hand-engineered to some extent; this tech

requires a lot of domain knowledge.
 The model can only make recommendations based on the existing interest of
a user.

Recommendation System can be implemented in 2 ways:

1. Memory based: We use entire user-item data set to generate a

Recommendation System.
It uses statistical techniques to approximate users or items.
Eg. Cosine similarity, Pearson Coefficient, Euclidian distance etc.

2. Model based: A model of users is developed in an attempt to learn

their preferences.
Models can be created using ML techniques like regression,
clustering, classification etc.

Q) Explain NearestNeighbor Technique for Collaborative filtering with a

suitable example.

Nearest Neighbor Technique

The idea is to predict the rating an “Active User” would give for an unseen
item. The first step is to select a set of users (peers) who liked the same
items as the “Active User” in the past and rated them too.
In order to predict the Active User’s rating for the unseen item, we can use
the average rating given by the peers for that unseen item.

i. Steps for User-Based Collaborative Filtering:

1. Finding the similarity of users to the target user U

Using Pearson’s Correlation in peer-based collaborative filtering, the

formula turns out to be:

where “a” and “b” are users; ra, p is the rating of user “a” for item “p”; “P” is
the set of items that are rated by both “a” and “b”.

2. Prediction of missing rating of an item using Nearest

Neighbor technique:
Similarity measure lies in the range of −1 and 1, where −1 indicates
very dissimilar and 1 indicates perfect similarity. A common
prediction function using the above similarity function is

Using this, we can calculate whether the neighbors’ ratings for the unseen
item i are higher or lower than their average, then combine the rating
differences using the similarity as a weight.
Finally the neighbor’s bias is added to or subtracted from the Active User’s
average and used as a prediction.
Eg.

Consider a matrix which shows four users Alice, U1, U2 and U3 rating on
different news apps. The rating range is from 1 to 5 on the basis of users
likability of the news app. The ‘?’ indicates that the app has not been rated
by the user.
Fig. Dataset

Calculating the similarity between Alice and all the other users
At first we calculate the averages of the ratings of all the user excluding I5
as it is not rated by Alice. Therefore, we calculate the average as:

Hence we get the following matrix:

Now, we calculate the similarity between Alice and all the other users:
Predicting the rating of I5 by Alice:

If we assume the threshold rating as 3.5, we recommend I5 to Alice as

the predicted rating is 3.83.

But the problem arises when it is a cold start. How to recommend new
items? What to recommend to the new users? In the initial phase, the user
can be requested to rate a set of items. Otherwise either demographic data
or non-personalized data can be used.

ii. Steps for Item-Based Collaborative Filtering:

1. To build the model by finding similarity between all the item pairs. The
similarity between item pairs can be found in different ways. One of the
most common methods is to use cosine similarity.

Formula for Cosine Similarity:

2. Executing a recommendation system. It uses the items (already rated by
the user) that are most similar to the missing item to generate rating.

Eg.

Given below is a set table that contains some items and the user who have
rated those items. The rating is explicit and is on a scale of 1 to 5. Each
entry in the table denotes the rating given by ai th User to a jth Item. We
need to find the missing ratings for the respective user.

Fig. Dataset

1. Finding similarities of all the item pairs.

Sim(Item1, Item2):
In the table, we can see only User_2 and User_3 have rated for both
items 1 and 2.
Thus, let I1 be vector for Item_1 and I2 be for Item_2. Then,
I1 = 5U2 + 3U3 and,
I2 = 2U2 + 3U3

Sim(Item2, Item3):
In the table we can see only User_3 and User_4 have rated for both the
items 1 and 2.
Thus, let I2 be vector for Item_2 and I3 be for Item_3. Then,
I2 = 3U3 + 2U4 and,
I3 = 1U3 + 2U4
Sim(Item1, Item3):
In the table we can see only User_1 and User_3 have rated for both the
items 1 and 2.
Thus, let I1 be vector for Item_1 and I3 be for Item_3. Then,
I1 = 2U1 + 3U3 and,
I3 = 3U1 + 1U3

2. Generating the missing ratings in the table

If we assume the threshold rating as 3 then we only recommend I3 to

U2.

Q) Briefly explain about Content based Recommendation techniques.

i. Discovering Features of Documents:

Document collections and images are other classes of items where the
values of features are not immediately apparent. There are many kinds of
documents for which a recommendation system can prove to be useful.

Eg. Web pages are also a collection of documents. For example, there are
many news articles published each day, and all cannot be read. A
recommendation system can suggest articles on topics a user is interested
in.

To measure the similarity of the two documents, there are several natural
distance measures we use:

1. The Jaccard distance between the sets of words.

Eg.
doc_1 = "Data is the new oil of the digital economy"
doc_2 = "Data is a new oil"

Let’s get the set of unique words for each document.

words_doc1 = {'data', 'is', 'the', 'new', 'oil', 'of', 'digital', 'economy'}

words_doc2 = {'data', 'is', 'a', 'new', 'oil'}
Now, we will calculate the intersection and union of these two sets of words
and measure the Jaccard Similarity between doc_1 and doc_2.

2. The cosine distance between the sets, treated as vectors.

Cosine Similarity:

Suppose that x and y aretwo term-frequency vectors:

Eg.
doc_1 = "Data is the oil of the digital economy"
doc_2 = "Data is a new oil"

# Vector representation of the document

doc_1_vector = [1, 1, 1, 1, 0, 1, 1, 2]
doc_2_vector = [1, 0, 0, 1, 1, 0, 1, 0]

The following two kinds of document similarity exist:

1. Lexical similarity: Documents are similar if they contain large, identical
sequences of characters.
2. Semantic similarity: Semantic similarity is defined over a set of
documents or terms, where the idea of distance between them is based on
the likeness of their meaning or semantic content.

Semantic similarity can be estimated by defining a topological

similarity using ontologies to define the distance between terms/concepts.
For example, a partially ordered set, represented as nodes of a directed
acyclic graph, is used for the comparison of concepts ordered. Based on text
analyses, semantic relatedness between units would be the shortest path
linking the two concept nodes. This can also be estimated using statistical
means, such as a vector space model, to correlate words and textual
contexts from a suitable text corpus.
For recommendation systems, the notion of similarity is different. We
are interested only in the occurrences of many important words in both
documents, even if there is little lexical similarity between the documents.

Item Profile:
In a content-based system, each item is a profile, which is a record or a
collection of records representing important characteristics of that item, is
first constructed. For example, for a movie recommendation system, the
important characteristics are:
1. The set of actors of the movie.
2. The director.
3. The year in which the movie was made.
4. The genre or general type of movie, and so on.

The objective of content-based recommendation systems is to find and rank

things (documents) according to the user preferences.

To find similarity between the items, Dice co-efficient given below is used.
Let b1, b2 be two items. Similarity between the two is given by

This approach has the basic assumptions that all keywords are of equal
importance.

ii. Term Frequency−Inverse Document Frequency

TF-IDF stands for “Term Frequency – Inverse Document Frequency ”.

TF-IDF is a numerical statistic which measures the importance of the word
in a document.
 Term Frequency: Number of time a word appears in a text document.
 Inverse Document Frequency: Measure the word is a rare word or
common word in a document.

tf(t,d) = (Number of times term t appears in a document) / (Total

number of terms in the document)

Where,
tf(t,d) - Term Frequency, t = term, d = document

idf(t) = log [ n / df(t) ] + 1

where,
idf(t) - Inverse Document Frequency
n - Total number of documents
df(t) is the document frequency of term t;

tf-idf(t, d) = tf(t, d) * idf(t)

Eg.

Consider a document which has a total of 100 words and the

word “book” has occurred 5 times in a document.
Term frequency (tf) = 5 / 100 = 0.05
Let’s assume we have 10,000 documents and the word “book” has occurred
in 1000 of these. Then idf is:
Inverse Document Frequency(IDF) = log[10000/1000] + 1 = 2
TF-IDF = 0.05 * 2 = 0.1

iii. Obtaining Item Features from Tags

Consider that images of features have been obtained for items. The problem
with images is that their data, which is an array of pixels, does not tell us
anything useful about their features. In order to obtain information about
features of items, request users to tag the items by entering words or
phrases that describe the item.

The problem with tagging as an approach to feature discovery is that

the process only works if users are willing to take the trouble to create the
tags, and the number of erroneous tags are minimal/negligible when
compared to total number of tags.

Eg.
To compute the cosine distance between vectors, consider the movie
recommendation system. Suppose the only features of movies are the set of
actors and the average rating. Consider two movies with five actors each.
Two of the actors are in both movies. Also, one movie has an average rating
of 3 andthe other has an average of 4. The vectors look something like
0 1 1 0 1 1 0 1 3α
1 1 0 1 0 1 1 0 4α

The last component represents the average rating with an unknown scaling
factor α. Computing in terms of α, the cosine of the angle between the
vectors is

For α = 1, the cosine is 0.816.

For α = 2, the cosine is 0.940, the vectors are closer in direction than if we
use α = 1.
iv. User Profiles

To consider the user profiles, vectors with the same components that
describe user’s preferences are created.
The utility matrix represents the connection between the users and the
items. The entries in the utility matrix to represent user purchases or a
similar connection could be just 1s, or they could be any number
representing a rating or liking that the user has for the item.

This information can be used to find the best estimate regarding which
items the user likes. This is taken as aggregation of the profiles of those
items.

If the utility matrix has only 1s, then the natural aggregate is the
average of the components of the vectors representing the item profiles for
the items in which the utility matrix has 1 for that user.
Eg.
Suppose items are movies, represented by Boolean profiles with components
corresponding to actors.
If 25% of the movies that user U likes have Tom Hanks as one of the
actors, then the user profile for U will have 0.25 in the component for Tom
Hanks.

If the utility matrix has ratings 1–5, then we can weigh the vectors
representing the profiles of items by the utility value. The utilities are
normalized by subtracting the average value for a user.
Eg.
User U gives rating for three movies as 1, 3 and 4. The user profile for U
has, in the component will be
Average(3) of 1 − 3, 3 − 3 and 4 − 3, that is, the value −1/3. Therefore,
items with a below-average rating get negative weights, and items with
above-average ratings get positive weights.

The vector for a user will have positive numbers for actors who appear in
movies the user likes and have negative numbers for actors appearing in
movies the user does not like.
1. Case 1: Consider a movie with many actors the user likes, and only a few
or none that the user does not like. The cosine of the angle between the
user’s and movie’s vectors will be a large positive fraction. That implies an
angle close to 0, and therefore a small cosine distance between the vectors.
2. Case 2: Consider a movie with as many actors that the user likes as
those the user does not like. In this situation, the cosine of the angle
between the user and movie is around 0, and therefore the angle between
the two vectors is around 90°.
3. Case 3: Consider a movie with mostly actors the user does not like. In
that case, the cosine will be a large negative fraction, and the angle between
the two vectors will be close to 180°– the maximum possible cosine distance.
v. Classification Algorithms
Another approach to a recommendation system is to treat this as the
machine learning problem using item profiles and utility matrices.
The given data is taken as a training set, and for each user, classifier that
predicts the rating of all items is built.

There are a number of different classifiers, and the most common one is the
decision trees.

A decision tree is a collection of nodes, arranged as a binary tree. The leaves

render decisions; in movie recommendation system, the decision would be
“likes” or “does not like”. Each interior node is a condition on the objects
being classified; here the condition would be a predicate involving one or
more features of an item.

To classify an item, starting from the root, the predicate is applied from the
root to the item. If the predicate is true, the path to the left (child) is taken,
and if it is false, the path to right (child) is taken. Then the same process is
repeated at all the nodes visited, until a leaf is reached. It is that leaf which
classifies the item as liked or not.

After selecting a predicate for a node N, the items are divided into two
groups: one that satisfies the predicate and the other that does not. For
each group, the predicate that best separates the positive and negative
examples in that group is then found. These predicates are assigned to the
children of N. This process of dividing the examples and building children
can proceed to any number of levels.
Q) Differentiate User based and Item based collaborative filtering
Recommendation System.

Collaborative filtering works by finding out similarities between two users or

two items.

User based: Recommend items by finding similar users.

Eg. If you have seen 10 movies and 7 out of those have been seen by
someone else too, that would imply that you both have similar taste and
you’d be recommended movies that he watched and you haven’t.

Item based: Calculate similarity between items and make recommendations.

Eg. If you gave high rating to a restaurant then you would be recommended
other restaurants rated highly by people who rated this restaurant highly
too.

Lec15-S Sarkar
No ratings yet
Lec15-S Sarkar
12 pages
Unit 3
No ratings yet
Unit 3
21 pages
Module 5
No ratings yet
Module 5
8 pages
Unit-5 ML
No ratings yet
Unit-5 ML
7 pages
Recommender System - New
No ratings yet
Recommender System - New
49 pages
Machine Learning Recommender Systems
No ratings yet
Machine Learning Recommender Systems
33 pages
Recommendation System
No ratings yet
Recommendation System
8 pages
Recommendation System
No ratings yet
Recommendation System
32 pages
6CS4 ML Unit-5
No ratings yet
6CS4 ML Unit-5
33 pages
RecommenderSystems Shortened
No ratings yet
RecommenderSystems Shortened
95 pages
Movie Recommendations
No ratings yet
Movie Recommendations
12 pages
Module5 Recommender Systems PartA
No ratings yet
Module5 Recommender Systems PartA
54 pages
DM Lect 6 - Recommender Systems
No ratings yet
DM Lect 6 - Recommender Systems
46 pages
Recomendation System
No ratings yet
Recomendation System
2 pages
Building Accurate and Practical Recomender System Usnig ML Classifier and CBF by Asma
No ratings yet
Building Accurate and Practical Recomender System Usnig ML Classifier and CBF by Asma
19 pages
An Optimized Item-Based Collaborative Filtering Recommendation Algorithm
No ratings yet
An Optimized Item-Based Collaborative Filtering Recommendation Algorithm
5 pages
A Collaborative Filtering Recommendation Algorithm Based On Item Genre and Rating Similarity
No ratings yet
A Collaborative Filtering Recommendation Algorithm Based On Item Genre and Rating Similarity
4 pages
Module5 Recommender Systems PartB
No ratings yet
Module5 Recommender Systems PartB
57 pages
Rec - Unit 1
No ratings yet
Rec - Unit 1
66 pages
Slides Lecture 2 RecSys
No ratings yet
Slides Lecture 2 RecSys
86 pages
IDEA - Collaborative Filtering Techniques in Recommendation Systems
No ratings yet
IDEA - Collaborative Filtering Techniques in Recommendation Systems
11 pages
Collaborative Filtering Models
No ratings yet
Collaborative Filtering Models
10 pages
Book Recommendation System Project
No ratings yet
Book Recommendation System Project
14 pages
Unit III Collaborative Filtering Final
No ratings yet
Unit III Collaborative Filtering Final
65 pages
Unit Iii-Collaborative Filtering
No ratings yet
Unit Iii-Collaborative Filtering
34 pages
Rec Sys CF
No ratings yet
Rec Sys CF
48 pages
Recommender Systems
No ratings yet
Recommender Systems
12 pages
Book Recommender for Developers
100% (1)
Book Recommender for Developers
21 pages
Moradabad Institute of Technology
No ratings yet
Moradabad Institute of Technology
13 pages
Recommendation Engine Survey
No ratings yet
Recommendation Engine Survey
9 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
20 pages
CSE545 sp23 (9) Recommendation Systems 4-10
No ratings yet
CSE545 sp23 (9) Recommendation Systems 4-10
72 pages
Unit III - 3.1 - Recommender Systems at CSJMU - 6 Slides Handouts
No ratings yet
Unit III - 3.1 - Recommender Systems at CSJMU - 6 Slides Handouts
3 pages
2404 16177v1
No ratings yet
2404 16177v1
6 pages
M3 Collaborative Filtering
No ratings yet
M3 Collaborative Filtering
8 pages
Movie Recommendation System: CSN-382 Project
No ratings yet
Movie Recommendation System: CSN-382 Project
25 pages
Recomendation System
No ratings yet
Recomendation System
10 pages
RS Part 1
No ratings yet
RS Part 1
40 pages
Paper 66-Recommendation System Based On Double Ensemble
No ratings yet
Paper 66-Recommendation System Based On Double Ensemble
8 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
58 pages
Recommender System
No ratings yet
Recommender System
20 pages
Recommender Systems
No ratings yet
Recommender Systems
23 pages
E - Commerce Recommendation System
No ratings yet
E - Commerce Recommendation System
29 pages
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
No ratings yet
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
36 pages
Book Recommendation System
No ratings yet
Book Recommendation System
8 pages
All Merge Chap 1
No ratings yet
All Merge Chap 1
69 pages
Experiment
No ratings yet
Experiment
36 pages
PCL Group2
No ratings yet
PCL Group2
21 pages
Chapter 2
No ratings yet
Chapter 2
40 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
26 pages
Is593-Lecture04 Recommendation Systems
No ratings yet
Is593-Lecture04 Recommendation Systems
51 pages
Recommendation System
No ratings yet
Recommendation System
19 pages
Unit-1 - Introduction
No ratings yet
Unit-1 - Introduction
46 pages
Clustering in Recommender Systems Review
No ratings yet
Clustering in Recommender Systems Review
22 pages
Movie Recommender System Using Collaborative Filtering
No ratings yet
Movie Recommender System Using Collaborative Filtering
6 pages
Unit 4 - MLMM
No ratings yet
Unit 4 - MLMM
36 pages
Online Book Recommendation System Using Collaborative Filtering (With Jaccard Similarity)
No ratings yet
Online Book Recommendation System Using Collaborative Filtering (With Jaccard Similarity)
9 pages
Implementation and Comparison of Recommender Systems Using Various Models
100% (1)
Implementation and Comparison of Recommender Systems Using Various Models
13 pages
Machine Learning Powering Today's Tech Solutions
No ratings yet
Machine Learning Powering Today's Tech Solutions
15 pages
Recommendation in Social Media: Recommender System
No ratings yet
Recommendation in Social Media: Recommender System
29 pages
AI in Marketing Issues and Challenges
No ratings yet
AI in Marketing Issues and Challenges
12 pages
W.A.S.M.U.Widanaarachchi Postgraduate Institute of Science University of Peradeniya Peradeniya, Sri Lanka Csc2239@pgis - LK
No ratings yet
W.A.S.M.U.Widanaarachchi Postgraduate Institute of Science University of Peradeniya Peradeniya, Sri Lanka Csc2239@pgis - LK
7 pages
Agentic Commerce Is Here - How Retailers Can Prepare For The New Shopping Era
No ratings yet
Agentic Commerce Is Here - How Retailers Can Prepare For The New Shopping Era
7 pages
Machine Learning & Some Industry Applications
No ratings yet
Machine Learning & Some Industry Applications
43 pages
Doc-20240412-Wa0003 240417 184525
No ratings yet
Doc-20240412-Wa0003 240417 184525
30 pages
Top 170 Machine Learning Interview Questions 2024 - Great Learning
No ratings yet
Top 170 Machine Learning Interview Questions 2024 - Great Learning
67 pages
1.3.2 Cse
No ratings yet
1.3.2 Cse
9 pages
I Want To Build A Platform For People To Find Best
No ratings yet
I Want To Build A Platform For People To Find Best
53 pages
Requests For The Application of AI in OCB Data Management
No ratings yet
Requests For The Application of AI in OCB Data Management
14 pages
Universal Principles of Branding 100 Key Concepts For Defining, Building, and Delivering Brands (Mark Kingsley)
100% (5)
Universal Principles of Branding 100 Key Concepts For Defining, Building, and Delivering Brands (Mark Kingsley)
331 pages
Mobile App Project Requirements Report
No ratings yet
Mobile App Project Requirements Report
3 pages
Shraddha Gupta: Tech Skills & Projects
No ratings yet
Shraddha Gupta: Tech Skills & Projects
1 page
Class 10 Unit1 Intro To AI
No ratings yet
Class 10 Unit1 Intro To AI
13 pages
Drug Recommendation System Based On Sentiment Analysis of Drug Reviews Using Machine Learning
No ratings yet
Drug Recommendation System Based On Sentiment Analysis of Drug Reviews Using Machine Learning
8 pages
Create Business Case For The Mall Group in Thailan
No ratings yet
Create Business Case For The Mall Group in Thailan
23 pages
Sentiment Analysis For Social Media
No ratings yet
Sentiment Analysis For Social Media
154 pages
TS 006 2025
No ratings yet
TS 006 2025
31 pages
Abhinav A - Resume
No ratings yet
Abhinav A - Resume
2 pages
GITAM PPT Templates - Academic
No ratings yet
GITAM PPT Templates - Academic
11 pages
Iv Year Technical Seminar Presentation
No ratings yet
Iv Year Technical Seminar Presentation
16 pages
Diet Recommendation System
No ratings yet
Diet Recommendation System
5 pages
Apache Mahout: Scalable ML Algorithms
0% (1)
Apache Mahout: Scalable ML Algorithms
26 pages
65
No ratings yet
65
15 pages
Sentiment Enhanced Neural Collaborative Filtering
No ratings yet
Sentiment Enhanced Neural Collaborative Filtering
4 pages
A Survey On The Impact of AI-based Recommenders On Human Behaviours: Methodologies, Outcomes and Future Directions
No ratings yet
A Survey On The Impact of AI-based Recommenders On Human Behaviours: Methodologies, Outcomes and Future Directions
41 pages
UtotoAI, Inc Pitch Deck v1.9
No ratings yet
UtotoAI, Inc Pitch Deck v1.9
19 pages
AI Seminar Report: Concepts & Applications
No ratings yet
AI Seminar Report: Concepts & Applications
37 pages
Digital Drobe
No ratings yet
Digital Drobe
12 pages

Introduction to Recommender Systems

Uploaded by

Introduction to Recommender Systems

Uploaded by

UNIT-5

Recommendation Systems: Introduction, A Model for Recommendation

Q) What is Recommender System? How it is useful to its users?

Recommender systems are the systems that are designed to recommend

Recommendations are commonly seen in e-commerce systems, LinkedIn,

The recommender system deals with a large volume of information present

Q) Explain a model for Recommendation System.

Recommendation system is a facility that involves predicting user responses

Generally we see the following recommendations:

Use-Cases Of Recommendation System

1.1 User-Based Collaborative Filtering

It is based on the notion of users’ similarity.

Fig. User Based Filtering

1.2 Item-Based Collaborative Filtering

It is based on the notion of item similarity.

If number of items is greater than number of users go with user-based

2. Content-based systems: In content based recommendation system,

 Feature representation of items is hand-engineered to some extent; this tech

Recommendation System can be implemented in 2 ways:

1. Memory based: We use entire user-item data set to generate a

2. Model based: A model of users is developed in an attempt to learn

Q) Explain NearestNeighbor Technique for Collaborative filtering with a

Nearest Neighbor Technique

i. Steps for User-Based Collaborative Filtering:

1. Finding the similarity of users to the target user U

Using Pearson’s Correlation in peer-based collaborative filtering, the

2. Prediction of missing rating of an item using Nearest

Hence we get the following matrix:

If we assume the threshold rating as 3.5, we recommend I5 to Alice as

ii. Steps for Item-Based Collaborative Filtering:

Formula for Cosine Similarity:

1. Finding similarities of all the item pairs.

2. Generating the missing ratings in the table

If we assume the threshold rating as 3 then we only recommend I3 to

Q) Briefly explain about Content based Recommendation techniques.

i. Discovering Features of Documents:

1. The Jaccard distance between the sets of words.

Let’s get the set of unique words for each document.

words_doc1 = {'data', 'is', 'the', 'new', 'oil', 'of', 'digital', 'economy'}

2. The cosine distance between the sets, treated as vectors.

Suppose that x and y aretwo term-frequency vectors:

# Vector representation of the document

The following two kinds of document similarity exist:

Semantic similarity can be estimated by defining a topological

The objective of content-based recommendation systems is to find and rank

ii. Term Frequency−Inverse Document Frequency

TF-IDF stands for “Term Frequency – Inverse Document Frequency ”.

tf(t,d) = (Number of times term t appears in a document) / (Total

idf(t) = log [ n / df(t) ] + 1

tf-idf(t, d) = tf(t, d) * idf(t)

Consider a document which has a total of 100 words and the

iii. Obtaining Item Features from Tags

The problem with tagging as an approach to feature discovery is that

For α = 1, the cosine is 0.816.

A decision tree is a collection of nodes, arranged as a binary tree. The leaves

Collaborative filtering works by finding out similarities between two users or

User based: Recommend items by finding similar users.

Item based: Calculate similarity between items and make recommendations.

You might also like