SlideShare a Scribd company logo
Sentiment Analysis
An Overview of Concepts and
Selected Techniques
Terms
 Sentiment
 A thought, view, or attitude, especially one
based mainly on emotion instead of reason
 Sentiment Analysis
 aka opinion mining
 use of natural language processing (NLP) and
computational techniques to automate the
extraction or classification of sentiment from
typically unstructured text
Motivation
 Consumer information
 Product reviews
 Marketing
 Consumer attitudes
 Trends
 Politics
 Politicians want to know voters’ views
 Voters want to know policitians’ stances and who else
supports them
 Social
 Find like-minded individuals or communities
Problem
 Which features to use?
 Words (unigrams)
 Phrases/n-grams
 Sentences
 How to interpret features for sentiment
detection?
 Bag of words (IR)
 Annotated lexicons (WordNet, SentiWordNet)
 Syntactic patterns
 Paragraph structure
Challenges
 Harder than topical classification, with
which bag of words features perform well
 Must consider other features due to…
 Subtlety of sentiment expression
 irony
 expression of sentiment using neutral words
 Domain/context dependence
 words/phrases can mean different things in different
contexts and domains
 Effect of syntax on semantics
Approaches
 Machine learning
 Naïve Bayes
 Maximum Entropy Classifier
 SVM
 Markov Blanket Classifier
 Accounts for conditional feature dependencies
 Allowed reduction of discriminating features from
thousands of words to about 20 (movie review
domain)
 Unsupervised methods
 Use lexicons
Assume pairwise
independent features
LingPipe Polarity Classifier
 First eliminate objective sentences, then
use remaining sentences to classify
document polarity (reduce noise)
LingPipe Polarity Classifier
 Uses unigram features extracted from
movie review data
 Assumes that adjacent sentences are
likely to have similar subjective-objective
(SO) polarity
 Uses a min-cut algorithm to efficiently
extract subjective sentences
LingPipe Polarity Classifier
Graph for classifying three items.
LingPipe Polarity Classifier
 Accurate as baseline but uses only 22% of
content in test data (average)
 Metrics suggests properties of movie
review structure
SentiWordNet
 Based on WordNet “synsets”
 http://wordnet.princeton.edu/
 Ternary classifier
 Positive, negative, and neutral scores for each
synset
 Provides means of gauging sentiment for
a text
SentiWordNet: Construction
 Created training sets of synsets, Lp and Ln
 Start with small number of synsets with fundamentally
positive or negative semantics, e.g., “nice” and “nasty”
 Use WordNet relations, e.g., direct antonymy, similarity,
derived-from, to expand Lp and Ln over K iterations
 Lo (objective) is set of synsets not in Lp or Ln
 Trained classifiers on training set
 Rocchio and SVM
 Use four values of K to create eight classifiers with
different precision/recall characteristics
 As K increases, P decreases and R increases
SentiWordNet: Results
 24.6% synsets with Objective<1.0
 Many terms are classified with some degree of
subjectivity
 10.45% with Objective<=0.5
 0.56% with Objective<=0.125
 Only a few terms are classified as definitively
subjective
 Difficult (if not impossible) to accurately
assess performance
SentiWordNet: How to use it
 Use score to select features (+/-)
 e.g. Zhang and Zhang (2006) used words in
corpus with subjectivity score of 0.5 or greater
 Combine pos/neg/objective scores to
calculate document-level score
 e.g. Devitt and Ahmad (2007) conflated
polarity scores with a Wordnet-based graph
representation of documents to create
predictive metrics
References
1. http://www.answers.com/sentiment, 9/22/08
 B. Pang, L. Lee, and S. Vaithyanathan, “Thumbs up? Sentiment
classification using machine learning techniques,” in Proc Conf
on Empirical Methods in Natural Language Processing (EMNLP),
pp. 79–86, 2002.
 Esuli A, Sebastiani F. SentiWordNet: A Publicly Available Lexical
Resource for Opinion Mining. In: Proc of LREC 2006 - 5th Conf
on Language Resources and Evaluation, 2006.
 Zhang E, Zhang Y. UCSC on TREC 2006 Blog Opinion Mining.
TREC 2006 Blog Track, Opinion Retrieval Task.
 Devitt A, Ahmad K. Sentiment Polarity Identification in Financial
News: A Cohesion-based Approach. ACL 2007.
 Bo Pang , Lillian Lee, A sentimental education: sentiment
analysis using subjectivity summarization based on minimum
cuts, Proceedings of the 42nd Annual Meeting on Association for
Computational Linguistics, p.271-es, July 21-26, 2004.

More Related Content

Similar to Sentiment+Analysis.ppt (20)

PPTX
Sentiment Analysis
ishan0019
 
PDF
N01741100102
IOSR Journals
 
PDF
Implementation of Semantic Analysis Using Domain Ontology
IOSR Journals
 
PPTX
Fyp ca2
Haha Teh
 
PPTX
A review on sentiment analysis and emotion detection.pptx
voicemail1
 
PDF
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
Journal For Research
 
PDF
Experiences with Sentiment Analysis with Peter Zadrozny
padatascience
 
PDF
Sentence level sentiment polarity calculation for customer reviews by conside...
eSAT Publishing House
 
PDF
A Survey On Sentiment Analysis Of Movie Reviews
Shannon Green
 
PDF
Emotion Detection from Text
IJERD Editor
 
PDF
Opinion mining on newspaper headlines using SVM and NLP
IJECEIAES
 
PPTX
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Quinsulon Israel
 
PPTX
New
devikamd09
 
DOCX
NLP Techniques for Sentiment Anaysis.docx
KevinSims18
 
PDF
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
Journal For Research
 
PDF
A SURVEY OF S ENTIMENT CLASSIFICATION TECHNIQUES USED FOR I NDIAN REGIONA...
ijcsa
 
PPTX
Sentiment analysis using ml
Pravin Katiyar
 
PPTX
Sentiment analysis
Jennifer D. Davis, Ph.D.
 
PPTX
detect emotion from text
Safayet Hossain
 
PPTX
A presentation on Sentiment Analysis....
ShraddhaShrivastava30
 
Sentiment Analysis
ishan0019
 
N01741100102
IOSR Journals
 
Implementation of Semantic Analysis Using Domain Ontology
IOSR Journals
 
Fyp ca2
Haha Teh
 
A review on sentiment analysis and emotion detection.pptx
voicemail1
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
Journal For Research
 
Experiences with Sentiment Analysis with Peter Zadrozny
padatascience
 
Sentence level sentiment polarity calculation for customer reviews by conside...
eSAT Publishing House
 
A Survey On Sentiment Analysis Of Movie Reviews
Shannon Green
 
Emotion Detection from Text
IJERD Editor
 
Opinion mining on newspaper headlines using SVM and NLP
IJECEIAES
 
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Quinsulon Israel
 
NLP Techniques for Sentiment Anaysis.docx
KevinSims18
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
Journal For Research
 
A SURVEY OF S ENTIMENT CLASSIFICATION TECHNIQUES USED FOR I NDIAN REGIONA...
ijcsa
 
Sentiment analysis using ml
Pravin Katiyar
 
Sentiment analysis
Jennifer D. Davis, Ph.D.
 
detect emotion from text
Safayet Hossain
 
A presentation on Sentiment Analysis....
ShraddhaShrivastava30
 

Recently uploaded (20)

PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PPT
PPT2_Metal formingMECHANICALENGINEEIRNG .ppt
Praveen Kumar
 
PDF
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
PPTX
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
PDF
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
PPTX
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
PPTX
GitOps_Repo_Structure for begeinner(Scaffolindg)
DanialHabibi2
 
PDF
Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGE
DancanNgutuku
 
PDF
Design Thinking basics for Engineers.pdf
CMR University
 
PDF
Zilliz Cloud Demo for performance and scale
Zilliz
 
PPTX
Hashing Introduction , hash functions and techniques
sailajam21
 
PPTX
Introduction to Design of Machine Elements
PradeepKumarS27
 
PDF
Book.pdf01_Intro.ppt algorithm for preperation stu used
archu26
 
PPTX
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
PDF
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
PDF
Ethics and Trustworthy AI in Healthcare – Governing Sensitive Data, Profiling...
AlqualsaDIResearchGr
 
PDF
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
PPTX
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
DOCX
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
PPTX
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PPT2_Metal formingMECHANICALENGINEEIRNG .ppt
Praveen Kumar
 
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
GitOps_Repo_Structure for begeinner(Scaffolindg)
DanialHabibi2
 
Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGE
DancanNgutuku
 
Design Thinking basics for Engineers.pdf
CMR University
 
Zilliz Cloud Demo for performance and scale
Zilliz
 
Hashing Introduction , hash functions and techniques
sailajam21
 
Introduction to Design of Machine Elements
PradeepKumarS27
 
Book.pdf01_Intro.ppt algorithm for preperation stu used
archu26
 
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
Ethics and Trustworthy AI in Healthcare – Governing Sensitive Data, Profiling...
AlqualsaDIResearchGr
 
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
Ad

Sentiment+Analysis.ppt

  • 1. Sentiment Analysis An Overview of Concepts and Selected Techniques
  • 2. Terms  Sentiment  A thought, view, or attitude, especially one based mainly on emotion instead of reason  Sentiment Analysis  aka opinion mining  use of natural language processing (NLP) and computational techniques to automate the extraction or classification of sentiment from typically unstructured text
  • 3. Motivation  Consumer information  Product reviews  Marketing  Consumer attitudes  Trends  Politics  Politicians want to know voters’ views  Voters want to know policitians’ stances and who else supports them  Social  Find like-minded individuals or communities
  • 4. Problem  Which features to use?  Words (unigrams)  Phrases/n-grams  Sentences  How to interpret features for sentiment detection?  Bag of words (IR)  Annotated lexicons (WordNet, SentiWordNet)  Syntactic patterns  Paragraph structure
  • 5. Challenges  Harder than topical classification, with which bag of words features perform well  Must consider other features due to…  Subtlety of sentiment expression  irony  expression of sentiment using neutral words  Domain/context dependence  words/phrases can mean different things in different contexts and domains  Effect of syntax on semantics
  • 6. Approaches  Machine learning  Naïve Bayes  Maximum Entropy Classifier  SVM  Markov Blanket Classifier  Accounts for conditional feature dependencies  Allowed reduction of discriminating features from thousands of words to about 20 (movie review domain)  Unsupervised methods  Use lexicons Assume pairwise independent features
  • 7. LingPipe Polarity Classifier  First eliminate objective sentences, then use remaining sentences to classify document polarity (reduce noise)
  • 8. LingPipe Polarity Classifier  Uses unigram features extracted from movie review data  Assumes that adjacent sentences are likely to have similar subjective-objective (SO) polarity  Uses a min-cut algorithm to efficiently extract subjective sentences
  • 9. LingPipe Polarity Classifier Graph for classifying three items.
  • 10. LingPipe Polarity Classifier  Accurate as baseline but uses only 22% of content in test data (average)  Metrics suggests properties of movie review structure
  • 11. SentiWordNet  Based on WordNet “synsets”  http://wordnet.princeton.edu/  Ternary classifier  Positive, negative, and neutral scores for each synset  Provides means of gauging sentiment for a text
  • 12. SentiWordNet: Construction  Created training sets of synsets, Lp and Ln  Start with small number of synsets with fundamentally positive or negative semantics, e.g., “nice” and “nasty”  Use WordNet relations, e.g., direct antonymy, similarity, derived-from, to expand Lp and Ln over K iterations  Lo (objective) is set of synsets not in Lp or Ln  Trained classifiers on training set  Rocchio and SVM  Use four values of K to create eight classifiers with different precision/recall characteristics  As K increases, P decreases and R increases
  • 13. SentiWordNet: Results  24.6% synsets with Objective<1.0  Many terms are classified with some degree of subjectivity  10.45% with Objective<=0.5  0.56% with Objective<=0.125  Only a few terms are classified as definitively subjective  Difficult (if not impossible) to accurately assess performance
  • 14. SentiWordNet: How to use it  Use score to select features (+/-)  e.g. Zhang and Zhang (2006) used words in corpus with subjectivity score of 0.5 or greater  Combine pos/neg/objective scores to calculate document-level score  e.g. Devitt and Ahmad (2007) conflated polarity scores with a Wordnet-based graph representation of documents to create predictive metrics
  • 15. References 1. http://www.answers.com/sentiment, 9/22/08  B. Pang, L. Lee, and S. Vaithyanathan, “Thumbs up? Sentiment classification using machine learning techniques,” in Proc Conf on Empirical Methods in Natural Language Processing (EMNLP), pp. 79–86, 2002.  Esuli A, Sebastiani F. SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining. In: Proc of LREC 2006 - 5th Conf on Language Resources and Evaluation, 2006.  Zhang E, Zhang Y. UCSC on TREC 2006 Blog Opinion Mining. TREC 2006 Blog Track, Opinion Retrieval Task.  Devitt A, Ahmad K. Sentiment Polarity Identification in Financial News: A Cohesion-based Approach. ACL 2007.  Bo Pang , Lillian Lee, A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts, Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p.271-es, July 21-26, 2004.