SlideShare a Scribd company logo
INFORMATIK
Using Complex Event Processing for Modeling




                                                                       FZI FORSCHUNGSZENTRUM
Semantic Requests in Real-Time Social Media
Monitoring
Dominik Riemer, Nenad Stojanovic, Ljiljana Stojanovic
FZI Research Center for Information Technologies, Karlsruhe, Germany

RAMSS 2012, 04 June 2012
Dublin, Ireland
Agenda


 Introduction

 Use Cases

 System Architecture

 Semantic Requests

 Tools

 Discussion
INTRODUCTION
Why is Social Media Monitoring important?

 Social Media Monitoring
    Continuous process of monitoring a specific subject of matter in social
     media
    Continuous, not necessarily real-time


 Usage of Social Media is emerging
    Wider range of age groups
    New business opportunities for companies


 State of the Art in SMM
    Today: mainly image reputation, early identification of harmful
     messages
    Predictions (e.g., earthquake identification)
    Aggregations over crawled data in order to find overall scores for a
     brand
Search in Social Media

 Historical Search:
    Search in past social media data
    Social network analysis
    Applications:
       market research


 Real-Time Search:
    Search for patterns when data arrives
    Applications:
       proactive marketing
       real-time spinning


 Proactive Search:
    Forecast future developments
    E.g., movie box office revenues
Complex Event Processing

   Event processing is a form of computing that performs operations on events

   CEP is an enabling technology that supports on the fly, (business-) real-
    time processing of huge event streams

   CEP is about a timely (or in head of time) recognition of the situations of
    interest and corresponding reaction
        A complex event pattern describes a situation of interests
USE CASES
Proactive Marketing

 As-Is:
    Buying decisions are often based on recommendations through social
     media channels
    Users share experiences and problems with products in social networks


 Business threads:
      Negative feedback spreads very fast in social networks

 Business opportunities:
    New customers
    More satisfied customers


 Proactive Marketing
    Establishing relations to consumers and customers based on detected
     situations
    Real-time advertising, real-time problem solving
        Example: automatic helpdesk ticket generation
Real-Time Spinning

 As-Is:
    TV debates before elections are an important event for political
     candidates
    Direct feedback of users from social networks


 Business threads
      Early opinions of voters are the basis for future developments


 Business opportunities
      Direct feedback to voters can influence their opinion


 Real-Time spinning
    Additional links to information sources
    Early identification of opinions, increased awareness for specific topics
SYSTEM ARCHITECTURE
Objectives
Some User Requirements                                        Approach

  Real-Time Processing
     Enrichment of social data                    (Distributed) CEP
     High throughput
     Advanced filtering and pattern detection


  Advanced Pattern Definitions
     Usage of domain knowledge                    Semantic Requests
     Pattern definition hides technical details


  Triggering of Actions & Visualizations
                                                      Visual Pattern
       e.g., Notifications                             Modeling
Overall Architecture
SEMANTIC REQUESTS
Challenges of Stream Search
                        Web search                            Stream search
Order of occurrence     Data->Query                           Query->Data
Scope                   Information Needs                     Notification Needs

Data Collection         Crawling                              Real-Time Sources

Optimization Priority   Data                                  Query


Goal                    Fast Processing of Queries            Fast Processing of Data




                                                 Challenge:

                                                 How can we simplify the user-centric
                                                 process of pattern definition?
Conceptual Model

 Semantic Requests
      event pattern extensions
      make use of background knowledge
      should simplify definition of situations of interest
      design-time translation to non-semantic queries


   Relevancy
        Advanced filtering based on background knowledge
        Users are often interested in generalized concepts (e.g., automotive
         manufactures)
   Importance
        Requires extensions with characteristic terms of a domain
        What is when important?
   Unusuality
        Computation of pattern fulfilment degree (white-box event processing)
        Makes use of data mining techniques
Modeling Semantic Requests

 Event pattern definition (Esper)
    select * from UpdateEvent where content like „%apple%“ or content
    like „%ipad%“ or content like „%iphone%“ or content like „%ios%“


 Semantic request-enhanced pattern definition:
        select * from UpdateEvent where sr:anythingRelated(content, like,
        Company, [hasName:Apple]



 Semantic requests:
       sr:anything: Find any instances of a given concept
       sr:anythingRelated: Find any related instances of a given concept
       sr:all: Find all instances of a given concept
       sr:allRelated: Find all related instances of a given concept
       sr:geo: Find all related locations that are related to a given concept
Semantic Requests: Pros and Cons

 Pros:
    No extension of the underlying engine required
    Background knowledge separated from pattern definition
    Easier to maintain, knowledge is defined at a single point
    Faster definition of similar, but frequently changing patterns




 Cons:
    Knowledge base has to be modeled and maintained
    Patterns have to be redeployed after changes in the knowledge base
     occured
    Performance loss due to long pattern specifications
Translation Process




   Definition of background knowledge (Ontology)
   Extraction of semantic requests from the original pattern
   Ontology is loaded
   Generation of SPARQL queries based on semantic requests
   Keywords are added to the resulting EPL
   Pattern is deployed to the event processing engine.
TOOLS
Prototype
Discussion & Future Work

 Semantic Request
   select * from TwitterEvent where content like „%apple%“ or content
   like „%ipad%“ or content like „%iphone%“ or content like „%ios%“



 Problem
    Patterns probably return many unimportant messages
    Example: links to iTunes, fruit, etc.


 Approach:
    Enhancing semantic requests with importance
    e.g., Apple + technology-related topics
    e.g., Apple + technical or finance-related words
THANK YOU! QUESTIONS?
BACKUP
Related Work

 Social Media Monitoring
    Twarql (Mendes et. al. 2010):
       Publishing Twitter Streams as Linked Data
       allows definition of SPARQL queries
    Domain-specific Topic Detection (Schirru et. al., 2010)
       weighting of terms and concepts
       uses data mining methods, not real-time computations


 Semantic Requests / Semantic CEP
      Event Query Pre-Processing (Teymourian 2011)
         rewrite queries based upon background knowledge
         not grounded in social media domain

More Related Content

Similar to Using Complex Event Processing for Modeling Semantic Requests in Real-Time Social Media Monitoring (20)

PPTX
Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Marco Brambilla
 
PDF
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
eswcsummerschool
 
PDF
Big Data Tutorial - Marko Grobelnik - 25 May 2012
Marko Grobelnik
 
PDF
Building Social Life Networks 130818
Ramesh Jain
 
PPT
Socialsensor project overview and topic discovery in tweeter streams
Yiannis Kompatsiaris
 
PDF
EDF2013: Big Data Tutorial: Marko Grobelnik
European Data Forum
 
PDF
Big data tutorial_part4
heyramzz
 
PPTX
Enabling Value Added Services in the Event-based Internet
Yiannis Verginadis
 
PDF
On demand access to Big Data through Semantic Technologies
Peter Haase
 
PDF
Wims2012
Elena Simperl
 
PPTX
Unstructured data to structured meaning for nyu itp camp - 6-22-12 ms
Marshall Sponder
 
PPTX
Siddhi: A Second Look at Complex Event Processing Implementations
Srinath Perera
 
PDF
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET Journal
 
PDF
CS6010 Social Network Analysis Unit IV
pkaviya
 
PDF
Real Time Event Processing and In-­memory analysis of Big Data - StampedeCon ...
StampedeCon
 
PDF
Lambda Architecture 2.0 Convergence between Real-Time Analytics, Context-awar...
Sabri Skhiri
 
PDF
2012 02-14-digitale fabriek v - Een praktische aanpak voor implementatie van ...
Sirris
 
PDF
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
Edward Curry
 
PDF
Real timeeventmonitoringsystem(1)
Atyam Sriharsha
 
PPTX
CHAPTER -12 it.pptx
Koteswari Kasireddy
 
Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Marco Brambilla
 
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
eswcsummerschool
 
Big Data Tutorial - Marko Grobelnik - 25 May 2012
Marko Grobelnik
 
Building Social Life Networks 130818
Ramesh Jain
 
Socialsensor project overview and topic discovery in tweeter streams
Yiannis Kompatsiaris
 
EDF2013: Big Data Tutorial: Marko Grobelnik
European Data Forum
 
Big data tutorial_part4
heyramzz
 
Enabling Value Added Services in the Event-based Internet
Yiannis Verginadis
 
On demand access to Big Data through Semantic Technologies
Peter Haase
 
Wims2012
Elena Simperl
 
Unstructured data to structured meaning for nyu itp camp - 6-22-12 ms
Marshall Sponder
 
Siddhi: A Second Look at Complex Event Processing Implementations
Srinath Perera
 
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET Journal
 
CS6010 Social Network Analysis Unit IV
pkaviya
 
Real Time Event Processing and In-­memory analysis of Big Data - StampedeCon ...
StampedeCon
 
Lambda Architecture 2.0 Convergence between Real-Time Analytics, Context-awar...
Sabri Skhiri
 
2012 02-14-digitale fabriek v - Een praktische aanpak voor implementatie van ...
Sirris
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
Edward Curry
 
Real timeeventmonitoringsystem(1)
Atyam Sriharsha
 
CHAPTER -12 it.pptx
Koteswari Kasireddy
 

Recently uploaded (20)

PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Ad

Using Complex Event Processing for Modeling Semantic Requests in Real-Time Social Media Monitoring

  • 1. INFORMATIK Using Complex Event Processing for Modeling FZI FORSCHUNGSZENTRUM Semantic Requests in Real-Time Social Media Monitoring Dominik Riemer, Nenad Stojanovic, Ljiljana Stojanovic FZI Research Center for Information Technologies, Karlsruhe, Germany RAMSS 2012, 04 June 2012 Dublin, Ireland
  • 2. Agenda Introduction Use Cases System Architecture Semantic Requests Tools Discussion
  • 4. Why is Social Media Monitoring important?  Social Media Monitoring  Continuous process of monitoring a specific subject of matter in social media  Continuous, not necessarily real-time  Usage of Social Media is emerging  Wider range of age groups  New business opportunities for companies  State of the Art in SMM  Today: mainly image reputation, early identification of harmful messages  Predictions (e.g., earthquake identification)  Aggregations over crawled data in order to find overall scores for a brand
  • 5. Search in Social Media  Historical Search:  Search in past social media data  Social network analysis  Applications:  market research  Real-Time Search:  Search for patterns when data arrives  Applications:  proactive marketing  real-time spinning  Proactive Search:  Forecast future developments  E.g., movie box office revenues
  • 6. Complex Event Processing  Event processing is a form of computing that performs operations on events  CEP is an enabling technology that supports on the fly, (business-) real- time processing of huge event streams  CEP is about a timely (or in head of time) recognition of the situations of interest and corresponding reaction  A complex event pattern describes a situation of interests
  • 8. Proactive Marketing  As-Is:  Buying decisions are often based on recommendations through social media channels  Users share experiences and problems with products in social networks  Business threads:  Negative feedback spreads very fast in social networks  Business opportunities:  New customers  More satisfied customers  Proactive Marketing  Establishing relations to consumers and customers based on detected situations  Real-time advertising, real-time problem solving  Example: automatic helpdesk ticket generation
  • 9. Real-Time Spinning  As-Is:  TV debates before elections are an important event for political candidates  Direct feedback of users from social networks  Business threads  Early opinions of voters are the basis for future developments  Business opportunities  Direct feedback to voters can influence their opinion  Real-Time spinning  Additional links to information sources  Early identification of opinions, increased awareness for specific topics
  • 11. Objectives Some User Requirements Approach  Real-Time Processing  Enrichment of social data (Distributed) CEP  High throughput  Advanced filtering and pattern detection  Advanced Pattern Definitions  Usage of domain knowledge Semantic Requests  Pattern definition hides technical details  Triggering of Actions & Visualizations Visual Pattern  e.g., Notifications Modeling
  • 14. Challenges of Stream Search Web search Stream search Order of occurrence Data->Query Query->Data Scope Information Needs Notification Needs Data Collection Crawling Real-Time Sources Optimization Priority Data Query Goal Fast Processing of Queries Fast Processing of Data Challenge: How can we simplify the user-centric process of pattern definition?
  • 15. Conceptual Model  Semantic Requests  event pattern extensions  make use of background knowledge  should simplify definition of situations of interest  design-time translation to non-semantic queries  Relevancy  Advanced filtering based on background knowledge  Users are often interested in generalized concepts (e.g., automotive manufactures)  Importance  Requires extensions with characteristic terms of a domain  What is when important?  Unusuality  Computation of pattern fulfilment degree (white-box event processing)  Makes use of data mining techniques
  • 16. Modeling Semantic Requests  Event pattern definition (Esper) select * from UpdateEvent where content like „%apple%“ or content like „%ipad%“ or content like „%iphone%“ or content like „%ios%“  Semantic request-enhanced pattern definition: select * from UpdateEvent where sr:anythingRelated(content, like, Company, [hasName:Apple]  Semantic requests:  sr:anything: Find any instances of a given concept  sr:anythingRelated: Find any related instances of a given concept  sr:all: Find all instances of a given concept  sr:allRelated: Find all related instances of a given concept  sr:geo: Find all related locations that are related to a given concept
  • 17. Semantic Requests: Pros and Cons  Pros:  No extension of the underlying engine required  Background knowledge separated from pattern definition  Easier to maintain, knowledge is defined at a single point  Faster definition of similar, but frequently changing patterns  Cons:  Knowledge base has to be modeled and maintained  Patterns have to be redeployed after changes in the knowledge base occured  Performance loss due to long pattern specifications
  • 18. Translation Process  Definition of background knowledge (Ontology)  Extraction of semantic requests from the original pattern  Ontology is loaded  Generation of SPARQL queries based on semantic requests  Keywords are added to the resulting EPL  Pattern is deployed to the event processing engine.
  • 19. TOOLS
  • 21. Discussion & Future Work  Semantic Request select * from TwitterEvent where content like „%apple%“ or content like „%ipad%“ or content like „%iphone%“ or content like „%ios%“  Problem  Patterns probably return many unimportant messages  Example: links to iTunes, fruit, etc.  Approach:  Enhancing semantic requests with importance  e.g., Apple + technology-related topics  e.g., Apple + technical or finance-related words
  • 24. Related Work  Social Media Monitoring  Twarql (Mendes et. al. 2010):  Publishing Twitter Streams as Linked Data  allows definition of SPARQL queries  Domain-specific Topic Detection (Schirru et. al., 2010)  weighting of terms and concepts  uses data mining methods, not real-time computations  Semantic Requests / Semantic CEP  Event Query Pre-Processing (Teymourian 2011)  rewrite queries based upon background knowledge  not grounded in social media domain