SlideShare a Scribd company logo
Presentation Web Search
Xander Koning
Justin Verhulst
Problem
● Which political parties appear in the news? Is there an
influence on electoral support?
● Interesting topic for political communication scholars
● Manual content analysis too labor-intensive
➢ Solution: a system that enables users to analyse political
news coverage
Chosen approach
● Entity prominence → How often are political
parties mentioned?
● Time → When are political parties mentioned
in news articles?
● Focus on Dutch news sites NOS and nu.nl
Will the party that is covered most
frequently in the news win the elections?
System: data acquisition
● Crawling of politics section of Nu.nl and NOS website.
● Python’s Scrapy library
● Extract:
- Name of news site
- Article Title
- Full text
- Date of publication
NOS.nl
Web search - Political news analysis tool
Nu.nl: AJAX calls
Web search - Political news analysis tool
System: data acquisition
System: data acquisition
System: data acquisition
Create the index using
mappings
PUT politics
{
"mappings": {
"article": {
"_all": { "enabled": false },
"properties": {
"newssite": { "type": "keyword" },
"title": { "type": "text" },
"text": { "type": "text" },
"date": {
"type": "date",
"format": "dd-MM-yy HH:mm||date_time_no_millis"
} } } }}
System: data processing
and indexing
● Index json files with curl, using elasticsearch’s
bulk API:
curl -s -XPOST 'http://localhost:9200/_bulk' --data-binary @nos.json
curl -s -XPOST 'http://localhost:9200/_bulk' --data-binary @nu.json
● 559 NOS.nl articles, 392 nu.nl articles
● Period: 02-01-2017 - 12-03-2017
Web search - Political news analysis tool
User Interface:
Kibana Dashboard
Web search - Political news analysis tool
Web search - Political news analysis tool
Example analysis
Order of party news coverage (#
of mentions)
1. VVD
2. PVDA
3. PVV
4. D66
5. CDA
6. SP
7. GroenLinks
8. ChristenUnie
9. Denk
10. 50+
11. SGP
12. Partij voor de Dieren
13. FvD
Order of party size (# of seats)
after elections
1. VVD
2. PVV
3. D66
4. CDA
5. SP
6. Groenlinks
7. PVDA
8. ChristenUnie
9. Partij voor de Dieren
10. 50+
11. Denk
12. SGP
13. FvD
Conclusion
● System enables the analysis of large quantities of
party-political news
● We showed a clear correlation between news exposure
and election results → demonstrates the usefulness of
this analysis tool
● Much more to be researched!
Future work
● Expand timeframe
● Add more news sites
● Retrieve more information from articles, e.g.
sentiment
● Evaluate interface with real users (media / political
communication scholars)
Thank you!

More Related Content

PDF
Harris County: Using Elastic to Accelerate Investigations
Elasticsearch
 
ODP
Mining the Web of Linked Data with RapidMiner
Heiko Paulheim
 
PDF
Forschungsdaten und Semantische Interoperabiblität
kostaedt
 
PPTX
Elasticsearch: Getting Started Part 3 Aggregations
Suyog Kale
 
PDF
Health Sciences Research Informatics, Powered by Globus
Globus
 
PPTX
Elasticsearch: Getting Started Part 1
Suyog Kale
 
PDF
Finding Data Sets
Anja Jentzsch
 
PPTX
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
Jens Mittelbach
 
Harris County: Using Elastic to Accelerate Investigations
Elasticsearch
 
Mining the Web of Linked Data with RapidMiner
Heiko Paulheim
 
Forschungsdaten und Semantische Interoperabiblität
kostaedt
 
Elasticsearch: Getting Started Part 3 Aggregations
Suyog Kale
 
Health Sciences Research Informatics, Powered by Globus
Globus
 
Elasticsearch: Getting Started Part 1
Suyog Kale
 
Finding Data Sets
Anja Jentzsch
 
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
Jens Mittelbach
 

What's hot (20)

PDF
Data Sharing via Globus in the NIH Intramural Program
Globus
 
PPT
The Power of Semantic Technologies to Explore Linked Open Data
Ontotext
 
PDF
Deep Web
Gol D Roger
 
PDF
VenmoPlus demo week6
Qingpeng "Q.P." Zhang
 
PDF
20170501 Distributed Network of Digital Heritage Information
Enno Meijers
 
PPT
The Modern Palimpsest
Leigh Dodds
 
PPTX
Using historical open data for family history - and the value of GB1900 data
Tom Pert
 
PPTX
PhD Projects in Text Mining Research Topics With Source Code
PhD Services
 
PDF
DBpedia Viewer - LDOW 2014
Dimitris Kontokostas
 
PDF
SC4 Workshop 2 : Pieter Colpaert - Maximizing the reuse of open transport data
BigData_Europe
 
PPTX
Semantic web 101: Benefits for geologists
dgarijo
 
PPTX
Linked Data media experiment
MediArena
 
PPTX
Clustering in Data Mining
S.Mostafa Sayyedi
 
PDF
DBpedia mobile
Kishoj Bajracharya
 
PDF
DBPedia-past-present-future
Data Science Society
 
PDF
Csdh sbg clariah_intr01
Richard Zijdeman
 
PDF
MongoDB API Talk @ HackPrinceton
Valeri Karpov
 
PPTX
Introduction to Web Mining and Spatial Data Mining
AarshDhokai
 
PDF
Linked data experience at Macmillan: Building discovery services for scientif...
Michele Pasin
 
Data Sharing via Globus in the NIH Intramural Program
Globus
 
The Power of Semantic Technologies to Explore Linked Open Data
Ontotext
 
Deep Web
Gol D Roger
 
VenmoPlus demo week6
Qingpeng "Q.P." Zhang
 
20170501 Distributed Network of Digital Heritage Information
Enno Meijers
 
The Modern Palimpsest
Leigh Dodds
 
Using historical open data for family history - and the value of GB1900 data
Tom Pert
 
PhD Projects in Text Mining Research Topics With Source Code
PhD Services
 
DBpedia Viewer - LDOW 2014
Dimitris Kontokostas
 
SC4 Workshop 2 : Pieter Colpaert - Maximizing the reuse of open transport data
BigData_Europe
 
Semantic web 101: Benefits for geologists
dgarijo
 
Linked Data media experiment
MediArena
 
Clustering in Data Mining
S.Mostafa Sayyedi
 
DBpedia mobile
Kishoj Bajracharya
 
DBPedia-past-present-future
Data Science Society
 
Csdh sbg clariah_intr01
Richard Zijdeman
 
MongoDB API Talk @ HackPrinceton
Valeri Karpov
 
Introduction to Web Mining and Spatial Data Mining
AarshDhokai
 
Linked data experience at Macmillan: Building discovery services for scientif...
Michele Pasin
 
Ad

Similar to Web search - Political news analysis tool (10)

PDF
Groningen nl pgroep
maartenmarx
 
PDF
Keynote Exploring and Exploiting Official Publications
maartenmarx
 
PPT
Expression of Political Opinions in Press
University Politehnica Bucharest
 
PPTX
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
Martijn Kleppe
 
PDF
Party Transformations In European Democracies Andr Krouwel
semranching69
 
PDF
Connecting political data to media data
Laura Hollink
 
PDF
Assessing the Benefits of Open Government Data: The Case of Meu Congresso Nac...
Kellyton Brito
 
PPTX
Media Policy Research Centre
Centre for Media Pluralism and Media Freedom
 
KEY
VMS @ HNS Denk & Drink weekend
Ben Teeuwen
 
PDF
Connecting political data to media data
Laura Hollink
 
Groningen nl pgroep
maartenmarx
 
Keynote Exploring and Exploiting Official Publications
maartenmarx
 
Expression of Political Opinions in Press
University Politehnica Bucharest
 
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
Martijn Kleppe
 
Party Transformations In European Democracies Andr Krouwel
semranching69
 
Connecting political data to media data
Laura Hollink
 
Assessing the Benefits of Open Government Data: The Case of Meu Congresso Nac...
Kellyton Brito
 
Media Policy Research Centre
Centre for Media Pluralism and Media Freedom
 
VMS @ HNS Denk & Drink weekend
Ben Teeuwen
 
Connecting political data to media data
Laura Hollink
 
Ad

Recently uploaded (20)

PDF
January_onlinecopy_2025_compressed.pdf.municip
municipluseonline
 
PDF
FROM ISRAEL'S USURPATION OF PALESTINIAN TERRITORY TO THE GENOCIDE IN THE GAZA...
Faga1939
 
PPTX
The Branches of Philosophy-Grade 12.pptx
arculankrishellemae
 
DOCX
Maharashtra Election Result 2024: Trends Compared to 2019 and 2024
maharashtraelection6
 
PDF
Self Defence Workshop In Dwarka Delhi.pdf
Sandhya Singh
 
PDF
25072025_First India Newspaper Jaipur.pdf
FIRST INDIA
 
PPTX
EU adopts 18th Sanctions Package against Russia
RT Consulting Limited
 
PDF
Kerala From Literacy Hub to Strategic radicals 2.pdf
Aaryan Kansari
 
PPTX
Stay Updated with Apna Punjab Media: Your Daily Punjab News in Punjabi
ApnaPunjabMedia1
 
PDF
Starlink Suffers Major Global Outage Due to Internal Software Failure.pdf
New India Abroad
 
PPTX
Chiang Rai Battles Severe Flooding Affecting Over 4,400 Households
hepburnindia
 
PDF
Thailand_Cambodia_Travel_Advisories_Blogs.pdf
bumppy48
 
PDF
29072025_First India Newspaper Jaipur.pdf
FIRST INDIA
 
PDF
Reporter Travel: Telling Stories from the Field – Powered by 5wh.com
5whnewsletter
 
PDF
Making Britain work for everyone How can employers go further to recruit and ...
ResolutionFoundation
 
PPTX
Naija Blog - Naija News, Top Nigerian Stories
Naija Blog
 
PDF
An Uncut Conversation With My Co-Pilot | PDF Document
Mike Hydes
 
PDF
U.S. Lawmakers Urge Action on Indian Student Visa Delays.pdf
New India Abroad
 
PDF
Summary of Final Report of the UN OEWG on State Use of ICT (Cyber) 2025.pdf
Benjamin Ang
 
PPTX
11HUMSS-LESSSONS-POLITICSGOVERNANCE.pptx
CarlVillanueva11
 
January_onlinecopy_2025_compressed.pdf.municip
municipluseonline
 
FROM ISRAEL'S USURPATION OF PALESTINIAN TERRITORY TO THE GENOCIDE IN THE GAZA...
Faga1939
 
The Branches of Philosophy-Grade 12.pptx
arculankrishellemae
 
Maharashtra Election Result 2024: Trends Compared to 2019 and 2024
maharashtraelection6
 
Self Defence Workshop In Dwarka Delhi.pdf
Sandhya Singh
 
25072025_First India Newspaper Jaipur.pdf
FIRST INDIA
 
EU adopts 18th Sanctions Package against Russia
RT Consulting Limited
 
Kerala From Literacy Hub to Strategic radicals 2.pdf
Aaryan Kansari
 
Stay Updated with Apna Punjab Media: Your Daily Punjab News in Punjabi
ApnaPunjabMedia1
 
Starlink Suffers Major Global Outage Due to Internal Software Failure.pdf
New India Abroad
 
Chiang Rai Battles Severe Flooding Affecting Over 4,400 Households
hepburnindia
 
Thailand_Cambodia_Travel_Advisories_Blogs.pdf
bumppy48
 
29072025_First India Newspaper Jaipur.pdf
FIRST INDIA
 
Reporter Travel: Telling Stories from the Field – Powered by 5wh.com
5whnewsletter
 
Making Britain work for everyone How can employers go further to recruit and ...
ResolutionFoundation
 
Naija Blog - Naija News, Top Nigerian Stories
Naija Blog
 
An Uncut Conversation With My Co-Pilot | PDF Document
Mike Hydes
 
U.S. Lawmakers Urge Action on Indian Student Visa Delays.pdf
New India Abroad
 
Summary of Final Report of the UN OEWG on State Use of ICT (Cyber) 2025.pdf
Benjamin Ang
 
11HUMSS-LESSSONS-POLITICSGOVERNANCE.pptx
CarlVillanueva11
 

Web search - Political news analysis tool

  • 1. Presentation Web Search Xander Koning Justin Verhulst
  • 2. Problem ● Which political parties appear in the news? Is there an influence on electoral support? ● Interesting topic for political communication scholars ● Manual content analysis too labor-intensive ➢ Solution: a system that enables users to analyse political news coverage
  • 3. Chosen approach ● Entity prominence → How often are political parties mentioned? ● Time → When are political parties mentioned in news articles? ● Focus on Dutch news sites NOS and nu.nl
  • 4. Will the party that is covered most frequently in the news win the elections?
  • 5. System: data acquisition ● Crawling of politics section of Nu.nl and NOS website. ● Python’s Scrapy library ● Extract: - Name of news site - Article Title - Full text - Date of publication
  • 13. Create the index using mappings PUT politics { "mappings": { "article": { "_all": { "enabled": false }, "properties": { "newssite": { "type": "keyword" }, "title": { "type": "text" }, "text": { "type": "text" }, "date": { "type": "date", "format": "dd-MM-yy HH:mm||date_time_no_millis" } } } }}
  • 14. System: data processing and indexing ● Index json files with curl, using elasticsearch’s bulk API: curl -s -XPOST 'http://localhost:9200/_bulk' --data-binary @nos.json curl -s -XPOST 'http://localhost:9200/_bulk' --data-binary @nu.json ● 559 NOS.nl articles, 392 nu.nl articles ● Period: 02-01-2017 - 12-03-2017
  • 19. Example analysis Order of party news coverage (# of mentions) 1. VVD 2. PVDA 3. PVV 4. D66 5. CDA 6. SP 7. GroenLinks 8. ChristenUnie 9. Denk 10. 50+ 11. SGP 12. Partij voor de Dieren 13. FvD Order of party size (# of seats) after elections 1. VVD 2. PVV 3. D66 4. CDA 5. SP 6. Groenlinks 7. PVDA 8. ChristenUnie 9. Partij voor de Dieren 10. 50+ 11. Denk 12. SGP 13. FvD
  • 20. Conclusion ● System enables the analysis of large quantities of party-political news ● We showed a clear correlation between news exposure and election results → demonstrates the usefulness of this analysis tool ● Much more to be researched!
  • 21. Future work ● Expand timeframe ● Add more news sites ● Retrieve more information from articles, e.g. sentiment ● Evaluate interface with real users (media / political communication scholars)