Presentation Web Search
Xander Koning
Justin Verhulst
Problem
● Which political parties appear in the news? Is there an
influence on electoral support?
● Interesting topic for political communication scholars
● Manual content analysis too labor-intensive
➢ Solution: a system that enables users to analyse political
news coverage
Chosen approach
● Entity prominence → How often are political
parties mentioned?
● Time → When are political parties mentioned
in news articles?
● Focus on Dutch news sites NOS and nu.nl
Will the party that is covered most
frequently in the news win the elections?
System: data acquisition
● Crawling of politics section of Nu.nl and NOS website.
● Python’s Scrapy library
● Extract:
- Name of news site
- Article Title
- Full text
- Date of publication
NOS.nl
Web search - Political news analysis tool
Nu.nl: AJAX calls
Web search - Political news analysis tool
System: data acquisition
System: data acquisition
System: data acquisition
Create the index using
mappings
PUT politics
{
"mappings": {
"article": {
"_all": { "enabled": false },
"properties": {
"newssite": { "type": "keyword" },
"title": { "type": "text" },
"text": { "type": "text" },
"date": {
"type": "date",
"format": "dd-MM-yy HH:mm||date_time_no_millis"
} } } }}
System: data processing
and indexing
● Index json files with curl, using elasticsearch’s
bulk API:
curl -s -XPOST 'http://localhost:9200/_bulk' --data-binary @nos.json
curl -s -XPOST 'http://localhost:9200/_bulk' --data-binary @nu.json
● 559 NOS.nl articles, 392 nu.nl articles
● Period: 02-01-2017 - 12-03-2017
Web search - Political news analysis tool
User Interface:
Kibana Dashboard
Web search - Political news analysis tool
Web search - Political news analysis tool
Example analysis
Order of party news coverage (#
of mentions)
1. VVD
2. PVDA
3. PVV
4. D66
5. CDA
6. SP
7. GroenLinks
8. ChristenUnie
9. Denk
10. 50+
11. SGP
12. Partij voor de Dieren
13. FvD
Order of party size (# of seats)
after elections
1. VVD
2. PVV
3. D66
4. CDA
5. SP
6. Groenlinks
7. PVDA
8. ChristenUnie
9. Partij voor de Dieren
10. 50+
11. Denk
12. SGP
13. FvD
Conclusion
● System enables the analysis of large quantities of
party-political news
● We showed a clear correlation between news exposure
and election results → demonstrates the usefulness of
this analysis tool
● Much more to be researched!
Future work
● Expand timeframe
● Add more news sites
● Retrieve more information from articles, e.g.
sentiment
● Evaluate interface with real users (media / political
communication scholars)
Thank you!

More Related Content

PDF
Harris County: Using Elastic to Accelerate Investigations
ODP
Mining the Web of Linked Data with RapidMiner
PDF
Forschungsdaten und Semantische Interoperabiblität
PPTX
Elasticsearch: Getting Started Part 3 Aggregations
PDF
Health Sciences Research Informatics, Powered by Globus
PPTX
Elasticsearch: Getting Started Part 1
PDF
Finding Data Sets
PPTX
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
Harris County: Using Elastic to Accelerate Investigations
Mining the Web of Linked Data with RapidMiner
Forschungsdaten und Semantische Interoperabiblität
Elasticsearch: Getting Started Part 3 Aggregations
Health Sciences Research Informatics, Powered by Globus
Elasticsearch: Getting Started Part 1
Finding Data Sets
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...

What's hot (20)

PDF
Data Sharing via Globus in the NIH Intramural Program
PPT
The Power of Semantic Technologies to Explore Linked Open Data
PDF
Deep Web
PDF
VenmoPlus demo week6
PDF
20170501 Distributed Network of Digital Heritage Information
PPT
The Modern Palimpsest
PPTX
Using historical open data for family history - and the value of GB1900 data
PPTX
PhD Projects in Text Mining Research Topics With Source Code
PDF
DBpedia Viewer - LDOW 2014
PDF
SC4 Workshop 2 : Pieter Colpaert - Maximizing the reuse of open transport data
PPTX
Semantic web 101: Benefits for geologists
PPTX
Linked Data media experiment
PPTX
Clustering in Data Mining
PDF
DBpedia mobile
PDF
DBPedia-past-present-future
PDF
Csdh sbg clariah_intr01
PDF
MongoDB API Talk @ HackPrinceton
PPTX
Introduction to Web Mining and Spatial Data Mining
PDF
Linked data experience at Macmillan: Building discovery services for scientif...
Data Sharing via Globus in the NIH Intramural Program
The Power of Semantic Technologies to Explore Linked Open Data
Deep Web
VenmoPlus demo week6
20170501 Distributed Network of Digital Heritage Information
The Modern Palimpsest
Using historical open data for family history - and the value of GB1900 data
PhD Projects in Text Mining Research Topics With Source Code
DBpedia Viewer - LDOW 2014
SC4 Workshop 2 : Pieter Colpaert - Maximizing the reuse of open transport data
Semantic web 101: Benefits for geologists
Linked Data media experiment
Clustering in Data Mining
DBpedia mobile
DBPedia-past-present-future
Csdh sbg clariah_intr01
MongoDB API Talk @ HackPrinceton
Introduction to Web Mining and Spatial Data Mining
Linked data experience at Macmillan: Building discovery services for scientif...
Ad

Similar to Web search - Political news analysis tool (10)

PDF
Groningen nl pgroep
PDF
Keynote Exploring and Exploiting Official Publications
PPT
Expression of Political Opinions in Press
PPTX
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PDF
Party Transformations In European Democracies Andr Krouwel
PDF
Connecting political data to media data
PDF
Assessing the Benefits of Open Government Data: The Case of Meu Congresso Nac...
PPTX
Media Policy Research Centre
KEY
VMS @ HNS Denk & Drink weekend
PDF
Connecting political data to media data
Groningen nl pgroep
Keynote Exploring and Exploiting Official Publications
Expression of Political Opinions in Press
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
Party Transformations In European Democracies Andr Krouwel
Connecting political data to media data
Assessing the Benefits of Open Government Data: The Case of Meu Congresso Nac...
Media Policy Research Centre
VMS @ HNS Denk & Drink weekend
Connecting political data to media data
Ad

Recently uploaded (20)

PPTX
CIVIL-SOCIETY-AND-NON-GOVERNMENTAL-ORGANIZATIONS.pptx
PPTX
politics and the day to day trading and power of government
DOC
买USF毕业证学历认证,卡罗大学毕业证本科/硕士学位
PDF
SOLUTIONS TO THE GEOPOLITICAL SECURITY THREATS.pdf
PPTX
Thailand and India Shine at PATA Travel Mart 2025 in Bangkok.
PPTX
482687029-The-Beginning-Of-Greek-Mathematics-pptx.pptx
PPTX
Apna Punjab Media: The Definitive Punjabi Source
PDF
New Light on India’s Plight
DOCX
Coinography: Breaking Down the Latest Cryptocurrency and Bitcoin News
PDF
Beyond the Buzz: Consumer Tech and Lifestyle Innovations
PDF
Public Attention on IRB System Problems: Key Trigger Explained
PDF
B. Flight Log_Released in U.S. v. Maxwell_Part6_0.pdf
PDF
GAZA STRIP: Famine confirmed in Gaza Governorate, projected to expand
DOC
办理UofM毕业证学历认证,克莱顿州立大学毕业证国外硕士毕业证
PPTX
Thailand Fashion Week 2025 Set to Ignite Bangkok with Daring Spring-Summer 20...
PDF
Houston City Life - Intown Magazine
PPTX
Flood Disaster in Buner, Khyber Pakhtunkhwa
PPTX
Examining_the_Potential_Relocation_of_Gazans_to_So.pptx
PPTX
British Colonialism in the Subcontinent.pptx
PPTX
political economy meaning and importance.pptx
CIVIL-SOCIETY-AND-NON-GOVERNMENTAL-ORGANIZATIONS.pptx
politics and the day to day trading and power of government
买USF毕业证学历认证,卡罗大学毕业证本科/硕士学位
SOLUTIONS TO THE GEOPOLITICAL SECURITY THREATS.pdf
Thailand and India Shine at PATA Travel Mart 2025 in Bangkok.
482687029-The-Beginning-Of-Greek-Mathematics-pptx.pptx
Apna Punjab Media: The Definitive Punjabi Source
New Light on India’s Plight
Coinography: Breaking Down the Latest Cryptocurrency and Bitcoin News
Beyond the Buzz: Consumer Tech and Lifestyle Innovations
Public Attention on IRB System Problems: Key Trigger Explained
B. Flight Log_Released in U.S. v. Maxwell_Part6_0.pdf
GAZA STRIP: Famine confirmed in Gaza Governorate, projected to expand
办理UofM毕业证学历认证,克莱顿州立大学毕业证国外硕士毕业证
Thailand Fashion Week 2025 Set to Ignite Bangkok with Daring Spring-Summer 20...
Houston City Life - Intown Magazine
Flood Disaster in Buner, Khyber Pakhtunkhwa
Examining_the_Potential_Relocation_of_Gazans_to_So.pptx
British Colonialism in the Subcontinent.pptx
political economy meaning and importance.pptx

Web search - Political news analysis tool

  • 1. Presentation Web Search Xander Koning Justin Verhulst
  • 2. Problem ● Which political parties appear in the news? Is there an influence on electoral support? ● Interesting topic for political communication scholars ● Manual content analysis too labor-intensive ➢ Solution: a system that enables users to analyse political news coverage
  • 3. Chosen approach ● Entity prominence → How often are political parties mentioned? ● Time → When are political parties mentioned in news articles? ● Focus on Dutch news sites NOS and nu.nl
  • 4. Will the party that is covered most frequently in the news win the elections?
  • 5. System: data acquisition ● Crawling of politics section of Nu.nl and NOS website. ● Python’s Scrapy library ● Extract: - Name of news site - Article Title - Full text - Date of publication
  • 13. Create the index using mappings PUT politics { "mappings": { "article": { "_all": { "enabled": false }, "properties": { "newssite": { "type": "keyword" }, "title": { "type": "text" }, "text": { "type": "text" }, "date": { "type": "date", "format": "dd-MM-yy HH:mm||date_time_no_millis" } } } }}
  • 14. System: data processing and indexing ● Index json files with curl, using elasticsearch’s bulk API: curl -s -XPOST 'http://localhost:9200/_bulk' --data-binary @nos.json curl -s -XPOST 'http://localhost:9200/_bulk' --data-binary @nu.json ● 559 NOS.nl articles, 392 nu.nl articles ● Period: 02-01-2017 - 12-03-2017
  • 19. Example analysis Order of party news coverage (# of mentions) 1. VVD 2. PVDA 3. PVV 4. D66 5. CDA 6. SP 7. GroenLinks 8. ChristenUnie 9. Denk 10. 50+ 11. SGP 12. Partij voor de Dieren 13. FvD Order of party size (# of seats) after elections 1. VVD 2. PVV 3. D66 4. CDA 5. SP 6. Groenlinks 7. PVDA 8. ChristenUnie 9. Partij voor de Dieren 10. 50+ 11. Denk 12. SGP 13. FvD
  • 20. Conclusion ● System enables the analysis of large quantities of party-political news ● We showed a clear correlation between news exposure and election results → demonstrates the usefulness of this analysis tool ● Much more to be researched!
  • 21. Future work ● Expand timeframe ● Add more news sites ● Retrieve more information from articles, e.g. sentiment ● Evaluate interface with real users (media / political communication scholars)