Carolin Gerlitz
(based on joint work with Liliana Bounegru & Jonathan Gray)
University of Siegen – Digital Methods Initiative Amsterdam
Infrastructuring eResearch Workshop, Dec 7 2016
DE- & REASSEMBLING DATA
INFRASTRUCTURES
DIGITAL METHODS
Widely used term.
Repurposing of (1) digital data (2)
technical features (3) analytical
capacities.
How can links, likes, shares, comments
etc. be used for research? (Rogers
2013)
DIGITAL METHODS
Structured extraction & analysis of data.
Two main objectives: (1) study sociality
online (2) understand medium
specificity and socio-technical
configurations
Tool development, repurposing &
training.
EXAMPLE: TCAT
Twitter Capture and Analysis Toolkit
(TCAT).
Developed by Erik Borra & Bernhard
Rieder.
Data collection & analysis of Twitter data
based on Streaming API.
DE-ASSEMBLING
Digital methods rely on the
participation of a variety of actors
and entities.
Data & tool chaining.
What are the data infrastructures
that underpin dm work? What
challenges do they pose?
DATA PRODUCTION
Data production as distributed
accomplishment of users,
platform activities, capture
mechanisms, third party apps &
cross-platform syndication.
(1) COMMENSURATION
Cross-platform syndication, different
interpretation of platform features,
bots & automation.
How to commensurate data from
heterogeneous sources
(Espeland & Stevens 1998)?
s
(2) MULTIVALENT YET
BIASED
Data set out to cater to different
analytical interests of
stakeholders.
At the same time: support some
forms of analysis more than others
(interestedness).
DATA EXTRACTION
Scraping, crawling, API retrieval.
Reliant on platform data structures
and API politics, tools, plugins
and scripts.
Platforms determine the conditions of
access to their data.
Instagram Hashtag Explorer
DATA ANALYSIS
Reliant on further tools for querying
data, calculating metrics, stats or
combination of data formats.
DMI TCAT
(3) METHODOLOGICAL
UNCANNY
Open or commercial tools resonate
with known methods – but not
quite (Marres & Gerlitz 2015).
DATA VISUALISATION
Visualisation standards and data
outputs.
Which data formats are amenable for
which visualisation technique?
What interestedness does
visualisation introduce?
D3, tableau, Gephi
(4) TOOL CHAINNG
Assembling of different data sources
& tools for different tasks into a
methodological apparatus.
Cascades of inscriptions (Ruppert
et al 2013).
(5) DISTRIBUTED TOOL
MAKING
Many general purpose tools (incl.
extensive documentation).
Heterogeneous developers and
emergent standards.
Which tools can be chained?
How can open source tools be
maintained and scaled up?
(6) DATA PUBLICS
Data assemble heterogeneous publics
with different objectives, interests,
skills & needs (Ruppert 2015,
Birchall 2015).
Researchers, companies,
organisations, activists, journalism.
ALLINGING DATA
INFRATSRUCTURES
Methodological work as de- &
reassembly.
Specific to needs of publics.
Alignment & mal-alignment of
data sources, tools,
visualisations and research
objectives: need for
repositories and shared dev.
(RE)IMAGINING DATA
INSTRUCTURES
From data literacy to data
infrastructure literacy (Gray et
al. 2017).
Accounting for inscription, alignment
and malalignment.
Enable to re-think, re-assemble and
re-align infrastructures.
Methodological infrastuctural
imagination (Bowker 2014).
ANTWORT
:
SITUIERTE
ALGORITH
MEN
THANK YOU.
carolin.gerlitz@uni-siegen.de

More Related Content

PPTX
Mdst 3559-01-25-data-journalism
PDF
Using graph technology for multi-INT investigations
PDF
Big Data Analytics IEEE 2015 Projects
PDF
Bi g data_urban modeling_applications_23092013
PDF
The Nature of Digitally-Produced Data: Towards Social-Scientific Tool Criticism
PDF
Pricing and business model Fusepool
PDF
Open Data Analytics for Parliamentary Monitoring in Finland
PDF
Use of graphs for political analysis
Mdst 3559-01-25-data-journalism
Using graph technology for multi-INT investigations
Big Data Analytics IEEE 2015 Projects
Bi g data_urban modeling_applications_23092013
The Nature of Digitally-Produced Data: Towards Social-Scientific Tool Criticism
Pricing and business model Fusepool
Open Data Analytics for Parliamentary Monitoring in Finland
Use of graphs for political analysis

What's hot (18)

PPTX
Quantum4D - Big Thinking - View Samples
PPTX
From Big Linked Data to Linked Big Data - DBpedia as a framework for data int...
PDF
Computational Journalism
PPTX
FSF innovation tools for strengthening integrity and risk adjusted certification
PDF
20200901 ECCB M. Kutmon
PPTX
IR tutorial
PDF
PID Services for FAIR data
PDF
PID services - understandability and findability of data
PDF
High-value datasets: from publication to impact
PPTX
Web Mining Project Ideas
PDF
Narrata_Final
PDF
Associating e-government and e-participation indexes with governmental twitte...
PPTX
20200130_Mannocci_OpenAIRE_ResearchGraph
PDF
Text mining through Non Negative Matrix Factorizations
PPTX
Graph database
PPTX
Introduction to OpenDataCommunities | Linda O'Halloran
PPTX
Mdst 3559-01-27-data-journalism-studio
PPT
MPROP Pal: Helping Planners Work With Property Data
Quantum4D - Big Thinking - View Samples
From Big Linked Data to Linked Big Data - DBpedia as a framework for data int...
Computational Journalism
FSF innovation tools for strengthening integrity and risk adjusted certification
20200901 ECCB M. Kutmon
IR tutorial
PID Services for FAIR data
PID services - understandability and findability of data
High-value datasets: from publication to impact
Web Mining Project Ideas
Narrata_Final
Associating e-government and e-participation indexes with governmental twitte...
20200130_Mannocci_OpenAIRE_ResearchGraph
Text mining through Non Negative Matrix Factorizations
Graph database
Introduction to OpenDataCommunities | Linda O'Halloran
Mdst 3559-01-27-data-journalism-studio
MPROP Pal: Helping Planners Work With Property Data
Ad

Viewers also liked (13)

PDF
Creation of Social Housing with Private Investors
PPTX
Company_profile_public_Nov2016
PDF
Anbefalingshæfte-ebog
DOCX
Tìm hiểu cách dùng pic để chạy motor bước
PDF
Reference letter from Tim Cottrell (1)
PDF
Diseño sin título
PDF
Certificate of Commendation Attendance 2006 Term 2
PDF
Certificate of Volunteer Service_James Welch
PDF
DIVYA RESUME_Final
PDF
WORLD’S LATEST 3D DIGITAL MAMMOGRAPHY SYSTEM
PDF
SAF Service Transcript
PPTX
"Hyvä mieli on päällimmäisin ajatus" - Kokemuksia järjestöistä oppimisympäris...
PPTX
Introduction
Creation of Social Housing with Private Investors
Company_profile_public_Nov2016
Anbefalingshæfte-ebog
Tìm hiểu cách dùng pic để chạy motor bước
Reference letter from Tim Cottrell (1)
Diseño sin título
Certificate of Commendation Attendance 2006 Term 2
Certificate of Volunteer Service_James Welch
DIVYA RESUME_Final
WORLD’S LATEST 3D DIGITAL MAMMOGRAPHY SYSTEM
SAF Service Transcript
"Hyvä mieli on päällimmäisin ajatus" - Kokemuksia järjestöistä oppimisympäris...
Introduction
Ad

Similar to De- and Reassembling Data Infrastructures (20)

PDF
Opportunities and methodological challenges of Big Data for official statist...
PDF
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
PPTX
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
PDF
Application and Methods of Deep Learning in IoT
PDF
A SURVEY OF BIG DATA ANALYTICS..........
PDF
A SURVEY OF BIG DATA ANALYTICS
PDF
KIT-601 Lecture Notes-UNIT-1.pdf
PDF
Fundamentals of data mining and its applications
PPTX
Researching Social Media – Big Data and Social Media Analysis
PDF
Big data Mining Using Very-Large-Scale Data Processing Platforms
DOCX
Big Data Analytics
PDF
RESEARCH IN BIG DATA – AN OVERVIEW
PDF
RESEARCH IN BIG DATA – AN OVERVIEW
PDF
RESEARCH IN BIG DATA – AN OVERVIEW
PDF
Research in Big Data - An Overview
PPT
using big-data methods analyse the Cross platform aviation
PDF
Participatory public data infrastructure: open data standards and the turn to...
PDF
A Comprehensive Overview of Advance Techniques, Applications and Challenges i...
PDF
Scraping and Clustering Techniques for the Characterization of Linkedin Profiles
PDF
Scraping and clustering techniques
Opportunities and methodological challenges of Big Data for official statist...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Application and Methods of Deep Learning in IoT
A SURVEY OF BIG DATA ANALYTICS..........
A SURVEY OF BIG DATA ANALYTICS
KIT-601 Lecture Notes-UNIT-1.pdf
Fundamentals of data mining and its applications
Researching Social Media – Big Data and Social Media Analysis
Big data Mining Using Very-Large-Scale Data Processing Platforms
Big Data Analytics
RESEARCH IN BIG DATA – AN OVERVIEW
RESEARCH IN BIG DATA – AN OVERVIEW
RESEARCH IN BIG DATA – AN OVERVIEW
Research in Big Data - An Overview
using big-data methods analyse the Cross platform aviation
Participatory public data infrastructure: open data standards and the turn to...
A Comprehensive Overview of Advance Techniques, Applications and Challenges i...
Scraping and Clustering Techniques for the Characterization of Linkedin Profiles
Scraping and clustering techniques

More from cgrltz (6)

PPTX
App ecologies: Mapping apps and their support networks
PPTX
AoIR 2016 Digital Methods Workshop - Tracking the Trackers
PDF
The Numbering Life of Platforms. Organising Value and Relations in Social Media.
PDF
What counts in social media? - Politics of Big Data conference
PDF
Becoming data point
PDF
One percent of twitter
App ecologies: Mapping apps and their support networks
AoIR 2016 Digital Methods Workshop - Tracking the Trackers
The Numbering Life of Platforms. Organising Value and Relations in Social Media.
What counts in social media? - Politics of Big Data conference
Becoming data point
One percent of twitter

Recently uploaded (20)

PPTX
Integrated Management of Neonatal and Childhood Illnesses (IMNCI) – Unit IV |...
PPTX
Climate Change and Its Global Impact.pptx
PPTX
What’s under the hood: Parsing standardized learning content for AI
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
PDF
The TKT Course. Modules 1, 2, 3.for self study
PDF
Journal of Dental Science - UDMY (2022).pdf
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
International_Financial_Reporting_Standa.pdf
PDF
semiconductor packaging in vlsi design fab
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PDF
MICROENCAPSULATION_NDDS_BPHARMACY__SEM VII_PCI Syllabus.pdf
PPTX
Thinking Routines and Learning Engagements.pptx
PDF
Comprehensive Lecture on the Appendix.pdf
PDF
Disorder of Endocrine system (1).pdfyyhyyyy
PDF
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
PDF
Compact First Student's Book Cambridge Official
PDF
fundamentals-of-heat-and-mass-transfer-6th-edition_incropera.pdf
PDF
English Textual Question & Ans (12th Class).pdf
PPTX
Education and Perspectives of Education.pptx
PDF
CRP102_SAGALASSOS_Final_Projects_2025.pdf
Integrated Management of Neonatal and Childhood Illnesses (IMNCI) – Unit IV |...
Climate Change and Its Global Impact.pptx
What’s under the hood: Parsing standardized learning content for AI
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
The TKT Course. Modules 1, 2, 3.for self study
Journal of Dental Science - UDMY (2022).pdf
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
International_Financial_Reporting_Standa.pdf
semiconductor packaging in vlsi design fab
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
MICROENCAPSULATION_NDDS_BPHARMACY__SEM VII_PCI Syllabus.pdf
Thinking Routines and Learning Engagements.pptx
Comprehensive Lecture on the Appendix.pdf
Disorder of Endocrine system (1).pdfyyhyyyy
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
Compact First Student's Book Cambridge Official
fundamentals-of-heat-and-mass-transfer-6th-edition_incropera.pdf
English Textual Question & Ans (12th Class).pdf
Education and Perspectives of Education.pptx
CRP102_SAGALASSOS_Final_Projects_2025.pdf

De- and Reassembling Data Infrastructures

  • 1. Carolin Gerlitz (based on joint work with Liliana Bounegru & Jonathan Gray) University of Siegen – Digital Methods Initiative Amsterdam Infrastructuring eResearch Workshop, Dec 7 2016 DE- & REASSEMBLING DATA INFRASTRUCTURES
  • 2. DIGITAL METHODS Widely used term. Repurposing of (1) digital data (2) technical features (3) analytical capacities. How can links, likes, shares, comments etc. be used for research? (Rogers 2013)
  • 3. DIGITAL METHODS Structured extraction & analysis of data. Two main objectives: (1) study sociality online (2) understand medium specificity and socio-technical configurations Tool development, repurposing & training.
  • 4. EXAMPLE: TCAT Twitter Capture and Analysis Toolkit (TCAT). Developed by Erik Borra & Bernhard Rieder. Data collection & analysis of Twitter data based on Streaming API.
  • 5. DE-ASSEMBLING Digital methods rely on the participation of a variety of actors and entities. Data & tool chaining. What are the data infrastructures that underpin dm work? What challenges do they pose?
  • 6. DATA PRODUCTION Data production as distributed accomplishment of users, platform activities, capture mechanisms, third party apps & cross-platform syndication.
  • 7. (1) COMMENSURATION Cross-platform syndication, different interpretation of platform features, bots & automation. How to commensurate data from heterogeneous sources (Espeland & Stevens 1998)? s
  • 8. (2) MULTIVALENT YET BIASED Data set out to cater to different analytical interests of stakeholders. At the same time: support some forms of analysis more than others (interestedness).
  • 9. DATA EXTRACTION Scraping, crawling, API retrieval. Reliant on platform data structures and API politics, tools, plugins and scripts. Platforms determine the conditions of access to their data. Instagram Hashtag Explorer
  • 10. DATA ANALYSIS Reliant on further tools for querying data, calculating metrics, stats or combination of data formats. DMI TCAT
  • 11. (3) METHODOLOGICAL UNCANNY Open or commercial tools resonate with known methods – but not quite (Marres & Gerlitz 2015).
  • 12. DATA VISUALISATION Visualisation standards and data outputs. Which data formats are amenable for which visualisation technique? What interestedness does visualisation introduce? D3, tableau, Gephi
  • 13. (4) TOOL CHAINNG Assembling of different data sources & tools for different tasks into a methodological apparatus. Cascades of inscriptions (Ruppert et al 2013).
  • 14. (5) DISTRIBUTED TOOL MAKING Many general purpose tools (incl. extensive documentation). Heterogeneous developers and emergent standards. Which tools can be chained? How can open source tools be maintained and scaled up?
  • 15. (6) DATA PUBLICS Data assemble heterogeneous publics with different objectives, interests, skills & needs (Ruppert 2015, Birchall 2015). Researchers, companies, organisations, activists, journalism.
  • 16. ALLINGING DATA INFRATSRUCTURES Methodological work as de- & reassembly. Specific to needs of publics. Alignment & mal-alignment of data sources, tools, visualisations and research objectives: need for repositories and shared dev.
  • 17. (RE)IMAGINING DATA INSTRUCTURES From data literacy to data infrastructure literacy (Gray et al. 2017). Accounting for inscription, alignment and malalignment. Enable to re-think, re-assemble and re-align infrastructures. Methodological infrastuctural imagination (Bowker 2014).

Editor's Notes

  • #4: Abgrenzung Digital Humanities.
  • #5: Abgrenzung Digital Humanities.
  • #16: Not only researchers
  • #18: heterogenous entities