SlideShare a Scribd company logo
Simon Price - IT Services R&D / ILRT
Peter Flach - Intelligent Systems Laboratory
Mining and Mapping the Research Landscape
2
ExaMiner
• Aim: Improve visibility of University of Bristol's
research within the research community, the media,
prospective students, and government.
• ExaMiner prototype:
• Builds on SubSift (a JISC Rapid Innovations project)
• Funded by and reporting to University Research Committee (URC)
• Steered by Exabyte Group in Dept. Computer Science
ExaMiner demonstrations
3
4
SubSift
5
SubSift
SubSift is a prototype
application to support
academic peer review.
SubSift matches submitted
conference/journal papers
to potential peer reviewers
based on similarity to
published works.
Website:
http://subsift.ilrt.bris.ac.uk
6
SubSift Tools
7
SubSift
8
SubSift
9
SubSift System Archicture
SUBSI FT
REST API
XML CSV TermsJSON YAML RDF
WEB
FILESTORE
SUBSIFT
HARVESTER
XSLT
CLIENT
10
SubSift – canonical workflow
11
SubSift REST API
12
Profiles
13
Matches
14
SubSift has been used for...
15
15
ExaMiner
16
ExaMiner demos
Demo 1 - Find a researcher (workflow)
17
Workflow Inputs (A)
Workflow Outputs
Workflow Inputs (B)
PROFILE SERVICE
DOCUMENT SERVICE
Abstract
Similarity
MATCH SERVICE
PROFILE SERVICE
DOCUMENT SERVICE
Staff URI s
Text Text
Text
Text
Profiles Profiles
BOOKMARKS SERVICE
URIs
HARVESTER
ROBOT
URIs
JSON
Null
Text
Demo 2 - Find similar research
18
Demo 3 - Find a researcher (workflow)
19
Workflow Outputs
Workflow Inputs
PROFILE SERVICE
Graph
MATCH SERVICE
DOCUMENT SERVICE
Staff URI s
Text
Text
Profiles Profiles
XML
BOOKMARKS SERVICE
URIs
HARVESTER
ROBOT
URIs
DOT
GENERATOR
DOT
Demo 4 - Profile email recipients
20
And finally…
21
Summary
• ExaMiner prototype:
• SubSift – provides back-end processing via REST web services
• ExaMiner demos – allowed us to explore requirements for our aim
• Informing (ongoing) redesign of corporate researcher homepages
• Future work:
• Move away from fixed (text-centric) model
• Scale up to work at whole institution scale
• ... and between institutions
22

More Related Content

PDF
Library Connect Webinar - Making the case for sharing with indicators of rese...
Library_Connect
 
PPTX
Introduction to Big data
cthanopoulos
 
PDF
EPOS metadata catalogue
Blue BRIDGE
 
PPTX
Discovery of IIIF Resources
Simeon Warner
 
PPTX
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE
 
PPTX
20191119_The OpenAIRE Research Graph
OpenAIRE
 
PDF
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
Blue BRIDGE
 
PPTX
The D4Science Infrastructure
e-ROSA
 
Library Connect Webinar - Making the case for sharing with indicators of rese...
Library_Connect
 
Introduction to Big data
cthanopoulos
 
EPOS metadata catalogue
Blue BRIDGE
 
Discovery of IIIF Resources
Simeon Warner
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE
 
20191119_The OpenAIRE Research Graph
OpenAIRE
 
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
Blue BRIDGE
 
The D4Science Infrastructure
e-ROSA
 

What's hot (20)

PPTX
LiSIs: a Galaxy based platform for Life Sciences Research
Christos Kannas
 
PPTX
BDE SC6-hang out - technology part-SWC - Martin
BigData_Europe
 
PDF
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Blue BRIDGE
 
PPTX
Owpd 03-1-daniel lombraña obscure-data
Safecast
 
PDF
Data journalismtips
Amy Weiss
 
PDF
Making Image Collections More Open with IIIF
KellliBee
 
PDF
Benchmarking Versioning for Big Linked Data
Graph-TA
 
PPT
IGIBS - BDB Research Forum, May 2011
EDINA, University of Edinburgh
 
PPTX
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
GESIS
 
PPTX
The Arctic Permafrost Geospatial Center – a portal for high-quality open acce...
Globus
 
PDF
Holistic Benchmarking of Big Linked Data: HOBBIT
Graph-TA
 
PPTX
Eva Méndez: Política europea y EOSC
maredata
 
PPTX
(Inter)disciplinary Infrastructures for Social Sciences and Humanities
dri_ireland
 
PDF
Presentation of HOBBIT's versioning benchmark at Graph-TA
Holistic Benchmarking of Big Linked Data
 
PDF
SC1 - Hangout 2: The Open PHACTS pilot
BigData_Europe
 
PPTX
Collections as Data National Forum (Elings)
Mary Elings
 
PPTX
Franz et al tdwg 2016 new developments for libraries of life
taxonbytes
 
PPT
User Engagement in Research Data Curation
University of Edinburgh
 
LiSIs: a Galaxy based platform for Life Sciences Research
Christos Kannas
 
BDE SC6-hang out - technology part-SWC - Martin
BigData_Europe
 
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Blue BRIDGE
 
Owpd 03-1-daniel lombraña obscure-data
Safecast
 
Data journalismtips
Amy Weiss
 
Making Image Collections More Open with IIIF
KellliBee
 
Benchmarking Versioning for Big Linked Data
Graph-TA
 
IGIBS - BDB Research Forum, May 2011
EDINA, University of Edinburgh
 
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
GESIS
 
The Arctic Permafrost Geospatial Center – a portal for high-quality open acce...
Globus
 
Holistic Benchmarking of Big Linked Data: HOBBIT
Graph-TA
 
Eva Méndez: Política europea y EOSC
maredata
 
(Inter)disciplinary Infrastructures for Social Sciences and Humanities
dri_ireland
 
Presentation of HOBBIT's versioning benchmark at Graph-TA
Holistic Benchmarking of Big Linked Data
 
SC1 - Hangout 2: The Open PHACTS pilot
BigData_Europe
 
Collections as Data National Forum (Elings)
Mary Elings
 
Franz et al tdwg 2016 new developments for libraries of life
taxonbytes
 
User Engagement in Research Data Curation
University of Edinburgh
 
Ad

Viewers also liked (12)

PPT
SubSift web services and workflows for profiling and comparing scientists and...
Simon Price
 
PPT
Nature Locator
Simon Price
 
PPTX
A Higher-Order Data Flow Model for Heterogeneous Big Data
Simon Price
 
PPTX
Co-designing Research IT and Research Data Services
Simon Price
 
PPTX
NewsPatterns - visualisation layer of news feed mining
Simon Price
 
PPT
Cost of Migrating Large-Scale Computer Assisted Learning (CAL) Software to We...
Simon Price
 
PPTX
Managing Large-scale Multimedia Development Projects
Simon Price
 
PPT
Managing research data at Bristol
Simon Price
 
PPT
SubSift: a novel application of the vector space model to support the academi...
Simon Price
 
PPTX
Best of Bristol Media City - MyMobileBristol, NatureLocator, Visualising China
Simon Price
 
PPTX
Querying and Merging Heterogeneous Data by Approximate Joins on Higher-Order ...
Simon Price
 
PPT
Citizen Science and Crowd-sourcing Biological Surveys
Simon Price
 
SubSift web services and workflows for profiling and comparing scientists and...
Simon Price
 
Nature Locator
Simon Price
 
A Higher-Order Data Flow Model for Heterogeneous Big Data
Simon Price
 
Co-designing Research IT and Research Data Services
Simon Price
 
NewsPatterns - visualisation layer of news feed mining
Simon Price
 
Cost of Migrating Large-Scale Computer Assisted Learning (CAL) Software to We...
Simon Price
 
Managing Large-scale Multimedia Development Projects
Simon Price
 
Managing research data at Bristol
Simon Price
 
SubSift: a novel application of the vector space model to support the academi...
Simon Price
 
Best of Bristol Media City - MyMobileBristol, NatureLocator, Visualising China
Simon Price
 
Querying and Merging Heterogeneous Data by Approximate Joins on Higher-Order ...
Simon Price
 
Citizen Science and Crowd-sourcing Biological Surveys
Simon Price
 
Ad

Similar to Mining and Mapping the Research Landscape (20)

PPTX
Intelligent Software Engineering: Synergy between AI and Software Engineering
Tao Xie
 
PPTX
Text mining and machine learning
Jisc RDM
 
PDF
IRJET- On-AIR Based Information Retrieval System for Semi-Structure Data
IRJET Journal
 
PDF
Multikeyword Hunt on Progressive Graphs
IRJET Journal
 
PPT
Reese when one ir isn’t enough
ASIS&T
 
PDF
Sandusky, "Deep Indexing and Discover of Tables and Figures"
National Information Standards Organization (NISO)
 
PDF
Applied Semantic Search with Microsoft SQL Server
Mark Tabladillo
 
PDF
Data mining model for the data retrieval from central server configuration
ijcsit
 
PDF
IRJET- A Survey on Image Retrieval using Machine Learning
IRJET Journal
 
PPT
Vellino presentationtocisti
Andre Vellino
 
PDF
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
IRJET Journal
 
PDF
Information Architectures - Lecture 04 - Next Generation User Interfaces (401...
Beat Signer
 
PPT
kantorNSF-NIJ-ISI-03-06-04.ppt
butest
 
PPTX
Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Marco Brambilla
 
PDF
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET Journal
 
PPT
viretrieval2.ppt chain codes Multimedia Information Retrieval
KhaledMohammadSoradi
 
PDF
Information Architectures - Lecture 04 - Next Generation User Interfaces (401...
Beat Signer
 
PPTX
Latest trends in AI and information Retrieval
Abhay Ratnaparkhi
 
PDF
Evaluation Mechanism for Similarity-Based Ranked Search Over Scientific Data
AM Publications
 
Intelligent Software Engineering: Synergy between AI and Software Engineering
Tao Xie
 
Text mining and machine learning
Jisc RDM
 
IRJET- On-AIR Based Information Retrieval System for Semi-Structure Data
IRJET Journal
 
Multikeyword Hunt on Progressive Graphs
IRJET Journal
 
Reese when one ir isn’t enough
ASIS&T
 
Sandusky, "Deep Indexing and Discover of Tables and Figures"
National Information Standards Organization (NISO)
 
Applied Semantic Search with Microsoft SQL Server
Mark Tabladillo
 
Data mining model for the data retrieval from central server configuration
ijcsit
 
IRJET- A Survey on Image Retrieval using Machine Learning
IRJET Journal
 
Vellino presentationtocisti
Andre Vellino
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
IRJET Journal
 
Information Architectures - Lecture 04 - Next Generation User Interfaces (401...
Beat Signer
 
kantorNSF-NIJ-ISI-03-06-04.ppt
butest
 
Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Marco Brambilla
 
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET Journal
 
viretrieval2.ppt chain codes Multimedia Information Retrieval
KhaledMohammadSoradi
 
Information Architectures - Lecture 04 - Next Generation User Interfaces (401...
Beat Signer
 
Latest trends in AI and information Retrieval
Abhay Ratnaparkhi
 
Evaluation Mechanism for Similarity-Based Ranked Search Over Scientific Data
AM Publications
 

More from Simon Price (14)

PPTX
Adding Open Data Value to 'Closed Data' Problems
Simon Price
 
PPT
A review of the state of the art in Machine Learning on the Semantic Web
Simon Price
 
PPT
Webs of People, Webs of Data
Simon Price
 
PPTX
Visualising China - historical photos of China
Simon Price
 
PPTX
Adapting CARDIO for BOS
Simon Price
 
PPTX
data.bris - Use case, role and functionality for CKAN adoption
Simon Price
 
PPTX
Code Club - a Fight Club inspired approach to software inspection and review
Simon Price
 
PPTX
Academic IT support for Data Science
Simon Price
 
PPTX
Historical Photographs of China - the journey towards sustainability and utility
Simon Price
 
PPTX
Mobile Apps for Research Data Collection
Simon Price
 
PPTX
Data Sharing and Standards
Simon Price
 
PPTX
Supporting Big Data, Open Data, Data Analytics and Data Science
Simon Price
 
PPTX
Clinical Experience Recorder
Simon Price
 
PPTX
Research IT at the University of Bristol
Simon Price
 
Adding Open Data Value to 'Closed Data' Problems
Simon Price
 
A review of the state of the art in Machine Learning on the Semantic Web
Simon Price
 
Webs of People, Webs of Data
Simon Price
 
Visualising China - historical photos of China
Simon Price
 
Adapting CARDIO for BOS
Simon Price
 
data.bris - Use case, role and functionality for CKAN adoption
Simon Price
 
Code Club - a Fight Club inspired approach to software inspection and review
Simon Price
 
Academic IT support for Data Science
Simon Price
 
Historical Photographs of China - the journey towards sustainability and utility
Simon Price
 
Mobile Apps for Research Data Collection
Simon Price
 
Data Sharing and Standards
Simon Price
 
Supporting Big Data, Open Data, Data Analytics and Data Science
Simon Price
 
Clinical Experience Recorder
Simon Price
 
Research IT at the University of Bristol
Simon Price
 

Recently uploaded (20)

PPTX
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PPTX
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PPTX
INFO8116 -Big data architecture and analytics
guddipatel10
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PDF
Research about a FoodFolio app for personalized dietary tracking and health o...
AustinLiamAndres
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PPTX
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
PDF
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PDF
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PDF
Chad Readey - An Independent Thinker
Chad Readey
 
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
INFO8116 -Big data architecture and analytics
guddipatel10
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
Research about a FoodFolio app for personalized dietary tracking and health o...
AustinLiamAndres
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
Chad Readey - An Independent Thinker
Chad Readey
 

Mining and Mapping the Research Landscape

  • 1. Simon Price - IT Services R&D / ILRT Peter Flach - Intelligent Systems Laboratory Mining and Mapping the Research Landscape
  • 2. 2 ExaMiner • Aim: Improve visibility of University of Bristol's research within the research community, the media, prospective students, and government. • ExaMiner prototype: • Builds on SubSift (a JISC Rapid Innovations project) • Funded by and reporting to University Research Committee (URC) • Steered by Exabyte Group in Dept. Computer Science
  • 5. 5 SubSift SubSift is a prototype application to support academic peer review. SubSift matches submitted conference/journal papers to potential peer reviewers based on similarity to published works. Website: http://subsift.ilrt.bris.ac.uk
  • 9. 9 SubSift System Archicture SUBSI FT REST API XML CSV TermsJSON YAML RDF WEB FILESTORE SUBSIFT HARVESTER XSLT CLIENT
  • 14. 14 SubSift has been used for... 15
  • 17. Demo 1 - Find a researcher (workflow) 17 Workflow Inputs (A) Workflow Outputs Workflow Inputs (B) PROFILE SERVICE DOCUMENT SERVICE Abstract Similarity MATCH SERVICE PROFILE SERVICE DOCUMENT SERVICE Staff URI s Text Text Text Text Profiles Profiles BOOKMARKS SERVICE URIs HARVESTER ROBOT URIs JSON Null Text
  • 18. Demo 2 - Find similar research 18
  • 19. Demo 3 - Find a researcher (workflow) 19 Workflow Outputs Workflow Inputs PROFILE SERVICE Graph MATCH SERVICE DOCUMENT SERVICE Staff URI s Text Text Profiles Profiles XML BOOKMARKS SERVICE URIs HARVESTER ROBOT URIs DOT GENERATOR DOT
  • 20. Demo 4 - Profile email recipients 20
  • 22. Summary • ExaMiner prototype: • SubSift – provides back-end processing via REST web services • ExaMiner demos – allowed us to explore requirements for our aim • Informing (ongoing) redesign of corporate researcher homepages • Future work: • Move away from fixed (text-centric) model • Scale up to work at whole institution scale • ... and between institutions 22