NISO – NFAIS Webinar 
www.accessinn.com 
www.dataharmony.com 
505-998-0800 
Marjorie M.K. Hlava 
President and Chief Scientist 
Access Innovations, Inc. 
Linked Data: 
Making it a Reality
Outline of the talk 
 Linked data potential 
 Leveraging the Thesaurus / Taxonomy/ 
Ontology 
 Automating the linking 
 Workflow possibilities 
 Linked data principles 
 A few cautions
Linked Data: Many definitions 
 Mash Ups 
 Live linking from multiple sources 
 Linking out to external datasets 
 Linking persistent URIs to datasets 
 Linked Data Repositories 
 Defining relationships in RDF triples 
 Taxonomies, thesauri, ontologies 
 Triple stores 
 SKOS or OWL format
Authors at a place 
MASHUP locations to a 
GPS grid of an area 
Two data points 
GPS Coordinates 
Taxonomy description of the place
Live linking from multiple sources 
Copyright © 2013 Access Innovations, Inc.
Watch Crime in Action
Time, Place, Type of Activity
Consider more personnel 
at these locations 
Two data points 
GPS Coordinates 
Taxonomy description of the crime
Points to Linked Data 
 Point to relevant resources via URL’s 
 Leverage the thesaurus for rich ontology 
 Link to other data repositories 
 Databases 
 People nets 
 Resource files 
 DBpedia
More Like This - Recommender 
Cancer Epidemiology Biomarkers & Prevention 
Vol. 12, 161-164, 
February 2003 
© 2003 American Association for Cancer Research 
Short Communications 
Related Press Releases 
•How What and How Much We Eat (And Drink) Affects Our 
Risk of Cancer 
•Novel COX-2 Combination Treatment May Reduce Colon 
Cancer Risk Combination Regimen of COX-2 Inhibitor and 
Fish Oil Causes Cell Death 
•COX-2 Levels Are Elevated in Smokers 
Alcohol, Folate, Methionine, and Risk of Incident Breast Cancer in the 
American Cancer Society Cancer Prevention Study II Nutrition Cohort 
Heather Spencer Feigelson1, Carolyn R. Jonas, Andreas S. Robertson, 
Marjorie L. McCullough, Michael J. Thun and Eugenia E. Calle Department 
of Epidemiology and Surveillance Research, American Cancer Society, 
National Home Office, Atlanta, Georgia 30329-4251 
Related AACR Workshops and Conferences 
•Frontiers in Cancer Prevention Research 
•Continuing Medical Education (CME) 
•Molecular Targets and Cancer Therapeutics 
Related Meeting Abstracts 
•Association between dietary folate intake, alcohol intake, and 
methylenetetrahydrofolate reductase C677T and A1298C 
polymorphisms and subsequent breast 
•Folate, folate cofactor, and alcohol intakes and risk for 
colorectal adenoma 
•Dietary folate intake and risk of prostate cancer in a large 
prospective cohort study 
Recent studies suggest that the increased risk of breast cancer associated 
with alcohol consumption may be reduced by adequate folate intake. We 
examined this question among 66,561 postmenopausalwomen in the 
American Cancer Society Cancer Prevention Study II Nutrition Cohort. 
Related Working Groups 
•Finance 
•Charter 
•Molecular Epidemiology 
Related Education Book Content 
Oral Contraceptives, Postmenopausal Hormones, 
and Breast Cancer 
Physical Activity and Cancer 
Hormonal Interventions: From Adjuvant Therapy to 
Breast Cancer Prevention 
Think Tank Report 
Related Think Tank Report 
Content 
Webcasts 
Related Webcasts 
Related Awards 
•AACR-GlaxoSmithKline Clinical Cancer Research Scholar Awards 
•ACS Award 
•Weinstein Distinguished Lecture
Link to Many Resources 
Journal 
Article on 
Topic A 
Other 
Journal 
Articles on 
Topic A 
Upcoming 
Conference 
on Topic A 
Podcast Interview 
with Researcher 
Working on Topic A 
Grant Available 
for Researchers 
Working on 
Topic A 
CME 
Activity on 
Topic A 
Job Posting 
for Expert 
on Topic A
Selected Article Search “thin film 
sputtering” 
More Articles on the same topic 
Grants available 
Upcoming conferences on this topic 
Authors working in this space
Optics 
 Definition of the concept 
 Links to concept pages in other sources 
(OSA, SPIE, IOP, AIP, etc.) 
 Link to Journals that publish on the 
subject 
 People and companies in the space 
 Optics DBpedia 
http://dbpedia.org/page/Optics 
 Etc.
Linking Optics
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider World:  Successful Applications of Linked Data
Linking Workflow 
 Link content to external databank 
 Make Potential URI matches 
 QC for the thesaurus domain 
 Matched URIs enrich the content
Linking Workflow 
Taxonomy 
Term 
DBpedia 
Potential 
Match 
Retry? 
Add to 
Statistics 
Report 
QC: 
Match? 
Add Definition 
to Thesaurus 
SPARQL 
Definition 
: Query 
Add URI to 
Thesaurus 
SILK Query 
NO 
YES 
Returns URI
Phrasing of Concepts will Vary 
 Exact concept match 
 add the URI to a field in the thesaurus. 
 Different phrasing 
 Research funding “Funding of science” 
 SILK http://personal.sirma.bg/vladimir/misc/silk-book. 
pdf 
 False matches 
 Ecosystem engineering vs Ecosystem engineer
Automating the Linking 
 Not every concept will have a match 
 Or a resource page 
 Semantic functionality – 
 Lots of synonyms will help 
 Proximity and other rules 
 Create new resources or landing pages
Linking Out to External 
Datasets 
 Link Thesaurus Preferred Terms 
 Resource describing the thesaurus concept 
 SKOS parlance, is “the same as” 
 Identify DBpedia pages for each term 
 Identify other sources 
 Backfill knowledge gaps 
 Concept exists 
 No content pages yet available
Linked (Open) Data
Linked (Open) Data
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider World:  Successful Applications of Linked Data
Every 
circle a 
link to 
other 
data 
… 
or ads
The Glue 
 To connect – a communication point 
 API’s 
 Application Programming Interface 
 JDBC, ODBC 
 Web Calls – Web Services 
 Data transfer formats 
 RDF Serialization formats
RDF serialization formats 
 Turtle a compact, human-friendly format. 
 N-Triples a very simple, easy-to-parse, line-based 
format that is not as compact as Turtle. 
 N-Quads a superset of N-Triples, for serializing 
multiple RDF graphs. 
 JSON-LD a JSON-based serialization. 
 N3 or Notation 3 a non-standard serialization that is 
very similar to Turtle, but has some additional 
features, such as the ability to define inference rules. 
 RDF/XML an XML-based syntax that was the first 
standard format for serializing RDF. 

But What about Triples? 
 SKOS 
 Simple Knowledge Organization System 
 Triples 
 RDF Statements 
 Resource Description Format 
 Subject Object Predicate 
 OWL 
 Web Ontology Language 
 Formats
Recursive triple challenges 
 The Edition is in London 
 The Edition is a hotel 
 The book has a second edition 
 Therefore = The book is a hotel 
 Margie is a member of NFAIS 
 NFAIS is in Baltimore 
 Therefore = Margie is in Baltimore 
 Need clear disambiguation = thesaurus
Metrics – Measuring 
Accuracy 
 The level of accuracy with which we 
matched concepts; 
 How many match correctly? 
 How many match incorrectly? 
 The number of concepts with no match 
 Number of autolink populated pages
5 Star Merits
Two Linked Data Camps 
 Linked data 
 Linked OPEN data 
 Free or security gate 
 Linking within a collection 
 Linking with permission 
 Linking freely on the web
Linked Data is about 
 Using the Web to connect related data that wasn't 
previously linked, 
 Using the Web to lower the barriers to linking data 
currently linked using other methods. 
 A recommended best practice for exposing, sharing, 
and connecting pieces of data, information, and 
Knowledge 
 Using URI’s and RDF to create a semantic web
Linked Data Principles 
 Use URIs as names for things 
 Use HTTP URIs so that people can look 
up those names. 
 When someone looks up a URI, provide 
useful information, using the standards 
(RDF*, SPARQL) 
 Include links to other URIs. so that they 
can discover more things.
The Linked Data Community 
 W3C standards and working groups 
 RDF 
 Linked Open Data Repositories 
 Dublin Core – DCMI
More Buzzwords 
 FOAF 
 Subject – Object – Predicate 
 Graph view – two ends of a link 
 Deference 
 Dog food 
 SPARQL 
 … its easy to quickly get into the weeds
Linking Open Data Cloud
Linking Open Data Cloud
Linked Data Cautions 
 Never change your URI’s – 
 It will break the links or maintain a map… 
 Need persistent identifiers 
 ..SQL indicates a relational database 
 JAVA & Object Oriented Databases not 
broadly supported yet. 
 Insure that your triples are not recursive 
loops
It’s What We Do With the Data 
 The formats will continue to vary 
 Words will continue to be a challenge 
 Its what we do with the data that is important. 
 The delivery 
 The concepts 
 Allowing the user to find the thread and follow 
it instead of giving them yet another resource 
to go to.
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider World:  Successful Applications of Linked Data
We covered… 
 Linked data potential 
 Leveraging the Thesaurus / Taxonomy/ 
Ontology 
 Automating the linking 
 Linked data principles 
 A few cautions 
 Now…
It Just Takes 
a Little 
Imagination 
Thank you 
Marjorie M.K. Hlava, President 
Access Innovations 
505-998-0800 
mhlava@accessinn.com
What we do 
 Access Innovations 
 Ensure clean, well formed content 
 Create Knowledge Organization Systems (KOS) 
 Data Harmony Tools 
 To automatically index content 
 To manage KOS and more 
 To semantically enrich the content 
 To organize the content 
 Access Integrity 
 Automated Medical Coding Support 
43
About Access Innovations 
Access Innovations are experts in content creation, enrichment, and 
conversion services. We provide services to semantically enrich and tag raw 
text into highly structured data. We deliver clean, well-formed, metadata-enriched 
content so our clients can reuse, repurpose, store, and find their 
knowledge assets. We go beyond the standards to build taxonomies and 
other data control structures as a solid foundation for your information. 
Our services and software allow organizations to use and present their 
information to both internal and external constituents by leveraging search, 
presentation, e-commerce and linking. We change search to found! 
Quick Facts 
• Founded in 1978 
• Headquartered in Albuquerque, NM 
• Privately held 
• Delivered more than 2000 engagements
Data, Information, Knowledge 
Abstraction Interpretation 
Data Information Knowledge 
Data = height of Mt. Everest 
Information = a book on Mt. Everest geological 
characteristics 
Knowledge = a report containing practical 
information on the best way 
to reach Mt. Everest's peak

More Related Content

PPTX
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
PDF
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
PDF
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPTX
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPTX
THOR Workshop - Persistent Identifier Linking
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
THOR Workshop - Persistent Identifier Linking
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

What's hot (20)

PPTX
UKSG webinar: Making scholarly communication great again. Do institutional re...
PDF
Research data spring: giving researchers credit for their data
PPTX
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
PDF
Hoffman and Rajan "Metadata: The Importance of Interoperability, and Factors ...
PDF
Hoeppner Feb 8 Imagining Better E-Resource Access
PDF
Hansen Metadata for Institutional Repositories
PPTX
2015 NISO Forum: The Future of Library Resource
PPTX
The New Dimensions in Scholcomm: How a global scholarly community collaborati...
PDF
FAIR Data Management and FAIR Data Sharing
PPTX
data citation
PDF
New Initiatives - Geoffrey Bilder - London LIVE 2017
PPTX
THOR Workshop - Introduction
PDF
Introduction to FundRef Webinar
PDF
"Cool" metadata for FAIR data
PDF
BioSharing - Update - Feb2016
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPTX
PPTX
CrossRef at SciELO15 Conference 2013
PDF
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
PPTX
Sharing IR metadata with SHARE
UKSG webinar: Making scholarly communication great again. Do institutional re...
Research data spring: giving researchers credit for their data
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
Hoffman and Rajan "Metadata: The Importance of Interoperability, and Factors ...
Hoeppner Feb 8 Imagining Better E-Resource Access
Hansen Metadata for Institutional Repositories
2015 NISO Forum: The Future of Library Resource
The New Dimensions in Scholcomm: How a global scholarly community collaborati...
FAIR Data Management and FAIR Data Sharing
data citation
New Initiatives - Geoffrey Bilder - London LIVE 2017
THOR Workshop - Introduction
Introduction to FundRef Webinar
"Cool" metadata for FAIR data
BioSharing - Update - Feb2016
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
CrossRef at SciELO15 Conference 2013
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Sharing IR metadata with SHARE
Ad

Similar to NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider World: Successful Applications of Linked Data (20)

PDF
Implementing Linked Data in Low-Resource Conditions
PPTX
Semantic Web questions we couldn't ask 10 years ago
PPTX
How the Web can change social science research (including yours)
PPTX
CSHALS 2010 W3C Semanic Web Tutorial
PPT
Taxonomies for Publishing: Enhancing the User Experience
PDF
The technical case for a semantic web
PPTX
Cognitive data
PPT
Linked Data Driven Data Virtualization for Web-scale Integration
PPTX
Semantic Web, e-commerce
PDF
The Web of Data: The W3C Semantic Web Initiative
PDF
Semantic Search Tutorial at SemTech 2012
PPT
Resource Description Framework Approach to Data Publication and Federation
ODT
Riding The Semantic Wave
PPTX
Large-Scale Semantic Search
KEY
Semantic Web and Linked Open Data
PPTX
SemTech 2011 Semantic Search tutorial
PPTX
Knowledge Graph Introduction
PPTX
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
PDF
Linking knowledge spaces
PDF
20110728 datalift-rpi-troy
Implementing Linked Data in Low-Resource Conditions
Semantic Web questions we couldn't ask 10 years ago
How the Web can change social science research (including yours)
CSHALS 2010 W3C Semanic Web Tutorial
Taxonomies for Publishing: Enhancing the User Experience
The technical case for a semantic web
Cognitive data
Linked Data Driven Data Virtualization for Web-scale Integration
Semantic Web, e-commerce
The Web of Data: The W3C Semantic Web Initiative
Semantic Search Tutorial at SemTech 2012
Resource Description Framework Approach to Data Publication and Federation
Riding The Semantic Wave
Large-Scale Semantic Search
Semantic Web and Linked Open Data
SemTech 2011 Semantic Search tutorial
Knowledge Graph Introduction
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
Linking knowledge spaces
20110728 datalift-rpi-troy
Ad

More from National Information Standards Organization (NISO) (20)

PPTX
Larry Bennett_ ALA Annual Convention 2025AL2 slides.pptx
PPTX
Potash "Our Journey & Vision for Accessible Content"
PPTX
O'Leary "Progress Assessment - How Far Are We from Delivery"
PPTX
Carpenter and O'Leary "Accessibility Standards and the Future of Inclusive Pu...
PPTX
Davidian "Transfer Code of Practice Standing Committee Update"
PPTX
Patham "NISO Open Discovery Initiative (ODI) Update"
PPTX
Hichliffe "A Standard Terminology for Peer Review"
PPTX
Levin "KBART RP Update at ALA Annual 2025"
PPTX
Carpenter "Advancing Infrastructure for Sustainable Collections: CCLP Project...
PPTX
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
PPTX
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
PDF
Carpenter "2025 NISO Annual Members Meeting"
PPTX
Allen "Social Marketing in Scholarly Communications"
PPTX
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
PDF
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
PDF
Pfeiffer "Secrets to Changing Behavior in Scholarly Communication: A 2025 NIS...
PPTX
Gilstrap "Accessibility Essentials: A 2025 NISO Training Series, Session 7, M...
PPTX
Turner "Accessibility Essentials: A 2025 NISO Training Series, Session 7, Lan...
PPTX
Comeford "Accessibility Essentials: A 2025 NISO Training Series, Session 7, A...
PPTX
Laverick and Richard "Accessibility Essentials: A 2025 NISO Training Series, ...
Larry Bennett_ ALA Annual Convention 2025AL2 slides.pptx
Potash "Our Journey & Vision for Accessible Content"
O'Leary "Progress Assessment - How Far Are We from Delivery"
Carpenter and O'Leary "Accessibility Standards and the Future of Inclusive Pu...
Davidian "Transfer Code of Practice Standing Committee Update"
Patham "NISO Open Discovery Initiative (ODI) Update"
Hichliffe "A Standard Terminology for Peer Review"
Levin "KBART RP Update at ALA Annual 2025"
Carpenter "Advancing Infrastructure for Sustainable Collections: CCLP Project...
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
Carpenter "2025 NISO Annual Members Meeting"
Allen "Social Marketing in Scholarly Communications"
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
Pfeiffer "Secrets to Changing Behavior in Scholarly Communication: A 2025 NIS...
Gilstrap "Accessibility Essentials: A 2025 NISO Training Series, Session 7, M...
Turner "Accessibility Essentials: A 2025 NISO Training Series, Session 7, Lan...
Comeford "Accessibility Essentials: A 2025 NISO Training Series, Session 7, A...
Laverick and Richard "Accessibility Essentials: A 2025 NISO Training Series, ...

Recently uploaded (20)

PDF
PUBH1000 - Module 6: Global Health Tute Slides
PDF
Farming Based Livelihood Systems English Notes
PDF
Journal of Dental Science - UDMY (2021).pdf
PPTX
Macbeth play - analysis .pptx english lit
PDF
Hospital Case Study .architecture design
PDF
Skin Care and Cosmetic Ingredients Dictionary ( PDFDrive ).pdf
PDF
Controlled Drug Delivery System-NDDS UNIT-1 B.Pharm 7th sem
PDF
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
PDF
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
PDF
fundamentals-of-heat-and-mass-transfer-6th-edition_incropera.pdf
PPTX
PLASMA AND ITS CONSTITUENTS 123.pptx
PPTX
CAPACITY BUILDING PROGRAMME IN ADOLESCENT EDUCATION
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PPT
REGULATION OF RESPIRATION lecture note 200L [Autosaved]-1-1.ppt
PDF
M.Tech in Aerospace Engineering | BIT Mesra
PDF
The TKT Course. Modules 1, 2, 3.for self study
DOCX
Cambridge-Practice-Tests-for-IELTS-12.docx
PDF
Literature_Review_methods_ BRACU_MKT426 course material
PDF
0520_Scheme_of_Work_(for_examination_from_2021).pdf
PDF
Solved Past paper of Pediatric Health Nursing PHN BS Nursing 5th Semester
PUBH1000 - Module 6: Global Health Tute Slides
Farming Based Livelihood Systems English Notes
Journal of Dental Science - UDMY (2021).pdf
Macbeth play - analysis .pptx english lit
Hospital Case Study .architecture design
Skin Care and Cosmetic Ingredients Dictionary ( PDFDrive ).pdf
Controlled Drug Delivery System-NDDS UNIT-1 B.Pharm 7th sem
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
fundamentals-of-heat-and-mass-transfer-6th-edition_incropera.pdf
PLASMA AND ITS CONSTITUENTS 123.pptx
CAPACITY BUILDING PROGRAMME IN ADOLESCENT EDUCATION
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
REGULATION OF RESPIRATION lecture note 200L [Autosaved]-1-1.ppt
M.Tech in Aerospace Engineering | BIT Mesra
The TKT Course. Modules 1, 2, 3.for self study
Cambridge-Practice-Tests-for-IELTS-12.docx
Literature_Review_methods_ BRACU_MKT426 course material
0520_Scheme_of_Work_(for_examination_from_2021).pdf
Solved Past paper of Pediatric Health Nursing PHN BS Nursing 5th Semester

NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider World: Successful Applications of Linked Data

  • 1. NISO – NFAIS Webinar www.accessinn.com www.dataharmony.com 505-998-0800 Marjorie M.K. Hlava President and Chief Scientist Access Innovations, Inc. Linked Data: Making it a Reality
  • 2. Outline of the talk  Linked data potential  Leveraging the Thesaurus / Taxonomy/ Ontology  Automating the linking  Workflow possibilities  Linked data principles  A few cautions
  • 3. Linked Data: Many definitions  Mash Ups  Live linking from multiple sources  Linking out to external datasets  Linking persistent URIs to datasets  Linked Data Repositories  Defining relationships in RDF triples  Taxonomies, thesauri, ontologies  Triple stores  SKOS or OWL format
  • 4. Authors at a place MASHUP locations to a GPS grid of an area Two data points GPS Coordinates Taxonomy description of the place
  • 5. Live linking from multiple sources Copyright © 2013 Access Innovations, Inc.
  • 6. Watch Crime in Action
  • 7. Time, Place, Type of Activity
  • 8. Consider more personnel at these locations Two data points GPS Coordinates Taxonomy description of the crime
  • 9. Points to Linked Data  Point to relevant resources via URL’s  Leverage the thesaurus for rich ontology  Link to other data repositories  Databases  People nets  Resource files  DBpedia
  • 10. More Like This - Recommender Cancer Epidemiology Biomarkers & Prevention Vol. 12, 161-164, February 2003 © 2003 American Association for Cancer Research Short Communications Related Press Releases •How What and How Much We Eat (And Drink) Affects Our Risk of Cancer •Novel COX-2 Combination Treatment May Reduce Colon Cancer Risk Combination Regimen of COX-2 Inhibitor and Fish Oil Causes Cell Death •COX-2 Levels Are Elevated in Smokers Alcohol, Folate, Methionine, and Risk of Incident Breast Cancer in the American Cancer Society Cancer Prevention Study II Nutrition Cohort Heather Spencer Feigelson1, Carolyn R. Jonas, Andreas S. Robertson, Marjorie L. McCullough, Michael J. Thun and Eugenia E. Calle Department of Epidemiology and Surveillance Research, American Cancer Society, National Home Office, Atlanta, Georgia 30329-4251 Related AACR Workshops and Conferences •Frontiers in Cancer Prevention Research •Continuing Medical Education (CME) •Molecular Targets and Cancer Therapeutics Related Meeting Abstracts •Association between dietary folate intake, alcohol intake, and methylenetetrahydrofolate reductase C677T and A1298C polymorphisms and subsequent breast •Folate, folate cofactor, and alcohol intakes and risk for colorectal adenoma •Dietary folate intake and risk of prostate cancer in a large prospective cohort study Recent studies suggest that the increased risk of breast cancer associated with alcohol consumption may be reduced by adequate folate intake. We examined this question among 66,561 postmenopausalwomen in the American Cancer Society Cancer Prevention Study II Nutrition Cohort. Related Working Groups •Finance •Charter •Molecular Epidemiology Related Education Book Content Oral Contraceptives, Postmenopausal Hormones, and Breast Cancer Physical Activity and Cancer Hormonal Interventions: From Adjuvant Therapy to Breast Cancer Prevention Think Tank Report Related Think Tank Report Content Webcasts Related Webcasts Related Awards •AACR-GlaxoSmithKline Clinical Cancer Research Scholar Awards •ACS Award •Weinstein Distinguished Lecture
  • 11. Link to Many Resources Journal Article on Topic A Other Journal Articles on Topic A Upcoming Conference on Topic A Podcast Interview with Researcher Working on Topic A Grant Available for Researchers Working on Topic A CME Activity on Topic A Job Posting for Expert on Topic A
  • 12. Selected Article Search “thin film sputtering” More Articles on the same topic Grants available Upcoming conferences on this topic Authors working in this space
  • 13. Optics  Definition of the concept  Links to concept pages in other sources (OSA, SPIE, IOP, AIP, etc.)  Link to Journals that publish on the subject  People and companies in the space  Optics DBpedia http://dbpedia.org/page/Optics  Etc.
  • 16. Linking Workflow  Link content to external databank  Make Potential URI matches  QC for the thesaurus domain  Matched URIs enrich the content
  • 17. Linking Workflow Taxonomy Term DBpedia Potential Match Retry? Add to Statistics Report QC: Match? Add Definition to Thesaurus SPARQL Definition : Query Add URI to Thesaurus SILK Query NO YES Returns URI
  • 18. Phrasing of Concepts will Vary  Exact concept match  add the URI to a field in the thesaurus.  Different phrasing  Research funding “Funding of science”  SILK http://personal.sirma.bg/vladimir/misc/silk-book. pdf  False matches  Ecosystem engineering vs Ecosystem engineer
  • 19. Automating the Linking  Not every concept will have a match  Or a resource page  Semantic functionality –  Lots of synonyms will help  Proximity and other rules  Create new resources or landing pages
  • 20. Linking Out to External Datasets  Link Thesaurus Preferred Terms  Resource describing the thesaurus concept  SKOS parlance, is “the same as”  Identify DBpedia pages for each term  Identify other sources  Backfill knowledge gaps  Concept exists  No content pages yet available
  • 24. Every circle a link to other data … or ads
  • 25. The Glue  To connect – a communication point  API’s  Application Programming Interface  JDBC, ODBC  Web Calls – Web Services  Data transfer formats  RDF Serialization formats
  • 26. RDF serialization formats  Turtle a compact, human-friendly format.  N-Triples a very simple, easy-to-parse, line-based format that is not as compact as Turtle.  N-Quads a superset of N-Triples, for serializing multiple RDF graphs.  JSON-LD a JSON-based serialization.  N3 or Notation 3 a non-standard serialization that is very similar to Turtle, but has some additional features, such as the ability to define inference rules.  RDF/XML an XML-based syntax that was the first standard format for serializing RDF. 
  • 27. But What about Triples?  SKOS  Simple Knowledge Organization System  Triples  RDF Statements  Resource Description Format  Subject Object Predicate  OWL  Web Ontology Language  Formats
  • 28. Recursive triple challenges  The Edition is in London  The Edition is a hotel  The book has a second edition  Therefore = The book is a hotel  Margie is a member of NFAIS  NFAIS is in Baltimore  Therefore = Margie is in Baltimore  Need clear disambiguation = thesaurus
  • 29. Metrics – Measuring Accuracy  The level of accuracy with which we matched concepts;  How many match correctly?  How many match incorrectly?  The number of concepts with no match  Number of autolink populated pages
  • 31. Two Linked Data Camps  Linked data  Linked OPEN data  Free or security gate  Linking within a collection  Linking with permission  Linking freely on the web
  • 32. Linked Data is about  Using the Web to connect related data that wasn't previously linked,  Using the Web to lower the barriers to linking data currently linked using other methods.  A recommended best practice for exposing, sharing, and connecting pieces of data, information, and Knowledge  Using URI’s and RDF to create a semantic web
  • 33. Linked Data Principles  Use URIs as names for things  Use HTTP URIs so that people can look up those names.  When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL)  Include links to other URIs. so that they can discover more things.
  • 34. The Linked Data Community  W3C standards and working groups  RDF  Linked Open Data Repositories  Dublin Core – DCMI
  • 35. More Buzzwords  FOAF  Subject – Object – Predicate  Graph view – two ends of a link  Deference  Dog food  SPARQL  … its easy to quickly get into the weeds
  • 38. Linked Data Cautions  Never change your URI’s –  It will break the links or maintain a map…  Need persistent identifiers  ..SQL indicates a relational database  JAVA & Object Oriented Databases not broadly supported yet.  Insure that your triples are not recursive loops
  • 39. It’s What We Do With the Data  The formats will continue to vary  Words will continue to be a challenge  Its what we do with the data that is important.  The delivery  The concepts  Allowing the user to find the thread and follow it instead of giving them yet another resource to go to.
  • 41. We covered…  Linked data potential  Leveraging the Thesaurus / Taxonomy/ Ontology  Automating the linking  Linked data principles  A few cautions  Now…
  • 42. It Just Takes a Little Imagination Thank you Marjorie M.K. Hlava, President Access Innovations 505-998-0800 [email protected]
  • 43. What we do  Access Innovations  Ensure clean, well formed content  Create Knowledge Organization Systems (KOS)  Data Harmony Tools  To automatically index content  To manage KOS and more  To semantically enrich the content  To organize the content  Access Integrity  Automated Medical Coding Support 43
  • 44. About Access Innovations Access Innovations are experts in content creation, enrichment, and conversion services. We provide services to semantically enrich and tag raw text into highly structured data. We deliver clean, well-formed, metadata-enriched content so our clients can reuse, repurpose, store, and find their knowledge assets. We go beyond the standards to build taxonomies and other data control structures as a solid foundation for your information. Our services and software allow organizations to use and present their information to both internal and external constituents by leveraging search, presentation, e-commerce and linking. We change search to found! Quick Facts • Founded in 1978 • Headquartered in Albuquerque, NM • Privately held • Delivered more than 2000 engagements
  • 45. Data, Information, Knowledge Abstraction Interpretation Data Information Knowledge Data = height of Mt. Everest Information = a book on Mt. Everest geological characteristics Knowledge = a report containing practical information on the best way to reach Mt. Everest's peak

Editor's Notes

  • #12: Thanks to Helen Atkins of AACR for this illustration. The real power of this is that the links can all go in all directions, so we take advantage of having the user’s attention regardless of how they step into our “web”