New York City and Baltimore Semantic Web Meetups 20130221/20120226
Discount code:
   13ldev
 at manning.com
Our Rapidly Changing
      Internet


                               35 hours of video
                              uploaded per minute


51% of Internet traffic
   is non-human
                         >2.3 billon Internet users,
                         >1 billion in Asia
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
08 Oct 2007   07 Nov 2007   10 Nov 2007   28 Feb 2008   31 Mar 2008




18 Sep 2008   05 Mar 2009   27 Mar 2009   14 Jul 2009   22 Sep 2010
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
David Wood has co-founded several Open Source Software
projects related to the Semantic Web, including Persistent
URLs, Mulgara and the Callimachus Project. He is co-chair of
the W3C’s RDF Working Group.



Marsha Zaidman is Associate Professor Emerita of
Computer Science at the University of Mary Washington.




Luke Ruth is a Linked Data developer supporting the
Callimachus Project (http://callimachusproject.org).




Michael Hausenblas is Chief Data Engineer at MapR.
He formerly led the Linked Data Research Centre in
Galway, Ireland.
New York City and Baltimore Semantic Web Meetups 20130221/20120226
http://www.manning.com/dwood/
Linked Data
 about the
   book!
The first chapter is free
Manning Early Access Program
          (MEAP)

 • Concept:
  • Give away the first chapter
  • Sell a low-resolution PDF to early readers
  • Readers get PDF updates and a print copy
     when it becomes available
Author Forum
http://www.manning-sandbox.com/forum.jspa?forumID=838
Success Criteria
Success Criteria
• Sell 100 copies in MEAP in one month
Success Criteria
• Sell 100 copies in MEAP in one month
• 598 copies in one month
Success Criteria
• Sell 100 copies in MEAP in one month
• 598 copies in one month
• Lots of interest in Linked Data!
What’s Inside
• What Linked Data is
• Find Linked Data you can reuse
• Use Linked Data in your applications
• Create your own Linked Data
• Build Linked Data applications using
  standard Web techniques
Coding Examples

             Callimachus
1. Linked Data to the rescue                        Available
2. RDF - the data model for Linked Data             Available
3. Consuming Linked Data                            Available
4. Creating Linked Data                             Available
5. Querying Linked Data                             Available
6. Enhancing results from search engines            Available
7. Collecting Linked Data                           Available
8. Datasets                                         Completed
9. Callimachus - a Linked Data management system     In draft
10. Building a read-write Linked Data application    In draft
Callimachus
http://callimachusproject.org
http://3roundstones.com
Partners




     Callimachus
                           Sesame


                      Sesame
(in progress)
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
From EPA
From Wikipedia




Open Street Map
Subject


Object
(Predicate is defined in a template)
RDF	
  “Describe”	
  View




          Subject
Predicate Object
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
New York City and Baltimore Semantic Web Meetups 20130221/20120226
Active PURLs for Clinical Study Aggregation
                                                                     David Wood1 and Tom Plasterer2
                                                         1   david@3roundstones.com, 2Tom.Plasterer@astrazeneca.com

The problem: No coordinated view of clinical study information. Information is distributed across departments, subsidiaries and government data sources.

The solution: Gather, convert, aggregate and format for display
      3 Round Stones and AstraZeneca created a system to allow coordinated views of distributed clinical trial information. The system extended the Callimachus
   Project, an Open Source management system for Linked Data.
      Persistent URLs, or PURLs, were used to provide globally unique and resolvable identifiers for each clinical study. The PURL concept was extended to enable
   PURLs to have multiple targets and for the results of each target to undergo arbitrary transformation. PURLs which have such capabilities are called Active PURLs.
      Information sources relevant to clinical studies were identified, regardless of whether their location was internal or external to the pharmaceutical company's
   network. Active PURLs were used to resolve data sources having HTTP endpoints capable of returning XML or textual results. Each information source is
   dynamically transformed into Resource Description Framework (RDF) formats and all sources' results then merged into a single, temporary graph of RDF data.
   Information is rendered to end users as coordinated HTML descriptions regarding each clinical trial using the Callimachus template engine. Machine-readable
   versions of the data are also available.
How semantic technologies help
      Linked Data techniques can help to address both the availability of clinical trial information and provide a means to build effective information systems using it.
   Linked Data techniques allow for "cooperation without coordination". Publishers of data provide context for use by third parties in other portions of a distributed
   enterprise. Users of Linked Data can combine information from multiple sources. Subsequent publication can create a virtuous circle of positive feedback, allowing
   researchers, informaticists and support staff to collaboratively and distributively build a reusable knowledge base.

User experience                                                                                                                         Challenges
                                                   HTTP-accessible endpoints capable of returning XML or textual content                     Distributed queries have many known
   1       Users resolve a URL that                                                                                                      limitations, such as the introduction of
       provides a unique identifier for                                                                                                   multiple single points of failure in any
       a clinical study, drug, chemical                                                                                                  given PURL resolution. HTTP timeouts,
       or other concept managed by                                                                                                       auth/auth errors or other network failures
       this system. The user may                                                                                                         can slow or stop a pipeline from returning
       be presented with the URL on                                                                                                      correctly.
       HTML pages, search it via full-                                                                                                       Similarly, distributed queries can result
       text techniques or discover it                                                                                                    in variant query-time performance due to
       via semantic search.                                                                                                              complex network and endpoint perform-
                                               Multiple targets queried
                                               independently                                                                             ance variances.
                                                                                                    Convert XML or textual results to
   2       Users are presented with a                                                               RDF                                      Proactive caching and cache manage-
       dynamically generated Web                                                                                                         meant strategies can improve runtime
       page representing aggregated        1                                                                                             performance and protect end users from
       clinical study information. Users       User resolves a
                                               single URI to an                                      Render RDF to HTML via template
                                                                                                                                         the limitations inherent in a distributed
       are isolated from the complex           Active PURL                                                                               query architecture. Caching of
       and distributed information                                                                                                       intermediate results from endpoints has
       environment.                                                                                                                      not yet been implemented.
References                                                                                                                              Next steps
✔
                                          DocBook 5

                                          ✔
                                          XHTML 5

                                            ✔
                                            ePub 3




Credit: Bradley P. Allen, Elsevier Labs
http://www.manning.com/dwood/
Discount code:
   13ldev
 at manning.com

More Related Content

PDF
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
PDF
ANALYSIS STUDY ON CACHING AND REPLICA PLACEMENT ALGORITHM FOR CONTENT DISTRIB...
PDF
Welcome to International Journal of Engineering Research and Development (IJERD)
PDF
Effective Approach For Content Based Image Retrieval In Peer-Peer To Networks
PDF
Mca & diplamo java titles
PDF
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
PDF
Analyse the performance of mobile peer to Peer network using ant colony optim...
PDF
Toward Personalized Peer-to-Peer Top-k Processing
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
ANALYSIS STUDY ON CACHING AND REPLICA PLACEMENT ALGORITHM FOR CONTENT DISTRIB...
Welcome to International Journal of Engineering Research and Development (IJERD)
Effective Approach For Content Based Image Retrieval In Peer-Peer To Networks
Mca & diplamo java titles
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
Analyse the performance of mobile peer to Peer network using ant colony optim...
Toward Personalized Peer-to-Peer Top-k Processing

What's hot (10)

PDF
Caching on Named Data Network: a Survey and Future Research
PDF
ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...
PDF
Privacy Preserved Distributed Data Sharing with Load Balancing Scheme
PDF
Peer to peer cache resolution mechanism for mobile ad hoc networks
PDF
Modified query roles based access
PDF
Group Finder
PDF
RESEARCH ON DISTRIBUTED SOFTWARE TESTING PLATFORM BASED ON CLOUD RESOURCE
PDF
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
PDF
Sqlmr
PDF
01 intro
Caching on Named Data Network: a Survey and Future Research
ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...
Privacy Preserved Distributed Data Sharing with Load Balancing Scheme
Peer to peer cache resolution mechanism for mobile ad hoc networks
Modified query roles based access
Group Finder
RESEARCH ON DISTRIBUTED SOFTWARE TESTING PLATFORM BASED ON CLOUD RESOURCE
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
Sqlmr
01 intro
Ad

Similar to New York City and Baltimore Semantic Web Meetups 20130221/20120226 (20)

PDF
The MADlib Analytics Library
 
PDF
Linked Data: Opportunities for Entrepreneurs
PPTX
Clustercomputingpptl2 120204125126-phpapp01
PPTX
Cluster computing pptl (2)
PDF
Linked Data and Semantic Web Application Development by Peter Haase
PPT
Resource Description Framework Approach to Data Publication and Federation
PPT
Resource
PDF
Reactive Stream Processing for Data-centric Publish/Subscribe
PDF
Adaptive Semantic Data Management Techniques for Federations of Endpoints
PDF
DISTRIBUTED SYSTEM SOLVED PAPER 2024.pdf
PPT
Knowledge Discovery in an Agents Environment
PDF
The google file system
PDF
Gfs论文
PDF
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
PPT
Managing Multidimensional Historical
PPT
Climbing the beanstalk
PDF
Towards an Infrastructure for Enabling Systematic Development and Research of...
PPTX
Cluster computing
PDF
P24120125
PDF
A Comprehensive Technical Report on Retrieval.pdf
The MADlib Analytics Library
 
Linked Data: Opportunities for Entrepreneurs
Clustercomputingpptl2 120204125126-phpapp01
Cluster computing pptl (2)
Linked Data and Semantic Web Application Development by Peter Haase
Resource Description Framework Approach to Data Publication and Federation
Resource
Reactive Stream Processing for Data-centric Publish/Subscribe
Adaptive Semantic Data Management Techniques for Federations of Endpoints
DISTRIBUTED SYSTEM SOLVED PAPER 2024.pdf
Knowledge Discovery in an Agents Environment
The google file system
Gfs论文
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
Managing Multidimensional Historical
Climbing the beanstalk
Towards an Infrastructure for Enabling Systematic Development and Research of...
Cluster computing
P24120125
A Comprehensive Technical Report on Retrieval.pdf
Ad

More from 3 Round Stones (20)

PDF
Brief on Linked Data for U.S. EPA's Chief Data Scientist
PDF
US EPA Resource Conservation and Recovery Act published as Linked Open Data
PDF
Briefing on US EPA Open Data Strategy using a Linked Data Approach
PDF
W3C Data Shapes Working Group 2014
PDF
Open by Default
PDF
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
PDF
Why Your Next Product Should be Semantic by Dr. David Wood
PDF
Celebrating 10 years of the Semantic Technology Conference 2014
PDF
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
PDF
Publising Data on the Web
PDF
Callimachus Enterprise 1.3 Tutorial
PDF
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
PDF
Linked Data Overview - structured data on the web for US EPA 20140203
PDF
Data Transparency 2013 - OrgPedia by 3 Round Stones
PDF
ORGpedia: The Open Organizational Data Project
PDF
Linked Data: The Jargon-free Primer on Integrating Data on the Web
PDF
Delivering on Standards for Publishing Government Linked Data
PDF
The Power of Linked Data for Government & Healthcare Information Integration
PDF
Sharing Data on the Web
PDF
Sharing data on the web (2013)
Brief on Linked Data for U.S. EPA's Chief Data Scientist
US EPA Resource Conservation and Recovery Act published as Linked Open Data
Briefing on US EPA Open Data Strategy using a Linked Data Approach
W3C Data Shapes Working Group 2014
Open by Default
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Why Your Next Product Should be Semantic by Dr. David Wood
Celebrating 10 years of the Semantic Technology Conference 2014
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Publising Data on the Web
Callimachus Enterprise 1.3 Tutorial
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Linked Data Overview - structured data on the web for US EPA 20140203
Data Transparency 2013 - OrgPedia by 3 Round Stones
ORGpedia: The Open Organizational Data Project
Linked Data: The Jargon-free Primer on Integrating Data on the Web
Delivering on Standards for Publishing Government Linked Data
The Power of Linked Data for Government & Healthcare Information Integration
Sharing Data on the Web
Sharing data on the web (2013)

Recently uploaded (20)

PDF
Diabetes Mellitus , types , clinical picture, investigation and managment
PPT
hsl powerpoint resource goyloveh feb 07.ppt
PPTX
Neurology of Systemic disease all systems
PDF
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
PPT
hemostasis and its significance, physiology
PDF
The TKT Course. Modules 1, 2, 3.for self study
PPTX
4. Diagnosis and treatment planning in RPD.pptx
PDF
Health aspects of bilberry: A review on its general benefits
PDF
Review of Related Literature & Studies.pdf
PPTX
Theoretical for class.pptxgshdhddhdhdhgd
PPTX
Cite It Right: A Compact Illustration of APA 7th Edition.pptx
PPTX
Q2 Week 1.pptx Lesson on Kahalagahan ng Pamilya sa Edukasyon
PDF
Disorder of Endocrine system (1).pdfyyhyyyy
PPTX
PLASMA AND ITS CONSTITUENTS 123.pptx
PPTX
principlesofmanagementsem1slides-131211060335-phpapp01 (1).ppt
PPTX
UNIT_2-__LIPIDS[1].pptx.................
PPTX
Thinking Routines and Learning Engagements.pptx
PPTX
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
PDF
Everyday Spelling and Grammar by Kathi Wyldeck
PDF
Lecture on Viruses: Structure, Classification, Replication, Effects on Cells,...
Diabetes Mellitus , types , clinical picture, investigation and managment
hsl powerpoint resource goyloveh feb 07.ppt
Neurology of Systemic disease all systems
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
hemostasis and its significance, physiology
The TKT Course. Modules 1, 2, 3.for self study
4. Diagnosis and treatment planning in RPD.pptx
Health aspects of bilberry: A review on its general benefits
Review of Related Literature & Studies.pdf
Theoretical for class.pptxgshdhddhdhdhgd
Cite It Right: A Compact Illustration of APA 7th Edition.pptx
Q2 Week 1.pptx Lesson on Kahalagahan ng Pamilya sa Edukasyon
Disorder of Endocrine system (1).pdfyyhyyyy
PLASMA AND ITS CONSTITUENTS 123.pptx
principlesofmanagementsem1slides-131211060335-phpapp01 (1).ppt
UNIT_2-__LIPIDS[1].pptx.................
Thinking Routines and Learning Engagements.pptx
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
Everyday Spelling and Grammar by Kathi Wyldeck
Lecture on Viruses: Structure, Classification, Replication, Effects on Cells,...

New York City and Baltimore Semantic Web Meetups 20130221/20120226

  • 2. Discount code: 13ldev at manning.com
  • 3. Our Rapidly Changing Internet 35 hours of video uploaded per minute 51% of Internet traffic is non-human >2.3 billon Internet users, >1 billion in Asia
  • 6. 08 Oct 2007 07 Nov 2007 10 Nov 2007 28 Feb 2008 31 Mar 2008 18 Sep 2008 05 Mar 2009 27 Mar 2009 14 Jul 2009 22 Sep 2010
  • 11. David Wood has co-founded several Open Source Software projects related to the Semantic Web, including Persistent URLs, Mulgara and the Callimachus Project. He is co-chair of the W3C’s RDF Working Group. Marsha Zaidman is Associate Professor Emerita of Computer Science at the University of Mary Washington. Luke Ruth is a Linked Data developer supporting the Callimachus Project (http://callimachusproject.org). Michael Hausenblas is Chief Data Engineer at MapR. He formerly led the Linked Data Research Centre in Galway, Ireland.
  • 14. Linked Data about the book!
  • 16. Manning Early Access Program (MEAP) • Concept: • Give away the first chapter • Sell a low-resolution PDF to early readers • Readers get PDF updates and a print copy when it becomes available
  • 19. Success Criteria • Sell 100 copies in MEAP in one month
  • 20. Success Criteria • Sell 100 copies in MEAP in one month • 598 copies in one month
  • 21. Success Criteria • Sell 100 copies in MEAP in one month • 598 copies in one month • Lots of interest in Linked Data!
  • 22. What’s Inside • What Linked Data is • Find Linked Data you can reuse • Use Linked Data in your applications • Create your own Linked Data • Build Linked Data applications using standard Web techniques
  • 23. Coding Examples Callimachus
  • 24. 1. Linked Data to the rescue Available 2. RDF - the data model for Linked Data Available 3. Consuming Linked Data Available 4. Creating Linked Data Available 5. Querying Linked Data Available 6. Enhancing results from search engines Available 7. Collecting Linked Data Available 8. Datasets Completed 9. Callimachus - a Linked Data management system In draft 10. Building a read-write Linked Data application In draft
  • 27. Partners Callimachus Sesame Sesame (in progress)
  • 32. RDF  “Describe”  View Subject Predicate Object
  • 36. Active PURLs for Clinical Study Aggregation David Wood1 and Tom Plasterer2 1 [email protected], [email protected] The problem: No coordinated view of clinical study information. Information is distributed across departments, subsidiaries and government data sources. The solution: Gather, convert, aggregate and format for display 3 Round Stones and AstraZeneca created a system to allow coordinated views of distributed clinical trial information. The system extended the Callimachus Project, an Open Source management system for Linked Data. Persistent URLs, or PURLs, were used to provide globally unique and resolvable identifiers for each clinical study. The PURL concept was extended to enable PURLs to have multiple targets and for the results of each target to undergo arbitrary transformation. PURLs which have such capabilities are called Active PURLs. Information sources relevant to clinical studies were identified, regardless of whether their location was internal or external to the pharmaceutical company's network. Active PURLs were used to resolve data sources having HTTP endpoints capable of returning XML or textual results. Each information source is dynamically transformed into Resource Description Framework (RDF) formats and all sources' results then merged into a single, temporary graph of RDF data. Information is rendered to end users as coordinated HTML descriptions regarding each clinical trial using the Callimachus template engine. Machine-readable versions of the data are also available. How semantic technologies help Linked Data techniques can help to address both the availability of clinical trial information and provide a means to build effective information systems using it. Linked Data techniques allow for "cooperation without coordination". Publishers of data provide context for use by third parties in other portions of a distributed enterprise. Users of Linked Data can combine information from multiple sources. Subsequent publication can create a virtuous circle of positive feedback, allowing researchers, informaticists and support staff to collaboratively and distributively build a reusable knowledge base. User experience Challenges HTTP-accessible endpoints capable of returning XML or textual content Distributed queries have many known 1 Users resolve a URL that limitations, such as the introduction of provides a unique identifier for multiple single points of failure in any a clinical study, drug, chemical given PURL resolution. HTTP timeouts, or other concept managed by auth/auth errors or other network failures this system. The user may can slow or stop a pipeline from returning be presented with the URL on correctly. HTML pages, search it via full- Similarly, distributed queries can result text techniques or discover it in variant query-time performance due to via semantic search. complex network and endpoint perform- Multiple targets queried independently ance variances. Convert XML or textual results to 2 Users are presented with a RDF Proactive caching and cache manage- dynamically generated Web meant strategies can improve runtime page representing aggregated 1 performance and protect end users from clinical study information. Users User resolves a single URI to an Render RDF to HTML via template the limitations inherent in a distributed are isolated from the complex Active PURL query architecture. Caching of and distributed information intermediate results from endpoints has environment. not yet been implemented. References Next steps
  • 37. DocBook 5 ✔ XHTML 5 ✔ ePub 3 Credit: Bradley P. Allen, Elsevier Labs
  • 39. Discount code: 13ldev at manning.com