Making Semantics Work
                        The Role of Linked Data in
                           Scholarly Publishing
                                              1 June 2012
                                          Arlington VA USA


                                     Brief by Bernadette Hyland,
                        co-chair, W3C Government Linked Data Working Group
                                      CEO, 3 Round Stones, Inc

                                  Email. bhyland@3roundstones.com
                                         Twitter: @BernHyland
                         This presentation: http://slideshare.net/3roundstones




Wednesday, May 30, 12                                                            1
What is the semantic web?
                                                 from the W3C web site

                                The Semantic Web is a web of data.

                              The Semantic Web is about two things.
                                     It is about common formats for
                                      integration and combination
                                          of data drawn from diverse
                                            sources… It is also about
                                              language for recording
                                                  how data relates to
                                                   real world objects.




                                               Slide credit: Scott Brinker @chiefmartec
Wednesday, May 30, 12                                                                 2
Content Data is King
                            Slide credit: Scott Brinker @chiefmartec
Wednesday, May 30, 12                                              3
Data is really
                        flowing...

Wednesday, May 30, 12                    4
Linked data is about data
      that is reusable


                        A simple yet
                        revolutionary change in
                        perspective.



Wednesday, May 30, 12                             5
We’re living in
                                      a golden
                                        age ...
                        Photo credit: http://www.flickr.com/photos/sjungling/5974860/
Wednesday, May 30, 12                                                                  6
“Knowledge is of two kinds.
                         We know a subject our ourselves, or we know
                        where we can find information upon it.”

                                           by Samuel Johnson
                                  18th Century British author, linguist & lexicographer




Wednesday, May 30, 12                                                                     7
from:


                        to:     LinkedEnterpriseData




Wednesday, May 30, 12                                  8
Wednesday, May 30, 12   9
Book Publishers
       Goals: Improve internal manuscript pipelines, expose
           additional ways of finding and using content




Wednesday, May 30, 12                                         10
New Media




Wednesday, May 30, 12               11
Scholars pain point ...
       #1 - Data access and reuse
       Large amounts of diverse data produced by complex
       experiments, simulations & observations
         • The growth rate of PubMed alone is one paper per
           minute
         • Hard to validate, reproduce & leverage scientific data
         • Not easily accessible nor interlinked
                 (Exception is ‘omics’ research, deposit of sequences
                 required for publication)


Wednesday, May 30, 12                                                   12
Publishers looking to ...
            1. Lower costs of combining data silos
            2. Control data quality & protect data/brand standards
            3. Produce high quality data for external consumption
            4. Leverage structured data increasingly available via the Web
            5. Distribute & promote content (SEO++)
            6. Increase paid subscriptions
            7. Provide new data initiatives, i.e, a “kitchen” for mashups


Wednesday, May 30, 12                                                        13
Business decisions are yours...
                                                        “Marketing”
               Determining how
               much data to
               share…


                         …or not to share.
  “Legal”



                                  Slide courtesy of Scott Brinker @chiefmartec
Wednesday, May 30, 12                                                       14
Some data may be
                        better harnessed as
                        an incentive for other
                        business goals


                        • For internal use
                        • For external use by
                          • new & existing authors
                          • new editors
                          • new subscribers
                          • new partners

Wednesday, May 30, 12                                15
Wednesday, May 30, 12   16
Why Linked Data matters ...
             • It       scales ... to Web-scale
             • Does  not require a super model
             • Based on International Data Exchange
               Standards (RDF, SPARQL)
             • Lingua          franca for data exchange


Wednesday, May 30, 12                                     17
Wednesday, May 30, 12   18
Wednesday, May 30, 12   19
Wednesday, May 30, 12   20
• Linked Data is
           about publishing
           and consuming
           data using
           international data
           standards
       • Based on 20 year
           old idea
       • A system of linked
           information systems




Wednesday, May 30, 12            21
Data landscape
                            Semantic
                            Technologies
                                                     RDBMS

                                   Linked Linked
                        Semantic   Open Enterprise      CRM
                        Web         Data Data
                                                       BI



Wednesday, May 30, 12                                         22
Agreement on priorities may be
        contentious among stakeholders.




Wednesday, May 30, 12                     23
YouTube             HDTV
                                      watch Better
                   watch videos
                                         videos
               Publish videos

                  Share videos

                        Rate videos

               Discuss videos

Wednesday, May 30, 12                                24
Linked Data              RDBMS
                         Use data    Use data

                   Publish data

                        Share data

                        Rate data

                  Discuss data

Wednesday, May 30, 12                           25
Credit: Bradley P. Allen, Elsevier Labs
Wednesday, May 30, 12                     26
Linked Data
                        Management
                          platform
Wednesday, May 30, 12                 27
CONTENT                     LINKED DATA
         MANAGEMENT                    MANAGEMENT
           SYSTEM                        SYSTEM


                          DATA




                                               TEXT
                        UNSTRUCTURED




                                             STRUCTURED
                                                DATA
                            TEXT




Wednesday, May 30, 12                                     28
• Callimachus   is a framework
            for data-driven applications
            based on Linked Data
            principles
         • Callimachus   allows Web
            developers to easily create
            data driven applications for
            the Web
         • Availableas Open Source
            (FLOSS) & commercially
            supported version



Wednesday, May 30, 12                      29
Espaces Semantiques
 US Government Printing Office




Wednesday, May 30, 12                                 30
Publishing Linked Data
      will require continual
      nurturing but the
      rewards are worth it




Wednesday, May 30, 12          31
Goal:
                        Achieve balance between
                        openness vs. protection,
                        distributed vs. controlled,
                        standardized vs. loosely-
                        coupled data relationships.




                                 Slide credit: Scott Brinker @chiefmartec
Wednesday, May 30, 12                                                       32
Recommendations
           • Seek   balance for sharing and reuse
           • Data is king
           • Publish in reusable format (RDF family of standards)

           •   Use OPEN vs proprietary in data formats
           •   Define a URI Policy and Strategy, document it and
               ensure editors & authors use it
           • Best       practices and vocabularies exist -- don’t recreate
               the wheel


Wednesday, May 30, 12                                                        33
Bernadette Hyland
                        bhyland@3roundstones.com
                              @BernHyland




                          http://3RoundStones.com
                        Email info@3RoundStones.com
                          Twitter @3RoundStones

Wednesday, May 30, 12                                 34
This work is Copyright © 2011-2012 3 Round Stones Inc.
                        It is licensed under the Creative Commons Attribution 3.0 Unported License
                        Full details at: http://creativecommons.org/licenses/by/3.0/

                        You are free:

                                to Share — to copy, distribute and transmit the work



                                to Remix — to adapt the work



                        Under the following conditions:
                                Attribution. You must attribute the work in the manner specified by the
                                author or licensor (but not in any way that suggests that they endorse
                                you or your use of the work).

                                Share Alike. If you alter, transform, or build upon this work, you may
                                distribute the resulting work only under the same or similar license to this
                                one.




Wednesday, May 30, 12                                                                                          35

Role of Linked Data for Scholarly Publishers

  • 1.
    Making Semantics Work The Role of Linked Data in Scholarly Publishing 1 June 2012 Arlington VA USA Brief by Bernadette Hyland, co-chair, W3C Government Linked Data Working Group CEO, 3 Round Stones, Inc Email. [email protected] Twitter: @BernHyland This presentation: http://slideshare.net/3roundstones Wednesday, May 30, 12 1
  • 2.
    What is thesemantic web? from the W3C web site The Semantic Web is a web of data. The Semantic Web is about two things. It is about common formats for integration and combination of data drawn from diverse sources… It is also about language for recording how data relates to real world objects. Slide credit: Scott Brinker @chiefmartec Wednesday, May 30, 12 2
  • 3.
    Content Data isKing Slide credit: Scott Brinker @chiefmartec Wednesday, May 30, 12 3
  • 4.
    Data is really flowing... Wednesday, May 30, 12 4
  • 5.
    Linked data isabout data that is reusable A simple yet revolutionary change in perspective. Wednesday, May 30, 12 5
  • 6.
    We’re living in a golden age ... Photo credit: http://www.flickr.com/photos/sjungling/5974860/ Wednesday, May 30, 12 6
  • 7.
    “Knowledge is oftwo kinds. We know a subject our ourselves, or we know where we can find information upon it.” by Samuel Johnson 18th Century British author, linguist & lexicographer Wednesday, May 30, 12 7
  • 8.
    from: to: LinkedEnterpriseData Wednesday, May 30, 12 8
  • 9.
  • 10.
    Book Publishers Goals: Improve internal manuscript pipelines, expose additional ways of finding and using content Wednesday, May 30, 12 10
  • 11.
  • 12.
    Scholars pain point... #1 - Data access and reuse Large amounts of diverse data produced by complex experiments, simulations & observations • The growth rate of PubMed alone is one paper per minute • Hard to validate, reproduce & leverage scientific data • Not easily accessible nor interlinked (Exception is ‘omics’ research, deposit of sequences required for publication) Wednesday, May 30, 12 12
  • 13.
    Publishers looking to... 1. Lower costs of combining data silos 2. Control data quality & protect data/brand standards 3. Produce high quality data for external consumption 4. Leverage structured data increasingly available via the Web 5. Distribute & promote content (SEO++) 6. Increase paid subscriptions 7. Provide new data initiatives, i.e, a “kitchen” for mashups Wednesday, May 30, 12 13
  • 14.
    Business decisions areyours... “Marketing” Determining how much data to share… …or not to share. “Legal” Slide courtesy of Scott Brinker @chiefmartec Wednesday, May 30, 12 14
  • 15.
    Some data maybe better harnessed as an incentive for other business goals • For internal use • For external use by • new & existing authors • new editors • new subscribers • new partners Wednesday, May 30, 12 15
  • 16.
  • 17.
    Why Linked Datamatters ... • It scales ... to Web-scale • Does not require a super model • Based on International Data Exchange Standards (RDF, SPARQL) • Lingua franca for data exchange Wednesday, May 30, 12 17
  • 18.
  • 19.
  • 20.
  • 21.
    • Linked Datais about publishing and consuming data using international data standards • Based on 20 year old idea • A system of linked information systems Wednesday, May 30, 12 21
  • 22.
    Data landscape Semantic Technologies RDBMS Linked Linked Semantic Open Enterprise CRM Web Data Data BI Wednesday, May 30, 12 22
  • 23.
    Agreement on prioritiesmay be contentious among stakeholders. Wednesday, May 30, 12 23
  • 24.
    YouTube HDTV watch Better watch videos videos Publish videos Share videos Rate videos Discuss videos Wednesday, May 30, 12 24
  • 25.
    Linked Data RDBMS Use data Use data Publish data Share data Rate data Discuss data Wednesday, May 30, 12 25
  • 26.
    Credit: Bradley P.Allen, Elsevier Labs Wednesday, May 30, 12 26
  • 27.
    Linked Data Management platform Wednesday, May 30, 12 27
  • 28.
    CONTENT LINKED DATA MANAGEMENT MANAGEMENT SYSTEM SYSTEM DATA TEXT UNSTRUCTURED STRUCTURED DATA TEXT Wednesday, May 30, 12 28
  • 29.
    • Callimachus is a framework for data-driven applications based on Linked Data principles • Callimachus allows Web developers to easily create data driven applications for the Web • Availableas Open Source (FLOSS) & commercially supported version Wednesday, May 30, 12 29
  • 30.
    Espaces Semantiques USGovernment Printing Office Wednesday, May 30, 12 30
  • 31.
    Publishing Linked Data will require continual nurturing but the rewards are worth it Wednesday, May 30, 12 31
  • 32.
    Goal: Achieve balance between openness vs. protection, distributed vs. controlled, standardized vs. loosely- coupled data relationships. Slide credit: Scott Brinker @chiefmartec Wednesday, May 30, 12 32
  • 33.
    Recommendations • Seek balance for sharing and reuse • Data is king • Publish in reusable format (RDF family of standards) • Use OPEN vs proprietary in data formats • Define a URI Policy and Strategy, document it and ensure editors & authors use it • Best practices and vocabularies exist -- don’t recreate the wheel Wednesday, May 30, 12 33
  • 34.
    Bernadette Hyland [email protected] @BernHyland http://3RoundStones.com Email [email protected] Twitter @3RoundStones Wednesday, May 30, 12 34
  • 35.
    This work isCopyright © 2011-2012 3 Round Stones Inc. It is licensed under the Creative Commons Attribution 3.0 Unported License Full details at: http://creativecommons.org/licenses/by/3.0/ You are free: to Share — to copy, distribute and transmit the work to Remix — to adapt the work Under the following conditions: Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one. Wednesday, May 30, 12 35