Richard	
  Wallis	
  
OCLC	
  Technology	
  Evangelist	
  
@rjw
Web	
  Driven	
  Revolution	
  
For	
  Library	
  Data
Washington,	
  DC	
  28th	
  April	
  2015
Image	
  courtesy	
  of:	
  Shropshire	
  County	
  Council1779	
  (c.)
The Industrial Revolution
The	
  Web	
  of	
  …
The	
  Web	
  of	
  …
Documents
The	
  Web	
  of	
  …
Documents
Active	
  Documents
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery ☌
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery
Data
☌☌
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery
Data
☌☌
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery
Data
☌☌
✔
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery
Data
☌☌
✔
✔
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery
Data
☌☌
✔
✔
✗
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery
Data
☌☌
✔
✔
✔✗
✗
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery
Data
Knowledge
☌☌
✔
✔
✔✗
✗☌
The	
  Web	
  of	
  …
Documents
Active	
  Documents
Discovery
Data
Knowledge
☌☌
✔
✔
✔✗
✗
?
☌
http://www.opte.org/
The	
  Web	
  of	
  Data	
  
http://www.opte.org/
The	
  Web	
  of	
  Data	
  
A	
  Web	
  of	
  related	
  entities
http://www.opte.org/
The	
  Web	
  of	
  Data	
  
http://www.opte.org/
The	
  Web	
  of	
  Data	
  
A	
  Library	
  Shaped	
  Black	
  Hole	
  ?
record	
  
/ˈrɛkɔːd/	
  
noun	
  
!
a	
  thing	
  constituting	
  a	
  piece	
  of	
  evidence	
  about	
  
the	
  past,	
  especially	
  an	
  account	
  kept	
  in	
  writing	
  
or	
  some	
  other	
  permanent	
  form.
entity	
  
/ˈɛntɪti/	
  
noun	
  
a	
  thing	
  with	
  distinct	
  and	
  independent	
  
existence.
entity	
  
/ˈɛntɪti/	
  
noun	
  
a	
  thing	
  with	
  distinct	
  and	
  independent	
  
existence.
relationship	
  
/rɪˈleɪʃ(ə)nʃɪp/	
  
noun	
  
the	
  way	
  in	
  which	
  two	
  or	
  more	
  people	
  or	
  
things	
  are	
  connected	
  
Record
Title:	
  	
  "War	
  and	
  Peace"	
  
Author:	
  	
  "Leo	
  Tolstoy	
  1828-­‐1910"	
  
ISBN:	
  0307266931
Type:	
  Work	
  
Name:	
  	
  "War	
  and	
  Peace"	
  
Author:	
  	
  http://worldcat.org/entity/person/id/1234
Entity	
  (http://worldcat.org/entity/work/id/115206288)
Record
Title:	
  	
  "War	
  and	
  Peace"	
  
Author:	
  	
  "Leo	
  Tolstoy	
  1828-­‐1910"	
  
ISBN:	
  0307266931
Type:	
  Work	
  
Name:	
  	
  "War	
  and	
  Peace"	
  
Author:	
  	
  http://worldcat.org/entity/person/id/1234
Entity	
  (http://worldcat.org/entity/work/id/115206288)
Record
Title:	
  	
  "War	
  and	
  Peace"	
  
Author:	
  	
  "Leo	
  Tolstoy	
  1828-­‐1910"	
  
ISBN:	
  0307266931
Type:	
  Work	
  
Name:	
  	
  "War	
  and	
  Peace"	
  
Author:	
  	
  http://worldcat.org/entity/person/id/1234
Entity	
  (http://worldcat.org/entity/work/id/115206288)
Type:	
  Person	
  
Name:	
  	
  "Leo	
  Tolstoy	
  "	
  
Born:	
  	
  1828	
  
Died:	
  1910	
  
Birthplace:	
  http://worldcat.org/entity/place/id/8976
Entity	
  (http://worldcat.org/entity/person/id/1234)
⤵
Record
Title:	
  	
  "War	
  and	
  Peace"	
  
Author:	
  	
  "Leo	
  Tolstoy	
  1828-­‐1910"	
  
ISBN:	
  0307266931
Type:	
  Work	
  
Name:	
  	
  "War	
  and	
  Peace"	
  
Author:	
  	
  http://worldcat.org/entity/person/id/1234
Entity	
  (http://worldcat.org/entity/work/id/115206288)
Type:	
  Person	
  
Name:	
  	
  "Leo	
  Tolstoy	
  "	
  
Born:	
  	
  1828	
  
Died:	
  1910	
  
Birthplace:	
  http://worldcat.org/entity/place/id/8976
Entity	
  (http://worldcat.org/entity/person/id/1234)
Type:	
  Place	
  
Name:	
  	
  "Yasnaya	
  Polyana"	
  
SameAs:	
  	
  http://geonames.org/468686
Entity	
  (http://worldcat.org/entity/place/id/8976)
⤵
⤵
⟶
Many great LD Projects
So today …..
Where are we on the web?
Where are we on the web?
Invisible on the web!
Invisible on the web!
Open	
  Linked	
  Data	
  -­‐	
  Silos
Library	
  Linked	
  Data
British	
  
Library
German	
  
National	
  
Library
Spanish	
  
National	
  
Library
Swedish	
  
National	
  
Library
Open	
  Linked	
  Data	
  -­‐	
  Silos
Library	
  Linked	
  Data
British	
  
Library
German	
  
National	
  
Library
Spanish	
  
National	
  
Library
Swedish	
  
National	
  
Library
Open	
  Linked	
  Data	
  -­‐	
  Silos
Library	
  Linked	
  Data
British	
  
Library
German	
  
National	
  
Library
Spanish	
  
National	
  
Library
Swedish	
  
National	
  
Library
Open	
  Linked	
  Data	
  -­‐	
  Silos
Library	
  Linked	
  Data
British	
  
Library
German	
  
National	
  
Library
Spanish	
  
National	
  
Library
Swedish	
  
National	
  
Library
Open	
  Linked	
  Data	
  -­‐	
  Silos
Library	
  Linked	
  Data
British	
  
Library
German	
  
National	
  
Library
Spanish	
  
National	
  
Library
Swedish	
  
National	
  
Library
Open	
  Linked	
  Data	
  -­‐	
  Silos
Library	
  Linked	
  Data
British	
  
Library
German	
  
National	
  
Library
Spanish	
  
National	
  
Library
Swedish	
  
National	
  
Library
Open	
  Linked	
  Data	
  -­‐	
  Silos
Behind	
  A	
  Vocabulary	
  Barrier
Library	
  Linked	
  Data
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains"
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains" "25%	
  of	
  pages	
  in	
  our	
  
indexes"
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains" "25%	
  of	
  pages	
  in	
  our	
  
indexes"
"15%	
  of	
  the	
  Web"
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains" "25%	
  of	
  pages	
  in	
  our	
  
indexes"
de	
  facto
y
"15%	
  of	
  the	
  Web"
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains" "25%	
  of	
  pages	
  in	
  our	
  
indexes"
de	
  facto
y
• Linked	
  Data	
  
"15%	
  of	
  the	
  Web"
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains" "25%	
  of	
  pages	
  in	
  our	
  
indexes"
de	
  facto
y
• Linked	
  Data	
  
• Embedded	
  in	
  HTML
"15%	
  of	
  the	
  Web"
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains" "25%	
  of	
  pages	
  in	
  our	
  
indexes"
de	
  facto
y
• Linked	
  Data	
  
• Embedded	
  in	
  HTML
• RDFa,	
  Microdata,	
  JSON-­‐LD
"15%	
  of	
  the	
  Web"
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains" "25%	
  of	
  pages	
  in	
  our	
  
indexes"
de	
  facto
y
• Linked	
  Data	
  
• Embedded	
  in	
  HTML
• RDFa,	
  Microdata,	
  JSON-­‐LD
• Descriptive	
  data
"15%	
  of	
  the	
  Web"
A	
  general	
  purpose	
  vocabulary	
  for	
  
describing	
  things	
  on	
  the	
  web
"Used	
  by	
  5	
  million	
  
domains" "25%	
  of	
  pages	
  in	
  our	
  
indexes"
de	
  facto
y
• Linked	
  Data	
  
• Embedded	
  in	
  HTML
• RDFa,	
  Microdata,	
  JSON-­‐LD
• Descriptive	
  data
• Active	
  links
"15%	
  of	
  the	
  Web"
• Foundation	
  for	
  the	
  future	
  of	
  bibliographic	
  description
• Foundation	
  for	
  the	
  future	
  of	
  bibliographic	
  description
• Eventual	
  replacement	
  for	
  Marc	
  21
• Foundation	
  for	
  the	
  future	
  of	
  bibliographic	
  description
• Eventual	
  replacement	
  for	
  Marc	
  21
• Identify	
  information	
  entities
• Foundation	
  for	
  the	
  future	
  of	
  bibliographic	
  description
• Eventual	
  replacement	
  for	
  Marc	
  21
• Identify	
  information	
  entities
• Conversion	
  from	
  Marc
• Foundation	
  for	
  the	
  future	
  of	
  bibliographic	
  description
• Eventual	
  replacement	
  for	
  Marc	
  21
• Identify	
  information	
  entities
• Conversion	
  from	
  Marc
• Publish	
  in	
  RDF	
  –	
  Linked	
  Data
• Foundation	
  for	
  the	
  future	
  of	
  bibliographic	
  description
• Eventual	
  replacement	
  for	
  Marc	
  21
• Identify	
  information	
  entities
• Conversion	
  from	
  Marc
• Publish	
  in	
  RDF	
  –	
  Linked	
  Data
• White	
  Paper
Common Ground: Exploring Compatibilities between the
Linked Data Models of the Library of Congress and OCLC
http://oc.lc/CommonGround
Why	
  Catalog?
Why	
  Catalog?
So	
  we	
  can	
  find	
  things
Why	
  Catalog?
So	
  we	
  can	
  find	
  things
Why	
  Share	
  on	
  the	
  Web?
Why	
  Catalog?
So	
  we	
  can	
  find	
  things
Why	
  Share	
  on	
  the	
  Web?
So	
  today’s	
  users	
  
can	
  find	
  our	
  things
Where	
  are	
  our	
  users?
Where	
  are	
  our	
  users?
Entities:	
  Getting	
  from	
  here	
  to	
  there
Data from one
converted record does
not an entity make
Entities:	
  Getting	
  from	
  here	
  to	
  there
Data from one
converted record does
not an entity make
Transformation	
  into	
  Linked	
  Data	
  is	
  just	
  a	
  beginning	
  …
• Mine	
  and	
  analyse	
  an	
  aggregate
Entities:	
  Getting	
  from	
  here	
  to	
  there
Data from one
converted record does
not an entity make
Transformation	
  into	
  Linked	
  Data	
  is	
  just	
  a	
  beginning	
  …
• Mine	
  and	
  analyse	
  an	
  aggregate
• Identify,	
  map,	
  merge	
  -­‐	
  evidence	
  based
Entities:	
  Getting	
  from	
  here	
  to	
  there
Data from one
converted record does
not an entity make
Transformation	
  into	
  Linked	
  Data	
  is	
  just	
  a	
  beginning	
  …
• Mine	
  and	
  analyse	
  an	
  aggregate
• Identify,	
  map,	
  merge	
  -­‐	
  evidence	
  based
• Relate	
  to	
  external	
  sources
Entities:	
  Getting	
  from	
  here	
  to	
  there
Data from one
converted record does
not an entity make
Transformation	
  into	
  Linked	
  Data	
  is	
  just	
  a	
  beginning	
  …
• Mine	
  and	
  analyse	
  an	
  aggregate
• Identify,	
  map,	
  merge	
  -­‐	
  evidence	
  based
• Relate	
  to	
  external	
  sources
• Establish	
  the	
  entities
Entities:	
  Getting	
  from	
  here	
  to	
  there
Entities	
  and	
  library	
  workflows

Discovery
The  Name  of  the  Rose
Summary:	
  The	
  year	
  is	
  1327.	
  Franciscans	
  in	
  a	
  wealthy	
  
Italian	
  abbey	
  are	
  suspected	
  of	
  heresy,	
  and	
  Brother	
  
William	
  of	
  Baskerville	
  arrives	
  to	
  investigate.	
  His	
  delicate	
  
mission	
  is	
  suddenly	
  overshadowed	
  by	
  seven	
  bizarre	
  
deaths	
  that	
  take	
  place	
  in	
  seven	
  days	
  and	
  nights	
  of	
  
apocalyptic	
  terror.	
  
Subjects
Borrowing	
  Options	
  
eBooks	
  |	
  Printed	
  Books	
  |	
  Audio	
  Books	
  
Other	
  Languages	
  
!
Monastic	
  libraries	
  -­‐-­‐	
  Italy	
  –	
  Fiction	
  |	
  Semiotics	
  -­‐-­‐	
  Fiction	
  
http://www.opte.org/
A	
  Web	
  of	
  Data	
  
http://www.opte.org/
A	
  Web	
  of	
  Data	
  
person place
object concept
organization work
The	
  solution	
  
starts	
  here.
The	
  library	
  knowledge	
  graph

A	
  graph	
  of	
  relationships
person place
object concept
organization work
author
The	
  solution	
  
starts	
  here.
The	
  library	
  knowledge	
  graph

A	
  graph	
  of	
  relationships
person place
object concept
organization work
author
subject
The	
  solution	
  
starts	
  here.
The	
  library	
  knowledge	
  graph

A	
  graph	
  of	
  relationships
person place
object concept
organization work
author
subjectitem

availability
The	
  solution	
  
starts	
  here.
The	
  library	
  knowledge	
  graph

A	
  graph	
  of	
  relationships
The	
  library	
  knowledge	
  graph

A	
  graph	
  of	
  relationships
person place
object concept
organization work
What	
  will	
  be	
  better?
The	
  library	
  knowledge	
  graph

Lots	
  of	
  things….if	
  we	
  do	
  it	
  right.
ILL	
  and	
  AnalyticsCataloging
Discovery Integration	
  with	
  the	
  web
What	
  will	
  be	
  better?
Entities	
  and	
  library	
  workflows

Cataloging
Cataloging	
  will	
  be	
  different…	
  
▪ Managing	
  the	
  quality	
  of	
  Works	
  
• Improving	
  clusters	
  
▪ Managing	
  the	
  quality	
  of	
  Persons	
  
• Links	
  to	
  works,	
  Other	
  IDs
What	
  has	
  OCLC	
  done?
What	
  has	
  OCLC	
  done?
So	
  what	
  progress	
  have	
  
we	
  made?
• 197+	
  million	
  Work	
  descriptions	
  and	
  URIs	
  
• Schema.org	
  +	
  BiblioGraph.net	
  
• RDF	
  Data	
  formats	
  
• RDF/XML,	
  Turtle,	
  Triples,	
  JSON-­‐LD	
  
• Links	
  to	
  WorldCat	
  manifestations	
  
• Links	
  to	
  Dewey,	
  LCSH,	
  LCNAF,	
  VIAF,	
  FAST	
  
• Open	
  Data	
  license	
  via	
  Linked	
  Data	
  Explorer	
  
• 	
  2015:	
  Discovery	
  API,	
  Metadata	
  API	
  
• Released	
  April	
  2014
http://www.oclc.org/dataThe  Work  Entity
• 98+	
  million	
  Person	
  descriptions	
  and	
  URIs	
  
• Person	
  entities	
  with	
  authority:	
  20.2	
  million	
  
• Person	
  entities	
  without	
  authority:	
  78.3	
  million	
  
• Schema.org	
  +	
  BiblioGraph.net	
  
• Harvested	
  from	
  WorldCat	
  data	
  and	
  enriched	
  from	
  other	
  hubs	
  RDF	
  
Data	
  formats	
  
• RDF/XML,	
  Turtle,	
  Triples,	
  JSON-­‐LD	
  
• Links	
  to	
  WorldCat	
  Works.	
  	
  Added	
  links	
  from	
  WC	
  Works.	
  
• Open	
  Data	
  license	
  via	
  Linked	
  Data	
  Explorer	
  
• 	
  2015:	
  Linked	
  Data	
  Explorer,	
  Discovery	
  API
http://www.oclc.org/dataThe  Person  Entity
Success
Can	
  we	
  measure	
  impact?
Success
Monthly	
  Unique	
  Visitors
OCLC	
  Entity	
  Based	
  Data	
  Strategy
2012	
  
2013
2010
OCLC	
  Entity	
  Based	
  Data	
  Strategy
✓VIAF,	
  ISNI,	
  FAST	
  Publish	
  Linked	
  Data
✓WorldCat.org	
  Linked	
  Data	
  Release	
  –	
  using	
  Schema.org
✓Data	
  mining	
  of	
  WorldCat	
  resources
✓WorldCat	
  Works	
  Released	
  –	
  using	
  Schema.org
✓Schema.org	
  added	
  to	
  VIAF	
  RDF
✓WorldCat	
  Discovery	
  API	
  Returns	
  Schema.org	
  RDF	
  (Beta)
2012	
  
2014
2013
2010
OCLC	
  Entity	
  Based	
  Data	
  Strategy
✓VIAF,	
  ISNI,	
  FAST	
  Publish	
  Linked	
  Data
✓WorldCat.org	
  Linked	
  Data	
  Release	
  –	
  using	
  Schema.org
✓Data	
  mining	
  of	
  WorldCat	
  resources
✓WorldCat	
  Works	
  Released	
  –	
  using	
  Schema.org
✓Schema.org	
  added	
  to	
  VIAF	
  RDF
✓WorldCat	
  Discovery	
  API	
  Returns	
  Schema.org	
  RDF	
  (Beta)
2012	
  
2014
➢Application	
  Integration	
  
➢WorldCat	
  Discovery	
  
➢Analytics	
  
➢Discovery	
  API	
  
➢Cataloging
2015
2013
2010
OCLC	
  Entity	
  Based	
  Data	
  Strategy
✓VIAF,	
  ISNI,	
  FAST	
  Publish	
  Linked	
  Data
✓WorldCat.org	
  Linked	
  Data	
  Release	
  –	
  using	
  Schema.org
✓Data	
  mining	
  of	
  WorldCat	
  resources
✓WorldCat	
  Works	
  Released	
  –	
  using	
  Schema.org
✓Schema.org	
  added	
  to	
  VIAF	
  RDF
✓WorldCat	
  Discovery	
  API	
  Returns	
  Schema.org	
  RDF	
  (Beta)
2012	
  
2014
➢Application	
  Integration	
  
➢WorldCat	
  Discovery	
  
➢Analytics	
  
➢Discovery	
  API	
  
➢Cataloging
2015
➢More	
  Entities	
  Released	
  
➢Person	
  
➢Organization	
  
➢Event	
  
➢Concept
2013
2010
OCLC	
  Entity	
  Based	
  Data	
  Strategy
✓VIAF,	
  ISNI,	
  FAST	
  Publish	
  Linked	
  Data
✓WorldCat.org	
  Linked	
  Data	
  Release	
  –	
  using	
  Schema.org
✓Data	
  mining	
  of	
  WorldCat	
  resources
✓WorldCat	
  Works	
  Released	
  –	
  using	
  Schema.org
✓Schema.org	
  added	
  to	
  VIAF	
  RDF
✓WorldCat	
  Discovery	
  API	
  Returns	
  Schema.org	
  RDF	
  (Beta)
2012	
  
2014
➢Application	
  Integration	
  
➢WorldCat	
  Discovery	
  
➢Analytics	
  
➢Discovery	
  API	
  
➢Cataloging
2015
➢More	
  Entities	
  Released	
  
➢Person	
  
➢Organization	
  
➢Event	
  
➢Concept
➢New	
  Products	
  	
  	
  	
  	
  	
  	
  
➢Continuing	
  Evangelism
➢New	
  Services
➢Continuing	
  Innovation
2013
2016
2010
!
Many great Library Linked
Data Initiatives
but
!
Many great Library Linked
Data Initiatives
but
!
Many great Library Linked
Data Initiatives
If	
  users	
  can't	
  discover	
  our	
  resources
but
!
Many great Library Linked
Data Initiatives
If	
  users	
  can't	
  discover	
  our	
  resources
What	
  is	
  the	
  point?
but
!
Many great Library Linked
Data Initiatives
If	
  users	
  can't	
  discover	
  our	
  resources
What	
  is	
  the	
  point?
Give	
  the	
  Web	
  what	
  it	
  wants!
Linked	
  Data	
  has	
  benefits	
  for	
  library	
  workflows	
  ….
….by	
  giving	
  the	
  Web	
  what	
  it	
  wants
Web	
  Driven	
  Revolution	
  
For	
  Library	
  Data
We	
  Can	
  Lead	
  The
Web	
  Driven	
  Revolution	
  
For	
  Library	
  Data
We	
  Can	
  Lead	
  The
Richard	
  Wallis	
  
OCLC	
  Technology	
  Evangelist	
  
@rjw
Web	
  Driven	
  Revolution	
  
For	
  Library	
  Data
Washington,	
  DC	
  28th	
  April	
  2015
Richard	
  Wallis	
  
OCLC	
  Technology	
  Evangelist	
  
@rjw
Web	
  Driven	
  Revolution	
  
For	
  Library	
  Data
Washington,	
  DC	
  28th	
  April	
  2015
http://slideshare.net/rjw

Web Driven Revolution For Library Data

  • 1.
    Richard  Wallis   OCLC  Technology  Evangelist   @rjw Web  Driven  Revolution   For  Library  Data Washington,  DC  28th  April  2015
  • 4.
    Image  courtesy  of:  Shropshire  County  Council1779  (c.) The Industrial Revolution
  • 6.
  • 7.
    The  Web  of  … Documents
  • 8.
    The  Web  of  … Documents Active  Documents
  • 9.
    The  Web  of  … Documents Active  Documents Discovery ☌
  • 10.
    The  Web  of  … Documents Active  Documents Discovery Data ☌☌
  • 11.
    The  Web  of  … Documents Active  Documents Discovery Data ☌☌
  • 12.
    The  Web  of  … Documents Active  Documents Discovery Data ☌☌ ✔
  • 13.
    The  Web  of  … Documents Active  Documents Discovery Data ☌☌ ✔ ✔
  • 14.
    The  Web  of  … Documents Active  Documents Discovery Data ☌☌ ✔ ✔ ✗
  • 15.
    The  Web  of  … Documents Active  Documents Discovery Data ☌☌ ✔ ✔ ✔✗ ✗
  • 16.
    The  Web  of  … Documents Active  Documents Discovery Data Knowledge ☌☌ ✔ ✔ ✔✗ ✗☌
  • 17.
    The  Web  of  … Documents Active  Documents Discovery Data Knowledge ☌☌ ✔ ✔ ✔✗ ✗ ? ☌
  • 18.
  • 19.
  • 20.
  • 21.
  • 25.
    record   /ˈrɛkɔːd/   noun   ! a  thing  constituting  a  piece  of  evidence  about   the  past,  especially  an  account  kept  in  writing   or  some  other  permanent  form.
  • 26.
    entity   /ˈɛntɪti/   noun   a  thing  with  distinct  and  independent   existence.
  • 27.
    entity   /ˈɛntɪti/   noun   a  thing  with  distinct  and  independent   existence. relationship   /rɪˈleɪʃ(ə)nʃɪp/   noun   the  way  in  which  two  or  more  people  or   things  are  connected  
  • 28.
    Record Title:    "War  and  Peace"   Author:    "Leo  Tolstoy  1828-­‐1910"   ISBN:  0307266931 Type:  Work   Name:    "War  and  Peace"   Author:    http://worldcat.org/entity/person/id/1234 Entity  (http://worldcat.org/entity/work/id/115206288)
  • 29.
    Record Title:    "War  and  Peace"   Author:    "Leo  Tolstoy  1828-­‐1910"   ISBN:  0307266931 Type:  Work   Name:    "War  and  Peace"   Author:    http://worldcat.org/entity/person/id/1234 Entity  (http://worldcat.org/entity/work/id/115206288)
  • 30.
    Record Title:    "War  and  Peace"   Author:    "Leo  Tolstoy  1828-­‐1910"   ISBN:  0307266931 Type:  Work   Name:    "War  and  Peace"   Author:    http://worldcat.org/entity/person/id/1234 Entity  (http://worldcat.org/entity/work/id/115206288) Type:  Person   Name:    "Leo  Tolstoy  "   Born:    1828   Died:  1910   Birthplace:  http://worldcat.org/entity/place/id/8976 Entity  (http://worldcat.org/entity/person/id/1234) ⤵
  • 31.
    Record Title:    "War  and  Peace"   Author:    "Leo  Tolstoy  1828-­‐1910"   ISBN:  0307266931 Type:  Work   Name:    "War  and  Peace"   Author:    http://worldcat.org/entity/person/id/1234 Entity  (http://worldcat.org/entity/work/id/115206288) Type:  Person   Name:    "Leo  Tolstoy  "   Born:    1828   Died:  1910   Birthplace:  http://worldcat.org/entity/place/id/8976 Entity  (http://worldcat.org/entity/person/id/1234) Type:  Place   Name:    "Yasnaya  Polyana"   SameAs:    http://geonames.org/468686 Entity  (http://worldcat.org/entity/place/id/8976) ⤵ ⤵ ⟶
  • 39.
    Many great LDProjects So today ….. Where are we on the web?
  • 40.
    Where are weon the web?
  • 49.
  • 50.
  • 51.
    Open  Linked  Data  -­‐  Silos Library  Linked  Data
  • 52.
    British   Library German   National   Library Spanish   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data
  • 53.
    British   Library German   National   Library Spanish   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data
  • 54.
    British   Library German   National   Library Spanish   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data
  • 55.
    British   Library German   National   Library Spanish   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data
  • 56.
    British   Library German   National   Library Spanish   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data
  • 57.
    British   Library German   National   Library Spanish   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Behind  A  Vocabulary  Barrier Library  Linked  Data
  • 60.
    A  general  purpose  vocabulary  for   describing  things  on  the  web
  • 61.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains"
  • 62.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes"
  • 63.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" "15%  of  the  Web"
  • 64.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y "15%  of  the  Web"
  • 65.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   "15%  of  the  Web"
  • 66.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   • Embedded  in  HTML "15%  of  the  Web"
  • 67.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   • Embedded  in  HTML • RDFa,  Microdata,  JSON-­‐LD "15%  of  the  Web"
  • 68.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   • Embedded  in  HTML • RDFa,  Microdata,  JSON-­‐LD • Descriptive  data "15%  of  the  Web"
  • 69.
    A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   • Embedded  in  HTML • RDFa,  Microdata,  JSON-­‐LD • Descriptive  data • Active  links "15%  of  the  Web"
  • 71.
    • Foundation  for  the  future  of  bibliographic  description
  • 72.
    • Foundation  for  the  future  of  bibliographic  description • Eventual  replacement  for  Marc  21
  • 73.
    • Foundation  for  the  future  of  bibliographic  description • Eventual  replacement  for  Marc  21 • Identify  information  entities
  • 74.
    • Foundation  for  the  future  of  bibliographic  description • Eventual  replacement  for  Marc  21 • Identify  information  entities • Conversion  from  Marc
  • 75.
    • Foundation  for  the  future  of  bibliographic  description • Eventual  replacement  for  Marc  21 • Identify  information  entities • Conversion  from  Marc • Publish  in  RDF  –  Linked  Data
  • 76.
    • Foundation  for  the  future  of  bibliographic  description • Eventual  replacement  for  Marc  21 • Identify  information  entities • Conversion  from  Marc • Publish  in  RDF  –  Linked  Data • White  Paper Common Ground: Exploring Compatibilities between the Linked Data Models of the Library of Congress and OCLC http://oc.lc/CommonGround
  • 78.
  • 79.
    Why  Catalog? So  we  can  find  things
  • 80.
    Why  Catalog? So  we  can  find  things Why  Share  on  the  Web?
  • 81.
    Why  Catalog? So  we  can  find  things Why  Share  on  the  Web? So  today’s  users   can  find  our  things
  • 83.
  • 84.
  • 85.
    Entities:  Getting  from  here  to  there
  • 86.
    Data from one convertedrecord does not an entity make Entities:  Getting  from  here  to  there
  • 87.
    Data from one convertedrecord does not an entity make Transformation  into  Linked  Data  is  just  a  beginning  … • Mine  and  analyse  an  aggregate Entities:  Getting  from  here  to  there
  • 88.
    Data from one convertedrecord does not an entity make Transformation  into  Linked  Data  is  just  a  beginning  … • Mine  and  analyse  an  aggregate • Identify,  map,  merge  -­‐  evidence  based Entities:  Getting  from  here  to  there
  • 89.
    Data from one convertedrecord does not an entity make Transformation  into  Linked  Data  is  just  a  beginning  … • Mine  and  analyse  an  aggregate • Identify,  map,  merge  -­‐  evidence  based • Relate  to  external  sources Entities:  Getting  from  here  to  there
  • 90.
    Data from one convertedrecord does not an entity make Transformation  into  Linked  Data  is  just  a  beginning  … • Mine  and  analyse  an  aggregate • Identify,  map,  merge  -­‐  evidence  based • Relate  to  external  sources • Establish  the  entities Entities:  Getting  from  here  to  there
  • 97.
    Entities  and  library  workflows
 Discovery The  Name  of  the  Rose Summary:  The  year  is  1327.  Franciscans  in  a  wealthy   Italian  abbey  are  suspected  of  heresy,  and  Brother   William  of  Baskerville  arrives  to  investigate.  His  delicate   mission  is  suddenly  overshadowed  by  seven  bizarre   deaths  that  take  place  in  seven  days  and  nights  of   apocalyptic  terror.   Subjects Borrowing  Options   eBooks  |  Printed  Books  |  Audio  Books   Other  Languages   ! Monastic  libraries  -­‐-­‐  Italy  –  Fiction  |  Semiotics  -­‐-­‐  Fiction  
  • 98.
  • 99.
  • 100.
    person place object concept organizationwork The  solution   starts  here. The  library  knowledge  graph
 A  graph  of  relationships
  • 101.
    person place object concept organizationwork author The  solution   starts  here. The  library  knowledge  graph
 A  graph  of  relationships
  • 102.
    person place object concept organizationwork author subject The  solution   starts  here. The  library  knowledge  graph
 A  graph  of  relationships
  • 103.
    person place object concept organizationwork author subjectitem
 availability The  solution   starts  here. The  library  knowledge  graph
 A  graph  of  relationships
  • 104.
    The  library  knowledge  graph
 A  graph  of  relationships person place object concept organization work
  • 105.
  • 106.
    The  library  knowledge  graph
 Lots  of  things….if  we  do  it  right. ILL  and  AnalyticsCataloging Discovery Integration  with  the  web What  will  be  better?
  • 107.
    Entities  and  library  workflows
 Cataloging Cataloging  will  be  different…   ▪ Managing  the  quality  of  Works   • Improving  clusters   ▪ Managing  the  quality  of  Persons   • Links  to  works,  Other  IDs
  • 108.
  • 109.
    What  has  OCLC  done? So  what  progress  have   we  made?
  • 110.
    • 197+  million  Work  descriptions  and  URIs   • Schema.org  +  BiblioGraph.net   • RDF  Data  formats   • RDF/XML,  Turtle,  Triples,  JSON-­‐LD   • Links  to  WorldCat  manifestations   • Links  to  Dewey,  LCSH,  LCNAF,  VIAF,  FAST   • Open  Data  license  via  Linked  Data  Explorer   •  2015:  Discovery  API,  Metadata  API   • Released  April  2014 http://www.oclc.org/dataThe  Work  Entity
  • 111.
    • 98+  million  Person  descriptions  and  URIs   • Person  entities  with  authority:  20.2  million   • Person  entities  without  authority:  78.3  million   • Schema.org  +  BiblioGraph.net   • Harvested  from  WorldCat  data  and  enriched  from  other  hubs  RDF   Data  formats   • RDF/XML,  Turtle,  Triples,  JSON-­‐LD   • Links  to  WorldCat  Works.    Added  links  from  WC  Works.   • Open  Data  license  via  Linked  Data  Explorer   •  2015:  Linked  Data  Explorer,  Discovery  API http://www.oclc.org/dataThe  Person  Entity
  • 112.
  • 113.
    Can  we  measure  impact? Success
  • 115.
  • 116.
    OCLC  Entity  Based  Data  Strategy 2012   2013 2010
  • 117.
    OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked  Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF ✓WorldCat  Discovery  API  Returns  Schema.org  RDF  (Beta) 2012   2014 2013 2010
  • 118.
    OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked  Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF ✓WorldCat  Discovery  API  Returns  Schema.org  RDF  (Beta) 2012   2014 ➢Application  Integration   ➢WorldCat  Discovery   ➢Analytics   ➢Discovery  API   ➢Cataloging 2015 2013 2010
  • 119.
    OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked  Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF ✓WorldCat  Discovery  API  Returns  Schema.org  RDF  (Beta) 2012   2014 ➢Application  Integration   ➢WorldCat  Discovery   ➢Analytics   ➢Discovery  API   ➢Cataloging 2015 ➢More  Entities  Released   ➢Person   ➢Organization   ➢Event   ➢Concept 2013 2010
  • 120.
    OCLC  Entity  Based  Data  Strategy ✓VIAF,  ISNI,  FAST  Publish  Linked  Data ✓WorldCat.org  Linked  Data  Release  –  using  Schema.org ✓Data  mining  of  WorldCat  resources ✓WorldCat  Works  Released  –  using  Schema.org ✓Schema.org  added  to  VIAF  RDF ✓WorldCat  Discovery  API  Returns  Schema.org  RDF  (Beta) 2012   2014 ➢Application  Integration   ➢WorldCat  Discovery   ➢Analytics   ➢Discovery  API   ➢Cataloging 2015 ➢More  Entities  Released   ➢Person   ➢Organization   ➢Event   ➢Concept ➢New  Products               ➢Continuing  Evangelism ➢New  Services ➢Continuing  Innovation 2013 2016 2010
  • 121.
    ! Many great LibraryLinked Data Initiatives
  • 122.
    but ! Many great LibraryLinked Data Initiatives
  • 123.
    but ! Many great LibraryLinked Data Initiatives If  users  can't  discover  our  resources
  • 124.
    but ! Many great LibraryLinked Data Initiatives If  users  can't  discover  our  resources What  is  the  point?
  • 125.
    but ! Many great LibraryLinked Data Initiatives If  users  can't  discover  our  resources What  is  the  point? Give  the  Web  what  it  wants!
  • 127.
    Linked  Data  has  benefits  for  library  workflows  …. ….by  giving  the  Web  what  it  wants Web  Driven  Revolution   For  Library  Data We  Can  Lead  The
  • 128.
    Web  Driven  Revolution   For  Library  Data We  Can  Lead  The
  • 129.
    Richard  Wallis   OCLC  Technology  Evangelist   @rjw Web  Driven  Revolution   For  Library  Data Washington,  DC  28th  April  2015
  • 130.
    Richard  Wallis   OCLC  Technology  Evangelist   @rjw Web  Driven  Revolution   For  Library  Data Washington,  DC  28th  April  2015 http://slideshare.net/rjw