Fuller Disclosure:  Getting More Collections into the Network Flow  Karen Smith-Yoshimura OCLC Programs and Research NELINET Conference Revealing Hidden Collections: Making the Lost Found Again Worcester, MA November 14, 2008
“ It is not the strongest of the species that survives, nor the most intelligent  that survives.  It is the one that is the most adaptable to change.” — Charles Darwin Image: Auckland Museum
Collections Grid A framework for representing content Published Content • Books • Journals • Newspapers • Gov. docs • CD, DVD • Maps • Scores Special Collections • Rare books • Local/Historical newspapers • Local history materials • Photographs • Archives & Manuscripts • Theses & Dissertations Museum objects Source: OCLC Office of Research 2003 Institutional Content • ePrints/tech reports • Learning objects • Courseware • Local government reports • Training manuals • Research data Open Web Content • Freely-available  web resources • Open source software • Newspaper archives • Images digital print HIGH LOW HIGH LOW stewardship uniqueness The Collective Collection
David Lewis (Dean of University Library, Indiana University Purdue) predicts in 20 years less than 50% of a library’s investments will go to purchasing collections Investment in  locally curated  content increases as library engagement in research support increases Investment in  acquired content  decreases as print collection is retired and shared purchasing of e-resources increases
  Recent attention given to hidden collections   Council on Library and Information Resources: Cataloging Hidden Special Collections and Archives (Mellon funded $4.27 million) http://www.clir.org/hiddencollections/index.html National Historical Publications and Records Commission grants “ activities to reveal collections that researchers cannot easily discover” Association of Research Libraries Special Collections Task Force 2001-2006 http://www.arl.org/rtl/speccoll/spcolltf/status0706.shtml
Funding for streamlined processes
http://www.loc.gov/bibliographic-future/news/lcwg-ontherecord-jan08-final.pdf
Enhance access to rare, unique, and other special hidden materials   2.1.1 Make the discovery of rare, unique and other special hidden materials a high priority 2.1.2 Streamline cataloging for rare, unique and other special hidden materials, emphasizing greater coverage and broader access 2.1.3 Integrate access to rare, unique and other special hidden materials with other library materials 2.1.4 Encourage digitization to allow broader access 2.1.5 Share access to rare, unique and other special hidden materials
Image: from the end of “Raiders of the Lost Ark” Who knows what’s hidden in our  collections?
What percentage of your collection do you estimate has not been adequately described – and is unlikely to be described without additional resources, funding, or both? RLG Programs Descriptive Metadata Practices Survey Results: Data Supplement   http://www.oclc.org/programs/publications/reports/2007-04.pdf 18% 35% 24% 22%
Princeton University Archives Even processed materials need description
 
Yale University
Where do you begin an online search for information on a topic? College Students’ Perceptions of Libraries and Information Resources: a Report to the OCLC Membership :  http://www.oclc.org/reports/perceptionscollege.htm
Be where the users go Image: informationarchitects.jp/web-trend-map-2008-beta/
Image:  http://hello.eboy.com/eboy/wp-content/uploads/shop/EBY_FooBar_35t.png
Discovery happens elsewhere – Be  there!
 
 
So how much metadata is really needed?
Diffusion
 
Relationships and controlled vocabularies contributed by all  catalogers of all editions Concentration
 
 
Where the users are
 
 
Going mobile, part of diffusion
Concentration Diffusion Aggregation of data at the network level Descriptive Mining the clickstream: “Database of intentions” Social Network effects Syndication to many destinations A feed based universe Data APIs Widgets Mobilization in user workflows Encourage social participation
Yale University Archives vault Digitize for access
Don’t get further behind Trinity College Dublin
University of Aberdeen Stop thinking about item-level description
 
Quantity over quality Images: labs.live.com/photosynth/
Images: Harry Ransom Center, UT Austin
Combine approaches
Do it   all Images: A View to Hugh (blog)
Share your findings Images: A View to Hugh (blog) www.lib.unc.edu/blogs/morton/
Images: various
Preserve right to reuse/remix
Work with partners to get what you want
Describing collections How much detail is necessary – if discovery will happen elsewhere?  … only 27 MARC fields are in 10% or more of WorldCat records Focus on collection-level descriptions Start with basic description, then … allow serious researchers to contact you for more detail, and … engage your user community to add to the descriptions More product, less process   (Greene-Meissner)
The changing context for metadata management B.W. (Before the Web) For finding and managing  library materials (mostly print) Catalog records (well-understood rules and encoding conventions) Shared cooperative cataloging systems Usually handcrafted, one at a time A.W. (After the Web) For finding and managing many types of materials, for many user communities Many types of records, many sources Loosely coupled metadata management, reuse and exchange services among multiple repositories Multiple batch creation and metadata extract, conversion, mapping, ingest and transfer services
Most common crosswalks between schemas used (from 2008 survey of RLG Partners) 63% of 100 respondents convert between MARC and non-MARC formats Metadata can be repurposed %
Fuller disclosure! Get discovered! Digitize or scan, whatever you can Expose your collections  where people congregate Share your metadata Quantity trumps quality “ Include and postpone” – items can be organized and metadata enriched over time
Resources Shifting Gears :  Gearing Up to Get Into the Flow  Ricky Erway and Jennifer Schaffner  http://www.oclc.org/programs/publications/reports/2007-02.pdf Blogs  http://hangingtogether.org/     http://orweblog.oclc.org/ Web  www.oclc.org/programs Karen Smith-Yoshimura  [email_address] Thanks to Karen Calhoun, Lorcan Dempsey, Ricky Erway, Thom Hickey, Roy Tennant, and other OCLC contributors.

Fuller Disclosure: Getting More Collections into the Network Flow

  • 1.
    Fuller Disclosure: Getting More Collections into the Network Flow Karen Smith-Yoshimura OCLC Programs and Research NELINET Conference Revealing Hidden Collections: Making the Lost Found Again Worcester, MA November 14, 2008
  • 2.
    “ It isnot the strongest of the species that survives, nor the most intelligent that survives. It is the one that is the most adaptable to change.” — Charles Darwin Image: Auckland Museum
  • 3.
    Collections Grid Aframework for representing content Published Content • Books • Journals • Newspapers • Gov. docs • CD, DVD • Maps • Scores Special Collections • Rare books • Local/Historical newspapers • Local history materials • Photographs • Archives & Manuscripts • Theses & Dissertations Museum objects Source: OCLC Office of Research 2003 Institutional Content • ePrints/tech reports • Learning objects • Courseware • Local government reports • Training manuals • Research data Open Web Content • Freely-available web resources • Open source software • Newspaper archives • Images digital print HIGH LOW HIGH LOW stewardship uniqueness The Collective Collection
  • 4.
    David Lewis (Deanof University Library, Indiana University Purdue) predicts in 20 years less than 50% of a library’s investments will go to purchasing collections Investment in locally curated content increases as library engagement in research support increases Investment in acquired content decreases as print collection is retired and shared purchasing of e-resources increases
  • 5.
    Recentattention given to hidden collections Council on Library and Information Resources: Cataloging Hidden Special Collections and Archives (Mellon funded $4.27 million) http://www.clir.org/hiddencollections/index.html National Historical Publications and Records Commission grants “ activities to reveal collections that researchers cannot easily discover” Association of Research Libraries Special Collections Task Force 2001-2006 http://www.arl.org/rtl/speccoll/spcolltf/status0706.shtml
  • 6.
  • 7.
  • 8.
    Enhance access torare, unique, and other special hidden materials 2.1.1 Make the discovery of rare, unique and other special hidden materials a high priority 2.1.2 Streamline cataloging for rare, unique and other special hidden materials, emphasizing greater coverage and broader access 2.1.3 Integrate access to rare, unique and other special hidden materials with other library materials 2.1.4 Encourage digitization to allow broader access 2.1.5 Share access to rare, unique and other special hidden materials
  • 9.
    Image: from theend of “Raiders of the Lost Ark” Who knows what’s hidden in our collections?
  • 10.
    What percentage ofyour collection do you estimate has not been adequately described – and is unlikely to be described without additional resources, funding, or both? RLG Programs Descriptive Metadata Practices Survey Results: Data Supplement http://www.oclc.org/programs/publications/reports/2007-04.pdf 18% 35% 24% 22%
  • 11.
    Princeton University ArchivesEven processed materials need description
  • 12.
  • 13.
  • 14.
    Where do youbegin an online search for information on a topic? College Students’ Perceptions of Libraries and Information Resources: a Report to the OCLC Membership : http://www.oclc.org/reports/perceptionscollege.htm
  • 15.
    Be where theusers go Image: informationarchitects.jp/web-trend-map-2008-beta/
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
    So how muchmetadata is really needed?
  • 21.
  • 22.
  • 23.
    Relationships and controlledvocabularies contributed by all catalogers of all editions Concentration
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
    Going mobile, partof diffusion
  • 30.
    Concentration Diffusion Aggregationof data at the network level Descriptive Mining the clickstream: “Database of intentions” Social Network effects Syndication to many destinations A feed based universe Data APIs Widgets Mobilization in user workflows Encourage social participation
  • 31.
    Yale University Archivesvault Digitize for access
  • 32.
    Don’t get furtherbehind Trinity College Dublin
  • 33.
    University of AberdeenStop thinking about item-level description
  • 34.
  • 35.
    Quantity over qualityImages: labs.live.com/photosynth/
  • 36.
    Images: Harry RansomCenter, UT Austin
  • 37.
  • 38.
    Do it all Images: A View to Hugh (blog)
  • 39.
    Share your findingsImages: A View to Hugh (blog) www.lib.unc.edu/blogs/morton/
  • 40.
  • 41.
    Preserve right toreuse/remix
  • 42.
    Work with partnersto get what you want
  • 43.
    Describing collections Howmuch detail is necessary – if discovery will happen elsewhere? … only 27 MARC fields are in 10% or more of WorldCat records Focus on collection-level descriptions Start with basic description, then … allow serious researchers to contact you for more detail, and … engage your user community to add to the descriptions More product, less process (Greene-Meissner)
  • 44.
    The changing contextfor metadata management B.W. (Before the Web) For finding and managing library materials (mostly print) Catalog records (well-understood rules and encoding conventions) Shared cooperative cataloging systems Usually handcrafted, one at a time A.W. (After the Web) For finding and managing many types of materials, for many user communities Many types of records, many sources Loosely coupled metadata management, reuse and exchange services among multiple repositories Multiple batch creation and metadata extract, conversion, mapping, ingest and transfer services
  • 45.
    Most common crosswalksbetween schemas used (from 2008 survey of RLG Partners) 63% of 100 respondents convert between MARC and non-MARC formats Metadata can be repurposed %
  • 46.
    Fuller disclosure! Getdiscovered! Digitize or scan, whatever you can Expose your collections where people congregate Share your metadata Quantity trumps quality “ Include and postpone” – items can be organized and metadata enriched over time
  • 47.
    Resources Shifting Gears: Gearing Up to Get Into the Flow Ricky Erway and Jennifer Schaffner http://www.oclc.org/programs/publications/reports/2007-02.pdf Blogs http://hangingtogether.org/ http://orweblog.oclc.org/ Web www.oclc.org/programs Karen Smith-Yoshimura [email_address] Thanks to Karen Calhoun, Lorcan Dempsey, Ricky Erway, Thom Hickey, Roy Tennant, and other OCLC contributors.