Institutional Identifiers in Practice
16 July 2015
Today’s Agenda
1. Ringgold Introduction
2. Entity Management: People, Places, and
Things
3. Identifiers in the Scholarly Space
4. Open Access Case Study
5. Recommendations & Best Practices
Provide structured, authoritative
data about two entity types
Institutions
• Identify Database
• Identify Audit Service: data
normalization
• Consortia Directory Online
Scholarly Works
Books & ebooks: Abstracts and
bibliographic data to drive
discovery, purchase, and use
Ringgold’s Vision
A scholarly supply chain where
information and data about
subscribers, authors, readers,
and content is able to flow easily
and without confusion. Trust in
data increases, and more
confident decisions can be made.
Founded 2005
Identify & Ringgold ID:
Broad Adoption by Publishers &
Intermediaries…
• Copyright Clearance Center – RightsLink for Open Access
• Aries Editorial Manager
• ScholarOne Manuscripts (coming soon)
• ORCID – Identify used for institutional affiliation module
• Advantage Computing Systems – multiple client integrations
• SalesForce – multiple client integrations
• 50+ scholarly publishers such as Elsevier, OUP, CUP, ProQuest,
PLOS, Sage, T&F, Wolters Kluwer, Kudos…
…and by Associations
• AAAS
• American Academy of Pediatrics
• American Chemical Society
• American College of Physicians
• American Institute of Aeronautics and
Astronautics
• American Institute of Physics
• American Medical Association
• American Psychological Association
• American Society for Microbiology
• American Society of Hematology
• American Thoracic Society
• ASTM International
• British Editorial Society of Bone &
Joint Surgery
• Institute of Engineering and
Technology (IET)
• IOP Publishing
• Massachusetts Medical Society (NEJM)
• Radiological Society of North America
• RCN Publishing
• The Royal Society
• Royal Society of Chemistry
• SAO/NASA Astrophysical Data System
• Society for Exploration Geophysicists
• Society for Neuroscience
ENTITY MANAGEMENT IN
SCHOLARLY RESEARCH
People, Places, and Content
What do
we need
to identify
&
describe?
Authors, Members,
Editors, Readers,
Researchers
Licensees,
Publishers,
Funders,
Intermediaries,
Affiliations
Books,
Journals,
Articles,
Grants,
Citations
What are standard identifiers?
• Numeric or alpha-numeric persistent designations associated with a
single entity
• Entities can be an institution, person, or piece of content
Disambiguate aka enforce
uniqueness
Institutions that have the same name, but are actually different…
Ringgold ID 1848
Northeastern University (Boston, USA)
Ringgold ID 12434
Northeastern University (Shenyang, China)
*This means data that can be linked together through
unambiguous identification and exchanged with others
Governed
Trusted
Transparent
And link to appropriate metadata
In order to be effective, identifiers must be:
What can high
quality data
help us to
achieve?
Tactical
support for
day to day
operations
Strategic
planning
Develop
new
information
IDENTIFIERS: A SAMPLING
Differences in scope & purpose
Personal Identifiers
International Standard Name Identifier (ISNI)
www.isni.org
Open Researcher and Contributor ID (ORCID)
www.orcid.org
And many proprietary IDs:
Scopus Author ID - Elsevier
ResearcherID - Thomson Reuters
ORCID: Open
Researcher &
Contributor ID
ISNI: International Standard Name
Identifier
• ISO Standard 27729
• ISNI is designed to be
a “bridge identifier”
to join other data
sets.
• Covers any type of
entity that might
appear in a library
catalogue record:
person or institution.
ISNI Number
Party ID 2Ringgold ID
Identify
Metadata &
Hierarchies
Proprietary
Information and/or
Metadata
ISNI – Personal Record
Institutional Identifiers
• ISNI
• Ringgold Identifier
• FundRef
ISNI – Institutional Record
Ringgold Identifier:
Institutions currently active in the scholarly
supply chain
• Numerical ID applied to each record
in the Identify Database
• Format is 4-6 digits
• Created by Ringgold’s team of
researchers (manual process)
• Global coverage
• All sectors: .edu, .gov., .com, .org
• All roles: licensees, publishers,
funders, intermediaries
• 415K institutions and growing
Identify Database in 3:
Identifies unique institutions…..
Catalogs & describes each using
Identify Data Elements
Current
• Ringgold Identifier
• Name: official & alternatives
• Location
• URL/domain
• Size metrics
• Tier assignments: JISC,
Carnegie, Ringgold
• Authentication: Athens, IPs
• Ringgold Type: sector & subject
• Links: Hierarchical & consortia
• ISNI matched to each Identify
record
• Expanded descriptive
metadata:
– Granular subjects
– Reach, sites
– Economic model, governance
– Level within hierarchy
– Mission, description
– Activity status
Organizes them into hierarchies
(aka “family trees”)
….of institutions
FundRef
IDENTIFIERS IN SUPPORT OF
OPEN ACCESS
CCC’s RightsLink for Open Access
Challenge: Correct Application
of APC Discounts / Waivers
• Multiple systems & data sources involved
• Lack of standard references
• Complex criteria + complex institutional
relationships
Solution: Get everyone speaking the same
language
RightsLink for Open Access: Typical Workflow
Publisher
•Provides CCC a list of eligible institutions with Ringgold IDs,
per Ringgold Identify Audit or CCC mapping
Author
•Chooses affiliation from Ringgold-enabled list via
•MSS submission (Aries, ScholarOne) OR
•CCC RightsLink for Open Access
CCC
•Captures author affiliation and Ringgold ID
•Compares affiliation to list of eligible institutions using
Ringgold IDs and hierarchies
•Calculates proper APC
Author view: CCC
Discounts can be triggered via Identify hierarchy
or other metadata elements
• Correct APC
discounts &
waivers applied
• Publisher’s
business rules
supported
• Records enhanced
with key metadata
and Ringgold IDs
• Reporting fueled
by accurate data
Elements of Best Practice at Work
 Data integration & linking
 Free text entries for
affiliations are minimized in
webforms
 Single institutional
authority file
 Hierarchical links provide
relational authority
 Powers business intelligence
Individual + Institution IDs
=
Benefits for Stakeholders
• Funders – Want to track areas of interest, identify worthwhile pursuits, and see where
their money goes
• Institutions – Demonstrate research output more accurately and precisely describe the
institution’s contribution and who is affiliated with that work
• Publishers – Facilitate transactions of all types from content discovery to delivery of
author royalties. Improved market analysis and targeted advertising. Accurately deliver
content, calculate APCs
• Societies – Understand their membership, total constituency, and institutional impact
RECOMMENDATIONS &
CONSIDERATIONS
Practical advice about what to do when approaching identifiers
Recommendations
• Create the most complete metadata
possible – for all entity types & roles
• Adhere to standards and identifiers
which are broadly adopted (e.g.
ORCID, ISBN, ISSN, DOIs, Ringgold ID,
ISNI)
• Require authors & members to
establish an ORCID profile
• Minimize manual entry of data
• Improve data capture to require an ID
upon record creation
Considerations
• Which records & entities require IDs?
– Old vs new
• How to apply them?
– In-house resources required to join existing records to IDs or authority
file
– Outside partners to help w metadata development, application of IDs,
data strategy
• How to leverage them?
– New reporting & analysis tools to leverage newly linked datasets
THANK YOU
Christine Orr
Sales Director, North America
christine.orr@ringgold.com
Tel: 540.359.6620
http://orcid.org/0000-0003-1362-3330
www.ringgold.com @ringgoldinc
Clean Data. Confident Decisions.

Institutional Identifiers in Practice: Christine Orr at CESSE 2015

  • 1.
    Institutional Identifiers inPractice 16 July 2015
  • 2.
    Today’s Agenda 1. RinggoldIntroduction 2. Entity Management: People, Places, and Things 3. Identifiers in the Scholarly Space 4. Open Access Case Study 5. Recommendations & Best Practices
  • 3.
    Provide structured, authoritative dataabout two entity types Institutions • Identify Database • Identify Audit Service: data normalization • Consortia Directory Online Scholarly Works Books & ebooks: Abstracts and bibliographic data to drive discovery, purchase, and use
  • 4.
    Ringgold’s Vision A scholarlysupply chain where information and data about subscribers, authors, readers, and content is able to flow easily and without confusion. Trust in data increases, and more confident decisions can be made. Founded 2005
  • 5.
    Identify & RinggoldID: Broad Adoption by Publishers & Intermediaries… • Copyright Clearance Center – RightsLink for Open Access • Aries Editorial Manager • ScholarOne Manuscripts (coming soon) • ORCID – Identify used for institutional affiliation module • Advantage Computing Systems – multiple client integrations • SalesForce – multiple client integrations • 50+ scholarly publishers such as Elsevier, OUP, CUP, ProQuest, PLOS, Sage, T&F, Wolters Kluwer, Kudos…
  • 6.
    …and by Associations •AAAS • American Academy of Pediatrics • American Chemical Society • American College of Physicians • American Institute of Aeronautics and Astronautics • American Institute of Physics • American Medical Association • American Psychological Association • American Society for Microbiology • American Society of Hematology • American Thoracic Society • ASTM International • British Editorial Society of Bone & Joint Surgery • Institute of Engineering and Technology (IET) • IOP Publishing • Massachusetts Medical Society (NEJM) • Radiological Society of North America • RCN Publishing • The Royal Society • Royal Society of Chemistry • SAO/NASA Astrophysical Data System • Society for Exploration Geophysicists • Society for Neuroscience
  • 7.
    ENTITY MANAGEMENT IN SCHOLARLYRESEARCH People, Places, and Content
  • 8.
    What do we need toidentify & describe? Authors, Members, Editors, Readers, Researchers Licensees, Publishers, Funders, Intermediaries, Affiliations Books, Journals, Articles, Grants, Citations
  • 9.
    What are standardidentifiers? • Numeric or alpha-numeric persistent designations associated with a single entity • Entities can be an institution, person, or piece of content
  • 10.
    Disambiguate aka enforce uniqueness Institutionsthat have the same name, but are actually different… Ringgold ID 1848 Northeastern University (Boston, USA) Ringgold ID 12434 Northeastern University (Shenyang, China)
  • 11.
    *This means datathat can be linked together through unambiguous identification and exchanged with others Governed Trusted Transparent And link to appropriate metadata In order to be effective, identifiers must be:
  • 12.
    What can high qualitydata help us to achieve? Tactical support for day to day operations Strategic planning Develop new information
  • 13.
  • 14.
    Personal Identifiers International StandardName Identifier (ISNI) www.isni.org Open Researcher and Contributor ID (ORCID) www.orcid.org And many proprietary IDs: Scopus Author ID - Elsevier ResearcherID - Thomson Reuters
  • 15.
  • 16.
    ISNI: International StandardName Identifier • ISO Standard 27729 • ISNI is designed to be a “bridge identifier” to join other data sets. • Covers any type of entity that might appear in a library catalogue record: person or institution. ISNI Number Party ID 2Ringgold ID Identify Metadata & Hierarchies Proprietary Information and/or Metadata
  • 17.
  • 18.
    Institutional Identifiers • ISNI •Ringgold Identifier • FundRef
  • 19.
  • 20.
    Ringgold Identifier: Institutions currentlyactive in the scholarly supply chain • Numerical ID applied to each record in the Identify Database • Format is 4-6 digits • Created by Ringgold’s team of researchers (manual process) • Global coverage • All sectors: .edu, .gov., .com, .org • All roles: licensees, publishers, funders, intermediaries • 415K institutions and growing
  • 21.
    Identify Database in3: Identifies unique institutions…..
  • 22.
    Catalogs & describeseach using Identify Data Elements Current • Ringgold Identifier • Name: official & alternatives • Location • URL/domain • Size metrics • Tier assignments: JISC, Carnegie, Ringgold • Authentication: Athens, IPs • Ringgold Type: sector & subject • Links: Hierarchical & consortia • ISNI matched to each Identify record • Expanded descriptive metadata: – Granular subjects – Reach, sites – Economic model, governance – Level within hierarchy – Mission, description – Activity status
  • 23.
    Organizes them intohierarchies (aka “family trees”)
  • 24.
  • 25.
  • 26.
    IDENTIFIERS IN SUPPORTOF OPEN ACCESS CCC’s RightsLink for Open Access
  • 27.
    Challenge: Correct Application ofAPC Discounts / Waivers • Multiple systems & data sources involved • Lack of standard references • Complex criteria + complex institutional relationships Solution: Get everyone speaking the same language
  • 28.
    RightsLink for OpenAccess: Typical Workflow Publisher •Provides CCC a list of eligible institutions with Ringgold IDs, per Ringgold Identify Audit or CCC mapping Author •Chooses affiliation from Ringgold-enabled list via •MSS submission (Aries, ScholarOne) OR •CCC RightsLink for Open Access CCC •Captures author affiliation and Ringgold ID •Compares affiliation to list of eligible institutions using Ringgold IDs and hierarchies •Calculates proper APC
  • 29.
  • 30.
    Discounts can betriggered via Identify hierarchy or other metadata elements
  • 31.
    • Correct APC discounts& waivers applied • Publisher’s business rules supported • Records enhanced with key metadata and Ringgold IDs • Reporting fueled by accurate data
  • 32.
    Elements of BestPractice at Work  Data integration & linking  Free text entries for affiliations are minimized in webforms  Single institutional authority file  Hierarchical links provide relational authority  Powers business intelligence
  • 33.
    Individual + InstitutionIDs = Benefits for Stakeholders • Funders – Want to track areas of interest, identify worthwhile pursuits, and see where their money goes • Institutions – Demonstrate research output more accurately and precisely describe the institution’s contribution and who is affiliated with that work • Publishers – Facilitate transactions of all types from content discovery to delivery of author royalties. Improved market analysis and targeted advertising. Accurately deliver content, calculate APCs • Societies – Understand their membership, total constituency, and institutional impact
  • 34.
    RECOMMENDATIONS & CONSIDERATIONS Practical adviceabout what to do when approaching identifiers
  • 35.
    Recommendations • Create themost complete metadata possible – for all entity types & roles • Adhere to standards and identifiers which are broadly adopted (e.g. ORCID, ISBN, ISSN, DOIs, Ringgold ID, ISNI) • Require authors & members to establish an ORCID profile • Minimize manual entry of data • Improve data capture to require an ID upon record creation
  • 36.
    Considerations • Which records& entities require IDs? – Old vs new • How to apply them? – In-house resources required to join existing records to IDs or authority file – Outside partners to help w metadata development, application of IDs, data strategy • How to leverage them? – New reporting & analysis tools to leverage newly linked datasets
  • 38.
    THANK YOU Christine Orr SalesDirector, North America [email protected] Tel: 540.359.6620 http://orcid.org/0000-0003-1362-3330 www.ringgold.com @ringgoldinc Clean Data. Confident Decisions.

Editor's Notes

  • #9  And very often, the same person might play different roles – and therefore be found in multiple data sets. SLA example.
  • #11 Also: there are plenty of institutions which share the same acronym, or are known by different official names – think different languages. This happens more often than you think…… So, you cannot disambiguate on name alone – you’d need a second data element, location…and that’s inefficient, and makes for poor data quality control & management.
  • #12 You’ll note that I’ve used the term “Standard Identifiers” as opposed to just “Standards”… I’ll be focusing on using standard identifiers as the main data hooks that will allow us to aggregate information for the purpose of synthesizing knowledge. Interoperability implies communication; how we communicate something is very different than how we describe things. I should take a moment to clarify that the Ringgold ID is not a standard – not an ISO Certified standard, in any case, but in many cases, our data has become a defacto standard through application; some of you might be wondering what then constitutes a big “S” Standards – if any system uses a predefined taxonomy as an authority file to validate data (thereby achieving identify data entries for each and every instance it is needed) then a standard has been achieved. How data is exchanged is quite different than the data itself, and of course, standards may be applied to both. For my part, I’m going to talk about the data itself, not how it is exchanged– So, in terms of the data itself, what are we trying to standardize? Descriptions – the wrapper around highly unique content. More importantly, as an industry—as a species, really—we are creating data elements that can be interpreted by machines – I should say, easily interpreted by machines – I’ll come back to this topic near the end of my presentation.
  • #13 Regardless of our business model, open access or traditional subscriptions, we need to make decisions upon a solid foundation, interoperate with all manner of systems, and facilitate sharing of data throughout the virtuous cycle of research, from researcher to reader. Tactical: customer service, APC calculation & application, helping different systems to interoperate & share data. Strategic: move away from “anecdata” make business decisions based on facts and trends. Analysis – how’s that OA journal doing? Develop new information: Create data overlays that may tell us new stories, and help us develop new services. Answer questions we’ve never been able to confidently state before: where have we got authors and no subscriptions? Overlay authors & reviewers to ensure there is no conflict of interest. Reveal new interdisciplinary collaborations.
  • #16 More than 1.4M and counting,
  • #18 International Standard Name Identifier: Scope = entities that may be found in library catalogs, ie, published works. Creators, publishers, subjects. Covers people (past, present, real, fictional, groups) and institutions such as publishers.
  • #19 Of most interest in the OA discussion…..
  • #21 ID applied to each record in a Ringgold Identify Database record; here is a sample.
  • #22 What it is:
  • #26 Funders currently active. This is a registry run by CrossRef, populated via data which is sourced from Elsevier.
  • #28 Most publishers offer some level of discount or waiver based on affiliation: specific institutions, classes of institutions (per country or org type).
  • #33 Authors don’t need to know their RIN, multiple orgs using the same standards & vocabulary. Prevent duplicate account / record creation Breaks down silos by enabling systems to be compared and linked Keep data up-to-date and systems synchronised Improve accuracy in reporting
  • #35 34
  • #36 35
  • #38 Identifiers become more meaningful, more powerful, more effective, when they are used broadly and frequently.