SlideShare a Scribd company logo
dkNET: Capabilities and the FAIR Principles
Dr. Jeffrey Grethe & Dr. Maryann Martone
June 16, 2016
The NIDDK Information Network (dkNET) - A
Community Research Resource, Information,
and Data Discovery Portal
Jeffrey S. Grethe, Ph.D.
September 8, 2016
bioCADDIE Webinar
Community Data and
Information Resources
• Over 3500 resources have been identified
– Researchers can’t visit them all
– Most content from these resources not easily found
through standard search engines
– Even more structured content on the web
• Resources provide domain specific views of data
– Must provide a snapshot of information in a simple to
understand form that can be further explored in the native
database
– Must provide a biomedical science based semantic
framework for resource description and search
Building a Community Resource,
Information, and Data Index
• It is quite difficult to access, collate, and filter the incredible array of
information and data in the public domain.
• dkNET was established in recognition of the need to interconnect
research communities, both basic and clinical, by providing seamless
access to large pools of resources, information, and data relevant to the
mission of the Neuroscience Blueprint.
•dkNET provides a single point-of-entry for discovering this information
to allow researchers to make good decisions in the research process,
thereby increasing both the speed and efficiency of the process.
The FAIR Principles and dkNET
Findable: A core aspect of dkNET’s mission
- facilitate access to a collection of diverse
research resources
Accessible: Assist users by directing them
to available resources
Interoperable: Provide unified views of
certain data and information across resources
Reusable: Supports community standards
and develops standard representations for
resources
Attribution and Recognition: Leading
initiative to unambiguously cite resources via
Research Resource Identifiers
Applies to data,
resources, knowledge,
and information
dkNET: A “Resource” Discovery Index
Datab
ase
Software
Application
Data Analysis
Service
Topical
Portal
Core
Facility
Ontol
ogy
Software
Resource
Year
s:
dkNET Offers…
• A search portal to find community-
vetted research resources:
materials, data and tools for your
research
• Access to hundreds of databases
across biomedicine with one easy
search
• Personalized search and display
of results
• Information hub for community
news and social networking
• Support for resource providers
dkNET Investigator's RetreatJune16, 2014
Shared
Resources
dkNET is built on the SciCrunch platform
Resource Identification Portal
Neuroscience
dkNET
Drug Design
• Pull from and
push to a
common pool
of data
• Customize
according to
communities
• Common
interface
elements
across
platforms
National Institute of
Diabetes and
Digestive and
Kidney Diseases
Information
Network
Neuroscience
Information
Framework
Open Data
Commons For
Spinal Cord Injury
Shared Infrastructure…
Drug Design Data Resource Resource Identification
Initiative
Shared Costs…
Community Specific Views
Shared Infrastructure: dkNET and SciCrunch
• Community “Apps”:
Portal features driven by
individual communities
can be re-used by other
communities
• Community Resource
Curation: All
communities have
access to shared data
platform
Can utilize portal infrastructure for challenges being
developed by another SciCrunch project
Resource IDs from aggregated databases
RRIDs: a single portal
for authors
• >25 authoritative
databases
• One search
interface
• Simple directions
• Prominent “Cite
This” button
Specialized Portals
Overview of the dkNET Portal
Results are returned in specific tabs. Community resources
are presented first if they are available. But all searches
can be executed across all resources.
Community
More
resources
Literature
Specialized
NIDDK-relevant
resources
200+ database
and information
resources
across
biomedicine
Literature
dkNET’s Three Primary Resource Views
Community resources
NURSA
BCBC
DiaComp
GUDMAP
T1DB
MMPC
NIDDK Central Repository
dkNET Community Funding
AddGene: Plasmids
Clinical trials.gov: Clinical trials and data
Antibody Registry: 2M+ antibodies
Integrated animals: Model organism databases
SciCrunch Resource Registry: 13K+ tools
Grants.gov
NIDDK-funded resources and centers Additional relevant databases vis SciCrunch
dkNET connects resources
Social media
Research resource analytics
Other SciCrunch communities
More
resources
LiteratureCommunity
RRID’s
Each index is organized slightly differently
Community More resources Literature
Navigation differs across indices
Resource
categories
Sources
Data
and/or
resource
type
System
level
Individual
records
Source
Functions
and filters
Results
display
Functions
and filters
Results
display
Collections
and
Analytics
Analytics
Facets
Articles
Functions
and filters
Results
display
Structure of dkNET Resources
● 3 main sections
○ Community
resources
○ More resources
○ Literature
● Faceted search to
drill down into a
single source
● Deep search into
information and data
resources
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Research Resource, Information, and Data Discovery Portal
Viewing Community Resources
Viewing More Resources
Viewing the Literature
Community Resource
Curation and Maintenance
Pilot and
Feasibility
Program
Database
Information Backstop for Retired Resources
User Notifications: Save Searches
Provide user regular updates associated with user’s
saved search via webpage alert and e-mail.
A core community resource: SciCrunch Registry
• Automated text mining is used to
look for “web page last updated” or
copyright dates
Identified for 570 resources
373 were not updated within the last 2
years (65%)
• Manual review of ~200 resources
38 not updated within the past 2 years
(~20%)
8 migrated to new addresses or
institutions
7 are no longer in service (~3%)
3 were deemed no longer appropriate
Tracking digital resources since 2008
Text Mining for Resources
User Notifications: Resource Subscriptions
1. Users can track their
resources
2. Provides updated
information on citations of
their resources
3. Resource mentions are
retrieved from external
data source, text mining,
and user submission
Subscribe
User Notifications: Resource Subscriptions
Resource Analytics
Facilitating broad use of
biomedical digital assets
Aggregation and Discovery platforms like dkNETcan help
mediate this complex world, much as the journals do for
publishing articles
RRIDs
An easy, practical method for improving
reproducibility and transparency
Unique ID’s for all! Resource Identification Initiative
• It is currently impossible to
query the biomedical literature
to find out what research
resources have been used to
produce the results of a study
• authors don’t provide enough
information to unambiguously
identify key research resources
• Impossible to find all studies
that used a resource
• Critical for reproducibility and
data mining
• Critical for trouble-shooting
Faulty Antibodies Continue to Enter
US and European Markets, Warns Top
Clinical Chemistry Researcher-
Genome Web Daily, October 11, 2013
Digital objects are a new beast
RRID: Provides foundation for establishing
an alerting service for research resources
Trust: Not just
who produced it
but what
produced it
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Research Resource, Information, and Data Discovery Portal
But the author knows what was used!
This author got back to us within 2 hours with the
stock number of this mouse
Hypothes.is annotation
overlay allows anyone to
leave notes like this
stock number from JAX
Authors supply database accession numbers for
key research resources used in research
Paper published in participating journal
Materials automatically detected by SciBot(s)
Made openly available
Increased identifiability of resources after the
Resource Identification Initiative Pilot
Bandrowski et al, 2015
New Authentication of Key
Biological Resources
Guidelines are already affecting
most NIH applications (May
2016 deadline)
Which Journals now ask for RRIDs?
~25 Elsevier Journals –
typesetting + App
BMC – checklist,
typesetting*
Frontiers – moving to
typesetting*
Cell Press – author
nagging, typesetting
Wiley – author nagging,
typesetting
Working with
Endocrine Society for
incorporation of
RRIDs
dkNET for Developers…
Application Programming Interfaces (APIs)
1. API specification, infrastructure
and initial services for push and
pull of data
2. Service documentation using
Swagger.io
3. Portal dashboard for requesting API
keys (e.g. Project based keys)
4. Future: Additional services for
resources
5. Future: Applets using the published
APIs that can be embedded on
external sites
RRID Resolver Service
dkNET and the NIH BD2K
Data Discovery Index
(bioCADDIE)
A Focus on the Community
e.g. NIH Commons
Interactions with Community Aggregators
Global Data Set Index
Community
Aggregation
Best
Practices
Community
Resources
$ Funding
Protocols
Thank You...

More Related Content

PDF
dkNET Poster ENDO 2016
dkNET
 
PDF
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
dkNET
 
PDF
dkNET ESP Meeting - February 2016
dkNET
 
PDF
dkNET Introductory Webinar 05/10/2017
dkNET
 
PDF
Dk net webinar tutorial pen
Maryann Martone
 
PPTX
What's new in dkNET 2.0
dkNET
 
PPTX
NISO Training Thursday Crafting a Scientific Data Management Plan
National Information Standards Organization (NISO)
 
PDF
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Merce Crosas
 
dkNET Poster ENDO 2016
dkNET
 
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
dkNET
 
dkNET ESP Meeting - February 2016
dkNET
 
dkNET Introductory Webinar 05/10/2017
dkNET
 
Dk net webinar tutorial pen
Maryann Martone
 
What's new in dkNET 2.0
dkNET
 
NISO Training Thursday Crafting a Scientific Data Management Plan
National Information Standards Organization (NISO)
 
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Merce Crosas
 

What's hot (20)

PDF
DataTags, The Tags Toolset, and Dataverse Integration
Michael Bar-Sinai
 
PDF
Valen Metadata and the [Data] Repository
National Information Standards Organization (NISO)
 
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
National Information Standards Organization (NISO)
 
PDF
Preparing Data for Sharing: The FAIR Principles
London School of Hygiene and Tropical Medicine
 
PPTX
ICPSR Data Exploration Tools
ICPSR
 
PDF
Data Repositories Impact
Merce Crosas
 
PPT
A Data Citation Roadmap for Scholarly Data Repositories
LIBER Europe
 
PPTX
BioPharma and FAIR Data, a Collaborative Advantage
Tom Plasterer
 
PPTX
The agINFRA Germplasm Working Group
Vassilis Protonotarios
 
PPTX
Linking Scientific Metadata (presented at DC2010)
Jian Qin
 
PDF
Dataverse, Cloud Dataverse, and DataTags
Merce Crosas
 
PDF
McGeary Data Curation Network: Developing and Scaling
National Information Standards Organization (NISO)
 
PPTX
FundRef, or Name That Funder!
Crossref
 
PDF
Managing, Sharing and Curating Your Research Data in a Digital Environment
philipdurbin
 
PDF
Johnston - How to Curate Research Data
National Information Standards Organization (NISO)
 
PDF
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Merce Crosas
 
PDF
Integration of research literature and data (InFoLiS)
Philipp Zumstein
 
PPTX
FAIR data overview
Luiz Olavo Bonino da Silva Santos
 
PPTX
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...
ICPSR
 
PDF
Alain Frey Research Data for universities and information producers
Incisive_Events
 
DataTags, The Tags Toolset, and Dataverse Integration
Michael Bar-Sinai
 
Valen Metadata and the [Data] Repository
National Information Standards Organization (NISO)
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
National Information Standards Organization (NISO)
 
Preparing Data for Sharing: The FAIR Principles
London School of Hygiene and Tropical Medicine
 
ICPSR Data Exploration Tools
ICPSR
 
Data Repositories Impact
Merce Crosas
 
A Data Citation Roadmap for Scholarly Data Repositories
LIBER Europe
 
BioPharma and FAIR Data, a Collaborative Advantage
Tom Plasterer
 
The agINFRA Germplasm Working Group
Vassilis Protonotarios
 
Linking Scientific Metadata (presented at DC2010)
Jian Qin
 
Dataverse, Cloud Dataverse, and DataTags
Merce Crosas
 
McGeary Data Curation Network: Developing and Scaling
National Information Standards Organization (NISO)
 
FundRef, or Name That Funder!
Crossref
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
philipdurbin
 
Johnston - How to Curate Research Data
National Information Standards Organization (NISO)
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Merce Crosas
 
Integration of research literature and data (InFoLiS)
Philipp Zumstein
 
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...
ICPSR
 
Alain Frey Research Data for universities and information producers
Incisive_Events
 
Ad

Similar to bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Research Resource, Information, and Data Discovery Portal (20)

PDF
dkNET Poster Experimental Biology 2019
dkNET
 
PPTX
dkNET Introduction for Librarians
dkNET
 
PDF
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET
 
PPTX
dkNET 2.0 Tutorial
dkNET
 
PDF
dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...
dkNET
 
PPTX
Identifying and tracking research resources using RRIDs: a practical approach
dkNET
 
PDF
Research Data Alliance (RDA) Webinar: What do you really know about that anti...
dkNET
 
PPTX
Martone acs presentation
Neuroscience Information Framework
 
PDF
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET
 
PDF
dkNET Annual Meeting - June 2017
dkNET
 
PDF
dkNET Poster ENDO 2019
dkNET
 
PPTX
Neuroscience as networked science
Neuroscience Information Framework
 
PDF
dkNET Introductory Webinar 03/22/2019
dkNET
 
PPTX
The Uniform Resource Layer
Neuroscience Information Framework
 
PPTX
Resource Identification Initiative
Maryann Martone
 
PPTX
Data-knowledge transition zones within the biomedical research ecosystem
Maryann Martone
 
PPTX
EMBL Australian Bioinformatics Resource AHM - Data Commons
Vivien Bonazzi
 
PPT
Yale Day of Data
Philip Bourne
 
PPTX
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Philip Bourne
 
PPTX
Data commons bonazzi bd2 k fundamentals of science feb 2017
Vivien Bonazzi
 
dkNET Poster Experimental Biology 2019
dkNET
 
dkNET Introduction for Librarians
dkNET
 
dkNET Webinar: Discover the Latest from dkNET - Biomed Resource Watch 06/02/2023
dkNET
 
dkNET 2.0 Tutorial
dkNET
 
dkNET Webinar: Discovering and Evaluating Antibodies, Cell Lines, Software To...
dkNET
 
Identifying and tracking research resources using RRIDs: a practical approach
dkNET
 
Research Data Alliance (RDA) Webinar: What do you really know about that anti...
dkNET
 
Martone acs presentation
Neuroscience Information Framework
 
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET
 
dkNET Annual Meeting - June 2017
dkNET
 
dkNET Poster ENDO 2019
dkNET
 
Neuroscience as networked science
Neuroscience Information Framework
 
dkNET Introductory Webinar 03/22/2019
dkNET
 
The Uniform Resource Layer
Neuroscience Information Framework
 
Resource Identification Initiative
Maryann Martone
 
Data-knowledge transition zones within the biomedical research ecosystem
Maryann Martone
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
Vivien Bonazzi
 
Yale Day of Data
Philip Bourne
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Philip Bourne
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Vivien Bonazzi
 
Ad

More from dkNET (20)

PDF
dkNET Webinar "Introduction to AI-READI, Studying Salutogenesis in T2DM" 10/1...
dkNET
 
PDF
dkNET Webinar: Single Cell Multi-Omics Analysis of Beta Cell Heterogeneity an...
dkNET
 
PDF
dkNET Webinar: The 4DN Data Portal - Data, Resources and Tools to Help Elucid...
dkNET
 
PDF
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
dkNET
 
PDF
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 
PDF
dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024
dkNET
 
PDF
dkNET Webinar: Tabula Sapiens 03/22/2024
dkNET
 
PDF
dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...
dkNET
 
PDF
dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...
dkNET
 
PDF
dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...
dkNET
 
PDF
dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...
dkNET
 
PDF
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET
 
PDF
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET
 
PDF
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET
 
PDF
dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...
dkNET
 
PDF
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET
 
PDF
dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...
dkNET
 
PDF
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET
 
PDF
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET
 
PDF
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET
 
dkNET Webinar "Introduction to AI-READI, Studying Salutogenesis in T2DM" 10/1...
dkNET
 
dkNET Webinar: Single Cell Multi-Omics Analysis of Beta Cell Heterogeneity an...
dkNET
 
dkNET Webinar: The 4DN Data Portal - Data, Resources and Tools to Help Elucid...
dkNET
 
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
dkNET
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 
dkNET Webinar: Unlocking the Power of FAIR Data Sharing with ImmPort 04/12/2024
dkNET
 
dkNET Webinar: Tabula Sapiens 03/22/2024
dkNET
 
dkNET Webinar "The Multi-Omic Response to Exercise Training Across Rat Tissue...
dkNET
 
dkNET Webinar: The Collaborative Microbial Metabolite Center – Democratizing ...
dkNET
 
dkNET Webinar: An Encyclopedia of the Adipose Tissue Secretome to Identify Me...
dkNET
 
dkNET Webinar: A Single Cell Atlas of Human and Mouse White Adipose Tissue 11...
dkNET
 
dkNET Webinar "The National Sleep Research Resource (NSRR) - Opportunities fo...
dkNET
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET
 
dkNET Webinar: Leveraging Computational Strategies to Identify Type 1 Diabete...
dkNET
 
dkNET Webinar: Estimating Relative Beta-Cell Function During Continuous Gluco...
dkNET
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET
 
dkNET Webinar: Postpartum Glucose Screening Among Homeless Women with Gestati...
dkNET
 
dkNET Webinar: Choosing Sample Sizes for Multilevel and Longitudinal Studies ...
dkNET
 
dkNET Webinar: : FAIR Data Curation of Antibody/B-cell and T-cell Receptor Se...
dkNET
 
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET
 

Recently uploaded (20)

PPTX
Temperature Mapping in Pharmaceutical.pptx
Shehar Bano
 
PPTX
Pharmacotherapy of Myasthenia Gravis- Dr. Anurag Sharma (1).pptx
Anurag Sharma
 
PPTX
Anatomy of eyelids basic anatomy covered along with abnormalities of eyelids
SummyBhatnagar1
 
PPTX
IMPORTANCE of WORLD ORS DAY July 29 & ORS.pptx
MedicalSuperintenden19
 
PPTX
perioperative management and ERAS protocol.pptx
Fahad Ahmad
 
PPTX
Statistical Method For Evaluating Medication Safety Data Pharmacology (1).pptx
Sakshi Ravankar
 
PPTX
5.Gene therapy for musculoskeletal system disorders.pptx
Bolan University of Medical and Health Sciences ,Quetta
 
PPTX
Describe Thyroid storm & it’s Pharmacotherapy Drug Interaction: Pyridoxine + ...
Dr. Deepa Singh Rana
 
PPTX
Models of screening of Adrenergic Blocking Drugs.pptx
Dr Fatima Rani
 
DOCX
RUHS II MBBS Pharmacology Paper-I with Answer Key | 26 July 2025 (New Scheme)
Shivankan Kakkar
 
DOCX
RUHS II MBBS Pharmacology Paper-II with Answer Key | 28 July 2025 (New Scheme)
Shivankan Kakkar
 
PPTX
Chemical Burn, Etiology, Types and Management.pptx
Dr. Junaid Khurshid
 
PPTX
Sources, types and collection of data.pptx
drmadhulikakgmu
 
PPTX
Surgical management of colorectal cancer.pptx
Oladele Situ
 
PPTX
LOW GRADE GLIOMA MANAGEMENT BY DR KANHU CHARAN PATRO
Kanhu Charan
 
PPTX
Models for screening of Local Anaesthetics.pptx
AntoRajiv1
 
PPTX
A Detailed Overview of Sterols Chemistry, Sources, Functions and Applications...
Indranil Karmakar
 
PPTX
Transfusion of Blood Components – A Guide for Nursing Faculty.pptx
AbrarKabir3
 
PPT
Diagnosis-and-treatment-planning-in-CD - DR.SONIA.ppt
drsoniabithi1987
 
PPTX
Omphalocele: PowerPoint presentation
Nathan Lupiya
 
Temperature Mapping in Pharmaceutical.pptx
Shehar Bano
 
Pharmacotherapy of Myasthenia Gravis- Dr. Anurag Sharma (1).pptx
Anurag Sharma
 
Anatomy of eyelids basic anatomy covered along with abnormalities of eyelids
SummyBhatnagar1
 
IMPORTANCE of WORLD ORS DAY July 29 & ORS.pptx
MedicalSuperintenden19
 
perioperative management and ERAS protocol.pptx
Fahad Ahmad
 
Statistical Method For Evaluating Medication Safety Data Pharmacology (1).pptx
Sakshi Ravankar
 
5.Gene therapy for musculoskeletal system disorders.pptx
Bolan University of Medical and Health Sciences ,Quetta
 
Describe Thyroid storm & it’s Pharmacotherapy Drug Interaction: Pyridoxine + ...
Dr. Deepa Singh Rana
 
Models of screening of Adrenergic Blocking Drugs.pptx
Dr Fatima Rani
 
RUHS II MBBS Pharmacology Paper-I with Answer Key | 26 July 2025 (New Scheme)
Shivankan Kakkar
 
RUHS II MBBS Pharmacology Paper-II with Answer Key | 28 July 2025 (New Scheme)
Shivankan Kakkar
 
Chemical Burn, Etiology, Types and Management.pptx
Dr. Junaid Khurshid
 
Sources, types and collection of data.pptx
drmadhulikakgmu
 
Surgical management of colorectal cancer.pptx
Oladele Situ
 
LOW GRADE GLIOMA MANAGEMENT BY DR KANHU CHARAN PATRO
Kanhu Charan
 
Models for screening of Local Anaesthetics.pptx
AntoRajiv1
 
A Detailed Overview of Sterols Chemistry, Sources, Functions and Applications...
Indranil Karmakar
 
Transfusion of Blood Components – A Guide for Nursing Faculty.pptx
AbrarKabir3
 
Diagnosis-and-treatment-planning-in-CD - DR.SONIA.ppt
drsoniabithi1987
 
Omphalocele: PowerPoint presentation
Nathan Lupiya
 

bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Research Resource, Information, and Data Discovery Portal

  • 1. dkNET: Capabilities and the FAIR Principles Dr. Jeffrey Grethe & Dr. Maryann Martone June 16, 2016 The NIDDK Information Network (dkNET) - A Community Research Resource, Information, and Data Discovery Portal Jeffrey S. Grethe, Ph.D. September 8, 2016 bioCADDIE Webinar
  • 2. Community Data and Information Resources • Over 3500 resources have been identified – Researchers can’t visit them all – Most content from these resources not easily found through standard search engines – Even more structured content on the web • Resources provide domain specific views of data – Must provide a snapshot of information in a simple to understand form that can be further explored in the native database – Must provide a biomedical science based semantic framework for resource description and search
  • 3. Building a Community Resource, Information, and Data Index • It is quite difficult to access, collate, and filter the incredible array of information and data in the public domain. • dkNET was established in recognition of the need to interconnect research communities, both basic and clinical, by providing seamless access to large pools of resources, information, and data relevant to the mission of the Neuroscience Blueprint. •dkNET provides a single point-of-entry for discovering this information to allow researchers to make good decisions in the research process, thereby increasing both the speed and efficiency of the process.
  • 4. The FAIR Principles and dkNET Findable: A core aspect of dkNET’s mission - facilitate access to a collection of diverse research resources Accessible: Assist users by directing them to available resources Interoperable: Provide unified views of certain data and information across resources Reusable: Supports community standards and develops standard representations for resources Attribution and Recognition: Leading initiative to unambiguously cite resources via Research Resource Identifiers Applies to data, resources, knowledge, and information
  • 5. dkNET: A “Resource” Discovery Index
  • 6. Datab ase Software Application Data Analysis Service Topical Portal Core Facility Ontol ogy Software Resource Year s: dkNET Offers… • A search portal to find community- vetted research resources: materials, data and tools for your research • Access to hundreds of databases across biomedicine with one easy search • Personalized search and display of results • Information hub for community news and social networking • Support for resource providers
  • 7. dkNET Investigator's RetreatJune16, 2014 Shared Resources dkNET is built on the SciCrunch platform Resource Identification Portal Neuroscience dkNET Drug Design • Pull from and push to a common pool of data • Customize according to communities • Common interface elements across platforms
  • 8. National Institute of Diabetes and Digestive and Kidney Diseases Information Network Neuroscience Information Framework Open Data Commons For Spinal Cord Injury Shared Infrastructure… Drug Design Data Resource Resource Identification Initiative Shared Costs… Community Specific Views
  • 9. Shared Infrastructure: dkNET and SciCrunch • Community “Apps”: Portal features driven by individual communities can be re-used by other communities • Community Resource Curation: All communities have access to shared data platform Can utilize portal infrastructure for challenges being developed by another SciCrunch project
  • 10. Resource IDs from aggregated databases RRIDs: a single portal for authors • >25 authoritative databases • One search interface • Simple directions • Prominent “Cite This” button Specialized Portals
  • 11. Overview of the dkNET Portal
  • 12. Results are returned in specific tabs. Community resources are presented first if they are available. But all searches can be executed across all resources. Community More resources Literature Specialized NIDDK-relevant resources 200+ database and information resources across biomedicine Literature dkNET’s Three Primary Resource Views
  • 13. Community resources NURSA BCBC DiaComp GUDMAP T1DB MMPC NIDDK Central Repository dkNET Community Funding AddGene: Plasmids Clinical trials.gov: Clinical trials and data Antibody Registry: 2M+ antibodies Integrated animals: Model organism databases SciCrunch Resource Registry: 13K+ tools Grants.gov NIDDK-funded resources and centers Additional relevant databases vis SciCrunch
  • 14. dkNET connects resources Social media Research resource analytics Other SciCrunch communities More resources LiteratureCommunity RRID’s
  • 15. Each index is organized slightly differently Community More resources Literature Navigation differs across indices Resource categories Sources Data and/or resource type System level Individual records Source Functions and filters Results display Functions and filters Results display Collections and Analytics Analytics Facets Articles Functions and filters Results display
  • 16. Structure of dkNET Resources ● 3 main sections ○ Community resources ○ More resources ○ Literature ● Faceted search to drill down into a single source ● Deep search into information and data resources
  • 21. Community Resource Curation and Maintenance Pilot and Feasibility Program Database
  • 22. Information Backstop for Retired Resources
  • 23. User Notifications: Save Searches Provide user regular updates associated with user’s saved search via webpage alert and e-mail.
  • 24. A core community resource: SciCrunch Registry
  • 25. • Automated text mining is used to look for “web page last updated” or copyright dates Identified for 570 resources 373 were not updated within the last 2 years (65%) • Manual review of ~200 resources 38 not updated within the past 2 years (~20%) 8 migrated to new addresses or institutions 7 are no longer in service (~3%) 3 were deemed no longer appropriate Tracking digital resources since 2008
  • 26. Text Mining for Resources
  • 27. User Notifications: Resource Subscriptions 1. Users can track their resources 2. Provides updated information on citations of their resources 3. Resource mentions are retrieved from external data source, text mining, and user submission Subscribe
  • 30. Facilitating broad use of biomedical digital assets Aggregation and Discovery platforms like dkNETcan help mediate this complex world, much as the journals do for publishing articles
  • 31. RRIDs An easy, practical method for improving reproducibility and transparency
  • 32. Unique ID’s for all! Resource Identification Initiative • It is currently impossible to query the biomedical literature to find out what research resources have been used to produce the results of a study • authors don’t provide enough information to unambiguously identify key research resources • Impossible to find all studies that used a resource • Critical for reproducibility and data mining • Critical for trouble-shooting Faulty Antibodies Continue to Enter US and European Markets, Warns Top Clinical Chemistry Researcher- Genome Web Daily, October 11, 2013
  • 33. Digital objects are a new beast RRID: Provides foundation for establishing an alerting service for research resources Trust: Not just who produced it but what produced it
  • 35. But the author knows what was used! This author got back to us within 2 hours with the stock number of this mouse Hypothes.is annotation overlay allows anyone to leave notes like this stock number from JAX
  • 36. Authors supply database accession numbers for key research resources used in research
  • 37. Paper published in participating journal
  • 38. Materials automatically detected by SciBot(s) Made openly available
  • 39. Increased identifiability of resources after the Resource Identification Initiative Pilot Bandrowski et al, 2015
  • 40. New Authentication of Key Biological Resources Guidelines are already affecting most NIH applications (May 2016 deadline)
  • 41. Which Journals now ask for RRIDs? ~25 Elsevier Journals – typesetting + App BMC – checklist, typesetting* Frontiers – moving to typesetting* Cell Press – author nagging, typesetting Wiley – author nagging, typesetting Working with Endocrine Society for incorporation of RRIDs
  • 43. Application Programming Interfaces (APIs) 1. API specification, infrastructure and initial services for push and pull of data 2. Service documentation using Swagger.io 3. Portal dashboard for requesting API keys (e.g. Project based keys) 4. Future: Additional services for resources 5. Future: Applets using the published APIs that can be embedded on external sites
  • 45. dkNET and the NIH BD2K Data Discovery Index (bioCADDIE) A Focus on the Community
  • 47. Interactions with Community Aggregators Global Data Set Index Community Aggregation Best Practices Community Resources $ Funding Protocols