SlideShare a Scribd company logo
Introduction to
Metadata
Application Profiles
DCMI Webinar
Karen Coyle
2018
Data silos
Data silos
DC BIBO MARC21
Data silos
MARC21 MARC21C21 MARC21
What are application profiles?
• Record your institution or project's choices
• Form a basis for developing a consensus around your own
data
• Express specific practices, rules
• Tell data consumers what to expect
Why do we need them?
• How can someone else understand your
data well enough to make use of it?
• Not unlike open source problem: you can
declare your code ‘open’ and wish people
‘good luck’ or you can provide support.
Who needs them?
• Creators: anyone providing data
• Users
• anyone who can/is allowed to access the data
• both people AND machines - not an either/or, but
should be both
What are they?
• Basic structure of the data
• the story that the data tells; what you are trying to say
• what are the things? how are they described?
• What are the properties and the rules for property use?
• What are the values?
How are they?
• What will a profile be? How can it be implemented?
• Documents (PDF)
• Spreadsheets
• Code (RDF, JSON, XML)
What does an application profile
look like?
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
Dublin Core and
Application Profiles
Dublin Core Singapore Framework for
Application Profiles (2007)
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
Domain model
Domain
model
An introduction to Metadata Application Profiles
Functional requirements
• Before developing any solutions, define problems
• Decide which problems you can solve
• State the requirements for success
An introduction to Metadata Application Profiles
Vocabularies
• Profiles reuse vocabularies
• Profiles can select from a single
vocabulary
• Profiles can extend a vocabulary
• Profiles can combine vocabularies
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
Term reuse & semantics
• Reuse can narrow semantics but should never contradict how
the term is defined at its origin
• Terms with strict definitions (e.g. OWL constraints, limits on
valid values, disjoint with other terms) are the hardest to
reuse
• Base vocabularies are best if they employ minimum semantic
commitment
An introduction to Metadata Application Profiles
Components of a profile
• Vocabulary
• Definitions
• Usage rules
• Cardinality of terms and values
• Examples
• Validation rules
This is not a full list!
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
Validation rules
• Can have foaf:name or (foaf:foreName + foaf:familyName)
• dct:date cannot be > 2020
• Subjects must be from http://id.loc.gov/authorities/subjects/
Validation
• Non-RDF (e.g. XML schema)
• SHACL – W3C recommendation (SHApes Constraint Language)
• https://www.w3.org/TR/shacl/
• ShEx – W3C community group (Shape Expressions)
• http://shex.io/
Validation
• Non-RDF (e.g. XML schema)
• SHACL – W3C recommendation (SHApes Constraint Language)
• https://www.w3.org/TR/shacl/
• ShEx – W3C community group (Shape Expressions)
• http://shex.io/
Validation
• Non-RDF (e.g. XML schema)
• SHACL – W3C recommendation (SHApes Constraint Language)
• https://www.w3.org/TR/shacl/
• ShEx – W3C community group (Shape Expressions)
• http://shex.io/
my:IssueShape {
ex:state [ex:unassigned
ex:assigned];
}
Not everything can be validated
• "Recommended" "Mandatory if applicable"
• Names, resource titles, other string-based data
Maintaining profiles
Profile maintenance
• Who maintains the profile?
• How will new terms be added?
• What can be changed?
• How can the profile be extended?
What we need so that
we can (easily) create
profiles
Some profile-related efforts
• Dublin Core (since the late 1990's) based on Singapore Framework
• http://dublincore.org/documents/singapore-framework/
• http://dublincore.org/documents/profile-guidelines/
• DXWG – Data eXchange Working Group, W3C, application profile guidance
(2017, due 2019)
• https://www.w3.org/2017/dxwg/wiki/Main_Page
Standard profile language(s)
• Core for the simplest needs, or for getting started
• shows domain model
• lists vocabulary terms
• can express basic rules for vocabulary members, especially cardinality & values
• documentation for human readers
Generic domain model - DC
Profile
Resource
Property
Value
"things"
"terms or elements"
"data"
MyBookCase
Profile: MyBookCase
Resource: Book
Resource: Person
http://dublincore.org/documents/profile-guidelines/
MyBookCase
Profile: MyBookCase
Resource: Book
Property: title
Property: author
Property: size
Resource: Person
Property: name
MyBookCase
Profile: MyBookCase
Resource: Book
Property: title
min:1, max:1
Property: author
min:0, max:3
Property: size
min:1, max:1
Resource: Person
Property: name
MyBookCase
Profile: MyBookCase
Resource: Book
Property: title
min:1, max:1
value type: literal
Property: author
min:0, max:3
value type: IRI
Property: size
min:1, max:1
value type: integer
Resource: Person
Property: name
An introduction to Metadata Application Profiles
APs as spreadsheets?
Can we make validation "easy"?
• Valid properties ✔
• Valid values ✔
• Value types
• Value lists (text or URIs)
• Conditional rules 
• If A not B
• A or (B & C)
Validation – bridging the gap
• Profile may need validation pseudo-code
• Pseudo-code -> validation standard (SHACL, ShEx)?
• What to do with non-actionable statements of validation (“mandatory if
applicable”)?
Summary: Functions of a profile
• Consensus-building
• Documentation
• Input/output control
• Validation (input and output and sharing)
Thank you
kcoyle@kcoyle.net
https://github.com/kcoyle/RDF-AP

More Related Content

What's hot (20)

PPT
Ontology Web services for Semantic Applications
Trish Whetzel
 
PPTX
RDA Toolkit Essentials 2015-09-24
jhennelly
 
PPTX
RDA Toolkit Essentials 2015-06-11
jhennelly
 
PPTX
RDA Toolkit Essentials 2014-12-17
jhennelly
 
PPTX
Sharepoint taxonomy introduction us
QUONTRASOLUTIONS
 
PPTX
RDA Toolkit Essentials 09.17.2014
jhennelly
 
PPTX
RDA Toolkit Essentials 2015-03-18
jhennelly
 
PPTX
01 18 rda toolkit essentials v6
jhennelly
 
PPTX
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
National Information Standards Organization (NISO)
 
PDF
Open Data Management for Public Automated Translation
Dave Lewis
 
PPTX
03.21 rda toolkit essentials
jhennelly
 
PPTX
RDA Toolkit Essentials - 06.18.2014
jhennelly
 
PPTX
RDA Toolkit Essentials webinar 03.19.14
jhennelly
 
PPTX
05.16 rda toolkit essentials
jhennelly
 
PDF
4. pentz orcid outreach_20121016
ORCID, Inc
 
PPTX
ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)
Crossref
 
PPTX
2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...
Crossref
 
PPT
Ed Pentz: Executive Summary #crossref15
Crossref
 
Ontology Web services for Semantic Applications
Trish Whetzel
 
RDA Toolkit Essentials 2015-09-24
jhennelly
 
RDA Toolkit Essentials 2015-06-11
jhennelly
 
RDA Toolkit Essentials 2014-12-17
jhennelly
 
Sharepoint taxonomy introduction us
QUONTRASOLUTIONS
 
RDA Toolkit Essentials 09.17.2014
jhennelly
 
RDA Toolkit Essentials 2015-03-18
jhennelly
 
01 18 rda toolkit essentials v6
jhennelly
 
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
National Information Standards Organization (NISO)
 
Open Data Management for Public Automated Translation
Dave Lewis
 
03.21 rda toolkit essentials
jhennelly
 
RDA Toolkit Essentials - 06.18.2014
jhennelly
 
RDA Toolkit Essentials webinar 03.19.14
jhennelly
 
05.16 rda toolkit essentials
jhennelly
 
4. pentz orcid outreach_20121016
ORCID, Inc
 
ORCID Update & Other Researcher Identifiers (2011 Annual Meeting)
Crossref
 
2013 CrossRef Annual Meeting United in Preservation - Randy Kiefer and Kate W...
Crossref
 
Ed Pentz: Executive Summary #crossref15
Crossref
 

Similar to An introduction to Metadata Application Profiles (20)

PPTX
Introduction to Application Profiles
Diane Hillmann
 
PPTX
DXWG Profiles Guidance & Vocabulary
njcar
 
PPTX
Exploiting the value of Dublin Core through pragmatic development
Paul Walk
 
PDF
State of the Art on methodologies for the development of Dublin Core Applicat...
Mariana Curado Malta
 
PPT
Application Profiles
Diane Hillmann
 
ODP
RDA-DCAM and Application Profiles
Mikael Nilsson
 
ODP
Learning Resource Metadata Initiative: Vocabulary Development Best Practices
Mike Linksvayer
 
ODP
DC-2008 Architecture Forum Open session
Mikael Nilsson
 
PDF
xAPI Live - The State of Standardization
RISC Inc
 
PPT
Semantics In Declarative Systems
Optum
 
PDF
Better Statements with Experience API (xAPI) Profiles
Russell Duhon
 
PDF
From UML Profiles to EMF Profiles and Beyond (TOOLS'11)
Philip Langer
 
PPT
Dublin Core Metadata Initiative Education Application Profile Task Group Meet...
Sarah Currier
 
PDF
Introduction to Data Management Planning
Erin Owens
 
PPT
Dcap Ja Progmeet 2007 07 05
Julie Allinson
 
PPT
SWAP : A Dublin Core Application Profile for desribing scholarly works
Julie Allinson
 
PDF
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
DATAVERSITY
 
PPTX
RIOXX: a Modern Metadata Application Profile
Paul Walk
 
PDF
Smart Data Webinar: Emerging Data Management Options
DATAVERSITY
 
Introduction to Application Profiles
Diane Hillmann
 
DXWG Profiles Guidance & Vocabulary
njcar
 
Exploiting the value of Dublin Core through pragmatic development
Paul Walk
 
State of the Art on methodologies for the development of Dublin Core Applicat...
Mariana Curado Malta
 
Application Profiles
Diane Hillmann
 
RDA-DCAM and Application Profiles
Mikael Nilsson
 
Learning Resource Metadata Initiative: Vocabulary Development Best Practices
Mike Linksvayer
 
DC-2008 Architecture Forum Open session
Mikael Nilsson
 
xAPI Live - The State of Standardization
RISC Inc
 
Semantics In Declarative Systems
Optum
 
Better Statements with Experience API (xAPI) Profiles
Russell Duhon
 
From UML Profiles to EMF Profiles and Beyond (TOOLS'11)
Philip Langer
 
Dublin Core Metadata Initiative Education Application Profile Task Group Meet...
Sarah Currier
 
Introduction to Data Management Planning
Erin Owens
 
Dcap Ja Progmeet 2007 07 05
Julie Allinson
 
SWAP : A Dublin Core Application Profile for desribing scholarly works
Julie Allinson
 
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
DATAVERSITY
 
RIOXX: a Modern Metadata Application Profile
Paul Walk
 
Smart Data Webinar: Emerging Data Management Options
DATAVERSITY
 
Ad

Recently uploaded (20)

PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
July Patch Tuesday
Ivanti
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PDF
Biography of Daniel Podor.pdf
Daniel Podor
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
July Patch Tuesday
Ivanti
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
Biography of Daniel Podor.pdf
Daniel Podor
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Ad

An introduction to Metadata Application Profiles

  • 5. What are application profiles? • Record your institution or project's choices • Form a basis for developing a consensus around your own data • Express specific practices, rules • Tell data consumers what to expect
  • 6. Why do we need them? • How can someone else understand your data well enough to make use of it? • Not unlike open source problem: you can declare your code ‘open’ and wish people ‘good luck’ or you can provide support.
  • 7. Who needs them? • Creators: anyone providing data • Users • anyone who can/is allowed to access the data • both people AND machines - not an either/or, but should be both
  • 8. What are they? • Basic structure of the data • the story that the data tells; what you are trying to say • what are the things? how are they described? • What are the properties and the rules for property use? • What are the values?
  • 9. How are they? • What will a profile be? How can it be implemented? • Documents (PDF) • Spreadsheets • Code (RDF, JSON, XML)
  • 10. What does an application profile look like?
  • 16. Dublin Core Singapore Framework for Application Profiles (2007)
  • 24. Functional requirements • Before developing any solutions, define problems • Decide which problems you can solve • State the requirements for success
  • 26. Vocabularies • Profiles reuse vocabularies • Profiles can select from a single vocabulary • Profiles can extend a vocabulary • Profiles can combine vocabularies
  • 29. Term reuse & semantics • Reuse can narrow semantics but should never contradict how the term is defined at its origin • Terms with strict definitions (e.g. OWL constraints, limits on valid values, disjoint with other terms) are the hardest to reuse • Base vocabularies are best if they employ minimum semantic commitment
  • 31. Components of a profile • Vocabulary • Definitions • Usage rules • Cardinality of terms and values • Examples • Validation rules This is not a full list!
  • 38. Validation rules • Can have foaf:name or (foaf:foreName + foaf:familyName) • dct:date cannot be > 2020 • Subjects must be from http://id.loc.gov/authorities/subjects/
  • 39. Validation • Non-RDF (e.g. XML schema) • SHACL – W3C recommendation (SHApes Constraint Language) • https://www.w3.org/TR/shacl/ • ShEx – W3C community group (Shape Expressions) • http://shex.io/
  • 40. Validation • Non-RDF (e.g. XML schema) • SHACL – W3C recommendation (SHApes Constraint Language) • https://www.w3.org/TR/shacl/ • ShEx – W3C community group (Shape Expressions) • http://shex.io/
  • 41. Validation • Non-RDF (e.g. XML schema) • SHACL – W3C recommendation (SHApes Constraint Language) • https://www.w3.org/TR/shacl/ • ShEx – W3C community group (Shape Expressions) • http://shex.io/ my:IssueShape { ex:state [ex:unassigned ex:assigned]; }
  • 42. Not everything can be validated • "Recommended" "Mandatory if applicable" • Names, resource titles, other string-based data
  • 44. Profile maintenance • Who maintains the profile? • How will new terms be added? • What can be changed? • How can the profile be extended?
  • 45. What we need so that we can (easily) create profiles
  • 46. Some profile-related efforts • Dublin Core (since the late 1990's) based on Singapore Framework • http://dublincore.org/documents/singapore-framework/ • http://dublincore.org/documents/profile-guidelines/ • DXWG – Data eXchange Working Group, W3C, application profile guidance (2017, due 2019) • https://www.w3.org/2017/dxwg/wiki/Main_Page
  • 47. Standard profile language(s) • Core for the simplest needs, or for getting started • shows domain model • lists vocabulary terms • can express basic rules for vocabulary members, especially cardinality & values • documentation for human readers
  • 48. Generic domain model - DC Profile Resource Property Value "things" "terms or elements" "data"
  • 49. MyBookCase Profile: MyBookCase Resource: Book Resource: Person http://dublincore.org/documents/profile-guidelines/
  • 50. MyBookCase Profile: MyBookCase Resource: Book Property: title Property: author Property: size Resource: Person Property: name
  • 51. MyBookCase Profile: MyBookCase Resource: Book Property: title min:1, max:1 Property: author min:0, max:3 Property: size min:1, max:1 Resource: Person Property: name
  • 52. MyBookCase Profile: MyBookCase Resource: Book Property: title min:1, max:1 value type: literal Property: author min:0, max:3 value type: IRI Property: size min:1, max:1 value type: integer Resource: Person Property: name
  • 55. Can we make validation "easy"? • Valid properties ✔ • Valid values ✔ • Value types • Value lists (text or URIs) • Conditional rules  • If A not B • A or (B & C)
  • 56. Validation – bridging the gap • Profile may need validation pseudo-code • Pseudo-code -> validation standard (SHACL, ShEx)? • What to do with non-actionable statements of validation (“mandatory if applicable”)?
  • 57. Summary: Functions of a profile • Consensus-building • Documentation • Input/output control • Validation (input and output and sharing)

Editor's Notes

  • #3: We' e all gotten the message about data silos and how bad they are. They prevent data sharing, they keep us from taking advantage of the work of others.
  • #4: We think of data silos as being the result of using different standards or different data formats.
  • #5: But just because we use the same basic standard, it doesn't mean that we are producing identical data. As anyone who has tried to consume data from another institution or project knows, there are always local variations – variations in which fields or terms are used, variations in how the data has been recorded. Even when we think we are not creating data in a silo, in many cases we are. It's difficult to avoid have your data be your data. I'm going to talk about how application profiles can help with data sharing and data reuse.
  • #6: They are many things
  • #8: Rather like the schema.org focus in terms of data providers = anyone with data on the web. These folks are not always coders, and may have experience limited to a only a few technologies. Any solution has to fit into their toolbox; we can’t require them to re-tool to use this solution.
  • #11: Today there is no standard format for application profiles
  • #12: Profiles exist today, and take many forms. This is an example of a profile based on the Dataset Catalog (DCAT) vocabulary. There a number of different application profiles based on this vocabulary, which is was developed to support government open data in the European Union, although it also used elsewhere. Many countries have created application profiles that are specific to their country because they each had some specific needs. Those choices are recorded in the application profiles which generally take the form of documents.
  • #13: BIBFRAME has a number of profiles.
  • #14: This is a profile based on the BIBFRAME vocabulary.
  • #15: There is information about each term, including what type of value is expect (whether it will be a URI for a thing or will be text).
  • #16: Dublin Core has had the ide aof application profiles since the late 1990s. This fits in well with the fact that DC is intended as a highly reusable vocabulary that can fit many different situations and needs.
  • #17: First in 2007. In keeping with the use of the place name to name things, this was first presented in Singapore.
  • #23: It's the picture that you have in your head that tells you what aspect of the world your data covers, what are the "parts" or entities that make up your description of that world, and how the parts fit together. Domain models can vary in their complexity and detail, and may have different levels of detail depending on the view that is needed at some moment in time.
  • #27: Profiles generally are a reuse of vocabularies. Like BIBFRAME, the profile can be a selection from a single vocabulary. In this case the profile's message is: I am a profile of BIBFRAME. I use some but not all of the BIBFRAME-defined vocabulary. Profiles can make use of all or part of a vocabulary, but can add terms that they need but that are missing from the base vocabulary. This is the case with the DCAT profiles: they are based on the DCAT vocabulary, but in some cases the users of DCAT needed to add some information that was not covered by that vocabulary. If you have worked with the Europeana Data Model you may be familiar with profiles that have some Europeana elements but that also add their own terms or equivalents. The DCAT profiles have much overlap between them but each one has information that is not included in DCAT. But profiles can also be not related to any single vocabulary. They can be a mix and match that essentially creates a new vocabulary. Oftentimes these vocabularies are not treated as profiles, and the line between a new vocabulary made up of existing terms and a profile is not distinct. We can say, however, that a profile does not have to be primarily based on any one vocabulary. Even Europeana and DCAT vocabularies make use of terms from Dublin Core,
  • #28: bibframe is an example of a profile that is a selection from a single vocabulary.
  • #29: DCAT uses terms from its own vocabulary, from dublin core terms and foaf, among others.
  • #30: You have to pay attention to how a term is defined before you reuse it.
  • #31: When you have all of this together, then it is time to create your profile.
  • #38: Examples
  • #41: The problem is the validation code usually is pretty complex. This is an extremely simple example that say that my property "status" can be one of two things: "assigned" or "unassigned". So you an imagine how much code it takes to say something much more complicated. It isn't reasonable to assume that everyone who creates metadata is capable of writing the needed validation code.
  • #43: There are a lot of things in data that cannot be validated, especially for those in the cultural heritage area where much of there metadata consists of text, and is based on decisions made by human beings, not on calculations. So although validation is important, validation alone won't describe a profile.
  • #45: These are community decisions. If you do not include in your development of profiles the means to maintain and evolve the profiles, within a short time they cease being living solutions to your metadata needs. https://pro.europeana.eu/project/creation-and-governance-of-edm-mappings-profiles-and-extensions-task-force http://makxdekkers.com/DXWG/DCAT-AP.pdf
  • #47: Just to mention here that I am co-chairing the W3C group representing DC. That group will be creating a kind of "best practices" document but nothing so specific as code. The Dublin Core work is more detailed, but has not yet yielded a usable schema for profiles. Hopefully that is in progress.
  • #52: This looks like an entity-relation diagram at this point. But more is needed.
  • #53: This looks like an entity-relation diagram at this point. But more is needed.
  • #54: In a sense, an application profile for an application profile. Not yet "finished", but may demonstrate that we can use something as simple as a spreadsheet to allow people to easily create application profiles that can be converted to a form that would allow for ingest and validation. All without the metadata schema creator having to write code. Note in particular that very few of the elements here are required. (Those with an initial zero in the third column are optional.) An application profile could be simply a list of terms that are used for one or more resources, with a resource being a document, a person, a place, a subject. Whatever you want it to be.
  • #55: Spreadsheets – which then become CSV files (comma separated files).
  • #56: It would be ideal to be able to express all of the rules that would be necessary to validate your data. That can get to be quite complex.
  • #57: Can we include in this simple view of profiles an easy way to include validation rules, or at least the most basic validation rules? That would go a long way to aiding interoperability of datasets.