Some ThoughtsJuan Esteva, Ph. D..751 Malena Dr., Ann Arbor, MI 48103Tel:  734-786-0233 Cell  734-277-4962Fax  734-821-0235SkypeDrEstevajuan.esteva@Ajatella.comOntology Data Integration For Competitive Decision Making
Not Just The Facts3/4/2010Juan Esteva, Ph. D.2“Good decisions are based on information that is analyzed and transformed into usable knowledge” Eileen Feretic
Information at the point of impact3/4/2010Juan Esteva, Ph. D.3“Information needs to be at the point of impact—at the front lines where people are making decisions. The right analysis needs to be done at the right place. It’s important for organizations to treat information as a strategic asset in order to optimize every decision, every process, everything they do.” AmbujGoyal,
Data in Silos3/4/2010Juan Esteva, Ph. D.4“One of the biggest challenges organizations face is the amount of data sitting in silos, too often, valuable data simply isn’t accessible or available.” Boris Evelson
Business Decisions for Competitive Advantage3/4/2010Juan Esteva, Ph. D.5“In today’s troubled economy and competitive business environment, making good decisions is a matter of survival. But good decisions aren’t based on gut feeling alone. They should be based on information gathered from multiple sources, which is then synthesized and analyzed to generate a road map of options and possible outcomes that transform data into usable knowledge” Eileen Feretic
Business Intelligence3/4/2010Juan Esteva, Ph. D.6Business Intelligence and now Business Analytics systems come into play [However,] it is hard to assemble [heterogeneous data and] disparate pieces of information in a way that provides the intelligence and insight needed to make good business decisions. Eileen FereticAlas enter Ontology Data Integration.
Data Integration3/4/2010Juan Esteva, Ph. D.7Data integration provides the ability to manipulate data transparently across multiple data sources.Based on the architecture there are 2 systems:Central Data IntegrationA central data integration system usually has a global schema, which provides the user with a uniform interface to access information stored in the data sourcesPeer-2-peerIn contrast, in a peer-to-peer data integration system, there are no global points of control on the data sources (or peers). Instead, any peer can accept user queries for the information distributed in the whole system.
Common Approaches for Data Integration3/4/2010Juan Esteva, Ph. D.8Global-as-ViewIn the GaV approach, every entity in the global schema is associated with a view over the source local schema. Therefore querying strategies are simple, but the evolution of the local source schemas is not easily supported.Local-as-ViewOn the contrary, the LaV approach permits changes to source schemas without affecting the global schema, since the local schemas are defined as views over the global schema, but query processing can be complex.
Data Heterogeneity3/4/2010Juan Esteva, Ph. D.9Data sources can be heterogeneous in:SyntaxSyntactic heterogeneity is caused by the use of different models or languages.SchemaSchematic heterogeneity results from structural differences.SemanticsSemantic heterogeneity is caused by different meanings or interpretations of data in various contextsTo achieve data interoperability, the issues posed by data heterogeneity need to be eliminated
Possible Solutions3/4/2010Juan Esteva, Ph. D.10The advent of XML has created a syntactic platform for Web data standardization and exchange. However, schematic data heterogeneity may persist, depending on the XML schemas used (e.g., nesting hierarchies). Likewise, semantic heterogeneity may persist even if both syntactic and schematic heterogeneities do not occur (e.g., naming concepts differently).We should be concerned with solving all three kinds of heterogeneities by bridging syntactic, schematic, and semantic heterogeneities across different sources.
Semantic Data Integration Using Ontologies3/4/2010Juan Esteva, Ph. D.11We call semantic data integration the process of using a conceptual representation of the data and of their relationships to eliminate possible heterogeneities.At the heart of semantic data integration is the concept of ontology, which is an explicit specification of a shared conceptualization
Ontology & Data Integration3/4/2010Juan Esteva, Ph. D.12Metadata Representation. Metadata (i.e., source schemas) in each data source can be explicitly represented by a local ontology, using a single language.Global Conceptualization. The global ontology provides a conceptual view over the schematically-heterogeneous source schemas.Support for High-level Queries. Given a high-level view of the sources, as provided by a global ontology, the user can formulate a query without specific knowledge of the different data sources. The query is then rewritten into queries over the sources, based on the semantic mappings between the global and local ontologies.Declarative Mediation. Query processing in a hybrid peer-to-peer system uses the global ontology as a declarative mediator for query rewriting between peers.Mapping Support. A thesaurus, formalized in terms of an ontology, can be used for the mapping process to facilitate its automation.
What do we need?3/4/2010Juan Esteva, Ph. D.13Increase search capabilitiesFrom discovery to reasoningIncreasing metadata  as to provide strong semanticsFrom glossaries to ontologiesConsequently, moving from syntactic interoperability to structural interoperability and finally to semantic interoperability
Graphically the model progression will be [2] 3/4/2010Juan Esteva, Ph. D.14The point of this graph is that Increasing Metadata (from glossaries to ontologies) is highly correlated with Increasing Search Capability (from discovery to reasoning).
Juan Esteva, Ph. D.3/4/201015References
References3/4/2010Juan Esteva, Ph. D.16Applying 4D ontologies to Enterprise Architecture, Matthew West,  Shell Corp.FHA Data Architecture Working Group: SICoP DRM 2.0 Pilot, 2005The Role of Ontologies in Data Integration, Isabel F. Cruz Huiyong Xiao
Topic Maps3/4/2010Juan Esteva, Ph. D.17Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information. The ISO standard is formally known as ISO/IEC 13250:2003.A topic map represents information using topics (representing any concept, from people, countries, and organizations to software modules, individual files, and events), associations (representing the relationships between topics), and occurrences (representing information resources relevant to a particular topic).
SKOS3/4/2010Juan Esteva, Ph. D.18Simple Knowledge Organization System (SKOS) SKOS is a common data model for sharing and linking knowledge organization systems via the Web.
RDF3/4/2010Juan Esteva, Ph. D.19Resource Description Language RDFRDF is a standard model for data interchange on the Web. RDF has features that facilitate data merging even if the underlying schemas differ, and it specifically supports the evolution of schemas over time without requiring all the data consumers to be changed.
OWL3/4/2010Juan Esteva, Ph. D.20Web Ontology Language OWLis a Semantic Web language designed to represent rich and complex knowledge about things, groups of things, and relations between things. OWL is a computational logic-based language such that knowledge expressed in OWL can be reasoned with by computer programs either to verify the consistency of that knowledge or to make implicit knowledge explicit. OWL documents, known as ontologies, can be published in the World Wide Web and may refer to or be referred from other OWL ontologies. OWL is part of the W3C’s Semantic Web technology stack, which includes RDF, RDFS, SPARQL, etc.

More Related Content

PPTX
ontology based- data_integration.
PPTX
Ontology-based Data Integration
PDF
Hyponymy extraction of domain ontology
PPTX
Ontology Engineering for Big Data
PDF
A category theoretic model of rdf ontology
PDF
Xml based data exchange in the
PDF
Automatically converting tabular data to
PPTX
Ontology integration - Heterogeneity, Techniques and more
ontology based- data_integration.
Ontology-based Data Integration
Hyponymy extraction of domain ontology
Ontology Engineering for Big Data
A category theoretic model of rdf ontology
Xml based data exchange in the
Automatically converting tabular data to
Ontology integration - Heterogeneity, Techniques and more

What's hot (20)

PDF
Using linguistic analysis to translate
PPT
Data Integration Ontology Mapping
PPT
Enhancing Semantic Mining
PDF
Improve information retrieval and e learning using
PPTX
The Standardization of Semantic Web Ontology
PDF
Ontology Mapping
PDF
USING RELATIONAL MODEL TO STORE OWL ONTOLOGIES AND FACTS
PDF
Translating Ontologies in Real-World Settings
PDF
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
PPTX
Semantic Web, Ontology, and Ontology Learning: Introduction
PPTX
Ontologies for big data
PPT
4 semantic web and ontology
DOC
Are Data Models Superfluous Nov2003
PDF
Learning ontologies
PDF
Towards a Query Rewriting Algorithm Over Proteomics XML Resources
PPT
Ontology Mapping
PDF
Swoogle: Showcasing the Significance of Semantic Search
PDF
A little more semantics goes a lot further!  Getting more out of Linked Data ...
Using linguistic analysis to translate
Data Integration Ontology Mapping
Enhancing Semantic Mining
Improve information retrieval and e learning using
The Standardization of Semantic Web Ontology
Ontology Mapping
USING RELATIONAL MODEL TO STORE OWL ONTOLOGIES AND FACTS
Translating Ontologies in Real-World Settings
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Semantic Web, Ontology, and Ontology Learning: Introduction
Ontologies for big data
4 semantic web and ontology
Are Data Models Superfluous Nov2003
Learning ontologies
Towards a Query Rewriting Algorithm Over Proteomics XML Resources
Ontology Mapping
Swoogle: Showcasing the Significance of Semantic Search
A little more semantics goes a lot further!  Getting more out of Linked Data ...

Viewers also liked (6)

PPT
X Som Graduation Presentation
PPTX
Horizontal Integration of Big Intelligence Data
PPT
The Role Of Ontology In Modern Expert Systems Dallas 2008
PDF
8 ontology integration and interoperability (onto i op)
PDF
Enterprise and Data Mining Ontology Integration to Extract Actionable Knowled...
PPT
Management Gurus
X Som Graduation Presentation
Horizontal Integration of Big Intelligence Data
The Role Of Ontology In Modern Expert Systems Dallas 2008
8 ontology integration and interoperability (onto i op)
Enterprise and Data Mining Ontology Integration to Extract Actionable Knowled...
Management Gurus

Similar to Ontology For Data Integration (20)

PDF
An Incremental Method For Meaning Elicitation Of A Domain Ontology
PDF
A semantic framework and software design to enable the transparent integratio...
PDF
An improved technique for ranking semantic associationst07
PPTX
sOCIAL NETWORK ANALYSIS AND ONTOLOGIES A VIEW
PDF
Open Government Data on the Web - A Semantic Approach
PDF
Association Rule Mining Based Extraction of Semantic Relations Using Markov ...
PPT
Semantics in Financial Services -David Newman
PDF
Association Rule Mining Based Extraction of Semantic Relations Using Markov L...
PDF
ONTOLOGY-DRIVEN INFORMATION RETRIEVAL FOR HEALTHCARE INFORMATION SYSTEM : A C...
PDF
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
PDF
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
PPT
Semantic Web: Technolgies and Applications for Real-World
PPT
The impact of standardized terminologies and domain-ontologies in multilingua...
PDF
A Survey On Ontology Agent Based Distributed Data Mining
PDF
Concept integration using edit distance and n gram match
PPTX
Neuroinformatics_Databses_Ontologies_Federated Database.pptx
PPTX
Neuroinformatics Databases Ontologies Federated Database.pptx
PDF
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
DOC
Poster Abstracts
An Incremental Method For Meaning Elicitation Of A Domain Ontology
A semantic framework and software design to enable the transparent integratio...
An improved technique for ranking semantic associationst07
sOCIAL NETWORK ANALYSIS AND ONTOLOGIES A VIEW
Open Government Data on the Web - A Semantic Approach
Association Rule Mining Based Extraction of Semantic Relations Using Markov ...
Semantics in Financial Services -David Newman
Association Rule Mining Based Extraction of Semantic Relations Using Markov L...
ONTOLOGY-DRIVEN INFORMATION RETRIEVAL FOR HEALTHCARE INFORMATION SYSTEM : A C...
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
Semantic Web: Technolgies and Applications for Real-World
The impact of standardized terminologies and domain-ontologies in multilingua...
A Survey On Ontology Agent Based Distributed Data Mining
Concept integration using edit distance and n gram match
Neuroinformatics_Databses_Ontologies_Federated Database.pptx
Neuroinformatics Databases Ontologies Federated Database.pptx
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
Poster Abstracts

Recently uploaded (20)

PDF
NewMind AI Journal Monthly Chronicles - August 2025
PDF
Introduction to c language from lecture slides
PDF
Advancements in abstractive text summarization: a deep learning approach
PPTX
Strategic Picks — Prioritising the Right Agentic Use Cases [2/6]
PPT
Overviiew on Intellectual property right
PDF
CCUS-as-the-Missing-Link-to-Net-Zero_AksCurious.pdf
PDF
Revolutionizing recommendations a survey: a comprehensive exploration of mode...
PDF
EGCB_Solar_Project_Presentation_and Finalcial Analysis.pdf
PDF
State of AI in Business 2025 - MIT NANDA
PDF
GDG Cloud Southlake #45: Patrick Debois: The Impact of GenAI on Development a...
PDF
Human Computer Interaction Miterm Lesson
PDF
The Digital Engine Room: Unlocking APAC’s Economic and Digital Potential thro...
PDF
【AI論文解説】高速・高品質な生成を実現するFlow Map Models(Part 1~3)
PDF
Peak of Data & AI Encore: Scalable Design & Infrastructure
PDF
Ebook - The Future of AI A Comprehensive Guide.pdf
PDF
Uncertainty-aware contextual multi-armed bandits for recommendations in e-com...
PDF
Slides World Game (s) Great Redesign Eco Economic Epochs.pdf
PDF
FASHION-DRIVEN TEXTILES AS A CRYSTAL OF A NEW STREAM FOR STAKEHOLDER CAPITALI...
PPTX
AQUEEL MUSHTAQUE FAKIH COMPUTER CENTER .
PDF
Decision Optimization - From Theory to Practice
NewMind AI Journal Monthly Chronicles - August 2025
Introduction to c language from lecture slides
Advancements in abstractive text summarization: a deep learning approach
Strategic Picks — Prioritising the Right Agentic Use Cases [2/6]
Overviiew on Intellectual property right
CCUS-as-the-Missing-Link-to-Net-Zero_AksCurious.pdf
Revolutionizing recommendations a survey: a comprehensive exploration of mode...
EGCB_Solar_Project_Presentation_and Finalcial Analysis.pdf
State of AI in Business 2025 - MIT NANDA
GDG Cloud Southlake #45: Patrick Debois: The Impact of GenAI on Development a...
Human Computer Interaction Miterm Lesson
The Digital Engine Room: Unlocking APAC’s Economic and Digital Potential thro...
【AI論文解説】高速・高品質な生成を実現するFlow Map Models(Part 1~3)
Peak of Data & AI Encore: Scalable Design & Infrastructure
Ebook - The Future of AI A Comprehensive Guide.pdf
Uncertainty-aware contextual multi-armed bandits for recommendations in e-com...
Slides World Game (s) Great Redesign Eco Economic Epochs.pdf
FASHION-DRIVEN TEXTILES AS A CRYSTAL OF A NEW STREAM FOR STAKEHOLDER CAPITALI...
AQUEEL MUSHTAQUE FAKIH COMPUTER CENTER .
Decision Optimization - From Theory to Practice

Ontology For Data Integration

  • 1. Some ThoughtsJuan Esteva, Ph. D..751 Malena Dr., Ann Arbor, MI 48103Tel: 734-786-0233 Cell 734-277-4962Fax [email protected] Data Integration For Competitive Decision Making
  • 2. Not Just The Facts3/4/2010Juan Esteva, Ph. D.2“Good decisions are based on information that is analyzed and transformed into usable knowledge” Eileen Feretic
  • 3. Information at the point of impact3/4/2010Juan Esteva, Ph. D.3“Information needs to be at the point of impact—at the front lines where people are making decisions. The right analysis needs to be done at the right place. It’s important for organizations to treat information as a strategic asset in order to optimize every decision, every process, everything they do.” AmbujGoyal,
  • 4. Data in Silos3/4/2010Juan Esteva, Ph. D.4“One of the biggest challenges organizations face is the amount of data sitting in silos, too often, valuable data simply isn’t accessible or available.” Boris Evelson
  • 5. Business Decisions for Competitive Advantage3/4/2010Juan Esteva, Ph. D.5“In today’s troubled economy and competitive business environment, making good decisions is a matter of survival. But good decisions aren’t based on gut feeling alone. They should be based on information gathered from multiple sources, which is then synthesized and analyzed to generate a road map of options and possible outcomes that transform data into usable knowledge” Eileen Feretic
  • 6. Business Intelligence3/4/2010Juan Esteva, Ph. D.6Business Intelligence and now Business Analytics systems come into play [However,] it is hard to assemble [heterogeneous data and] disparate pieces of information in a way that provides the intelligence and insight needed to make good business decisions. Eileen FereticAlas enter Ontology Data Integration.
  • 7. Data Integration3/4/2010Juan Esteva, Ph. D.7Data integration provides the ability to manipulate data transparently across multiple data sources.Based on the architecture there are 2 systems:Central Data IntegrationA central data integration system usually has a global schema, which provides the user with a uniform interface to access information stored in the data sourcesPeer-2-peerIn contrast, in a peer-to-peer data integration system, there are no global points of control on the data sources (or peers). Instead, any peer can accept user queries for the information distributed in the whole system.
  • 8. Common Approaches for Data Integration3/4/2010Juan Esteva, Ph. D.8Global-as-ViewIn the GaV approach, every entity in the global schema is associated with a view over the source local schema. Therefore querying strategies are simple, but the evolution of the local source schemas is not easily supported.Local-as-ViewOn the contrary, the LaV approach permits changes to source schemas without affecting the global schema, since the local schemas are defined as views over the global schema, but query processing can be complex.
  • 9. Data Heterogeneity3/4/2010Juan Esteva, Ph. D.9Data sources can be heterogeneous in:SyntaxSyntactic heterogeneity is caused by the use of different models or languages.SchemaSchematic heterogeneity results from structural differences.SemanticsSemantic heterogeneity is caused by different meanings or interpretations of data in various contextsTo achieve data interoperability, the issues posed by data heterogeneity need to be eliminated
  • 10. Possible Solutions3/4/2010Juan Esteva, Ph. D.10The advent of XML has created a syntactic platform for Web data standardization and exchange. However, schematic data heterogeneity may persist, depending on the XML schemas used (e.g., nesting hierarchies). Likewise, semantic heterogeneity may persist even if both syntactic and schematic heterogeneities do not occur (e.g., naming concepts differently).We should be concerned with solving all three kinds of heterogeneities by bridging syntactic, schematic, and semantic heterogeneities across different sources.
  • 11. Semantic Data Integration Using Ontologies3/4/2010Juan Esteva, Ph. D.11We call semantic data integration the process of using a conceptual representation of the data and of their relationships to eliminate possible heterogeneities.At the heart of semantic data integration is the concept of ontology, which is an explicit specification of a shared conceptualization
  • 12. Ontology & Data Integration3/4/2010Juan Esteva, Ph. D.12Metadata Representation. Metadata (i.e., source schemas) in each data source can be explicitly represented by a local ontology, using a single language.Global Conceptualization. The global ontology provides a conceptual view over the schematically-heterogeneous source schemas.Support for High-level Queries. Given a high-level view of the sources, as provided by a global ontology, the user can formulate a query without specific knowledge of the different data sources. The query is then rewritten into queries over the sources, based on the semantic mappings between the global and local ontologies.Declarative Mediation. Query processing in a hybrid peer-to-peer system uses the global ontology as a declarative mediator for query rewriting between peers.Mapping Support. A thesaurus, formalized in terms of an ontology, can be used for the mapping process to facilitate its automation.
  • 13. What do we need?3/4/2010Juan Esteva, Ph. D.13Increase search capabilitiesFrom discovery to reasoningIncreasing metadata as to provide strong semanticsFrom glossaries to ontologiesConsequently, moving from syntactic interoperability to structural interoperability and finally to semantic interoperability
  • 14. Graphically the model progression will be [2] 3/4/2010Juan Esteva, Ph. D.14The point of this graph is that Increasing Metadata (from glossaries to ontologies) is highly correlated with Increasing Search Capability (from discovery to reasoning).
  • 15. Juan Esteva, Ph. D.3/4/201015References
  • 16. References3/4/2010Juan Esteva, Ph. D.16Applying 4D ontologies to Enterprise Architecture, Matthew West, Shell Corp.FHA Data Architecture Working Group: SICoP DRM 2.0 Pilot, 2005The Role of Ontologies in Data Integration, Isabel F. Cruz Huiyong Xiao
  • 17. Topic Maps3/4/2010Juan Esteva, Ph. D.17Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information. The ISO standard is formally known as ISO/IEC 13250:2003.A topic map represents information using topics (representing any concept, from people, countries, and organizations to software modules, individual files, and events), associations (representing the relationships between topics), and occurrences (representing information resources relevant to a particular topic).
  • 18. SKOS3/4/2010Juan Esteva, Ph. D.18Simple Knowledge Organization System (SKOS) SKOS is a common data model for sharing and linking knowledge organization systems via the Web.
  • 19. RDF3/4/2010Juan Esteva, Ph. D.19Resource Description Language RDFRDF is a standard model for data interchange on the Web. RDF has features that facilitate data merging even if the underlying schemas differ, and it specifically supports the evolution of schemas over time without requiring all the data consumers to be changed.
  • 20. OWL3/4/2010Juan Esteva, Ph. D.20Web Ontology Language OWLis a Semantic Web language designed to represent rich and complex knowledge about things, groups of things, and relations between things. OWL is a computational logic-based language such that knowledge expressed in OWL can be reasoned with by computer programs either to verify the consistency of that knowledge or to make implicit knowledge explicit. OWL documents, known as ontologies, can be published in the World Wide Web and may refer to or be referred from other OWL ontologies. OWL is part of the W3C’s Semantic Web technology stack, which includes RDF, RDFS, SPARQL, etc.