SlideShare a Scribd company logo
Delivering a Linked Data warehouse and integrating
across the wider enterprise
Ben Gardner – Linklaters LLP
Semantics
September 2016
11
Summary
• Information discovery requirements
• What we did
• Linked Data in Action
• Conclusion
22
Accessing the right information is challenging
Diverse Range of Specialisations
Information Seeking Behaviour
Information is Silo’ed
Information Hierarchy
33
What we did
44
Building a Linked Data Warehouse demo
Excel Reports
XML File
RDF
Management
Triple
Store
Model
UI
S  O
ETL Platform
OData
+
OData4Sparql
Sparql
+
Linked Data Warehouse Data Access Exploration
Linked Data and Model
• Traditional approaches try to identify how the data is to be “captured”
upfront.
• You can do this with the linked data model
• But we don’t…..Why?
• Always leads to “Paralysis by Analysis”
• You will miss so much.
• And take a huge amount of time doing it.
• You will find that there is a huge amount of
information and relationships you never would
of thought if starting from the model.
• Then there are tricks you can do to add huge
value
• The data model evolves very rapidly from the
data and can be further tweaked at anytime.
Let the data express itself
• Source by source, row by row let the data tell
you what it is describing.
• What it is, what relationships and metadata it
has.
• You’ll find a lot more information that you
simply couldn’t describe in a RDMS
• Another source can add to an existing item
without you even having to think
66
Degree
Person
Matter
Jurisdic
tion
Jurisdic
tion
College
Sector
Person
Person
Client
Manager
Partner
Client Area
Client
Person
Manager Area
Linked Data and Model : Individual Model
Fragments
77
Degree
Matter
Jurisdic
tion
College
Sector
Person
Client
Manager
Partner
Client
Area
Client
Manager
Area
Linked Data and Model: Fragments automatically
align
ETL & Linked Data Creation & Management
In4mium Talend modules
• Semantic modules ready to use through
configuration in Talend
• No API knowledge required by users
• Range of modules (over 60 ) for all
aspects of linked data creation and
management
• Create fully semantic apps
• Or pick and mix with traditional
aspects
• Works seamlessly with existing Talend
environment and modules
• Model driven behaviours are now
possible
• Easily add sematic technologies into
existing service architectures
• All the benefits without the hassle
99
OData4Sparql – Simplifying integration
+
• Brings together the strength of a ubiquitous RESTful
interface standard (OData) with the flexibility, federation
ability of RDF/SPARQL.
• SPARQL/OData Interop proposed W3C interoperation proxy
between OData and SPARQL (Kal Ahmed, 2013)
• Opens up many popular user-interface development
frameworks and tools such as Kendo UI, SAPUI5, etc.
• Acts as a Janus-point between application development and
data-sources.
• User interface developers are not, and do not want to be,
database developers. Therefore they want to use a
standardized interface that abstracts away the database,
even to the extent of what type of database: RDBMS,
NoSQL, or RDF/SPARQL
• By providing an OData4SPARQL server, it opens up any
SPARQL data-source to the C#/LINQ development world.
• Opens up many productivity tools such as
Excel/PowerQuery, and SharePoint to be consumers of
SPARQL data such as Dbpedia, Chembl, Chebi, BioPax
and any of the Linked Open Data endpoints!
• Microsoft has been joined by IBM and SAP using OData as
their primary interface method which means there will many
application developers familiar with OData as the means to
communicate with a backend data source.
1010
Model Driven UI
Linklaters Data Model Northwind Data Model
Things
Sample Query Sample Query
Relationships
between
Things
Things
Relationships
between
Things
1111
Demo of Linked Data in action
1212
Strings to Things to Facts
Click on a ‘thing’
displays a ‘Lens’
about that ‘thing’
that shows different
fragments that
displays facts about
the thing
The ‘About’
fragment shows
most relevant
information.
Compare with the
Google
knowledge graph
The ‘Person
Involved’
fragment list all
persons involved
with the matter
The ‘Financial
Summary’
calculates a
financial
summary
… and we can find
associated deal
‘things’. If we want
more details about
any ‘thing’ we can
now navigate to its
‘lens’
1313
Lens Discovery
Navigating through
‘Gerald Grant’, the
managing partner
for the Matter, takes
us to his Lens
Navigating through
the associated deal
takes us to that
deal’s Lens
Or show the Lens
on the client of the
matter
One is not limited to
facts within the
application. In the
case of a client we
can navigate to their
Companies House
page (or it could
have been D&B,
LinkDocs etc)
1414
Composing Questions
Advanced Searches can
be selected from the list
which then displays a
query in a different format
that allows better control
over the search
Advanced Searches can
be selected from the list
which then displays a
query in a different format
that allows better control
over the search
The advanced search
allows conditions to be
added that link to other
‘things’ or limit the values
of ‘facts’ about the
associated ‘thing’. This
allows much more precise
searches to be executed
1515
OData integration with Excel Power Query/Pivot
OData
OData4Sparql
Power Query Data Grabber/Shaper
• Build queries and utilise expand to traverse graph
• Limited data transformation can be incorporated into
the queries
• Create multiple views
Power Pivot Self Service BI
• Integrate across Power Queries and
other sources to build ROLAP models
• Explore model with Pivot tables
Power
View
Power
Map
Pivots, Charts
& Grids
Tableau,
etc.
Power Query
Power Pivot
1616
Conclusion
1717
Linked Data has delivered
• Elimination of silos through creation of logical
data warehouse that is extensible across internal
and external data sources
• Enabled “find and explore” information seeking
behaviours
• Separation of data modelling from integration
provides for easy addition of internal & external
data
• Ability to support diverse range of specialised
domain views onto data
• Introduces a Service Orientated Data
Architecture simplifying application
development
• Based on W3C web standards providing future
proofing and protection of firms IP (data
models)
1818
Building a Linked Data Warehouse pilot
RDF
Management
Triple
Store
Model
UI
S  O
ETL Platform
OData
+
OData4Sparql
Sparql
+







Matter
Time
People
Financials
Deal
Finder
Client
Book
Client
Engage
K_Docs
SAP


One FTE (2x0.5) and nine months delivered
• Integrated 3 years and 9 months of data from 9 sources
• 24 million triples
• 62 Things (People, Matters, Clients, etc.)
• 127 Relationships between Things
• 223 Data attributes
1919
Questions?

More Related Content

PDF
Ontos NLP Stack, Sep. 2016
Martin Voigt
 
PPTX
David Kuilman | Creating a Semantic Enterprise Content model to support conti...
semanticsconference
 
PDF
Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...
semanticsconference
 
PDF
Chalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
semanticsconference
 
PDF
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
semanticsconference
 
PDF
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
semanticsconference
 
PPTX
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
semanticsconference
 
PDF
Semantic E-Commerce - Use Cases in Enterprise Web Applications
Linked Enterprise Date Services
 
Ontos NLP Stack, Sep. 2016
Martin Voigt
 
David Kuilman | Creating a Semantic Enterprise Content model to support conti...
semanticsconference
 
Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...
semanticsconference
 
Chalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
semanticsconference
 
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
semanticsconference
 
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
semanticsconference
 
Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...
semanticsconference
 
Semantic E-Commerce - Use Cases in Enterprise Web Applications
Linked Enterprise Date Services
 

What's hot (20)

PPTX
Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...
semanticsconference
 
PDF
Felix Burkhardt | ARCHITECTURE FOR A QUESTION ANSWERING MACHINE
semanticsconference
 
PPTX
Semantic Technology in Publishing & Finance
Vladimir Alexiev, PhD, PMP
 
DOCX
Evaluation criteria for nosql databases
Ebenezer Daniel
 
PDF
On demand access to Big Data through Semantic Technologies
Peter Haase
 
PPTX
Top 5 Considerations When Evaluating NoSQL
MongoDB
 
PDF
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
PPTX
Enterprise search
Emrah M. Işık
 
PPSX
RDF and OWL : the powerful duo | Tara Raafat
Connected Data World
 
PDF
Sebastian Hellmann
Connected Data World
 
PPTX
Charles Ivie
Connected Data World
 
PDF
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
PDF
Linked Open Data in the World of Patents
Dr. Haxel Consult
 
PDF
Odp - On demand profiler (ICPE 2018)
Tao Feng
 
PPTX
Enterprise Search Summit Keynote: A Big Data Architecture for Search
Search Technologies
 
PPTX
The Evolution of Search and Big Data
Search Technologies
 
PPTX
Solution architecture for big data projects
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
PPTX
Metadata management in SharePoint
Metataxis
 
PDF
How To Drive Intelligent Migration Webinar
Concept Searching, Inc
 
PPTX
Linked Data Platform as a novel approach for Enterprise Application Integra...
Nandana Mihindukulasooriya
 
Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...
semanticsconference
 
Felix Burkhardt | ARCHITECTURE FOR A QUESTION ANSWERING MACHINE
semanticsconference
 
Semantic Technology in Publishing & Finance
Vladimir Alexiev, PhD, PMP
 
Evaluation criteria for nosql databases
Ebenezer Daniel
 
On demand access to Big Data through Semantic Technologies
Peter Haase
 
Top 5 Considerations When Evaluating NoSQL
MongoDB
 
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
Enterprise search
Emrah M. Işık
 
RDF and OWL : the powerful duo | Tara Raafat
Connected Data World
 
Sebastian Hellmann
Connected Data World
 
Charles Ivie
Connected Data World
 
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
Linked Open Data in the World of Patents
Dr. Haxel Consult
 
Odp - On demand profiler (ICPE 2018)
Tao Feng
 
Enterprise Search Summit Keynote: A Big Data Architecture for Search
Search Technologies
 
The Evolution of Search and Big Data
Search Technologies
 
Solution architecture for big data projects
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Metadata management in SharePoint
Metataxis
 
How To Drive Intelligent Migration Webinar
Concept Searching, Inc
 
Linked Data Platform as a novel approach for Enterprise Application Integra...
Nandana Mihindukulasooriya
 
Ad

Viewers also liked (20)

PDF
Linked data the next 5 years - From Hype to Action
Andreas Blumauer
 
PDF
Shuangyong Song, Qingliang Miao and Yao Meng | Linking Images to Semantic Kno...
semanticsconference
 
PPTX
Kostas Kastrantas | Business Opportunities with Linked Open Data
semanticsconference
 
PDF
Victor Charpenay | Standardized Semantics for an Open Web of Things
semanticsconference
 
PPTX
OWL-based validation by Gavin Mendel Gleasonand Bojan Bozic, Trinity College,...
semanticsconference
 
PPTX
Thomas Vavra | New Ways of Handling Old Data
semanticsconference
 
PPTX
Georgios Meditskos and Stamatia Dasiopoulou | Question Answering over Pattern...
semanticsconference
 
PPTX
Sören Auer | Enterprise Knowledge Graphs
semanticsconference
 
PDF
Adam Bartusiak and Jörg Lässig | Semantic Processing for the Conversion of Un...
semanticsconference
 
PPTX
Jörg Waitelonis, Henrik Jürges and Harald Sack | Don't compare Apples to Oran...
semanticsconference
 
PDF
Christian Opitz | Semantic E-Commerce - Use Cases in Enterprise Web Applications
semanticsconference
 
PDF
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
semanticsconference
 
PDF
Fajar J. Ekaputra, Marta Sabou, Estefania Serral and Stefan Biffl | Knowledge...
semanticsconference
 
PPTX
Reginald Ford, Grit Denker, Daniel Elenius, Wesley Moore and Elie Abi-Lahoud ...
semanticsconference
 
PDF
Tomas Knap | RDF Data Processing and Integration Tasks in UnifiedViews: Use C...
semanticsconference
 
PPTX
Vassilios Peristeras | Promoting Semantic Interoperability for European Publi...
semanticsconference
 
PPTX
Holger Wollschläger | E-government at its best: Open, transparent and useful
semanticsconference
 
PPTX
Jo Kent | ADA – Opening up the BBC archive with linked data
semanticsconference
 
PPTX
Consuming Linked Data SemTech2010
Juan Sequeda
 
PDF
Linked Data Quality Assessment: A Survey
Amrapali Zaveri, PhD
 
Linked data the next 5 years - From Hype to Action
Andreas Blumauer
 
Shuangyong Song, Qingliang Miao and Yao Meng | Linking Images to Semantic Kno...
semanticsconference
 
Kostas Kastrantas | Business Opportunities with Linked Open Data
semanticsconference
 
Victor Charpenay | Standardized Semantics for an Open Web of Things
semanticsconference
 
OWL-based validation by Gavin Mendel Gleasonand Bojan Bozic, Trinity College,...
semanticsconference
 
Thomas Vavra | New Ways of Handling Old Data
semanticsconference
 
Georgios Meditskos and Stamatia Dasiopoulou | Question Answering over Pattern...
semanticsconference
 
Sören Auer | Enterprise Knowledge Graphs
semanticsconference
 
Adam Bartusiak and Jörg Lässig | Semantic Processing for the Conversion of Un...
semanticsconference
 
Jörg Waitelonis, Henrik Jürges and Harald Sack | Don't compare Apples to Oran...
semanticsconference
 
Christian Opitz | Semantic E-Commerce - Use Cases in Enterprise Web Applications
semanticsconference
 
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
semanticsconference
 
Fajar J. Ekaputra, Marta Sabou, Estefania Serral and Stefan Biffl | Knowledge...
semanticsconference
 
Reginald Ford, Grit Denker, Daniel Elenius, Wesley Moore and Elie Abi-Lahoud ...
semanticsconference
 
Tomas Knap | RDF Data Processing and Integration Tasks in UnifiedViews: Use C...
semanticsconference
 
Vassilios Peristeras | Promoting Semantic Interoperability for European Publi...
semanticsconference
 
Holger Wollschläger | E-government at its best: Open, transparent and useful
semanticsconference
 
Jo Kent | ADA – Opening up the BBC archive with linked data
semanticsconference
 
Consuming Linked Data SemTech2010
Juan Sequeda
 
Linked Data Quality Assessment: A Survey
Amrapali Zaveri, PhD
 
Ad

Similar to Ben Gardner | Delivering a Linked Data warehouse and integrating across the wider enterprise (20)

PPTX
Delivering a Linked Data warehouse and realising the power of graphs
Ben Gardner
 
PDF
Linked Data 1st Edition David Wood Marsha Zaidman Luke Ruth Michael Hausenblas
juradorurua
 
PDF
Tutorial Data Management and workflows
SSSW
 
PDF
Linked Data and Semantic Web Application Development by Peter Haase
Laboratory of Information Science and Semantic Technologies
 
PDF
Linking knowledge spaces
Christophe Guéret
 
PDF
Using Linked Data Resources to generate web pages based on a BBC case study
Leila Zemmouchi-Ghomari
 
PDF
Efficient, Scalable, and Provenance-Aware Management of Linked Data
eXascale Infolab
 
PDF
Llinked open data training for EU institutions
Open Data Support
 
PDF
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS Platform
Seonho Kim
 
PPTX
Iswc 2014-hammond-pasin-presentation-final
Tony Hammond
 
PDF
Linking Open Government Data at Scale
Bernadette Hyland-Wood
 
PDF
Alberto Ciaramella: "Linked patent data: opportunities and challenges for pat...
IntelliSemantic
 
PDF
Linked Data Visualization 1st Edition Laura Po
krruciatanda
 
PDF
An introduction to Linked Open Data
Ali Khalili
 
PDF
Introduction to linked data
Open Data Support
 
PDF
Cloud-based Linked Data Management for Self-service Application Development
Peter Haase
 
PPTX
Linked Data for African Libraries
Getaneh Alemu
 
PDF
Linked Data - Overview and Potentials
Tobias Bürger
 
DOCX
LODLAM Landscape NOTES
Shana McDanold
 
PDF
Weaving a Web of Linked Data - September 26th, 2019
Platform Linked Data Netherlands (PLDN)
 
Delivering a Linked Data warehouse and realising the power of graphs
Ben Gardner
 
Linked Data 1st Edition David Wood Marsha Zaidman Luke Ruth Michael Hausenblas
juradorurua
 
Tutorial Data Management and workflows
SSSW
 
Linked Data and Semantic Web Application Development by Peter Haase
Laboratory of Information Science and Semantic Technologies
 
Linking knowledge spaces
Christophe Guéret
 
Using Linked Data Resources to generate web pages based on a BBC case study
Leila Zemmouchi-Ghomari
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
eXascale Infolab
 
Llinked open data training for EU institutions
Open Data Support
 
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS Platform
Seonho Kim
 
Iswc 2014-hammond-pasin-presentation-final
Tony Hammond
 
Linking Open Government Data at Scale
Bernadette Hyland-Wood
 
Alberto Ciaramella: "Linked patent data: opportunities and challenges for pat...
IntelliSemantic
 
Linked Data Visualization 1st Edition Laura Po
krruciatanda
 
An introduction to Linked Open Data
Ali Khalili
 
Introduction to linked data
Open Data Support
 
Cloud-based Linked Data Management for Self-service Application Development
Peter Haase
 
Linked Data for African Libraries
Getaneh Alemu
 
Linked Data - Overview and Potentials
Tobias Bürger
 
LODLAM Landscape NOTES
Shana McDanold
 
Weaving a Web of Linked Data - September 26th, 2019
Platform Linked Data Netherlands (PLDN)
 

More from semanticsconference (20)

PPTX
Linear books to open world adventure
semanticsconference
 
PDF
Session 1.2 high-precision, context-free entity linking exploiting unambigu...
semanticsconference
 
PDF
Session 4.3 semantic annotation for enhancing collaborative ideation
semanticsconference
 
PDF
Session 1.1 dalicc - data licenses clearance center
semanticsconference
 
PDF
Session 1.3 context information management across smart city knowledge domains
semanticsconference
 
PDF
Session 0.0 aussenac semanticsnl-pwebsem2017-v4
semanticsconference
 
PPTX
Session 0.0 keynote sandeep sacheti - final hi res
semanticsconference
 
PPTX
Session 1.1 linked data applied: a field report from the netherlands
semanticsconference
 
PDF
Session 1.2 enrich your knowledge graphs: linked data integration with pool...
semanticsconference
 
PDF
Session 1.4 connecting information from legislation and datasets using a ca...
semanticsconference
 
PDF
Session 1.4 a distributed network of heritage information
semanticsconference
 
PDF
Session 0.0 media panel - matthias priem - gtuo - semantics 2017
semanticsconference
 
PDF
Session 1.3 semantic asset management in the dutch rail engineering and con...
semanticsconference
 
PPTX
Session 1.3 energy, smart homes & smart grids: towards interoperability...
semanticsconference
 
PDF
Session 1.2 improving access to digital content by semantic enrichment
semanticsconference
 
PPTX
Session 2.3 semantics for safeguarding & security – a police story
semanticsconference
 
PPTX
Session 2.5 semantic similarity based clustering of license excerpts for im...
semanticsconference
 
PDF
Session 4.2 unleash the triple: leveraging a corporate discovery interface....
semanticsconference
 
PDF
Session 1.6 slovak public metadata governance and management based on linke...
semanticsconference
 
PPTX
Session 5.6 towards a semantic outlier detection framework in wireless sens...
semanticsconference
 
Linear books to open world adventure
semanticsconference
 
Session 1.2 high-precision, context-free entity linking exploiting unambigu...
semanticsconference
 
Session 4.3 semantic annotation for enhancing collaborative ideation
semanticsconference
 
Session 1.1 dalicc - data licenses clearance center
semanticsconference
 
Session 1.3 context information management across smart city knowledge domains
semanticsconference
 
Session 0.0 aussenac semanticsnl-pwebsem2017-v4
semanticsconference
 
Session 0.0 keynote sandeep sacheti - final hi res
semanticsconference
 
Session 1.1 linked data applied: a field report from the netherlands
semanticsconference
 
Session 1.2 enrich your knowledge graphs: linked data integration with pool...
semanticsconference
 
Session 1.4 connecting information from legislation and datasets using a ca...
semanticsconference
 
Session 1.4 a distributed network of heritage information
semanticsconference
 
Session 0.0 media panel - matthias priem - gtuo - semantics 2017
semanticsconference
 
Session 1.3 semantic asset management in the dutch rail engineering and con...
semanticsconference
 
Session 1.3 energy, smart homes & smart grids: towards interoperability...
semanticsconference
 
Session 1.2 improving access to digital content by semantic enrichment
semanticsconference
 
Session 2.3 semantics for safeguarding & security – a police story
semanticsconference
 
Session 2.5 semantic similarity based clustering of license excerpts for im...
semanticsconference
 
Session 4.2 unleash the triple: leveraging a corporate discovery interface....
semanticsconference
 
Session 1.6 slovak public metadata governance and management based on linke...
semanticsconference
 
Session 5.6 towards a semantic outlier detection framework in wireless sens...
semanticsconference
 

Recently uploaded (20)

PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
The Future of Artificial Intelligence (AI)
Mukul
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 

Ben Gardner | Delivering a Linked Data warehouse and integrating across the wider enterprise

  • 1. Delivering a Linked Data warehouse and integrating across the wider enterprise Ben Gardner – Linklaters LLP Semantics September 2016
  • 2. 11 Summary • Information discovery requirements • What we did • Linked Data in Action • Conclusion
  • 3. 22 Accessing the right information is challenging Diverse Range of Specialisations Information Seeking Behaviour Information is Silo’ed Information Hierarchy
  • 5. 44 Building a Linked Data Warehouse demo Excel Reports XML File RDF Management Triple Store Model UI S  O ETL Platform OData + OData4Sparql Sparql + Linked Data Warehouse Data Access Exploration
  • 6. Linked Data and Model • Traditional approaches try to identify how the data is to be “captured” upfront. • You can do this with the linked data model • But we don’t…..Why? • Always leads to “Paralysis by Analysis” • You will miss so much. • And take a huge amount of time doing it. • You will find that there is a huge amount of information and relationships you never would of thought if starting from the model. • Then there are tricks you can do to add huge value • The data model evolves very rapidly from the data and can be further tweaked at anytime. Let the data express itself • Source by source, row by row let the data tell you what it is describing. • What it is, what relationships and metadata it has. • You’ll find a lot more information that you simply couldn’t describe in a RDMS • Another source can add to an existing item without you even having to think
  • 9. ETL & Linked Data Creation & Management In4mium Talend modules • Semantic modules ready to use through configuration in Talend • No API knowledge required by users • Range of modules (over 60 ) for all aspects of linked data creation and management • Create fully semantic apps • Or pick and mix with traditional aspects • Works seamlessly with existing Talend environment and modules • Model driven behaviours are now possible • Easily add sematic technologies into existing service architectures • All the benefits without the hassle
  • 10. 99 OData4Sparql – Simplifying integration + • Brings together the strength of a ubiquitous RESTful interface standard (OData) with the flexibility, federation ability of RDF/SPARQL. • SPARQL/OData Interop proposed W3C interoperation proxy between OData and SPARQL (Kal Ahmed, 2013) • Opens up many popular user-interface development frameworks and tools such as Kendo UI, SAPUI5, etc. • Acts as a Janus-point between application development and data-sources. • User interface developers are not, and do not want to be, database developers. Therefore they want to use a standardized interface that abstracts away the database, even to the extent of what type of database: RDBMS, NoSQL, or RDF/SPARQL • By providing an OData4SPARQL server, it opens up any SPARQL data-source to the C#/LINQ development world. • Opens up many productivity tools such as Excel/PowerQuery, and SharePoint to be consumers of SPARQL data such as Dbpedia, Chembl, Chebi, BioPax and any of the Linked Open Data endpoints! • Microsoft has been joined by IBM and SAP using OData as their primary interface method which means there will many application developers familiar with OData as the means to communicate with a backend data source.
  • 11. 1010 Model Driven UI Linklaters Data Model Northwind Data Model Things Sample Query Sample Query Relationships between Things Things Relationships between Things
  • 12. 1111 Demo of Linked Data in action
  • 13. 1212 Strings to Things to Facts Click on a ‘thing’ displays a ‘Lens’ about that ‘thing’ that shows different fragments that displays facts about the thing The ‘About’ fragment shows most relevant information. Compare with the Google knowledge graph The ‘Person Involved’ fragment list all persons involved with the matter The ‘Financial Summary’ calculates a financial summary … and we can find associated deal ‘things’. If we want more details about any ‘thing’ we can now navigate to its ‘lens’
  • 14. 1313 Lens Discovery Navigating through ‘Gerald Grant’, the managing partner for the Matter, takes us to his Lens Navigating through the associated deal takes us to that deal’s Lens Or show the Lens on the client of the matter One is not limited to facts within the application. In the case of a client we can navigate to their Companies House page (or it could have been D&B, LinkDocs etc)
  • 15. 1414 Composing Questions Advanced Searches can be selected from the list which then displays a query in a different format that allows better control over the search Advanced Searches can be selected from the list which then displays a query in a different format that allows better control over the search The advanced search allows conditions to be added that link to other ‘things’ or limit the values of ‘facts’ about the associated ‘thing’. This allows much more precise searches to be executed
  • 16. 1515 OData integration with Excel Power Query/Pivot OData OData4Sparql Power Query Data Grabber/Shaper • Build queries and utilise expand to traverse graph • Limited data transformation can be incorporated into the queries • Create multiple views Power Pivot Self Service BI • Integrate across Power Queries and other sources to build ROLAP models • Explore model with Pivot tables Power View Power Map Pivots, Charts & Grids Tableau, etc. Power Query Power Pivot
  • 18. 1717 Linked Data has delivered • Elimination of silos through creation of logical data warehouse that is extensible across internal and external data sources • Enabled “find and explore” information seeking behaviours • Separation of data modelling from integration provides for easy addition of internal & external data • Ability to support diverse range of specialised domain views onto data • Introduces a Service Orientated Data Architecture simplifying application development • Based on W3C web standards providing future proofing and protection of firms IP (data models)
  • 19. 1818 Building a Linked Data Warehouse pilot RDF Management Triple Store Model UI S  O ETL Platform OData + OData4Sparql Sparql +        Matter Time People Financials Deal Finder Client Book Client Engage K_Docs SAP   One FTE (2x0.5) and nine months delivered • Integrated 3 years and 9 months of data from 9 sources • 24 million triples • 62 Things (People, Matters, Clients, etc.) • 127 Relationships between Things • 223 Data attributes

Editor's Notes

  • #10: In this picture we show just two In4mium modules being used alongside standard Talend modules. This workflow is showing filters, transformations and lookup joins before the data is converted to RDF. It is the Rdfiser that converts the standard data on the flow to RDF. The RDf can then be managed in triple stores or as in this case written to files. The RDFizer is itself model driven as it uses an RDF r2rml configuration file. The talend job can be deployed as a stand alone java executable or deployed as a web service within your architecture. Foundation Platform: Talend Gartner Magic Quadrant Open Studio and enterprise versions Composable visual java development environment Solution frameworks for Integration, BPM, MDM, ESB, Data Quality, Big data Configuration 1000’s of module to configure into applications ETL, Amazon Cloud, Hadoop, BI Modules are java injection routines Well supported community Highly scalable efficient code generation Deployable as within service architectures Adds to your existing architecture Not a rip and replace! BUT Lacks any knowledge of Semantic data handling and management