SlideShare a Scribd company logo
Real Semantics: Being Proactive about BCBS 239
Presented by Ontology2 -semantic graph models, dataquality and easy multi-model management
Overview
BCBS 239 is a standard set by the Bank for International Settlements which applies to banks which are
systematically important, either on a global or national basis. It raises the bar for IT systems and the
challenge of compliance is a major concern of both IT organizations and the C-Suite.
A product of years of development at Ontology2, RealSemantics could be the first IT architecture designed
with BCBS 239 in mind. With a universal graph model, a common mechanism can be used for tracing any
decisions made by the system. With the capability to peer into legacy systems at the conceptual, API, object
code and source code level, RealSemantics synchronizes the map and the territory to take the chaos out of
IT, setting the stage for intelligent applications that satisfy customers and grow the top line.
In this chapter, we will look at a few paragraphs from the official BCBS 239 document and explain what
features and characteristics of Real Semantics satisfy their requirements.
IT InfrastructureforOrdinary and Extraordinary times
33. A bank should establish integrated 16 data taxonomies and architecture across the banking group, which
includes information on the characteristics of the data (metadata), as well as use of single identifiers and/or
unified naming conventions for data including legal entities, counterparties, customers and accounts.
(which assumes the associated footnote)
 Banks do not necessarily need to have one data model; rather, there should be robust automated
reconciliation procedures where multiple models are in use.
Real Semantics is based on a Multiple Model Architecture, which is precisely intended to interconnect
systems based upon different data models. This is possible because the RDF/K data model is flexible to itself
model any model in use by legacy systems. A single data model also supports a single model for metadata
about every concept, data record, user, IT artifact or other thing encountered.
Ontology2 has supported an ongoing program to improve the quality and state of knowledge about Legal
Entity Identifiers. As such, Real Semantics is ideal for reconciliation of various numeric identifiers as well as
natural languages names and addresses.
Roles and responsibilities should be established as they relateto the ownership and quality of risk data and
information for both the business and IT functions. The owners (business and IT functions), in partnership
with risk managers, should ensure there areadequate controls throughout the lifecycle of the data and for all
aspects of the technology infrastructure. The role of the business owner includes ensuring data is correctly
entered by the relevant front office unit, kept current and aligned with the data definitions, and also ensuring
that risk data aggregationcapabilities and risk reporting practices are consistent with firms€™policies.
 Real Semantics traceability makes control certain.
Real Semantics captures knowledge from workers across the organization from line workers to subject
matter experts, software developers, risk management experts and executive. With a structured process for
case management, misunderstandings can be understood, identified, repaired and/or documented. Policy
documents are captured directly into the system, linked to rule-based implementations accessible to auditors
and other professionals.
Supporting Accuracy and Integrity
As a precondition, a bank should have a "dictionary" of the concepts used, such that data is defined
consistently across an organization.
The RDF/K model used in RealSemantics not only represents common data structures used in the software
industry, but it works together with RDFS, OWL, SKOS and similar vocabularies to represent concepts used in
natural language and cognition. Real Semantics exceeds traditional tools in that it keeps trackof multiple
contexts (Multiple Model Architecture), setting the stage to record nuances of expression that are are
relevant to various communities, applications and business lines.
There should be an appropriate balance between automated and manual systems. Where professional
judgements are required, human intervention may be appropriate. For many other processes, a higher
degree of automation is desirable to reduce the risk of errors.
Real Semantics pushes the envelope of what can be automated because it is based on a systematic approach
to capturing natural languagedocumentation, implicit knowledge in the form of rules, and explicit knowledge
in the form of examples which can be used to validate rules, explain special cases, and train machine learning
models.
Human input is important, so RealSemantics participates in business process orchestration, sending difficult
tasks to humans as well as sending a statistical sample for review to support statistical quality control (think
Edward Deming, Phil Crosby, and Joseph Juran) to control, understand and improve quality with a focused
and efficient use of human effort.
Supervisors expect banks to document and explain all of their risk data aggregationprocesses whether
automated or manual (judgement based or otherwise). Documentation should include an explanation of the
appropriateness of any manual workarounds, a description of their criticality to the accuracyof risk data
aggregationand proposed actions to reduce the impact.
Experience shows that the structure of commonsense knowledge consists of rules and exceptions. In
conventional software development based on procedural or object-oriented techniques, the result is often
that a clean design becomes corrupted over time with a largenumber of "hacks" put in at the last minute
that areplaced randomly throughout the code base. The result is frequently a largesystem that is
unpredictable, unreliable, and hard to maintain.
 Real Semantics is based on the assumption that simple models are imperfect, so it provides facilities,
such as fact patches and rule patches to create neat layers that separate general purpose, industry
specific, company specific and while many organizations are struggling to meet regulatory
requirements.
 While many organizations are struggling to meet regulatory requirements, we see this as an
opportunity to find alpha, satisfy and delight customers, and grow the bottom line. If you
agree, contact us.. Changes are documented with a case management system, so that clear
documentation exists for both the automatic fast path as well as cases that are specialized or require
human intervention.
Unprecedented Adaptability
Supervisors expect banks to measure and monitor the accuracyof data and to develop appropriate escalation
channels and action plans to be in place to rectify poor data quality
Real Semantics flags quality problems automatically based upon both rules and discovery algorithms. In cases
where data accuracy is in question, a ticket is createdin the integrated case management system, so the
problem can be investigated, possibly confirmed, and referred to the proper person, with the proper
knowledge and skills, inside or outside of the organization -- leaving a paper trail which can be referred to
later.
Adaptability includes: (a) Data aggregationprocesses that are flexible and enable risk data to be aggregated
for assessment and quick decision-making; (b) Capabilities for data customization to user’s needs (e.g.
dashboards, key takeaways, anomalies), to drill down as needed, and to produce quick summary reports; (c)
Capabilities to incorporate new developments on the organization of the business and/or external factors
that influence the banks risk profile; and (d) Capabilities to incorporate changes in the regulatory framework.
Rapid and flexible processing is enabled in Real Semantics by an extensive knowledge base about real world
data and the problem domain. By bundling these capabilities into a structured process, the data cleaning that
takes 80 to 90% of the time of a typical analyst or "data scientist" is built into a repeatable process that
dramatically saves time.
Although Real Semantics can export data for use with any tools, it integrates with Kibi from our partner SIREn
Solutions, which combines classical BI queries with time series and full-text capabilities. Real Semantics
classifies documents, data records, and events, accurately extracting facts and concepts for an unmatched
capability for data exploration and dashboard creation.
With the Multiple Model Architecture, Real Semantics is designed from square one with the assumption that
data, code, documentation and other assets will be repurposed in multiple ways. With data lake capabilities
based on Hadoop and cloud computing, RealSemantics can get the resources to repeat and reproduce data
processing operations in case of any change of the data, rules, or other assumptions about the environment,
business organization or regulatory frameworks quickly.
Supervisors expect banks to be able to generatesubsets of data based on requested scenarios or resulting
from economic events. For example, a bank should be able to aggregaterisk data quickly on country credit
exposures 18 as of a specified date based on a list of countries, as well as industry credit exposures as of a
specified date based on a list of industry types across all business lines and geographic areas.
Real Semantics incorporates reasoning over large Linked Data knowledge bases covering topics such as
places, people, creative works, industry classifications as well as technology, chemistry, biology and other
fields that concern large numbers of creative entities. RealSemantics comes with the Ontology2 Spatial
Hierarchy, derived from Freebase (which Google bought to createthe Google Knowledge Graph).
Containing spatial relationships between millions of locations, as well in names in nearly 100 languages,
the Ontology2 Spatial Hierarchy is a global database that places business locations in postal codes, cities,
countries and other spatial subdivisions (such as US states and counties and Japanese prefectures) to identify
regions and regulatory jurisdictions to support sub reporting as necessary.
Next Steps
Many financial organizations see BCBS 239, Dodd-Frank, EMIR II and other regulations as a source of cost and
risk. We see it is a wake-up call to fix creaking IT infrastructures: while many while many organizations are
struggling to meet regulatory requirements, we see this as an opportunity to satisfy and delight customers,
while helping grow the bottom line.
Bill Freeman - Managing Director 1-774-301-1301 bill.freeman@ontology2.com
Paul Houle Chief Data Scientist paul.houle@ontology2.com

More Related Content

DOC
Vendor comparisons: the end game in business intelligence
Kishore Jethanandani, MBA, MA, MPhil,
 
PDF
Thinking space
Titos Matsakos
 
DOCX
CIO Published Article
Craig Harclerode
 
PDF
Information and Integration Management Vision
Colin Bell
 
PPT
Actionable Architecture
guestd3d2f49
 
PPTX
Incentius - Portfolio of Capabilities
Sujeet Pillai
 
PPT
Design On A Dime Dashboard Edition Laura Edell Gibbons 3
Laura Edell
 
PDF
Data dynamics in IoT Era
Paddy Ramanathan
 
Vendor comparisons: the end game in business intelligence
Kishore Jethanandani, MBA, MA, MPhil,
 
Thinking space
Titos Matsakos
 
CIO Published Article
Craig Harclerode
 
Information and Integration Management Vision
Colin Bell
 
Actionable Architecture
guestd3d2f49
 
Incentius - Portfolio of Capabilities
Sujeet Pillai
 
Design On A Dime Dashboard Edition Laura Edell Gibbons 3
Laura Edell
 
Data dynamics in IoT Era
Paddy Ramanathan
 

What's hot (17)

PPTX
Document Process Automation Solutions - Invensis Technologies
Invensis
 
PPTX
Guide to Business Intelligence
TechnologyAdvice
 
PDF
Canheit-2012-ITRepository
Doug Badger
 
PDF
Graph Database
Ashutosh Sable
 
PPTX
Intelligent Document Processing to Streamline Document Processing
10xDS - Exponential Digital Solutions
 
PDF
Information economics and big data
Mark Albala
 
PDF
Horizon DMS White Paper
AndreaMorris30
 
PDF
BIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIES
ijcsit
 
PPTX
Business analytics
PenpaTsering1
 
PDF
Oracle Big Data Governance Webcast Charts
Jeffrey T. Pollock
 
PPT
Business intelligent
Migrant Systems
 
PDF
Document Automation for Everyday Efficiency
Chelsey Lambert
 
PDF
brocade-five-pillars-federal-data-centers-wp (1)
Vince Garr
 
PPTX
Module 1
Gaurav Malode
 
PPTX
AI and Marketing: Robot-proofing Your Job
Call Sumo
 
PDF
Architecting a-big-data-platform-for-analytics 24606569
Kun Le
 
PDF
Intro to big data and applications - day 1
Parviz Vakili
 
Document Process Automation Solutions - Invensis Technologies
Invensis
 
Guide to Business Intelligence
TechnologyAdvice
 
Canheit-2012-ITRepository
Doug Badger
 
Graph Database
Ashutosh Sable
 
Intelligent Document Processing to Streamline Document Processing
10xDS - Exponential Digital Solutions
 
Information economics and big data
Mark Albala
 
Horizon DMS White Paper
AndreaMorris30
 
BIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIES
ijcsit
 
Business analytics
PenpaTsering1
 
Oracle Big Data Governance Webcast Charts
Jeffrey T. Pollock
 
Business intelligent
Migrant Systems
 
Document Automation for Everyday Efficiency
Chelsey Lambert
 
brocade-five-pillars-federal-data-centers-wp (1)
Vince Garr
 
Module 1
Gaurav Malode
 
AI and Marketing: Robot-proofing Your Job
Call Sumo
 
Architecting a-big-data-platform-for-analytics 24606569
Kun Le
 
Intro to big data and applications - day 1
Parviz Vakili
 
Ad

Similar to BCBS -By Ontology2 (20)

PPTX
Research methods group accelarating impact by sharing data
World Agroforestry (ICRAF)
 
PPTX
Semantic Applications for Financial Services
DavidSNewman
 
PDF
Domain Semantics
mlang22
 
PPTX
Data Model vs Ontology Development – a FIBO perspective | Mike Bennett
Connected Data World
 
PDF
Data and Computation Interoperability in Internet Services
Sergey Boldyrev
 
PDF
Data Quality Management With Semantic Technologies 1st Edition Christian Frbe...
kywxxdpmw956
 
PPTX
Ontologies for Emergency & Disaster Management
Stephane Fellah
 
PDF
NIC and the semantic web briefing paper
Michael Wilkinson
 
DOC
ICDMWorkshopProposal.doc
butest
 
PPTX
On the personalization of event-based systems
Opher Etzion
 
PDF
FAIR data_ Superior data visibility and reuse without warehousing.pdf
Alan Morrison
 
PPT
On the Emergent Semantic Web and Overlooked Issues - 2004
Yannis Kalfoglou
 
PDF
Finsec innov-acts-open banking-london080319-4web
innov-acts-ltd
 
PDF
Competency framework: engineers, statisticians, data scientists, librarians, ...
African Open Science Platform
 
PDF
Dc32644652
IJERA Editor
 
PDF
Sustainable Places 2015 - The OPTIMUS project
Álvaro Sicilia
 
PDF
Semantic solution impact analysis
Rajib Saha
 
PDF
Retail Banking - an ontological example by Lauren Madar
Lauren Madar
 
PDF
Alignment: Office of the Chief Data Officer & BCBS 239
Craig Milroy
 
PPTX
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Andre Freitas
 
Research methods group accelarating impact by sharing data
World Agroforestry (ICRAF)
 
Semantic Applications for Financial Services
DavidSNewman
 
Domain Semantics
mlang22
 
Data Model vs Ontology Development – a FIBO perspective | Mike Bennett
Connected Data World
 
Data and Computation Interoperability in Internet Services
Sergey Boldyrev
 
Data Quality Management With Semantic Technologies 1st Edition Christian Frbe...
kywxxdpmw956
 
Ontologies for Emergency & Disaster Management
Stephane Fellah
 
NIC and the semantic web briefing paper
Michael Wilkinson
 
ICDMWorkshopProposal.doc
butest
 
On the personalization of event-based systems
Opher Etzion
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
Alan Morrison
 
On the Emergent Semantic Web and Overlooked Issues - 2004
Yannis Kalfoglou
 
Finsec innov-acts-open banking-london080319-4web
innov-acts-ltd
 
Competency framework: engineers, statisticians, data scientists, librarians, ...
African Open Science Platform
 
Dc32644652
IJERA Editor
 
Sustainable Places 2015 - The OPTIMUS project
Álvaro Sicilia
 
Semantic solution impact analysis
Rajib Saha
 
Retail Banking - an ontological example by Lauren Madar
Lauren Madar
 
Alignment: Office of the Chief Data Officer & BCBS 239
Craig Milroy
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Andre Freitas
 
Ad

BCBS -By Ontology2

  • 1. Real Semantics: Being Proactive about BCBS 239 Presented by Ontology2 -semantic graph models, dataquality and easy multi-model management Overview BCBS 239 is a standard set by the Bank for International Settlements which applies to banks which are systematically important, either on a global or national basis. It raises the bar for IT systems and the challenge of compliance is a major concern of both IT organizations and the C-Suite. A product of years of development at Ontology2, RealSemantics could be the first IT architecture designed with BCBS 239 in mind. With a universal graph model, a common mechanism can be used for tracing any decisions made by the system. With the capability to peer into legacy systems at the conceptual, API, object code and source code level, RealSemantics synchronizes the map and the territory to take the chaos out of IT, setting the stage for intelligent applications that satisfy customers and grow the top line. In this chapter, we will look at a few paragraphs from the official BCBS 239 document and explain what features and characteristics of Real Semantics satisfy their requirements. IT InfrastructureforOrdinary and Extraordinary times 33. A bank should establish integrated 16 data taxonomies and architecture across the banking group, which includes information on the characteristics of the data (metadata), as well as use of single identifiers and/or unified naming conventions for data including legal entities, counterparties, customers and accounts. (which assumes the associated footnote)  Banks do not necessarily need to have one data model; rather, there should be robust automated reconciliation procedures where multiple models are in use. Real Semantics is based on a Multiple Model Architecture, which is precisely intended to interconnect systems based upon different data models. This is possible because the RDF/K data model is flexible to itself model any model in use by legacy systems. A single data model also supports a single model for metadata about every concept, data record, user, IT artifact or other thing encountered. Ontology2 has supported an ongoing program to improve the quality and state of knowledge about Legal Entity Identifiers. As such, Real Semantics is ideal for reconciliation of various numeric identifiers as well as natural languages names and addresses.
  • 2. Roles and responsibilities should be established as they relateto the ownership and quality of risk data and information for both the business and IT functions. The owners (business and IT functions), in partnership with risk managers, should ensure there areadequate controls throughout the lifecycle of the data and for all aspects of the technology infrastructure. The role of the business owner includes ensuring data is correctly entered by the relevant front office unit, kept current and aligned with the data definitions, and also ensuring that risk data aggregationcapabilities and risk reporting practices are consistent with firms€™policies.  Real Semantics traceability makes control certain. Real Semantics captures knowledge from workers across the organization from line workers to subject matter experts, software developers, risk management experts and executive. With a structured process for case management, misunderstandings can be understood, identified, repaired and/or documented. Policy documents are captured directly into the system, linked to rule-based implementations accessible to auditors and other professionals. Supporting Accuracy and Integrity As a precondition, a bank should have a "dictionary" of the concepts used, such that data is defined consistently across an organization. The RDF/K model used in RealSemantics not only represents common data structures used in the software industry, but it works together with RDFS, OWL, SKOS and similar vocabularies to represent concepts used in natural language and cognition. Real Semantics exceeds traditional tools in that it keeps trackof multiple contexts (Multiple Model Architecture), setting the stage to record nuances of expression that are are relevant to various communities, applications and business lines. There should be an appropriate balance between automated and manual systems. Where professional judgements are required, human intervention may be appropriate. For many other processes, a higher degree of automation is desirable to reduce the risk of errors. Real Semantics pushes the envelope of what can be automated because it is based on a systematic approach to capturing natural languagedocumentation, implicit knowledge in the form of rules, and explicit knowledge in the form of examples which can be used to validate rules, explain special cases, and train machine learning models. Human input is important, so RealSemantics participates in business process orchestration, sending difficult tasks to humans as well as sending a statistical sample for review to support statistical quality control (think Edward Deming, Phil Crosby, and Joseph Juran) to control, understand and improve quality with a focused and efficient use of human effort. Supervisors expect banks to document and explain all of their risk data aggregationprocesses whether automated or manual (judgement based or otherwise). Documentation should include an explanation of the
  • 3. appropriateness of any manual workarounds, a description of their criticality to the accuracyof risk data aggregationand proposed actions to reduce the impact. Experience shows that the structure of commonsense knowledge consists of rules and exceptions. In conventional software development based on procedural or object-oriented techniques, the result is often that a clean design becomes corrupted over time with a largenumber of "hacks" put in at the last minute that areplaced randomly throughout the code base. The result is frequently a largesystem that is unpredictable, unreliable, and hard to maintain.  Real Semantics is based on the assumption that simple models are imperfect, so it provides facilities, such as fact patches and rule patches to create neat layers that separate general purpose, industry specific, company specific and while many organizations are struggling to meet regulatory requirements.  While many organizations are struggling to meet regulatory requirements, we see this as an opportunity to find alpha, satisfy and delight customers, and grow the bottom line. If you agree, contact us.. Changes are documented with a case management system, so that clear documentation exists for both the automatic fast path as well as cases that are specialized or require human intervention. Unprecedented Adaptability Supervisors expect banks to measure and monitor the accuracyof data and to develop appropriate escalation channels and action plans to be in place to rectify poor data quality Real Semantics flags quality problems automatically based upon both rules and discovery algorithms. In cases where data accuracy is in question, a ticket is createdin the integrated case management system, so the problem can be investigated, possibly confirmed, and referred to the proper person, with the proper knowledge and skills, inside or outside of the organization -- leaving a paper trail which can be referred to later. Adaptability includes: (a) Data aggregationprocesses that are flexible and enable risk data to be aggregated for assessment and quick decision-making; (b) Capabilities for data customization to user’s needs (e.g. dashboards, key takeaways, anomalies), to drill down as needed, and to produce quick summary reports; (c) Capabilities to incorporate new developments on the organization of the business and/or external factors that influence the banks risk profile; and (d) Capabilities to incorporate changes in the regulatory framework.
  • 4. Rapid and flexible processing is enabled in Real Semantics by an extensive knowledge base about real world data and the problem domain. By bundling these capabilities into a structured process, the data cleaning that takes 80 to 90% of the time of a typical analyst or "data scientist" is built into a repeatable process that dramatically saves time. Although Real Semantics can export data for use with any tools, it integrates with Kibi from our partner SIREn Solutions, which combines classical BI queries with time series and full-text capabilities. Real Semantics classifies documents, data records, and events, accurately extracting facts and concepts for an unmatched capability for data exploration and dashboard creation. With the Multiple Model Architecture, Real Semantics is designed from square one with the assumption that data, code, documentation and other assets will be repurposed in multiple ways. With data lake capabilities based on Hadoop and cloud computing, RealSemantics can get the resources to repeat and reproduce data processing operations in case of any change of the data, rules, or other assumptions about the environment, business organization or regulatory frameworks quickly. Supervisors expect banks to be able to generatesubsets of data based on requested scenarios or resulting from economic events. For example, a bank should be able to aggregaterisk data quickly on country credit exposures 18 as of a specified date based on a list of countries, as well as industry credit exposures as of a specified date based on a list of industry types across all business lines and geographic areas. Real Semantics incorporates reasoning over large Linked Data knowledge bases covering topics such as places, people, creative works, industry classifications as well as technology, chemistry, biology and other fields that concern large numbers of creative entities. RealSemantics comes with the Ontology2 Spatial Hierarchy, derived from Freebase (which Google bought to createthe Google Knowledge Graph). Containing spatial relationships between millions of locations, as well in names in nearly 100 languages, the Ontology2 Spatial Hierarchy is a global database that places business locations in postal codes, cities, countries and other spatial subdivisions (such as US states and counties and Japanese prefectures) to identify regions and regulatory jurisdictions to support sub reporting as necessary. Next Steps Many financial organizations see BCBS 239, Dodd-Frank, EMIR II and other regulations as a source of cost and risk. We see it is a wake-up call to fix creaking IT infrastructures: while many while many organizations are struggling to meet regulatory requirements, we see this as an opportunity to satisfy and delight customers, while helping grow the bottom line. Bill Freeman - Managing Director 1-774-301-1301 [email protected] Paul Houle Chief Data Scientist [email protected]