©
Fraunhofer
1
ISCRAM 2013, Baden-Baden, Germany
The Seven Main Challenges of an Early
Warning System Architecture
J. Moßgraber, F. Chaves(1), S. Middleton, Z.
Zlatev(2), R. Tao(3)
(1) Fraunhofer Institute of Optronics, System
Technologies and Image Exploitation (IOSB), Germany
(2) IT Innovation Centre, Southampton, UK
(3) Department of Electronic Engineering, Queen Mary
University of London, UK
©
Fraunhofer
2
ISCRAM 2013, Baden-Baden, Germany
Sensors
Decision
Magic
Early Warning System Architecture
©
Fraunhofer
3
ISCRAM 2013, Baden-Baden, Germany
Downhole Drilling
 Purpose
 Exploration & exploitation
of oil & gas
 Retrieval of geothermal
energy
 Controlled disposal of
carbon dioxide
 Scientific drilling
 Safety Constraints
 Prevent crew, equipment
and environment from
injury, damage and
pollution
 Prevent Crises
©
Fraunhofer
4
ISCRAM 2013, Baden-Baden, Germany
Tsunami Warning Systems
©
Fraunhofer
5
ISCRAM 2013, Baden-Baden, Germany
System-of-Systems
 Operational / managerial Independence of
the Elements
 Different governments and institutions
like Warning Centres, Task Forces,
Scientific Institutions, Data Centres, …
 Evolutionary Development
 Integration of new Sensor networks,
analysis algorithms, ..
 Emergent Behaviour
 Combines the knowledge of parts
 Geographic Distribution
 Tsunami Early Warning System for the
Euro-Mediterranean area (> 20 national
and at least one regional centre).
©
Fraunhofer
6
ISCRAM 2013, Baden-Baden, Germany
System-of-Systems (cont.)
Communication
is the key!
1st site
Broker Cluster
2nd site
Broker Cluster
3rd site
Broker Cluster
©
Fraunhofer
7
ISCRAM 2013, Baden-Baden, Germany
The 7 Challenges
1.Build a scalable
communication layer for
a SoS
2.Build a resilient
communication layer for
a SoS
3.Efficiently publish
large volumes of
semantically rich sensor
data
4.Scalable and high
performance storage of
large distributed
datasets
5.Handling federated
multi-domain
heterogeneous data
6.Discovery of resources
Sensors
Decision
Magic
©
Fraunhofer
8
ISCRAM 2013, Baden-Baden, Germany
1) Build a scalable communication layer
 Requirements
 Open, heterogeneity, standard, language/OS
independent
 Design decision
 Scalable communication using a Message-
oriented Middleware (MOM): m-m
 Single MOM technology (Apache QPID): supports
Advanced Message Queuing Protocol (AMQP)
 Discussion
 API standard, e.g., JMS / not a wireline
standard, i.e., AMQP as supported by Qpid
 Text-based protocols, e.g., STOMP / not a
binary protocol that is ↑ scalable
 Point-point systems, e.g., P2P, SOA / not
generic M-M pub-sub model
 Proprietary MOMs / Not open-source, more
difficult to enhance & instrument to monitor
8
©
Fraunhofer
9
ISCRAM 2013, Baden-Baden, Germany
2) Build a resilient communication layer
 Requirements
 Broker Failure, e.g., broker crash due to
underlay system failure
 Link Failure, e.g., low QoS in comms links or
link broken
 Client Failure, e.g., subscriber client failed
 Design decision
 Broker Mirroring, i.e., messages are
replicated to both primary broker and mirror
broker
 Overlay Routing, i.e., messages are auto-
switched to another overlay routing path
according to the link status
 Durable Queue is selected to temporary stored
the messages when sub has failed.
 Discussion
 SOA, distributed stream processing: have no
inherent resilience
9
©
Fraunhofer
10
ISCRAM 2013, Baden-Baden, Germany
Twitter
Crawler
Unconventional / Human Sensors
Source: http://xkcd.
©
Fraunhofer
11
ISCRAM 2013, Baden-Baden, Germany
3) Efficiently publish large volumes of semantically rich sensor data
 Design decision
 Publish data to MOM & metadata to a semantic
registry - fast, expressive metadata
 Combine different approaches (there is no „one
size fits all“)
 SWE O&M XML messages with data embedded in
them – slow, expressive metadata
 Support existing formats: WITS0 JSON encoded
messages – fast, limited metadata
 Binary HDF5 / netCDF – fastest, limited
metadata
 Discussion
 Database query
 via SSH tunnel – tight coupling of machines,
SQL security issues
 HTTP or SQL request over MOM - QPID does not
easily support this, SQL security issues
11
©
Fraunhofer
12
ISCRAM 2013, Baden-Baden, Germany
4) Scalable and high performance storage of large distributed
datasets
Design decision
 Working datasets
 Hybrid databases [OWLIM for
metadata, MySQL for data] –
semantically rich and efficient
queries, clients must understand two
protocols (SPARQL & SQL)
 File storage for larger binary data
– no size limits, any format,
difficult to query data
 Archived datasets
 HDF5 strategy for longer term
storage – compressed format with
embedded metadata
12
©
Fraunhofer
13
ISCRAM 2013, Baden-Baden, Germany
4) Scalable and high performance storage of large distributed
datasets (cont.)
Discussion
 Triple stores – good metadata query
support, poor scalability, standard
SPARQL
 Relational databases – fast, ridgid
structure, poor metadata query support,
standard SQL
 NoSQL type solutions [column stores etc.]
– suitable for our proposed hybrid
solution
 Currently work on Apache Cassandra
integration
 Map Reduce solutions – good for
distributed processing, efficient for
very large datasets, complex to setup
[overkill for real-time data with short
processing time horizons] 13
©
Fraunhofer
14
ISCRAM 2013, Baden-Baden, Germany
5) Handling federated multi-domain heterogeneous data
 Design decision
 Broker pattern for access / transformation of
data between domains – scalable, slow adding
an extra „hop‟ to workflow
 Domain ontologies in semantic registries –
scalable, requires a lookup step
 Federated data queries – scalable, extra
aggregation work for clients
 Discussion
 Data sources and/or apps map data to a global
ontology – efficient, not practical to get
agreement between domains
 Data sources and/or apps locally map between
data models – does not scale, inconsistencies
likely
 Automatic ontology alignment services –
scalable, difficult & error prone, slow adding
an extra „hop‟ to workflow
14
©
Fraunhofer
15
ISCRAM 2013, Baden-Baden, Germany
6) Discovery of resources in a geo-distributed SoS
Design decision
 Multiple semantic registries hosted
by stakeholders – scalable, modular
 Separation of frontend/s and ontology
store/s
 Shared ontology core (classes,
relations, attributes, design
patterns based on SSNO),
registry-specific sub-classes and
individuals – flexibility for
adaptation to different domains
 Data and services described in
semantically rich ways, allowing
search/browse by both humans and
machines – multiple interfaces for
established protocols/standards15
©
Fraunhofer
16
ISCRAM 2013, Baden-Baden, Germany
6) Discovery of resources in a geo-distributed SoS
(cont.)
Discussion
 Classical search-engines and
catalogues – many out-of-the-box
solutions,
limited “semantic” search-
capabilities,
 Monolithic systems – optimized and
good performance for some types of
applications,
many additional tools but dependency
between components
 One central semantic registry – no
“synchronisation” of ontology core
necessary, bottleneck if central
registry not available
16
©
Fraunhofer
17
ISCRAM 2013, Baden-Baden, Germany
7) Coordination of work between geo-distributed
systems
 Design decision
 Decision tables & authoring tool for end users
– self-documenting rule-sets and intuitive
easy-to-use interfaces for non IT-experts,
many available tools; comfortable mapping of
“models” (sets of variables and rules) to
ontology elements not yet
standardized/available,
 Workflow engine(s) coordinating federated –
message-based event processing for complex and
rich choreographies, requirement-specific
flexible adaptation of (standard) workflows
 Discussion
 Ontology reasoning – powerful and expressive
reasoning und rule systems; require high-level
of expertise, performance and (quality of)
results depend on size/complexity of
ontology, persistent storage of big ontologies
still a problem.
17
©
Fraunhofer
18
ISCRAM 2013, Baden-Baden, Germany
System-of-systems
1st site
Broker Cluster
2nd site
Broker Cluster
3rd site
Broker Cluster
©
Fraunhofer
19
ISCRAM 2013, Baden-Baden, Germany
Data Source(s)
1st site
MOM
Data Source(s)
Data Source(s)
Feeder Storage
Historic DataCached Data
Semantic
Registry
Workflow Service
Data Source(s)
Data Source(s)
Processing
Service
R
Receive
realtime data
R
Get cached data and
parameters; write results
User Interface
R
Cache / store data
Query
R
Steers
R
Receive
notifications
R
Invoke &
handle results
RR
Downstream
Dissemination
R
R
Register & request topic
Generic TRIDEC Architecture
©
Fraunhofer
20
ISCRAM 2013, Baden-Baden, Germany
Example Workflow for Tsunami Early Warning
©
Fraunhofer
21
ISCRAM 2013, Baden-Baden, Germany
Data Source(s)
1st site
MOM
Data Source(s)
Data Source(s)
Feeder Storage
Historic DataCached Data
Semantic
Registry
Workflow Service
Data Source(s)
Data Source(s)
Processing
Service
R
Receive
realtime data
R
Get cached data and
parameters; write results
User Interface
R
Cache / store data
Query
R
Steers
R
Receive
notifications
R
Invoke &
handle results
RR
Downstream
Dissemination
R
R
Register & request topic
Generic TRIDEC Architecture
©
Fraunhofer
22
ISCRAM 2013, Baden-Baden, Germany
Does it work?
 Tsunami
 TRIDEC software deployed at
Instituto Português do Mar e
da Atmosfera (IPMA) in
Lissabon and the Kandilli
Observatory and Earthquake
Research Institute (KOERI) in
Istanbul for testing purposes
 Two scenarios in the
European-wide Tsunami
exercise NEAMWave2012 were
successfully validated
 New functionalities “Centre-
to-Centre communication” via
software systems between
Turkey and Portugal and
eyewitness reports sent from
mobile devices via apps were
available for the first time.
©
Fraunhofer
23
ISCRAM 2013, Baden-Baden, Germany
Conclusion and Future Work
 Main problems when dealing with the
architecture of a SoS presented
 requires a scalable and resilient
communication layer
 large amounts of data need to be
published and processed
 requires a scalable storage concept
 respect the geo-distributed nature
 Future Work
 improving the resilience and workload
allocation of the MOM
 improve the resilience of the Semantic
Registry by providing a replication
mechanism
 research the federated access to Big Data
©
Fraunhofer
24
ISCRAM 2013, Baden-Baden, Germany
Acknowledgements
The presented work is done in collaboration
with consortium of the TRIDEC project
which is supported by the European
Commission under the 7th Framework
Programme
(ICT-2009.4.3 Intelligent Information Management
Project Reference: 258723)

More Related Content

PDF
Internet data mining 2006
PPTX
Transient and persistent RDF views over relational databases in the context o...
PDF
Advancing life sciences with IBM reference architecture for genomics
PDF
A New Architecture for Group Replication in Data Grid
PDF
2013 NIST Big Data Subgroups Combined Outputs
DOCX
Robust module based data management
PPTX
Tridec tsunami-decision-support
Internet data mining 2006
Transient and persistent RDF views over relational databases in the context o...
Advancing life sciences with IBM reference architecture for genomics
A New Architecture for Group Replication in Data Grid
2013 NIST Big Data Subgroups Combined Outputs
Robust module based data management
Tridec tsunami-decision-support

Viewers also liked (20)

PPT
Implementation and integration of GPU-accelerated easyWave for instant tsunam...
PDF
UNESCAP_Galle_Report
DOCX
PDF
PDF
TSUNAMI EMERGENCY RESPONSE SYSTEM USING GEO-INFORMATION TECHNOLOGY ALONG THE ...
PDF
report
PDF
TSUNAMI EARLY WARNING SYSTEM ALONG THE GUJARAT COAST, INDIA
DOC
Tsunami on 26th December,2004 Current National Disaster Management System
PDF
Southeastern Pacific Tsunami Warning Solutions
PPTX
The TRIDEC system in the NEAMWave12 exercise
PPT
The Evolution of Disaster Early Warning Systems in the TRIDEC Project
DOCX
Seminar technical abstract
DOCX
Tsunami warning system
PPT
Tsunami warning system
DOCX
tsunami warning system (synofsis)
PPTX
Tsunami Warning System
PPTX
Tsunami warning system
PPTX
Tsunami warning system
PPTX
Tsunami warning system
Implementation and integration of GPU-accelerated easyWave for instant tsunam...
UNESCAP_Galle_Report
TSUNAMI EMERGENCY RESPONSE SYSTEM USING GEO-INFORMATION TECHNOLOGY ALONG THE ...
report
TSUNAMI EARLY WARNING SYSTEM ALONG THE GUJARAT COAST, INDIA
Tsunami on 26th December,2004 Current National Disaster Management System
Southeastern Pacific Tsunami Warning Solutions
The TRIDEC system in the NEAMWave12 exercise
The Evolution of Disaster Early Warning Systems in the TRIDEC Project
Seminar technical abstract
Tsunami warning system
Tsunami warning system
tsunami warning system (synofsis)
Tsunami Warning System
Tsunami warning system
Tsunami warning system
Tsunami warning system
Ad

Similar to The Seven Main Challenges of an Early Warning System Architecture (20)

PPT
Semantics in Sensor Networks
PPTX
Empowering Transformational Science
PPTX
Ogce Workflow Suite
PPTX
Streaming HYpothesis REasoning
PPT
Waters Grid & HPC Course
ODP
Smartblitzmerker
PDF
mapreduce_presentation
PDF
Paper id 25201431
PPT
SciDB : Open Source Data Management System for Data-Intensive Scientific Anal...
PPTX
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
PPTX
240705-랩소개-전산학부-석사신입생.pptx
PPTX
Streaming Hypothesis Reasoning - William Smith, Jan 2016
PDF
Hdf5
PDF
Distributed Systems in Data Engineering
PPTX
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
PDF
Memory-Driven Near-Data Acceleration and its application to DOME/SKA
PPT
Shifting the Burden from the User to the Data Provider
PPTX
Data management presentation
PPTX
SSG4Env EGU2010
PDF
Big Data Expo 2015 - Ureason
Semantics in Sensor Networks
Empowering Transformational Science
Ogce Workflow Suite
Streaming HYpothesis REasoning
Waters Grid & HPC Course
Smartblitzmerker
mapreduce_presentation
Paper id 25201431
SciDB : Open Source Data Management System for Data-Intensive Scientific Anal...
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
240705-랩소개-전산학부-석사신입생.pptx
Streaming Hypothesis Reasoning - William Smith, Jan 2016
Hdf5
Distributed Systems in Data Engineering
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
Memory-Driven Near-Data Acceleration and its application to DOME/SKA
Shifting the Burden from the User to the Data Provider
Data management presentation
SSG4Env EGU2010
Big Data Expo 2015 - Ureason
Ad

More from streamspotter (20)

PPTX
Understanding Crises: Investigating Organizational Safety Culture by Combinin...
PPTX
ASC Model: A Process Model for the Evaluation of Simulated Field Exercises in...
PPTX
Comparing Performance and Situation Awareness in USAR Unit Tasks in a virtual...
PPTX
Towards a Model-Based Analysis of Place-Related Information in Disaster Respo...
PPTX
Information Infrastructure for Crisis Response Coordination: A Study of local...
PPT
Validating Procedural Knowledge in the Open Virtual Collaboration Environment
PPTX
Context-Based Knowledge Fusion Patterns in Decision Support System for Emerge...
PPTX
Exploring Shared Situational Awareness using Serious Gaming in Supply Chain D...
PPSX
LVC Training Environment for Strategic and Tactical Emergency Operations
PPT
Ethical Challenges of Participatory Sensing for Crisis Information Management
PPT
The Impact of IT on the Management of Mass Casualty Incidents in Germany
PPTX
Towards a Knowledge-Intensive Serious Game for Training Emergency Medical Ser...
PPTX
Communication Interface for Virtual Training of Crisis Management
PPTX
Optimization Modeling and Decision Support for Wireless Infrastructure Deploy...
PPTX
A System Dynamics Model of the 2005 Hatlestad Slide Emergency Management
PPTX
Inter-organizational Collaboration Structures during Emergency Response: A Ca...
PPTX
Unexpected Effects of Rescue Robots’ Team-Membership in a virtual Environment
PPTX
A Typology to facilitate Multi-Agency Coordination
PPTX
Exercises for Crisis Management Training in intra-organizational Settings
PPTX
A Novel Architecture for Disaster Response Workflow Management Systems
Understanding Crises: Investigating Organizational Safety Culture by Combinin...
ASC Model: A Process Model for the Evaluation of Simulated Field Exercises in...
Comparing Performance and Situation Awareness in USAR Unit Tasks in a virtual...
Towards a Model-Based Analysis of Place-Related Information in Disaster Respo...
Information Infrastructure for Crisis Response Coordination: A Study of local...
Validating Procedural Knowledge in the Open Virtual Collaboration Environment
Context-Based Knowledge Fusion Patterns in Decision Support System for Emerge...
Exploring Shared Situational Awareness using Serious Gaming in Supply Chain D...
LVC Training Environment for Strategic and Tactical Emergency Operations
Ethical Challenges of Participatory Sensing for Crisis Information Management
The Impact of IT on the Management of Mass Casualty Incidents in Germany
Towards a Knowledge-Intensive Serious Game for Training Emergency Medical Ser...
Communication Interface for Virtual Training of Crisis Management
Optimization Modeling and Decision Support for Wireless Infrastructure Deploy...
A System Dynamics Model of the 2005 Hatlestad Slide Emergency Management
Inter-organizational Collaboration Structures during Emergency Response: A Ca...
Unexpected Effects of Rescue Robots’ Team-Membership in a virtual Environment
A Typology to facilitate Multi-Agency Coordination
Exercises for Crisis Management Training in intra-organizational Settings
A Novel Architecture for Disaster Response Workflow Management Systems

Recently uploaded (20)

PPTX
Microsoft Excel 365/2024 Beginner's training
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PDF
sbt 2.0: go big (Scala Days 2025 edition)
DOCX
Basics of Cloud Computing - Cloud Ecosystem
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
Accessing-Finance-in-Jordan-MENA 2024 2025.pdf
PDF
UiPath Agentic Automation session 1: RPA to Agents
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
PDF
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
DOCX
search engine optimization ppt fir known well about this
PDF
giants, standing on the shoulders of - by Daniel Stenberg
Microsoft Excel 365/2024 Beginner's training
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
The influence of sentiment analysis in enhancing early warning system model f...
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
A review of recent deep learning applications in wood surface defect identifi...
Improvisation in detection of pomegranate leaf disease using transfer learni...
sbt 2.0: go big (Scala Days 2025 edition)
Basics of Cloud Computing - Cloud Ecosystem
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Accessing-Finance-in-Jordan-MENA 2024 2025.pdf
UiPath Agentic Automation session 1: RPA to Agents
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
Custom Battery Pack Design Considerations for Performance and Safety
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
search engine optimization ppt fir known well about this
giants, standing on the shoulders of - by Daniel Stenberg

The Seven Main Challenges of an Early Warning System Architecture

  • 1. © Fraunhofer 1 ISCRAM 2013, Baden-Baden, Germany The Seven Main Challenges of an Early Warning System Architecture J. Moßgraber, F. Chaves(1), S. Middleton, Z. Zlatev(2), R. Tao(3) (1) Fraunhofer Institute of Optronics, System Technologies and Image Exploitation (IOSB), Germany (2) IT Innovation Centre, Southampton, UK (3) Department of Electronic Engineering, Queen Mary University of London, UK
  • 2. © Fraunhofer 2 ISCRAM 2013, Baden-Baden, Germany Sensors Decision Magic Early Warning System Architecture
  • 3. © Fraunhofer 3 ISCRAM 2013, Baden-Baden, Germany Downhole Drilling  Purpose  Exploration & exploitation of oil & gas  Retrieval of geothermal energy  Controlled disposal of carbon dioxide  Scientific drilling  Safety Constraints  Prevent crew, equipment and environment from injury, damage and pollution  Prevent Crises
  • 4. © Fraunhofer 4 ISCRAM 2013, Baden-Baden, Germany Tsunami Warning Systems
  • 5. © Fraunhofer 5 ISCRAM 2013, Baden-Baden, Germany System-of-Systems  Operational / managerial Independence of the Elements  Different governments and institutions like Warning Centres, Task Forces, Scientific Institutions, Data Centres, …  Evolutionary Development  Integration of new Sensor networks, analysis algorithms, ..  Emergent Behaviour  Combines the knowledge of parts  Geographic Distribution  Tsunami Early Warning System for the Euro-Mediterranean area (> 20 national and at least one regional centre).
  • 6. © Fraunhofer 6 ISCRAM 2013, Baden-Baden, Germany System-of-Systems (cont.) Communication is the key! 1st site Broker Cluster 2nd site Broker Cluster 3rd site Broker Cluster
  • 7. © Fraunhofer 7 ISCRAM 2013, Baden-Baden, Germany The 7 Challenges 1.Build a scalable communication layer for a SoS 2.Build a resilient communication layer for a SoS 3.Efficiently publish large volumes of semantically rich sensor data 4.Scalable and high performance storage of large distributed datasets 5.Handling federated multi-domain heterogeneous data 6.Discovery of resources Sensors Decision Magic
  • 8. © Fraunhofer 8 ISCRAM 2013, Baden-Baden, Germany 1) Build a scalable communication layer  Requirements  Open, heterogeneity, standard, language/OS independent  Design decision  Scalable communication using a Message- oriented Middleware (MOM): m-m  Single MOM technology (Apache QPID): supports Advanced Message Queuing Protocol (AMQP)  Discussion  API standard, e.g., JMS / not a wireline standard, i.e., AMQP as supported by Qpid  Text-based protocols, e.g., STOMP / not a binary protocol that is ↑ scalable  Point-point systems, e.g., P2P, SOA / not generic M-M pub-sub model  Proprietary MOMs / Not open-source, more difficult to enhance & instrument to monitor 8
  • 9. © Fraunhofer 9 ISCRAM 2013, Baden-Baden, Germany 2) Build a resilient communication layer  Requirements  Broker Failure, e.g., broker crash due to underlay system failure  Link Failure, e.g., low QoS in comms links or link broken  Client Failure, e.g., subscriber client failed  Design decision  Broker Mirroring, i.e., messages are replicated to both primary broker and mirror broker  Overlay Routing, i.e., messages are auto- switched to another overlay routing path according to the link status  Durable Queue is selected to temporary stored the messages when sub has failed.  Discussion  SOA, distributed stream processing: have no inherent resilience 9
  • 10. © Fraunhofer 10 ISCRAM 2013, Baden-Baden, Germany Twitter Crawler Unconventional / Human Sensors Source: http://xkcd.
  • 11. © Fraunhofer 11 ISCRAM 2013, Baden-Baden, Germany 3) Efficiently publish large volumes of semantically rich sensor data  Design decision  Publish data to MOM & metadata to a semantic registry - fast, expressive metadata  Combine different approaches (there is no „one size fits all“)  SWE O&M XML messages with data embedded in them – slow, expressive metadata  Support existing formats: WITS0 JSON encoded messages – fast, limited metadata  Binary HDF5 / netCDF – fastest, limited metadata  Discussion  Database query  via SSH tunnel – tight coupling of machines, SQL security issues  HTTP or SQL request over MOM - QPID does not easily support this, SQL security issues 11
  • 12. © Fraunhofer 12 ISCRAM 2013, Baden-Baden, Germany 4) Scalable and high performance storage of large distributed datasets Design decision  Working datasets  Hybrid databases [OWLIM for metadata, MySQL for data] – semantically rich and efficient queries, clients must understand two protocols (SPARQL & SQL)  File storage for larger binary data – no size limits, any format, difficult to query data  Archived datasets  HDF5 strategy for longer term storage – compressed format with embedded metadata 12
  • 13. © Fraunhofer 13 ISCRAM 2013, Baden-Baden, Germany 4) Scalable and high performance storage of large distributed datasets (cont.) Discussion  Triple stores – good metadata query support, poor scalability, standard SPARQL  Relational databases – fast, ridgid structure, poor metadata query support, standard SQL  NoSQL type solutions [column stores etc.] – suitable for our proposed hybrid solution  Currently work on Apache Cassandra integration  Map Reduce solutions – good for distributed processing, efficient for very large datasets, complex to setup [overkill for real-time data with short processing time horizons] 13
  • 14. © Fraunhofer 14 ISCRAM 2013, Baden-Baden, Germany 5) Handling federated multi-domain heterogeneous data  Design decision  Broker pattern for access / transformation of data between domains – scalable, slow adding an extra „hop‟ to workflow  Domain ontologies in semantic registries – scalable, requires a lookup step  Federated data queries – scalable, extra aggregation work for clients  Discussion  Data sources and/or apps map data to a global ontology – efficient, not practical to get agreement between domains  Data sources and/or apps locally map between data models – does not scale, inconsistencies likely  Automatic ontology alignment services – scalable, difficult & error prone, slow adding an extra „hop‟ to workflow 14
  • 15. © Fraunhofer 15 ISCRAM 2013, Baden-Baden, Germany 6) Discovery of resources in a geo-distributed SoS Design decision  Multiple semantic registries hosted by stakeholders – scalable, modular  Separation of frontend/s and ontology store/s  Shared ontology core (classes, relations, attributes, design patterns based on SSNO), registry-specific sub-classes and individuals – flexibility for adaptation to different domains  Data and services described in semantically rich ways, allowing search/browse by both humans and machines – multiple interfaces for established protocols/standards15
  • 16. © Fraunhofer 16 ISCRAM 2013, Baden-Baden, Germany 6) Discovery of resources in a geo-distributed SoS (cont.) Discussion  Classical search-engines and catalogues – many out-of-the-box solutions, limited “semantic” search- capabilities,  Monolithic systems – optimized and good performance for some types of applications, many additional tools but dependency between components  One central semantic registry – no “synchronisation” of ontology core necessary, bottleneck if central registry not available 16
  • 17. © Fraunhofer 17 ISCRAM 2013, Baden-Baden, Germany 7) Coordination of work between geo-distributed systems  Design decision  Decision tables & authoring tool for end users – self-documenting rule-sets and intuitive easy-to-use interfaces for non IT-experts, many available tools; comfortable mapping of “models” (sets of variables and rules) to ontology elements not yet standardized/available,  Workflow engine(s) coordinating federated – message-based event processing for complex and rich choreographies, requirement-specific flexible adaptation of (standard) workflows  Discussion  Ontology reasoning – powerful and expressive reasoning und rule systems; require high-level of expertise, performance and (quality of) results depend on size/complexity of ontology, persistent storage of big ontologies still a problem. 17
  • 18. © Fraunhofer 18 ISCRAM 2013, Baden-Baden, Germany System-of-systems 1st site Broker Cluster 2nd site Broker Cluster 3rd site Broker Cluster
  • 19. © Fraunhofer 19 ISCRAM 2013, Baden-Baden, Germany Data Source(s) 1st site MOM Data Source(s) Data Source(s) Feeder Storage Historic DataCached Data Semantic Registry Workflow Service Data Source(s) Data Source(s) Processing Service R Receive realtime data R Get cached data and parameters; write results User Interface R Cache / store data Query R Steers R Receive notifications R Invoke & handle results RR Downstream Dissemination R R Register & request topic Generic TRIDEC Architecture
  • 20. © Fraunhofer 20 ISCRAM 2013, Baden-Baden, Germany Example Workflow for Tsunami Early Warning
  • 21. © Fraunhofer 21 ISCRAM 2013, Baden-Baden, Germany Data Source(s) 1st site MOM Data Source(s) Data Source(s) Feeder Storage Historic DataCached Data Semantic Registry Workflow Service Data Source(s) Data Source(s) Processing Service R Receive realtime data R Get cached data and parameters; write results User Interface R Cache / store data Query R Steers R Receive notifications R Invoke & handle results RR Downstream Dissemination R R Register & request topic Generic TRIDEC Architecture
  • 22. © Fraunhofer 22 ISCRAM 2013, Baden-Baden, Germany Does it work?  Tsunami  TRIDEC software deployed at Instituto Português do Mar e da Atmosfera (IPMA) in Lissabon and the Kandilli Observatory and Earthquake Research Institute (KOERI) in Istanbul for testing purposes  Two scenarios in the European-wide Tsunami exercise NEAMWave2012 were successfully validated  New functionalities “Centre- to-Centre communication” via software systems between Turkey and Portugal and eyewitness reports sent from mobile devices via apps were available for the first time.
  • 23. © Fraunhofer 23 ISCRAM 2013, Baden-Baden, Germany Conclusion and Future Work  Main problems when dealing with the architecture of a SoS presented  requires a scalable and resilient communication layer  large amounts of data need to be published and processed  requires a scalable storage concept  respect the geo-distributed nature  Future Work  improving the resilience and workload allocation of the MOM  improve the resilience of the Semantic Registry by providing a replication mechanism  research the federated access to Big Data
  • 24. © Fraunhofer 24 ISCRAM 2013, Baden-Baden, Germany Acknowledgements The presented work is done in collaboration with consortium of the TRIDEC project which is supported by the European Commission under the 7th Framework Programme (ICT-2009.4.3 Intelligent Information Management Project Reference: 258723)