SlideShare a Scribd company logo
Big Data Europe: Simplifying
Development and Deployment of
Big Data Applications
Ivan Ermilov @ Meet-up v MSD IT: Bioinformatics
Outline
2
● About BigDataEurope Project
● Architecture
● BDI Stacks and BDI Stacks Lifecycle
● Supporting BDI Components
BigDataEurope: BDE Project
3
● Horizon2020 project
● 17 partners
● 7 pilots in various domains
● > 30 Big Data components
● > 250 stars on github
Platform Architecture
4
Support Layer
Init Daemon
GUIs
Monitor
App Layer
Traffic
Forecast
Satellite Image Analysis
Platform Layer
Spark Flink Semantic Layer
Ontario SANSA Semagrow
Kafka
Real-time Stream
Monitoring
...
...
Resource Management Layer (Swarm)
Hardware Layer
Premises Cloud (AWS, GCE, MS Azure, …)
Data Layer
Hadoop NOSQL Store CassandraElasticsearch ...RDF Store
Platform Architecture
5
BDI Stack Lifecycle
App templates
Ready-made
components, best
practices
BDI Stack
Builder
BDI Support
Components,
Instructions
Swarm UI,
docker-compose
Pipeline/Logg
ing Monitor
6
BDI Stack Assembly
7
Stack Builder
8
Adding Custom Apps
FROM bde2020/spark-submit:2.1.0-hadoop2.7
ENV ENABLE_INIT_DAEMON=false
ENV SPARK_APPLICATION_PYTHON_LOCATION=
ENV SPARK_MASTER_NAME=sc6-spark-master
ENV SPARK_APPLICATION_ARGS=
ENV SPARK_MASTER_URL=spark://sc6-spark-master:7077
ENV SPARK_MASTER_PORT=7077
...
9
WorkFlow Builder
10
Logging Monitor
● Network logging for HTTP
○ Capture network interface as PCAP
○ Convert to HAR (json)
○ Expand HAR
○ Dump into ELK stack
11
UI Integrator Application
12
https://www.big-data-europe.eu/user-interface-integration-in-bdi-platform-integrator-ui-application/
UI Integrator
13
Reverse Proxy/CSSWrapper
● Simple injection of custom CSS
strabon:
image: bde2020/strabon
links:
- csswrapper
expose:
- "8080"
environment:
VIRTUAL_HOST: "strabon.big-data-europe.aksw.org"
VIRTUAL_PORT: "8080"
CSS_SOURCE: "strabon"
14
https://www.big-data-europe.eu/using-reverse-proxy-inside-bde-platform-jwildernginx-setup-for-docke
r-swarm/
https://github.com/big-data-europe/demo-integrator-ui
Swarm UI
15
Swarm UI
16
BDI Stack Assembly
17
Example BDI Stack: Halyard
● Which BD components does Halyard use?
○ HDFS
○ YARN (for MapReduce jobs)
○ HBase
● Which interfaces are supported?
○ Shell scripts (bulkload, export etc)
○ RDF4J console
○ RDF4J REST Server + Workbench
18
Halyard: BDI Stack
● Hadoop
○ DFS
○ YARN
● HBase
● Zookeeper
19
Halyard: BDI Stack (yaml)
20
namenode:
image: bde2020/hadoop-namenode:1.2.0-hadoop2.8-java8
container_name: namenode
networks:
- hbase
volumes:
- ./data/hadoop/namenode:/hadoop/dfs/name
environment:
- CLUSTER_NAME=test
ports:
- "50070:50070"
env_file:
- ./hadoop.env
Halyard: BDI Stack: Running
Simply execute the command:
docker-compose up -d
21
Adding Halyard to BDI Stack
FROM bde2020/hadoop-base:1.2.0-hadoop2.8-java8 as hadoop-base
FROM bde2020/hbase-base:1.0.0-hbase1.2.6 as hbase-base
FROM openjdk:8
MAINTAINER Ivan Ermilov <ivan.s.ermilov@gmail.com>
ENV HADOOP_VERSION=2.8.0
COPY --from=hadoop-base /opt/hadoop-$HADOOP_VERSION /opt/hadoop-$HADOOP_VERSION
RUN ln -s /opt/hadoop-$HADOOP_VERSION/etc/hadoop /etc/hadoop
ENV PATH /opt/hadoop-$HADOOP_VERSION/bin:$PATH
ENV HBASE_VERSION=1.2.6
COPY --from=hbase-base /opt/hbase-$HBASE_VERSION /opt/hbase-$HBASE_VERSION
RUN ln -s /opt/hbase-$HBASE_VERSION/conf /etc/hbase
ENV PATH /opt/hbase-$HBASE_VERSION/bin:$PATH
ENV HALYARD_VERSION 1.2
…
22
Running Halyard SDK
$ docker run -it --rm --network hbase
--env-file ./hbase.env
bde2020/halyard-sdk:1.0.0-halyard1.2
/bin/bash
$ ./console
23
Running Halyard SDK
24
> create hbase
> load http://danbri.org/foaf.rdf
Running Halyard SDK
25
> sparql
select ?s ?p ?o {where ?s ?p ?o} .
.
Halyard: BDI Stack (complete)
26
Thank you
27
Questions?
Github: https://github.com/earthquakesan
@AKSW: http://aksw.org/IvanErmilov.html
Email: iermilov@informatik.uni-leipzig.de
Twitter: @earthquakesan
LinkedIn: https://www.linkedin.com/in/iermilov/

More Related Content

What's hot (20)

PDF
TranSMART Hackathon Introduction Amsterdam 2015
Kees van Bochove
 
PDF
Cloud-Native: A New Ecosystem for Putting Containers into Production
Gordon Haff
 
PPTX
Cloud computing application for water resources based on open source software...
Blagoj Delipetrev
 
PDF
KubeCon EU 2016: ChatOps and Automatic Deployment on Kubernetes
KubeAcademy
 
PPTX
Updates from Hungary (Jozsef Kovacs)
EOSC-hub project
 
PDF
Applying Linked Open Data to Public Procurement
Jindřich Mynarz
 
PDF
Act 00085 i towns, nouveau framework pour la visualisation 3d web
ACSG Section Montréal
 
PPTX
Bde euro proworkshop
BigData_Europe
 
PDF
AGIT 2017: GeoMapFish_2.2, the open source WebGIS
Camptocamp
 
PDF
Playing with kubernetes, istio and kiali
Alberto Jesús Gutiérrez Juanes
 
PDF
Docker adoption in datadog japan container days v18.04 20180418
Masahiro Hattori
 
PDF
Using Streaming APIs in Production
Red Hat
 
PDF
OpenNebulaConf2017EU: Elastic Clusters for Data Analysis by Carlos de Alfonso...
OpenNebula Project
 
PDF
Jess Ingrassellino [InfluxData] | How to Get Data Into InfluxDB | InfluxDays ...
InfluxData
 
PPTX
EDF2013: Language Technology Panel, Hans-Ulrich von Freyberg: Language Infras...
European Data Forum
 
PDF
BDE SC3.3 Workshop - Agenda
BigData_Europe
 
PDF
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
Hong-Linh Truong
 
PDF
Project COLA - Project Flyer EN Web
Project COLA
 
ODP
CORE final workshop introduction
Carlo Vaccari
 
PDF
OSS Projects Knowledge Mining with CROSSMINER, OW2con'18, June 7-8, 2018
OW2
 
TranSMART Hackathon Introduction Amsterdam 2015
Kees van Bochove
 
Cloud-Native: A New Ecosystem for Putting Containers into Production
Gordon Haff
 
Cloud computing application for water resources based on open source software...
Blagoj Delipetrev
 
KubeCon EU 2016: ChatOps and Automatic Deployment on Kubernetes
KubeAcademy
 
Updates from Hungary (Jozsef Kovacs)
EOSC-hub project
 
Applying Linked Open Data to Public Procurement
Jindřich Mynarz
 
Act 00085 i towns, nouveau framework pour la visualisation 3d web
ACSG Section Montréal
 
Bde euro proworkshop
BigData_Europe
 
AGIT 2017: GeoMapFish_2.2, the open source WebGIS
Camptocamp
 
Playing with kubernetes, istio and kiali
Alberto Jesús Gutiérrez Juanes
 
Docker adoption in datadog japan container days v18.04 20180418
Masahiro Hattori
 
Using Streaming APIs in Production
Red Hat
 
OpenNebulaConf2017EU: Elastic Clusters for Data Analysis by Carlos de Alfonso...
OpenNebula Project
 
Jess Ingrassellino [InfluxData] | How to Get Data Into InfluxDB | InfluxDays ...
InfluxData
 
EDF2013: Language Technology Panel, Hans-Ulrich von Freyberg: Language Infras...
European Data Forum
 
BDE SC3.3 Workshop - Agenda
BigData_Europe
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
Hong-Linh Truong
 
Project COLA - Project Flyer EN Web
Project COLA
 
CORE final workshop introduction
Carlo Vaccari
 
OSS Projects Knowledge Mining with CROSSMINER, OW2con'18, June 7-8, 2018
OW2
 

Viewers also liked (20)

DOC
Gaurav dev ops (AWS, Linux, Automation-ansible, jenkins:CI and CD:Ansible)
Gaurav Srivastav
 
PPTX
Nagios Conference 2014 - Fernando Covatti - Nagios in Power Transmission Util...
Nagios
 
PDF
Streaming architecture with HDP & ELK
Alain Douangpraseuth
 
PDF
Advanced Microservices - Greach 2015
Steve Pember
 
DOCX
Kelompok 2
University of Andalas
 
PPTX
NSM (Network Security Monitoring) - Tecland Chapeco
Rodrigo Montoro
 
PPTX
#speakgeek - Open Source Software Infrastructure at iconnect360
Derek Chan
 
PPTX
MyHeritage backend group - build to scale
Ran Levy
 
PDF
IM World presentation from Chris Swan: Application centric – how the cloud ha...
Cohesive Networks
 
PDF
Yirgacheffe Chelelelktu Washed Coffee 2015
Golden Future Trading Ltd
 
PPTX
Developing highly scalable applications with Symfony and RabbitMQ
Alexey Petrov
 
PPTX
What is dev ops?
Mukta Aphale
 
PPTX
Python Pants Build System for Large Codebases
Angad Singh
 
PPT
Jake Fox Pd. 5
LigScience2
 
PDF
Platform - Technical architecture
David Rundle
 
PDF
AWS + Puppet = Dynamic Scale
Shiva Narayanaswamy
 
PDF
Automated Infrastructure Security: Monitoring using FOSS
Sonatype
 
PDF
Linux Malware Analysis
Cysinfo Cyber Security Community
 
PPTX
Risk management
hussnain ali
 
Gaurav dev ops (AWS, Linux, Automation-ansible, jenkins:CI and CD:Ansible)
Gaurav Srivastav
 
Nagios Conference 2014 - Fernando Covatti - Nagios in Power Transmission Util...
Nagios
 
Streaming architecture with HDP & ELK
Alain Douangpraseuth
 
Advanced Microservices - Greach 2015
Steve Pember
 
NSM (Network Security Monitoring) - Tecland Chapeco
Rodrigo Montoro
 
#speakgeek - Open Source Software Infrastructure at iconnect360
Derek Chan
 
MyHeritage backend group - build to scale
Ran Levy
 
IM World presentation from Chris Swan: Application centric – how the cloud ha...
Cohesive Networks
 
Yirgacheffe Chelelelktu Washed Coffee 2015
Golden Future Trading Ltd
 
Developing highly scalable applications with Symfony and RabbitMQ
Alexey Petrov
 
What is dev ops?
Mukta Aphale
 
Python Pants Build System for Large Codebases
Angad Singh
 
Jake Fox Pd. 5
LigScience2
 
Platform - Technical architecture
David Rundle
 
AWS + Puppet = Dynamic Scale
Shiva Narayanaswamy
 
Automated Infrastructure Security: Monitoring using FOSS
Sonatype
 
Linux Malware Analysis
Cysinfo Cyber Security Community
 
Risk management
hussnain ali
 
Ad

Similar to Big Data Europe: Simplifying Development and Deployment of Big Data Applications (20)

PPTX
BDE-BDVA Webinar: BDE Technical Overview
BigData_Europe
 
PDF
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
BigData_Europe
 
PPTX
Platform introduction & Summary
BigData_Europe
 
PDF
BDE SC3.3 Workshop - BDE Platform: Technical overview
BigData_Europe
 
PPTX
SC4 Workshop 2: Hajira Jabeen BDE Platform architecture
BigData_Europe
 
PDF
BigDataEurope @BDVA Summit2016 1: The BDE Platform
BigData_Europe
 
PPTX
HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...
Geoffrey Fox
 
PDF
BigData HUB Workshop
Ahmed Salman
 
PDF
Big Data Journey
Tugdual Grall
 
PDF
Big data Europe: concept, platform and pilots
BigData_Europe
 
PPTX
High Performance Processing of Streaming Data
Geoffrey Fox
 
PPTX
Storage and-compute-hdfs-map reduce
Chris Nauroth
 
PPTX
Big Data and Cloud Computing
Farzad Nozarian
 
PPTX
ICWE2017 BigDataEurope
BigData_Europe
 
PDF
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
BigData_Europe
 
PDF
Mar 2012 HUG: Hive with HBase
Yahoo Developer Network
 
PPTX
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
BigData_Europe
 
PPTX
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Hajira Jabeen
 
PPTX
What is Big Data ?
AkhmadZakiAlsafi
 
PDF
9/2017 STL HUG - Back to School
Adam Doyle
 
BDE-BDVA Webinar: BDE Technical Overview
BigData_Europe
 
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
BigData_Europe
 
Platform introduction & Summary
BigData_Europe
 
BDE SC3.3 Workshop - BDE Platform: Technical overview
BigData_Europe
 
SC4 Workshop 2: Hajira Jabeen BDE Platform architecture
BigData_Europe
 
BigDataEurope @BDVA Summit2016 1: The BDE Platform
BigData_Europe
 
HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...
Geoffrey Fox
 
BigData HUB Workshop
Ahmed Salman
 
Big Data Journey
Tugdual Grall
 
Big data Europe: concept, platform and pilots
BigData_Europe
 
High Performance Processing of Streaming Data
Geoffrey Fox
 
Storage and-compute-hdfs-map reduce
Chris Nauroth
 
Big Data and Cloud Computing
Farzad Nozarian
 
ICWE2017 BigDataEurope
BigData_Europe
 
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
BigData_Europe
 
Mar 2012 HUG: Hive with HBase
Yahoo Developer Network
 
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
BigData_Europe
 
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Hajira Jabeen
 
What is Big Data ?
AkhmadZakiAlsafi
 
9/2017 STL HUG - Back to School
Adam Doyle
 
Ad

More from BigData_Europe (20)

PDF
Rajendra Akerkar - LeMO Project
BigData_Europe
 
PDF
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
BigData_Europe
 
PDF
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
BigData_Europe
 
PDF
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
BigData_Europe
 
PDF
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
BigData_Europe
 
PDF
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
BigData_Europe
 
PDF
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
BigData_Europe
 
PDF
BDE SC3.3 Workshop - Data management in WT testing and monitoring
BigData_Europe
 
PDF
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
BigData_Europe
 
PDF
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BigData_Europe
 
PDF
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
BigData_Europe
 
PDF
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
BigData_Europe
 
PDF
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
BigData_Europe
 
PPTX
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
BigData_Europe
 
PPTX
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
BigData_Europe
 
PPTX
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BigData_Europe
 
PPTX
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
BigData_Europe
 
PPTX
SC1 Hangout: Updating public databases: Automation and other challenges for c...
BigData_Europe
 
PDF
SC7 Webinar 5 13/12/2017 SatCen Presentation "Secure societies activities: th...
BigData_Europe
 
PDF
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
BigData_Europe
 
Rajendra Akerkar - LeMO Project
BigData_Europe
 
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
BigData_Europe
 
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
BigData_Europe
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
BigData_Europe
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
BigData_Europe
 
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
BigData_Europe
 
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
BigData_Europe
 
BDE SC3.3 Workshop - Data management in WT testing and monitoring
BigData_Europe
 
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
BigData_Europe
 
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BigData_Europe
 
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
BigData_Europe
 
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
BigData_Europe
 
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
BigData_Europe
 
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
BigData_Europe
 
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
BigData_Europe
 
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BigData_Europe
 
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
BigData_Europe
 
SC1 Hangout: Updating public databases: Automation and other challenges for c...
BigData_Europe
 
SC7 Webinar 5 13/12/2017 SatCen Presentation "Secure societies activities: th...
BigData_Europe
 
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
BigData_Europe
 

Recently uploaded (20)

DOCX
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
PPT
PPT2_Metal formingMECHANICALENGINEEIRNG .ppt
Praveen Kumar
 
PPTX
Big Data and Data Science hype .pptx
SUNEEL37
 
PDF
Zilliz Cloud Demo for performance and scale
Zilliz
 
PDF
Halide Perovskites’ Multifunctional Properties: Coordination Engineering, Coo...
TaameBerhe2
 
PPTX
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
PDF
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
PDF
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
PPTX
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
PPTX
Arduino Based Gas Leakage Detector Project
CircuitDigest
 
PPTX
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
PDF
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PPTX
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
PDF
Introduction to Productivity and Quality
মোঃ ফুরকান উদ্দিন জুয়েল
 
PDF
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
PPTX
美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查
Taqyea
 
PPTX
Presentation 2.pptx AI-powered home security systems Secure-by-design IoT fr...
SoundaryaBC2
 
PDF
smart lot access control system with eye
rasabzahra
 
PPTX
Knowledge Representation : Semantic Networks
Amity University, Patna
 
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
PPT2_Metal formingMECHANICALENGINEEIRNG .ppt
Praveen Kumar
 
Big Data and Data Science hype .pptx
SUNEEL37
 
Zilliz Cloud Demo for performance and scale
Zilliz
 
Halide Perovskites’ Multifunctional Properties: Coordination Engineering, Coo...
TaameBerhe2
 
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
Arduino Based Gas Leakage Detector Project
CircuitDigest
 
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
Introduction to Productivity and Quality
মোঃ ফুরকান উদ্দিন জুয়েল
 
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查
Taqyea
 
Presentation 2.pptx AI-powered home security systems Secure-by-design IoT fr...
SoundaryaBC2
 
smart lot access control system with eye
rasabzahra
 
Knowledge Representation : Semantic Networks
Amity University, Patna
 

Big Data Europe: Simplifying Development and Deployment of Big Data Applications