● Bring Apache Spark capabilities closer to non code-savvy users
○ interactive Big Data analytics
○ speed up building Apache Spark applications
● Provide collaboration platform for data scientists and software engineers
Seahorse Motivations
Seahorse v.1.3
● Development since mid 2015
● First release in Feb 16
● 3 releases so far
● Shown on multiple conferences:
○ Strata (NYC, London)
○ Spark Summit (SF, Amsterdam)
○ Hadoop Summit (San Jose, Dublin)
○ AI World (SF)
○ ODSC (London)
● Included in platforms:
○ IBM’s Big Data University
○ Intel’s Trusted Analytics Platform
● Group of early adopters
Seahorse Timeline
Seahorse 1.3 (released in Sep 16):
● R support: build custom
operations in R and use R
notebooks within Seahorse
● Custom cluster connectivity: run
Seahorse in Client Mode and
connect to any cluster
● Spark 2.0
Seahorse Roadmap Highlights
Seahorse 1.4 (Jan 17):
● Scala SDK: create custom
operations in Scala and use them in
multiple workflows from the
operation palette
● Workflow scheduler
● Email reports: generate custom
visual reports using Notebooks and
get results sent via email
1. Go to deepsense.io -> Seahorse -> Standalone
2. Fill out a short form and choose between:
a. Docker (set of containers with Docker Compose)
b. Vagrant (virtual machine image)
3. See examples and docs at seahorse.deepsense.io
4. Post feedback to our board or directly to michal@deepsense.io
How to Get Seahorse
Wizualne budowanie aplikacji na Sparku przy pomocy narzędzia Seahorse

More Related Content

PDF
Neptune - narzędzie do monitorowania i zarządzania eksperymentami Machine Lea...
PDF
Contributing to Open Source
PDF
[WSO2Con EU 2018] Tooling for Observability
PDF
[WSO2Con EU 2018] OpenAPI Specification 3 - The Evolution of Swagger
PDF
Expertmeeting OpenSocial Portals - Edukapp
PDF
apidays LIVE Paris 2021 - Automating API Documentation by Ajinkya Marudwar, G...
PPTX
IoT Fleet Management and Scaling
PDF
Introducción a Docker - LibreCon 2016
Neptune - narzędzie do monitorowania i zarządzania eksperymentami Machine Lea...
Contributing to Open Source
[WSO2Con EU 2018] Tooling for Observability
[WSO2Con EU 2018] OpenAPI Specification 3 - The Evolution of Swagger
Expertmeeting OpenSocial Portals - Edukapp
apidays LIVE Paris 2021 - Automating API Documentation by Ajinkya Marudwar, G...
IoT Fleet Management and Scaling
Introducción a Docker - LibreCon 2016

What's hot (11)

PDF
RootStack - Devfactory
PDF
apidays LIVE Paris 2021 - Why GraphQL is Perfect For Microservices by Roy Der...
PDF
apidays LIVE Paris 2021 - OpenAPI Generator - The Babel Fish of the API World...
PDF
Welcome at OPEN'22
PDF
Altic's big analytics stack, Charly Clairmont, Altic.
 
PPTX
OpenAPI Extensions for OSLC
PDF
Apache Portals Panel (ApacheCon US 2007)
PDF
RIPE Atlas - A Measurement Network
PDF
Open Source and Standardization
 
PDF
EDB Postgres in Public Sector
PPTX
Devops training
RootStack - Devfactory
apidays LIVE Paris 2021 - Why GraphQL is Perfect For Microservices by Roy Der...
apidays LIVE Paris 2021 - OpenAPI Generator - The Babel Fish of the API World...
Welcome at OPEN'22
Altic's big analytics stack, Charly Clairmont, Altic.
 
OpenAPI Extensions for OSLC
Apache Portals Panel (ApacheCon US 2007)
RIPE Atlas - A Measurement Network
Open Source and Standardization
 
EDB Postgres in Public Sector
Devops training
Ad

Viewers also liked (11)

PDF
Anomaly detection made easy
PDF
Real-time fraud detection in credit card transactions
PDF
Analiza języka naturalnego
PDF
As simple as Apache Spark
PDF
Data science warsaw inaugural meetup
PDF
Rozwiązywanie problemów optymalizacyjnych
PDF
Data science w ubezpieczeniach
PDF
Online content popularity prediction
PDF
Jak zbudować aplikacje z wykorzystaniem funkcjonalności windows server 2016...
PPTX
Self-service BI for SAP and HANA – Dream or Reality?
PDF
Data Mining / Cross-Selling
Anomaly detection made easy
Real-time fraud detection in credit card transactions
Analiza języka naturalnego
As simple as Apache Spark
Data science warsaw inaugural meetup
Rozwiązywanie problemów optymalizacyjnych
Data science w ubezpieczeniach
Online content popularity prediction
Jak zbudować aplikacje z wykorzystaniem funkcjonalności windows server 2016...
Self-service BI for SAP and HANA – Dream or Reality?
Data Mining / Cross-Selling
Ad

Similar to Wizualne budowanie aplikacji na Sparku przy pomocy narzędzia Seahorse (20)

PDF
Apache Spark and Python: unified Big Data analytics
PDF
Project Hydrogen: State-of-the-Art Deep Learning on Apache Spark
PDF
Apache Spark for Everyone - Women Who Code Workshop
PPTX
In Memory Analytics with Apache Spark
PDF
BKK16-408B Data Analytics and Machine Learning From Node to Cluster
PDF
BKK16-404B Data Analytics and Machine Learning- from Node to Cluster
PDF
Data Analytics and Machine Learning: From Node to Cluster on ARM64
PDF
Hadoop to spark_v2
PDF
What to Expect for Big Data and Apache Spark in 2017
PDF
Sydney Apache Spark Meetup - Spark Natural Language Processing
PPTX
Apache Spark in Industry
PDF
Started with-apache-spark
PPTX
Big Data Introduction - Solix empower
PDF
Hands-on Guide to Apache Spark 3: Build Scalable Computing Engines for Batch ...
PPTX
Apache spark
PPTX
Part 2: A Visual Dive into Machine Learning and Deep Learning 

PDF
Using PySpark to Process Boat Loads of Data
PDF
Updates from Project Hydrogen: Unifying State-of-the-Art AI and Big Data in A...
PDF
Trends for Big Data and Apache Spark in 2017 by Matei Zaharia
PDF
Very large scale distributed deep learning on BigDL
Apache Spark and Python: unified Big Data analytics
Project Hydrogen: State-of-the-Art Deep Learning on Apache Spark
Apache Spark for Everyone - Women Who Code Workshop
In Memory Analytics with Apache Spark
BKK16-408B Data Analytics and Machine Learning From Node to Cluster
BKK16-404B Data Analytics and Machine Learning- from Node to Cluster
Data Analytics and Machine Learning: From Node to Cluster on ARM64
Hadoop to spark_v2
What to Expect for Big Data and Apache Spark in 2017
Sydney Apache Spark Meetup - Spark Natural Language Processing
Apache Spark in Industry
Started with-apache-spark
Big Data Introduction - Solix empower
Hands-on Guide to Apache Spark 3: Build Scalable Computing Engines for Batch ...
Apache spark
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Using PySpark to Process Boat Loads of Data
Updates from Project Hydrogen: Unifying State-of-the-Art AI and Big Data in A...
Trends for Big Data and Apache Spark in 2017 by Matei Zaharia
Very large scale distributed deep learning on BigDL

More from Data Science Warsaw (15)

PDF
CRISP-DM Agile Approach to Data Mining Projects
PDF
Ile informacji jest w danych?
PDF
Otwarte Miasta
PDF
How to build your own google
PDF
To się w ram ie nie zmieści
PDF
Azure - Duże zbiory w chmurze
PDF
Data Science Warsaw
PDF
Big Data, Wearable, sztuczna inteligencja i ekonomia współpracy
PDF
Ask Data Anything
PDF
Oracle Big Data Discovery - ludzka twarz Hadoop'a
PDF
Geolokalizacja i analizy przestrzenne: trzy wymiary a ile pracy dla analityka!
PDF
Data Exchange - the missing link in the big data value chain
PDF
Metody logiczne w analizie danych
PDF
Małe dane, duży wpływ - Dominik Batorski ICM
PDF
CRISP-DM Agile Approach to Data Mining Projects
Ile informacji jest w danych?
Otwarte Miasta
How to build your own google
To się w ram ie nie zmieści
Azure - Duże zbiory w chmurze
Data Science Warsaw
Big Data, Wearable, sztuczna inteligencja i ekonomia współpracy
Ask Data Anything
Oracle Big Data Discovery - ludzka twarz Hadoop'a
Geolokalizacja i analizy przestrzenne: trzy wymiary a ile pracy dla analityka!
Data Exchange - the missing link in the big data value chain
Metody logiczne w analizie danych
Małe dane, duży wpływ - Dominik Batorski ICM

Recently uploaded (20)

PPTX
research framework and review of related literature chapter 2
PPT
BME 301 Lecture Note 1_2.ppt mata kuliah Instrumentasi
PPT
Technicalities in writing workshops indigenous language
PPTX
Reinforcement learning in artificial intelligence and deep learning
PDF
PPT nikita containers of the company use
PPTX
Bussiness Plan S Group of college 2020-23 Final
PPTX
AI-Augmented Business Process Management Systems
PPTX
PPT for Diseases (1)-2, types of diseases.pptx
PPTX
cyber row.pptx for cyber proffesionals and hackers
PPTX
lung disease detection using transfer learning approach.pptx
PPTX
Chapter security of computer_8_v8.1.pptx
PDF
Delhi c@ll girl# cute girls in delhi with travel girls in delhi call now
PPTX
Basic Statistical Analysis for experimental data.pptx
PDF
9 FinOps Tools That Simplify Cloud Cost Reporting.pdf
PPTX
Stats annual compiled ipd opd ot br 2024
PPT
2011 HCRP presentation-final.pptjrirrififfi
PPTX
langchainpptforbeginners_easy_explanation.pptx
PDF
toaz.info-grade-11-2nd-quarter-earth-and-life-science-pr_5360bfd5a497b75f7ae4...
PPTX
DAA UNIT 1 for unit 1 time compixity PPT.pptx
PPTX
transformers as a tool for understanding advance algorithms in deep learning
research framework and review of related literature chapter 2
BME 301 Lecture Note 1_2.ppt mata kuliah Instrumentasi
Technicalities in writing workshops indigenous language
Reinforcement learning in artificial intelligence and deep learning
PPT nikita containers of the company use
Bussiness Plan S Group of college 2020-23 Final
AI-Augmented Business Process Management Systems
PPT for Diseases (1)-2, types of diseases.pptx
cyber row.pptx for cyber proffesionals and hackers
lung disease detection using transfer learning approach.pptx
Chapter security of computer_8_v8.1.pptx
Delhi c@ll girl# cute girls in delhi with travel girls in delhi call now
Basic Statistical Analysis for experimental data.pptx
9 FinOps Tools That Simplify Cloud Cost Reporting.pdf
Stats annual compiled ipd opd ot br 2024
2011 HCRP presentation-final.pptjrirrififfi
langchainpptforbeginners_easy_explanation.pptx
toaz.info-grade-11-2nd-quarter-earth-and-life-science-pr_5360bfd5a497b75f7ae4...
DAA UNIT 1 for unit 1 time compixity PPT.pptx
transformers as a tool for understanding advance algorithms in deep learning

Wizualne budowanie aplikacji na Sparku przy pomocy narzędzia Seahorse

  • 1. ● Bring Apache Spark capabilities closer to non code-savvy users ○ interactive Big Data analytics ○ speed up building Apache Spark applications ● Provide collaboration platform for data scientists and software engineers Seahorse Motivations
  • 3. ● Development since mid 2015 ● First release in Feb 16 ● 3 releases so far ● Shown on multiple conferences: ○ Strata (NYC, London) ○ Spark Summit (SF, Amsterdam) ○ Hadoop Summit (San Jose, Dublin) ○ AI World (SF) ○ ODSC (London) ● Included in platforms: ○ IBM’s Big Data University ○ Intel’s Trusted Analytics Platform ● Group of early adopters Seahorse Timeline
  • 4. Seahorse 1.3 (released in Sep 16): ● R support: build custom operations in R and use R notebooks within Seahorse ● Custom cluster connectivity: run Seahorse in Client Mode and connect to any cluster ● Spark 2.0 Seahorse Roadmap Highlights Seahorse 1.4 (Jan 17): ● Scala SDK: create custom operations in Scala and use them in multiple workflows from the operation palette ● Workflow scheduler ● Email reports: generate custom visual reports using Notebooks and get results sent via email
  • 5. 1. Go to deepsense.io -> Seahorse -> Standalone 2. Fill out a short form and choose between: a. Docker (set of containers with Docker Compose) b. Vagrant (virtual machine image) 3. See examples and docs at seahorse.deepsense.io 4. Post feedback to our board or directly to [email protected] How to Get Seahorse