SlideShare a Scribd company logo
SUSE for Hadoop & Big Data 
Stephen Mogg 
SUSE UK 
October 2014
2 
About SUSE 
• Established 1992 
• Original Provider of Enterprise Linux 
About Me 
• SUSE Employee 4 years 
• Systems Engineer
3 
If you want to know more about SUSE 
• New Certifications 
• New Resources 
• New Lab
SUSE for Hadoop
5 
Big Data Reference Architecture 
Operating System OS / Cloud Platform 
Source: Hortonworks Modern Data Architecture - http://hortonworks.com/partner/suse/
6 
SUSE Big Data Reference Architecture 
Source: Hortonworks Modern Data Architecture - http://hortonworks.com/partner/suse/
7 
SUSE Big Data Partners 
Hadoop Data Systems 
Applications Services
8 
Certified for Leading Hadoop Platforms 
Additional level of testing 
and quality assurance to 
make sure SUSE Linux 
Enterprise Server 
integrates with partner 
software, saving our 
customers time while 
providing them with an 
assurance of 
interoperability. 
We hereby declare that 
SUSE Linux Enterprise Server 
is officially certified for: 
Cloudera CDH 5 
Hortonworks HDP2
Market Leadership
10 
SUSE in High Performance 
“Teradata's extensive 
financial, technical, 
and management 
resources can 
create a unique, 
high-performance 
Hadoop appliance 
that few other 
vendors can match.” 
– Forrester Feb 2014 
High Performance Computing 
‒ Half of the world's largest super computer 
clusters run SUSE Linux Enterprise Server 
Mainframe Computing 
‒ Over 80% of all Linux running on mainframe 
computers is SUSE Linux 
SAP Hana 
‒ SUSE Linux Enterprise Server is the 
recommended OS for the market leading 
analytics appliance, SAP HANA. 
Teradata 
‒ SUSE Linux Enterprise Server is the OS 
foundation for Hadoop in the Aster Big Analytics 
Appliance 
IBM Watson 
‒ Power artificial intelligence computer runs SUSE 
Linux and Hadoop
11 
What Makes an Optimal Foundation 
for Hadoop? 
SLAs and 
Business Continuity 
Resource Utilization 
and Efficiency 
Security and 
Compliance 
Affordable, No 
Vendor Lock-in
12 
Power, Scalability 
Reliability, Availability,Serviceability: 
Swap-over NFS 
Built-in open source multi-path IO 
CPU/Memory hot-plugging 
Horizontal/Vertical Scalability: 
Large capacity and faster system 
interconnect (OFED, Infiniband) 
A rock-solid, certified 
foundation for deploying 
Hadoop clusters. 
Huge Data, Massive Compute: 
4096 logical CPU 
64 TiB RAM 
Supports latest Intel CPUs: 
Ivy Bridge v2 
Haswell 
SUSE Linux Enterprise Server
13 
Flexibility, Agility 
Massively Scalable Private 
Cloud Implementations 
Deploy pre-configured 
Hadoop clusters on 
KVM, Xen, Hyper-V, ESXi 
Spin up fully configured and 
optimized Hadoop Cluster in 
minutes for dev/test 
Scale-out Hadoop cluster 
Infrastructure easily 
API for Cloud-aware 
Applications 
SUSE Cloud 
Hadoop in the Cloud: 
OpenStack based 
enterprise ready IaaS Cloud 
Platform.
14 
Improve Resource Utilization and Efficiency 
Batch Command Speeds Up 
Cluster Implementation 
Centralized Server 
Infrastructure Management 
Software and Patch 
Management for Linux and 
Hadoop 
Batch-deploy config files to 
entire Hadoop cluster 
Asset Management 
and Reporting 
Application and 
Infrastructure Monitoring 
SUSE Manager 
A perfect complement 
to the monitoring and 
management capabilities 
provided in the Hadoop 
cluster management 
software.
15 
Security and Certifications 
90% of companies cite data access and data protection as either extremely or very important 
security capabilities. - IDG Big Data Survey 2014 
Security Features SUSE Linux Enterprise Server 
System Hardening YaST2 Security Center 
Application Confinement AppArmor 
System Confinement SE Linux (stack support) 
Intrusion Detection (file system) AIDE 
Fine-grained Access Rights File system POSIX capabilities 
Encryption Capabilities Three ways: Full disk, Volume, Filesystem 
(eCryptFS) 
Certifications Carrier Grade Linux (CGL) 4.0 IPv6 (refresh) 
Measure and Monitor System Integrity During 
Trusted Platform Modules (TPM)—Trusted 
Reboot 
Computing 
System Requirements for Cryptographic Modules FIPS 140-2 Validation for OpenSSL 
Common Criteria for IT Security Evaluation Common Criteria Certification for SP2 
(x86 64 with KVM; IBM System z)
16 
Summary: Key Features and Benefits 
Key Features Benefits 
Reliability, 
Availability, 
Serviceability, 
Scalability 
Swap over NFS Cut cost with less expensive diskless servers 
Kernel 3.0 Enhanced RAS capabilities 
Intel Ivy-Bridge 2 and Haswell Support Harness the latest CPU technologies and provides 
excellent 4096 Logical CPU, 64TiB RAM Support vertical scalability 
InfiniBand, iSCSI Target (LIO) and OFED Faster connectivity with networking and storage equipment 
Dual Hypervisor Support: Xen and KVM 
Cross-platform Maximum choice both as a host and as a guest 
Virtualization 
Optimized for vSphere, Hyper-V, Open 
Source Hypervisors 
Linux Containers Light weight OS level virtualization 
UEFI Secure Boot Less malicious attach risk in boot 
Security and 
Compliance 
FIPS 140-2 Validation and Common 
Criteria Certification Security standard compliance 
AppArmor Protects from external/internal threats and zero-day 
attacks 
Integrated System 
Management 
Snapper and BTRFS Snapshot and rollback for easy management 
YaST, AutoYaST and Zypp Integrated single system management and fast update 
tools 
Interop with 
Other Platforms 
SAMBA 3.6 Compatible with Windows 
IPv6 Compliance Networking with IPv6 equipment
SUSE Big Data Resources
18 
Hadoop on SLES 
Best Practices White Paper: 
• Deployment scenarios 
• Proposed Architecture using SLES 
• Infrastructure considerations 
• Basic optimization of the Linux OS 
• Installation and configuration of Hadoop 
on SLES
19 
SUSE Manager and Hadoop 
Step-by-step guide for using SUSE 
Manager to deploy Cloudera on SLES: 
• Automate OS provisioning 
• Deploy new servers with identical 
characteristics 
• Auto-deployment of RPM-based applications 
• Centralize management of configuration files 
• Connect to SUSE Customer Center for 
updates 
• Create / manage multiple organizations from a 
single remote console. 
• Create customized repositories 
• Maintain the security of enterprise systems 
• Leverage the SUSE Manager API to create 
custom scripts to manage tasks or integrate 
third-party applications and management 
tools
20 
Hadoop / HP Reference Architecture 
HP Reference Architechture: 
• Written by SUSE, HP & Hortonworks, 
• Proposed Architecture using SLES 
• HP Recommends SLES
21 
SUSE Big Data Lab 
Big Data Cluster in USA for: 
• Benchmarking 
• Software certification 
• Integration / test 
• Reference architectures
Learn About: 
Register: 
22 
SUSE Linux Expert Days 
• SUSE and Big Data 
• Towards Zero Uptime with SUSE Tecnology 
• SUSE Linux Enterprise Server 
https://www.suse.com/events/slef-2014/#Liste
23 
Learn More 
Visit our web site 
www.suse.com/solutions/platform.html#big_data 
Read our whitepapers 
Deploying Hadoop on SLES 
Deploy and Manage Hadoop with SUSE Manager 
HP Reference Architecture. 
Contact us 
bigdata@suse.com
24
Unpublished Work of SUSE LLC. All Rights Reserved. 
This work is an unpublished work and contains confidential, proprietary and trade secret information of SUSE LLC. 
Access to this work is restricted to SUSE employees who have a need to know to perform tasks within the scope of 
their assignments. No part of this work may be practiced, performed, copied, distributed, revised, modified, translated, 
abridged, condensed, expanded, collected, or adapted without the prior written consent of SUSE. 
Any use or exploitation of this work without authorization could subject the perpetrator to criminal and civil liability. 
General Disclaimer 
This document is not to be construed as a promise by any participating company to develop, deliver, or market a 
product. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making 
purchasing decisions. SUSE makes no representations or warranties with respect to the contents of this document, 
and specifically disclaims any express or implied warranties of merchantability or fitness for any particular purpose. The 
development, release, and timing of features or functionality described for SUSE products remains at the sole 
discretion of SUSE. Further, SUSE reserves the right to revise this document and to make changes to its content, at 
any time, without obligation to notify any person or entity of such revisions or changes. All SUSE marks referenced in 
this presentation are trademarks or registered trademarks of Novell, Inc. in the United States and other countries. All 
third-party trademarks are the property of their respective owners.

More Related Content

What's hot (20)

PDF
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Charlie Berger
 
PDF
MOUG17 Keynote: Oracle OpenWorld Major Announcements
Monica Li
 
PPTX
Hadoop in the Cloud - The what, why and how from the experts
DataWorks Summit/Hadoop Summit
 
PPTX
SQL Server on Linux - march 2017
Sorin Peste
 
PPTX
Modern Data Warehousing with the Microsoft Analytics Platform System
James Serra
 
PPTX
Db2 analytics accelerator on ibm integrated analytics system technical over...
Daniel Martin
 
PPTX
HA/DR options with SQL Server in Azure and hybrid
James Serra
 
PPTX
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
avanttic Consultoría Tecnológica
 
PDF
Dipping Your Toes: Azure Data Lake for DBAs
Bob Pusateri
 
PDF
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
Amr Awadallah
 
PPTX
Nordic infrastructure Conference 2017 - SQL Server on Linux Overview
Travis Wright
 
PPTX
Big Data in the Cloud - The What, Why and How from the Experts
DataWorks Summit/Hadoop Summit
 
PPTX
Dynamic DDL: Adding structure to streaming IoT data on the fly
DataWorks Summit
 
PPTX
Implement SQL Server on an Azure VM
James Serra
 
PDF
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
Cuneyt Goksu
 
PPTX
Built-In Security for the Cloud
DataWorks Summit
 
PPTX
Introducing Azure SQL Database
James Serra
 
PPTX
Accelerating Big Data Insights
DataWorks Summit
 
PPTX
Intel and Cloudera: Accelerating Enterprise Big Data Success
Cloudera, Inc.
 
PPTX
Treat your enterprise data lake indigestion: Enterprise ready security and go...
DataWorks Summit
 
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Charlie Berger
 
MOUG17 Keynote: Oracle OpenWorld Major Announcements
Monica Li
 
Hadoop in the Cloud - The what, why and how from the experts
DataWorks Summit/Hadoop Summit
 
SQL Server on Linux - march 2017
Sorin Peste
 
Modern Data Warehousing with the Microsoft Analytics Platform System
James Serra
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Daniel Martin
 
HA/DR options with SQL Server in Azure and hybrid
James Serra
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
avanttic Consultoría Tecnológica
 
Dipping Your Toes: Azure Data Lake for DBAs
Bob Pusateri
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
Amr Awadallah
 
Nordic infrastructure Conference 2017 - SQL Server on Linux Overview
Travis Wright
 
Big Data in the Cloud - The What, Why and How from the Experts
DataWorks Summit/Hadoop Summit
 
Dynamic DDL: Adding structure to streaming IoT data on the fly
DataWorks Summit
 
Implement SQL Server on an Azure VM
James Serra
 
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
Cuneyt Goksu
 
Built-In Security for the Cloud
DataWorks Summit
 
Introducing Azure SQL Database
James Serra
 
Accelerating Big Data Insights
DataWorks Summit
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Cloudera, Inc.
 
Treat your enterprise data lake indigestion: Enterprise ready security and go...
DataWorks Summit
 

Viewers also liked (9)

PDF
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Happiest Minds Technologies
 
ODP
The European Conference on Software Architecture (ECSA) 14 - IBM BigData Refe...
Romeo Kienzler
 
PPTX
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Cloudera, Inc.
 
PDF
Scaling up with hadoop and banyan at ITRIX-2015, College of Engineering, Guindy
Rohit Kulkarni
 
PPTX
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
SoftServe
 
PDF
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Cynthia Saracco
 
PPTX
Big data concepts
Serkan Özal
 
PDF
Big Data & Analytics Architecture
Arvind Sathi
 
PDF
Big Data Architecture
Guido Schmutz
 
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Happiest Minds Technologies
 
The European Conference on Software Architecture (ECSA) 14 - IBM BigData Refe...
Romeo Kienzler
 
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Cloudera, Inc.
 
Scaling up with hadoop and banyan at ITRIX-2015, College of Engineering, Guindy
Rohit Kulkarni
 
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
SoftServe
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Cynthia Saracco
 
Big data concepts
Serkan Özal
 
Big Data & Analytics Architecture
Arvind Sathi
 
Big Data Architecture
Guido Schmutz
 
Ad

Similar to SUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UK (20)

PDF
SUSE y Big Data
SUSE España
 
PDF
SUSE: Infraestructura definida por software para BigData
Juan Herrera Utande
 
PDF
2013 linux days final
RandomShare
 
PDF
Software-definierte Infrastrukturen, DevOps, Digitale Transformation – Neues ...
MariaDB plc
 
PDF
SUSE KVM Ecosystem
Patrick Quairoli
 
PDF
SUSE Enterprise Storage - a Gentle Introduction
Gábor Nyers
 
PDF
SUSE Storage 2015
Finceptum Oy
 
ODP
Running SAP on SUSE Cloud 2.0
Dirk Oppenkowski
 
PDF
High Performance Computing with SUSE — We adapt. You succeed!
Intel IT Center
 
PPT
Bridging IaaS With PaaS To Deliver The Service-Oriented Data Center
Chris Haddad
 
PDF
6_OPEN17_SUSE Enterprise Storage 4
Kangaroot
 
PDF
Productos de SUSE basados en CaaSP
SUSE España
 
PPTX
Advantages of SUSE Linux Over Windows
Jeff Reser
 
PDF
TechEd 2019 SUSE theater session
Christian Holsing
 
PPT
Intel SUSE Texperts Webinar
Dirk Oppenkowski
 
PDF
Using Ceph in a Private Cloud - Ceph Day Frankfurt
Ceph Community
 
PDF
8/ SUSE @ OPEN'16
Kangaroot
 
PDF
Ralf Flaxa, SUSE - Opening Keynote Open World Forum 2012
Paris Open Source Summit
 
PPT
SAP UNIX to Linux
Joanne Harris
 
PDF
SUSE Open Stack Cloud.
briggsy_uk
 
SUSE y Big Data
SUSE España
 
SUSE: Infraestructura definida por software para BigData
Juan Herrera Utande
 
2013 linux days final
RandomShare
 
Software-definierte Infrastrukturen, DevOps, Digitale Transformation – Neues ...
MariaDB plc
 
SUSE KVM Ecosystem
Patrick Quairoli
 
SUSE Enterprise Storage - a Gentle Introduction
Gábor Nyers
 
SUSE Storage 2015
Finceptum Oy
 
Running SAP on SUSE Cloud 2.0
Dirk Oppenkowski
 
High Performance Computing with SUSE — We adapt. You succeed!
Intel IT Center
 
Bridging IaaS With PaaS To Deliver The Service-Oriented Data Center
Chris Haddad
 
6_OPEN17_SUSE Enterprise Storage 4
Kangaroot
 
Productos de SUSE basados en CaaSP
SUSE España
 
Advantages of SUSE Linux Over Windows
Jeff Reser
 
TechEd 2019 SUSE theater session
Christian Holsing
 
Intel SUSE Texperts Webinar
Dirk Oppenkowski
 
Using Ceph in a Private Cloud - Ceph Day Frankfurt
Ceph Community
 
8/ SUSE @ OPEN'16
Kangaroot
 
Ralf Flaxa, SUSE - Opening Keynote Open World Forum 2012
Paris Open Source Summit
 
SAP UNIX to Linux
Joanne Harris
 
SUSE Open Stack Cloud.
briggsy_uk
 
Ad

More from huguk (20)

PDF
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
huguk
 
PDF
ether.camp - Hackathon & ether.camp intro
huguk
 
PPTX
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
huguk
 
PPTX
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
huguk
 
PDF
Extracting maximum value from data while protecting consumer privacy. Jason ...
huguk
 
PDF
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
huguk
 
PDF
Streaming Dataflow with Apache Flink
huguk
 
PPTX
Lambda architecture on Spark, Kafka for real-time large scale ML
huguk
 
PDF
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
huguk
 
PDF
Jonathon Southam: Venture Capital, Funding & Pitching
huguk
 
PDF
Signal Media: Real-Time Media & News Monitoring
huguk
 
PDF
Dean Bryen: Scaling The Platform For Your Startup
huguk
 
PDF
Peter Karney: Intro to the Digital catapult
huguk
 
PDF
Cytora: Real-Time Political Risk Analysis
huguk
 
PDF
Cubitic: Predictive Analytics
huguk
 
PDF
Bird.i: Earth Observation Data Made Social
huguk
 
PDF
Aiseedo: Real Time Machine Intelligence
huguk
 
PDF
Secrets of Spark's success - Deenar Toraskar, Think Reactive
huguk
 
PDF
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
huguk
 
PPTX
Hadoop - Looking to the Future By Arun Murthy
huguk
 
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
huguk
 
ether.camp - Hackathon & ether.camp intro
huguk
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
huguk
 
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
huguk
 
Extracting maximum value from data while protecting consumer privacy. Jason ...
huguk
 
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
huguk
 
Streaming Dataflow with Apache Flink
huguk
 
Lambda architecture on Spark, Kafka for real-time large scale ML
huguk
 
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
huguk
 
Jonathon Southam: Venture Capital, Funding & Pitching
huguk
 
Signal Media: Real-Time Media & News Monitoring
huguk
 
Dean Bryen: Scaling The Platform For Your Startup
huguk
 
Peter Karney: Intro to the Digital catapult
huguk
 
Cytora: Real-Time Political Risk Analysis
huguk
 
Cubitic: Predictive Analytics
huguk
 
Bird.i: Earth Observation Data Made Social
huguk
 
Aiseedo: Real Time Machine Intelligence
huguk
 
Secrets of Spark's success - Deenar Toraskar, Think Reactive
huguk
 
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
huguk
 
Hadoop - Looking to the Future By Arun Murthy
huguk
 

Recently uploaded (20)

PDF
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
NewMind AI Journal - Weekly Chronicles - July'25 Week II
NewMind AI
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
PDF
Why Orbit Edge Tech is a Top Next JS Development Company in 2025
mahendraalaska08
 
PDF
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PDF
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
NewMind AI Journal - Weekly Chronicles - July'25 Week II
NewMind AI
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
Why Orbit Edge Tech is a Top Next JS Development Company in 2025
mahendraalaska08
 
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 

SUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UK

  • 1. SUSE for Hadoop & Big Data Stephen Mogg SUSE UK October 2014
  • 2. 2 About SUSE • Established 1992 • Original Provider of Enterprise Linux About Me • SUSE Employee 4 years • Systems Engineer
  • 3. 3 If you want to know more about SUSE • New Certifications • New Resources • New Lab
  • 5. 5 Big Data Reference Architecture Operating System OS / Cloud Platform Source: Hortonworks Modern Data Architecture - http://hortonworks.com/partner/suse/
  • 6. 6 SUSE Big Data Reference Architecture Source: Hortonworks Modern Data Architecture - http://hortonworks.com/partner/suse/
  • 7. 7 SUSE Big Data Partners Hadoop Data Systems Applications Services
  • 8. 8 Certified for Leading Hadoop Platforms Additional level of testing and quality assurance to make sure SUSE Linux Enterprise Server integrates with partner software, saving our customers time while providing them with an assurance of interoperability. We hereby declare that SUSE Linux Enterprise Server is officially certified for: Cloudera CDH 5 Hortonworks HDP2
  • 10. 10 SUSE in High Performance “Teradata's extensive financial, technical, and management resources can create a unique, high-performance Hadoop appliance that few other vendors can match.” – Forrester Feb 2014 High Performance Computing ‒ Half of the world's largest super computer clusters run SUSE Linux Enterprise Server Mainframe Computing ‒ Over 80% of all Linux running on mainframe computers is SUSE Linux SAP Hana ‒ SUSE Linux Enterprise Server is the recommended OS for the market leading analytics appliance, SAP HANA. Teradata ‒ SUSE Linux Enterprise Server is the OS foundation for Hadoop in the Aster Big Analytics Appliance IBM Watson ‒ Power artificial intelligence computer runs SUSE Linux and Hadoop
  • 11. 11 What Makes an Optimal Foundation for Hadoop? SLAs and Business Continuity Resource Utilization and Efficiency Security and Compliance Affordable, No Vendor Lock-in
  • 12. 12 Power, Scalability Reliability, Availability,Serviceability: Swap-over NFS Built-in open source multi-path IO CPU/Memory hot-plugging Horizontal/Vertical Scalability: Large capacity and faster system interconnect (OFED, Infiniband) A rock-solid, certified foundation for deploying Hadoop clusters. Huge Data, Massive Compute: 4096 logical CPU 64 TiB RAM Supports latest Intel CPUs: Ivy Bridge v2 Haswell SUSE Linux Enterprise Server
  • 13. 13 Flexibility, Agility Massively Scalable Private Cloud Implementations Deploy pre-configured Hadoop clusters on KVM, Xen, Hyper-V, ESXi Spin up fully configured and optimized Hadoop Cluster in minutes for dev/test Scale-out Hadoop cluster Infrastructure easily API for Cloud-aware Applications SUSE Cloud Hadoop in the Cloud: OpenStack based enterprise ready IaaS Cloud Platform.
  • 14. 14 Improve Resource Utilization and Efficiency Batch Command Speeds Up Cluster Implementation Centralized Server Infrastructure Management Software and Patch Management for Linux and Hadoop Batch-deploy config files to entire Hadoop cluster Asset Management and Reporting Application and Infrastructure Monitoring SUSE Manager A perfect complement to the monitoring and management capabilities provided in the Hadoop cluster management software.
  • 15. 15 Security and Certifications 90% of companies cite data access and data protection as either extremely or very important security capabilities. - IDG Big Data Survey 2014 Security Features SUSE Linux Enterprise Server System Hardening YaST2 Security Center Application Confinement AppArmor System Confinement SE Linux (stack support) Intrusion Detection (file system) AIDE Fine-grained Access Rights File system POSIX capabilities Encryption Capabilities Three ways: Full disk, Volume, Filesystem (eCryptFS) Certifications Carrier Grade Linux (CGL) 4.0 IPv6 (refresh) Measure and Monitor System Integrity During Trusted Platform Modules (TPM)—Trusted Reboot Computing System Requirements for Cryptographic Modules FIPS 140-2 Validation for OpenSSL Common Criteria for IT Security Evaluation Common Criteria Certification for SP2 (x86 64 with KVM; IBM System z)
  • 16. 16 Summary: Key Features and Benefits Key Features Benefits Reliability, Availability, Serviceability, Scalability Swap over NFS Cut cost with less expensive diskless servers Kernel 3.0 Enhanced RAS capabilities Intel Ivy-Bridge 2 and Haswell Support Harness the latest CPU technologies and provides excellent 4096 Logical CPU, 64TiB RAM Support vertical scalability InfiniBand, iSCSI Target (LIO) and OFED Faster connectivity with networking and storage equipment Dual Hypervisor Support: Xen and KVM Cross-platform Maximum choice both as a host and as a guest Virtualization Optimized for vSphere, Hyper-V, Open Source Hypervisors Linux Containers Light weight OS level virtualization UEFI Secure Boot Less malicious attach risk in boot Security and Compliance FIPS 140-2 Validation and Common Criteria Certification Security standard compliance AppArmor Protects from external/internal threats and zero-day attacks Integrated System Management Snapper and BTRFS Snapshot and rollback for easy management YaST, AutoYaST and Zypp Integrated single system management and fast update tools Interop with Other Platforms SAMBA 3.6 Compatible with Windows IPv6 Compliance Networking with IPv6 equipment
  • 17. SUSE Big Data Resources
  • 18. 18 Hadoop on SLES Best Practices White Paper: • Deployment scenarios • Proposed Architecture using SLES • Infrastructure considerations • Basic optimization of the Linux OS • Installation and configuration of Hadoop on SLES
  • 19. 19 SUSE Manager and Hadoop Step-by-step guide for using SUSE Manager to deploy Cloudera on SLES: • Automate OS provisioning • Deploy new servers with identical characteristics • Auto-deployment of RPM-based applications • Centralize management of configuration files • Connect to SUSE Customer Center for updates • Create / manage multiple organizations from a single remote console. • Create customized repositories • Maintain the security of enterprise systems • Leverage the SUSE Manager API to create custom scripts to manage tasks or integrate third-party applications and management tools
  • 20. 20 Hadoop / HP Reference Architecture HP Reference Architechture: • Written by SUSE, HP & Hortonworks, • Proposed Architecture using SLES • HP Recommends SLES
  • 21. 21 SUSE Big Data Lab Big Data Cluster in USA for: • Benchmarking • Software certification • Integration / test • Reference architectures
  • 22. Learn About: Register: 22 SUSE Linux Expert Days • SUSE and Big Data • Towards Zero Uptime with SUSE Tecnology • SUSE Linux Enterprise Server https://www.suse.com/events/slef-2014/#Liste
  • 23. 23 Learn More Visit our web site www.suse.com/solutions/platform.html#big_data Read our whitepapers Deploying Hadoop on SLES Deploy and Manage Hadoop with SUSE Manager HP Reference Architecture. Contact us [email protected]
  • 24. 24
  • 25. Unpublished Work of SUSE LLC. All Rights Reserved. This work is an unpublished work and contains confidential, proprietary and trade secret information of SUSE LLC. Access to this work is restricted to SUSE employees who have a need to know to perform tasks within the scope of their assignments. No part of this work may be practiced, performed, copied, distributed, revised, modified, translated, abridged, condensed, expanded, collected, or adapted without the prior written consent of SUSE. Any use or exploitation of this work without authorization could subject the perpetrator to criminal and civil liability. General Disclaimer This document is not to be construed as a promise by any participating company to develop, deliver, or market a product. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. SUSE makes no representations or warranties with respect to the contents of this document, and specifically disclaims any express or implied warranties of merchantability or fitness for any particular purpose. The development, release, and timing of features or functionality described for SUSE products remains at the sole discretion of SUSE. Further, SUSE reserves the right to revise this document and to make changes to its content, at any time, without obligation to notify any person or entity of such revisions or changes. All SUSE marks referenced in this presentation are trademarks or registered trademarks of Novell, Inc. in the United States and other countries. All third-party trademarks are the property of their respective owners.