SlideShare a Scribd company logo
SAP Lambda Architecture Point Of View
SAP Digital Enterprise Platform
Doug Martino
January 2016 External
Inside cover
SAP External | 3
Introduction
New Architectures for Big Data
Whether MVC for user interfaces, or Spine and Leaf for data centers, new
architecture patterns in our industry act as sort of historical markers of the
effectiveness and acceptance of new technologies. Practical techniques push the
bounds resulting in a shift. Application of distributed storage and streaming
capabilities such as Kafka and of course Hadoop are shifting Big Data architectures
from a layer cake concept, or North/South oriented approach to one which can be
thought of as an East/West architectural concept. Recently popular is Lambda
Architecture, this article presents an SAP HANA based rendering of the Lambda
Architecture.
Lambda Architecture
Before discussing how SAP products can be used in a Lambda fashion, a brief
overview of Lambda is in order. Attribution must go to Nathan Marz for his work.
Typically consisting of three components, a batch layer, a speed layer, and a
serving layer, Lambda rethinks how data flows through an analytics system. Data
flows left to right in a multi speed architecture; with all data stored in the batch layer
and the speed layer simultaneously, while the serving layer is used to combine and
analyze that data as required. Advantages are fault tolerance, scalability, while at
the same time offering multiple latencies to supporting a a variety of performance
requirements.
With Lambda some business questions are answered using the speed layer, others
answered by the batch layer, and still others answered by combing both. As
algorithms in the speed layer decide which data to keep, answer immediate
questions such as which ad to display or how much discount to offer, it also stores
an immutable copy of the data into the batch layer. In addition to acting as a replay
source, data in the batch layer is suited for use cases such as long tail analytics,
machine learning, or propensity to ‘act’ types of endeavors. Fault tolerance,
resilience, ultimate scale, and back testing are derived from having the immutable
copy of the original data for replay through the downstream components. The
serving layer may persist a copy of the data to be used for additional downstream
use of the data such as dashboards or feeds to another system. Shown here are
several open source options for the components which make up a Lambda
Architecture.
SAP External | 4
Traditional Lambda Architecture
Lambda with SAP HANA Platform
At SAP we are rapidly delivering value with our customers by employing these types of architecture patterns.
In 2012 we started moving the HANA product towards the SAP Real-time data architecture where one already
finds the notions of Lambda. Smart Data Streaming is the obvious choice for the real-time layer. Born on Wall
Street, this component is capable of ingesting millions of rows per second into HANA. Complete with a rich set
of API’s, stream and window compute constructs, and scale out capability, it can send the right data into
HANA and all the data into HDFS as necessary.
As the leading in-memory database platform, HANA meets the needs of the serving component of Lambda
rather well. The data can be stored once, then transformed, aggregated, and calculated dynamically. Easy to
understand in-memory compute engines such as graph, spatial, OLAP, and predictive analytics are available
to manipulate the data where it is stored. Because the immutable data is kept in an append only columnar
SQL database and the serving is done on the fly, our customer’s experience a valuable and unique
combination of expressive, declarative programming capability on data which is managed in a SQL
framework. This allows data access via everyday BI tools via standard ODBC or JDBC SQL as well as ODBO
MDX. Based on Node.js, the HANA app server XS Advanced provides amongst other things noSQL access to
the data via restful ODATA.
SAP External | 5
Often times requirements change such that new algorithms or new derivations of the immutable data are
required. Because of the dynamic nature of materialization employed by the HANA core architecture, these
changes can be made almost instantly. This flexibility changes the logistics of replay, allowing for rapid
experimentation and moves into a production environment. In the case that a schema needs to be extended
HANA does have schema flexibility whereby columns may be added to an existing table structure, leaving any
existing code intact. The amount of flexibility offered by HANA once the immutable data has been stored is a
fundamental differentiator of HANA. This flexibility offers reduced time to value and increased data agility.
While the HANA platform is working extremely well for real-time applications, Lambda is complete by
incorporating Vora. From an architecture standpoint, there are several aspects to add to the discussion. First
is the aspect of scalability.
SAP External | 6
One of the beauties of the lambda is scaling each component dynamically and independently as latency
requirements shift to meet changing business requirements. This notion of independent scaling allows
appropriate resources allocation in a fit-for purpose model. Vora’s use of Hadoop moves the scale of the
overall system up to multi petabyte range. More importantly, addressing long tail analytics and other big data
compute problems are now possible.
This leads to the next concept, which is one of throughput and algorithmic capability. Each of the real-
time/batch/serving layers are best suited for a particular algorithm at a given throughout. All of them can run
‘out of band’ code; the question is at which level of concurrency and complexity. Rather than have JAVA JVM
as the foundation of each, SAP have the appropriate engines with well-documented and supported extension
capabilities. One may argue that open source provides the ultimate extension capability but only as a
consuming organization is willing to merge code. In this case, SAP is providing high performance, fit for
purpose engines to achieve the desired results more efficiently.
This leaves us with the replay topic, an oft forgotten concept, especially in an operational batch context.
Usually left for that – operations, it is often the case that re-running loads into a system to catch up with the
real world, or to modify results based on new algorithms cause over provisioning of hardware in today's ETL
and serving layers. By accounting for this capability up front, the opportunity to make transforms happen at the
right time reduce the time and energy required to keep things in phase. HANA is a fantastic example of this
given its powerful late materialization capability. Eventually elastic capabilities will further reduce replay
workload stress by optimal allocation of compute resources with finer grain and lower latency. By employing
lambda and other emerging big data architecture patterns, our customers remain well positioned as the state
of the art continues forward.
www.sap.com/contactsap
© 2016 SAP SE or an SAP affiliate company.
All rights reserved.
No part of this publication may be reproduced or transmitted in any
form or for any purpose without the express permission of SAP SE
or an SAP affiliate company.
SAP and other SAP products and services mentioned herein as well
as their respective logos are trademarks or registered trademarks of
SAP SE (or an SAP affiliate company) in Germany and other
countries. Please see http://www.sap.com/corporate-
en/legal/copyright/index.epx#trademark for additional trademark
information and notices. Some software products marketed by SAP
SE and its distributors contain proprietary software components of
other software vendors.
National product specifications may vary.
These materials are provided by SAP SE or an SAP affiliate
company for informational purposes only, without representation or
warranty of any kind, and SAP SE or its affiliated companies shall
not be liable for errors or omissions with respect to the materials.
The only warranties for SAP SE or SAP affiliate company products
and services are those that are set forth in the express warranty
statements accompanying such products and services, if any.
Nothing herein should be construed as constituting an additional
warranty.
In particular, SAP SE or its affiliated companies have no obligation to
pursue any course of business outlined in this document or any
related presentation, or to develop or release any functionality
mentioned therein. This document, or any related presentation, and
SAP SE’s or its affiliated companies’ strategy and possible future
developments, products, and/or platform directions and functionality
are all subject to change and may be changed by SAP SE or its
affiliated companies at any time for any reason without notice. The
information in this document is not a commitment, promise, or legal
obligation to deliver any material, code, or functionality. All forward-
looking statements are subject to various risks and uncertainties that
could cause actual results to differ materially from expectations.
Readers are cautioned not to place undue reliance on these forward-
looking statements, which speak only as of their dates, and they
should not be relied upon in making purchasing decisions.

More Related Content

What's hot (18)

PDF
SplunkSummit 2015 - Real World Big Data Architecture
Splunk
 
PDF
Intelligent Integration OOW2017 - Jeff Pollock
Jeffrey T. Pollock
 
PPTX
Operational Analytics Using Spark and NoSQL Data Stores
DATAVERSITY
 
PPTX
In-Memory Database Platform for Big Data
SAP Technology
 
PDF
Flexpod with SAP HANA and SAP Applications
Lishantian
 
PDF
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Igor De Souza
 
PPTX
Hadoop Powers Modern Enterprise Data Architectures
DataWorks Summit
 
PDF
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
Mark Rittman
 
PPTX
Next Generation Enterprise Architecture
MapR Technologies
 
PDF
Artur Fejklowicz - “Data Lake architecture” AI&BigDataDay 2017
Lviv Startup Club
 
PDF
Designing the Next Generation Data Lake
Robert Chong
 
PDF
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Rittman Analytics
 
PDF
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
SAP Technology
 
PDF
Tapping into the Big Data Reservoir (CON7934)
Jeffrey T. Pollock
 
PPTX
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Scott Mitchell
 
PDF
The Impact of SAP Hana on the SAP Infrastructure Utility Services Marketplace
Lisa Milani, MBA
 
PDF
Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...
DATAVERSITY
 
PPTX
Solving Performance Problems on Hadoop
Tyler Mitchell
 
SplunkSummit 2015 - Real World Big Data Architecture
Splunk
 
Intelligent Integration OOW2017 - Jeff Pollock
Jeffrey T. Pollock
 
Operational Analytics Using Spark and NoSQL Data Stores
DATAVERSITY
 
In-Memory Database Platform for Big Data
SAP Technology
 
Flexpod with SAP HANA and SAP Applications
Lishantian
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Igor De Souza
 
Hadoop Powers Modern Enterprise Data Architectures
DataWorks Summit
 
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
Mark Rittman
 
Next Generation Enterprise Architecture
MapR Technologies
 
Artur Fejklowicz - “Data Lake architecture” AI&BigDataDay 2017
Lviv Startup Club
 
Designing the Next Generation Data Lake
Robert Chong
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Rittman Analytics
 
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
SAP Technology
 
Tapping into the Big Data Reservoir (CON7934)
Jeffrey T. Pollock
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Scott Mitchell
 
The Impact of SAP Hana on the SAP Infrastructure Utility Services Marketplace
Lisa Milani, MBA
 
Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...
DATAVERSITY
 
Solving Performance Problems on Hadoop
Tyler Mitchell
 

Similar to SAP Lambda Architecture Point of View (20)

PDF
209 hana-defining-capability-whitepaper
bbenthach
 
PDF
What is Sap HANA Convista Consulting Asia.pdf
ankeetkumar4
 
PDF
What Is SAP HANA And Its Benefits?
ManojAgrawal74
 
PPTX
Analytics Products L2 public 2020-23 Black.pptx
BurakAyan6
 
DOCX
SAP HANA
Saravanan Manoharan
 
DOCX
Strategies To Overcome SAP S/4HANA Data Migration Challenges
Dipak Pimpale
 
PDF
SAP HANA SQL Data Warehousing (Sefan Linders)
Twan van den Broek
 
DOCX
Integration of SAP HANA with Hadoop
Ramkumar Rajendran
 
PDF
S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...
lakshmi vara
 
PDF
What's Planned for SAP HANA SPS10
SAP Technology
 
PDF
Empowering SAP HANA Customers and Use Cases
thinkASG
 
PDF
SAP HANA SPS09 - HANA IM Services
SAP Technology
 
PDF
SAP_SLT_Guide_21122015.pdf
ssuser17886a
 
PDF
SAP S/4HANA cloud editions or On Prem? Demystifying the options and cost bene...
IBM
 
PDF
Pol03262 usen
Kaizenlogcom
 
PPTX
Sap hana
saisree92
 
DOCX
Sap deployment
Asha Panda
 
PDF
SUSE Technical Webinar: Build HANA Apps in the Framework of the SAP and SUSE ...
SAP PartnerEdge program for Application Development
 
PDF
1310 success stories_and_lessons_learned_implementing_sap_hana_solutions
Bobby Shah
 
PDF
Cool features 7.4
Mahesh Someshetty
 
209 hana-defining-capability-whitepaper
bbenthach
 
What is Sap HANA Convista Consulting Asia.pdf
ankeetkumar4
 
What Is SAP HANA And Its Benefits?
ManojAgrawal74
 
Analytics Products L2 public 2020-23 Black.pptx
BurakAyan6
 
Strategies To Overcome SAP S/4HANA Data Migration Challenges
Dipak Pimpale
 
SAP HANA SQL Data Warehousing (Sefan Linders)
Twan van den Broek
 
Integration of SAP HANA with Hadoop
Ramkumar Rajendran
 
S4F01_EN_Col17 Financial Accounting in SAP S4HANA for SAP ERP FI Professional...
lakshmi vara
 
What's Planned for SAP HANA SPS10
SAP Technology
 
Empowering SAP HANA Customers and Use Cases
thinkASG
 
SAP HANA SPS09 - HANA IM Services
SAP Technology
 
SAP_SLT_Guide_21122015.pdf
ssuser17886a
 
SAP S/4HANA cloud editions or On Prem? Demystifying the options and cost bene...
IBM
 
Pol03262 usen
Kaizenlogcom
 
Sap hana
saisree92
 
Sap deployment
Asha Panda
 
SUSE Technical Webinar: Build HANA Apps in the Framework of the SAP and SUSE ...
SAP PartnerEdge program for Application Development
 
1310 success stories_and_lessons_learned_implementing_sap_hana_solutions
Bobby Shah
 
Cool features 7.4
Mahesh Someshetty
 
Ad

Recently uploaded (20)

PPTX
Engineering the Java Web Application (MVC)
abhishekoza1981
 
PPTX
3uTools Full Crack Free Version Download [Latest] 2025
muhammadgurbazkhan
 
PDF
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
PPTX
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
PDF
Automate Cybersecurity Tasks with Python
VICTOR MAESTRE RAMIREZ
 
PDF
Understanding the Need for Systemic Change in Open Source Through Intersectio...
Imma Valls Bernaus
 
PDF
Linux Certificate of Completion - LabEx Certificate
VICTOR MAESTRE RAMIREZ
 
PPTX
Platform for Enterprise Solution - Java EE5
abhishekoza1981
 
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked} 2025
hashhshs786
 
PDF
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
PDF
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
PDF
Revenue streams of the Wazirx clone script.pdf
aaronjeffray
 
PPTX
MailsDaddy Outlook OST to PST converter.pptx
abhishekdutt366
 
PDF
vMix Pro 28.0.0.42 Download vMix Registration key Bundle
kulindacore
 
PDF
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
PDF
Thread In Android-Mastering Concurrency for Responsive Apps.pdf
Nabin Dhakal
 
PDF
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
PPTX
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
PDF
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
PPTX
Equipment Management Software BIS Safety UK.pptx
BIS Safety Software
 
Engineering the Java Web Application (MVC)
abhishekoza1981
 
3uTools Full Crack Free Version Download [Latest] 2025
muhammadgurbazkhan
 
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
Automate Cybersecurity Tasks with Python
VICTOR MAESTRE RAMIREZ
 
Understanding the Need for Systemic Change in Open Source Through Intersectio...
Imma Valls Bernaus
 
Linux Certificate of Completion - LabEx Certificate
VICTOR MAESTRE RAMIREZ
 
Platform for Enterprise Solution - Java EE5
abhishekoza1981
 
Capcut Pro Crack For PC Latest Version {Fully Unlocked} 2025
hashhshs786
 
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
Revenue streams of the Wazirx clone script.pdf
aaronjeffray
 
MailsDaddy Outlook OST to PST converter.pptx
abhishekdutt366
 
vMix Pro 28.0.0.42 Download vMix Registration key Bundle
kulindacore
 
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
Thread In Android-Mastering Concurrency for Responsive Apps.pdf
Nabin Dhakal
 
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
Equipment Management Software BIS Safety UK.pptx
BIS Safety Software
 
Ad

SAP Lambda Architecture Point of View

  • 1. SAP Lambda Architecture Point Of View SAP Digital Enterprise Platform Doug Martino January 2016 External
  • 3. SAP External | 3 Introduction New Architectures for Big Data Whether MVC for user interfaces, or Spine and Leaf for data centers, new architecture patterns in our industry act as sort of historical markers of the effectiveness and acceptance of new technologies. Practical techniques push the bounds resulting in a shift. Application of distributed storage and streaming capabilities such as Kafka and of course Hadoop are shifting Big Data architectures from a layer cake concept, or North/South oriented approach to one which can be thought of as an East/West architectural concept. Recently popular is Lambda Architecture, this article presents an SAP HANA based rendering of the Lambda Architecture. Lambda Architecture Before discussing how SAP products can be used in a Lambda fashion, a brief overview of Lambda is in order. Attribution must go to Nathan Marz for his work. Typically consisting of three components, a batch layer, a speed layer, and a serving layer, Lambda rethinks how data flows through an analytics system. Data flows left to right in a multi speed architecture; with all data stored in the batch layer and the speed layer simultaneously, while the serving layer is used to combine and analyze that data as required. Advantages are fault tolerance, scalability, while at the same time offering multiple latencies to supporting a a variety of performance requirements. With Lambda some business questions are answered using the speed layer, others answered by the batch layer, and still others answered by combing both. As algorithms in the speed layer decide which data to keep, answer immediate questions such as which ad to display or how much discount to offer, it also stores an immutable copy of the data into the batch layer. In addition to acting as a replay source, data in the batch layer is suited for use cases such as long tail analytics, machine learning, or propensity to ‘act’ types of endeavors. Fault tolerance, resilience, ultimate scale, and back testing are derived from having the immutable copy of the original data for replay through the downstream components. The serving layer may persist a copy of the data to be used for additional downstream use of the data such as dashboards or feeds to another system. Shown here are several open source options for the components which make up a Lambda Architecture.
  • 4. SAP External | 4 Traditional Lambda Architecture Lambda with SAP HANA Platform At SAP we are rapidly delivering value with our customers by employing these types of architecture patterns. In 2012 we started moving the HANA product towards the SAP Real-time data architecture where one already finds the notions of Lambda. Smart Data Streaming is the obvious choice for the real-time layer. Born on Wall Street, this component is capable of ingesting millions of rows per second into HANA. Complete with a rich set of API’s, stream and window compute constructs, and scale out capability, it can send the right data into HANA and all the data into HDFS as necessary. As the leading in-memory database platform, HANA meets the needs of the serving component of Lambda rather well. The data can be stored once, then transformed, aggregated, and calculated dynamically. Easy to understand in-memory compute engines such as graph, spatial, OLAP, and predictive analytics are available to manipulate the data where it is stored. Because the immutable data is kept in an append only columnar SQL database and the serving is done on the fly, our customer’s experience a valuable and unique combination of expressive, declarative programming capability on data which is managed in a SQL framework. This allows data access via everyday BI tools via standard ODBC or JDBC SQL as well as ODBO MDX. Based on Node.js, the HANA app server XS Advanced provides amongst other things noSQL access to the data via restful ODATA.
  • 5. SAP External | 5 Often times requirements change such that new algorithms or new derivations of the immutable data are required. Because of the dynamic nature of materialization employed by the HANA core architecture, these changes can be made almost instantly. This flexibility changes the logistics of replay, allowing for rapid experimentation and moves into a production environment. In the case that a schema needs to be extended HANA does have schema flexibility whereby columns may be added to an existing table structure, leaving any existing code intact. The amount of flexibility offered by HANA once the immutable data has been stored is a fundamental differentiator of HANA. This flexibility offers reduced time to value and increased data agility. While the HANA platform is working extremely well for real-time applications, Lambda is complete by incorporating Vora. From an architecture standpoint, there are several aspects to add to the discussion. First is the aspect of scalability.
  • 6. SAP External | 6 One of the beauties of the lambda is scaling each component dynamically and independently as latency requirements shift to meet changing business requirements. This notion of independent scaling allows appropriate resources allocation in a fit-for purpose model. Vora’s use of Hadoop moves the scale of the overall system up to multi petabyte range. More importantly, addressing long tail analytics and other big data compute problems are now possible. This leads to the next concept, which is one of throughput and algorithmic capability. Each of the real- time/batch/serving layers are best suited for a particular algorithm at a given throughout. All of them can run ‘out of band’ code; the question is at which level of concurrency and complexity. Rather than have JAVA JVM as the foundation of each, SAP have the appropriate engines with well-documented and supported extension capabilities. One may argue that open source provides the ultimate extension capability but only as a consuming organization is willing to merge code. In this case, SAP is providing high performance, fit for purpose engines to achieve the desired results more efficiently. This leaves us with the replay topic, an oft forgotten concept, especially in an operational batch context. Usually left for that – operations, it is often the case that re-running loads into a system to catch up with the real world, or to modify results based on new algorithms cause over provisioning of hardware in today's ETL and serving layers. By accounting for this capability up front, the opportunity to make transforms happen at the right time reduce the time and energy required to keep things in phase. HANA is a fantastic example of this given its powerful late materialization capability. Eventually elastic capabilities will further reduce replay workload stress by optimal allocation of compute resources with finer grain and lower latency. By employing lambda and other emerging big data architecture patterns, our customers remain well positioned as the state of the art continues forward.
  • 7. www.sap.com/contactsap © 2016 SAP SE or an SAP affiliate company. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP SE or an SAP affiliate company. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP SE (or an SAP affiliate company) in Germany and other countries. Please see http://www.sap.com/corporate- en/legal/copyright/index.epx#trademark for additional trademark information and notices. Some software products marketed by SAP SE and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP SE or an SAP affiliate company for informational purposes only, without representation or warranty of any kind, and SAP SE or its affiliated companies shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP SE or SAP affiliate company products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. In particular, SAP SE or its affiliated companies have no obligation to pursue any course of business outlined in this document or any related presentation, or to develop or release any functionality mentioned therein. This document, or any related presentation, and SAP SE’s or its affiliated companies’ strategy and possible future developments, products, and/or platform directions and functionality are all subject to change and may be changed by SAP SE or its affiliated companies at any time for any reason without notice. The information in this document is not a commitment, promise, or legal obligation to deliver any material, code, or functionality. All forward- looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned not to place undue reliance on these forward- looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions.