SlideShare a Scribd company logo
Bringing the Power of Big Data
Computation to Salesforce
Arun Bhat
Chief Architect – Model N Inc.
abhat@modeln.com
@parunbhat
Krishna Shekhram
Software Architect – Model N Inc.
kshekhram@modeln.com
@kshekhram
Speaker Introduction
Little bit about us
• Model N is the leading provider of Revenue Management solutions for the life sciences and
technology industries.
• The company helps customers maximize revenues, drive growth and reduce compliance risk by
transforming the revenue lifecycle from inefficient disjointed operation into a strategic end to end
process.
Why do we care about big data
Model N – The Pioneer in Revenue Management
Founded in 1999$120+B
Revenue under management
2+M
Sales lines processed daily
100+
Companies maximizing revenue
with Model N
50,000+
Sales, Sales Ops, FAE’s, Finance,
Marketing, Manufacturing reps and
Distributor users
100+
Countries where Model N
Revenue Management is used
1,000+
Distributors in 50 Countries
Arun Bhat
Chief Architect, Revvy Products
15 years in Model N
19 years in Software Industry
Led Architecture of Model N products
Responsible for architecture of multi-tenant
Revvy products on Salesforce
Passionate about technology but likes to read
comics 
Krishna Shekhram
Architect, Revvy Products
6 years in Model N
14 years in Software Industry
Architected Model N Analytics Products
Lead for Revvy Big Data Architecture
Enjoys exploring new technologies. Love to
watch documentaries to learn more about
world.
Model N – The Pioneer in Revenue Management
Overview
What we will be discussing over this talk
Leveraging Salesforce
Computing using Big Data
Metadata as a common fabric
Integrating into a Cohesive Architecture
Building a Data Driven Application
Demo
Data Pipeline and BigObjects
Summary
Agenda
Big Data
Leveraging Salesforce
To build flexible cloud applications
Availability
Deployment
Elasticity
Customization
Security
Upgradeability
Integration
Device Independence
Multi Tenancy
Metadata
Cloud Computing Force.com Stack Enabling Technology
Leveraging Salesforce Power
User Interface
Logic
Integration
Database
Infrastructure
DeveloperTools
Computing using Big
Data
Realize valuable insights, actions and faster decisions from your
data at scale
Source: logs, social media,
mobile, IOT, POS
Format: structured, text, picture,
video, binary, document
Speed: real-time streams,
transactions, batch upload
Rapid Ingestion
Bigger Storage
Faster Processing
Quicker Retrieval
Better Visualization
Hidden insights discovery
Facts based decision making
Business process automation
Ecosystem engagement
Growth & monetization of data
Data Explosion Technology Evolution Business Opportunities
Why “Big Data” is a Big Deal
Competitive advantage for today, Survival for tomorrow
Big data technology is going through innovation spurt
Big Data Technology Landscape
Components
• HDFS, Map/Reduce, YARN
• Provides fault tolerant and scalable cluster
HDFS as storage
• Supports variety of data formats
• Metadata driven schema evolution
YARN as cluster manager
• Supports Security, Resource Isolation, Multi-tenancy
• Highly available and elastic scaling
Components
• Spark Core, SQL, MLib, Streaming, GraphX
• Can run in variety of clusters (YARN, Mesos,
Standalone)
Data Access
• Data access from HDFS, S3, Cassandra, HBase,
JDBC, Streaming source like Kafka
• Supports multiple formats like Parquet, json, csv, etc.
Compute
• General purpose low latency compute engine
• Batch, Interactive, Query, Predictive, Graph and
Stream processing
Hadoop and Spark Advantage
Data driven, flexible, multi-tenant applications at scale
Hadoop Spark
Metadata
The common fabric
Sales Data Sales Metadata
URL: /tx/sales/Sales.parquet
Columns:
Sale ID: ID
Customer : Relationship (Customer)
Product : Relationship (Product)
Invoice Date: Date
Qty : Integer
Price : Decimal
Metadata Example
Metadata describes data
Sale ID
Customer
Product
Invoice Date
Qty
Price
Product ID
Product #
BU
Customer
ID
Name
Type
Customer
Sales
Product
Calculation Unit Calculation Model
Flexibility & Extensibility
Key for multi tenant cloud applications
Calc
Op
Input
Dataset
Output
Dataset
Define
Metadata
Define
Metadata
Input
Dataset
Input
Dataset
Input
Dataset
Output
Dataset
Output
Dataset
Output
Dataset
Calculation
Model
Metadata MetadataConfiguration
• Metadata Capture & Synchronization
• Define all dataset as objects in Salesforce to capture metadata. Example: Sales, Inventory, Order
• Load actual data in HDFS
• Synchronize metadata on change
• Master Data Sync
• Synchronize the master data from SFDC to HDFS. Example: Accounts, Catalog
• HDFS Schema using metadata
• Use HDFS file formats which supports schema evolution(e.g. Parquet, Avro)
• Use the dataset metadata to read/write HDFS file
• Configure Calculation
• Define Variability in calculation as configuration using Salesforce custom object
Leverage Salesforce to capture metadata
Flexibility & Extensibility using metadata
Integration
Building a cohesive architecture
• Exposes all the REST APIs needed for application.
• Stores application and object metadata
• Provides support for multi-tenancy, error handling and recovery
• Provides secure API for
• Metadata synchronization
• Data Loads
• Batch calculation
• Querying the aggregated results
• Real time calculation/prediction
Exposes big data computation as service
Web Service as Middleware
Compute
Cluster
Cluster Web
Service
• Abstracts out complexity of big data technology
• Translates business specific service calls to calculation jobs
• Uses metadata to build calculation model
• Handles connection to cluster
• Manages multi-tenancy context to submit jobs to cluster
• Interacts with Various cluster components
• HDFS
• YARN
• Spark
Acts as client for cluster
Web Service as Middleware
Compute
Cluster
Cluster Web
Service
Building a Data Driven
Application
Getting best of both world to realize business value
• Unified transactional and analytics application
• Provides real time insights from data in business context
• Calculates KPIs and processes data for business
• Evaluate performance against goal based on data
• Combines intelligence with Action
• Facilitate business process automation
• Learn from data to support fast and accurate decision
Key Concepts
What is a data driven application
Contextual Discovery
Measuring KPIs and
triggering workflow
actions, alerts or
notifications based on KPI.
Claim processing
Fraud detection
Processing large amount
of data and running
business calculation on it
to generate results critical
for business operation.
Tax report generation
Stock portfolio valuation
Intelligent decisions and
actions based on learning
from data. Prediction,
Optimization, Anomaly
detection, AI,
Recommendation.
Google Now, Price
Optimization
Business Process
Automation Data Processing Decision Intelligence
Interactive dashboards
and analysis in the
transactional application
business context.
Account performance
dashboard in CRM
application
Data Driven Application Examples
Guideline for building data driven application
Reference Architecture
Metadata
Manager
Common Library
Data
Manager
Job
Manager
Config
Manager
Application
Account
Catalog
Opportunity
Sales
Segment
Big Data Cluster
Web App Middleware
Cluster Client
Metadata
Service
Data
Service
Application
Service
Data Storage
Calculation Runtime
Demo
Seeing is believing
User enters segment definition
See Sales metadata in Salesforce
Show Sales lines loaded in Hadoop
Trigger segmentation from Salesforce
Show dashboards with segmented customers in Salesforce
Segmenting customers based on revenue
Demo Overview
Data Pipelines
BigObjects
Collaborating with Salesforce on the big data roadmap
Data Pipelines
Brings batch processing using Hadoop to the Salesforce Platform
Apache Pig for data flow control and evaluation
BigObjects
Storage of large amounts of data
Data Pipelines and BigObjects (Pilot)
Features that can be leveraged
BigObjects to store POS, Order and line items
Apache Pig Script and Hadoop through the Data Pipeline API
Features that need to be incorporated
Support Data Pipeline API through Apex (instead of the Metadata API)
Support for low latency jobs e.g. Spark (as compared to batch processing)
To get big data computation in Salesforce
Collaborate with Salesforce on big data roadmap
Reference Architecture
Metadata
Manager
Common Library
Data
Manager
Job
Manager
Config
Manager
Application
Account
Catalog
Opportunity
Sales
Segment
Big Data Cluster
Web App Middleware
Cluster Client
Metadata
Service
Data
Service
Application
Service
Data Storage
Calculation Runtime
Data
Pipeline
Bulk
SOQL
Apex
SObjects
BigObjects
Files
SObjects
BigObjects
Files
SObjects
BigObjects
Files
SObjects
BigObjects
Files
Job
Manager
Config
Manager
Summary
Let’s recap
• How to leverage Salesforce to build flexible cloud applications
• How to use big data computation to realize valuable insights, actions and faster decisions from your data at
scale
• How to fuse Salesforce and Big Data technologies together using metadata and integrations
• How to unlock your business potential using data driven application
• How Salesforce and Big Data technologies can coexist well
What we learnt
Summary
Thank you

More Related Content

What's hot (20)

PPTX
Movie lens movie recommendation system
Gaurav Sawant
 
PDF
Empower Your Security Practitioners with Elastic SIEM
Elasticsearch
 
PDF
Enterprise Security Architecture
Priyanka Aash
 
PPTX
Salesforce sales cloud solutions
JanBask LLC
 
PDF
Azure Machine Learning
Mostafa
 
PPTX
Movie Recommendation System.pptx
randominfo
 
PPT
How Cascading Style Sheets (CSS) Works
Amit Tyagi
 
PPTX
How to Draw an Effective ER diagram
Tech_MX
 
PPTX
Collaborative filtering
Neha Kulkarni
 
PPTX
Security Operations Cloud vs On Prem ISC2 Bangalore SlideShare.pptx
Vikas Singh Yadav
 
PDF
Documenting Software Architectures
Paulo Gandra de Sousa
 
PDF
Security Automation Simplified via NIST OSCAL: We’re Not in Kansas Anymore
Priyanka Aash
 
PDF
Using R for customer segmentation
Kumar P
 
KEY
HTML CSS & Javascript
David Lindkvist
 
PDF
MLOps Bridging the gap between Data Scientists and Ops.
Knoldus Inc.
 
PDF
Ml ops past_present_future
Nisha Talagala
 
PPTX
Data model in salesforce
Chamil Madusanka
 
PDF
Using MLOps to Bring ML to Production/The Promise of MLOps
Weaveworks
 
PPTX
SABSA Implementation(Part III)_ver1-0
Maganathin Veeraragaloo
 
PDF
50 Shades of Sigma
Florian Roth
 
Movie lens movie recommendation system
Gaurav Sawant
 
Empower Your Security Practitioners with Elastic SIEM
Elasticsearch
 
Enterprise Security Architecture
Priyanka Aash
 
Salesforce sales cloud solutions
JanBask LLC
 
Azure Machine Learning
Mostafa
 
Movie Recommendation System.pptx
randominfo
 
How Cascading Style Sheets (CSS) Works
Amit Tyagi
 
How to Draw an Effective ER diagram
Tech_MX
 
Collaborative filtering
Neha Kulkarni
 
Security Operations Cloud vs On Prem ISC2 Bangalore SlideShare.pptx
Vikas Singh Yadav
 
Documenting Software Architectures
Paulo Gandra de Sousa
 
Security Automation Simplified via NIST OSCAL: We’re Not in Kansas Anymore
Priyanka Aash
 
Using R for customer segmentation
Kumar P
 
HTML CSS & Javascript
David Lindkvist
 
MLOps Bridging the gap between Data Scientists and Ops.
Knoldus Inc.
 
Ml ops past_present_future
Nisha Talagala
 
Data model in salesforce
Chamil Madusanka
 
Using MLOps to Bring ML to Production/The Promise of MLOps
Weaveworks
 
SABSA Implementation(Part III)_ver1-0
Maganathin Veeraragaloo
 
50 Shades of Sigma
Florian Roth
 

Viewers also liked (20)

PPTX
Keresőoptimalizálás mobilon: az mSEO eszközei
Norbert Boros
 
PPTX
круиз на ледоколе
PolarStar2017
 
PDF
Luis carlos salazar_topicos de globalizacion.docx
Luis Carlos Salazar Estévez
 
PDF
christopher powell productions
Christopher Powell
 
PPTX
Educacion no presencial
erendida solis
 
PPTX
Mobil rangsolási faktorok
Norbert Boros
 
PDF
BenchMarker Issue 4 2012 -- India Edition
Sewells MSXI
 
PDF
PSD Enablement Session "Mobile Reference Applications"
SAP PartnerEdge program for Application Development
 
PDF
Risk based testing with Jira and Jubula
Daniele Gagliardi
 
PPTX
Mapa conceptual
jaison higuer
 
PPTX
Hbase at Salesforce.com
Salesforce Engineering
 
PPTX
E government dan penerepannya di kota bandung jawa barat
Julio Mamesah
 
PPTX
Salesforce for Nonprofits: Turn Big Data into Social Change
Salesforce.org
 
PDF
Phoenix - A High Performance Open Source SQL Layer over HBase
Salesforce Developers
 
PDF
Unleash the Potential of Big Data on Salesforce
Dreamforce
 
PDF
(359)long pdf repasando la comision angelides
ManfredNolte
 
PDF
Continuous Delivery of Success
Alexander Sutherland
 
PDF
Agile.2013.effecting.a.dev ops.transformation.at.salesforce
Dave Mangot
 
PDF
cardinal health Q2 2007 Earnings Release
finance2
 
Keresőoptimalizálás mobilon: az mSEO eszközei
Norbert Boros
 
круиз на ледоколе
PolarStar2017
 
Luis carlos salazar_topicos de globalizacion.docx
Luis Carlos Salazar Estévez
 
christopher powell productions
Christopher Powell
 
Educacion no presencial
erendida solis
 
Mobil rangsolási faktorok
Norbert Boros
 
BenchMarker Issue 4 2012 -- India Edition
Sewells MSXI
 
PSD Enablement Session "Mobile Reference Applications"
SAP PartnerEdge program for Application Development
 
Risk based testing with Jira and Jubula
Daniele Gagliardi
 
Mapa conceptual
jaison higuer
 
Hbase at Salesforce.com
Salesforce Engineering
 
E government dan penerepannya di kota bandung jawa barat
Julio Mamesah
 
Salesforce for Nonprofits: Turn Big Data into Social Change
Salesforce.org
 
Phoenix - A High Performance Open Source SQL Layer over HBase
Salesforce Developers
 
Unleash the Potential of Big Data on Salesforce
Dreamforce
 
(359)long pdf repasando la comision angelides
ManfredNolte
 
Continuous Delivery of Success
Alexander Sutherland
 
Agile.2013.effecting.a.dev ops.transformation.at.salesforce
Dave Mangot
 
cardinal health Q2 2007 Earnings Release
finance2
 
Ad

Similar to Bringing the Power of Big Data Computation to Salesforce (20)

PPTX
Microsoft cloud big data strategy
James Serra
 
PPTX
How does Microsoft solve Big Data?
James Serra
 
PPTX
Extreme SSAS- SQL 2011
Itay Braun
 
PDF
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 
PPTX
Big Data in Azure
DataWorks Summit/Hadoop Summit
 
PPTX
Skillwise Big Data part 2
Skillwise Group
 
PPTX
Skilwise Big data
Skillwise Group
 
PDF
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
PPTX
DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?
US-Analytics
 
PPT
8.17.11 big data and hadoop with informatica slideshare
Julianna DeLua
 
PPTX
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Streamsets Inc.
 
PPTX
Finding business value in Big Data
James Serra
 
PDF
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Hortonworks
 
PDF
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo
 
PPTX
Opportunity: Data, Analytic & Azure
Abhimanyu Singhal
 
PDF
Modernizing to a Cloud Data Architecture
Databricks
 
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
MSAdvAnalytics
 
PDF
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Dataconomy Media
 
PPTX
Building a Big Data Solution
James Serra
 
PPTX
Choosing technologies for a big data solution in the cloud
James Serra
 
Microsoft cloud big data strategy
James Serra
 
How does Microsoft solve Big Data?
James Serra
 
Extreme SSAS- SQL 2011
Itay Braun
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 
Skillwise Big Data part 2
Skillwise Group
 
Skilwise Big data
Skillwise Group
 
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?
US-Analytics
 
8.17.11 big data and hadoop with informatica slideshare
Julianna DeLua
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Streamsets Inc.
 
Finding business value in Big Data
James Serra
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Hortonworks
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo
 
Opportunity: Data, Analytic & Azure
Abhimanyu Singhal
 
Modernizing to a Cloud Data Architecture
Databricks
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
MSAdvAnalytics
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Dataconomy Media
 
Building a Big Data Solution
James Serra
 
Choosing technologies for a big data solution in the cloud
James Serra
 
Ad

More from Salesforce Developers (20)

PDF
Sample Gallery: Reference Code and Best Practices for Salesforce Developers
Salesforce Developers
 
PDF
Maximizing Salesforce Lightning Experience and Lightning Component Performance
Salesforce Developers
 
PDF
Local development with Open Source Base Components
Salesforce Developers
 
PPTX
TrailheaDX India : Developer Highlights
Salesforce Developers
 
PDF
Why developers shouldn’t miss TrailheaDX India
Salesforce Developers
 
PPTX
CodeLive: Build Lightning Web Components faster with Local Development
Salesforce Developers
 
PPTX
CodeLive: Converting Aura Components to Lightning Web Components
Salesforce Developers
 
PPTX
Enterprise-grade UI with open source Lightning Web Components
Salesforce Developers
 
PPTX
TrailheaDX and Summer '19: Developer Highlights
Salesforce Developers
 
PDF
Live coding with LWC
Salesforce Developers
 
PDF
Lightning web components - Episode 4 : Security and Testing
Salesforce Developers
 
PDF
LWC Episode 3- Component Communication and Aura Interoperability
Salesforce Developers
 
PDF
Lightning web components episode 2- work with salesforce data
Salesforce Developers
 
PDF
Lightning web components - Episode 1 - An Introduction
Salesforce Developers
 
PDF
Migrating CPQ to Advanced Calculator and JSQCP
Salesforce Developers
 
PDF
Scale with Large Data Volumes and Big Objects in Salesforce
Salesforce Developers
 
PDF
Replicate Salesforce Data in Real Time with Change Data Capture
Salesforce Developers
 
PDF
Modern Development with Salesforce DX
Salesforce Developers
 
PDF
Get Into Lightning Flow Development
Salesforce Developers
 
PDF
Integrate CMS Content Into Lightning Communities with CMS Connect
Salesforce Developers
 
Sample Gallery: Reference Code and Best Practices for Salesforce Developers
Salesforce Developers
 
Maximizing Salesforce Lightning Experience and Lightning Component Performance
Salesforce Developers
 
Local development with Open Source Base Components
Salesforce Developers
 
TrailheaDX India : Developer Highlights
Salesforce Developers
 
Why developers shouldn’t miss TrailheaDX India
Salesforce Developers
 
CodeLive: Build Lightning Web Components faster with Local Development
Salesforce Developers
 
CodeLive: Converting Aura Components to Lightning Web Components
Salesforce Developers
 
Enterprise-grade UI with open source Lightning Web Components
Salesforce Developers
 
TrailheaDX and Summer '19: Developer Highlights
Salesforce Developers
 
Live coding with LWC
Salesforce Developers
 
Lightning web components - Episode 4 : Security and Testing
Salesforce Developers
 
LWC Episode 3- Component Communication and Aura Interoperability
Salesforce Developers
 
Lightning web components episode 2- work with salesforce data
Salesforce Developers
 
Lightning web components - Episode 1 - An Introduction
Salesforce Developers
 
Migrating CPQ to Advanced Calculator and JSQCP
Salesforce Developers
 
Scale with Large Data Volumes and Big Objects in Salesforce
Salesforce Developers
 
Replicate Salesforce Data in Real Time with Change Data Capture
Salesforce Developers
 
Modern Development with Salesforce DX
Salesforce Developers
 
Get Into Lightning Flow Development
Salesforce Developers
 
Integrate CMS Content Into Lightning Communities with CMS Connect
Salesforce Developers
 

Recently uploaded (20)

PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PPTX
Designing Production-Ready AI Agents
Kunal Rai
 
PDF
Biography of Daniel Podor.pdf
Daniel Podor
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
Designing Production-Ready AI Agents
Kunal Rai
 
Biography of Daniel Podor.pdf
Daniel Podor
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 

Bringing the Power of Big Data Computation to Salesforce

  • 1. Bringing the Power of Big Data Computation to Salesforce Arun Bhat Chief Architect – Model N Inc. [email protected] @parunbhat Krishna Shekhram Software Architect – Model N Inc. [email protected] @kshekhram
  • 3. • Model N is the leading provider of Revenue Management solutions for the life sciences and technology industries. • The company helps customers maximize revenues, drive growth and reduce compliance risk by transforming the revenue lifecycle from inefficient disjointed operation into a strategic end to end process. Why do we care about big data Model N – The Pioneer in Revenue Management Founded in 1999$120+B Revenue under management 2+M Sales lines processed daily 100+ Companies maximizing revenue with Model N 50,000+ Sales, Sales Ops, FAE’s, Finance, Marketing, Manufacturing reps and Distributor users 100+ Countries where Model N Revenue Management is used 1,000+ Distributors in 50 Countries
  • 4. Arun Bhat Chief Architect, Revvy Products 15 years in Model N 19 years in Software Industry Led Architecture of Model N products Responsible for architecture of multi-tenant Revvy products on Salesforce Passionate about technology but likes to read comics  Krishna Shekhram Architect, Revvy Products 6 years in Model N 14 years in Software Industry Architected Model N Analytics Products Lead for Revvy Big Data Architecture Enjoys exploring new technologies. Love to watch documentaries to learn more about world. Model N – The Pioneer in Revenue Management
  • 5. Overview What we will be discussing over this talk
  • 6. Leveraging Salesforce Computing using Big Data Metadata as a common fabric Integrating into a Cohesive Architecture Building a Data Driven Application Demo Data Pipeline and BigObjects Summary Agenda Big Data
  • 7. Leveraging Salesforce To build flexible cloud applications
  • 8. Availability Deployment Elasticity Customization Security Upgradeability Integration Device Independence Multi Tenancy Metadata Cloud Computing Force.com Stack Enabling Technology Leveraging Salesforce Power User Interface Logic Integration Database Infrastructure DeveloperTools
  • 9. Computing using Big Data Realize valuable insights, actions and faster decisions from your data at scale
  • 10. Source: logs, social media, mobile, IOT, POS Format: structured, text, picture, video, binary, document Speed: real-time streams, transactions, batch upload Rapid Ingestion Bigger Storage Faster Processing Quicker Retrieval Better Visualization Hidden insights discovery Facts based decision making Business process automation Ecosystem engagement Growth & monetization of data Data Explosion Technology Evolution Business Opportunities Why “Big Data” is a Big Deal Competitive advantage for today, Survival for tomorrow
  • 11. Big data technology is going through innovation spurt Big Data Technology Landscape
  • 12. Components • HDFS, Map/Reduce, YARN • Provides fault tolerant and scalable cluster HDFS as storage • Supports variety of data formats • Metadata driven schema evolution YARN as cluster manager • Supports Security, Resource Isolation, Multi-tenancy • Highly available and elastic scaling Components • Spark Core, SQL, MLib, Streaming, GraphX • Can run in variety of clusters (YARN, Mesos, Standalone) Data Access • Data access from HDFS, S3, Cassandra, HBase, JDBC, Streaming source like Kafka • Supports multiple formats like Parquet, json, csv, etc. Compute • General purpose low latency compute engine • Batch, Interactive, Query, Predictive, Graph and Stream processing Hadoop and Spark Advantage Data driven, flexible, multi-tenant applications at scale Hadoop Spark
  • 14. Sales Data Sales Metadata URL: /tx/sales/Sales.parquet Columns: Sale ID: ID Customer : Relationship (Customer) Product : Relationship (Product) Invoice Date: Date Qty : Integer Price : Decimal Metadata Example Metadata describes data Sale ID Customer Product Invoice Date Qty Price Product ID Product # BU Customer ID Name Type Customer Sales Product
  • 15. Calculation Unit Calculation Model Flexibility & Extensibility Key for multi tenant cloud applications Calc Op Input Dataset Output Dataset Define Metadata Define Metadata Input Dataset Input Dataset Input Dataset Output Dataset Output Dataset Output Dataset Calculation Model Metadata MetadataConfiguration
  • 16. • Metadata Capture & Synchronization • Define all dataset as objects in Salesforce to capture metadata. Example: Sales, Inventory, Order • Load actual data in HDFS • Synchronize metadata on change • Master Data Sync • Synchronize the master data from SFDC to HDFS. Example: Accounts, Catalog • HDFS Schema using metadata • Use HDFS file formats which supports schema evolution(e.g. Parquet, Avro) • Use the dataset metadata to read/write HDFS file • Configure Calculation • Define Variability in calculation as configuration using Salesforce custom object Leverage Salesforce to capture metadata Flexibility & Extensibility using metadata
  • 18. • Exposes all the REST APIs needed for application. • Stores application and object metadata • Provides support for multi-tenancy, error handling and recovery • Provides secure API for • Metadata synchronization • Data Loads • Batch calculation • Querying the aggregated results • Real time calculation/prediction Exposes big data computation as service Web Service as Middleware Compute Cluster Cluster Web Service
  • 19. • Abstracts out complexity of big data technology • Translates business specific service calls to calculation jobs • Uses metadata to build calculation model • Handles connection to cluster • Manages multi-tenancy context to submit jobs to cluster • Interacts with Various cluster components • HDFS • YARN • Spark Acts as client for cluster Web Service as Middleware Compute Cluster Cluster Web Service
  • 20. Building a Data Driven Application Getting best of both world to realize business value
  • 21. • Unified transactional and analytics application • Provides real time insights from data in business context • Calculates KPIs and processes data for business • Evaluate performance against goal based on data • Combines intelligence with Action • Facilitate business process automation • Learn from data to support fast and accurate decision Key Concepts What is a data driven application
  • 22. Contextual Discovery Measuring KPIs and triggering workflow actions, alerts or notifications based on KPI. Claim processing Fraud detection Processing large amount of data and running business calculation on it to generate results critical for business operation. Tax report generation Stock portfolio valuation Intelligent decisions and actions based on learning from data. Prediction, Optimization, Anomaly detection, AI, Recommendation. Google Now, Price Optimization Business Process Automation Data Processing Decision Intelligence Interactive dashboards and analysis in the transactional application business context. Account performance dashboard in CRM application Data Driven Application Examples
  • 23. Guideline for building data driven application Reference Architecture Metadata Manager Common Library Data Manager Job Manager Config Manager Application Account Catalog Opportunity Sales Segment Big Data Cluster Web App Middleware Cluster Client Metadata Service Data Service Application Service Data Storage Calculation Runtime
  • 25. User enters segment definition See Sales metadata in Salesforce Show Sales lines loaded in Hadoop Trigger segmentation from Salesforce Show dashboards with segmented customers in Salesforce Segmenting customers based on revenue Demo Overview
  • 26. Data Pipelines BigObjects Collaborating with Salesforce on the big data roadmap
  • 27. Data Pipelines Brings batch processing using Hadoop to the Salesforce Platform Apache Pig for data flow control and evaluation BigObjects Storage of large amounts of data Data Pipelines and BigObjects (Pilot)
  • 28. Features that can be leveraged BigObjects to store POS, Order and line items Apache Pig Script and Hadoop through the Data Pipeline API Features that need to be incorporated Support Data Pipeline API through Apex (instead of the Metadata API) Support for low latency jobs e.g. Spark (as compared to batch processing) To get big data computation in Salesforce Collaborate with Salesforce on big data roadmap
  • 29. Reference Architecture Metadata Manager Common Library Data Manager Job Manager Config Manager Application Account Catalog Opportunity Sales Segment Big Data Cluster Web App Middleware Cluster Client Metadata Service Data Service Application Service Data Storage Calculation Runtime Data Pipeline Bulk SOQL Apex SObjects BigObjects Files SObjects BigObjects Files SObjects BigObjects Files SObjects BigObjects Files Job Manager Config Manager
  • 31. • How to leverage Salesforce to build flexible cloud applications • How to use big data computation to realize valuable insights, actions and faster decisions from your data at scale • How to fuse Salesforce and Big Data technologies together using metadata and integrations • How to unlock your business potential using data driven application • How Salesforce and Big Data technologies can coexist well What we learnt Summary