SlideShare a Scribd company logo
Data science at scale
Introduction
Shashank Agarwal
Associate Director, Engineering
Founder, ThirdWatch (Acquired by Razorpay)
www.linkedin.com/in/shashank734/
2
@shashank734
Data Science @ Razorpay
3
Introduction to Mitra
4
Key Features of Mitra
5
● Predict results within 200 milliseconds in the distributed
environment
● Generate Hundreds of features on the fly during model serving
● Serve results from deployed ML models
● Dynamic rule engine on Flink streams
● Heavily use Flink’s in memory states for storing features and
data
● Use CEP (Complex event processing) to handle async events
Core Engine
6
Data
Source App
Control
App
● CEP
● Join Streams
● Features
● Enrichment
● ML serving
● Rule Engine
● Sink
● Database
● Model Serving
Connector
Async IO
Data Lake &
Model training
app
Why Apache flink?
7
● Low latency
● CEP (Complex event processing) and event time streams
● Flink’s in-memory states
● Async IO
● Automatic Checkpointing for fault tolerance.
● Active community support and active development. For
example, Flink just introduced the most awaited features
schema evolution and State TTL.
Model Deployment
8
On Small Scale
On High Scale
● Network Load
● Resource
allocation
● Limit traffic size
Model Serving Scenarios
9
● Static model in-memory integration (classes, packages etc.)
● Static model Standalone service (Web Service, Socket
connections etc)
● Online learning service with integrated training
● Parameter server with integrated model updates
Tools : Kubeflow, Airflow, Mlflow, H2O
Some Takeaways
10
● There is a difference between research and live models.
● Data richness increases the accuracy.
● Accuracy have different meanings for different businesses.
● Don’t over do, start with a simple approach and it’s not wrong to use business logic along with
predictions.
● You should implement some scenario for A/B testing on Live data.
● Start productionize the models..
Thanks
Ready for new challenge, We are Hiring :
shashank.a@razorpay.com
@shashank734www.linkedin.com/in/shashank734

More Related Content

What's hot (20)

PPTX
intern
Shivang Singh
 
PDF
Productionalizing Models through CI/CD Design with MLflow
Databricks
 
PPTX
Server Monitoring 101
Shovon Paulinus Rozario
 
PPT
Apache kafka- Onkar Kadam
Onkar Kadam
 
PPTX
Providers
BeMyApp
 
PDF
Apache Airavata Cloud Integration
Heshan Suriyaarachchi
 
PPTX
Monitoring Splunk: S.o.S, DMC, and Beyond
Splunk
 
PPTX
Access, Alerts and Application Insights
Martin Abbott
 
PPTX
Automatyzacja Microsoft Azure z wykorzystaniem Azure Automation
Lukasz Kaluzny
 
PDF
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
BIOVIA
 
PDF
(ATS6-DEV03) Building an Enterprise Web Solution with AEP
BIOVIA
 
PDF
Splunk conf2014 - Splunk Monitoring - New Native Tools for Monitoring your Sp...
Splunk
 
PPTX
implementing the right website monitoring strategy
ManageEngine, Zoho Corporation
 
PDF
Spring cloud
Milan Ashara
 
PPTX
Cloud applications monitoring in digital transformation era
ManageEngine, Zoho Corporation
 
PDF
Should we manage events like APIs? | Alan Chatt and Kim Clark, IBM
HostedbyConfluent
 
PPTX
Nasscom ml ops webinar
Sameer Mahajan
 
PPTX
Flink Forward Berlin 2017: Hao Wu - Large Scale User Behavior Analytics by Flink
Flink Forward
 
PDF
GraphQL Advanced
LeanIX GmbH
 
PDF
Le monitoring d'infrastructure de l'ingestion aux données : un jeu d'enfants !
Elasticsearch
 
Productionalizing Models through CI/CD Design with MLflow
Databricks
 
Server Monitoring 101
Shovon Paulinus Rozario
 
Apache kafka- Onkar Kadam
Onkar Kadam
 
Providers
BeMyApp
 
Apache Airavata Cloud Integration
Heshan Suriyaarachchi
 
Monitoring Splunk: S.o.S, DMC, and Beyond
Splunk
 
Access, Alerts and Application Insights
Martin Abbott
 
Automatyzacja Microsoft Azure z wykorzystaniem Azure Automation
Lukasz Kaluzny
 
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
BIOVIA
 
(ATS6-DEV03) Building an Enterprise Web Solution with AEP
BIOVIA
 
Splunk conf2014 - Splunk Monitoring - New Native Tools for Monitoring your Sp...
Splunk
 
implementing the right website monitoring strategy
ManageEngine, Zoho Corporation
 
Spring cloud
Milan Ashara
 
Cloud applications monitoring in digital transformation era
ManageEngine, Zoho Corporation
 
Should we manage events like APIs? | Alan Chatt and Kim Clark, IBM
HostedbyConfluent
 
Nasscom ml ops webinar
Sameer Mahajan
 
Flink Forward Berlin 2017: Hao Wu - Large Scale User Behavior Analytics by Flink
Flink Forward
 
GraphQL Advanced
LeanIX GmbH
 
Le monitoring d'infrastructure de l'ingestion aux données : un jeu d'enfants !
Elasticsearch
 

Similar to Data science at scale with Kafka and Flink (Razorpay) (20)

PPTX
Is Spark the right choice for data analysis ?
Ahmed Kamal
 
PDF
Apache Flink London Meetup - Let's Talk ML on Flink
Stavros Kontopoulos
 
PDF
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
BigDataEverywhere
 
PDF
Data science a practitioner's perspective
Amir Ziai
 
PDF
Flink Forward San Francisco 2018: Dave Torok & Sameer Wadkar - "Embedding Fl...
Flink Forward
 
PDF
Data Science with Spark
Krishna Sankar
 
PPTX
Flink Streaming
Gyula Fóra
 
PDF
Data Science as Scale
Conor B. Murphy
 
PPTX
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 
PDF
DevOps for DataScience
Stepan Pushkarev
 
PDF
Spark
newmooxx
 
PDF
Towards Apache Flink 2.0 - Unified Data Processing and Beyond, Bowen Li
Bowen Li
 
PPTX
The Python ecosystem for data science - Landscape Overview
Dr. Ananth Krishnamoorthy
 
PPTX
Spark-Zeppelin-ML on HWX
Kirk Haslbeck
 
PDF
Scalable Algorithm Design with MapReduce
Pietro Michiardi
 
PDF
Machine learning at Scale with Apache Spark
Martin Zapletal
 
PDF
Flink Forward Berlin 2017: Boris Lublinsky, Stavros Kontopoulos - Introducing...
Flink Forward
 
PPTX
Tech Spark Presentation
Stephen Borg
 
PDF
Data Science in Future Tense
Paco Nathan
 
PDF
A New Year in Data Science: ML Unpaused
Paco Nathan
 
Is Spark the right choice for data analysis ?
Ahmed Kamal
 
Apache Flink London Meetup - Let's Talk ML on Flink
Stavros Kontopoulos
 
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
BigDataEverywhere
 
Data science a practitioner's perspective
Amir Ziai
 
Flink Forward San Francisco 2018: Dave Torok & Sameer Wadkar - "Embedding Fl...
Flink Forward
 
Data Science with Spark
Krishna Sankar
 
Flink Streaming
Gyula Fóra
 
Data Science as Scale
Conor B. Murphy
 
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 
DevOps for DataScience
Stepan Pushkarev
 
Spark
newmooxx
 
Towards Apache Flink 2.0 - Unified Data Processing and Beyond, Bowen Li
Bowen Li
 
The Python ecosystem for data science - Landscape Overview
Dr. Ananth Krishnamoorthy
 
Spark-Zeppelin-ML on HWX
Kirk Haslbeck
 
Scalable Algorithm Design with MapReduce
Pietro Michiardi
 
Machine learning at Scale with Apache Spark
Martin Zapletal
 
Flink Forward Berlin 2017: Boris Lublinsky, Stavros Kontopoulos - Introducing...
Flink Forward
 
Tech Spark Presentation
Stephen Borg
 
Data Science in Future Tense
Paco Nathan
 
A New Year in Data Science: ML Unpaused
Paco Nathan
 
Ad

More from KafkaZone (7)

PPTX
Introduction to ksqlDB and stream processing (Vish Srinivasan - Confluent)
KafkaZone
 
PDF
Real time data processing and model inferncing platform with Kafka streams (N...
KafkaZone
 
PDF
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
KafkaZone
 
PDF
Tale of two streaming frameworks (Karthik D - Walmart)
KafkaZone
 
PDF
Stream processing with Apache Flink (Timo Walther - Ververica)
KafkaZone
 
PPTX
Stream processing at Hotstar
KafkaZone
 
PDF
Key considerations in productionizing streaming applications
KafkaZone
 
Introduction to ksqlDB and stream processing (Vish Srinivasan - Confluent)
KafkaZone
 
Real time data processing and model inferncing platform with Kafka streams (N...
KafkaZone
 
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
KafkaZone
 
Tale of two streaming frameworks (Karthik D - Walmart)
KafkaZone
 
Stream processing with Apache Flink (Timo Walther - Ververica)
KafkaZone
 
Stream processing at Hotstar
KafkaZone
 
Key considerations in productionizing streaming applications
KafkaZone
 
Ad

Recently uploaded (20)

PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
July Patch Tuesday
Ivanti
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
July Patch Tuesday
Ivanti
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 

Data science at scale with Kafka and Flink (Razorpay)

  • 2. Introduction Shashank Agarwal Associate Director, Engineering Founder, ThirdWatch (Acquired by Razorpay) www.linkedin.com/in/shashank734/ 2 @shashank734
  • 3. Data Science @ Razorpay 3
  • 5. Key Features of Mitra 5 ● Predict results within 200 milliseconds in the distributed environment ● Generate Hundreds of features on the fly during model serving ● Serve results from deployed ML models ● Dynamic rule engine on Flink streams ● Heavily use Flink’s in memory states for storing features and data ● Use CEP (Complex event processing) to handle async events
  • 6. Core Engine 6 Data Source App Control App ● CEP ● Join Streams ● Features ● Enrichment ● ML serving ● Rule Engine ● Sink ● Database ● Model Serving Connector Async IO Data Lake & Model training app
  • 7. Why Apache flink? 7 ● Low latency ● CEP (Complex event processing) and event time streams ● Flink’s in-memory states ● Async IO ● Automatic Checkpointing for fault tolerance. ● Active community support and active development. For example, Flink just introduced the most awaited features schema evolution and State TTL.
  • 8. Model Deployment 8 On Small Scale On High Scale ● Network Load ● Resource allocation ● Limit traffic size
  • 9. Model Serving Scenarios 9 ● Static model in-memory integration (classes, packages etc.) ● Static model Standalone service (Web Service, Socket connections etc) ● Online learning service with integrated training ● Parameter server with integrated model updates Tools : Kubeflow, Airflow, Mlflow, H2O
  • 10. Some Takeaways 10 ● There is a difference between research and live models. ● Data richness increases the accuracy. ● Accuracy have different meanings for different businesses. ● Don’t over do, start with a simple approach and it’s not wrong to use business logic along with predictions. ● You should implement some scenario for A/B testing on Live data. ● Start productionize the models..
  • 11. Thanks Ready for new challenge, We are Hiring : [email protected] @shashank734www.linkedin.com/in/shashank734