SlideShare a Scribd company logo
3
Most read
5
Most read
6
Most read
Presented By: Prateek Gupta
Introduction to
Amazon Kinesis Data
Streams
Lack of etiquette and manners is a huge turn off.
KnolX Etiquettes
Punctuality
Join the session 5 minutes prior to
the session start time. We start on
time and conclude on time!
Feedback
Make sure to submit a constructive
feedback for all sessions as it is
very helpful for the presenter.
Silent Mode
Keep your mobile devices in silent
mode, feel free to move out of
session in case you need to attend
an urgent call.
Avoid Disturbance
Avoid unwanted chit chat during
the session.
Our Agenda
02 Amazon Kinesis Data Streams
03 High-Level Architecture
04 Key Concepts and Terminology
05 Basic Operations
01 What is Streaming Data?
06 Demo
What is Streaming
Data?
What is Streaming Data?
Streaming data refers to the data that is generated continuously in real time by thousands of data
sources and delivered to a system for processing.
Key Points:
● Real-time
● Continuous flow
● Variety of sources
● Variety of formats
● Requires specialized processing
Examples:
● Ecommerce purchases
● Game data
● Information from social networks
● Log data
● Stock prices
● GPS data
● IoT Sensor Data
Amazon Kinesis Data Streams
Amazon Kinesis Data Streams is a real-time streaming data service by AWS. It makes it easy to
collect and process real-time streaming data at high scale.
Some key points to understand:
● Real-time data
● Highly Scalable
● Data sources
● Processing
● Cost-effective
● Easy to use
High-Level Architecture
● The producers continually push data to Kinesis Data Streams, and the consumers process the data in
real time.
● Once the processing is done by the consumer, the result are stored using an AWS service such as
Amazon DynamoDB, Amazon Redshift, or Amazon S3.
Key Concepts and Terminology
➢ Producer: It is an application that puts the data records into Amazon Kinesis Data
Streams.
➢ Consumer: It is an application that retrieves the data records from Amazon Kinesis Data
Streams and process them.
➢ Kinesis Data Stream:
○ A Kinesis data stream is a set of shards.
○ Each shard has a sequence of data records.
○ Each data record has a sequence number.
○ Data retains for 24 hours by default.
➢ Shard:
○ A shard is a uniquely identified sequence of data records
○ A stream is composed of one or more shards, each of which provides a fixed unit of
capacity.
○ Each shard can support up to 1000 PUT records per second(or 1MB/sec), and up to
1,000 GET records per second(or 2MB/sec)
○ The data capacity of a stream is a function of the number of shards.
○ If the data rate increases, increase the number of shards allocated to the stream.
➢ Data Record:
○ A data record is the unit of data stored in a Kinesis data stream.
○ Each data record is composed of a sequence number, a partition key, and a data
blob(up to 1MB).
➢ Sequence Number:
○ A sequence number is a unique identifier for each data record.
○ Allows to read data in the order and also to determine which records have been processed
➢ Partition Key:
○ A partition key is a meaningful identifier that is associated with each record.
○ It is used by the service to determine which shard to store the record in.
○ Specified by the data producer while putting data into a data stream
○ Records with the same partition key are stored together in the same shard.
➢ Retention Period:
○ Amount of time that data records are stored in an Amazon Kinesis Data Stream.
○ Default data retention period for a stream is 24 hours(configurable upto 365 days)
➢ Capacity Mode:
○ The capacity mode determines how capacity is managed and the usage charges for a data
stream.
○ Currently, in Kinesis Data Streams, we can choose between an on-demand mode and a
provisioned mode for our data streams.
Basic Operations
Amazon Kinesis Data Streams provides a number of operations that can be performed on a data
stream. Here are some basic operations:
● create-stream
● describe-stream
● list-streams
● put-record
● get-shard-iterator
● get-records
● split-shard
● merge-shards
● delete-stream
Demo
References
● Kinesis Data Streams Official Documentation
● AWS Kinesis - Javatpoint
Thank You !

More Related Content

What's hot (20)

PPT
Performance Testing
sharmaparish
 
PPTX
AWS Cloud trail
zekeLabs Technologies
 
PPS
JUnit Presentation
priya_trivedi
 
PDF
CI CD Pipeline Using Jenkins | Continuous Integration and Deployment | DevOps...
Edureka!
 
PDF
Azure DevOps Presentation
InCycleSoftware
 
PDF
Java exception-handling
Suresh Kumar Reddy V
 
PDF
Default GitLab CI Pipeline - Auto DevOps
Rajith Bhanuka Mahanama
 
PPTX
Cloud Formation
TO THE NEW | Technology
 
PPTX
AWS S3 | Tutorial For Beginners | AWS S3 Bucket Tutorial | AWS Tutorial For B...
Simplilearn
 
PDF
AWS Control Tower
CloudHesive
 
PDF
Testcontainers - Geekout EE 2017 presentation
Richard North
 
PDF
Java Generics - by Example
CodeOps Technologies LLP
 
PDF
Code Refactoring Cheatsheet
Rachanee Saengkrajai
 
PPTX
Azure dev ops
Tomy Rhymond
 
PPTX
API Presentation
nityakulkarni
 
PPSX
Entity beans in java
Acp Jamod
 
PDF
Jenkins
Roger Xia
 
PPTX
Lambda Expressions in C# From Beginner To Expert - Jaliya Udagedara
Jaliya Udagedara
 
PDF
Android activities & views
ma-polimi
 
PDF
Java8 features
Elias Hasnat
 
Performance Testing
sharmaparish
 
AWS Cloud trail
zekeLabs Technologies
 
JUnit Presentation
priya_trivedi
 
CI CD Pipeline Using Jenkins | Continuous Integration and Deployment | DevOps...
Edureka!
 
Azure DevOps Presentation
InCycleSoftware
 
Java exception-handling
Suresh Kumar Reddy V
 
Default GitLab CI Pipeline - Auto DevOps
Rajith Bhanuka Mahanama
 
Cloud Formation
TO THE NEW | Technology
 
AWS S3 | Tutorial For Beginners | AWS S3 Bucket Tutorial | AWS Tutorial For B...
Simplilearn
 
AWS Control Tower
CloudHesive
 
Testcontainers - Geekout EE 2017 presentation
Richard North
 
Java Generics - by Example
CodeOps Technologies LLP
 
Code Refactoring Cheatsheet
Rachanee Saengkrajai
 
Azure dev ops
Tomy Rhymond
 
API Presentation
nityakulkarni
 
Entity beans in java
Acp Jamod
 
Jenkins
Roger Xia
 
Lambda Expressions in C# From Beginner To Expert - Jaliya Udagedara
Jaliya Udagedara
 
Android activities & views
ma-polimi
 
Java8 features
Elias Hasnat
 

Similar to Introduction to Amazon Kinesis Data Streams (20)

PPSX
Apache Flink, AWS Kinesis, Analytics
Araf Karsh Hamid
 
PDF
AWS Kinesis - Streams, Firehose, Analytics
Serhat Can
 
PDF
찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)
Amazon Web Services Korea
 
PDF
Choose Right Stream Storage: Amazon Kinesis Data Streams vs MSK
Sungmin Kim
 
PPTX
AWS Kinesis
Julian Kleinhans
 
PDF
1.0 - AWS-DAS-Collection-Kinesis.pdf
SreeGe1
 
PPTX
Amazon Kinesis Data Streams Vs Msk (1).pptx
RenjithPillai26
 
PDF
Amazon Kinesis Data Streams
Elif Nurber Karakaş
 
PDF
Barga IC2E & IoTDI'16 Keynote
Roger Barga
 
PDF
AWS를 활용한 첫 빅데이터 프로젝트 시작하기(김일호)- AWS 웨비나 시리즈 2015
Amazon Web Services Korea
 
PDF
AWS data engineer online course | AWS data engineer training
Accentfuture
 
PDF
AWS Lambda Supports Parallelization Factor for Kinesis and DynamoDB Event Sou...
Swapnil Pawar
 
PDF
Path to the future #4 - Ingestão, processamento e análise de dados em tempo real
Amazon Web Services LATAM
 
PDF
Realtime Analytics on AWS
Sungmin Kim
 
PDF
A quick introduction to AWS Kinesis
ogeisser
 
PDF
AWS Kinesis Streams
Fernando Rodriguez
 
PDF
Em tempo real: Ingestão, processamento e analise de dados
Amazon Web Services LATAM
 
PDF
SNS SQS SWF and Kinesis
Mahesh Raj
 
PPTX
Introduction to AWS Kinesis
Steven Ensslen
 
PPTX
Community day ppt_kinesisv1.0
Sridevi Murugayen
 
Apache Flink, AWS Kinesis, Analytics
Araf Karsh Hamid
 
AWS Kinesis - Streams, Firehose, Analytics
Serhat Can
 
찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)
Amazon Web Services Korea
 
Choose Right Stream Storage: Amazon Kinesis Data Streams vs MSK
Sungmin Kim
 
AWS Kinesis
Julian Kleinhans
 
1.0 - AWS-DAS-Collection-Kinesis.pdf
SreeGe1
 
Amazon Kinesis Data Streams Vs Msk (1).pptx
RenjithPillai26
 
Amazon Kinesis Data Streams
Elif Nurber Karakaş
 
Barga IC2E & IoTDI'16 Keynote
Roger Barga
 
AWS를 활용한 첫 빅데이터 프로젝트 시작하기(김일호)- AWS 웨비나 시리즈 2015
Amazon Web Services Korea
 
AWS data engineer online course | AWS data engineer training
Accentfuture
 
AWS Lambda Supports Parallelization Factor for Kinesis and DynamoDB Event Sou...
Swapnil Pawar
 
Path to the future #4 - Ingestão, processamento e análise de dados em tempo real
Amazon Web Services LATAM
 
Realtime Analytics on AWS
Sungmin Kim
 
A quick introduction to AWS Kinesis
ogeisser
 
AWS Kinesis Streams
Fernando Rodriguez
 
Em tempo real: Ingestão, processamento e analise de dados
Amazon Web Services LATAM
 
SNS SQS SWF and Kinesis
Mahesh Raj
 
Introduction to AWS Kinesis
Steven Ensslen
 
Community day ppt_kinesisv1.0
Sridevi Murugayen
 
Ad

More from Knoldus Inc. (20)

PPTX
Angular Hydration Presentation (FrontEnd)
Knoldus Inc.
 
PPTX
Optimizing Test Execution: Heuristic Algorithm for Self-Healing
Knoldus Inc.
 
PPTX
Self-Healing Test Automation Framework - Healenium
Knoldus Inc.
 
PPTX
Kanban Metrics Presentation (Project Management)
Knoldus Inc.
 
PPTX
Java 17 features and implementation.pptx
Knoldus Inc.
 
PPTX
Chaos Mesh Introducing Chaos in Kubernetes
Knoldus Inc.
 
PPTX
GraalVM - A Step Ahead of JVM Presentation
Knoldus Inc.
 
PPTX
Nomad by HashiCorp Presentation (DevOps)
Knoldus Inc.
 
PPTX
Nomad by HashiCorp Presentation (DevOps)
Knoldus Inc.
 
PPTX
DAPR - Distributed Application Runtime Presentation
Knoldus Inc.
 
PPTX
Introduction to Azure Virtual WAN Presentation
Knoldus Inc.
 
PPTX
Introduction to Argo Rollouts Presentation
Knoldus Inc.
 
PPTX
Intro to Azure Container App Presentation
Knoldus Inc.
 
PPTX
Insights Unveiled Test Reporting and Observability Excellence
Knoldus Inc.
 
PPTX
Introduction to Splunk Presentation (DevOps)
Knoldus Inc.
 
PPTX
Code Camp - Data Profiling and Quality Analysis Framework
Knoldus Inc.
 
PPTX
AWS: Messaging Services in AWS Presentation
Knoldus Inc.
 
PPTX
Amazon Cognito: A Primer on Authentication and Authorization
Knoldus Inc.
 
PPTX
ZIO Http A Functional Approach to Scalable and Type-Safe Web Development
Knoldus Inc.
 
PPTX
Managing State & HTTP Requests In Ionic.
Knoldus Inc.
 
Angular Hydration Presentation (FrontEnd)
Knoldus Inc.
 
Optimizing Test Execution: Heuristic Algorithm for Self-Healing
Knoldus Inc.
 
Self-Healing Test Automation Framework - Healenium
Knoldus Inc.
 
Kanban Metrics Presentation (Project Management)
Knoldus Inc.
 
Java 17 features and implementation.pptx
Knoldus Inc.
 
Chaos Mesh Introducing Chaos in Kubernetes
Knoldus Inc.
 
GraalVM - A Step Ahead of JVM Presentation
Knoldus Inc.
 
Nomad by HashiCorp Presentation (DevOps)
Knoldus Inc.
 
Nomad by HashiCorp Presentation (DevOps)
Knoldus Inc.
 
DAPR - Distributed Application Runtime Presentation
Knoldus Inc.
 
Introduction to Azure Virtual WAN Presentation
Knoldus Inc.
 
Introduction to Argo Rollouts Presentation
Knoldus Inc.
 
Intro to Azure Container App Presentation
Knoldus Inc.
 
Insights Unveiled Test Reporting and Observability Excellence
Knoldus Inc.
 
Introduction to Splunk Presentation (DevOps)
Knoldus Inc.
 
Code Camp - Data Profiling and Quality Analysis Framework
Knoldus Inc.
 
AWS: Messaging Services in AWS Presentation
Knoldus Inc.
 
Amazon Cognito: A Primer on Authentication and Authorization
Knoldus Inc.
 
ZIO Http A Functional Approach to Scalable and Type-Safe Web Development
Knoldus Inc.
 
Managing State & HTTP Requests In Ionic.
Knoldus Inc.
 
Ad

Recently uploaded (20)

PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
July Patch Tuesday
Ivanti
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Biography of Daniel Podor.pdf
Daniel Podor
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
July Patch Tuesday
Ivanti
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Biography of Daniel Podor.pdf
Daniel Podor
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 

Introduction to Amazon Kinesis Data Streams

  • 1. Presented By: Prateek Gupta Introduction to Amazon Kinesis Data Streams
  • 2. Lack of etiquette and manners is a huge turn off. KnolX Etiquettes Punctuality Join the session 5 minutes prior to the session start time. We start on time and conclude on time! Feedback Make sure to submit a constructive feedback for all sessions as it is very helpful for the presenter. Silent Mode Keep your mobile devices in silent mode, feel free to move out of session in case you need to attend an urgent call. Avoid Disturbance Avoid unwanted chit chat during the session.
  • 3. Our Agenda 02 Amazon Kinesis Data Streams 03 High-Level Architecture 04 Key Concepts and Terminology 05 Basic Operations 01 What is Streaming Data? 06 Demo
  • 5. What is Streaming Data? Streaming data refers to the data that is generated continuously in real time by thousands of data sources and delivered to a system for processing. Key Points: ● Real-time ● Continuous flow ● Variety of sources ● Variety of formats ● Requires specialized processing Examples: ● Ecommerce purchases ● Game data ● Information from social networks ● Log data ● Stock prices ● GPS data ● IoT Sensor Data
  • 6. Amazon Kinesis Data Streams Amazon Kinesis Data Streams is a real-time streaming data service by AWS. It makes it easy to collect and process real-time streaming data at high scale. Some key points to understand: ● Real-time data ● Highly Scalable ● Data sources ● Processing ● Cost-effective ● Easy to use
  • 7. High-Level Architecture ● The producers continually push data to Kinesis Data Streams, and the consumers process the data in real time. ● Once the processing is done by the consumer, the result are stored using an AWS service such as Amazon DynamoDB, Amazon Redshift, or Amazon S3.
  • 8. Key Concepts and Terminology ➢ Producer: It is an application that puts the data records into Amazon Kinesis Data Streams. ➢ Consumer: It is an application that retrieves the data records from Amazon Kinesis Data Streams and process them. ➢ Kinesis Data Stream: ○ A Kinesis data stream is a set of shards. ○ Each shard has a sequence of data records. ○ Each data record has a sequence number. ○ Data retains for 24 hours by default.
  • 9. ➢ Shard: ○ A shard is a uniquely identified sequence of data records ○ A stream is composed of one or more shards, each of which provides a fixed unit of capacity. ○ Each shard can support up to 1000 PUT records per second(or 1MB/sec), and up to 1,000 GET records per second(or 2MB/sec) ○ The data capacity of a stream is a function of the number of shards. ○ If the data rate increases, increase the number of shards allocated to the stream. ➢ Data Record: ○ A data record is the unit of data stored in a Kinesis data stream. ○ Each data record is composed of a sequence number, a partition key, and a data blob(up to 1MB).
  • 10. ➢ Sequence Number: ○ A sequence number is a unique identifier for each data record. ○ Allows to read data in the order and also to determine which records have been processed ➢ Partition Key: ○ A partition key is a meaningful identifier that is associated with each record. ○ It is used by the service to determine which shard to store the record in. ○ Specified by the data producer while putting data into a data stream ○ Records with the same partition key are stored together in the same shard. ➢ Retention Period: ○ Amount of time that data records are stored in an Amazon Kinesis Data Stream. ○ Default data retention period for a stream is 24 hours(configurable upto 365 days)
  • 11. ➢ Capacity Mode: ○ The capacity mode determines how capacity is managed and the usage charges for a data stream. ○ Currently, in Kinesis Data Streams, we can choose between an on-demand mode and a provisioned mode for our data streams.
  • 12. Basic Operations Amazon Kinesis Data Streams provides a number of operations that can be performed on a data stream. Here are some basic operations: ● create-stream ● describe-stream ● list-streams ● put-record ● get-shard-iterator ● get-records ● split-shard ● merge-shards ● delete-stream
  • 13. Demo
  • 14. References ● Kinesis Data Streams Official Documentation ● AWS Kinesis - Javatpoint