Amazon
DATABASES
CS-A
BATCH 2
GROUP 3
Prof. Anil Kadu
• Introduction to Amazon's database
• Objectives
• Amazon web services
• Scalability and availability
• Data security and privacy
• Literature Review
• References
Table of Content
COMPANY GROUP
Introduction to Amazon Platform:
•Amazon.com is one of the world's largest e-
commerce platforms, serving tens of millions of
customers globally.
•The platform offers a wide range of products, from
books and electronics to clothing and household
items.
•At Amazon, reliability, scalability, and performance
are critical to ensure an exceptional shopping
experience for customers.
COMPANY GROUP
Introduction to
Amazon Database:
• Amazon Database Services Overview: Amazon offers database
services for secure data storage and organization.
• Similar to a digital storage system, it's used for managing
various data types.
• Secure Data Storage: Amazon offers secure digital storage for
various data types, ensuring data confidentiality.
• Scalable Solutions: Businesses can easily scale resources to
manage growing data needs efficiently.
• Cost-Effective: Pay-as-you-go pricing minimizes upfront costs,
making it budget-friendly.
• Global Accessibility: Data centers worldwide provide low-
latency access, benefiting users globally.
• The objective of exploring the topic of Amazon databases is to understand their scalability, flexibility, and how
they can effectively handle large volumes of data, enabling businesses to grow and adapt to changing
demands.
• Another objective is to recognize the high availability and durability provided by Amazon databases, ensuring
that data is always accessible, protected, and can be reliably recovered in case of failures, thus ensuring the
smooth operation and continuity of businesses.
Objective
TWO TYPES OF DATABASES
RELATIONAL
DATABASES
NON-RELATIONAL
DATABASES
• A relational database organizes data into rows and columns, which collectively form a table.
• Structured Data Management : Relational databases organize data into structured tables with rows and
columns.
• ACID Properties : Relational databases follow ACID (Atomicity, Consistency, Isolation, Durability) properties,
ensuring data integrity and reliability.
• Multi-User Support : They provide concurrent access for multiple users or applications, ensuring data
consistency.
• Scalability Challenges : Scaling relational databases can be complex, especially when dealing with large
datasets or high traffic loads.
• Use Cases : Relational databases are suitable for a wide range of applications, including e-commerce, content
management systems, and financial systems.
Relational Databases
Amazon Web
Services
Amazon Aurora
Amazon RDS
Amazon Redshift
Amazon Aurora
• Amazon Aurora is a fully managed relational database service that combines
the performance and availability of a traditional enterprise database with the
simplicity and cost-effectiveness of a cloud database.
• Amazon Aurora is based on MySQL and PostgreSQL, but it adds a number of
features that make it more scalable, reliable, and secure.
• Aurora Storage Layer is durable and scalable.
• Aurora Storage Layer can store data for multiple Aurora DB
clusters.
• Aurora Replicas are identical copies of the database.
• Aurora Replicas are spread across multiple availability
zones.
• When you write data to an Aurora DB cluster, it is written
to all of the replicas.
• This ensures that your data is always available, even if one
of the replicas goes down.
Amazon RDS
• Amazon Relational Database Service (Amazon RDS) is a fully
managed relational database service that makes it easy to set up,
operate, and scale a relational database in the cloud.
• Amazon RDS provides a Number of Features that make it a good
choice for businesses of all sizes.
• High Availability: It ensures high availability through multi-Availability
Zone deployments, reducing downtime risk.
• Scalability: Amazon RDS allows easy vertical and horizontal scaling for
accommodating growing workloads.
Amazon Redshift
Amazon Redshift is a fully managed, petabyte-scale data warehouse service in
the cloud. It is designed to help you quickly and easily analyze large amounts
of data using standard SQL.
• Automatic scaling : Amazon Redshift can automatically scale up or down
based on your workload, so you only pay for the resources that you use.
• Point-in-time restore : You can restore your data to any point in time
within the past 90 days, so you can easily recover from accidental data
loss or corruption.
• Data encryption : Your data is encrypted at rest and in transit, so you can
be confident that your data is secure.
• Integration with Amazon S3 : You can easily import data from Amazon S3
into Amazon Redshift, and you can also export data from Amazon Redshift
to Amazon S3.
Comparative Analysis
Feature Amazon Aurora Amazon RDS Amazon Redshift
Database engines MySQL and PostgreSQL
MySQL, PostgreSQL, Oracle, MariaDB,
SQL Server
PostgreSQL
Data storage Shared storage layer Dedicated storage Dedicated storage
Availability zones Multi-AZ deployment Single-AZ deployment Multi-AZ deployment
Automatic scaling Automatic scaling Manual scaling Manual scaling
Performance
Up to 5 times the throughput of a
traditional MySQL database
Up to 30 times the throughput of a
traditional MySQL database
Up to 100 times the throughput of a
traditional PostgreSQL database
Cost-effectiveness
Pay for the storage and compute
resources that you use
Pay for the storage and compute
resources that you use
Pay for the storage and compute
resources that you use
Security Encryption at rest and in transit Encryption at rest and in transit Encryption at rest and in transit
Migration support
Supports migration from MySQL and
PostgreSQL databases
Supports migration from MySQL,
PostgreSQL, Oracle, MariaDB, and SQL
Server databases
Supports migration from PostgreSQL
databases
• Also known as NoSQL Databases
• Flexible Data Models : NoSQL databases support various data models, including document, key-value,
column-family, and graph databases, providing flexibility for different data types and structures.
• Schema-less : Unlike relational databases, NoSQL databases are typically schema-less, allowing developers to
store data without a predefined schema.
• Scalability : NoSQL databases are designed for horizontal scalability, making it easier to handle large volumes
of data and high traffic loads.
• High Availability : Many NoSQL databases offer built-in high availability and fault tolerance features, ensuring
data accessibility even in the presence of hardware failures.
• Use Cases : NoSQL databases are commonly used for applications like content management systems, social
media platforms, real-time analytics, and sensor data storage.
Non-Relational Databases
DynamoDB
• Fully Managed NoSQL Database : Amazon DynamoDB is a fully managed
NoSQL database service that offers seamless scalability, automatic backups,
and high availability. It eliminates the operational overhead of database
management.
• Scalability at Any Scale : DynamoDB can effortlessly handle workloads that
range from a few requests per second to millions of requests per second.
Its auto-scaling feature adjusts capacity to match traffic patterns.
• Low Latency and High Throughput : DynamoDB provides single-digit
millisecond latency for read and write operations, making it suitable for
real-time applications like gaming and IoT. It supports high-throughput
workloads with ease.
• Flexible Data Models : It supports both document and key-value data
models, enabling developers to store and query data in a way that best fits
their application requirements.
Classification on the Basis of
Requirements
Requirement Best database service
Relational database engine
Amazon Aurora, Amazon RDS, Amazon
Redshift
Data warehouse engine Amazon Redshift
Lots of storage space Amazon Aurora
More moderate amount of storage space Amazon RDS
Small amount of storage space Amazon Redshift
High performance Amazon Aurora
More moderate level of performance Amazon RDS
Good level of performance on a budget Amazon Redshift
Database service that can scale up and down as needed Amazon Aurora
Database service with a predictable level of scalability Amazon RDS
Database service that can scale up to very large sizes Amazon Redshift
Scalability
• You can adjust the computing power and memory
of your database system, with a maximum limit of
32 CPUs and 244 GiB of RAM.
• For Amazon Aurora, the database storage can
automatically grow as your data needs increase,
up to a maximum of 64 TB. Alternatively, you can
set a maximum size yourself.
• MVSOL, MariaDB, Oracle, and PostgreSQL engines
can scale up to 32 TB of storage, while SQL Server
supports up to 16 TB.
• You can create copies of your main database called
replicas, which can handle high-volume read
requests from applications.
Availability
• Always Accessible: Amazon's databases are
designed to be highly available.
• They use a Multi-Availability Zone
deployment, which means your data is
automatically replicated across different
locations.
• Automatic Backups and Recovery: Amazon's
databases regularly take backups of your
data and transaction logs.
Data Security and Privacy
• They have a number of physical, electronic,
and procedural safeguards in place to
protect customer data.
• They use industry-leading encryption
features to protect data in transit and at
rest.
• Customers have control over how their data
is stored and who can access it.
• Amazon does not access or use customer
data without their permission, except as
required to prevent fraud or abuse, or to
comply with law.
AUTHORS
RESEARCH PAPER
NAME ABSTRACT
A STUDY ON AWS
ARCHITECTURE
Prangya Prachi Samantaray
Dr. Aswini Kumar Mohanty
Dr.Arati Pradhan
• In the current era, cloud computing has become integral, offering
enhanced IT resource utilization and services.
• This growth has led to increased adoption of cloud solutions, with a
focus on inter-cloud communication and security.
• This paper centers on the hybrid cloud concept, particularly in the
context of Amazon Web Services (AWS), discussing its integration
with on-premises infrastructure and addressing key issues.
Millions of Tiny
Databases
Marc Brooker
Tao Chen
Fan Ping
• In 2013, a project was initiated to build a database for Amazon
EBS that required high availability and strong consistency.
• Recognizing the constraints of the CAP theorem and distributed
systems, the approach shifted to creating a distributed system with
numerous databases.
• Physalia, a transactional key-value store, was developed to provide
high availability and strong consistency for cloud control systems.
• It strategically places data based on transaction patterns and
infrastructure knowledge.
• This paper discusses its role in Amazon EBS and its broader
applicability to distributed system challenges.
AUTHORS
RESEARCH PAPER
NAME ABSTRACT
Database Security
Management for Healthcare
SaaS in the Amazon AWS
Cloud
Fabio Bracci,
Antonio Corradi,
Luca Foschini
• The rise of healthcare Cloud computing as an evolution of
networking, SaaS, and security best practices.
• Describes how to make healthcare SaaS services compliant in the
Cloud, ensuring data security standards are met.
• Encryption & security key management functions can be easily
added without compromising system performance.
• Future work involves conducting experiments on larger-scale
Cloud deployments and exploring different Cloud providers like
OpenStack.
Amazon Relational
Database Service Delivers
Enhanced Database
Performance at Lower Total
Cost
Carl W. Olofson,
Matthew Marden
• Amazon RDS supports six database engines the open source
MySQL, PostgreSQL, and Maria DB, as well as Oracle, Microsoft
SQL Server, and Amazon’s own cloud-native RDBMS, Aurora.
• They required databases that they could cost-effectively scale to
match business demand in a cost-effective but high-performing
manner.
• It is a cloud-native database service aligned with cloud strategies,.
IDC's study demonstrates the value of running various database
workloads and engines in RDS.
AUTHORS
RESEARCH PAPER
NAME ABSTRACT
Analysis of research on
amazon AWS cloud
computing seller data
security
Muhammad
Talha, Mishal
Sohail, Hajar
Hajji
• Big data and cloud computing are intertwined; major cloud
companies like Google and Amazon handle vast amounts of big
data.
• There are two types of RDS databases: Online Transaction
Processing (OLTP) and Online Analytical Processing (OLAP) Amazon
uses AWS cloud computing to implement data analysis and mining,
with a high level of security protection
• Significance of this big data in the amazon dataset is beyond
collection and aggregation.
• Its true value emerges during processing, analysis, visualization,
and application. Core big data concerns encompass data origin,
analysis, and commercialization.
Dynamo: Amazon’s
Highly Available Key-
value Store
Avinash Lakshman,
Alex Pilchin,
Swaminathan
Sivasubramanian
• Amazon.com faces a massive-scale reliability challenge due to the
global e-commerce operation's size.
• This paper introduces Dynamo, a highly available key-value
storage system.
• Dynamo prioritizes availability over consistency and employs
object versioning and application-assisted conflict resolution.
AUTHORS
RESEARCH PAPER
NAME ABSTRACT
Using the Amazon Metric
to Construct an Image
Database based on what
people do
Robert Columb ,
Theodor G Wheld
• The General approach of constructing an image database based on
people's activities or behaviors, using methods is commonly
employed in data analysis and computer vision.
• Defining scope & objective Data collection
• Data Annotation
• Dimensionally Reduction
• Data Construction Data Analysis
• Improvement and Application
A Feasible Schema Design
Strategy for Amazon
DynamoDB: A Nested
Normal Form Approach
Wai Yin Mok and the
University of Alabama
• Designing a schema for Amazon DynamoDB involves making
decisions about the structure of your tables and the organization of
your data to optimize performance, scalability, and cost-
effectiveness.
• First understand the DataModel ,Start with Entities, Denormalize it,
Hierarchical Data Modelling identify the PK,CK and FK, batch
operation and transaction ,estimate and optimize capacity units
,testing and optimization
AUTHORS
RESEARCH PAPER
NAME ABSTRACT
Research Paper on AWS
Cloud Infrastructure vs
Traditional On-Premise
Akshay Kushwaha
• This paper highlights the advantages of AWS Cloud over on-
premises solutions and introduces AWS services.
• AWS, launched in 2006, has seen widespread adoption, with 77%
of enterprises having cloud applications or infrastructure.
• It offers a wide range of global cloud products with over one
million active enterprise customers, featuring on-demand services
and over 140 offerings.
• This enables rapid response to changing business needs for
startups, SMBs, and the public sector.
Questions ?
THANK YOU
CS-A
BATCH 2
GROUP 3
Prasanna Bhalerao [35]
Yash Bhalerao [36]
Atharva Chawle [50]
Chirag Belani [51]
Kalyani Chopade [52]

More Related Content

PPTX
Relational Database Design
PPTX
PPTX
Elements of a Story
PDF
AWS Data Analytics on AWS
PPTX
Social and Professional Issues in Computing - Ethics
PPT
C++ Arrays
PPTX
Regular Expression Examples.pptx
PDF
AWS RDS
Relational Database Design
Elements of a Story
AWS Data Analytics on AWS
Social and Professional Issues in Computing - Ethics
C++ Arrays
Regular Expression Examples.pptx
AWS RDS

What's hot (20)

PPTX
Big data architectures
PPTX
Chapter 1 big data
PPTX
Presentation on data preparation with pandas
PDF
Efficient Spark Analytics on Encrypted Data with Gidon Gershinsky
PPTX
Multi cloud networking
PDF
Dbms vs dsms
PPTX
Services comparison among Microsoft Azure AWS and Google Cloud Platform
PPTX
PDF
Introduction to Big Data Analytics and Data Science
PPTX
INTRODUCTION TO BIG DATA AND HADOOP
PPTX
Cloud security ppt
PPTX
Amazon_SNS.pptx
PDF
Data Visualization in Exploratory Data Analysis
PPTX
Unit 4-apache pig
PDF
Data Analysis and Statistics in Python using pandas and statsmodels
PPTX
Introduction to Hadoop and Hadoop component
PPTX
Navies bayes
PPTX
Cloud computing
PPT
Cloud Computing: Hadoop
PDF
Google Cloud Platform Tutorial | GCP Fundamentals | Edureka
Big data architectures
Chapter 1 big data
Presentation on data preparation with pandas
Efficient Spark Analytics on Encrypted Data with Gidon Gershinsky
Multi cloud networking
Dbms vs dsms
Services comparison among Microsoft Azure AWS and Google Cloud Platform
Introduction to Big Data Analytics and Data Science
INTRODUCTION TO BIG DATA AND HADOOP
Cloud security ppt
Amazon_SNS.pptx
Data Visualization in Exploratory Data Analysis
Unit 4-apache pig
Data Analysis and Statistics in Python using pandas and statsmodels
Introduction to Hadoop and Hadoop component
Navies bayes
Cloud computing
Cloud Computing: Hadoop
Google Cloud Platform Tutorial | GCP Fundamentals | Edureka
Ad

Similar to amazon database (14)

PPTX
9. AWS_Databases_Databases_Aws_Cloud.pptx
PPTX
week 5 cloud security computing northumbria foudation
PPTX
Azure data platform overview
PDF
Amazon Web Services - Relational Database Service Meetup
PDF
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
PDF
AWS Certified Cloud Practitioner Course S11-S17
PPTX
DBMbsbsiisnsbshsjsbshsjwnsnS_PPT final.pptx
PDF
AWS Study Group - Chapter 09 - Storage Option [Solution Architect Associate G...
PPTX
Cloud Databases and Big Data - Mechlin.pptx
PDF
Bases de datos en la nube con AWS
PPTX
2016 Utah Cloud Summit: RDS
PDF
Intro to database_services_fg_aws_summit_2014
PPTX
AWS Database Services
PDF
AMAZON RELATIONAL DATABASES.pdf
9. AWS_Databases_Databases_Aws_Cloud.pptx
week 5 cloud security computing northumbria foudation
Azure data platform overview
Amazon Web Services - Relational Database Service Meetup
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
AWS Certified Cloud Practitioner Course S11-S17
DBMbsbsiisnsbshsjsbshsjwnsnS_PPT final.pptx
AWS Study Group - Chapter 09 - Storage Option [Solution Architect Associate G...
Cloud Databases and Big Data - Mechlin.pptx
Bases de datos en la nube con AWS
2016 Utah Cloud Summit: RDS
Intro to database_services_fg_aws_summit_2014
AWS Database Services
AMAZON RELATIONAL DATABASES.pdf
Ad

Recently uploaded (20)

PDF
VSL-Strand-Post-tensioning-Systems-Technical-Catalogue_2019-01.pdf
PPTX
Agentic Artificial Intelligence (Agentic AI).pptx
PDF
UEFA_Embodied_Carbon_Emissions_Football_Infrastructure.pdf
PPTX
Wireless sensor networks (WSN) SRM unit 2
PPT
Programmable Logic Controller PLC and Industrial Automation
PDF
Computer System Architecture 3rd Edition-M Morris Mano.pdf
PDF
LOW POWER CLASS AB SI POWER AMPLIFIER FOR WIRELESS MEDICAL SENSOR NETWORK
PPTX
Module1.pptxrjkeieuekwkwoowkemehehehrjrjrj
PPT
UNIT-I Machine Learning Essentials for 2nd years
PDF
Lesson 3 .pdf
PPTX
Cisco Network Behaviour dibuywvdsvdtdstydsdsa
PDF
Research on ultrasonic sensor for TTU.pdf
PDF
Mechanics of materials week 2 rajeshwari
PPTX
MAD Unit - 3 User Interface and Data Management (Diploma IT)
PDF
VTU IOT LAB MANUAL (BCS701) Computer science and Engineering
PPTX
CNS - Unit 1 (Introduction To Computer Networks) - PPT (2).pptx
PDF
Principles of operation, construction, theory, advantages and disadvantages, ...
PPTX
BBOC407 BIOLOGY FOR ENGINEERS (CS) - MODULE 1 PART 1.pptx
PDF
MACCAFERRY GUIA GAVIONES TERRAPLENES EN ESPAÑOL
PPTX
AI-Reporting for Emerging Technologies(BS Computer Engineering)
VSL-Strand-Post-tensioning-Systems-Technical-Catalogue_2019-01.pdf
Agentic Artificial Intelligence (Agentic AI).pptx
UEFA_Embodied_Carbon_Emissions_Football_Infrastructure.pdf
Wireless sensor networks (WSN) SRM unit 2
Programmable Logic Controller PLC and Industrial Automation
Computer System Architecture 3rd Edition-M Morris Mano.pdf
LOW POWER CLASS AB SI POWER AMPLIFIER FOR WIRELESS MEDICAL SENSOR NETWORK
Module1.pptxrjkeieuekwkwoowkemehehehrjrjrj
UNIT-I Machine Learning Essentials for 2nd years
Lesson 3 .pdf
Cisco Network Behaviour dibuywvdsvdtdstydsdsa
Research on ultrasonic sensor for TTU.pdf
Mechanics of materials week 2 rajeshwari
MAD Unit - 3 User Interface and Data Management (Diploma IT)
VTU IOT LAB MANUAL (BCS701) Computer science and Engineering
CNS - Unit 1 (Introduction To Computer Networks) - PPT (2).pptx
Principles of operation, construction, theory, advantages and disadvantages, ...
BBOC407 BIOLOGY FOR ENGINEERS (CS) - MODULE 1 PART 1.pptx
MACCAFERRY GUIA GAVIONES TERRAPLENES EN ESPAÑOL
AI-Reporting for Emerging Technologies(BS Computer Engineering)

amazon database

  • 2. • Introduction to Amazon's database • Objectives • Amazon web services • Scalability and availability • Data security and privacy • Literature Review • References Table of Content
  • 3. COMPANY GROUP Introduction to Amazon Platform: •Amazon.com is one of the world's largest e- commerce platforms, serving tens of millions of customers globally. •The platform offers a wide range of products, from books and electronics to clothing and household items. •At Amazon, reliability, scalability, and performance are critical to ensure an exceptional shopping experience for customers.
  • 4. COMPANY GROUP Introduction to Amazon Database: • Amazon Database Services Overview: Amazon offers database services for secure data storage and organization. • Similar to a digital storage system, it's used for managing various data types. • Secure Data Storage: Amazon offers secure digital storage for various data types, ensuring data confidentiality. • Scalable Solutions: Businesses can easily scale resources to manage growing data needs efficiently. • Cost-Effective: Pay-as-you-go pricing minimizes upfront costs, making it budget-friendly. • Global Accessibility: Data centers worldwide provide low- latency access, benefiting users globally.
  • 5. • The objective of exploring the topic of Amazon databases is to understand their scalability, flexibility, and how they can effectively handle large volumes of data, enabling businesses to grow and adapt to changing demands. • Another objective is to recognize the high availability and durability provided by Amazon databases, ensuring that data is always accessible, protected, and can be reliably recovered in case of failures, thus ensuring the smooth operation and continuity of businesses. Objective
  • 6. TWO TYPES OF DATABASES RELATIONAL DATABASES NON-RELATIONAL DATABASES
  • 7. • A relational database organizes data into rows and columns, which collectively form a table. • Structured Data Management : Relational databases organize data into structured tables with rows and columns. • ACID Properties : Relational databases follow ACID (Atomicity, Consistency, Isolation, Durability) properties, ensuring data integrity and reliability. • Multi-User Support : They provide concurrent access for multiple users or applications, ensuring data consistency. • Scalability Challenges : Scaling relational databases can be complex, especially when dealing with large datasets or high traffic loads. • Use Cases : Relational databases are suitable for a wide range of applications, including e-commerce, content management systems, and financial systems. Relational Databases
  • 9. Amazon Aurora • Amazon Aurora is a fully managed relational database service that combines the performance and availability of a traditional enterprise database with the simplicity and cost-effectiveness of a cloud database. • Amazon Aurora is based on MySQL and PostgreSQL, but it adds a number of features that make it more scalable, reliable, and secure. • Aurora Storage Layer is durable and scalable. • Aurora Storage Layer can store data for multiple Aurora DB clusters. • Aurora Replicas are identical copies of the database. • Aurora Replicas are spread across multiple availability zones. • When you write data to an Aurora DB cluster, it is written to all of the replicas. • This ensures that your data is always available, even if one of the replicas goes down.
  • 10. Amazon RDS • Amazon Relational Database Service (Amazon RDS) is a fully managed relational database service that makes it easy to set up, operate, and scale a relational database in the cloud. • Amazon RDS provides a Number of Features that make it a good choice for businesses of all sizes. • High Availability: It ensures high availability through multi-Availability Zone deployments, reducing downtime risk. • Scalability: Amazon RDS allows easy vertical and horizontal scaling for accommodating growing workloads.
  • 11. Amazon Redshift Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud. It is designed to help you quickly and easily analyze large amounts of data using standard SQL. • Automatic scaling : Amazon Redshift can automatically scale up or down based on your workload, so you only pay for the resources that you use. • Point-in-time restore : You can restore your data to any point in time within the past 90 days, so you can easily recover from accidental data loss or corruption. • Data encryption : Your data is encrypted at rest and in transit, so you can be confident that your data is secure. • Integration with Amazon S3 : You can easily import data from Amazon S3 into Amazon Redshift, and you can also export data from Amazon Redshift to Amazon S3.
  • 13. Feature Amazon Aurora Amazon RDS Amazon Redshift Database engines MySQL and PostgreSQL MySQL, PostgreSQL, Oracle, MariaDB, SQL Server PostgreSQL Data storage Shared storage layer Dedicated storage Dedicated storage Availability zones Multi-AZ deployment Single-AZ deployment Multi-AZ deployment Automatic scaling Automatic scaling Manual scaling Manual scaling Performance Up to 5 times the throughput of a traditional MySQL database Up to 30 times the throughput of a traditional MySQL database Up to 100 times the throughput of a traditional PostgreSQL database Cost-effectiveness Pay for the storage and compute resources that you use Pay for the storage and compute resources that you use Pay for the storage and compute resources that you use Security Encryption at rest and in transit Encryption at rest and in transit Encryption at rest and in transit Migration support Supports migration from MySQL and PostgreSQL databases Supports migration from MySQL, PostgreSQL, Oracle, MariaDB, and SQL Server databases Supports migration from PostgreSQL databases
  • 14. • Also known as NoSQL Databases • Flexible Data Models : NoSQL databases support various data models, including document, key-value, column-family, and graph databases, providing flexibility for different data types and structures. • Schema-less : Unlike relational databases, NoSQL databases are typically schema-less, allowing developers to store data without a predefined schema. • Scalability : NoSQL databases are designed for horizontal scalability, making it easier to handle large volumes of data and high traffic loads. • High Availability : Many NoSQL databases offer built-in high availability and fault tolerance features, ensuring data accessibility even in the presence of hardware failures. • Use Cases : NoSQL databases are commonly used for applications like content management systems, social media platforms, real-time analytics, and sensor data storage. Non-Relational Databases
  • 15. DynamoDB • Fully Managed NoSQL Database : Amazon DynamoDB is a fully managed NoSQL database service that offers seamless scalability, automatic backups, and high availability. It eliminates the operational overhead of database management. • Scalability at Any Scale : DynamoDB can effortlessly handle workloads that range from a few requests per second to millions of requests per second. Its auto-scaling feature adjusts capacity to match traffic patterns. • Low Latency and High Throughput : DynamoDB provides single-digit millisecond latency for read and write operations, making it suitable for real-time applications like gaming and IoT. It supports high-throughput workloads with ease. • Flexible Data Models : It supports both document and key-value data models, enabling developers to store and query data in a way that best fits their application requirements.
  • 16. Classification on the Basis of Requirements
  • 17. Requirement Best database service Relational database engine Amazon Aurora, Amazon RDS, Amazon Redshift Data warehouse engine Amazon Redshift Lots of storage space Amazon Aurora More moderate amount of storage space Amazon RDS Small amount of storage space Amazon Redshift High performance Amazon Aurora More moderate level of performance Amazon RDS Good level of performance on a budget Amazon Redshift Database service that can scale up and down as needed Amazon Aurora Database service with a predictable level of scalability Amazon RDS Database service that can scale up to very large sizes Amazon Redshift
  • 18. Scalability • You can adjust the computing power and memory of your database system, with a maximum limit of 32 CPUs and 244 GiB of RAM. • For Amazon Aurora, the database storage can automatically grow as your data needs increase, up to a maximum of 64 TB. Alternatively, you can set a maximum size yourself. • MVSOL, MariaDB, Oracle, and PostgreSQL engines can scale up to 32 TB of storage, while SQL Server supports up to 16 TB. • You can create copies of your main database called replicas, which can handle high-volume read requests from applications.
  • 19. Availability • Always Accessible: Amazon's databases are designed to be highly available. • They use a Multi-Availability Zone deployment, which means your data is automatically replicated across different locations. • Automatic Backups and Recovery: Amazon's databases regularly take backups of your data and transaction logs.
  • 20. Data Security and Privacy • They have a number of physical, electronic, and procedural safeguards in place to protect customer data. • They use industry-leading encryption features to protect data in transit and at rest. • Customers have control over how their data is stored and who can access it. • Amazon does not access or use customer data without their permission, except as required to prevent fraud or abuse, or to comply with law.
  • 21. AUTHORS RESEARCH PAPER NAME ABSTRACT A STUDY ON AWS ARCHITECTURE Prangya Prachi Samantaray Dr. Aswini Kumar Mohanty Dr.Arati Pradhan • In the current era, cloud computing has become integral, offering enhanced IT resource utilization and services. • This growth has led to increased adoption of cloud solutions, with a focus on inter-cloud communication and security. • This paper centers on the hybrid cloud concept, particularly in the context of Amazon Web Services (AWS), discussing its integration with on-premises infrastructure and addressing key issues. Millions of Tiny Databases Marc Brooker Tao Chen Fan Ping • In 2013, a project was initiated to build a database for Amazon EBS that required high availability and strong consistency. • Recognizing the constraints of the CAP theorem and distributed systems, the approach shifted to creating a distributed system with numerous databases. • Physalia, a transactional key-value store, was developed to provide high availability and strong consistency for cloud control systems. • It strategically places data based on transaction patterns and infrastructure knowledge. • This paper discusses its role in Amazon EBS and its broader applicability to distributed system challenges.
  • 22. AUTHORS RESEARCH PAPER NAME ABSTRACT Database Security Management for Healthcare SaaS in the Amazon AWS Cloud Fabio Bracci, Antonio Corradi, Luca Foschini • The rise of healthcare Cloud computing as an evolution of networking, SaaS, and security best practices. • Describes how to make healthcare SaaS services compliant in the Cloud, ensuring data security standards are met. • Encryption & security key management functions can be easily added without compromising system performance. • Future work involves conducting experiments on larger-scale Cloud deployments and exploring different Cloud providers like OpenStack. Amazon Relational Database Service Delivers Enhanced Database Performance at Lower Total Cost Carl W. Olofson, Matthew Marden • Amazon RDS supports six database engines the open source MySQL, PostgreSQL, and Maria DB, as well as Oracle, Microsoft SQL Server, and Amazon’s own cloud-native RDBMS, Aurora. • They required databases that they could cost-effectively scale to match business demand in a cost-effective but high-performing manner. • It is a cloud-native database service aligned with cloud strategies,. IDC's study demonstrates the value of running various database workloads and engines in RDS.
  • 23. AUTHORS RESEARCH PAPER NAME ABSTRACT Analysis of research on amazon AWS cloud computing seller data security Muhammad Talha, Mishal Sohail, Hajar Hajji • Big data and cloud computing are intertwined; major cloud companies like Google and Amazon handle vast amounts of big data. • There are two types of RDS databases: Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP) Amazon uses AWS cloud computing to implement data analysis and mining, with a high level of security protection • Significance of this big data in the amazon dataset is beyond collection and aggregation. • Its true value emerges during processing, analysis, visualization, and application. Core big data concerns encompass data origin, analysis, and commercialization. Dynamo: Amazon’s Highly Available Key- value Store Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian • Amazon.com faces a massive-scale reliability challenge due to the global e-commerce operation's size. • This paper introduces Dynamo, a highly available key-value storage system. • Dynamo prioritizes availability over consistency and employs object versioning and application-assisted conflict resolution.
  • 24. AUTHORS RESEARCH PAPER NAME ABSTRACT Using the Amazon Metric to Construct an Image Database based on what people do Robert Columb , Theodor G Wheld • The General approach of constructing an image database based on people's activities or behaviors, using methods is commonly employed in data analysis and computer vision. • Defining scope & objective Data collection • Data Annotation • Dimensionally Reduction • Data Construction Data Analysis • Improvement and Application A Feasible Schema Design Strategy for Amazon DynamoDB: A Nested Normal Form Approach Wai Yin Mok and the University of Alabama • Designing a schema for Amazon DynamoDB involves making decisions about the structure of your tables and the organization of your data to optimize performance, scalability, and cost- effectiveness. • First understand the DataModel ,Start with Entities, Denormalize it, Hierarchical Data Modelling identify the PK,CK and FK, batch operation and transaction ,estimate and optimize capacity units ,testing and optimization
  • 25. AUTHORS RESEARCH PAPER NAME ABSTRACT Research Paper on AWS Cloud Infrastructure vs Traditional On-Premise Akshay Kushwaha • This paper highlights the advantages of AWS Cloud over on- premises solutions and introduces AWS services. • AWS, launched in 2006, has seen widespread adoption, with 77% of enterprises having cloud applications or infrastructure. • It offers a wide range of global cloud products with over one million active enterprise customers, featuring on-demand services and over 140 offerings. • This enables rapid response to changing business needs for startups, SMBs, and the public sector.
  • 27. THANK YOU CS-A BATCH 2 GROUP 3 Prasanna Bhalerao [35] Yash Bhalerao [36] Atharva Chawle [50] Chirag Belani [51] Kalyani Chopade [52]